Professional Documents
Culture Documents
net/publication/313237218
CITATIONS READS
4 5,104
4 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Eswaran Subrahmanian on 05 March 2018.
Abstract—This paper presents a category theoretic ontology humming or singing. However, the vast majority of MIR is
of Carnatic music. Our goals here are twofold. First, we will metadata-based; we retrieve music segments by searching for
demonstrate the power and flexibility of conceptual modeling text related to the segment, such as its title or artist. In this
techniques based on a branch of mathematics called category
theory (CT), using the structure of Carnatic music as an example. paper, we will focus exclusively on metadata-based MIR.
Second, we describe a platform for collaboration and research Although metadata-based retrieval is convenient for users
sharing in this area. The construction of this platform uses for- and easy to implement, the metadata on which these searches
mal methods of CT (colimits) to merge our Carnatic ontology depend must be expressive enough and accurate enough to
with a generic model of music information retrieval tasks. The support retrieval tasks. This makes metadata creation and
latter model allows us to integrate multiple analytical methods,
such as hidden Markov models, machine learning algorithms, and evaluation one of the most significant challenges in metadata-
other data mining techniques like clustering, bagging, etc., in the based MIR. In many cases metadata collection requires explicit
analysis of a variety of different musical features. Furthermore, supervision to ensure accurate results. It may also be difficult
the framework facilitates the storage of musical performances to maintain a consistent format (including the choice of data
based on the proposed ontology, making them available for fields) in our music databases, especially when these grow
additional analysis and integration. The proposed framework
is extensible, allowing future work in the area of rāga recog- rapidly.
nition to build on our results, thereby facilitating collaborative One way to ease, though not eliminate, these challenges is
research. Generally speaking, the methods presented here are through the use of domain modeling. If we have some rules
intended as an exemplar for designing collaborative frame- about the way that different pieces of metadata relate to one
works supporting reproducibility of computational analysis and another, then we can evaluate new data against these rules to
simulation.
recognize some incorrect inferences. Similarly, modeling the
Index Terms—Categorical framework for Carnatic music, structure of a musical domain can allow us to cross-check
categorical structure for rāga, category theory (CT), related inferences.
ontology.
There is a significant body of literature modeling Western
classical music, including tempo estimation, beat tracking,
instrumental music segmentation, instruments classification,
I. I NTRODUCTION and transcription [4]–[9]. By comparison, there has been
ROUND the world, a tremendous number of audio music little effort to analyze Indian classical music and, correspond-
A files are accessed every day for personal usage, research,
and analysis purposes. The area of music information retrieval
ingly, most MIR technologies have not been applied in this
context [10]–[12]. Here, we try to address this gap.
(MIR) studies techniques and methods for helping users find Indian classical music has two main traditions, namely,
the music files they want. Thus, MIR concentrates on archival Carnatic and Hindustani. Carnatic music is popular in the
methods, providing metadata, annotations, search algorithms, southern part of India while Hindustani is popular in the north.
and analysis of music to help the users filter an ocean of musi- Though these two traditions are similar in certain respects,
cal content. Especially as listening habits migrate from hard in this paper, we choose to focus only on Carnatic music,
drives to the cloud, MIR is becoming an increasingly critical leaving the extension to Hindustani music and the relation-
area of research [1]. ship between the two branches for future work. A reader
There are two general classes of MIR: 1) content-based interested in learning more about Hindustani music should
and 2) metadata-based [2], [3]. The former involves retrieval consult [13], [14].
based on musical content, which a user might provide by A song in Carnatic music is composed in a specific
melody (rāga) but, unlike the case in Western classical music,
Manuscript received May 12, 2016; revised August 29, 2016; accepted
November 4, 2016. This paper was recommended by Associate Editor the rhythm can be rendered differently by different musi-
L. Sheremetov. cians [13], [15]. This is primarily because Carnatic music has
This paper has supplementary downloadable multimedia material available been handed down from teacher to student in an oral tradi-
at http://ieeexplore.ieee.org provided by the authors.
The authors are with the Department of ITL, National Institute tion. Consequently, Carnatic music typically does not have
of Standards and Technology, Gaithersburg, MD 20899 USA (e-mail: standardized notations as are available in Western music, and
sarala.padi@nist.gov). what does exist varies from school to school. This adds an
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org. additional difficulty for the archiving and analysis of Indian
Digital Object Identifier 10.1109/TSMC.2016.2631130 classical music.
2168-2216 c 2017 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/
redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TABLE I
C ARNATIC M USIC Svaras AND T HEIR F REQUENCY R ATIOS
(a)
(b)
Fig. 2. Svara sequence (both ascending and descending) and its generating
regular expression for a Melakartha rāga system.
A. Rāga
The melodic component of a piece of Carnatic music is
By contrast, Janya rāgas are much more flexible. Their
called its rāga. Rāgas are constructed by grouping together
sequences usually contain fewer than seven notes, although
the svaras introduced above in different ways, and each of
sometimes they may have seven or more notes. A given
these groupings are associated with one of nine correspond-
sequence in a Janya rāga may also repeat certain svaras, and
ing emotions [15]. The rāgas are developed to elicit these
need not be strictly ascending or descending. For a comparison
emotions and, in fact, the word “rāga” is derived from the
of Melakartha and Janya rāgas [see Fig. 2(b)].
Sanskrit word for “color” or “passion.”
There are further restrictions on Melakartha rāgas which
A rāga is uniquely determined by a sequence of svaras.
Janya rāgas do not share. For Melakartha rāgas, both the
Additionally, this sequence can be decomposed into an
aarohana and avarohana sequences must contain exactly the
aarohana (ascending) sequence followed by an avarohana
same svaras, and these sequences must be exactly the reverse
(descending) sequence. In the aarohana sequence the pitch
of one another. Thus, all together there are 72 possible
tends to increase, beginning at the tonic and ending again at
Melakartha rāgas constructed from the allowed combinations
the tonic, one full scale higher (i.e., at twice the frequency of
of svaras indicated in Fig. 2.
the initial tonic). The avarohana sequence, on the other hand,
There is, however, an important relationship between the
tends to decreases, beginning at the high tonic and ending at
two types of rāgas: every Janya rāga is derived from a
the low.
Melakartha rāga. Starting from a Melakartha rāga, one usu-
Rāgas are broadly grouped into two classes, namely,
ally drops or adds a small number of svaras to arrive at an
Melakartha and Janya rāgas. While both classes contain
associated Janya rāga.
aarohana and avarohana sequences, the Melakartha rāgas are
This relationship is also reflected in an additional char-
more restrictive in the sequences they allow. Rāgas may be
acteristic of rāgas: the emotion associated with a rāga.
further classified as sampoorna (complete) or asampoorna.
Traditionally, each rāga is associated with one of nine emo-
A sampoorna rāga includes exactly seven svaras in both
tions, include Bhakti (ritual or devotional) and Viram (bravery
the aarohana and the avarohana sequence. Any other rāga
or fury). A thorough discussion is given in [30]. Here, the
is asampoorna. Therefore, all Melakartha rāgas are sam-
important observation is that a Janya rāgas shares the same
poorna rāgas while Janya rāgas, may be either sampoorna
emotion as the Melakartha rāgas that it was derived from.
or asampoorna rāgas.
In Melakartha rāgas, both the aarohana and avarohana
sequences are required to contain exactly one note from each
svara class (i.e., the horizontal groupings as shown in Fig. 1). B. Tāla
Moreover, in the aarohana sequence the pitch of each svara In the Carnatic music tradition tāla and rāga are concepts
must be higher than the one before. Thus, e.g., the svara R2 of equal importance in rendering a composition. Tāla defines
can be followed by G2 or G3 , but not by G1 . Similar (but the rhythmic structure or framework using which music is
reversed) requirements hold for the avarohana sequence. This performed. In general tāla means clap and is used to main-
means that the frequency of notes in a rāga gradually increases tain the time which in turn determines the rhythmic structure
from the tonic to its middle note (one full scale higher), and or pattern for a musical composition. Generally, there are no
then gradually descends to return to the tonic at the end of special instruments to maintain the tāla in Indian classical
the rāga. Fig. 2(a) gives a finite state machine and a regular music performance. It is usually maintained by the musician
expression which will produce all of the aarohana sequences by tapping of the hand on the lap or using both the hands like
in Melakartha rāgas. clapping. As shown in Table II, seven tālas are characterized
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TABLE II TABLE IV
Tāla NAMES AND C ORRESPONDING S TRUCTURES (PATTERN ) IN L IST OF G ATIS AND THE N UMBER OF N OTES P ER B EAT
C ARNATIC M USIC T RADITION . T HE N OTATIONS IN THE TABLE
ARE : laghu—I, drutam—O, AND anudrutam—U
TABLE III
JATI NAMES AND N UMBER OF B EATS FOR
laghu (I) S PECIFIED IN tāla Pattern
Fig. 5. Methods and features used for rāga recognition task in Carnatic
music tradition.
2) Representing a problem and providing the required There is also a broad and substantial literature on the
knowledge to solve the problem. use of CT in formal modeling. One line of inquiry fol-
3) Validating the appropriateness of the knowledge before lowing on this database work can be found in the work
solving the problem. of Johnson and Rosebrugh [63], who provide a categori-
4) Solving the problem computationally and evaluating the cal interpretation of entity–attribute–relation diagrams. This
correctness of the solution to the problem. paper was further elaborated by Diskin [66], [67], and
5) Comprehensive representation of the knowledge so that a Rutle et al. [68], with a particular emphasis on software
computer can solve the problem for a particular domain. engineering. Most importantly, whereas earlier work was
There are many models for knowledge representation. One largely theoretical, much of this more recent work has been
class of such models are called ontologies, which provide an implemented and is directly informed by engineering practice.
explicit specification and abstraction of knowledge [49]–[52] Along broadly similar lines, CT can be used to under-
in either human-readable form or a formal language. These stand the object-oriented class modeling as found, for example,
typically begin by specifying the types of entities which occur in the universal modeling language (UML). Although it is
in the problem domain, then supplemented with a collection ubiquitous in (especially the early stages of) software engi-
of rules that the entities are expected to obey. neering, UML lacks well-defined semantics. Both Diskin [69]
The most popular approach to ontologies rely on a family of and Padi et al. [70] have associated (fragments of) the UML
Web ontology languages (OWL) [53]–[55]. In MIR, these have class diagram syntax with constructions in CT. Turning this
been applied to model music for segmentation and the extrac- relationship around, CT straight-forwardly inherits many of
tion of semantic information [56], [57]. The OWL approach the methods used in object-oriented class modeling.
is based on description logic and is often used in conjunc- The formal mathematics of CT allow us to sidestep some of
tion with other technologies, such as the resource description the difficulties of other ontological representations. For exam-
framework and the SPARQL the query language, and can also ple, one advantage of categorical ontologies is that we can
be mapped to relational database schemas for storage and express their inter-relationships using maps called functors.
retrieval [58], [59]. More generally, a diagram of functors can describe sophisti-
One shortcoming of existing ontological approaches is a cated relationships between families of ontologies, which can
lack of extensibility. It can be difficult to modify an existing then be merged together using a construction called a colimit.
ontology in order to extend or debug its domain. In particu- Kan extensions provide a formal method for data migration
lar, it is difficult to relate different ontologies to one another, between schemas. These powerful formal methods allow us to
particularly in cases where neither ontology embeds into the manipulate categorical ontologies in a way that is both correct
other. A lack of formal relationships between ontologies also and mathematically justified [71]–[73].
leads to difficulties when migrating data from one ontology to The previous section described some fundamental aspects of
another. Carnatic music. In this section, we translate that description
Similar issues arise for ontological integration. Domain- into a category-theoretic ontology. Our supplementary mate-
specific OWL ontologies are often specialized from large and rials provide a brief introduction to the methods of CT (as
complex “upper” ontologies, e.g., [60]. The advantage is that well as our notation); we strongly advise those without a
this method can align multiple small ontologies derived from background in CT to study that material before proceeding.
the same upper ontology. The disadvantage is that these upper In our categorical model, each object represents a set of
ontologies can be difficult for developers to write and debug entities, and each arrow represents a function between these
and for users to navigate. A more modular approach, based on sets. In diagrams the objects will be represented as a box con-
identification of overlap between small ontologies, depends on taining text which describe a typical element of the set. For
a concrete representation of that overlap. example:
In this paper, we investigate an alternative approach to
ontologies based on CT. CT was developed in the 1940s
by Eilenberg and MacLane [61] to study the relationship
between two areas of mathematics: 1) topology and 2) algebra. represents the set of svaras (notes) in Carnatic music. For
Subsequent research has led to applications throughout mathe- readability, we use corner braces rather than full boxes when
matics as well as in theoretical physics and computer science. discussing objects in the text: e.g., a svara.
In 1990s, Rosebrugh and Wood [62] showed that a database As discussed in Section II, a Carnatic melody is built up
schema can be viewed as a category, and a database instance from 16 basic notes, spread across 12 pitches and 7 articula-
as a functor on this category [63]. More recently, Spivak [64] tions. Each of these will become an object in our categorical
has used this point of view to apply the methods of CT to representation. Since each of these objects represents a single
problems of data management. Generally speaking, this line individual, these can all be modeled by terminal objects in our
of work provides a dictionary of categorical interpretations category. We then group these notes by articulation, as in the
for the standard vocabulary of databases, such as schemas, horizontal groupings of Fig. 1. As shown in Fig. 6 and 7,
instances, queries, updates, and data migration. This paper has categorically, these groupings take the form of coproducts,
also been implemented in the functorial query language, soft- allowing us to represent the objects Rishabham, Gandharam,
ware for building and analyzing databases from the categorical Dhaivata, Nishada, Madhyama, Sadja, and Panchama as finite
perspective [65]. sets (also called enumerations).
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 16. Using the UMP of the coproduct to define the type of a rāga.
Fig. 17. Illustrating the relationship between Melakartha rāga and Janya
rāga.
Fig. 18. A tāla determines a sequence (list) of beats (of type O, U or I).
Fig. 20. A jati associates each type of beat (O, U or I) with a natural number,
its beat count. This figure number should be 19.
any processing task, share the code, use the existing results
for analysis purposes and verifying the results. This platform
creates a common workspace, where anyone can use a shared
database and perform analysis in a consistent manner.
Notice, in particular, the dotted line
Fig. 31. A commutative diagram expressing coherence between the task and
result of an experiment.
If we let meta-data for MIR denote the coproduct represents a ground truth. Every file under consideration
records a particular piece of Carnatic music, so there is such
a function. However, this information is, in general, not avail-
able to us. In some sense, our goal is to identify (the metadata
Attached to this object there is a typing map constructed of) the recorded segment based on characteristics of the file.
in the same fashion as the rāga typing map introduced in Alternatively, in situations where we know the ground truth of
Section III (see Fig. 29). a music segment, this provides us with additional paths from
Next, we have the notion of an experiment as depicted an experiment to meta-data for MIR (refer to Fig. 33).
in Fig. 30. Roughly speaking, an experiment is an instance These alternate paths can help to verify our metadata.
of a particular task which analyzes a raw file to generate a Ideally, these paths should agree (for any particular task
single result. This result can be mapped to meta-data for MIR, type), but in practical experimental evaluations, this cannot
corresponding to an arrow (see Fig. 32). be expected in all cases. One path is derived from ground
truth about music segment and another path is taken from the
experimental evaluation. We can compare the results of both
the paths to judge the accuracy of our experimental methods.
For example, Fig. 35 provides an instance of the task model
There is an obvious consistency check here, which can be rep- for rāga recognition task using the HMM method applied to
resented as a commutative diagram: the type of result from the pitch feature. It also gives an example of one element in
an experiment is the same as the type of the task which the model, traced through the various maps. Other tasks may
the experiment performed. This is shown diagrammatically involve different parameters, but this basic model provides a
in Fig. 31. framework enabling end users to integrate various pieces of
As shown in Fig. 30, we can also keep track of some addi- information for implementing other tasks like tāla recognition,
tional information like the researcher and the date on which the singer identification, song name identification, etc.
experiment is conducted. This categorical model allows group
of people to do the experiments on a given music segment
with different methods and features and different users can V. U SE C ASES
store, retrieve, and update the results. This framework can be This section provides experimental evaluation and database
used by group of people for collaborative analysis and imple- schemas for the categorical models discussed in Section IV.
mentation purposes. Notice that the models of an experiment A database is a collection of data that is stored and organized
and a task may be glued together along their overlap using the so that the end user can easily and quickly access, manage, and
same colimit methods discussed above. Moreover, in any par- update the collection. In particular, relational database models
ticular domain the feature parameters, method parameters and involve the storage of data in tables (also called relations).
results objects may be expanded in a similar way to represent In the relational models, each column in a table is called an
structured data. attribute and each row is called as an entity or record. Each
Fig. 32 illustrates the complete categorical model for analy- record is uniquely determined by an identifier called a primary
sis of Carnatic music with experimental evaluation, including key. Columns of other tables which refer to this primary key
the tasks performed on music segments. As suggested by the are called foreign keys, and these can be regarded as arrows
figure, an experiment is performed on a raw file which contains in a category.
a recording of a Carnatic music segment. Every experiment In fact, Spivak has shown [64], [65] that (finitely presented)
involves a task which is performed using features and meth- categories can be converted into database specifications (and
ods with appropriate parameters. The parameters for feature vice versa), so in defining our categorical models we have
and method vary based on type of feature and type of method already provided a database model. The fact that categorical
with respect to a particular task. models can be translated into a database schemas facilitates the
The idea behind this categorical model is that any user can storage, analysis and fine tuning of the data based on our mod-
execute the model and store his/her outputs for a particu- els. The resulting database provides the users access to the data
lar task using different features and methods. These can be for further analysis and processing tasks. See [64], [65] for an
accessed or cross verified by other people who work on the explanation of how a categorical model can be represented as
same music segment. The model shares information between a database schema along with its integrity constraints.
a group of people and allows them to collaborate on research Via this approach we can use our existing models of rāga,
using the same music databases. This collaborative platform tāla, etc., to define our underlying database models. In fact, in
allows for sharing the common model for modeling music for most cases we do not want to store all of the data associated
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 37. Graphical interpretation of generic model for analysis of Carnatic music.
visualized as a three-layered structure. The first layer concen- for music segments, but these efforts have been somewhat dis-
trates on a categorical representation of the characteristics of parate, making it difficult to compare their results. This paper
Carnatic music. This layer also details the methods, features, discusses a categorical approach for modeling the Carnatic
and tasks which are used to perform a particular analysis. music for various processing tasks for MIR applications.
While implementing such a task, we may pull characteristics Despite the complexity of traditional Carnatic music and its
of music from its corresponding models. basis in rāga and tāla structures, we have formally charac-
After concluding a task, its results are stored in the database terized these complexities using the categorical approach. The
of the second layer. conceptual model proposed in this paper provides a means to
As explained in Fig. 36, the second layer in the generic model any music segment for finding metadata for information
model is a database schema which mediates our category the- retrieval purposes.
oretic ontology and the query processing system presented to With respect to collaborative research, the proposed categor-
users. This layer allows us to create database schema based ical model provides a common platform that allows users or
on (a fragment of) the CT models provided in first layer so researchers to share data and verify the results on a common
that end user can access the required information from the database. This paper also addresses the mapping of categori-
database through query processing system. cal structures into database schemas for storage and retrieval
The last layer of the model supports query analysis for informa- purposes. The framework developed in this paper will allow a
tion retrieval purposes. This layer also integrates the processing group of people to conduct analysis using a categorical model
of results based on different methods which may be evaluated on a common corpus of music segments collaboratively.
on different features. This layer allow the end user to access This paper provides a framework to formalize the relation-
or retrieve information about desired music segments. The end ship between Carnatic music structure, prior research, tasks
user can also find the methods and features used for a particular previously attempted, methods and features used for differ-
task for further analysis purposes. This layer extracts the data ent music processing tasks. The general model proposed in
from the database for any query analysis. In this way, the unified this paper can be adapted to other domains by replacing our
conceptual model allows a group of people to share the results model of Carnatic music by models applicable to other domain
and to get complete details about the tasks, features and results problems. As future work, the proposed categorical model can
of a music segment in the database. We can say that this model be extended to characterize other musical traditions, including
allows the people to do collaborative research. Western classical music and Hindustani music.
R. Wisnesky and S. Adhinarayanan for their valuable com- [24] J. C. Baez and A. Lauda, “A prehistory of n-categorical physics,” in
ments and suggestions for improving the consistency and Deep Beauty: Understanding the Quantum World Through Mathematical
Innovation. 2009, pp. 13–128.
readability of this paper. [25] J.-C. Letelier, J. Soto-Andrade, F. G. Abarzua, A. Cornish-Bowden, and
M. Luz Cárdenas, “Organizational invariance and metabolic closure:
Analysis in terms of (M,R) systems,” J. Theor. Biol., vol. 238, no. 4,
R EFERENCES pp. 949–961, 2006.
[26] D. Ellerman, “Determination through universals: An application of cate-
[1] N. Orio, “Music retrieval: A tutorial and review,” Found. Trends Inf. gory theory in the life sciences,” unpublished paper. [Online]. Available:
Retrieval, vol. 1, no. 1, pp. 1–90, 2006. https://arxiv.org/abs/1305.6958
[2] M. A. Casey et al., “Content-based music information retrieval: [27] V. G. Paranjape, “Indian music and aesthetics,” J. Music Acad. Madras,
Current directions and future challenges,” Proc. IEEE, vol. 96, no. 4, vol. XXVIII, pp. 68–71, 1957.
pp. 668–696, Apr. 2008. [28] P. Lavezzoli, The Dawn of Indian Music in the West. New York, NY,
[3] R. Typke, F. Wiering, and R. C. Veltkamp, “A survey of music USA: Bhairavi, Continuum, 2006.
information retrieval systems,” in Proc. ISMIR, London, U.K., 2005, [29] A. Bellur, V. Ishwar, X. Serra, and H. A. Murthy, “A knowledge based
pp. 153–160. signal processing approach to tonic identification in Indian classical
[4] E. D. Scheirer, “Tempo and beat analysis of acoustic musical signals,” music,” in Proc. 2nd CompMusic Workshop, Istanbul, Turkey, 2012,
J. Acoust. Soc. America, vol. 103, no. 1, pp. 588–601, 1998. pp. 113–116.
[5] M. A. Alonso, G. Richard, and B. David, “Tempo and beat estimation of [30] G. K. Koduri and B. Indurkhya, “A behavioral study of emotions in
musical signals,” in Proc. ISMIR, Barcelona, Spain, 2004, pp. 158–163. South Indian classical music and its implications in music recommen-
[6] P. Herrera-Boyer, G. Peeters, and S. Dubnov, “Automatic classification of dation systems,” in Proc. ACM Workshop Soc. Adapt. Pers. Multimedia
musical instrument sounds,” J. New Music Res., vol. 32, no. 1, pp. 3–21, Interact. Access, Florence, Italy, 2010, pp. 55–60.
2003. [31] P. Sarala, V. Ishwar, A. Bellur, and H. A. Murthy, “Applause identifica-
[7] T. Kitahara, M. Goto, K. Komatani, T. Ogata, and H. G. Okuno, tion and its relevance to archival of Carnatic music,” in Proc. Workshop
“Instrument identification in polyphonic music: Feature weighting to Comput. Music, Istanbul, Turkey, Jul. 2012, pp. 66–71.
minimize influence of sound overlaps,” EURASIP J. Appl. Signal [32] V. Ishwar, A. Bellur, and H. A. Murthy, “Motivic analysis and its rel-
Process., vol. 1, no. 1, p. 155, 2007. evance to raga identification in Carnatic music,” in Proc. Workshop
[8] B. L. Sturm, M. Morvidone, and L. Daudet, “Musical instrument Comput. Music, Istanbul, Turkey, Jul. 2012, pp. 153–157.
identification using multiscale mel-frequency cepstral coefficients,” in [33] S. Gulati, J. Salamon, and X. Serra, “A two-stage approach for tonic
Proc. 18th Eur. Signal Process. Conf., Aalborg, Denmark, Aug. 2010, identification in Indian art music,” in Proc. Workshop Comput. Music,
pp. 477–481. Istanbul, Turkey, Jul. 2012, pp. 119–127.
[9] M. Marolt, “SONIC: Transcription of polyphonic piano music with [34] G. K. Koduri, J. Serrá, and X. Serra, “Characterization of into-
neural networks,” in Proc. Workshop Current Res. Directions Comput. nation in Karnataka music by parametrizing context-based Svara
Music, Barcelona, Spain, 2001, pp. 217–224. Distributions,” in Proc. Workshop Comput. Music, Istanbul, Turkey,
[10] H. G. Ranjani, S. Arthi, and T. V. Sreenivas, “Carnatic music analy- Jul. 2012, pp. 128–132.
sis: Shadja, swara identification and raga verification in alapana using [35] J. C. Ross and P. Rao, “Detection of raga-characteristic phrases from
stochastic models,” in Proc. IEEE Workshop Appl. Signal Process. Audio Hindustani classical music audio,” in Proc. Workshop Comput. Music,
Acoust. (WASPAA), New Paltz, NY, USA, Oct. 2011, pp. 29–32. Istanbul, Turkey, Jul. 2012, pp. 133–138.
[36] A. Vidwans, K. K. Ganguli, and P. Rao, “Classification of Indian clas-
[11] R. Sridhar and T. V. Geetha, “Raga identification of Carnatic music for
sical vocal styles from melodic contours,” in Proc. Workshop Comput.
music information retrieval,” Int. J. Recent Trends Eng., vol. 1, no. 1,
Music, Istanbul, Turkey, Jul. 2012, pp. 139–146.
pp. 571–574, 2009.
[37] B. Bozkurt, “Features for analysis of Makam music,” in Proc. Workshop
[12] R. Sridhar and T. V. Geetha, “Music information retrieval of Carnatic
Comput. Music, Istanbul, Turkey, Jul. 2012, pp. 61–65.
songs based on Carnatic music singer identification,” in Proc. Int. Conf.
[38] A. Srinivasamurthy, S. Subramanian, T. Gregoire, and P. Chordia,
Comput. Elect. Eng., 2008, pp. 407–411.
“A beat tracking approach to complete description of rhythm in Indian
[13] L. Pesch, The Oxford Illustrated Companion to South Indian Classical
classical music,” in Proc. Workshop Comput. Music, Istanbul, Turkey,
Music. New Delhi, India: Oxford Univ. Press, 2009.
Jul. 2012, pp. 72–78.
[14] R. M. Rangayyan, “An introduction to the classical music of India,” [39] P. Sarala and H. A. Murthy, “Cent filter-banks and its relevance
Dept. Elect. Comput. Eng., Univ. Calgary, Calgary, AB, Canada. to identifying the main song in Carnatic music,” in Proc. Comput.
[15] T. M. Krishna, A Southern Music: The Karnatik Story, HarperCollins, Music Multidscipl. Res. (CMMR), Marseilles, France, Oct. 2013,
Noida, India, 2013. pp. 659–681.
[16] M. Andreatta, A. Ehresmann, R. Guitart, and G. Mazzola, “Towards [40] V. Ishwar, S. Dutta, A. Bellur, and H. A. Murthy, “Motif spot-
a categorical theory of creativity for music, discourse, and cogni- ting in an alapana in Carnatic music,” in Proc. Int. Soc. Music Inf.
tion,” in Mathematics and Computation in Music. Heidelberg, Germany: Retrieval (ISMIR), Curitiba, Brazil, Nov. 2013, pp. 499–504.
Springer, 2013, pp. 19–37. [41] A. Bellur and H. A. Murthy, “A novel application of group delay function
[17] E. Dennis-Jones and D. E. Rydeheard, “Categorical ML—Category- for identifying tonic in Carnatic music,” in Proc. EUSIPCO, Marrakesh,
theoretic modular programming,” Formal Aspects Comput., vol. 5, no. 4, Morocco, 2013, pp. 1–5.
pp. 337–366, 1993. [42] J. C. Ross, T. P. Vinutha, and P. Rao, “Detecting melodic motifs from
[18] K. Williamson, M. Healy, and R. Barker, “Industrial applications of audio for Hindustani classical music,” in Proc. Int. Soc. Music Inf.
software synthesis via category theory—Case studies using Specware,” Retrieval (ISMIR), Porto, Portugal, Oct. 2012, pp. 193–198.
Autom. Softw. Eng., vol. 8, no. 1, pp. 7–30, 2001. [43] P. Sarala and H. A. Murthy, “Inter and intra item segmentation of
[19] R. Tate, M. Stepp, and S. Lerner, “Generating compiler optimizations continuous audio recordings of Carnatic music for archival,” in Proc.
from proofs,” ACM SIGPLAN Notices, vol. 45, no. 1, pp. 389–402, Int. Soc. Music Inf. Retrieval (ISMIR), Curitiba, Brazil, Nov. 2013,
2010. pp. 487–492.
[20] J. C. Reynolds, “Using category theory to design implicit conversions [44] J. Kuriakose, J. C. Kumar, P. Sarala, H. A. Murthy, and U. K. Sivaraman,
and generic operators,” in Semantics-Directed Compiler Generation. “Akshara transcription of mrudangam strokes in Carnatic music,” in
New York, NY, USA: Springer, 1980, pp. 211–258. Proc. 21st IEEE Nat. Conf. Commun. (NCC), Mumbai, India, 2015,
[21] Z. Diskin and T. Maibaum, “Category theory and model-driven engi- pp. 1–6.
neering: From formal semantics to design patterns and beyond,” [45] CUED. (2002). HTK Speech Recognition Toolkit. [Online]. Available:
in Model-Driven Engineering of Information Systems: Principles, http://htk.eng.cam.ac.uk
Techniques, and Practice. Toronto, ON, Canada: Apple Acad. Press, [46] B. Logan, “Mel frequency cepstral coefficients for music modeling,” in
2014, p. 173. Proc. Int. Symp. Music Inf. Retrieval, Plymouth, MA, USA, 2000.
[22] T. Giesa, D. I. Spivak, and M. J. Buehler, “Category theory based solu- [47] M.-L. Zhang and Z.-H. Zhou, “ML-KNN: A lazy learning approach to
tion for the building block replacement problem in materials design,” multi-label learning,” Pattern Recognit., vol. 40, no. 7, pp. 2038–2048,
Adv. Eng. Mater., vol. 14, no. 9, pp. 810–817, 2012. 2007.
[23] D. I. Spivak, T. Giesa, E. Wood, and M. J. Buehler, “Category theoretic [48] D. Gerhard, “Pitch extraction and fundamental frequency: History and
analysis of hierarchical protein materials and social networks,” PLoS current techniques,” Dept. Comput. Sci., Univ. Regina, Regina, SK,
ONE, vol. 6, no. 9, pp. 1–15, Sep. 2011. Canada, Tech. Rep. TR-CS 2003-6, 2003.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
[49] B. Chandrasekaran, J. R. Josephson, and V. R. Benjamins, “What are Sarala Padi received the Ph.D. degree from the
ontologies, and why do we need them?” IEEE Intell. Syst., vol. 14, no. 1, Indian Institute of Technology Madras, Chennai,
pp. 20–26, Jan. 1999. India, in 2014.
[50] P. M. Simons, Parts: A Study in Ontology. Oxford, U.K.: Oxford Univ. She is currently a Guest Researcher with the
Press, 1987. National Institute of Standards and Technology,
[51] D. Fensel, Ontologies: A Silver Bullet for Knowledge Management and Gaithersburg, MD, USA. Her area of specializations
Electronic Commerce, 2nd ed. Heidelberg, Germany: Springer-Verlag, are machine learning and signal processing tech-
2003. niques for efficient music information retrieval and
[52] D. Fensel, “Ontology-based knowledge management,” Computer, automatic indexing purposes, text mining, and NLP
vol. 35, no. 11, pp. 56–59, 2002. methods for processing of text documents for various
[53] G. Antoniou and F. Van Harmelen, “Web ontology language: Owl,” applications. Her current research interests include
in Handbook on Ontologies. Heidelberg, Germany: Springer, 2004, applying category theory for modeling and knowledge representation, and
pp. 67–92. applying deep learning models for medical image analysis purposes.
[54] G. Antoniou, E. Franconi, and F. Van Harmelen, “Introduction to seman-
tic Web ontology languages,” in Reasoning Web. Heidelberg, Germany:
Springer, 2005, pp. 1–21.
[55] D. L. McGuinness and F. Van Harmelen, “Owl Web ontology lan-
guage overview,” W3C Recommendation, vol. 10, no. 10, 2004. [Online]. Spencer Breiner received the master’s and
Available: https://www.w3.org/TR/owl-features/ Ph.D. degrees from Carnegie Mellon University,
[56] S. Song, M. Kim, S. Rho, and E. Hwang, “Music ontology for mood Pittsburgh, PA, USA, in 2010 and 2013,
and situation reasoning to support music retrieval and recommendation,” respectively.
in Proc. 3rd Int. Conf. Digit. Soc. (ICDS), Cancún, Mexico, Feb. 2009, He joined the U.S. National Institute of Standards
pp. 304–309. and Technology, Gaithersburg, MD, USA, in 2015
[57] B. Fields, K. Page, D. De Roure, and T. Crawford, “The segment ontol-
under a Post-Doctoral Grant from the National
ogy: Bridging music-generic and domain-specific,” in Proc. IEEE Int.
Research Council. His current research interest
Conf. Multimedia Expo (ICME), Barcelona, Spain, 2011, pp. 1–6.
includes potential applications of the mathematical
[58] A. Gali, C. X. Chen, K. T. Claypool, and R. Uceda-Sosa, “From
field of category theory to real-world applications.
ontology to relational databases,” in Conceptual Modeling for
Advanced Application Domains. Heidelberg, Germany: Springer, 2004,
pp. 278–289.
[59] B. Motik, I. Horrocks, and U. Sattler, “Bridging the gap between owl
and relational databases,” Web Semantics Sci. Services Agents World
Wide Web, vol. 7, no. 2, pp. 74–89, 2009. Eswaran Subrahmanian received the Ph.D. degree
[60] A. Gangemi, N. Guarino, C. Masolo, A. Oltramari, and L. Schneider, from Carnegie Mellon University, Pittsburgh, PA,
“Sweetening Ontologies with DOLCE,” in Proc. Int. Conf. Knowl. Eng. USA, in 1987.
Knowl. Manag., Sigüenza, Spain, 2002, pp. 166–181. He is a Research Professor with the Institute
[61] S. Eilenberg and S. MacLane, “General theory of natural equivalences,” for Complex Engineered Systems and Engineering
Trans. Amer. Math. Soc., vol. 58, no. 2, pp. 231–294, 1945. and Public Policy, Carnegie Mellon University,
[62] R. Rosebrugh and R. J. Wood, “Relational databases and indexed cat- Pittsburgh, PA, USA. He has published over
egories,” in Proc. Int. Category Theory Meeting CMS Conf., vol. 13. 100 refereed papers, co-edited three books on
1992, pp. 391–407. Empirical Studies in Design, Design engineer-
[63] M. Johnson and R. Rosebrugh, “Sketch data models, relational schema ing, and Organizational Implications knowledge
and data specifications,” Electron. Notes Theor. Comput. Sci., vol. 61, Management. He also co-authored a book setting the
pp. 51–63, Jan. 2002. global research agenda for ICT in Development. He has also co-edited three
[64] D. I. Spivak, Category Theory for the Sciences. Cambridge, MA, USA: special issues in the areas of design theory, engineering informatics, and anno-
MIT Press, 2014. tations in engineering design. His current research interests include design
[65] H. Forssell, H. R. Gylterud, and D. I. Spivak, “Type theoretical
theory, design support systems, information modeling for engineering, col-
databases,” in Proc. Int. Symp. Logic. Found. Comput. Sci., 2016,
laborative engineering, and engineering education.
pp. 117–129.
Dr. Subrahmanian was a recipient of the Steven Fenves Award for Systems
[66] Z. Diskin, “Generalized sketches as an algebraic graph-based framework
Engineering at CMU. He is a Distinguished Scientist of the Association
for semantic modeling and database design,” Faculty Phys. Math., Univ.
of Computing Machinery, and a Fellow of the American Association of
Latvia, Riga, Latvia, Tech. Rep. M-97, 1997.
Advancement of Science and a member of the Design Society.
[67] Z. Diskin and U. Wolter, “A diagrammatic logic for object-oriented
visual modeling,” Electron. Notes Theor. Comput. Sci., vol. 203, no. 6,
pp. 19–41, 2008.
[68] A. Rutle, A. Rossini, Y. Lamo, and U. Wolter, “A category-theoretical
approach to the formalisation of version control in MDE,” in Proc.
Int. Conf. Fundam. Approaches Softw. Eng., York, U.K., 2009, Ram D. Sriram (S’82–M’85–SM’00–F’17) was
pp. 64–78. a Faculty of Engineer with the Massachusetts
[69] Z. Diskin, “Mathematics of UML,” in Practical Foundations of Business Institute of Technology, Cambridge, MA, USA,
System Specifications. Amsterdam, The Netherlands: Springer, 2003, from 1986 to 1994 and was instrumental in setting
pp. 145–178. up the Intelligent Engineering Systems Laboratory.
[70] S. Padi, S. Breiner, E. Subrahmanian, and R. D. Sriram, “Category He is currently a Division Chief of the Software
theoretical approaches for deconstructing the semantics of UML class and Systems Division with the National Institute
diagrams,” in preparation. of Standards and Technology, Gaithersburg, MD,
[71] I. Cafezeiro and E. H. Haeusler, “Semantic interoperability via cate- USA. He is a Distinguished Alumni with the
gory theory,” in Proc. 26th Int. Conf. Conceptual Model. Tuts. Posters Indian Institute of Technology and Carnegie Mellon
Panels Ind. Contribut. (ER), vol. 83. Auckland, New Zealand, 2007, University, Pittsburgh, PA, USA. He has co-authored
pp. 197–202. or authored nearly 250 papers, books, and reports, including several books. His
[72] F. McNeill, A. Bundy, and C. Walton, “Diagnosing and repairing onto- current research interests include developing knowledge-based expert systems,
logical mismatches,” in Proc. Stairs 2nd Starting Ai Res. Symp., vol. 109. natural language interfaces, machine learning, object-oriented software devel-
Amsterdam, The Netherlands, 2004, p. 241. opment, life-cycle product and process models, geometrical modelers, object-
[73] F. McNeill, “Dynamic ontology refinement,” Ph.D. dissertation, Div. oriented databases for industrial applications, health care informatics, bioin-
Informat., Univ. at Edinburgh, Edinburgh, U.K., 2006. formatics, and bioimaging.
[74] M. Barr and C. Wells, Eds., Category Theory for Computing Science, Dr. Sriram was a recipient of the NSF’s Presidential Young Investigator
2nd ed. Hertfordshire, U.K.: Prentice-Hall, 1995. Award in 1989, the ASME Design Automation Award in 2011, the ASME
[75] S. M. Lane, Categories for the Working Mathematician (Graduate Texts CIE Distinguished Service Award in 2014, and the Washington Academy of
in Mathematics), vol. 5. New York, NY, USA: Springer-Verlag, 1971. Sciences’ Distinguished Career in Engineering Sciences Award in 2015.