Professional Documents
Culture Documents
VMAP. A Dublin Core Application Pro Le For Musical Resources
VMAP. A Dublin Core Application Pro Le For Musical Resources
1 Introduction
According the Lynch [1], the new context for bibliographic control in the new
millennium will certainly reorder priorities for investment in bibliographic con-
trol practices and will change the way that cataloguing information is used.
He identifies three general approaches to identifying potentially relevant infor-
mation: through bibliographic surrogates that represent an intellectual analy-
sis and description of aspects and attributes of a work; through content-based
techniques that compare queries to parts of the actual works themselves (or to
computationally-derived surrogates for the works); and through social processes
that exploit the opinions and actions of communities that author, read, and eval-
uate works, and the information seeker’s view of those communities of people
involved.
This paper details a Dublin Core Application Profile (DCAP) defined for cat-
aloguing musical resources defined within the European eContentPlus project
Variazioni. Variazioni project aims at enriching musical contents provided by
musical institutions and end-users by combining the three processes: manual cat-
aloguing based on well-defined metadata by musical institutions and end users;
automatic cataloguing based on audio analysis, and social tagging. This paper
is focused on the definition of the metadata schema for manual cataloguing
The article is structured as follows. First, section 2 reviews the limitations of
existing metadata standards and projects for cataloguing musical assets. Then,
?
This research has been co-funded by the European Community under the programme
eContentPlus. The authors are solely responsible for this article and it does not
represent the opinion of the European Community. The European Community is
not responsible for any use that might be made of information contained within it.
??
This work has been partially funded by the Spanish Ministry of Industry, Trade and
Tourism under the Avanza Programme in the project Musiteca.
section 3 presents the model of the Variazioni Musical Dublin Core Application
Profile (VMAP). Finally, section 4 draws out some conclusions and future work.
In this section it is discussed how the FRBR conceptual model can be applied
for cataloguing different types of musical contents, such as master classes videos
or digitalised scores. In order to understand better the relationship with FRBR,
a first identification of FRBR entities per musical content type has been carried
out as shown in table 1.
From the previous table, the following issues can be pointed out.
– Expression and Work entities are not easy to identify in some cases, such
as Master Classes or Conferences. This happens because the intellectual
or artistic activity (Work) emerges while the activity (Expression) is being
carried out. A similar issue has been previously reported for Western Music
or Jazz improvisation in FRBRList [26] or MusicAustralia [20].
– According to FRBR, an Expression is the realisation of one and only one
Work [6]. This can create some problems while cataloguing if the final digital
file contains several Expressions (for example, a video recording with several
performances or a digitalised score book with several scores, or a CD in only
one track) and there is not a segmentation tool available in the system. The
main Work entity in the music domain is Composition. Nevertheless, in some
musical contents, such as Master Classes or Conferences, the Composition is
not the intellectual / artistic activity of the Master class / Conference, but
it is commonly used to exemplify a concept. They are used as subjects.
– Managing image and ’event material’. The image content is problematic. For
example, let us consider a concert, where there are a video recording, an au-
dio recording and photos of the event. One natural alternative is considering
that all of them are ’Manifestations’ of the same Expression (the Concert)
but recorded in different media (image, video or sound). The main problem
is that the photo may not be easily linked to the performance of one partic-
ular Work, but to the general event. A similar case happens for cataloguing
related material such as the announcement poster of the Concert. According
to [29], these augmentations (illustrations, notes, glosses, etc.) of the Expres-
sion should be considered separate Expressions of their own separate works,
but this makes the cataloguing more laborious.
– In digital libraries, the distinction between Manifestation and Item is not so
relevant, since there is only one copy of the work (the digital media). FRBR
cannot be considered as a data model, but a conceptual schema. FRBR does
not even require implementing the four entities of the first FRBR Group [29].
– While FRBR follows a top-down approach for cataloguing, cataloguing fol-
lows a bottom-up approach. Users or librarians catalogue an Item, not a
Work. Users should have an easy interface in order to catalogue their media
files, without being aware of the FRBR model. Expertise in implementing
FRBR in standard databases [20] has shown its utility for end users to find
relationships between items, which were hidden before its implementation.
Nevertheless, these experiences have shown that since FRBR provides sev-
eral alternatives during the cataloguing process, this can make the process
difficult to understand. Some examples of these difficulties are to decide
whether music and lyrics should be catalogued as different items, the defi-
nition of relationships between expressions (i.e. an interpretation (e1) based
on a libretto (e2) of a work (o1)), as the cataloguing of expressions based on
improvisation, such as jazz music and folk traditions.
– Cataloguing is commonly carried out in an iterative way in musical institu-
tions. Depending on the available resources, a media file can be catalogued
with very few metadata and catalogued more exhaustively when there are
more resources. This include the identification of the Composition (Work),
which can be delayed.
In order to clarify these elements, an example of how the same items are cat-
alogued according to standard FRBR and Variazioni Music Application Profile
is shown in tables 2 and 3 respectively.
From this example, the main differences of the model can be outlined. First of
all, according to FRBR, and Expression has one and only one Work, and this has
supposed the shift in focus from the resource (Manifestation) in the traditional
cataloguing world to the Work in FRBR. Our proposal consists of modifying
MC1. Score. Transcription for classic guitar by Santley Yates
C1: J. S. Bach’s Six suites for unaccompanied cello
P1: Book edition
MF1: Media file of the book (page range if book includes more compositions)
P2 Separata of the guitar edition
MF2: Media file of the separata
MC2. Studio Recording. Performances by Janos Starker recorded in 1963 and 1965
C1: J. S. Bach’s Six suites for unaccompanied cello
P3: Recordings released on 33 1/3 rpm sound discs in 1965 by Mercury.
MF3: Suite 1 media file (and details of the fragment, full or time range)
C2: J. S. Bach Suite 1 for unaccompanied cello [is-part-of C1]
MF4: Suite 2 media file (and details of the fragment, full or time range)
C3: J. S. Bach Suite 2 for unaccompanied cello [is-part-of C1]
...
MF8: Suite 6 media file (and details of the fragment, full or time range)
C7: J. S. Bach Suite 1 for unaccompanied cello [is-part-of C1]
P4: Recordings re-released on CD in 1991 by Mercury
MF9: Single media file of the suites
the cardinality of the relationship hasWork between Work and Expression, from
1-1 in FRBR to M-M (many-to-many). This allows solving some of the previous
issues pointed out: (a) , since Compositions (Works) are not mandatory for a
Musical Content (Expression); and (b), since one Musical Content (Expression)
can have more than one associated Compositions (Works).
Another interesting change is the usage of the relationship hasSubject, in
particular for linking any element of the model with Composition. FRBR only
considers this relationship for Works. In our case, for example, for Master classes,
several Compositions could be the subject (or example) of a master class. In the
example previously presented, a Composition can be assigned as subject of a Me-
dia Fragment, avoiding the need for creating a new Expression. This is depicted
in figure 1, which points out two different kind of semantic relationships be-
tween Composition and Musical Content: isRealizedAs and hasSubject. In terms
of searchability, we have not found the need to distinguish between both in the
implementation of the model. Furthermore, it is possible to define the subject
of a media fragment, allowing a direct asignation
The relationships between the entities of the VMAP model are shown in
figure 2. It is important to point out the cardinality between Composition and
Musical Content. A Musical Content can have more than one Composition (as a
realisation or as a subject). In addition, Media Fragments can have one or more
Compositions as a subject or realisation. In order to simplify the cataloguing
process, it is not distinguished between the relationships isPerformedAs and
hasAsSubject, shown previously in figure 1.
Fig. 2. VMAP model
For each musical content type, specific metadata have been identifed as shown
in figure 4 and is formalised in [30].
This model has been formalised as a Dublin Core Application Profile for
describing digital musical contents hold by musical institutions or end users.
The concept of Application Profile has emerged within the Dublin Core Meta-
data Initiative. Dublin Core Application Profiles (DCAP) can be defined [31] as
schemas consisting of data elements drawn from one or more name spaces, com-
bined together by implementers, and optimised for a particular local application
Fig. 3. Musical Content Entity
Dublin Core Application Guidelines [32] and the Singapore guidelines [33] have
been followed to document attributes drawn from other name spaces and the
declaration of new elements under the name space VMAP (Variazioni Musical
Application Profile)
The resulting DCAP is available at [30] and has been formalised with DC-
Text [34], where all the entities and properties have been formalised, as well
as 26 classification schemes. VMAP reuses several controlled vocabularies, such
as Getty Thesaurus for Geographical names [35], the thesaurus of Musical In-
struments [36] as well as metadata schemas such as FOAF [37] In addition, a
mapping to Simple Dublin Core has been defined in order to provide OAI-PMH
interoperability.
The model has been implemented following the aspect oriented content model
of Alfresco [38] and is publicly available at the Variazioni Content Enrichment
Model (CEP) [39].
as discussed in [40]. This could suggest the need of defining a more extensible
model based on FRBR, which would improve the interoperability. Although
VMAP has been defined for the musical domain, it could be applicable in other
arts, which is left to future work.
Defining a metadata schema which provides quality metadata is not an easy
task. It enables a trade-off between different quality properties, such as con-
sistency, completeness, accuracy, shareability, economic feasibility and usability.
One could think that the most complete and more metadata a schema has, the
more quality it has. Nevertheless, one of the common pitfalls pointed out in [41]
for providing shareable metadata is too much information. Economic and human
factors are also important in the definition of metadata, in order to provide un-
derstandable and effective metadata. Furthermore, natural resistance to change
is a challenge for adopting a new metadata model.
In order to define quality metadata, the process defined in [42] has been fol-
lowed for defining Variazioni Metadata Model, identifying external and internal
requirements. The implementation of the project has provided feedback about
the decisions taken in this definition.
Currently partners of Variazioni are working in the dissemination of the
VMAP model as well as the portal. Feedback from users will guide its evo-
lution. In addition, the Spanish project Musiteca has started a living lab for
classic musical contents where this DCAP will be validated with end users and
classical music content providers.
References