of metadata, since semantic web technologies can helpin checking the consistency of the cataloguing.The inclusion of semantic web technology pointsout several challenges. Firstly, it is needed to define anontology that contains the concepts of the Harmostaxonomy. Secondly, since musical analysts should notbe aware of the usage of semantic web technology forcataloguing, it is needed to develop easy interfaces inorder to catalogue semantically. Thirdly, it is neededmigrating all the Harmos catalogued multimediacollection to the new semantic schema. Finally, it isneeded to evaluate the current status of semantic webtechnology in terms of throughput and performance,given the size of the multimedia collection. This articlewill cover the first objective.
4. Methodology for transformingtaxonomies into ontologies
The central aim of Semusici is to provide asemantic structure that fits the former conceptstaxonomy. The purpose of this new approach is togather information about the relationships betweendisjoint leaves and build a new representation of boththe concepts and these relationships. This leads to aricher representation of the knowledge that is reallyassociated to the digital audiovisual items. This impliesa deep understanding of the subject domain. There isno definite methodology for this task, but a generalprocess together with best practices has been proposed.Once the problem has been well defined and all therequirements have been identified, a suitable structurehas to be chosen. As we are looking to take advantageof the technologies of the semantic web, the mostappropriate structure is an ontology. The reason whywe have chosen this structure is that it provides aformal way to represent roles and their correspondingrelations in a specific domain. By placing a concept insuch a structure, we are stating that it has certainproperties and satisfies some restrictions about hismeaning. In other words, each leaf of an ontologyrepresents the definition of a certain resource.The main difference between an ontology and ataxonomy is the kind of structure in which each of them is based. A taxonomy can be represented as a treewhere each leaf is a class. No connections are allowedbetween disjoint branches. Relations between classescan only be established between a concept and itsdirect children. So an instance of a certain class can bedefined as ”a kind of” its parent class. An ontology is agraph in which richer definitions can be expressedthrough a more extensive set of relations. This meansthat any class can be defined in terms of any otherresource that is connected to it, not necessarily beingits parent or child. Therefore ontologies can store moresemantic information than taxonomies, allowing us toinfer undeclared knowledge by studying the relationsand restrictions of a certain class.
5.1. First step: choosing the appropriate tools
There is a wide variety of tools available to create,edit, browse and store ontologies. There are also manyinference engines or reasoners, which are veryimportant to obtain knowledge from the ontology.Several tools have been examined in order to choosethe most suitable framework for our purposes. Some of these were Protégé [2], RacerPro [3], Sesame [4],SWOOP [5], WebODE [6], etc. A survey was carriedout in order to find distinctive features. Thereforeeleven parameters were chosen and thirteen tools wereevaluated according to these key features. Some of these parameters were the supported languages,consistency check support, availability, maintenance,etc. As a result, Protégé and Sesame were chosen.All these tools support a number of languages.Choosing the right language to implement an ontologyis probably the most important step in the process. Thisdepends on how thorough the ontology is intended tobe. For Semusici, our initial choice was RDFS as it isthe main language in Sesame. It proved to be completeenough to allow the building of a basic version of theontology. Later we decided to include somerestrictions to enforce the definition of the elementsthat we have already defined. These restrictions werealso intended to help us perform consistency checkswhen adding new contents. For that purpose, newOWL statements were added.
5.2. Semusici knowledge base
There are two distinct parts in the knowledge basethat is to be represented by the ontology. One isintended to capture all the information that is notdirectly related to the collection and can be useful tolocate a recording. The aim of this is to answer anyquery that is not directly related to the contents of therecording itself. For instance, “give me all therecordings related to composers born in the 18
th
century”.The other part of the knowledge base is theconcepts taxonomy. The features of this structure havealready been discussed. This taxonomy contains over200 pedagogical concepts that are used as tags todescribe the recordings. In the process of cataloguingthe content, these recordings are to be labelled
Leave a Comment