You are on page 1of 2

Mining for Content Re-Use and Exchange -- Solutions and Problems*

Aga the Merceron † . Ca rlo s Oliveira ‡ ,


Michel Scholl † , Carsten Ullr ich §

Eco le Supérieure d'Ingénieu rs Léonard de Vinci, ‡ Faculty o f Engineering
of the Universit y of Port o, § Saarland Un ivers it y
agathe.merceron@dev inci.fr, colive@fe.up .pt,
michel.scholl@devinci.fr, cu llrich@ags.un i-sb.de
1 Problem Statement *
Valuable information can be gained by mining meta- 3 Case Studies
data of educational resources. However, if the mined We will now describe two case studies that show how
data is annotated using IEEE Learning Objects and data mining benefits from information contained in a
Metadata standard (LOM), then significant pedagogi- pedagogical ontology.
cal information is missing.
In this paper, we will describe how an ontology of 3 .1 Mini ng o f Le a rning Pa ths
pedagogical objects that captures the “pedagogical Teachers need to be aware of whether and how learn-
semantics” of a learning resource helps data mining ing resources are used by students while learning.
to retrieve more precise information. Two case stud- This can be achieved by mining students's logs and
ies illustrate the advantages of an ontological ap- homework for associations. Association mining algo-
proach. Finally, we will point out shortcomings of rithms produce results of the form A,B->C, 80%,
this approach and propose a solution based on fusion 95% which describe the relation that if students use
of ontologies. learning resources A and B, then they use learning
resource C with a support of 80% and a confidence of
2 An Ontology of Pedagogical Objects 95%. Here, a support of 80% signifies that 80% of
In the emerging educational specification for learning the students use learning resources A, B and C. Con-
content SCORM, course structure is built up out of fidence is a measure of how much C is implied by A
self-contained units of learning. The description of and B. The higher the value, the greater the depend-
these resources is covered in LOM, which provides a ency.
detailed description of learning resources. But LOM If learning resources are annotated using a peda-
does not directly address the pedagogical purpose of gogical ontology, and additional user related infor-
a resource. The available Educational elements, in mation is taken into account, much finer association
particular the "Learning Resource Type", do not in- rules giving more information to teachers can be ob-
clude recommended values to, for instance, describe tained.
that a web page provides a definition or a counter- An example of an advanced rule is the following:
example of an item. students who study concept A, then example B, then
Ullrich [2004] describes an ontology of pedagogi- example C complete successfully test D with a sup-
cal objects that provides a vocabulary capturing the port of 80% and a confidence of 95%. This gives
“pedagogical semantics” of a virtual or text-book hints to reuse resources and compose a course for
learning resource. In general, each paragraph in a text new students. Resources that make students success-
book and each learning object in an online course ful should be recommended (as 'preferred learning
serves a particular pedagogical role. These roles are tracks') to other students or should be preferred when
reflected in the classes of the ontology. building a new course.
The ontology distinguishes between two main Additionally, dominant learning styles in learner
classes, pedagogical concepts and satellite elements. groups can be assessed. Let’s assume an association
Concepts describe the central pieces of knowledge, shows that students study a concept then solve 10
the main pieces of information being taught in a exercises in a row and complete successfully a test
course. Subclasses of concepts are fact, definition, with a support of 25% and a confidence of 98%. This
and different kinds of laws and processes. Satellites puts in evidence practical students who learn by do-
provide additional information about the concepts. ing. To cater for them, teachers have to reuse and
Direct subclasses of satellites are interactive element, share with their fellow teachers many exercises.
example, evidence, and explanation. This approach generalizes the use of associations
presented in Merceron and Yacef [2003] where stu-
* dents' homework is mined to find mistakes made
This work was supported by IST TEL NoE Kaleidoscope.
while solving exercises in propositional logic.
3 .2 Mini ng fo r Q ua lity Co ntro l 4 Limitations
Most Learning Management Systems (LMS) give ac- It is often the case when sharing pedagogical re-
cess to several statistics related to the levels of activity sources between separate sites that sites are not only
of course participants and to the course content usage. physically distributed but more important have dif-
Typical reports include the number of published docu- ferent usages. Even in the case they run the same
ments, the number of viewers, the last access, etc.. Sys- scenario, significant heterogeneity in the pedagogical
tems give also access to general statistics such as resources and description is expected. In our context
browser and operating system. it might happen that some other e-learning commu-
This is useful for teachers to understand what is hap- nity would like to use a similar but different peda-
pening during the course, but provides little insight into gogical ontology. One solution is the fusion of the
the factors that influenced the pedagogical results. For two ontologies or an upgrading of the initial one to
instance, most LMS do not provide the specific kinds of take into account the requirements and enrichments
aggregated information about courses and users which proposed by the other. If the requirements for updates
are of interest for learning managers. are infrequent and hit a few terms only the fusion or
Oliveira and Domingues [2004] describe how in- updating of an ontology is realistic. In contrast if the
formation discovery of behaviours and trends can be two communities have divergent –although similar-
improved by using multidimensional data analysis goals and evolutions, a looser connection between
and mining technologies, while providing the basis ontologies is to be preferred.
for adaptive learning and automatic course creation. A simple known model consists in connecting two
Several types of metadata were considered in this similar terms from different ontologies by an isa link.
context: student interaction metadata provides a Then, if term t in peer 1 is connected to term t’ in
measure of the progress and rate of learning and of peer 2, all resources under t become resources under
the collaboration extent; content and courseware t’. Given such a model, an interesting issue is to dis-
metadata enables quality control of the provided con- cover resources in neighbour peers and to find the
tent and path optimization based in the pedagogical best strategies for this distributed query processing,
purpose of content; evaluation and assessment meta- as described in Tzitzikas and Meghini [2003].
data helps to determine satisfaction and effectiveness
of learning and registers grades and other scores. Referen ces
Multidimensional views on these administrative [Ullrich, 2004] Carsten Ullrich. Description of an
metadata related with the course areas, the content
Instructional Ontology and its Application in Web
types, the communication tools, the user profiles or the Services for Education. Submitted to the Third Inter-
time axis, allow a deeper knowledge on the system us- national Semantic Web Conference, 2004.
age and, consequently, better informed decisions on
investment, user training, content reuse or pedagogic
[Merceron,Yacef, 2003] Agathe Merceron and Kalina
methodology developments. Yacef. A Web-Based Tutoring Tool with Mining Fa-
Examples of usage analysis include questions like cilities to Improve Learning and Teaching. In Proc.
"what are the preferred document types?", or scenarios
of the 11th International Conference on Artificial
like "why do we have high rates in message answering Intelligence in Education, Sydney, Australia, 2003,
delay?" or "what if we do not allow internal linking to U. Hoppe and F. Verdejo and J. Kay Eds., IOS Press,
teachers' homepages?". pages 201-208.
While LOM and SCORM provide a framework for
the representation and processing of the metadata, they [Oliveira, Domingues, 2004] Carlos Oliveira and
fall short in including the needed semantic density for Manuel Domingues. Multidimensional Analysis of
more specific pedagogical tracking. We thus argue that
Administrative Data in eLearning Systems. In Proc.
the use of a pedagogical ontology provides a higher of the 10th European University Information Systems
level of decision support analysis and mining, based in Conference, Bled, Slovenia, 2004, V. Mahnic and B.
qualitative issues like: the pedagogic methodologies
Vilfan Eds., University of Ljubljana Press, pages
used, the collaborative degree of activities or the under-
172-178.
standing expressed in the assessments.
For example, we could make an analysis of the com- [Tzitzikas, Meghini, 2003] Yannis Tzitzikas and
pleteness of existent learning objects, in terms of con- Carlo Meghini. Query Evaluation in Peer-to-Peer
cept description, example coverage and interactivity Networks of Taxonomy-based Sources. In Proc. of
level, and explore detected trends. The correlation be- the 10th International Conference on Cooperative
tween learner interaction with the content and exercise Information Systems, Catania, Italy, 2003.
scoring can yield important clues on the relevance of
pedagogic methodologies used and raise management
alerts where untypical patterns are detected.

You might also like