Professional Documents
Culture Documents
A Decision Support System To Improve E-Learning Environments (With Framework)
A Decision Support System To Improve E-Learning Environments (With Framework)
Environments
Marta Zorrilla Diego García Elena Álvarez
Department of Mathematics, Statistics Department of Mathematics, Statistics Department of Applied Mathematics
and Computation, University of and Computation, University of and Computer Science University of
Cantabria Cantabria Cantabria
Avda. Los Castros s/n Avda. Los Castros s/n Avda. Los Castros s/n
Santander, Spain Santander, Spain Santander, Spain
34 942 20 20 63 34 942 20 14 20 34 942 20 18 44
zorrillm@unican.es diego.garcias@alumnos.unican. alvareze@unican.es
es
• Finding out the questions which students fail more For this case study we have used the data from a virtual course
frequently entitled “Introduction to multimedia methods”. It is a subject of 6
ECTS which was taught in the first semester of 2009 at the largest
• Discovering the resources which are commonly used virtual campus in Spain, called G9 (this group is composed of 9
together Spanish universities; one of them is the University of Cantabria).
As it has been said, we have a data warehouse database, so we It is a practical subject in which a multimedia tool is taught. The
have the parameters from LCMS database already prepared to course is designed by means of web pages conformed to SCORM
make decisions. This means that the pre-processing tasks are [24] and include some video tutorials, flash animations and
reduced. Table 1 gathers the set of variables with which we can interactive elements. It is registered in Blackboard LMS.
work. These can be obtained with different levels of aggregation. Although the number of students enrolled in the course was 80,
For example, student time spent can be calculated per session, per only 45 did the first assignment, whose submission was 15 days
resource, per action, per week, etc. after the beginning of the course, and finally, 37 students followed
the course until the end.
First, we analyse the student profile. This template utilizes the
Table 1. Some of available variables. following input parameters: gender, age, number of sessions in the
course, time spent in the course, average sessions per week,
Variables average time spent per week.
Nº sessions Learner gender
These parameters were chosen in accordance with the instructor,
Nº hits Learner academic level since the model obtained will be explained depending on them.
Time spent Learner age We use EM (Expectation-Maximization) and KMeans as
clustering algorithms [30]. Given that the EM algorithm provides
Delay among sessions Nº chat room entered a probability distribution that can be used as a similarity criterion
Nº content pages viewed Nº wiki page edited to characterize the data, we utilize it in order to know the number
of clusters with which the KMeans algorithm will be executed
Nº messages post to forum Nº quizzes done (required parameter). We show the result obtained with KMeans
because it is easier to understand it graphically and statistically.
Nº messages read on forum Nº quizzes passed Each cluster is represented by its centroid, which means, the
Nº messages replied to forum Nº quizzes failed "average" of all its points.
As can be extracted from Figure 2, there is a 44% of females with
Nº messages sent by mail Nº items in test an age around 22 and they carry out more sessions than the rest
and these are longer. Males about 24 represent 26% of the
Nº messages read on mail Nº attempts in item population and dedicate less time per session (practically half) and
Nº assignments read Grade in item the number of visits is lower. The youngest males (31%) have a
behavior quite similar to the female group. The image on the right
Nº assignments submitted Mark in each assessment, test, shows each cluster graphically, using the percentile in which the
or assessed task cluster variable value is located. Cluster 1 and 2 are very similar,
the difference is in the gender variable which is not represented
due to the fact that it is not numerical.
In general, numerical data will be discretized into categorical
classes that are easier to understand for the instructor (categorical Next, we show the template for discovering the resources which
values are more user-friendly for the instructor than precise are commonly used together. This contains the following
magnitudes and ranges) [7]. For example, the mark could be parameters: session id, and a Boolean variable indicating if each
divided in 4 classes: fail, pass, good and excellent if the system one of the available resources in the course was visited: content-
has the criteria to do it. The equal-width method will be generally page, mail, discussion, chat, assignments, weblink, organizer,
learning objectives, assessment, calendar and others.
Average Cluster 0 Cluster 1 Cluster 2
Age 22,4872 24 22,1765 21,6667
Gender Male Male Female Male
TotalTime 1976,8065 1313,4806 2290,4953 2085,185
Sessions 128,0323 80,6032 146,4175 141,5108
AvgTimeWeek 115,8065 76,8806 134,26 122,1022
AvgSessionWeek 7,0323 4,3032 8,1233 7,7608
Instances: Instances: Instances:
26% (10) 44% (17) 31% (12)