Professional Documents
Culture Documents
IN
T
else. Among the widely proposed recommendation techniques, content-based filtering [2] [3] and collaborative
filtering [4] [5] have been the most famous one in the
literature [6]. Content-based filtering is done under the
assumption that users future preferences are similar to
those they liked in the past, while collaborative filtering
is done under the assumption that if two users have
similar preferences in the past, they will have similar
preferences in the future. Between them, collaborative
filtering is more widely used and therefore attracts more
interest from researchers [7][8]. Collaborative filtering
techniques are rating-oriented and involve the participation of a high number of users who provide fewer
ratings than the items they consume. Taking that into
consideration, questions such as how to alleviate the data
sparsity while increasing the recommendation accuracy
are the main concerns in the related works.
Recently, some approaches considering factors outside
users ratings have been proposed in the literature [9].
Since recommender systems can naturally be applied
in various fields where items are categorized, their
associated datasets provide not only users ratings but
also items categories. Based on this information, many
recommender system extensions have been made.
In this paper, we propose a movie genre prediction
model based on users ratings. We apply multi-variate
Bernoulli model to estimate likelihoods that are used
in Nave Bayes rule to predict movie genres. We also
calculated the genre correlations to check if incorrectly
predicted genre is correlated with the correctly predicted
one. In general, a recommender predicting an items
category is important in sense that it can complement the
items categories assigned by a human expert, therefore
increasing the user satisfaction by providing surprising
recommendations.
The proposed approach has the following combination
of contributions in order to expand traditional recommender systems :
(1) We propose a new approach that expands the traditional recommender systems by predicting the category
of an item under evaluation.
PR
EP
R
I. I NTRODUCTION
Nowadays, web users are no longer simply considered as consumers of information but also as active
sources that generate large volume of data online. Consequently, the amount and the diversity of information
on the Internet increase exponentially. The whole body
of information that confronts the users online can cost
them more time and effort without any guarantee of
finding what they are looking for. Aiming to solve these
problems, researchers in the academe and/or industries
have suggested the use of recommender systems [1]
that overcome the information overload by facilitating
search and access to information by providing users
with relevant items in the shortest time possible. In this
context, items can be of any kind, namely a movie to
watch, a soundtrack to listen to, a webpage to visit, or
vm,u (r) =
if u rated movie m as r,
otherwise.
(1)
IN
T
P (m | g, r) =
[vm,u (r)P (u | g, r)
(2)
uU
PR
EP
R
P (u | g, r) =
P
1 + mM vm,u P (m | g, r)
P
|U | + mM P (m | g, r)
(3)
P (m | g, r) P (g | r)
P (m)
(4)
(5)
P (ujg)
0.01
900
800
0
IN
T
700
600
500
400
300
10
200
12
14
users
100
16
genres
18
-1000
PR
EP
R
B. Evaluation
In this section, we demonstrate the correctness of our
proposed method.
Movies were selected randomly in both of our
training and testing approaches. To test the prediction
power of the proposed model the following portion of
training size were chose: : 1%, 5%,10%,...,75%, 80%.
As for the testing set we always used 20% of the whole
set. None of items of testing set were included in the
training.
The first step in our algorithm is to estimate
P (u | g, r) based on the user ratings, that is described
by eq. 3. Figure 1, depicts the probability of the
user u rating a movie of category g as rating r = 1
i.e. user preference models. Figure 2, illustrates the
probability P (m | g, r) that is obtained based on
the Bernoulli event model (2) and plays a role of a
likelihood in the Bayesian eq. 4. The probability of
a failure or absence of the event vm,u (r) is given by
(1 vm,u (r))(1 P (u | g, r)). When the prediction
is incorrect, the genre correlation matrix (fig. 3) is used
to check whether the incorrectly predicted genre is
correlated with the true movies genre. If it is correlated
(cor > 0.1), we accept this prediction as a prediction
of a similar genre.
To measure the accuracy of the proposed approach on
movie genre recommendations, we plot the predictions
based on every rating from one to five. For every
training size we repeated the process of randomly
selecting training samples 20 times.
0.02
P (mjg)
-2000
1600
18
16
1400
14
1200
12
1000
10
800
8
600
6
genres
400
4
2
200
movies
0.7
TABLE I
P REDICTION RATE WHEN 80% OF THE SET USED FOR TRAINING
r=2
51.6 2.5
r=3
58.8 2.6
r=4
56.7 2.4
r=5
52.9 2.8
53.6 2.2
55.9 2.8
69.2 2.9
66.6 2.4
61.0 3.2
Correct prediction
exact
exact w/
similar
r=1
51.2 2.2
Prediction rate
Prediction rate including correlated genres
0.65
0.55
0.6
0.55
0.5
0.45
0.4
0.35
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
IN
T
0.45
0.4
0.7
0.35
Prediction rate
Prediction rate including correlated genres
0.3
1
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
Prediction rate
Prediction rate including correlated genres
0.65
0.6
Correct prediction
Correct prediction
0.5
0.55
0.5
0.45
0.4
0.6
Prediction rate
Prediction rate including correlated genres
0.35
0.55
10
PR
EP
R
Correct prediction
0.3
15
20
25
30
35
40
45
50
55
60
65
70
75
80
75
80
0.5
0.45
0.65
0.4
Prediction rate
Prediction rate including correlated genres
0.6
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
Correct prediction
0.35
0.55
0.5
0.45
0.4
0.35
0.3
10
15
20
25
30
35
40
45
50
55
60
65
70
PR
EP
R
[1] L, L., Medo, M., Yeung, C. H., Zhang, Y. C., Zhang, Z. K., &
Zhou, T, Recommender systems, Physics Reports , 519(1), 1-49,
2012.
[2] Bogers, T., & Van den Bosch, A, Collaborative and contentbased filtering for item recommendation on social bookmarking
websites, Submitted to CIKM, 9, 2009.
[3] Basilico, J., & Hofmann, T, Unifying collaborative and contentbased filtering, In Proceedings of the twenty-first international
conference on Machine learning, (p. 9), ACM, 2004, July.
[4] Su, X., & Khoshgoftaar, T. M, Review Article A Survey of
Collaborative Filtering Techniques, Advances in Artificial Intelligence, 2009.
[5] Braida, F., Mello, C. E., Pasinato, M. B., & Zimbro, G, Transforming collaborative filtering into supervised learning, Expert
Systems with Applications, 42(10), 4733-4742, 2015.
[6] Park, D. H., Kim, H. K., Choi, I. Y., & Kim, J. K, A literature review and classification of recommender systems research,Expert
Systems with Applications, 39(11), 10059-10072, 2012.
[7] Liu, Q., Chen, E., Xiong, H., Ding, C. H., & Chen, J, Enhancing
collaborative filtering by user interest expansion via personalized
ranking, Systems, Man, and Cybernetics, Part B: Cybernetics,
IEEE Transactions on, 42(1), 218-233, 2012.
[8] Vargas, S., Baltrunas, L., Karatzoglou, A., & Castells, P, Coverage, redundancy and size-awareness in genre diversity for recommender systems, In Proceedings of the 8th ACM Conference on
Recommender systems (pp. 209-216), ACM, 2014, October.
[9] Huang, Z., Chen, H., & Zeng, D, Applying associative retrieval
techniques to alleviate the sparsity problem in collaborative filtering, ACM Transactions on Information Systems (TOIS), 22(1),
116-142, 2004.
[10] Manzato, M. G, Discovering latent factors from movies genres
for enhanced recommendation, In Proceedings of the sixth ACM
conference on Recommender systems (pp. 249-252), ACM, 2012,
September.
[11] Sollenborn, M., & Funk, P, Category-based filtering and user
stereotype cases to reduce the latency problem in recommender
systems, In Advances in Case-Based Reasoning (pp. 395-405),
Springer Berlin Heidelberg, 2002.
[12] Tilwani and Tiwari, Implementation of category based recommendation module of SEP architecture using PBTA, International
Journal of Advanced Research in Computer Science and Software
Engineering, volume 3, 2013.
[13] Choi, S. M., Ko, S. K., & Han, Y. S, A movie recommendation
algorithm based on genre correlations, Expert Systems with
Applications, 39(9), 8079-8085, 2012.
[14] http://grouplens.org/datasets/movielens/100k/
IN
T
R EFERENCES