Professional Documents
Culture Documents
Abstract- Recommender systems which are simulations of standard ranking. This standard ranking yields into the list of
web personalization are now-a-days widely integrated in most relevant movies for those particular users at the top in
various domains for improving quality of fetched order. Out of this list only top-N movies will be actually
information. The most critical job for any recommender recommended to the users. Figure 1 shows the general steps
system is to provide more and more utilizable items to its followed by conventional recommender systems in general.
users. Today lots of knowledge discovery techniques are
lip
invented to dig into huge chunks of user-item information Da t a s e t
and accurately read users mind for selecting list of items .
Recent studies have shown the importance of diversified
recommendations to increase item utilization . In short
both accurate and diversified recommendations are key
quality factors of user satisfaction. Re-ranking techniques
are one of the solutions proven to be best for balancing
both accuracy and diversity. After using collaborative
filtering for standard ranking, we propose use of various
content based attributes for re-ranking recommendations
to introduce aggregate diversity selectively and flexibly.
We experimented on movilens dataset and finally propose
multi-attributed hybrid movie's content based re-ranking
technique (MCBRT) for recommendation.
alp
Re C O lll m e n d a t i o n s
2
recommenders can increase sales, and recommenders that C. Other Related Work
discount popularity appropriately may increase sales more. The K. Alodhaibi et al. [ 1 1 ] built a recommender algorithm
M. Zhang and N. Hurley [5] proposed a approach that seeks that works for compound products and services instead of just
to find out best possible subset of items to be recommended individual items. For this they implemented CARD framework
over all possible subsets. Here resultant list's similarity to target which basically separates the utility space and diversity space
query and diversity within list these two are taken as a binary to avoid the tradeoff between similarity and diversity. The
optimization problem. A new evaluation metrics, item novelty, algorithm he designed is computationally efficient and
is proposed. Item novelty means how much an item is different outperforms in terms of diversity.
than current item list. Item novelty depends upon other existing
items in user profile. Item novelty brings certain level of M. Ge et al. [ 1 2] proposed an approach where position of
difficulty for recommendations and hence can be used to items in recommendation list has given most importance.
generate useful test cases. By adjusting the novelty value the Authors said that though diversity weather individual or
tolerance in accuracy loss is balanced. Author points out that aggregate is achieved, it is not necessary that user will
probability of recommending novel items is low whenever perceive its advantages. A well diversified list may display the
similarity is the basic selection criterion. similar items at the top of the list and diverse items may be at
bottom which is not desirable always. Especially it is highly
P. Castells et at. [6] said that though various novelty and
important in case of small screen devices where user is
diversity metrics are popular in literature of recommender
systems, they do not address two important characteristics item interested in only top few recommendations.
ranking and relevance. Author concentrated on two ground B. Wang et al. [ 1 3 ] said that user's interests are always full
concepts, namely item similarity and user-item interaction. of uncertainty which could not be addressed by top-N list of
User-item interaction is modeled based on three conditions recommendations easily. Instead of this author proposed a
choice, discovery and relevance. They tried to cover and cloud model which is powerful at solving knowledge
generalize the old metrics and put in better format. uncertainties.
S. Vergas [7] showed that intent oriented information
R. B . Wagh and J. B . Patil [ 1 4] discussed about web
retrieval diversity can be applied improving recommendation
personalization techniques and use of web mining for web
diversity. They formalized the diversity and novelty metrics
and their results showed that resulting diversification personalization. They suggested novel clustering methods for
techniques can give best results. In addition these proposed web page recommendations.
Oataset I/ P
datasets have shown that diversity is improved with
MovieLe n s Predictio n
insignificant loss in recommendation accuracy. t'----lI t----�
Tec h n iq u e
B. Aggregate Diversity
T. Zhou et at. [9] developed an approach that combines a
accuracy focused algorithm and a diversity focused algorithm.
According to author such collaborations can yield best results
balancing both accuracy and diversity, without relying on any
semantic or context specific information. They used averaging
process in their algorithm that supports diversity F i n d top·k genres Find Release_year
enhancements. a n d Gen_categories categories
3
A . Use of Neighbourhood Based Collaborative Filtering (CF) Moviei d 23 107 56 78 301 211 590
Pred icted
As A Prediction Technique
4.8 4.78 4.62 4. 10 4.08 3.74 3,44
R ati n g
We have used user-based technique of neighborhood
based CF to predict the unknown ratings of different users for
Genre
different movies. We calculated similarities between users. H' H' H' 0' 0' H' 0'
We used cosine similarity metric [ 1 0] to calculate similarity category
Rel ease_year
: E iF-: I(u.,Ll,1) R(u, 1.) , R(ur O)
H" H" H"
values. Similarity of user u with user u' is given as follows:
0" H" 0" 0"
S!: n'l.(u" iLl )
catego ry
-;::======-r=======
= I I T --.:...-'-------------
.:.. movies and user ' s age attribute . Depending on the age of
I
c ategory H H H 0 0 H 0 0 0 0 H 0 0
I �
!
-l Th =2.4
Movi e i d 78 3 01 590 122 700 23 1 07 56 21 1 89 1001 1 89 34
P re d i cted 4. 1 0 4. 08 3 . 44 3 .3 2 3.10 4. 8 4.7 8 4. 62 3 .74 --- 2. 8 2 .7 5 2 .45 2. 1 ---
. ( b)
Rati ng
C ategory 0 0 0 0 0 H H H H 0 H 0 0
J
I
J- Tr=3.4 Th =2.4
Movi e i d 78 301 590 23 107 56 21 1 122 700 89 1001 1 89 34
(c) P re d i cted 4. 1 0 4. 0 8 3 . 44 4. 8 4.7 8 4. 62 3 .74 3 .32 3.10 --- 2 .8 2 .7 5 2 .45 2. 1 ---
Rati ng
C ategory 0 0 0 H H H H 0 0 0 H 0 0
Figure 4. General idea of proposed re-ranking technique multi-sattributed hybrid MCBRT with respect to standard ranking
4
As shown in figure 4 (a) the movies for user u are sorted in 20% split, are already available in the dataset itself. We have
descending order of their predicted rating values obtained in verified our results on all this disjoint sets.
step- I . The Top-5 movies to be recommended contains all TABLE T
highest predicted ratings but some of them are from home BASIC INFORMATION OF INPUT DATASET
category itself. We can take chance here and replace home
category movies with next highly rated movies of other
category in sequence as shown in figure 4 (b). We can see N u m be r of u s e rs 94 3
movies 23, 1 07 and 56 are replaced by next movies 590, 1 22
and 700 respectively. But it may be too risky for maintaining N u m be r o f mov i e s 1682
accuracy level to include certain movies like 1 22 and 700
having comparatively low values of predictions, especially N u m be r o f ra ting s 100000
where accuracy is vital. For this reason depending on the need
of situation we should have flexibility to decide required
accuracy and diversity levels . It is achieved using Ranking b) Parameters
threshold (Tr) as shown in fig. 4 (c) where Tr is set as 3 .4 and
During the whole experiment, we have used following
hence movies 1 22 and 700 are out of the top-5 competition
different parameters shown in table II with different values to
and next movies in sequence from other category is tried to
crosscheck versatility of performance of our method.
find out . As now none of the movie from other category
satisfying ranking threshold is available hence remaining TABLE II
highly ranked home category movies are appeared inside the SETTING PARAMETER VALUES
top-5 window at the empty places sequentially.
Pa rameter Va l ues
TABLE III
Accuracy vs. Diversity performance of multi-attributed hybrid MCBRT and other ranking techniques for 50 neighbors, top-N=5, top-k=3 and Th=2.5
D A D A D A D A D A
2 .5 620 0 .835 600 0.827 580 0.188 609 0.1lt 500 0.814
3 .0 590 0 .842 588 0.830 518 0.198 609 0.152 430 0.813
3 .5 580 0 .855 588 0.832 560 0.8 13 530 0.189 430 0.827
4 .5 530 0 .856 500 0.855 419 0.851 420 0.855 330 0.865
5 .0 425 0 .868 390 0.861 400 0.855 380 0.854 320 0.851
5
7 10 0
675
6 5 10
""
625
'"
6 10 0
�
� -... ....
575
5 5 10 " "' � M CBR T
'¥- "" \.
,\ \
...
525 r-- M u lti - a,tt r i b ute d M CB RT
"- "toil (\
I::: - ......
,:t::=
!!l 5 10 0
+-A- A ve r age
Q 475
"- \
"" \.
4 5 10 I t e m p - 10 0 10 u I a r it','
� \ -
425
4 10 0
375 J •
3 5 10
325
3 10 0
10 . 7 10 . 7 10 . 7 28 . 7 30 . 7 40 . 7 50 . 7 60 . 7 70 . 7 00 . 7 9 0 . 8 . 8 10 . 8 28 . 8 30 . 8 40 . 8 50 . 8 60 . 8 70 . 8 00 . 8 9 10 . 9
A O C Il <iI" , <i CV
Figure 5. Graph for accuracy vs. diversity performance of all re-ranking techniques for 50 neighbors, top-N=5, top-k=3 and Th=2.5