You are on page 1of 12

Decision Support Systems 63 (2014) 52–63

Contents lists available at ScienceDirect

Decision Support Systems


journal homepage: www.elsevier.com/locate/dss

Object typicality for effective Web of Things recommendations


Yi Cai a, Raymond Y.K. Lau b,⁎, Stephen S.Y. Liao b, Chunping Li c, Ho-Fung Leung d, Louis C.K. Ma e
a
School of Software Engineering, South China University of Technology, China
b
Department of Information Systems, City University of Hong Kong, Hong Kong Special Administrative Region
c
School of Software, Tsinghua University, 100084 Beijing, China
d
Department of Computer Science and Engineering, The Chinese University of Hong Kong, China
e
SCOPE, City University of Hong Kong, Hong Kong Special Administrative Region

a r t i c l e i n f o a b s t r a c t

Available online 23 September 2013 With the rapid growth of “Web of Things” (WoT), there is a pressing need to develop effective mechanisms for
the intelligent discovery and selection of these things (items). Recommender systems are viable solutions to ad-
Keywords: dress the issue of WoT discovery and selection. However, classical recommender systems are weak in handling
Object typicality sparse recommendation spaces which characterize most WoT recommendations. Moreover, classical recom-
Recommender systems
mender systems may not be able to scale up to efficiently process a large number of things on the Web, and
Situation awareness
yet these systems may produce big-error recommendations that diminish users' trusts on utilizing WoT. The
Web of Things
main contribution of our research is the design and development of a novel recommendation method which is
underpinned by the principle of object typicality verified in the field of cognitive psychology to address the afore-
mentioned issues related to WoT recommendations. Based on the MovieLens benchmark data set, our experi-
mental results show that the proposed recommendation method is effective and produces the least big-errors.
Since the proposed method exploits data generalization by operating at item group and user group level during
recommendation time, it is more effective and efficient than other baseline methods given sparse training data.
Based on the Netflix benchmark data set that simulates a large WoT recommendation space, the proposed meth-
od also significantly outperforms state-of-the-art recommendation methods in terms of Mean Absolute Error
(MAE). The business implication of our research is that the proposed recommendation method can enhance
the situation awareness of WoT applications which facilitate the reuse of enterprise resources and the interoper-
ability among enterprises.
© 2013 Elsevier B.V. All rights reserved.

1. Introduction as humans to find, select, and utilize smart things in an effective way
[11,25]. Due to the problem of information overload [19,38,47], it is ex-
The transition from the Social Web to the Semantic Web has trig- tremely difficult for Web users to develop the situation awareness about
gered increasingly more physical devices (i.e., smart things) such as the huge number of things initiated on the Web everyday. Accordingly,
RFID chips, wireless sensors, actuators, and mobile phones to be there is a pressing need to examine the issue of smart things discovery
connected to the digital world for the development of useful real- and selection on the Web. Since recommender systems have been
world applications on the Internet (i.e., the “Internet of Things”) [12]. shown to be viable solutions for the discovery and selection of services
More recently, the vision of “Web of Things” (WoT) leads to the design on the Web [49], this paper focuses on the design and development of a
of more sophisticated applications or services by interconnecting any novel recommendation method that can facilitate users (humans or ma-
objects through the Web layer (e.g., interconnecting via the HTTP proto- chines) to discover and select relevant smart things. For example, the
col) [8,12]. For example, a movie recommendation service (i.e., a smart proposed WoT recommendation service can autonomously suggest
thing) is connected to a sensor installed at a cinema; a new WoT appli- some useful things to users based on their previous usage experience
cation is then composed to timely inform users once tickets of the rec- and the preferences of other like-minded users.
ommended movies are nearly sold out at that cinema. A large body of research about recommender systems has been
However, it has been pointed out that the emerge of a large number performed in the past two decades. Existing recommender systems
of smart things on the Web causes great difficulties to computers as well are usually classified under one of the three broad categories, namely
collaborative filtering (CF) [17,36], content-based recommendation
(CB) [28,33], and hybrid recommendation [2,24,27]. Content-based
⁎ Corresponding author.
E-mail addresses: ycai@scut.edu.cn (Y. Cai), raylau@cityu.edu.hk (R.Y.K. Lau),
recommendation methods suggest items to a user based on similar
issliao@cityu.edu.hk (S.S.Y. Liao), cli@tsinghua.edu.cn (C. Li), lhf@cuhk.edu.hk items s/he having consumed before. On the other hand, collaborative fil-
(H.-F. Leung), islma@cityu.edu.hk (L.C.K. Ma). tering methods recommend items to a user based on the preferences of

0167-9236/$ – see front matter © 2013 Elsevier B.V. All rights reserved.
http://dx.doi.org/10.1016/j.dss.2013.09.008
Y. Cai et al. / Decision Support Systems 63 (2014) 52–63 53

similar users. Hybrid recommendation methods represent a model- is referred to [29]. According to the prototypical view of concepts,
based or heuristic-based combination of the aforementioned recom- each concept is represented by the best prototype capturing the salient
mendation methods. Although these recommendation methods have properties of objects belonging to that category [26]. Vanpaemel et al.
been widely used in electronic commerce, they are faced with the fol- [42] extended the prototypical view of concepts by developing methods
lowing challenges for WoT recommendations: to identify the prototypes of a concept based on typical objects. In par-
ticular, an object is considered to be an instantiation from the most sim-
• First, data sparsity is a big challenge given few ratings for a large num-
ilar abstraction (prototype). Barsalou [3] proposed two quantitative
ber of items on the Web.
measures, namely central tendency and frequency of instantiation to esti-
• Second, computational efficiency is another concern given the sheer
mate the typicality of an object with respect to a given concept. Central
volume of information about user preferences and items on the
tendency refers to the degree of an object's “family resemblance”. An
Web. Nevertheless, it is desirable for recommender systems to gener-
object is considered to have a high central tendency if it is similar to
ate recommendations in real-time to facilitate the timely composi-
other members of the same category and it is different from the mem-
tions of WoT applications.
bers of other categories. On the other hand, frequency of instantiation
• Third, existing recommender systems may generate big-error predic-
refers to the frequency of an object being referred to by people when
tions. In other words, some items suggested by a recommender sys-
a specific concept is examined. If an object is often used as an exemplar
tem are very different from the actual preferences of a user. These
for a concept, it has a high frequency of instantiation, and therefore it is
big-error predictions diminish users' trusts on WoT applications.
considered a typical one with respect to that concept.
The main contribution of the research work presented in this paper Rifqi [34] proposed a computational method to estimate object typ-
is that we exploit the principle of “object typicality” extensively studied icality in large databases. In particular, the typicality of an object is esti-
in the field of cognitive psychology [9] to develop a novel, cognitively mated according to its resemblance to other members of the same
motivated recommendation method called Recommendation based category and its dissimilarities to the members of other categories.
On Typicality (ROT) to address the aforementioned issues arising in Lesot et al. [21] developed a similar computational method in the con-
WoT recommendations, and hence to improve the situation awareness text of fuzzy systems. Desclés and Pascu [6] applied the notion of object
of WoT applications. Since recommender systems deal with human per- typicality to construct new quantifiers for natural language processing
ceptions about objects, there is a distinct advantage of designing recom- and common sense reasoning. Cai and Leung [5] formalized object typ-
mender systems grounded in the principle of human cognition. To the icality with reference to an ontology. Hua et al. [14] applied the principle
best of our knowledge, this is the first successful design of a cognitively of object typicality to develop a typicality-based query operator that en-
motivated recommender system to enhance WoT recommendations. In hances the effectiveness of query processing in databases.
particular, ROT exploits high-level item- and user-based similarities to
facilitate WoT recommendations. The basic intuition of the proposed 2.2. Recommender systems
method is that a “typical” user of a specific user interest group should
be recommended “typical” items that the group is most interested in. The assumption of content-based recommender systems is that peo-
For example, a typical user of the “war movies” interest group tends to ple prefer items similar to those that they positively evaluate before. For
prefer the typical movies (e.g., “The Longest Day”) characterizing the in- content-based recommender systems, the central issue is to examine
terests of the group. computational methods for learning user profiles and measuring item
More specifically, the proposed ROT method first exploits the natural similarity. For example, Pazzani and Billsus [33] applied the naive Bayes
partitions of items of a given recommendation space to establish item classifier to construct a user profile that captured “relevant” and “non-
groups. Then, the user interest group pertaining to each item group is relevant” Web pages for the user. Mooney and Roy [28] developed the
identified. Finally, items are recommended to a user according to a LIBRA system for the recommendations of books. A detailed account of
novel typicality-based computational mechanism that exploits the content-based recommender systems is provided by Pazzani and Billsus
user's fuzzy memberships pertaining to various user interest groups [32].
and the typical items characterizing each of these groups. Pragmatically, On the other hand, collaborative filtering-based recommender sys-
items are recommended according to the match between user groups tems suggest items to a user based on the preferences of other like-
and item groups at execution time. In other words, the ROT method minded users. Since the collaborative filtering approach does not require
operates at a high level of data granularity (data generalization) [47]. well-structured item descriptions, it has been widely used to recommend
This is a novel way on how ROT addresses the issues of data sparsity a variety of items including images, videos, and music [1]. For example,
and computational complexity arising in WoT recommendations. GroupLens [17] and PHOAKS [40] were developed based on the collabo-
The remainder of the paper is organized as follows. Section 2 dis- rative filtering approach. The user-based CF approach first identifies the
cusses related research of recommender systems and compares existing nearest “neighbors” of a user by exploiting the user similarity relations.
work with our proposed approach. The computational details about the Then, the system predicts the rating of an unrated item based on the rat-
proposed object typicality based recommendation method are illustrat- ings given by these nearest “neighbors” [13]. Moreover, Zheng et al. [49]
ed in Section 3. In Section 4, we discuss the results of our empirical ex- developed a user-based CF approach for the discovery and selection of
periments based on the MovieLens data set1 and the Netflix data set.2 Web services. In contrast, item-based CF approach recommends items
We then summarize the main characteristics of the proposed ROT to a user based on other users' ratings of similar items. More specifically,
method in Section 5. Finally, we offer concluding remarks and describe an item-based CF system first identifies the nearest “neighbors” of an
future directions of our research work. unrated item by examining the item similarity relations. Then, the system
predicts the rating of the unrated item based on other users' ratings
2. Related work assigned to these nearest “neighbors” of items [7,36].
Hybrid recommender systems combine CB- and CF-based approaches
2.1. Research on object typicality to address the limitations of individual recommendation method
[2,27,37]. Melville et al. [27] applied a content-based approach to aug-
Psychologists have found that people are more interested in typical ment a user-item rating matrix, and then used a collaborative filtering ap-
objects than atypical ones when a concept (i.e., a category of objects) proach to generate the final recommendations. Xue et al. [46] developed a
cluster-based Pearson Correlation Coefficient method (SCBPCC) that
1
http://www.grouplens.org/. exploited both item similarity and user similarity in collaborative filtering.
2
http://archive.ics.uci.edu/ml/datasets.html. Wang et al. [43] proposed the similarity fusion method to unify user- and
54 Y. Cai et al. / Decision Support Systems 63 (2014) 52–63

item-based CF methods. Ma et al. [24] proposed an effective missing and country of production, whereas a Web service is represented by
data prediction (EMDP) method that leveraged the advantages of both the service provider, service quality, expected response time, and so
item- and user-based CF approaches. Recently, Li et al. [22] developed on. It should be noted that an item can be represented by any properties
a transfer learning-based recommendation method by means of the ex- rather than restricting to keywords. Formally, each item is represented
traction of cross-domain contextual information. Kim et al. [16] pro- by an item vector as follows.
posed a collaborative filtering approach to enrich user modeling for
more personalized recommendations. Xu et al. [45] combined social Definition 1. Item vector
network analysis and semantic concept analysis to facilitate personal-
ized researcher recommendations. An item Oi is represented by a vector of property–value pair such as
D     E
!
p Oi ¼ pOi ;1 : lOi ;1 ; pOi ;2 : lOi ;2 ; ⋯; pOi ;n : lOi ;n where lOi ; j ∈½0; 1 in-
2.3. Differences between the proposed ROT method and existing methods
dicates the fuzzy degree of Oi possessing the property pOi ; j . If lOi ; j ¼ 0 is
established, it indicates that item Oi does not possess property pOi ; j at all,
The proposed ROT method is grounded in the sound principle of
object typicality which has been verified based on numerous empirical whereas lOi ; j ¼ 1 suggests that Oi definitely possesses property pOi ; j .
tests in the field of cognitive psychology [26,29,42]. In addition, the For example, a computer Oi is represented by an item vector ! p Oi =

principle of object typicality has been successfully applied to enhance b (‘has intelCPU’: 1), …, (‘has wideScreen’: 0.8) N. For the proposed
user query processing in database systems [14]. The basic intuition of computational method, a set of fuzzy clusters is first constructed
the proposed ROT method is that a typical user of a specific user interest based on all the items under consideration. Each fuzzy cluster is
group should be recommended typical items that the user group is most called an item group which consists of similar items. An item be-
interested in. The proposed ROT method differs from existing content- longs to some fuzzy clusters with various degrees. For example,
!
based recommendation approach in that it does not directly recom- p Oi = b (‘has intelCPU’: 1), …, (‘has wideScreen’: 0.8) N may belong

mend new items to a user based on the items s/he positively rated be- to the item group “desktop computer” and the item group “netbook”
fore. Moreover, it differs from CF-based recommendation approach in with various degrees. From a cognitive perspective, each item group
that it does not need to compute the “nearest neighbors” of an individ- represents a concept of a specific domain. Formally, an item group is
ual user when a recommendation for that user is generated. In contrast, represented by a fuzzy cluster which is underpinned by the theory
the ROT method represents each user by a typicality vector which cap- of fuzzy sets [48]. Essentially, each cluster is represented by the
tures the user's typicality degrees with respect to various user interest same set of properties (e.g., keywords). However, each cluster
groups. The user typicality vectors and user groups are constructed by owns these properties with various degrees (i.e., fuzzy member-
means of an off-line system training process. ships). An item group is defined as follows.
The ROT method is different from economic choice models in that it
generates recommendations based on the typicality of a user with re- Definition 2. Item group
spect to various user interest groups and the typical items that these
groups prefer instead of the joint statistical distributions of item proper- An item group kj is a fuzzy cluster of similar objects represented by:
n o
w w w
ties and user preferences. Pragmatically, ROT operates at item group and k j ¼ O1 j;1 ; O2 j;2 ; ⋯; Om j;m where m represents the number of items; Oi
user group level rather than individual user and item levels during rec- is an item, and wj,i is the typicality degree of item Oi for the cluster kj.
ommendation time. This is a unique way on how the proposed ROT Formally, the construction of fuzzy clusters is represented by a map-
method addresses the issues of data sparsity and computational efficien- ping function (i.e., a fuzzy membership function): m : O × K ↦ [0,1],
cy for WoT recommendations. Moreover, there is a good potential for where O is a set of items and K = {k1,k2, ⋯,kn} is a set of clusters. The
the ROT method to address the problem of big-error predictions due term n denotes the number of clusters induced based on the natural pat-
to the sound principle of object typicality. It should be noted that the terns of the set of items O. The term wj,i denotes the typicality degree
proposed ROT method differs from clustering-based CF methods such (i.e., fuzzy membership) of an item Oi for the item group kj. For the pro-
as SCBPCC [46] in that ROT applies data clustering to identify item posed ROT method, different clustering methods can be applied to con-
groups and user groups, whereas SCBPCC invokes clustering processes struct the fuzzy clusters (i.e., item groups) [23,44]. However, the search
to predict missing ratings. Though the proposed ROT method utilizes for the most effective clustering method given a specific domain will be
item descriptions to construct item groups, it does not involve extra left as part of our future work. If a clustering method is effective, the se-
data processing when compared to existing CB and hybrid methods mantic distances among different fuzzy clusters tend to be large (i.e., the
which also leverage item descriptions to generate recommendations. overlapping among item groups is small).
According to the prototypical view of concepts [5,29], each item
3. The computational model for typicality-based recommendations group can also be represented by a single prototype induced based
on the set of object instances described by the corresponding con-
In this section, we illustrate the computational details of the pro- cept. For a computer-based implementation of concept prototype,
posed typicality-based recommendation model which predicts users' previous research work has proposed using a prototype vector to
ratings on items based on both user typicality and item typicality. The represent a concept prototype which is an abstraction of a group of
concept of typicality provides a sound mechanism to rank items in a similar objects [5,29,42]. A prototype vector is induced based on
way that is close to human perception [9,29]. Since the main functional- the set of similar objects described by the concept. For example, a
ity of recommender systems is to predict users' preferences about items, prototype vector can be composed based on the mean, median, or
it is desirable to design recommender systems that can imitate the way mode vector derived from the set of object (item) vectors of a specif-
how people evaluate real-world items. ic category (concept) [21,29]. Formally, a prototype vector is defined
as follows.
3.1. The computational apparatus of object typicality
Definition 3. Prototype vector
To illustrate the computational apparatus of the proposed typicality-
based recommendation method, we first formally define items, item D     
groups, users, and user groups, respectively. Let U and O denote a set A prototype vector !
t kj ¼ pk j ;1 : r k j ;1 ; pk j ;2 : r k j ;2 ;⋯; pk j ;m : r k j ;m

of users and a set of items (objects). Each item is represented by some is a vector of property–value pairs which represent the prototype of an
properties. For example, a movie is represented by directors, actors, item group kj. The term m denotes the number of properties of the
Y. Cai et al. / Decision Support Systems 63 (2014) 52–63 55

prototype kj, and r k j ;i ∈½0; 1 represents the fuzzy degree of kj possessing items that the user group is most interested in. Therefore, the recom-
the property pk j ;i . mendation strategy of the proposed ROT method is that if an item Oi
For the special case, if r k j ;i ¼ 0 holds, it means that the item group kj is a typical (atypical) item of the item group kj, and a user Uj is a typ-
does not possess the property pk j ;i at all. In contrast, if r k j ;i ¼ 1 holds, it ical (atypical) user of the user group gj that corresponds to the item
suggests that kj definitely possesses the property pk j ;i . For the set of group kj, item Oi should be recommended to the user Uj with a high
users U, each individual user may have specific favorites or preferences (low) recommendation score. In this case, the pair (kj,gj) (i.e., the
for some kinds of items (i.e., item groups). For instance, a user Bob may matching item group and user group) represents a specific “recom-
be interested in “Horror” movies, while his wife Amy may prefer mendation context” discovered via un-supervised item clustering
“Adventure” movies, and yet their son Tom likes “Thrillers” movies. and user clustering. Formally, the recommendation score of Oi for
For each induced item group ki, the proposed recommendation method U
!j is!estimated
 via the typicality-based recommendation function RS
O i ; U j : O×U↦½0; 1, where O is a set of items represented by the cor-
constructs a fuzzy set [48] of users who like the corresponding item
group. Within a user interest group gi ⊂ U, each user Ux ∈ gi is interest- responding item group vectors, and U is a set of users represented by
ed in the group of items to various degrees. For example, Bob likes the corresponding user group vectors. More specifically, the pro-
“Horror” movies to a higher degree than Tom does. The set of users posed recommendation function RS should observe the following
pertaining to an item group ki is called a user group gi, and it is repre- axioms.
sented by a fuzzy set. The semantics of gi is that it describes the set of  !
Axiom 1. For a user Uj and an item Oi, if ∀ x, vx,j = 0, then RS !
O i ; U j ¼ 0.
users who like the particular item group ki. The membership of each
member Ux ∈ gi indicates the typicality degree of Ux with respect to gi.
 !2. For a user Uj and an item Oi, if ∃ x, vx,j = 1 and wx,i = 1, then
Axiom
The fuzzy membership of Ux for gi is called the user typicality. The ratio- RS !
O i ; U j ¼ 1.
nale of using fuzzy sets to represent item groups and user groups is that
an item may belong to multiple item groups and a user may also have Axiom 3. For a user Uj and two items Ox and Oy,if ∃ ki : wi,x N wi,y, and
! 
multiple interests pertaining to different user interest groups in the ∀ h ≠ i : wh,x = wi,y, then RS ! ! !
O x ; U j NRS O y ; U j .

real-world.
Axiom 4. For two users Ux and Uy and one item Oi, if ∃ gj : vj,x N vj,y, and
! ! 
Definition 4. User group ∀ h ≠ j : vh,x = vh,y, then RSð! !
O i ; U x ÞNRS O i ; U y .
Axioms 1 and 2 specify the boundary cases of generating the recom-
n o mendation scores. If a user does not like any item groups at all (i.e., the
v v v
A user group g i ¼ U 1i;1 ; U 2i;2 ; ⋯; U mi;m is a fuzzy set of users who like
user is not typical in any user group), it means that the user does not like
the corresponding item group ki. The term m denotes the number of any items for all item groups, and so the recommendation score of any
users of the group gi, and Ux represents a user. The term vi,x ∈ [0,1] item for the user should be zero. For a user Uj and an item Oi, if Uj likes
(i.e., fuzzy membership) is the typicality degree of the user Ux for the an item group kx with the highest degree (i.e., Uj is the most typical
user group gi. user of the user group gx), and Oi is the most typical item of the item
There are four important components of the proposed recom- group kx, the ROT system should recommend Oi to Uj with the highest
mender system, namely items, users, item groups, and user groups. recommendation score. On the other hand, Axioms 3 and 4 specify the
Fig. 1 shows the relationships between these four components. For influence of user typicality degrees from user groups and item typicality
each item, it has different typicality degrees with respect to the degrees from item groups, respectively. If an item is more typical for
item groups. Similarly, each user has different typicality degrees some item groups, and a user is more typical for the corresponding
with respect to different user groups. It should be noted that item user groups, the ROT system should recommend the item to the user
groups are first constructed via a chosen clustering method, and with a relatively high recommendation score. Accordingly, we propose
then the user group pertaining to each item group is composed. An a novel typicality-based recommendation function RS that observes all
item Oi may have a fuzzy membership wj,i N 0 for multiple item the aforementioned axioms.
groups, whereas a user Ux may also have a fuzzy membership vi,
Xn
x N 0 for multiple user groups.   wx;i  vx; j
! !
O i; U j ¼ ð1Þ
x¼1
According to the principle of “object typicality”, a “typical” user of RS
n
a specific user interest group should be recommended “typical”
The term n represents the number of item groups (also the number
of user groups). Moreover, wx,i is the typicality degree of the item Oi for
the item group kx, and vx,j is the typicality degree of the user Uj for the
user group gx that corresponds to the item group kx. As can be seen,
the proposed RS function is underpinned by the typicality degrees of
items and the typicality degrees of users. The computational details of
deriving these typical degrees will be illustrated in Sections 3.2 and
3.3, respectively. It should be noted that Eq. (1) does not rate an item
based on the average rating (i.e., simply recommending popular items
to a user). In contrast, it takes a user's specific interests into account
by evaluating their typicality with respect to various user interest
groups. Even though Eq. (1) produces a recommendation score that
falls in the unit interval, the proposed ROT system can simply invoke a
linear conversion function ϑ : [0,1] ↦ [Min, Max] that maps the unit
interval to any desirable range [Min, Max] of recommendation scores
for a variety of WoT recommendation applications.

3.2. Estimating typicality degrees of items in item groups

Fig. 1. The relationships among items, item groups, users, and user groups of the ROT Previous research in the field of cognitive psychology shows that
model. central tendency (i.e., an object's similarity to a concept prototype) is
56 Y. Cai et al. / Decision Support Systems 63 (2014) 52–63

one of the main factors that largely influences an object's typicality the following aggregation function to estimate central tendency ct
with respect to a concept [4,5]. When the principle of object typical- of an item Oi.
ity is applied to design the proposed recommender system for WoT      
recommendations, the central tendency of an item (i.e., an object) ! ! ! ! ! !
τk j ðOi Þ ≈ ct pO ; t c
i
¼ simint pO ; t c
i
 dissimext pO ; t c
i
ð5Þ
with respect to an item group (i.e., a concept) is estimated based
on the similarity between the corresponding item vector and proto-
type vector of the item group. More specifically, the central tendency For the proposed ROT method, each item has a typicality degree
of an item is estimated based on its internal similarity and external against each item group. Accordingly, an item typicality matrix is need-
dissimilarity [3,5]. Internal similarity refers to the similarity between ed to characterize item typicality. Formally, an item typicality matrix is
an object and the prototype of the corresponding concept, whereas defined as follows:
external dissimilarity is the distance between the object and the pro- Definition 5. Item typicality matrix
totypes of other concepts. As discussed in Section 3.1, objects and
prototypes of concepts are represented by the corresponding item An item typicality matrix MIK captures the typicality degree for each
vectors and prototype vectors of item groups. Conceptually, a simi- item Oi against each item group kj of a recommendation domain, and it
larity function sim is applied to map the product of the set of items has the form:
and the set of prototypes of item groups to the unit interval:
sim : P × T ↦ [0,1], where T is a set of prototype vectors of item 8       9
< k1 : w1;1 ; k2 : w2;1 ; ⋯; kn : wn;1 >
> =
groups, and P is a set of property vectors of items. For ! each pair of
! MIK ¼    ⋯   
item vector ! !
p Oi ∈P and prototype vector t c ∈T , if sim t c ; p Oi ¼ 1 is >
: k :w >
;
1 1;m ; k2 : w2;m ; ⋯; kn : wn;m
established, it means that the item Oi is certainly typical with respect
to the prototype c of the item group kj. On the other hand, if sim
! !  where w j;i ¼ τk j ðOi Þ is the typicality degree of item Oi for the item group
t c ; p Oi ¼ 0 is true, it suggests that item Oi is totally atypical with re-
spect to the prototype c of the item group kj. kj.
For the proposed ROT method, any standard similarity functions
discussed in the literature can be used [35]. For the implementation 3.3. Estimating typicality degrees of users in user groups
of our current prototype system, we employ the following similarity
function: For the proposed ROT method, estimating the typicality degrees of
users with respect to various user groups is also an important proce-
  0 ∀k; ∃lOi ;k ¼ 0 dure. Similar to the item typicality matrix, a user typicality matrix cap-
!
sim !pO ; t c ¼ ð2Þ
i κ otherwise tures the typicality degrees of a user against various user groups.

0 0vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi1θ 1 Definition 6. User typicality matrix


u n  2
uX 
κ ¼ exp@−@t r c;k −lOi ;k  A A ð3Þ
k¼1
A user typicality matrix MIG captures the typicality degree for each
user Ui against each user group gj of a recommendation domain, and it
where n represents the number of properties of the universe of dis- has the form:
course. The term lOi ;k represents the degree to which item Oi possesses 8       9
>
< g 1 : v1;1 ; g 2 : v2;1 ; ⋯; g n : vn;1 > =
the property pOi ;k , while rc,k indicates the degree to which prototype c
MIG ¼    ⋯   
possesses the property pc,k. The parameter θ ∈ R+ is a positive real num- >
: g :v >
;
1 1;m ; g 2 : v2;m ; ⋯; g n : vn;m
ber which is applied to tune the rate of decay of the exponential function
so that the proposed similarity function is effective for different recom-
rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
 2ffi where v j;i ¼ τg j ðU i Þ is the typicality degree of user Ui for the user group
n  
mendation applications. The Euclidean distance ∑k¼1 r c;k −lOi ;k  is gj.
applied to estimate the similarity between an item vector ! p O and a pro-
In order to estimate the typicality degree of a user for each user
i
group, we need to interpret the principle of central tendency in
totype vector !t c . The proposed exponential function produces a high
the context of typical user group formation. In particular, a user in-
typicality degree if an item is extremely close to a prototype. On the
terest group gj is constructed with respect to a specific item group
other hand, the resulting typicality degree substantially decreases if
kj (i.e., the items generally preferred by the group of users). Under
the item is only moderately similar to the prototype [29].
such a circumstance, if a user Ui often rates the items of kj, s/he has
The internal similarity of an item Oi with respect
 to the prototype
! ! c of
! the tendency to prefer these items. Consequently, s/he is likely to
an item group is computed according to simint ! p Oi ; t c ¼ sim p Oi ; t c . In
 !
 be the typical member of the corresponding user interest group gj.
addition, the external dissimilarity dissimext ! p Oi ; t c is taken as the av-
In fact, it reveals Ui's central tendency in gj. Moreover, if the user Ui
erage of dissimilarities between the item and other prototypes exclud-
highly rates the items of kj, it also shows their tendency of preferring
ing the current one. More specifically, the external dissimilarity
those items. As a result, Ui has a high central tendency in gj. Accord-
between an item Oi and the prototype c of an item group is defined as
ingly, we estimate the central tendency, and hence the typicality de-
follows:
gree of a user Ui with respect to a user group gj based on the
X   frequency of Ui rating each item Oi ∈ kj and the rating that Ui spec-
!
  dissim !pO ; t x
dissimext ! !
pO ; t c ¼
x∈C=fcg i
ð4Þ ifies for Oi ∈ kj. Formally, these two factors are expressed by sig ; f
jC j−1
i j
and sig j ;r , respectively:

where C is the set of prototypes of the item groups for a given rec- i N j;i
ommendation domain. For the dissimilarity function dissim, it is de- sg j ; f ¼ Xn ð6Þ
    N
y¼1 y;i
! !
fined by dissim ! i
!
p O ; t c ¼ 1−sim p O ; t c
i
. Finally, the central
tendency of an item with respect to an item group is estimated XN
i y¼1
j;i
w j;y  Ri;y
based on an aggregation function that combines the internal simi- sg j ;r ¼ ð7Þ
larity and external dissimilarity [21]. More specifically, we propose N j;i  R max
Y. Cai et al. / Decision Support Systems 63 (2014) 52–63 57

where n is the total number of item groups. The term Nj,i represents compare our experimental results with published results. Formally,
the number of items that user Ui has rated for an item group kj. Ri,y is MAE is defined by:
the rating that Ui assigns to the item Oy ∈ kj. The term wj,y repre- Xn
sents the typicality degree of Oy for the item group kj, and Rmax is j f i −hi j
MAE ¼ i¼1
the maximal rating value for any items. n
Finally, the typicality degree v j;i ¼ τg j ðU i Þ of a user Ui for a user
group gj is approximated by the aggregation of the above two where n is the number of recommendations. fi is the user-specified
measures through a standard convex function. The weight factor rating, and hi is the prediction provided by a recommender system.
α = 0.5 is specified to assign equal weight for sig ; f and sig j ;r , respec- If the MAE score is low, it suggests that the recommender system
j
tively. The proposed user typicality function has the property that can predict users' ratings with a high accuracy.
if the user Ui often assign high rating values for the items Oi ∈ kj, Another common metric for the evaluation of recommender sys-
Ui's typicality degree for the user group gj that corresponds to kj tems is root mean square error (RMSE). Since the errors are squared be-
will be high. fore they are averaged, the RMSE measure is more sensitive to big-error
predictions when compared to the MAE measure. In general, the RMSE
i i score achieved by a recommender system is always greater than or
τg j ðU i Þ ≈ α  sg j ; f þ ð1−α Þ  sg j ;r ð8Þ equal to its MAE score. The greater difference between the MAE score
and the RMSE score achieved by a system, the larger variance of the in-
dividual errors produced by the system will be. Formally, the RMSE
4. System evaluation measure is defined as follows.
ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
sX
n 2
In this section, we discuss the experiments that evaluate the perfor- i¼1 i
j f −hi j
RMSE ¼
mance of the proposed typicality-based recommendation method when n
compared to that of state-of-the-art recommendation methods. In par-
ticular, we try to answer the following research questions by running
two series of experiments: 4.2. The experimental procedures
1. How does item clustering affect the recommendation quality of the
For all the experiments reported in this paper, we applied the topic
proposed ROT method?
modeling based clustering method to generate item groups [39]. After
2. Does the proposed ROT method produce less big-error predictions
the clustering process, a prototype vector was extracted for each cluster
when compared to existing recommendation methods?
(i.e., item group). In particular, the top k most probable (typical) key-
3. Does the proposed ROT method achieve good performance given
words were applied to construct each prototype vector after the proba-
sparse training data?
bilistic topic modeling process was invoked. For our experiments, the
4. Is the proposed ROT method more efficient than some classical rec-
parameter k = 20 was empirically established. Table 1 shows the sam-
ommendation methods?
ple prototype vector for the “crime movie” item group generated by
5. Is the proposed ROT method effective when compared to state-of-
means of a topic modeling process [39]. According to our empirical ob-
the-art recommendation methods?
servation, relevant keywords often appear in movie descriptions. As a
result, topic modeling processes tend to extract useful topics that char-
4.1. Evaluation data sets
acterize different types of movies. However, we do not make a claim
that topic modeling is the best clustering method. Searching for the
Since a benchmark data set specifically constructed for the eval-
most effective clustering method for a given domain will be left as
uation of WoT recommender systems is not available, we applied
part of our future work. After the construction of item groups, the user
the common MovieLens [24,36] and Netflix [18] data sets to evalu-
group corresponding to each item group was constructed according to
ate the effectiveness of the proposed ROT prototype system. The
the computational method illustrated in Section 3.3. To answer the
MovieLens data set we used contains 100,000 ratings generated
aforementioned research questions, we performed two series of
by 943 users for 1682 movies. Each user has rated at least 20
experiments.
movies, and the ratings are in the range between 1 (bad) and 5 (ex-
The objective of the first series of experiments was to examine the
100;000
cellent). The sparsity level of this data set is 1−943×1;682 ¼ 0:9369. We
impact of the number of item groups and the sparsity of training data
downloaded the descriptions of movies from the IMDB movie site.3
on the recommendation quality of the proposed ROT prototype system.
Then, the properties of the movies (items) were identified by
In addition, the computational efficiency of the ROT system was evalu-
extracting the corresponding keywords from these descriptions.
ated when compared to several classical recommender systems. The
Since only the descriptions of 1334 movies were available, we
classical recommender systems involved in our experiments included
used this subset of movies in our experiments. In addition, we
a content-based recommender system (CB) that used the cosine
also evaluated the ROT system based on a subset of the Netflix
data set which contained 100,480,507 ratings offered by 480,189
users for 17,770 movies. The sparsity level of our Netflix data set Table 1
100;480;507
is 1−480;189×17;770 ¼ 0:9882. The Prototype of the Item Group “Crime Movies”.

Properties Membership Properties Membership


4.1. Performance metrics
Murder 0.9830 Police 0.8216
Prison 0.3689 Suicide 0.3662
To measure the accuracy of the experimental system, we adopted Revenge 0.3642 Serial-killer 0.3628
the Mean Absolute Error (MAE) which is defined as the average abso- Courtroom 0.3612 Obsession 0.3030
lute difference between the system predicted ratings and the actual Investigation 0.2564 Lawyer 0.2404
user ratings [1]. MAE is commonly used by researchers to evaluate Robbery 0.1830 Police-officer 0.1216
Fugitive 0.1089 Judge 0.1066
the effectiveness of recommender systems, and so it is easy to
Chase 0.1042 Confession 0.0982
Deception 0.0961 Smoking 0.0933
3 Gunfight 0.0864 Escape 0.0840
http://www.imdb.com/interfaces.
58 Y. Cai et al. / Decision Support Systems 63 (2014) 52–63

similarity function to compute item similarity, a user-based collabora- Table 2


tive filtering system (UBCF) that employed the Pearson Correlation Co- Sensitivity of MAE w.r.t. K and training/test ratios.

efficient to compute user similarity, an item-based collaborative 0.1 0.3 0.5 0.7 0.9
filtering system (IBCF) that used Pearson Correlation Coefficient to com-
K = 10 0.8291 0.7804 0.7722 0.7664 0.7624
pute item similarity, a naive hybrid recommender system, and a collab- K = 15 0.8176 0.7774 0.7611 0.7587 0.7529
orative filtering system that supported missing data prediction (EMDP) K = 20 0.8231 0.7806 0.7689 0.7645 0.7582
[24]. For these experiments, we divided the whole data set into two K = 25 0.8262 0.7843 0.7688 0.7654 0.7607
K = 30 0.8308 0.7850 0.7686 0.7657 0.7624
parts namely, a training set and a test set. Each system under testing
K = 35 0.8414 0.7887 0.7754 0.7675 0.7645
was first trained using the training set, and then its performance was
evaluated based on the test set. We applied 5-fold cross-validation to
compute the average MAE and RMSE scores for each system. To assess clusters) has a high predictive power [41]. We tried different numbers
ROT's performance under the condition of sparse training data, various of topics and computed the corresponding perplexities. Fig. 2 shows
divisions of the training and the test sets were tried [36]. For instance, our experimental results; it seems that K ∈ [15,28] leads to lower per-
a splitting ratio x = 0.8 indicates that 80% of the evaluation data are plexity values. We then adopted the range of K ∈ [10,35] to establish
randomly selected to build the training set and the remaining 20% of the number of item groups for the proposed ROT system. For each cho-
data are used to create the test set. sen K value, we applied a different training/test splitting ratio
For the second series of experiments, we compared the performance x ∈ [0.1,0.9] to simulate different data sparsity conditions. Finally, 5-
of ROT with that of state-of-the-art recommender systems based on fold cross-validation was applied to compute the MAE and the RMSE
both the MovieLens [24,36] and the Netflix [18] benchmark data sets. scores achieved by the ROT system. Tables 2 and 3 report the MAE
The state-of-the-art recommender systems included cluster-based and the RMSE scores achieved by the ROT system with respect to vari-
Pearson Correlation Coefficient (SCBPCC) [46], Weighted Low-rank Ap- ous numbers of item groups and training/test splitting ratios.
proximation (WLR) [30], and Transfer Learning-based Collaborative Fil- Our experimental results show that both the number of item groups K
tering (CBT) [22]. Similar to previous research [22,24], we extracted a and the training/test splitting ratio have an impact on the system's recom-
subset of 500 users from the MovieLens data set. For instance, we ran- mendation performance. For example, both the MAE and the RMSE scores
domly selected the first 100, 200 and 300 users from the benchmark are the lowest when K = 15 is applied. The MAE and the RMSE scores
data set to build different training sets such as ML100, ML200, and slightly increase when K is deviated from 15 (e.g., K = 10 and K = 20).
ML300, respectively. The remaining 200 users were then applied to However, the absolute differences of the MAE and the RMSE scores are
build the test set. Moreover, similar to previous research, three different small after a reasonable range of K values are established. Accordingly,
sizes of the observed ratings such as 5 ratings (Given5), 10 ratings we applied the empirically established parameter value (e.g., K = 15)
(Given10), and 15 ratings (Given15) were applied to compute user sim- to the remaining experiments. On the other hand, more training data
ilarity for each user [22,24]. To better simulate a WoT recommendation (e.g., x = 0.9) lead to better recommendation performance, that is, a
environment, the larger and more sparse Netflix data set was applied to lower MAE or RMSE score. Although the proposed empirical method
further evaluate the performance of the ROT system. Based on this larg- may not be able to identify the optimal number of item groups, applying
er benchmark data set, we compared the performance of ROT with that more sophisticated methods such as genetic algorithms [20] for parame-
of other state-of-the-art recommender systems such as SVD [31], ter tuning will only further improve the performance of the proposed ROT
SVD++ [18], and socialMF [15] which were made available via the pub- system reported in this paper. However, more sophisticated parameter
licly accessible MyMediaLite library [10]. tuning will be left as part of our future work.

4.3. Experimental results 4.3.2. Recommendation quality


We performed a comparative evaluation of the performance of ROT
4.3.1. The impact of number of item groups on recommendation performance when compared to that of classical recommender systems under differ-
For the proposed ROT system, determining the number of item ent training/test splitting ratios. Figs. 3 and 4 show the comparative
groups (clusters of items) is an important design issue. For topic model- MAE and RMSE performance of the experimental and the baseline sys-
ing based conceptual clustering, perplexity is a common measure ap- tems. It is obvious that the proposed ROT system outperforms all the
plied to estimate a reasonable number of clusters with respect to the baseline systems for both MAE and RMSE under various training/test
natural partitions of a given data set [41]. Accordingly, we empirically splitting ratios. For example, ROT achieves 0.7529 and 1.0327 for MAE
established an appropriate number of item groups K based on the and RMSE, respectively for a training/test splitting ratio x = 0.9, while
movie descriptions pertaining to the MovieLens data set. In general, a EMDP (the best baseline system) achieves 0.8040 and 1.0826, respec-
small perplexity value implies that the learned model (e.g., the derived tively. Under a testing/test splitting ratio of x = 0.3, ROT achieves
0.7774 and 1.0705 for MAE and RMSE, while the best performing base-
line system IBCF achieves 0.8803 and 1.1544, respectively. This experi-
mental result indicates that the recommendation quality of the ROT
system is higher than that of classical recommender systems.
Tables 4 and 5 depict the details of MAE and RMSE improvements
achieved by the proposed ROT system. When compared to the EMDP
baseline system, the performance improvement achieved by ROT

Table 3
Sensitivity of RMSE w.r.t. K and training/test ratios.

0.1 0.3 0.5 0.7 0.9

K = 10 1.1290 1.0703 1.0592 1.0526 1.0442


K = 15 1.1161 1.0705 1.0434 1.0451 1.0327
K = 20 1.1234 1.0705 1.0524 1.0463 1.0365
K = 25 1.1203 1.0719 1.0523 1.0503 1.0454
K = 30 1.1272 1.0781 1.0563 1.0491 1.0425
K = 35 1.1440 1.08174 1.0636 1.0528 1.0487
Fig. 2. Perplexity vs. number of item groups.
Y. Cai et al. / Decision Support Systems 63 (2014) 52–63 59

Table 4
Comparative MAE improvements of ROT under different training/test ratios.

0.1 0.3 0.5 0.7 0.9

EMDP 17.56% 14.92% 9.64% 8% 6.35%


CB 14.17% 17.09% 18.54% 18.78% 19.34%
IBCF 9.89% 11.69% 12.08% 12.1% 12.03%
UBCF 11.77% 12.27% 11.55% 10.53% 9.63%
Naive hybrid 19.08% 20.62% 14.97% 12.91% 11.39%

construction of item groups does not rely on any rated training data at
all. Consequently, the proposed ROT system is less susceptible to sparse
training data. From a cognitive perspective, the proposed typicality-
based recommendation method operates based on the sound object
typicality principle in that the typical user of a specific user interest
Fig. 3. Comparative MAE performance of ROT. group should be recommended the typical items that the user group is
most interested in. From a computational perspective, the proposed
typicality-based recommendation method mainly operates at the
reduces with the increment of the training/test splitting ratio. However,
group level rather than individual item or user level. More specifically,
when compared to the IBCF and the CB baseline systems, ROT achieves a
item groups are matched with user groups in order to generate recom-
larger performance improvement with the increment of the training/
mendations during the recommendation time. This can be seen as a
test splitting ratio. For the training/test splitting ratio x N = 0.4, ROT
kind of “generalization” approach to address the data sparsity issue of
outperforms the best baseline system EMDP by at least 6.35% in terms
WoT recommendations.
of MAE, and at least 4.6% in terms of RMSE. On the other hand, for train-
ing/test splitting ratio x b 0.4, IBCF is the best performing baseline sys-
tem. ROT outperforms IBCF by at least 9.89% in terms of MAE and at 4.3.4. Big-error predictions
least 5.67% in terms of RMSE. According to paired one-tail t -tests, all As mentioned before, big-error predictions produced by recom-
the performance improvements achieved by the ROT system are statis- mender systems can seriously harm users' trusts on recommender sys-
tically significant. tems. For this experiment, we evaluated the big-errors produced by the
ROT and other baseline systems. For the MovieLens data set, a rating
4.3.3. The impact of sparse training data value between 1 and 5 is assigned to an item. So, the biggest prediction
Figs. 3 and 4 also highlight an interesting property of the ROT system error is 4. Fig. 5 shows the comparison of the prediction errors (PE)
in that it achieves the best MAE and RMSE scores when compared to committed by the ROT and other baseline systems. Our experimental re-
other baseline systems even with a low training/test splitting ratio sult was obtained with 15 item groups (K = 15) and a training/test
(i.e., sparse training data). For instance, ROT achieves a MAE score of splitting ratio of x = 0.9. The legend PE = 0 means that there is no pre-
0.8176 and a RMSE score of 1.1161 for the lowest training/test splitting diction error, and PE = 4 implies that the difference between a system
ratio x = 0.1. The EMDP baseline system achieves similar results only predicted rating and the actual user rating is 4 (the big-error). As shown
when sufficient training data is available (e.g., a training/test splitting in Fig. 5, the proposed ROT system produces the most correct predic-
ratio x = 0.7). Such a characteristic of ROT is desirable for a WoT recom- tions (e.g., 38.58% predictions with PE = 0) and the most small errors
mendation environment where little user ratings (i.e., training data) are (48.92% predictions with PE = 1) among all the systems. Besides, the
available given a huge user-item recommendation space. ROT system only generates 0.05% predictions with PE = 4 (the big-
The reason of such a significant performance improvement of ROT errors). As a whole, ROT generates 12.5% prediction errors for PE ≥ 2,
over other baseline systems is that it is difficult for collaborative filtering while the EMDP and the CB baseline systems produce 16% and 29% pre-
based systems to accurately identify the nearest “neighbors” for each diction errors at the same level. The IBCF, UBCF and Naive Hybrid base-
user given sparse training data (e.g., commonly rated items). As a result, line systems all produce 17% or above prediction errors at the same
the recommendation accuracy of these systems is generally low. In con- level. It is obvious that the proposed ROT system produces the least
trast, the ROT system relies on the item typicality of item groups and the big-error predictions when compared to other baseline systems. For
user typicality of user groups to generate a recommendation. The other training/test splitting ratios (i.e., x b 0.9), similar results were
observed. For the reason of brevity, we only report the typical result
(e.g., x = 0.9) in this paper.

4.3.5. Evaluation of computational efficiency


Since a WoT recommendation environment involves a large number
of items and users, computational efficiency is a major concern. Fig. 6
shows the recommendation time consumed by all systems under the

Table 5
Comparative RMSE improvements of ROT under different training/test ratios.

0.1 0.3 0.5 0.7 0.9

EMDP 10.1% 9.32% 7.12% 5.26% 4.6%


CB 8.91% 11.08% 12.72% 12.53% 13.5%
IBCF 5.67% 7.26% 8.29% 8.13% 8.84%
UBCF 7.33% 8.02% 7.89% 6.94% 7.07%
Naive hybrid 11.97% 13.34% 10.06% 8.09% 7.39%
Fig. 4. Comparative RMSE performance of ROT.
60 Y. Cai et al. / Decision Support Systems 63 (2014) 52–63

Table 6
Comparison with state-of-the-art methods on MAE.

Training set Methods Given5 Given10 Given15

ML100 SCBPCC 0.874 0.845 0.839


WLR 0.915 0.875 0.890
CBT 0.840 0.802 0.786
ROT 0.807 0.797 0.781
ML200 SCBPCC 0.871 0.833 0.828
WLR 0.941 0.903 0.883
CBT 0.839 0.800 0.784
ROT 0.813 0.771 0.764
ML300 SCBPCC 0.870 0.834 0.819
WLR 1.018 0.962 0.938
CBT 0.840 0.801 0.785
ROT 0.785 0.755 0.753

Fig. 5. Comparative performance of big-error predictions. rich training data (e.g., ML300 and Given15), ROT achieves the lowest
MAE scores when compared to state-of-the-art systems such as
SCBPCC, WLR, and CBT.
training/test splitting ratio of x = 0.9. For the proposed ROT system, We also compared the performance of ROT with that of three state-
item groups and user groups are computed in advance during the sys- of-the-art recommender systems, SVD, SVD++, and SocialMF because
tem training time. During the recommendation time, the ROT system these systems achieved very good performance under the larger Netflix
only needs to aggregate the typicality degrees of items and users in benchmark data set [18]. For this experiment, we utilized the Netflix
order to generate the final recommendation scores. As a result, the benchmark data set and adopted a training/test splitting ratio of x =
ROT system can efficiently generate the recommendation scores on 0.8. The experimental results that were generated based on 5-fold
the fly. The CB baseline system is the second best system in terms of rec- cross-validation are plotted in Figs. 7 and 8, respectively. According to
ommendation time because it only needs to compute the similarities Fig. 7, the proposed ROT system significantly outperforms SVD,
between a target item and all the items rated by the current user. Both SocialMF, and SVD++ by 5.27% (t(4) = 30.0, p b. 01), 4.45% (t(4) =
the UBCF and the IBCF baseline systems need to identify the “nearest 73.4, p b. 01), and 3.16% (t(4) = 61.6, p b. 01) in terms of MAE, re-
neighbors” of an item or a user in the entire recommendation space. spectively. Moreover, the ROT system achieves a comparable RMSE per-
As a result, they are not as efficient as the ROT system and the CB sys- formance with that of state-of-the-art systems. Our post-experimental
tem. The EMDP baseline is the least efficient system since it needs to analysis found that some users were not assigned to any user groups.
predict the missing data first, and then it combines the ratings obtained Accordingly, the ROT system only generated a default rating for an
through a user-based CF method and an item-based CF method. For item (i.e., the average rating of that item) with respect to these users.
other training/test splitting ratios, we obtained similar experimental re- Nevertheless, these recommendations may lead to big-error predic-
sults. For the reason of brevity, we only report the result under the con- tions. Since the metric of RMSE is sensitive to big-error predictions,
dition x = 0.9. the RMSE score achieved by the ROT system is not as good as those
achieved by other state-of-the-art systems. We will explore other
4.3.6. Comparison with the state-of-the-art methods methods (e.g., default user profiling) to specifically deal with the cold-
To further evaluate the performance of the proposed ROT system, we start problem so that new users can also be classified to some user inter-
adopted a similar experimentation procedure and data set as reported est groups. However, this line of research will be conducted as part of
in the literature [22]. Table 6 shows the comparative performance of our future work. As a whole, we compare the performance of the pro-
ROT and other state-of-the-art recommender systems reported in the posed ROT system with quite a number of state-of-the-art recommend-
literature [22]. For the reason of limited space, we only show the exper- er systems based on two different benchmark data sets. The ROT system
imental result based on MAE. According to Table 6, ROT outperforms all significantly outperforms state-of-the-art systems based on the
state-of-the-art systems under different configurations. In particular, MovieLens data set. As for the Netflix data set, the results are mixed.
given both sparse training data (e.g., ML100 and Given5) and relatively The ROT system outperforms state-of-the-art systems in terms of

Fig. 6. Comparison of recommendation time. Fig. 7. Comparative MAE performance of ROT based on the Netflix data set.
Y. Cai et al. / Decision Support Systems 63 (2014) 52–63 61

6. Conclusions

With the rapid growth of “Web of Things” applications, there is a


pressing need to develop an effective and efficient method for the dis-
covery and selection of smart things on the Web. This paper proposes
a recommender system-based solution to address WoT discovery and
selection, and hence to improve the situation awareness of WoT applica-
tions. Since the recommendation space of WoT is typically sparse, and
computational efficiency as well as users' trusts on recommendations
are also the primary concerns, the main contribution of our research
work is the design of a novel typicality-based recommendation method
that addresses the aforementioned critical issues for WoT recommenda-
tions. As the proposed recommendation method is underpinned by the
“object typicality” principle empirically verified in the field of cognitive
psychology, it tends to produce less big-error recommendations, and
hence it promotes users' trusts on WoT applications. In addition, since
Fig. 8. Comparative RMSE performance of ROT based on the Netflix data set.
the proposed method exploits data generalization by operating at the
item group and the user group level during recommendation time, it al-
leviates the problems of data sparsity and computational efficiency for
MAE, but it only achieves a comparable performance with state-of-the- large-scale WoT recommendations.
art systems in terms of RMSE. Based on the benchmark MovieLens data set, our experimental re-
sults confirm that the proposed ROT prototype system significantly out-
performs all the baseline recommender systems even under the
5. Discussions condition of sparse training data. Moreover, the ROT system is more ef-
ficient than other baseline recommender systems and it produces the
The proposed typicality-based recommendation method is novel least big-error recommendations. Based on the Netflix data set that sim-
since it can take into account the way how people evaluate objects ulates a large WoT recommendation space, the ROT system significantly
with reference to a concept (e.g., a user's specific interest). More specif- outperforms state-of-the-art recommender systems in terms of MAE,
ically, the proposed recommendation method is underpinned by the and it achieves comparable performance with that of state-of-the-art
principle of object typicality empirically evaluated in the field of cogni- recommender systems in terms of RMSE. As a whole, our experimental
tive psychology. The proposed method estimates the specific interest results reveal that the proposed ROT prototype system has great poten-
of a user in terms of the typicality of the user within a particular user in- tial to support large-scale WoT discovery and recommendation. To the
terest group and the typicality of items that the user group is most inter- best of our knowledge, it is the first successful research work which in-
ested in. It addresses the issues of data sparsity and computational volves the design and development of a cognitively motivated recom-
efficiency pertaining to the WoT recommendation environment by mender system for WoT recommendations. The business implication
recommending things at the level of item groups and user groups in- of our research work is that the proposed recommendation method
stead of directly evaluating individual items and users on the fly. can enhance the situation awareness of WoT applications which facili-
These are the reasons why the proposed ROT system outperforms tate the reuse of enterprise resources and the interoperability among
(e.g., in terms of MAE and computational time) all the baseline systems enterprises.
that directly operate at individual item and user level under the condi- For our current experiments, only keyword-based properties (e.g., the
tion of sparse training data. textual descriptions of movies) have been explored to construct item
For example, for the MovieLens-based experiments, the ROT system groups. Future work will explore a richer model of item properties to en-
predicted a rating of 5 for the movie “Usual Suspects, The (1995)” hance the effectiveness of the proposed ROT system. Another limitation of
(movie id 12) and the movie “Pulp Fiction (1994)” (movie id 56) with our current work is that the number of item groups identified by the ROT
respect to the user 99 (a user with benchmark id 99). The user 99 is system may not be optimal. More sophisticated methods such as genetic
very typical (with a typicality degree of 0.9) for the user interest algorithms will be explored to identify optimal or near optimal clusters of
group “Crime Movies”. Besides, the typicality degrees of “Usual Sus- items. In addition, alternative clustering algorithms and similarity func-
pects, The (1995)” and “Pulp Fiction (1994)” for the item group tions will be applied to bootstrap the performance of item group and
“Crime Movies” are 1.0 and 0.9517, respectively. Since the user 99 is typ- user group construction. A hybrid method will be examined to tackle
ical for the user interest group “crime movie”, and the two movies are the cold-start problem such that new users of the system can also be
the typical items that the user group likes, the ROT system generates assigned to some user groups. A comparative study about the recommen-
high ratings for these movies according to the principle of object typical- dation performance of ROT and that of economic choice models will be
ity. In fact, the predictions provided by the ROT system are exactly the performed. Finally, we will explore Big Data Analytics (e.g., MapReduce)
same as user 99's actual ratings for the respective movies. to support parallel incremental clustering of item groups and user groups
Basically, all the research questions have been answered through the to further improve the computational efficiency of the ROT system for
two series of benchmark experiments. The number of item groups K has real-time WoT recommendations.
an impact on the performance of the ROT prototype system. The empir-
ical parameter tuning method that we adopted can identify a reasonable Acknowledgments
range of K values for the ROT system. Our experiments also show that
the ROT system is more effective (e.g., producing the least big-errors) The work described in this paper was supported by a grant from
than other baseline systems given sparse training data. Moreover, it is the Research Grants Council of the Hong Kong Special Administra-
more efficient than other baseline systems because it mainly operates tive Region, China (project no. CityU 115910), City University of
at the item group and user group level during the recommendation Hong Kong (project no. 7008138), the Shenzhen Municipal Science
time. Finally, the ROT system significantly outperforms other state-of- and Technology R&D Funding — Basic Research Program (project no.
the-art recommender systems in terms of MAE based on both the JCYJ20130401145617281), the Shenzhen Research Institute, City University
MovieLens and the Netflix benchmark data sets. of Hong Kong, National Natural Science Foundation of China (project no.
62 Y. Cai et al. / Decision Support Systems 63 (2014) 52–63

61300137), the Guangdong Natural Science Foundation of China [27] Prem Melville, Raymod J. Mooney, Ramadass Nagarajan, Content-boosted collabora-
(project no. S2011040002222 and S2013010013836), the Funda- tive filtering for improved recommendations, Proceedings of the Eighteenth National
Conference on Artificial Intelligence, American Association for Artificial Intelligence,
mental Research Funds for the Central Universities, SCUT (project Menlo Park, CA, USA, 2002, pp. 187–192.
no. 2012ZM0077). [28] Raymond J. Mooney, Loriene Roy, Content-based book recommending using learn-
ing for text categorization, DL'00: Proceedings of the Fifth ACM Conference on Dig-
ital Libraries, ACM, New York, NY, USA, 2000, pp. 195–204.
[29] Gregory L. Murphy, Gregory L. Murphy, The Big Book of Concepts, MIT Press,
References 2002.
[1] G. Adomavicius, A. Tuzhilin, Toward the next generation of recommender systems: [30] Nathan Srebro Nati, Tommi Jaakkola, Weighted low-rank approximations, In 20th
a survey of the state-of-the-art and possible extensions, IEEE Transactions on International Conference on Machine Learning, AAAI Press, 2003, pp. 720–727.
Knowledge and Data Engineering 17 (6) (2005) 734–749. [31] A. Paterek, Improving regularized singular value decomposition for collabora-
[2] Marko Balabanović, Yoav Shoham, Fab: content-based, collaborative recommenda- tive filtering, Proceedings of the ACM SIGKDD Cup and Workshop, ACM, 2007,
tion, Communications of the ACM 40 (3) (1997) 66–72. pp. 39–42.
[3] Lawrence W. Barsalou, Ideals, central tendency, and frequency of instantiation as [32] M.J. Pazzani, D. Billsus, Content-based recommendation systems, The Adaptive
determinants of graded structure in categories, Journal of Experimental Psychology: Web: Methods and Strategies of Web Personalization, 2007. 325–341.
Learning, Memory, and Cognition 11 (4) (October 1985) 629–654. [33] Michael Pazzani, Daniel Billsus, Learning and revising user profiles: the identifica-
[4] Lawrence W. Barsalou, Cognitive Psychology: An Overview for Cognitive Scientists, tion of interesting web sites, Machine Learning 27 (3) (1997) 313–331.
Lawrence Erlbaum As. Pu, 1992. [34] M. Rifqi, Constructing prototypes from large databases, Proceedings of the
[5] Yi Cai, H.F. Leung, Multi-prototype concept and object typicality in ontology, Pro- Information Processing and Management of Uncertainty Conference, 1996,
ceedings of the 21st International Florida Artificial Intelligence Research Society pp. 301–306.
Conference, AAAI Press, 2008, pp. 470–475. [35] Simone Santini, Ramesh Jain, Similarity matching, Proceedings of the Second Asian
[6] Jean-Pierre Desclés, Anca Pascu, Logic of determination of objects: the meaning of Conference on Computer Vision, 1995, pp. 571–580.
variable in quantification, International Journal of Artificial Intelligence Tools 15 [36] Badrul Sarwar, George Karypis, Joseph Konstan, John Reidl, Item-based collaborative
(6) (2006) 1041–1052. filtering recommendation algorithms, Proceedings of the 10th International World
[7] Mukund Deshpande, George Karypis, Item-based top-n recommendation algo- Wide Web Conference, ACM, New York, NY, USA, 2001, pp. 285–295.
rithms, ACM Transactions on Information Systems 22 (1) (2004) 143–177. [37] Ian M. Soboroff, Charles K. Nicholas, Combining content and collaboration in text fil-
[8] Carolina Fortuna, Matevz Vucnik, Blaz Fortuna, Klemen Kenda, Alexandra Moraru, tering, In Proceedings of the IJCAI'99 Workshop on Machine Learning for Informa-
Dunja Mladenic, Towards building a global oracle: a physical mashup using artificial tion Filtering, 1999, pp. 86–91.
intelligence technology, in: Simon Mayer, Dominique Guinard, Erik Wilde (Eds.), [38] D. Song, R.Y.K. Lau, P.D. Bruza, K.F. Wong, D.Y. Chen, An adaptive information agent
Third International Workshop on the Web of Things, WoT'12, Newcastle, United for document title classification and filtering in document-intensive domains, Deci-
Kingdom, June 19, 2012, ACM, 2012. sion Support Systems 44 (1) (2008) 251–265.
[9] K.M. Galotti, K.M. Galotti, Cognitive Psychology In and Out of the Laboratory, third [39] Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, Zhong Su, Arnetminer: extraction
edition, Wadsworth, Belmont, CA, 2004. and mining of academic social networks, Proceedings of the 14th ACM SIGKDD Inter-
[10] Zeno Gantner, Steffen Rendle, Christoph Freudenthaler, Lars Schmidt-Thieme, national Conference on Knowledge Discovery and Data Mining, ACM, New York, NY,
Mymedialite: a free recommender system library, Proceedings of the Fifth ACM USA, 2008, pp. 990–998.
Conference on Recommender Systems, 2011, pp. 305–308. [40] Loren Terveen, Will Hill, Brian Amento, David McDonald, Josh Creter, Phoaks: a
[11] Dominique Guinard, Mathias Fischer, Vlad Trifa, Sharing using social networks in a system for sharing recommendations, Communications of the ACM 40 (3) (1997)
composable Web of Things, Proceedings of the 8th IEEE International Conference 59–62.
on Pervasive Computing and Communications, IEEE, 2010, pp. 702–707. [41] J.P. Ueberla, An extended clustering algorithm for statistical language models, IEEE
[12] Jing He, Yanchun Zhang, Guangyan Huang, Jinli Cao, A smart Web service based Transactions on Speech and Audio Processing 4 (4) (1996) 313–316.
on the context of things, ACM Transactions on Internet Technology (TOIT) 11 [42] W. Vanpaemel, G. Storms, B. Ons, A varying abstraction model for categorization,
(3) (January 2012)(Article 13). Proceedings of the 27th Annual Cognitive Science Conference, Lawrence Erlbaum,
[13] Jonathan L. Herlocker, Joseph A. Konstan, Al Borchers, John Riedl, An algorithmic Mahwah, NJ, 2005, pp. 2277–2282.
framework for performing collaborative filtering, Proceedings of the 22nd Annual [43] Jun Wang, Arjen P. de Vries, Marcel J.T. Reinders, Unifying user-based and
International ACM SIGIR Conference on Research and Development in Information item-based collaborative filtering approaches by similarity fusion, Proceedings of
Retrieval, ACM Press, New York, NY, USA, 1999, pp. 230–237. the 29th Annual International ACM SIGIR Conference on Research and Development
[14] Ming Hua, Jian Pei, Ada W. Fu, Xuemin Lin, Ho-Fung Leung, Top-k typicality queries in Information Retrieval, ACM, New York, NY, USA, 2006, pp. 501–508.
and efficient query answering methods on large databases, The VLDB Journal 18 (3) [44] Rui Xu, D. Wunsch, Survey of clustering algorithms, IEEE Transactions on Neural
(2009) 809–835. Networks 16 (3) (2005) 645–678.
[15] Mohsen Jamali, Martin Ester, A matrix factorization technique with trust propagation [45] Xu. Yunhong, Xitong Guo, Jinxing Hao, Jian Ma, Raymond Y.K. Lau, Wei Xu,
for recommendation in social networks, Proceedings of the Fourth ACM Conference Combining social network and semantic concept analysis for personalized aca-
on Recommender Systems, RecSys'10, ACM, New York, NY, USA, 2010, pp. 135–142. demic researcher recommendation, Decision Support Systems 54 (1) (2012)
[16] Heung-Nam Kim, Inay Ha, Kee-Sung Lee, GeunSik Jo, Abdulmotaleb El-Saddik, Col- 564–573.
laborative user modeling for enhanced content filtering in recommender systems, [46] Gui-Rong Xue, Chenxi Lin, Qiang Yang, WenSi Xi, Hua-Jun Zeng, Yong Yu, Zheng
Decision Support Systems 51 (4) (2011) 772–781. Chen, Scalable collaborative filtering using cluster-based smoothing, Proceed-
[17] Joseph A. Konstan, Bradley N. Miller, David Maltz, Jonathan L. Herlocker, Lee R. ings of the 28th Annual International ACM SIGIR Conference on Research and
Gordon, John Riedl, GroupLens: applying collaborative filtering to Usenet news, Development in Information Retrieval, ACM, New York, NY, USA, 2005,
Communications of the ACM 40 (3) (1997) 77–87. pp. 114–121.
[18] Yehuda Koren, Robert M. Bell, Chris Volinsky, Matrix factorization techniques for [47] Xin Yan, Raymond Y.K. Lau, Dawei Song, Xue Li, Jian Ma, Toward a semantic granu-
recommender systems, IEEE Computer 42 (8) (2009) 30–37. larity model for domain-specific information retrieval, ACM Transactions on Infor-
[19] Raymond Y.K. Lau, Peter D. Bruza, Dawei Song, Towards a belief-revision-based mation Systems 29 (3) (2011)(Article 15).
adaptive and context-sensitive information retrieval system, ACM Transactions on [48] L.A. Zadeh, Fuzzy sets, Journal of Information and Control 8 (1965) 338–353.
Information Systems 26 (2) (2008)(Article 8). [49] Zibin Zheng, Hao Ma, Michael R. Lyu, Irwin King, WSRec: a collaborative filtering
[20] Raymond Y.K. Lau, M. Tang, O. Wong, S. Milliner, Y. Chen, An evolutionary learning based web service recommender system, Proceedings of the 2009 IEEE International
approach for adaptive negotiation agents, International Journal of Intelligent Sys- Conference on Web Services, IEEE, 2009, pp. 437–444.
tems 21 (1) (2006) 41–72.
[21] Marie-Jeanne Lesot, Laure Mouillet, Bernadette Bouchon-Meunier, Fuzzy prototypes
based on typicality degrees, Proceedings of the 8th Fuzzy Days'04, volume 33, Spring-
er, 2004, pp. 125–138. Yi Cai received his Ph.D. degree in computer science from the Chinese University
[22] Bin Li, Qiang Yang, Xiangyang Xue, Can movies and books collaborate? Cross-domain of Hong Kong. He is currently an associate professor of the School of Software Engi-
collaborative filtering for sparsity reduction, in: Craig Boutilier (Ed.), Proceedings of neering at the South China University of Technology, Guangzhou, China. His research
the Twenty-first International Joint Conference on Artificial Intelligence, 2009, interests are recommendation system, personalized search, semantic web and data
pp. 2052–2057. mining.
[23] Cen Li, Gautam Biswas, Unsupervised learning with mixed numeric and nomi-
nal data, IEEE Transactions on Knowledge and Data Engineering 14 (2002)
673–690.
[24] Hao Ma, Irwin King, Michael R. Lyu, Effective missing data prediction for collabora- Raymond Y. K. Lau is an associate professor in the Department of Information Systems at
tive filtering, Proceedings of the 30th Annual International ACM SIGIR Conference the City University of Hong Kong. He received his Ph.D. degree from the Queensland
on Research and Development in Information Retrieval, ACM, New York, NY, USA, University of Technology in 2003. He is the author of over 100 refereed internation-
2007, pp. 39–46. al journals and conference papers. His research work has been published in re-
[25] Simon Mayer, Dominique Guinard, An extensible discovery service for smart things, nowned journals such as the MIS Quarterly, ACM Transactions on Information
in: Dominique Guinard, Vlad Trifa, Erik Wilde (Eds.), Second International Work- Systems, IEEE Transactions on Knowledge and Data Engineering, INFORMS Journal
shop on the Web of Things (WoT 2011), San Francisco, California, June 2011. on Computing, Journal of MIS, etc. His research interests include information re-
[26] D.L. Medin, E.E. Smith, Concepts and concept formation, Annual Review of Psychol- trieval, WoT, and social media analytics. He is a senior member of IEEE and ACM,
ogy 35 (1984) 113–138. respectively.
Y. Cai et al. / Decision Support Systems 63 (2014) 52–63 63

Stephen S.Y. Liao is a professor and the director of the Advanced Transportation Information Ho-fung Leung is currently a professor and the chairman of the Department of Com-
Systems (ATIS) Research Center at the City University of Hong Kong. He received a bachelor's puter Science and Engineering, at the Chinese University of Hong Kong, China. He has
degree from Beijing University and a Ph.D. from the University of Aix-Marseille III and Insti- been active in research on intelligent agents, multi-agent systems, game theory, and
tute of France Telecom. His research work has been published in journals including the MIS semantic web.
Quarterly, Decision Support Systems, Communications of the ACM, Information Science, etc.
His current research interests include the use of information technology in mobile commerce
applications and intelligent business systems, especially intelligent transportation systems. Louis C. K. Ma is the acting director of the School of Continuing and Professional
Education at the City University of Hong Kong. He obtained his MBA from the
University of Technology, Sydney, and his Ph.D. from the University of
Chunping Li received his Ph.D. degree from Darmstadt University of Technology, Warwick, UK. His areas of research interest are in IS policy and strategy, pro-
Germany in 1999. He is currently an associate professor in Tsinghua University, China. ject management, and e-markets. His publications have appeared in major IS
His research interests include machine learning, data mining, text classification and topic journals such as the Journal of MIS, Information & Management, and Decision Sup-
modeling. He has published more than eighty research papers in related fields. port Systems.

You might also like