Professional Documents
Culture Documents
Collaboration Recommendation On Academic Social Networks
Collaboration Recommendation On Academic Social Networks
1 Introduction
J. Trujillo et al. (Eds.): ER 2010 Workshops, LNCS 6413, pp. 190–199, 2010.
c Springer-Verlag Berlin Heidelberg 2010
Collaboration Recommendation on Academic Social Networks 191
In this context, the research area of Social Networks has emerged. Specifically,
the Social Network Analysis (SNA) is based on the assumption that the relation-
ship’s importance between interaction units is a central point to the evaluation
and the analysis of social interactions. The increasing interest in researching on
Social Networks was encouraged by the popularization of online Social Networks,
which are very interesting Web applications. Nowadays, this type of network is
commonly used, and each network connects millions of users. Examples of online
Social Networks include MySpace, Facebook, Hi5, Orkut, among others. Another
example of the Social Network’s concept application is a co-authorship Social
Network representing a scientific collaboration network.
In this “social growth of the web”, the number of studies involving Social Net-
work Analysis and the number of applications aiming to improve recommender
system have increased. This fact shall lead to advances in both areas and en-
courages the improvement or the development of new approaches to analyze
connections and recommend new ones. In this context, our paper presents an
innovative approach to recommend collaborations on the context of academic
Social Networks. The contributions of this paper can be summarized as follows:
we introduce the architecture for such approach and the metrics involved in rec-
ommending collaborations (Sect. 2) and we present the results of an initial case
study to validate our approach (Sect. 3).
2 Collaborations Recommendation
In this section, we give an overview of our approach to perform the recommenda-
tion of collaborations on the context of academic Social Networks, as illustrated
in Fig. 1. The first step consists on selecting the target user of the recommenda-
tion and the collaborative network to be used (i.e., a group of individuals from a
digital community such as DBLP1 ) (1). Then, the Social Network formed by se-
lected individuals (or a subgroup of them) is constructed and the weights of the
relationships among them are attributed (2). These weights are calculated based
on the information about the publications of the authors obtained from exist-
ing Digital Library (3). These weights indicate the level of “global cooperation”
between pairs of the researchers involved.
Indeed, the researchers’ profiles (profiles of the authors of the collaborative
network) to be used by the recommender system are constructed based on: the
information available about these authors in some digital library (4) and the
classification of the authors’ publications made with the aid of a research area
ontology (taxonomy) (5). Further details about this kind of ontology can be ob-
tained in [3]. Our approach considers the research areas in which the researchers
work to determine the “global correlation” index. The “global cooperation” and
the “global correlation” indexes between pairs of authors are then used (6 and 7)
as the basis to generate recommendations of authors with whom the user could
establish new collaborations. Finally, these recommendations are presented to
the user (8). The approach is detailed in the following subsections.
1
DBLP Computer Science Bibliography, http://www.informatik.uni-trier.de/˜ley/db
192 G.R. Lopes et al.
Digital
(4) Library
(7)
(5)
Research Areas
Researchers Profile
Research
areas
Ontology
(8)
|aj co authorship|
wtCa(ai →aj ) = (1)
|ai author|
where:
– wtCa(ai →aj ) corresponds to the weight of the collaboration based on the co-
author relationship (the weight is different according to the relation direc-
tion, i.e., the weight ai → aj is different from aj → ai );
– |aj co authorship| corresponds to the number of times that author aj has
co-authored a paper with ai ;
– |ai author| corresponds to the total number of papers of the author ai .
In other words, the higher the wtCa weight is, the more relevant is the relationship
between author aj to the author ai . The use of wtCa metric implies that there
Collaboration Recommendation on Academic Social Networks 193
is a graph with 0 or 2 links between two authors. The weights represent the
degree of collaboration in co-authorship between these authors. This metric is
an asymmetric variant of the Jaccard Coefficient and it has already been applied
in the context of Social Networks [4,5].
where |ai authorresearch area x | corresponds to the number of papers that author
ai published in the research area x, and |ai author| to the total number of pa-
pers by ai ’. Then, each area x has a corresponding weight that indicates the
contributions of the researcher in that area.
The weights indicate the grade of contributions of the researcher in each re-
search area, and they are used in the calculation of the global correlation between
pairs of authors. Vector Space Model (VSM) [6] is used in our work to perform
this computation. The VSM uses an n-dimensional space to represent the terms,
i.e. n corresponds to the number of distinct research areas. Each author profile
is represented by a research area’s vector, and the weights represent the vector’s
coordinates in the corresponding dimension. Based on the VSM, similarity is
calculated between pairs of authors and the index terms correspond to the re-
search areas of the authors. The weight assigned to each research area allows to
distinguish between the areas, and it is calculated according to their importance
to the author considered by using equation (2). These weights vary continuously
between 0 and 1. Values near to 1 correspond to more important research areas
while values near to 0 correspond to less important research areas. The VSM
principle is based on the inverse correlation between the distance (angle) among
term vectors in the space and the similarity between the information (in this case
the authors profile) that they represent. To calculate the similarity score, that
will represent the global correlation between two authors, the cosine is used as in
equation 3. The resultant value indicates the global correlation degree between
the two authors profiles (ai and aj ), where wRa (ai , x) represents the weights of
194 G.R. Lopes et al.
the research areas that composed the user’s profile, and n represents the total
number of research areas.
n
k=1 wRa (ai , xk ).wRa (aj , xk )
global correlation(ai , aj ) = (3)
n 2 n 2
k=1 (wRa (ai , xk )) . k=1 (wRa (aj , xk ))
2.3 Recommendation
Our approach to generate cooperation recommendations considers the analysis
of all indexes previously described. The relationship between the “global co-
operation” and the “global correlation” for each pair of authors establish the
necessity (or not) of having more research interaction between them. For this
kind of analysis, we establish degrees to represent the different ranges of val-
ues that are possible for these weights. The degrees are “high”, “medium” and
“low”. Based on this analysis, recommendation actions can be established to
indicate possible cooperations between pairs of authors. The combination be-
tween degrees and the interpretation of them can be observed in Table 1, where
the action of recommending a collaboration is indicated by the term “recom-
mend”. For instance, pairs of authors with high global correlation between them
but with low global cooperation must be recommended to intensify their coop-
eration. Moreover, pairs of authors with low global correlation between them
and high or medium global cooperation could also be recommended. The term
“ok” indicates the ideal relationship. The term “alert” indicates situations in
which the global cooperation and global correlation degrees are different but do
not characterize a recommendation action. For example, pairs of authors with
high or medium global cooperation that have lesser global correlation degrees
do not need to be recommended to cooperate, since the cooperation between
the authors is already happening in a more intense relation than their profiles
correlation suggests.
In the case study presented in Sect. 3, we presented an example of translation
of the ranges of numeric values for global cooperation and global correlation into
the degrees defined as “low”, “medium” and “high”.
3 Case Study
In order to analyze how the proposed approach performs, we consider a case
study from the InWeb (Brazilian National Institute of Science and Technology
Collaboration Recommendation on Academic Social Networks 195
for the Web)2 . The Institute is formed by 27 researchers and their students. All
researchers are professors in a Brazilian major education institution (UFMG,
UFRGS, UFAM and CEFET-MG) with graduate program in Computer Science.
For validating our metrics, we have implemented a tool to automatically gen-
erate a Social Network. This SN was built using information about authors
provided by the DBLP on January 21, 2009. It is important to notice that this
library is exported as an XML document. Instead of using the whole dataset, we
extracted from the library just the papers written by the considered researchers
and published in conference proceedings or in journals (as elements inproceed-
ings or article). This data gathering process summed up 677,345 authors; 692,431
conference proceedings papers and 432,663 journal articles. Such a subset was
chosen because this information is significantly important for representing the
co-author relationship between authors and, consequently, to determine the re-
search collaborations among them.
The actors (vertices or nodes) of the SN can be chosen at will. In here, they are
a subset of authors with scientific papers indexed by the DBLP. The relational
ties (linkages or edges) between actors are the relationships between pairs of
authors. These social ties represent the co-author relationships. The weights of
the linkages are determined by the equation 1 (presented in the Sect. 2). In that
equation, |ai author| corresponds to the total number of papers of the author ai ,
and it considers all papers to this author ai indexed at DBLP (papers published
in conferences proceedings or in journals), including those papers that are not
co-authored by authors in the selected SN (that can be a sub-network of the
global SN). These weights represent the “global cooperation” index between
researchers. The SN will be represented by a directional graph where the edges
have a direction associated, according to the equation previously defined. The
InWeb SN was automatically constructed using our developed system. The result
is presented in Fig. 2.
The recommendation method was implemented as presented in Sect. 2. The
profiles of the researchers (author profiles of the collaborative network) used
by the recommendation method were built based on information available about
these authors in the DBLP and on the classification of authors publications made
using an ontology of research areas. The ontology (taxonomy) for classifying au-
thors papers was proposed by Loh et al. [3]. This ontology uses a classification of
computer science areas similar to the ACM (Association for Computing Machin-
ery) classification3 and provides associated weights to the keywords according
to their relevance to each research area. In this case study, only the keywords
present on the publication’s title were used. The proposed approach considers
the research areas in which researchers work to determine a “global correlation”
index. The “global cooperation” and “global correlation” indexes between pairs
of authors are used as a base to generate recommendations for authors with
whom they should establish new collaborations, or that they must intensify col-
laborations. For the definition of recommendations, we have used the method
2
InWeb, http://www.inweb.org.br/
3
ACM Computing Classification, http://www.acm.org/about/class/ccs98-html
196 G.R. Lopes et al.
Legend:
UFAM
1.1 Altigran Soares da Silva
1.2 Edleno Silva de Moura
1.3 João M. B. Cavalcanti
UFRGS
2.1 Carlos A. Heuser
2.2 José Palazzo Moreira de Oliveira
2.3 Leandro Krug Wives
2.4 Renata de Matos Galante
2.5 Viviane Moreira Orengo
CEFET-MG
3.1 Cristina D. Murta
3.2 Evandrino G. Barros
3.3 Fabiano C. Botelho
UFMG
4.1 Adriano M. Pereira
4.2 Alberto H. F. Laender
4.3 Arnaldo de Albuquerque Araújo
4.4 Berthier A. Ribeiro-Neto
4.5 Clodoveu A. Davis
4.6 Dorgival Olavo Guedes Neto
4.7 Genaı́na Nunes Rodrigues
4.8 Gisele L. Pappa
4.9 Jussara M. Almeida
4.10 Marcos André Gonalves
4.11 Mirella Moura Moro
4.12 Nivio Ziviani
4.13 Raquel Oliveira Prates
4.14 Renato Ferreira
4.15 Virgı́lio A. F. Almeida
4.16 Wagner Meira Jr.
presented in Sect. 2.3. We have used a linear scale to translate the ranges of
numeric values for global cooperation and global correlation into the degrees
defined as “low”, “medium” and “high”. The “low”, “medium” and “high” de-
grees correspond, respectively, to values v in the ranges {v ∈ |0.0 < v ≤ 1/3},
{v ∈ |1/3 < v ≤ 2/3} and {v ∈ |2/3 < v ≤ 1.0}.
After the identification of the resulting degree, it is possible to establish recom-
mendation actions according to Table 1. As shown in that table, the recommen-
dation action will only happen when there is a “low” value of global cooperation
between pairs of authors and a “medium” or “high” value of global correla-
tion. Moreover, when the value of global cooperation is effectively null (zero), a
recommendation to initiate collaboration will be made to the authors. Indeed,
when the value of global cooperation is a “low” but not null, an intensification
of existing collaboration will be recommended.
An example of interface for presenting recommendations for the researcher
“José Palazzo Moreira de Oliveira” (InWeb member), can be seen in Fig. 3. In
the recommendation to “initiate collaboration”, the value of the global correla-
tion index obtained between the recommended research and the target researcher
is shown. When it is recommended to “intensify cooperation”, the ratio of global
cooperation and global correlation is presented. Furthermore, recommendations
are presented in ranked lists: in the “initiate collaboration” list, the researchers
recommended are in descending order of global correlation index, since the higher
the value of global correlation found, the greater the possibility of collaboration
be interesting to the target researcher; and in the “intensify cooperation” list,
the recommended researchers are presented in increasing order of ratio between
Collaboration Recommendation on Academic Social Networks 197
global cooperation and global correlation, so that the recommendations are pre-
sented in order of intensification necessity.
4 Related Works
5 Final Remarks
of the researchers associated to the InWeb project. In our case study, the Social
Network is constructed based on data available by the DBLP digital library. In
future works, we will provide more experiments to evaluate the approach aiming
to show the viability and applicability of our proposal in a real world application.
References
1. Smeaton, A.F., Callan, J.: Personalisation and recommender systems in digital
libraries. Int. J. on Digital Libraries 5, 299–308 (2005)
2. Perugini, S., Gonçalves, M.A., Fox, E.A.: Recommender systems research: A
connection-centric survey. J. Intell. Inf. Syst. 23, 107–143 (2004)
3. Loh, S., et al.: Constructing domain ontologies for indexing texts and creating
users’ profiles. In: Work. on Ontologies and Metamodeling in Software and Data
Engineering, Brazilian Symp. on Databases, UFSC, Florianópolis, pp. 72–82 (2006)
4. Aleman-Meza, B., et al.: Semantic analytics on social networks: experiences in
addressing the problem of conflict of interest detection. In: Intl. Conf. on World
Wide Web, pp. 407–416. ACM Press, New York (2006)
5. Mika, P.: Social networks and the semantic web. In: IEEE/WIC/ACM Intl. Conf.
on Web Intelligence, pp. 285–291. IEEE Press, New York (2004)
6. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval.
Inf. Process. Manage. 24, 513–523 (1988)
7. Ogata, H., Yano, Y., Furugori, N., Jin, Q.: Computer supported social networking
for augmenting cooperation. Comput. Supp. Coop. Work 10, 189–209 (2001)
8. Golbeck, J., Hendler, J.: Filmtrust: movie recommendations using trust in web-
based social networks. In: Consumer Communications and Networking Conf., pp.
282–286. IEEE Press, New York (2006)
9. Quercia, D., Capra, L.: Friendsensing: recommending friends using mobile phones.
In: ACM Conference on Recommender Systems, pp. 273–276. ACM Press, New
York (2009)
10. Karagiannis, T., Vojnovic, M.: Behavioral profiles for advanced email features. In:
Intl. Conf. on World Wide Web, pp. 711–720. ACM Press, New York (2009)
11. Chen, J., et al.: Make new friends, but keep the old: recommending people on
social networking sites. In: SIGCHI Conference on Human Factors in Computing
Systems, pp. 201–210. ACM Press, New York (2009)
12. Kautz, H., Selman, B., Shah, M.: Referral web: combining social networks and
collaborative filtering. Commun. ACM 40, 63–65 (1997)
13. McDonald, D.W.: Recommending collaboration with social networks: a compara-
tive evaluation. In: SIGCHI Conference on Human Factors in Computing Systems,
pp. 593–600. ACM Press, New York (2003)
14. Zaiane, O.R., Chen, J., Goebel, R.: Dbconnect: mining research community on
dblp data. In: WebKDD and SNA-KDD Work. on Web Mining and Social Network
Analysis, pp. 74–81. ACM Press, New York (2007)
15. Weng, S.S., Chang, H.L.: Using ontology network analysis for research document
recommendation. Expert Syst. Appl. 34, 1857–1869 (2008)