You are on page 1of 5

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 12, December 2013 ISSN 2319 - 4847

A Mechanism for Handling Cold Start in Book Recommender System by Sharing Student Profile.
Umar Bashir Mir1 Mubbashir Nazir2 and Anuj Yadav3
3

M.Tech Scholar Department of Computer Science & Information Technology DIT University Dehradun Assistant Professor Department of Computer Science & Information Technology DIT University Dehradun

1,2

Abstract
In Recommendation system, one of the main concerns is the insufficient amount of information available about the new users or new items. In literature this problem is termed as Cold Start, which affects the actionability of recommendation results. In this paper we have devised a methodology for handling cold start problem in Recommendation System (CF). The Proposed approach provides solution to this problem by sharing user profile.

Keywords: Cold start, profile sharing, collaborative filtering, recommender systems.

1. INTRODUCTION
Recommender systems are software tools which represent user preferences for the purpose of suggesting items to purchase or examine. Recommender system play a vital role in ecommerce where recommendation is made by processing huge information in order to suggest items which best meet user needs and preferences. Many techniques have been proposed for providing recommendations based on techniques such as Collaborative Filtering (CF), Content Based (CB) etc. Among them CF technique is considered one of the most successful recommendation technique. This technique allows recommending an item to a user based on the user profiles that are closest to him [9, 28, 29].Although CF has been used extensively; it also has some disadvantages, such as the cold-start problem. Cold start problem occurs when insufficient rating data is available in the initial state. The recommender system lacks data to produce appropriate recommendations. It occurs because of the presence of new user or new item. New User Problem: When recommendations follow from user-to-user correlations based on the accumulation of ratings, a user with few ratings is difficult to categorize. New Item Problem: An item with few ratings cannot easily be recommended. This problem occurs particularly in domains with many new items (e.g. news articles) [10].

2. Related Work
Collaborative filtering (CF) systems use community ratings to determine recommendations for users. CF systems typically work by associating a user with a group of likeminded users, called neighbourhoods, then recommending items enjoyed by others in the group. The movie domain is probably the most widely studied application area for CF systems [9, 28, 29, 25, 17, 32], though the same methodology is applicable across a number of domains, including books, music [30], web pages [6], jokes [8], news, newsgroups [27, 14, 19], advertisements, and more. CF algorithms range from simple nearestneighbour methods [6, 27, 29] (Breese et al. [6] call these memory-based methods) to more elaborate model-based methods which first train or compile a model for example, a Bayesian network [6], a classifier [5, 20], a co-clustered matrix [31], or a truncated singular value decomposition matrix [5, 8,28]based on historical data, then use the trained model to generate recommendations. Hundreds of CF algorithms have been proposed and studied, including machine learning methods [5, 4, 15, 26], graph-based methods [1, 12], linear algebra based methods [5, 8, 28], and probabilistic methods [11, 23, 24]. A number of hybrid methods combining information filtering (IF) and CF techniques have also been proposed [3, 2, 4, 24, 18, 9], which are especially useful when data is sparse, for example in cold-start situations. In fact, in the extreme cold-start setting, CF methods cannot provide recommendations at all, and IF methods or hybrid IF/CF methods are needed. We base our methodology on sharing users (in our case student) profile in a controlled manner for generating recommendations. This shared profile is then treated as a separate user in a CF system.

3. THE PROPOSED APPROACH


The Proposed approach handles cold-start problem in recommendation systems (Book Recommendation System) by formulating User Profiles (students in our case) taking into consideration the various attributes of users such as class, stream, preferred authors (local or international) etc. The profile template will be common to all students. 3.1 Methodology Using appropriate methodologies is as important as identifying the right recommendations. The Proposed approach is formed of three main phases: Fetch, Process and Truncate. The Fetch phase is concerned with getting the required

Volume 2, Issue 12, December 2013

Page 318

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 12, December 2013 ISSN 2319 - 4847
parameters such as Global Student Profile or Shared Profile (Profile Token) for Process Phase. Process is the main engine where the actual recommendations are generated based on inputs from the Fetch phase. Truncate is involved with discarding of the profile-tokens used during the Fetch phase. Algorithm for Sharing Profile Fetch Phase Stage 1: Use Global student profile (say GSP) or Shared profile (Profile Token). Stage 2: If Profile Token is chosen from Stage 1 then enter Profile-Token. Process Phase Stage 3: Clone selected profile (whichever is selected from Fetch phase) as students personalized profile. Stage 4: Generate initial recommendations using cloned profile (students personalized profile from stage 3). Stage 5: Analyse students response to these recommendations and continuously update personalized profile for this student. Truncate Phase Stage 6: Discard Profile-token.

Fig 1: General Architecture of Profile Sharing Mechanism in www.xbooksindia.com Since attributes of the students are bound to change but the only difference is that some will be changing frequently like subjects, price sensitivity etc. and other will be changing less frequent like class stream etc. taking into consideration the various factors such as Semester timing, time-to-exams etc. the process of determining the price sensitivity should be based on the data captured on-line(Real Time Data) as well as off-line(Stored Data). 3.2 Algorithm for Price Sensitivity Stage 1: Identify unique students .

Volume 2, Issue 12, December 2013

Page 319

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 12, December 2013 ISSN 2319 - 4847
Stage 2: Stage 3: Stage 4: Stage 5: Stage 6: Store a list L of books displayed to the student. Store the price for each book in list L. Arrange books in list L according to their categories (like programming, Networking etc) Calculate Pmin (minimum book price) and Pmax (maximum book price) for each category in the list L For each book clicked by the user calculate the normalized book price as

where PPn is the normalized book price , Pp is the clicked book price, Pmin is the minimum book price in the list L and Pmax is the maximum book price in the list Stage 7: Calculate the price sensitivity of each student, as For a new user

where PSu is the price sensitivity of a student, NCu is the total no of products clicked by the student, PPn normalized book price. For a returning user

is the

where PSp is the previous price sensitivity of a returning student. Stage 8: Store the Price sensitivity of each student.

Figure 2 Price Sensitivity Score (tested on dummy data).

Figure 3 Segment Price Sensitivity (tested on dummy data)

Figure 4 Price Sensitivity score (tested on dummy data)

Volume 2, Issue 12, December 2013

Page 320

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 12, December 2013 ISSN 2319 - 4847

Fig 4: Flow Chart depicting holistic flow in the system

4. Conclusion and future work


Thus the research work presented will handle cold-start problem by sharing students profile in a controlled manner (without personal details like age, email etc.). The proposed work is implemented in an ecommerce website xbooksindia (www.xbooksindia.com) which is currently in its final phase. Also price sensitivity module has been successfully tested in xbooksindia (www.xbooksindia.com) website where it was observed that the systems overall performance was improved by good margin. This mechanism is specifically focused on students and books, with some modifications it can be extended to other domains as well.

References
[1] C. C. Aggarwal, J. L. Wolf, K.-L. Wu, and P. S. Yu. Horting hatches an egg: a new graph-theoretic approach to collaborative filtering. In ACM KDD, pages 201212, 1999.

Volume 2, Issue 12, December 2013

Page 321

International Journal of Application or Innovation in Engineering & Management (IJAIEM)


Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 12, December 2013 ISSN 2319 - 4847
[2] M. Balabanovic and Y. Shoham. Fab: content-based, collaborative recommendation.Communications of the ACM ,40(3):6672, 1997. [3] J. Basilico and T. Hofmann. A joint framework for collaborative and content filtering. In ACM SIGIR , 2004. [4] C. Basu, H. Hirsh, and W. W. Cohen. Recommendation as classification: Using social and content-based information in recommendation. In AAAI/IAAI, pages 714720, 1998. [5] D. Billsus and M. J. Pazzani. Learning collaborative information filters. In ICML, pages 4654, 1998. [6] J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative filtering. In UAI, pages 4352, 1998. [6] Shaghayegh Sahebi, William W. Cohen. Community-based recommendations: a solution to the Cold-Start Problem WOODSTOCK97 El Paso, Texas USA [7] M. Deshpande and G. Karypis. Item-based top-n recommendation algorithms. ACM TOIS, 22(1):143177, Jan 2004. [8] K. Goldberg, T. Roeder, D. Gupta, and C. Perkins. Eigentaste:A constant time collaborative filtering algorithm. Information Retrieval, 4(2):133151, 2001. [9] N. Good, J. B. Schafer, J. A. Konstan, A. Borchers, B. M.Sarwar, J. L. Herlocker, and J. Riedl. Combining collaborativ e iltering with personal agents for better recommendations.In AAAI/IAAI, pages 439446, 1999. [10] Thomas Hess, Recommendation Engines Seminar Paper, February 1,2009 [11] T. Hofmann and J. Puzicha. Latent class models for collaborative filtering. In IJCAI, pages 688693, 1999. [12] Z. Huang, H. Chen, and D. Zeng. Applying associative retrieval techniques to alleviate the sparsity problem in collaborative filtering. ACM TOIS , 22(1):116142, Jan 2004. [12] Z. Huang, H. Chen, and D. Zeng. Applying associative retrieval techniques to alleviate the sparsity problem in collaborative filtering. ACM TOIS, 22(1):116142, Jan 2004. [13] J. A. Konstan, B. N. Miller, D. Maltz, J. L. H. L. R. Gordon, and J. Riedl. GroupLens: applying collaborative filtering to Usenet news. Communications of the ACM, 40(3):7787, 1997. [14] B. Marlin. Collaborative filtering: A machine learning perspective. Masters thesis, University of Toronto, Computer Science Department. [15] S. McNee, S. Lam, J. Konstan, and J. Riedl. Interfaces for eliciting new user preferences in recommender systems. In UM, pages 178188, 2003. [16] P. Melville, R. Mooney, and R. Nagarajan. Content-boosted collaborative filtering. In AAAI, 2002. [17] B. N. Miller, J. T. Riedl, and J. A. Konstan. Experience with grouplens: Making usenet useful again. In USENIX annual technical conference, pages 219231, 1997. [18] K. Miyahara and M. J. Pazzani. Collaborative filtering with the simple bayesian classifier. In PRICAI, pages 679 689, 2000. [19] D. Pennock, E. Horvitz, S. Lawrence, and C. L. Giles. Collaborative filtering by personality diagnosis: A hybrid memory- and model-based approach. In UAI, pages 473480, 2000. [20] A. Popescul, L. Ungar, D. Pennock, and S. Lawrence. Probabilistic models for unified collaborative and contentbased recommendation in sparse-data environments. In UAI, pages 437444, 2001. [21] A. Rashid, I. Albert, D. Cosley, S. Lam, S. Mcnee, J. Konstan, and J. Riedl. Getting to know you: Learning new user preferences in recommender systems. In IUI, pages 127134, 2002. [22] J. Rennie and N. Srebro. Fast maximum margin matrix factorization for collaborative prediction. In ICML , 2005. [23] P. Resnick, N. Iacovou, M. Suchak, P. Bergstorm, and J. Riedl. GroupLens: An Open Architecture for Collaborative Filtering of Netnews. In ACM CSCW , pages 175186, 1994. [24] B. Sarwar, G. Karypis, J. Konstan, and J. Riedl. Application of dimensionality reduction in recommender systemsa case study. In ACM WebKDD Workshop, 2000. [25] B. M. Sarwar, G. Karypis, J. A. Konstan, and J. Reidl. Item-based collaborative filtering recommendation algori thms. In WWW, pages 285295, 2001. [26] U. Shardanand and P. Maes. Social information filtering:Algorithms for automating word of mouth. In CHI, 1995. [27] L. Ungar and D. Foster. Clustering methods for collaborative filtering. In Workshop on Recommendation Systems at AAAI, 1998. [28] M. R. W. Hill, L. Stead and G. Furnas. Recommending and evaluating choices in a virtual community of use. In ACM CHI, pages 194201, 1995.

Volume 2, Issue 12, December 2013

Page 322

You might also like