30 views

Uploaded by rohananil

- Static Condensation (Guyan Reduction)
- System Identification for Control
- vibration
- Diploma Schedule
- Java Programs
- Bourguingon
- lec12_chap10
- Advanced Math
- LFA
- 0111GS-GettingStartedWithMath
- Learning to Rank Short Text
- Metodo de Foss K.A.
- 1.[1-14]the Impact of Macro Economic Indicators on Stock Prices in Nigeria
- ti89vshp49
- MatlabCheatSheet.pdf
- Section 5
- ti36x_pro.pdf
- PLoS ONE Volume 13 Issue 10 2018 [Doi 10.1371_journal.pone.0204849] Ayyaz, Sundus; Qamar, Usman; Nawaz, Raheel; Deng, Yong -- HCF-CRS- A Hybrid Content Based Fuzzy Conformal Recommender System for p
- c progtrams
- Errata

You are on page 1of 6

Abstract This document contains information about our nal submission for track 1 of KDD Cup 2011. We achieved a nal RMSE of 23.5797 on the test set using an ensemble of Alternative Least Squares and Latent Feature Log-Linear approach. We are ranked at 38th position on the leader board for track 1. This report contains information about our nal submission. Our main contribution is the parallelism for LFL using the Joint SGD Update by grouping strategy.

Contents

1 2 3 Notation Alternating least squares based Matrix Factorization (ALS) Parallelism for ALS training 3.1 Alternating update and grouping strategy Latent Feature log linear model Parallelism for LFL training 5.1 Joint SGD Update by grouping strategy . . . . . . . . . . . . . . . . . . . . . . . Results Timing Information 3 3 3 3 4 5 5 5 5

. . . . . . . . . . . . . . . . . . . . . .

4 5

6 7

Notation

True rating for user- (u) and item - (i) Predicted rating for user - (u) and item - (i) Latent feature vector for user - (u) Latent feature vector for item - (i) Size of feature vector i.e. latent factors Concatenated feature matrix for all users Concatenated feature matrix for all items Number of users Number of items Learning rate parameter Regularization parameter Sigmoid on x

This method was rst presented in [4]. The main differences compared to previously dicussed methods are a) the update rule for Uu or Ii is the least squares solution and b) the regularization parameter is multiplied by the number of ratings for that user (nu ) or item (ni ). Objective Function E = (ru,i Uu Ii )2 + ( nu ||Uu ||2 + ni ||Ii ||2 ) Least squares solution for a Uu and Ii T (MI(u) MI(u) + (nu E))Uu = Vu where MI(u) is sub matrix of I, where columns are chosen based on items that user u has rated. and E is the identity matrix and Vu = MI(u) RT (u, I(u)) Optimization Type LS T Update Rule Uu A1 Vu where A = (MI(u) MI(u) + (nu E)) u Ii Bi1 Yi ; derivation similar to Uu

3

3.1

Alternating update and grouping strategy

In this scheme, the SGD updates for U and I are decoupled. The U matrix is updated while xing I and vice versa (Alternating). This allows us to exploit the inherent parallelism in matrix updates. The matrix being updated is split into N groups and each group is updated independently.

FIXED I

In LFL model [1] we restrict output ratings to be in the set Rc = {0,10,20,30,40,50...100} each corresponding to c = {0,...11} classes and learn latent features for each of the ratings. We x U 0 and I 0 to be zero i.e. keeping class 0 as the base class.

c Rc exp(Uu Iic ) 2 c ) + ( c ||Uu ||2 + ||Iic ||2 ) Z c Z = c exp(Uu Iic ) - Normalization term c exp(Uu Iic ) p(c|U c , I c ) = Zc c c c R exp(Uu Ii ) r= Z Derivative with respect to each example c foreach c U c (ru,i c Rc exp(Uu Iic ))2 = 2(ru,i c (Rc p(c|U c , I c ))p(c|U c , I c ) uk c (Rc c (Rc p(c|U c , I c ))Iik c foreach c I c (ru,i c Rc exp(Uu Iic ))2 = 2(ru,i c (Rc p(c|U c , I c ))p(c|U c , I c ) ik c (Rc c (Rc p(c|U c , I c ))Uuk Optimization Type SGD c c c Update Rule Uuk Uuk (( U c E) + (Uuk )) uk c c c Iik Iik (( I c E) + (Iik ))

Objective Function E =

(ru,i

ik

5

5.1

Joint SGD Update by grouping strategy

In this scheme, the SGD updates for U and I are parallelized by creating two disjoint set containing (u,i) pairs as illustrated in the gure below. This scheme can be recursively applied to each of the disjoint set for further levels parallelism. To create the disjoint set we used the modulo operator to partition into (u,i) sets. It turns out that in this dataset modulo operator splits the disjoint sets of almost equal sizes. One of the main advantages of this strategy over the alternating strategy is that the trained model is identitical to the trained model that one would get from a sequential SGD training. The alternating strategy creates a different model altogether.

Item

User

Results

The results from our experiments using both training and validation set during training. The ensemble coefcients were learned using linear regression on the validation set using a model trained on the training set. ALS with validation set (1/-/200) 23.88 LFL with validation set (10/.0001/120) 23.87 Ensemble of LFL and ALS 23.57

Table 4. Current Results on Test Set

Timing Information

All these runs were using 8 cores on the same node. It takes around 250 seconds to load all the les into memory for track 1 on a single compute node. On vSMP loading time is around 400 seconds. Method(k) Time in sec per epoch ALS (200) 4000 LFL (120) 1200

Table 4. Run times on a single node

References

1. Aditya Krishna Menon, Charles Elkan, A log-linear model with latent features for dyadic prediction, In IEEE International Conference on Data Mining (ICDM), Sydney, Australia, 2010 2. Zhou, Y., Wilkinson, D.M., Schreiber, R., Pan, R.: Large-Scale Parallel Collaborative Filtering for the Netix Prize,In AAIM(2008) 337-348

- Static Condensation (Guyan Reduction)Uploaded byGeorlin
- System Identification for ControlUploaded bymacbeth78
- vibrationUploaded byscribactive2010
- Diploma ScheduleUploaded byNaresh Sankuru
- Java ProgramsUploaded byManoj Mahajan
- BourguingonUploaded byFrancisco Javier González Alvarado
- lec12_chap10Uploaded byakirank1
- Advanced MathUploaded bygreg
- LFAUploaded byamit77999
- 0111GS-GettingStartedWithMathUploaded byuser 54
- Learning to Rank Short TextUploaded byAlba Alba
- Metodo de Foss K.A.Uploaded byNoe David Lazos
- 1.[1-14]the Impact of Macro Economic Indicators on Stock Prices in NigeriaUploaded byiiste
- ti89vshp49Uploaded bynismogtr10
- MatlabCheatSheet.pdfUploaded byaskdfjlafsdkj
- Section 5Uploaded bySukrit Ghorai
- ti36x_pro.pdfUploaded byJeff
- PLoS ONE Volume 13 Issue 10 2018 [Doi 10.1371_journal.pone.0204849] Ayyaz, Sundus; Qamar, Usman; Nawaz, Raheel; Deng, Yong -- HCF-CRS- A Hybrid Content Based Fuzzy Conformal Recommender System for pUploaded byLaique Santana
- c progtramsUploaded byVijay Kr
- ErrataUploaded byManar Hosny
- Topis and SawUploaded byYien Shadzrin
- Ashley, Ball and EckelUploaded bysara
- GMTUploaded byE
- 20_What drives eGovernment growth An econometric analysis on the impacting factors .pdfUploaded byMas Candra
- 30. SPE-19784-MSUploaded byleonelz6a
- rrbUploaded byMani Kandan
- Greese 40Uploaded byShahin
- I YEARUploaded byrawani_rizwan
- MathematicsUploaded byAyush Gochhayat Gochhayat
- LABviewTutorial.pdfUploaded byEslem Islam

- Traffic ManagementUploaded byMahesh Kumar Chenna
- State of NatureUploaded byDan Davis
- Comments on PTU - CRSUploaded byjuliyet struc
- Autodesk Inventor Practice Part DrawingsUploaded byCiprian Fratila
- Impressed Current Cathodic Catalogo MaterialesUploaded byJorge Rodriguez
- ASTM D95Uploaded bytime4521
- Trusses LectureUploaded byNilesh Yerawar
- C6915enUploaded byjoshi_rags
- 2. BCP Installation & CommissioningUploaded byKrishnan Santhanaraj
- Astm2010 Volume 0409 IndexUploaded byJhon Michael
- Assessment of Transition Model and CFD Methodology for Wind Turbine Flows992Uploaded byKuan Tek Seang
- EE354-Communication Systems Spring2012Uploaded bylifeanil
- Performance Management a Literature ReviewUploaded byanuj0072006
- Black Dragon Ninjitsu Camp Handbook June 2010Uploaded byRon Collins
- Ch07 SolutionsUploaded bycelinacheung1
- Mass CustomizationUploaded byRahul Itankar
- Metabolome Analysis Using Gc-msUploaded bymalvarez2012
- MATHC255BernstenS09Uploaded byThao Nguyen
- Informe Jabón AntiacnéUploaded bynjugo
- Autocad IllustratorUploaded byEfraín Vásquez
- ARTICLE SOBRE LA TAXONOMIA DE BLOOMUploaded byPatti
- transportation history timeline.pptUploaded byAnnalyn Peña
- DJI+Assistant+2+Release+Notes(1.2.3)Uploaded bydilloai420
- Checklist to Evaluate DrawingUploaded byMishal Limbu
- EDUCATIONAL PHILOSOPHIES 3Uploaded bymdhariz14
- ESOT 2015 Advanced ProgramUploaded byFrancesco
- Stacking & TrackingUploaded byioncazacu
- Keyboard ShortcutsUploaded byDerrick Ceo Savala
- Diodo bzxUploaded byAndrey Silva
- TIMKEN_-_Zespoy_ozyskowe_typu_E.pdfUploaded byPlamen Kangalov