• Embed Doc
  • Readcast
  • Collections
  • CommentGo Back
Download
 
ANewInstance-BasedLabelRankingApproacUsingtheMallowsMode
WeiweiChengandEykeH ullermeier 
MathematicsandComputerSciencUniversityofMarburg,Germany 
cheng,eyke 
@mathematik.uni-marburg.de 
Abstract
Inthispaper,weintroduceanewinstance-basedapproachtthelabelrankingproblem.Thisapproachisbasedonaprobabilitymodel onrankingswhichisknownastheMallowsmodelinstatistics.Proba- bilisticmodelingprovidesthebasisforatheoreticallysoundprediction procedureintheformofmaximumlikelihoodestimation.Moreover,iallowsforcomplementingpredictionsbydiversetypesofstatisticalinfor- mation,forexampleregardingthereliabilityofanestimation.Empirical experimentsshowthatourapproachiscompetitivetostart-of-the-armethodsforlabelrankingandperformsquitewelleveninthecaseof incompleterankinginformation
Keywords
Instance-basedlearning,Labelranking,Classication,Maximumlikelihoodestimation 
1Introduction 
Thetopicoflearningpreferenceshasattractedincreasingattentionintherecent machinelearningliterature[1].Labelranking,aparticularpreferencelearning scenario,studiestheproblemoflearningamappingfrominstancestorankings overanitenumberofpredenedlabels.Itcanbeconsideredasanatural generalizationoftheconventionalclassicationproblem,whereonlyasingllabelisrequestedinsteadofarankingofalllabelsVariousapproachesforlabelrankinghavebeenproposedinrecentyearsTypically,theseareextensionsoflearningalgorithmsusedinbinaryclassica- tionproblems.Rankingbypairwisecomparison(RPC)isanaturalextensionof pairwiseclassication,inwhichbinarypreferencemodelsarelearnedforeacpairoflabels,andthepredictionsofthesemodelsarecombinedintoaranking ofalllabels[1].Twootherapproaches,constraintclassication(CC)andlog- linearmodelsforlabelranking(LL),seektolearnlinearutilityfunctionsfor eachindividuallabelinsteadofpreferencepredicatesforpairsoflabels[2,3]Inthispaper,weareinterestedinanalternativetomodel-basedapproachesnamelytheuseofa
instance-based 
approach.Instance-basedorcase-baselearningalgorithmshavebeenappliedsuccessfullyinvariouselds,suchasma- chinelearningandpatternrecognition,foralongtime[4].Thesealgorithmsimplystorethetrainingdata,oratleastaselectionthereof,anddeferthe 
 
processingofthisdatauntilanestimationforanewinstanceisrequested,propertydistinguishingthemfromtypicalmodel-basedapproaches.Instancebasedapproachesthereforehaveanumberofpotentialadvantages,especiallyithecontextofthelabelrankingproblemAsaparticularadvantageofdelayedprocessing,theselearningmethodsmay estimatethetargetfunction 
locall
insteadofinducingaglobalpredictionmodel fortheentireinputdomain(instancespace) 
.Predictionsaretypicallyobtained usingonlyasmall,locallyrestrictedsubsetoftheentiretrainingdata,namelthoseexamplesthatareclosetothequer
(hence 
mustbeendowedwitadistancemeasure).Theseexamplesarethen 
aggregate
inareasonableway.Aaggregatinganitesetofobjectsfromanoutputspace 
 
isoftenmuchsimplethanrepresentingacomplet
 
mappinginanexplicitway,instance-basemethodsareespeciallyappealingi
 
hasacomplexstructureInlabelranking, 
 
correspondstothesetofallrankingsofanunderlying labelset 
.Torepresentan 
 
-valuedmapping,theaforementionedmodel-baseapproachesencodethismappingintermsofconventionalbinarymodels,either byalargesetofsuchmodelsintheoriginallabelspac
(RPC),orbysinglebinarymodelinanexpanded,high-dimensionalspace(CC,LL).Sincforinstance-basedmethods,thereisnoneedtorepresentan 
 
mapping explicitly,suchmethodscanoperateontheoriginaltargetspace 
 
directly. Thepaperisorganizedasfollows:InSection2,weintroducetheprobleoflabelrankinginamoreformalway.Thecoreideaofourinstance-baseapproachtolabelranking,namelymaximumlikelihoodestimationbasedonspecialprobabilitymodelforrankings,isdiscussedinSection4.ThemodeitselfisintroducedbeforehandinSection3.Section5isdevotedtoexperimental results.ThepaperendswithconcludingremarksinSection6. 
2LabelRankin
Labelrankingcanbeseenasanextensionoftheconventionalsettingofclassica- tion.Roughlyspeaking,theformerisobtainedfromthelatterthroughreplacing singleclasslabelsbycompletelabelrankings.So,insteadofassociatingeverinstance 
fromaninstancespac
withoneamonganitesetofclasslabel
 
::: 
,wenowassociat
withatotalorderoftheclasslabels,that is,acomplete,transitive,andasymmetricrelation 
o
wher
 
 
indicatesthat 
 
precedes 
 
intherankingassociatedwit
.Itfollowsthatrankingcanbeconsideredasaspecialtypeofpreferencerelation,andthereforweshallalsosaythat 
 
 
indicatesthat 
 
i
preferred 
t
 
giventhe instance 
.Toillustrate,supposethatinstancesarestudents(characterizedby attributessuchassex,age,andmajorsubjectsinsecondaryschool)and 
ispreferencerelationonaxedsetofstudyeldssuchasMath,CS,PhysicsFormally,aranking 
canbeidentiedwithapermutation 
 
ofthese
:::n 
.Itisconvenienttodene 
 
suchthat 
 
)
 
 
)istheposition o
 
intheranking.Thispermutationencodesthe(groundtruth)ranking: 
 
 
 
(1
 
 
 
(2
::
 
 
 
 
wher
 
 
)istheindexofthelabelatposition 
intheranking.Theclasofpermutationsof 
:::n 
(thesymmetricgroupoforde
)isdenotedby 
 
.Byabuseofterminology,thoughjustiedinlightoftheaboveone-to-one correspondence,werefertoelement
 
 
asbothpermutationsandrankingsInanalogywiththeclassicationsetting,wedonotassumethatthereexistadeterministi
 
mapping.Instead,everyinstanceisassociatedwith
probabilitydistributio
over 
 
.Thismeansthat,foreac
,thereexistsprobabilitydistributionPr( 
¡
)suchthat,forever
 
 
Pr
 
)(1istheprobabilitythat 
 
 
Thegoalinlabelrankingistolearna\labelranker"intheformofa
 
mapping.Astrainingdata,alabelrankerusesasetofinstance
:::m 
,togetherwithinformationabouttheassociatedrankings 
 
.Ideally, completerankingsaregivenastraininginformation.Fromapracticalpointof view,however,itisalsoimportanttoallowforincompleteinformationinthe formofaranking 
 
 
 
 
 
 
::
 
 
 
wher
;
:::i 
isasubsetoftheindexse
:::n 
suchthat
 
<:::<
 
.Forexample,foraninstanc
,itmightbeknowntha
 
 
 
,whilenopreferenceinformationisgivenaboutthelabel
 
o
 
Toevaluatethepredictiveperformanceofalabelranker,asuitablelossfunc- tionon 
 
isneeded.Inthestatisticalliterature,severaldistancemeasuresfor rankingshavebeenproposed.Onecommonlyusedmeasureisthenumberodiscordantpairs
; 
)
i;
i<j; 
> 
)and 
 
< 
(2whichiscloselyrelatedtotheKendall'staucoecient.Infact,thelatterisnormalizationof(2)totheinterval[ 
 
1]thatcanbeinterpretedasacorrelation measure(itassumesthevalue1i
 
 
andthevalue 
 
1i
 
isthereversalof 
 
).Kendall'stauisanatural,intuitive,andeasilyinterpretablemeasure[5].Wshallfocuson(2)throughoutthepaper,eventhoughotherdistancemeasurecouldofcoursebeused.Adesirablepropertyofanydistance 
¡ 
)isitsinvariance towardarenumberingoftheelements(renamingoflabels).Thispropertyiequivalenttothe 
rightinvariance 
o
¡ 
),namel
; 
)
; 
)foral
;; 
 
,wher
 
 
 
 
denotesthepermutation 
U
 
 
)).Thedistance (2)isright-invariant,andsoaremostothercommonlyusedmetricson 
 
3TheMallowsMode
Sofar,wedidnotmakeanyassumptionsabouttheprobabilitymeasure(1) despiteitsexistence.Tobecomemoreconcrete,weresorttoadistance-base
of 00

Leave a Comment

You must be to leave a comment.
Submit
Characters: ...
You must be to leave a comment.
Submit
Characters: ...