Year Tasks Participants2006 Opinion Finding Task 14Open Task 52007 Opinion Finding Task 20Polarity Subtask 11Blog Distillation Task 9Table 3: Tasks run overthe first two years of the TREC Blogtrack.In the remainder of this paper, we present in details thetwo main tasks that have ran at the TREC Blog track. Wedescribe the tasks in details, as well as the most effectiveretrieval approaches that the participating groups have de-ployed. We provide insights on the performances of searchengines across a variety of topic categories, as well as howthe topic categories were affected by spam.
Opinion Finding
Many blogs are created by their authors as a mechanismfor self-expression encouraged by the freely accessible blogsoftware, communicating their opinions and thoughts onany topic of their choice. A study conducted in (Mishne &de Rijke 2006) shows that many queries received by blogsearch engines seem to be of an opinion, or perspective-finding nature, rather than fact-finding. The opinion findingtask is an articulation of an information need that aimsto uncover the public sentiment towards a given targetentity such as a product, an organisation or a location. Aretrieval engine allowing for an effective opinion findingmight naturally be used as a tool for supporting manybusiness-intelligence tasks such as brand monitoring,consumer-generatedviews and feedback analysis, and moregenerally media analysis. It can also help users make aninformedchoice before buyinga givenproduct,attendinganentertainment event, or taking a holiday in a given location.Several commercial blog search engines aim to allowusers to find out about the opinions and thoughts of otherpeople, who happily share their thoughts on the blogo-sphere. These thoughts range from anger at some products,politicians or organisations, to good reviews of products orappraisal of cultural events.In the Blog track, the opinion retrieval task involved lo-cating blog posts that express an opinion about a given tar-get (Ounis
et al.
2007). The target can be a “traditional”named entity, e.g. a name of a person, location, or organi-sation, but also a concept (such as a type of technology), aproduct name, or an event. The task can be summarised as
What do people think about X
,
X
being a target. The topicof the post is not required to be the same as the target. How-ever, for a post to be judged relevant, an opinion about thetargethadto be present in the post orone ofthe commentstothe post, as identified by the permalink. To create a realisticsetting where the topics are actual representations of real in-formationneeds, assessors selected queries froma querylogofa commercialblogsearchengine,andexpandedthemintofully-describedtopics by making a reasonable interpretationof the query. This process was used to generate 50 topics for
<top><num> Number: 930 </num><title> ikea </title><desc> Description:Find opinions on Ikea or its products.</desc><narr> Narrative:Recommendations to shop at Ikea are relevant opinions.Recommendations of Ikea products are relevant opinions.Pictures on an Ikea-related site that are not relatedto the store or its products are not relevant.</narr></top>
Figure 1: Blog track 2007, opinion retrieval task, topic 930.the 2006 Blog track and another50 topics for the 2007 Blogtrack. Figure 1 shows an example topic.The relevance assessment procedure had two levels (Ou-nis
et al.
2007; Macdonald, Ounis, & Soboroff 2008). Thefirst level assesses whether a given blog post, i.e. a perma-link, contains information about the target and is thereforerelevant. The second level assesses the opinionated natureof the blog post, if it was deemed relevant in the first assess-ment level. A workable definition of
subjective
or
opinion-ated
content was used. In particular, a post is assumed tohave a subjective contentif it contains an explicit expressionof opinion or sentiment about the target, showing a personalattitude of the writer. Rather than attempting to provide aformal definition, the human assessors were given a numberof examples,which illustrated the two levels of assessments.Given a topic and a blog post, assessors were asked to judgethe content of the blog posts. The following scale was usedfor the assessment:
0
Not relevant
. The post and its comments were exam-ined, and does not contain any information about the target,or refers to it only in passing.
1
Relevant
. The post or its comments contain informationaboutthe target, but do not express an opiniontowards it. Tobe assessed as “Relevant”, the information given about thetargetshouldbesubstantial enoughto be includedin a reportcompiled about this entity.If the post or its comments are not only on target, but alsocontain an explicit expression of opinion or sentiment aboutthe target, showing some personal attitude of the writer(s),thenthedocumenthadtobejudgedusingoneofthreelabels:
2
Negative opinionated
. Contains an explicit expressionof opinion or sentiment about the target, showing some per-sonal attitude of the writer(s), and the opinion expressed isexplicitly negative about, or against, the target.
3
Mixed opinionated
. Same as (2), but contains both pos-itive and negative opinions.
4
Positive opinionated
. Same as (2), but the opinion ex-pressed is explicitly positive about, or supporting,the target.Posts that are opinionated, but for which the opinion ex-pressed is ambiguous, mixed, or unclear, were judged sim-ply as “mixed” (3 in the scale).Following the TREC paradigm described in the previoussection, the relevance assessments were formed using the
Leave a Comment