Professional Documents
Culture Documents
net/publication/221248089
CITATIONS READS
7 58
1 author:
Asbjørn Følstad
SINTEF
126 PUBLICATIONS 4,336 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Asbjørn Følstad on 28 May 2015.
"Why include artists you cannot listen to?" "Annoying with songs in the list that you cannot listen to"
Fair
match "The house icon is usually front page, slightly "Difficult to understand the house-icon of the player leading to the
misguiding, but ok if it is there. Somewhat discreet!" artist's own page. Interpreted as link to the Urørt main page."
"Name of the song or the artist? Doubled up?" "Difficult to differentiate between song name and artist name"
Good
match "The player opens in a separate window. Would prefer a "Annoying that the player opens in a separate window. Wish the
player embedded in the artist’s page" player to be embedded at the artist’s own page, as MySpace"
After the matching, the following analyses were conducted: Table 3: Distribution of discarded vs. sustained individually
New usability issues predicted by the group (H1): Plenary items recorded items, across severity categories.
not matching any individual’s items were regarded as new; Severity Modified or
plenary items matching individual’s items were regarded as Discarded items
(individual unchanged items
sustained. The proportion of new items to the totality of plenary ratings) Count % Count %
items was calculated. The severity ratings of new vs. sustained
items were compared. Cosmetic 25 58 36 27
The groups’ discarding or modification of usability issues Serious 9 21 52 39
predicted by individuals (H2): Individual items not matching Critical 3 7 28 21
any plenary items were regarded as discarded; individual items
associated with the categories Weak match and Fair match were Not rated 6 14 16 12
regarded as modified. The proportions and severity ratings of Sum 43 100 132 100
discarded, modified and unchanged items were calculated.
Changes in severity rating between individuals and group (H3):
The severity rating of each sustained plenary item was compared Results of relevance to H3: Forty-four of the plenary items had
with the average severity rating of its associated items recorded been rated both for severity and associated with one or more
by individual evaluators. Average severity ratings were individually recorded items rated for severity. Of these plenary
calculated upon assigning the following values to each severity items, 21 had a severity rating identical to the average severity
level: Cosmetic=1, Serious=2, Critical=3. score of their individually recorded counterparts, 16 had higher
severity ratings and 7 had lower. The severity ratings of plenary
items vs. individual’s items were compared through a Wilcoxon
5. RESULTS signed rank test. No significant difference between the groups
The individual evaluators’ notes contained a total of 175 items was found (Z=1.40; p(two tailed)=0.16).
(an additional four items were discarded due to interpretational
issues). The mean number of items for each individual evaluator
was 12.5 (SD=5.2). Thirty-six percent of the individually 6. DISCUSSION
recorded items were classified as cosmetic, 35 percent serious The discussion is structured according to the three research
and 18 percent critical. Eleven percent of the individual hypotheses and is concluded by suggestions for refined
evaluators’ items were not rated for severity. hypotheses and further work.
The plenary discussions resulted in 76 items. Thirteen percent of 6.1 New usability issues
the plenary recorded items were classified as cosmetic, 38 One fourth of the items of the plenary records were interpreted
percent serious and 21 percent critical. Twenty eight percent of as new, resulting from the group discussions. These new items
the plenary recorded items were not rated for severity. were not found to differ in severity from the sustained items,
Results of relevance to H1: Twenty-five percent of the plenary implying that the usability issues generated in the group
recorded items were not found to match any individually discussions were judged by the evaluators to be of equal
recorded item, and thus regarded as new. The severity ratings of important to the issues brought into the discussion by one or
new vs. sustained items were compared through a Fischer Exact more of the group’s individuals.
test. Items not rated for severity were excluded from the test. No H1 is sustained. Group discussions indeed seem to enable the
significant difference was found (n=55; p(two tailed)=0.19). identification of new usability issues. However, the great
Results of relevance to H2: The distribution of individually majority of the issues already being identified by the individuals
recorded items as discarded, modified, or unchanged is of the groups, the importance of the group discussions as such
presented in Table 2. seemed to be a fairly modest source of new issues.