Professional Documents
Culture Documents
Verbal fluency tasks require spontaneous generation category or semantic fluency (producing words within a
of words in a specific amount of time, typically 30 or specific category such as animals or supermarket items),
60 s, with a predetermined set of criteria (e.g., producing have been used to assess and monitor executive function-
words in a semantic category such as animals). Verbal flu- ing and lexical access in a variety of neurogenic popula-
ency tasks, including phonemic or letter fluency (produc- tions (Alegret et al., 2018; Henry et al., 2004; Macoir
ing words beginning with a specified sound or letter) and et al., 2019; Östberg et al., 2005; Paek & Murray, 2021;
Rodrigues et al., 2015). Category fluency tasks in which
the participant is asked to produce verbs of a specific
Correspondence to Eun Jin Paek: epaek@uthsc.edu. Publisher Note: semantic category (as opposed to nouns of a specific cate-
This article is part of the Special Issue: Select Papers From the 51st
Clinical Aphasiology Conference. Disclosure: The authors have
gory) are a more recent addition to the set of verbal flu-
declared that no competing financial or nonfinancial interests existed at ency tasks and were first studied by Piatt, Fields, Paolo,
the time of publication. and Troster (1999). The “verb fluency tasks” have been
American Journal of Speech-Language Pathology • Vol. 32 • 2589–2601 • October 2023 • Copyright © 2023 The Authors 2589
This work is licensed under a Creative Commons Attribution 4.0 International License.
shown to be valid and reliable measures of multiple and MCI to AD (Alegret et al., 2018; Östberg et al.,
domains of cognition and predictors of future cognitive 2005) or from Parkinson’s disease to Parkinson’s disease
declines in the assessment of cognitive–linguistic skills in with dementia (Piatt, Fields, Paolo, Koller & Troster,
adults (Macoir et al., 2019; Östberg et al., 2005; Paek & 1999). Other studies have investigated verb fluency task
Murray, 2021; Zhao et al., 2013; to Piatt, Fields, Paolo, performance in different types of dementia (e.g., Davis
Koller, & Troster, 1999; Piatt, Fields, Paolo, & Troster, et al., 2010; Delbeuck et al., 2013). Given the increasing
1999). For example, those with neurological disorders number of people diagnosed with Alzheimer’s disease and
such as aphasia or Alzheimer’s dementia (AD) produced related dementias (Alzheimer’s Association, 2022) and the
fewer verbs in a given time than cognitively healthy older lack of cures for the diseases, there is a critical need for
adults (CHOAs; i.e., adults with no history of cognitive early detection and prevention of the disease and related
decline, cerebrovascular accidents, or other neurological disorders, as well as intervention and management strate-
disorders; Alegret et al., 2018; Beber et al., 2015; Davis gies that are appropriate for the level of cognition in each
et al., 2010; Forlenza et al., 2012; Paek et al., 2019) due individual. Therefore, it is imperative to have reliable and
to the decreased cognitive–linguistic abilities observed in valid measures to assist in the evaluation of cognitive and
those populations. linguistic symptoms, as well as distinguish characteristics
of CHOA, SCD, MCI, and AD (Östberg et al., 2005).
Verb fluency can be classified as a type of semantic
fluency, because it assesses a participant’s ability to gener- There are a variety of methods to analyze verbal flu-
ate words related to a specific semantic category, namely, ency performance in addition to calculating the number of
verbs that represent actions that people can perform. correct responses that can further capture cognitive
However, it differs from other types of semantic fluency, declines. For example, analyses of word clusters (i.e., the
such as noun or category fluency (e.g., animals or super- size or number of clusters of words that are semantically
market items), because it may involve distinct aspects of or phonologically related) may provide additional insights
language processing, including more complex semantics into the cognitive–linguistic characteristics of individuals
and syntax. For example, verb fluency tasks may engage with Alzheimer’s dementia (IwDs). Clusters are consecu-
processes related to motor aspects of language production, tive responses grouped together based on predetermined
as well as syntactic processes that involve verb-argument categorization criteria. The average cluster size has been
structure (Qiu & Johns, 2021; Williams et al., 2021). The suggested to reflect lexical retrieval skills and semantic
constraint in the instructions to produce actions that could networks, whereas the number of switches (i.e., change
be performed by people specifically highlights this motor from one cluster to a different cluster) reflects executive
aspect of language and sets it apart from other forms of functioning skills such as organization in verbal fluency
semantic fluency. Moreover, verbs generally possess higher tasks (Troyer et al., 1997). On the other hand, the number
semantic complexity compared with nouns, making them of clusters is thought to reflect both lexical retrieval and
a potentially more sensitive measure for detecting seman- executive function abilities (Troyer et al., 1997). Although
tic declines associated with AD, which could contribute to clustering and switching can drive overall productivity
early detection and better management of the disease during verbal fluency and are strongly correlated with the
(Kim & Thompson, 2004; Williams et al., 2021). There- total number of correct responses (e.g., Gordon & Chen,
fore, verb fluency tasks offer important and distinct 2022), research has demonstrated their potential diagnostic
insights into language abilities in individuals with AD that and clinical value in predicting disease progression and
go beyond what semantic and phonemic fluency measures differential diagnosis, especially in the context of neurode-
can capture. Despite its potential benefits, there is cur- generative disorders such as AD. Previous studies have
rently a dearth of information on verb fluency in AD, in shown that these measures (switching and clustering) offer
contrast to other types of verbal fluency such as animal unique information about the underlying cognitive pro-
fluency or phonemic fluency. cesses, highlighting their significance in evaluating individ-
uals with AD (Beatty et al., 1997; Fagundo et al., 2008;
Previous studies indicated that decreased verb flu-
Haugrud et al., 2011; Tröster et al., 1998; Weakley &
ency is associated with reduced lexical access to verbs and
Schmitter-Edgecombe, 2014).
executive dysfunction (Davis et al., 2010; Macoir et al.,
2019; Östberg et al., 2005), and thus, the tasks are particu- Despite the research analyzing word clusters in ver-
larly useful during diagnostic procedures and evaluations bal fluency tasks, studies investigating clustering and
of symptoms for people with AD, including the earlier switching have reported a substantial amount of rater bias
stages of the progressive disorder such as subjective cogni- (Troyer et al., 1997; Vonk et al., 2019). Moreover, verb
tive decline (SCD) and mild cognitive impairment (MCI; fluency in particular has less data on cluster analyses and
Östberg et al., 2005; Zhao et al., 2013). The task is also a the data available involves methods of clustering that are
useful tool to indicate the progression from SCD to MCI heterogeneous, making comparison and validation of the
Table 1. Demographic information on the individuals with dementia (IwDs) and cognitively healthy older adults (CHOAs).
Looking at the interrater and intrarater reliability results. This also suggests that the two methods may
analyzed in this study, subjective clustering methods pro- capture different aspects of verb semantics compared
duced excellent intrarater reliability on all variables. This with human raters. Upon comparison, we observed that
indicates that one rater can repeatedly categorize the the disparities between the two methods can be ascribed
responses similarly on different occasions—an important to the more elaborate categorization used by either the
consideration for assessment particularly when tracking objective or subjective clustering approach (although
disease progression or treatment effectiveness over time. these differences do not encompass all possible explana-
The subjective clustering methods revealed questionable tions for the observed disparities; thus, further research is
interrater reliability, especially on the average cluster size warranted).
variable in comparison with other variables. Such poten-
For example, although human raters would cluster
tial for inconsistency is problematic for a reliable and
talk, laugh, and cry together as a cluster associated with
valid assessment. Previous studies that have investigated
face, the Lancaster Norms used a more detailed categori-
the interrater reliability between two raters for clustering/
zation, with talk and laugh clustered for mouth and cry
switching analyses for other verbal fluency tasks (i.e., ani-
for head (see Table 4). On the other hand, garden, swim,
mal fluency) reported strong correlations overall (LeDoux
and plant were all clustered together by the Lancaster
et al., 2014; Ross et al., 2007). These studies that have
Norms into the category of hand (arm), whereas human
reported interrater reliability for clustering and switching
raters agreed that swim belongs to the whole body, and
analyses on animal fluency or letter fluency tasks have uti-
the other two (garden and plant) belong either to arms or
lized criteria from Troyer et al. (1997) and LeDoux et al.
hands. However, the relatively weak concordance between
(2014), establishing widely accepted and replicable criteria,
objective and subjective methods is comparable with the
which could contribute to the higher interrater reliability
degree of reliability observed between two human raters
observed in those cluster analyses. Given the variability of
(i.e., ICCs between the human raters). That is, the discrep-
the interrater reliability between two raters reported in this
ancies between objective and subjective methods are not
study during a verb fluency task, there appears to be
unique to the comparison between objective and subjective
increased opportunities for rater inconsistency especially
methods. This underscores the need for further research
when the clustering criteria are vague.
on a comprehensive and reliable set of criteria for cluster
Additionally, the ICCs between the subjective and analysis and how sensorimotor features can contribute to
objective clustering methods were classified as moderate verb processing and semantics in establishing the criteria.
(for the cluster size) or good (for the number of clusters Because the use of Lancaster Norms contributed to differ-
and switches) according to Koo and Li (2016). This indi- entiating the two groups in terms of the number of clus-
cates that there might be a commonality between the two ters and switches, which is consistent with subjective clus-
clustering methods lending some support to the validity of tering methods, this method may be another viable option
the objective clustering method, but not strong enough in clinical diagnostic processes to investigate cognitive pro-
ICC to claim that both methods would yield the same cesses underlying verbal fluency impairment.
Lancaster Lancaster
Variables Response Score Normsa Rater 1 Rater 2 Response Score Normsa Rater 1 Rater 2
Talk 1 mouth face mouth Garden 1 hand_arm arms hands
Smile 1 mouth face mouth Swim 1 hand_arm whole body whole body
Cry 1 head face mouth Plant 1 hand_arm arms hands
Laugh 1 mouth face mouth Walk 1 foot_leg legs legs
Run 1 foot_leg legs legs Talk 1 mouth face face
Jump 1 foot_leg legs legs Laugh 1 mouth face face
Walk 1 foot_leg legs legs Cry 1 head face face
Talkb 0 n/a n/a n/a
Blink 1 head face face
Swallow 1 mouth face mouth
Sneeze 1 head face mouth
Clap 1 hand_arm arms hand
Count 1 head mind hand
Swim 1 hand_arm whole body whole body
Number of clusters 2 3 4 2 1 1
Average cluster size 1.5 2.33 1.75 1.5 2 2
Number of switches 9 5 5 3 4 4
Note. Responses in the italicized areas indicate that they were clustered together. The bold text symbolizes clusters per each rater. CHOA =
cognitively healthy older adult; AD = Alzheimer’s dementia.
a
The dominant action effector was determined using the Lancaster Norms database. bThis is an incorrect response and therefore was not
included in the analysis.
Although clustering and switching analyses from cluster size during the animal fluency task although
category fluency tasks have been extensively studied in CHOAs produced the largest cluster sizes, followed by
neurological populations, verb clustering and switching MCI participants, and IwDs produced the smallest
analyses have received limited attention in research. cluster sizes (i.e., AD < MCI < CHOA). They concluded
Because these measures provide valuable insight into cog- that these results could highlight semantic memory retrieval
nitive processes underlying impaired verbal fluency perfor- processes in IwDs being inefficient, but still effective, which
mance, exploring verb fluency abilities through cluster and could be present in this study considering similar results
switch analyses can significantly enhance our understand- on a different verbal fluency task.
ing of AD beyond traditional accuracy-based scoring
Another possible explanation for the lack of signifi-
approach. Similar to the findings of Macoir et al. (2019),
cance in group differences for the average cluster size is
this study reports no significant difference between people
the heterogeneous presentation of AD. The large stan-
with neurodegenerative disorders and CHOAs regarding
dards of deviation in the IwDs, especially for the total
the average cluster size in verb fluency responses. With
number of words (SD = 3.28) and number of switches
other verbal fluency tasks such as animal fluency, there
(SD = 2.13) reflect the heterogeneity of verb fluency perfor-
are variable results regarding the group differences with
mance in the dementia population even within the mild
respect to average cluster size in IwDs compared with
and moderate stages of the disorder and precipitating fac-
CHOAs. Some studies have indicated that IwDs (or those
tors, which has been reported in previous studies (Delbeuck
with MCI) do not produce significantly smaller cluster
et al., 2013). As the heterogeneity of AD is commonly
sizes in comparison with healthy controls (e.g., Peter
observed and has been reported previously, the objective
et al., 2016; Raoux et al., 2008; Tröger et al., 2019;
clustering method used is a viable option despite the lack
Weakley et al., 2013). On the other hand, other studies
of group differences on the average cluster size variable.
have reported a significant difference between the two
groups regarding average cluster size with IwDs produc- With the objective clustering method, there were sig-
ing significantly smaller cluster sizes (e.g., Murphy et al., nificantly fewer switches in IwDs compared with CHOAs.
2006; Oh et al., 2019). Tröger et al. (2019) reported a These results are consistent with previous category fluency
lack of statistically significant differences on the average studies using animal fluency (Raoux et al., 2008; Rofes