Professional Documents
Culture Documents
Alh Mia Pigi Pigi
Alh Mia Pigi Pigi
To cite this article: Angela K. Troyer , Larry Leach & Esther Strauss (2006): Aging and Response
Inhibition: Normative Data for the Victoria Stroop Test, Aging, Neuropsychology, and Cognition: A
Journal on Normal and Dysfunctional Development, 13:1, 20-35
This article may be used for research, teaching, and private study purposes. Any
substantial or systematic reproduction, redistribution, reselling, loan, sub-licensing,
systematic supply, or distribution in any form to anyone is expressly forbidden.
The publisher does not give any warranty express or implied or make any representation
that the contents will be complete or accurate or up to date. The accuracy of any
instructions, formulae, and drug doses should be independently verified with primary
sources. The publisher shall not be liable for any loss, actions, claims, proceedings,
demand, or costs or damages whatsoever or howsoever caused arising directly or
indirectly in connection with or arising out of the use of this material.
Aging, Neuropsychology, and Cognition, 13:20–35, 2006
Copyright © Taylor & Francis Group, LLC
ISSN: 1382-5585/05 print; 1744-4128 online
DOI: 10.1080/138255890968187
1
Baycrest Centre for Geriatric Care and 2University of Victoria
ABSTRACT
Increased difficulty with response inhibition occurs with age, although there is some
controversy as to whether increased interference on Stroop tasks reflects difficulties
with response inhibition or simply reflects generalized cognitive slowing. The Victoria
Stroop Test (VST) is a brief, easily administered, psychometrically sound version of
Stroop’s original task. Performance on the VST by adults across a wide age span was
examined to determine the association between age and various measures of interfer-
ence. In addition, normative data for the VST were calculated. Participants were 272
healthy, community-dwelling adults age 18 to 94. Age and speed were negatively cor-
related on all trials of the VST. Importantly, however, interference scores that were
corrected for baseline slowing remained highly correlated with age. Similarly, age and
error scores on the interference trial were positively correlated, indicating decreased
accuracy with age. These findings suggest that increased interference on Stroop tasks
with age is not accounted for by simple cognitive slowing and more likely reflects
other cognitive changes, such as decreased response inhibition. The VST has a number
of administrative and psychometric strengths, and the provision of normative data
should enhance its potential for clinical and research applications.
Address correspondence to: Angela Troyer, Psychology Department, Baycrest Centre for Geriatric
Care, 3560 Bathurst Street, Toronto, Ontario, Canada, M6A 2E1. E-mail: atroyer@baycrest.org
20
AGING AND RESPONSE INHIBITION 21
Stroop in 1935. There are a number of versions of the Stroop task, and each
involves: (a) a control task that requires the individual to name the colors of
nonword stimuli such as dots or strings of Xs; and (b) an interference task
that requires the individual to name the colors of color names that are printed
in contrasting colors (e.g., red is written in blue ink). This latter task, thus,
requires the individual to inhibit an automatic reading response and to
produce a more effortful color-naming response. The interference effect is
determined by calculating the extra time required to name colors in the
interference task in comparison to the time required to name colors in the
control task.
Studies of aging and inhibition using the Stroop task have been some-
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
dementia (e.g., Bondi et al., 2002; Nathan et al., 2001; Spieler et al., 1996).
In contrast, there is evidence that patients with nonfrontal focal lesions show
normal interference (Stuss et al., 2001).
Demographic variables other than age have less of an association with
Stroop performance. Previous studies have shown that sex is not consistently
related to Stroop interference scores (Ivnik et al., 1996; Moering et al., 2004;
Trenerry et al., 1989). Education tends to show a modest relation (r < .30)
with Stroop interference scores (Anstey et al., 2000; Ivnik et al., 1996),
although Moering and colleagues found a somewhat larger effect of educa-
tion on Stroop interference (accounting for 9 to 26% of the variance) in a
sample of African Americans.
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
items on each of three tasks (naming the color of dots, of neutral words, and
of color words printed in contrasting colors). Thus, not only does it require
less administration time, but it may be ideal for detecting difficulties with
response inhibition because the examinee does not get extended practice
with the task. Second, the VST includes an additional training task (i.e.,
naming the colors of neutral words) that helps examinees establish the
appropriate response set (i.e., color naming) without exposure to the interfer-
ence condition (although it has been argued that presenting a color naming
trial decreases the interference effect on the subsequent trial—i.e., the
Comalli-Kaplan version in Mitrushina et al. (1999)). Third, scores that are
relatively independent of cognitive speed can be calculated, including an
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
error score and an interference score (calculated as a ratio) that corrects for
generalized slowing. Similar to the other versions of the Stroop test, the VST
materials are easy to obtain. The VST is in the public domain, and users may
make their own stimuli (as described in Spreen & Strauss (1998)) or pur-
chase them from the University of Victoria. In addition, adequate psycho-
metric data, including reliability and validity, have been obtained for the
VST (reviewed in Spreen & Strauss (1998)).
The purpose of this article is twofold: (a) Because the literature about
the effect of age on Stroop measures of response inhibition is contradictory,
we examined the association between age and various performance mea-
sures on the VST. Of greatest interest were examinations of time and error
measures in the baseline condition (i.e., Dot task) and interference condition
(i.e., Color Word task) across age. We calculated ratio time scores based on
baseline and interference conditions to compare across age. (b) The clinical
use of the VST to date has been hampered by a lack of normative data on a
large number of healthy controls. In this article, we provide normative data
for adults between the ages of 18 and 94.
METHODS
Participants
Participants were 272 healthy, community-dwelling adults living in
Victoria, British Columbia or Toronto, Ontario. They were recruited from a
university-based, older-adult subject pool, senior centers, and advertise-
ments posted in the community and at the university. Participants were inter-
viewed to screen for neurological disorders (e.g., loss of consciousness
exceeding 1 hour, stroke, epilepsy, multiple sclerosis) and psychiatric disor-
ders (e.g., depression or anxiety requiring hospitalization) that could affect
cognitive functioning; interviews were conducted by graduate students
trained by a registered psychologist. Demographic information, including
age, level of education, and sex, is provided in Table 1. Erroneously,
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
Age Group 18–39 30–49 40–59 50–64 60–69 65–74 70–79 75–84 80–94
n* 64 42 54 62 55 60 61 53 38
Age 26.8 (6.0) 38.5 (6.5) 51.3 (5.4) 57.6 (4.5) 64.6 (3.0) 69.6 (2.9) 74.5 (3.0) 79.1 (2.7) 84.3 (3.8)
Education n/a n/a 13.2 (3.0) 13.2 (2.4) 13.6 (2.5) 13.6 (2.9) 12.9 (2.8) 12.1 (2.7) 11.5 (3.1)
24
% Female 55 60 63 73 78 62 61 68 61
Note. Standard deviations are shown in parentheses. Education information was not recorded for the youngest age groups.
*Overlapping age groups were used.
Total n = 272.
AGING AND RESPONSE INHIBITION 25
education was not recorded for the youngest participants. Testing was
conducted in English, and all subjects were fully fluent in English.
Victoria Stroop Test
The Victoria Stroop Test is described in detail by Spreen and Strauss
(1998). Briefly, the stimuli include three cards, each of which contains six
rows of four items. Within each row, one item is presented in each of the
following colors: red, blue, yellow, and green. On each of the three tasks,
participants are asked to scan the items across rows from left to right and to
name the color of each item as quickly and accurately as possible. On the
first task, the Dot task, items are colored dots. On the second task, the
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
Neutral Word task, the items are common words (i.e., when, hard, and, over)
printed in color. On the third and final task, the Color Word task, the items
are color names (i.e., red, blue, yellow, green) printed in colors not corre-
sponding to the words themselves.
Standard administration of the VST (Spreen & Strauss, 1998) requires
the examiner to interrupt the examinee as soon as an error is made, after
which the examinee must correct the error and continue with the next item.
Mistakes spontaneously corrected by the individual before being interrupted
by the examiner are not counted as errors. The examiner records the amount
of time required to complete each task (to the nearest full second) and the
number of errors that are not spontaneously corrected.
Data Analyses
To address the first purpose of this article (i.e., examining interference
effects with age), correlations between age and several Stroop variables
were performed. Because many of the Stroop variables were skewed, we cal-
culated log transformation of the scores then performed correlations with
transformed rather than raw Stroop scores. Individually calculated ratio
interference time scores (i.e., time to complete the Color Word task ÷ time to
complete the Dot task) were also calculated, and correlations between age
and these ratio interference scores were computed. To address the second
purpose of this article (i.e., providing normative data), correlations with
demographic variables were performed to determine appropriate sample
groups, and normative data were calculated within those groups.
RESULTS
Data Transformations
Many of the distributions of Stroop scores were significantly and posi-
tively skewed, with skewness values ranging from 1.5 for time to complete
the Dot task to 4.5 for number of errors on the Neutral Word task. For the
26 ANGELA K. TROYER ET AL.
not statistically significant for any of the Stroop variables. All effect sizes
were negligible, with d’s less than 0.20.
Determination of Age Groups
Given that education and sex did not show sizable associations with
Stroop variables, normative data were tabulated according to age alone. An
examination of scatter plots depicting the relation between age and the seven
Stroop variables suggested a slower rate of change among the younger ages
than the older ages. Consequently, to maximize the number of participants in
each group, approximately 20-year age ranges were used for normative data
up to age 59, and approximately 10-year age ranges were used for normative
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
DISCUSSION
We obtained evidence for increased interference with age among healthy
adults on the Victoria Stroop Test. This interference effect is not accounted
for by the generalized slowing known to occur with age, that is: (a) individu-
ally calculated interference ratio scores (i.e., Color Word time ÷ Dot time)
were positively correlated with age but not correlated with baseline speed
(i.e., Dot time); and (b) error scores in the interference condition increased
with age. In general, the lower correlation between age and error scores (in
comparison to age and time scores) may reflect the restricted range due to an
overall low number of errors.
The nature of the relationship between age and interference on the
Stroop task, therefore, appears to involve slowed response rates and
increased errors. It is not possible to determine the extent to which speed
versus accuracy independently account for age-related changes because, on
the VST, the two scores are related. That is, the occurrence of errors
increases the time score obtained because participants are required to correct
mistakes as they are made. Regardless of whether decreased speed or
decreased accuracy is the prime contributor, it is apparent that a significant
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
TABLE 2. Scaled Score Equivalents for Dot Time Scores (in Seconds), by Age Group
Age 18–39 (29) 30–49 (40) 40–59 (50) 50–64 (57) 60–69 (65) 65–74 (70) 70–79 (75) 75–84 (80) 80–94 (87)
Mean 11.0 11.1 12.3 12.0 12.1 13.3 14.2 15.1 15.1
SD 2.5 1.9 2.4 2.3 2.3 3.6 3.9 3.8 3.8
SS
17 <7 <8 <9 <9 <9 <9
16 7 <8 <8 <8 8 9 9 9 9
15 8 8 8 8 9 10 10 10 10
14 9 9 10 11 11
13 9 9 10 10 11 11 12 12
12 10 11 12 13 13
28
11 10 11 11 12 13 14 14
10 11 12 12 12 13 14 15 15–16
9 11 12 13 13 13 14 15 16 17
8 12 13 14 14 14 15 16–17 17 18–19
7 13 14 15 15 15 16–17 18–19 18–20 20–21
6 14–15 15–16 16 16 16–17 18–20 20–21 21–22 22–23
5 16–17 17 17 17 18–19 21–27 22–27 23–27 24–29
4 18 18 18 18 20 28–29 28–29 28–29 > 29
3 19 19 19 19 21 30 > 29 > 29
2 > 19 > 19 > 19 > 19 > 21 > 30
Note. Midpoint ages are shown in parentheses.
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
TABLE 3. Scaled Score Equivalents for Neutral Word Time Scores (in Seconds), By Age Group
Age 18–39 (29) 30–49 (40) 40–59 (50) 50–64 (57) 60–69 (65) 65–74 (70) 70–79 (75) 75–84 (80) 80–94 (87)
Mean 13.0 13.9 15.2 15.4 15.9 16.9 18.6 20.7 22.1
SD 2.9 2.6 2.9 3.2 5.1 5.1 5.4 6.7 6.0
SS
17 < 10 < 10 < 10 < 11 < 11
16 <9 < 10 10 10 10 11 11–12 < 14 < 14
15 9 10 11 11 11 12 13 14 14
14 10 11 12 12 13 14 15 15
13 11 12 13 13 15 16 16
12 12 13 14 16 17 17–18
29
11 12 13 14 14 14 15 17 18 19–20
10 13 14 15 15 15 16 18 19 21–22
9 14 15 16 16 16 17 19 20–22 23–24
8 15 16 17 17 17 18–19 20–21 23–25 25–26
7 16–17 17 18 18 18 20–21 22–23 26–29 27–29
6 18–19 18–19 19 19–20 19–24 22–24 24–29 30–33 30–33
5 20 20 20 21–26 25–41 25–41 30–41 34–45 34–45
4 21 21 21–24 27–28 42–45 42–45 42–46 > 45 > 45
3 22 > 21 > 24 > 28 > 45 > 45 > 46
2 > 22
Note. Midpoint ages are shown in parentheses.
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
TABLE 4. Scaled Score Equivalents for Color Word Time Scores (in Seconds), by Age Group
Age 18–39 (29) 30–49 (40) 40–59 (50) 50–64 (57) 60–69 (65) 65–74 (70) 70–79 (75) 75–84 (80) 80–94 (87)
Mean 22.1 25.7 27.8 28.5 29.4 32.6 37.1 43.3 50.4
SD 7.2 9.0 8.2 9.5 9.0 9.6 11.9 17.7 23.9
SS
17 < 11 < 15 < 15 < 15 < 15 < 19 < 19 < 22 < 22
16 11 15 15–17 15–17 15–17 19–20 19–21 22 22–25
15 12–13 16 18 18 18–19 21 22–23 23–24 26–28
14 14–15 17 19 19 20 22 24 25–26 29–30
13 16 18 20–21 20–21 21–22 23–25 25–28 27–29 31–32
12 17–18 19–20 22–23 22–23 23–24 26–28 29–31 30–32 33–36
30
11 19 21–22 24–25 24–25 25–26 29–31 32–34 33–36 37–44
10 20–22 23–24 26–27 26–28 27–29 32–33 35–39 37–44 45–53
9 23–24 25–28 28–30 29–30 30–33 34–36 40–42 45–52 54–59
8 25–28 29–33 31–34 31–37 34–37 37–41 43–47 53–63 60–68
7 29–30 34–38 35–39 38–42 38–42 42–44 48–55 64–66 69–83
6 31–35 39–48 40–48 43–51 43–56 45–57 56–65 67–74 84–112
5 36–48 49–50 49–51 52–57 57–58 58–68 66–69 75–109 113–137
4 > 48 > 50 > 51 58–59 > 58 69–70 70–71 110–112 > 137
3 > 59 > 70 > 71 > 112
2
Note. Midpoint ages are shown in parentheses.
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
Age 18–39 (29) 30–49 (40) 40–59 (50) 50–64 (57) 60–69 (65) 65–74 (70) 70–79 (75) 75–84 (80) 80–94 (87)
Mean 2.0 2.3 2.3 2.4 2.5 2.6 2.7 2.9 3.2
SD 0.6 0.8 0.7 0.8 0.8 0.9 1.0 1.0 1.6
SS
18 < 1.2 < 1.2 < 1.5
17 < 1.1 < 1.2 < 1.2 < 1.3 < 1.4 1.2–1.3 1.2–1.3 1.5
16 < 1.1 1.1–1.2 1.2 1.2 1.3–1.4 1.4 1.4–1.5 1.4–1.5 1.6–1.7
15 1.1–1.2 1.3 1.3–1.4 1.3–1.4 1.5–1.6 1.5–1.6 1.6 1.6–1.8 1.8
14 1.3 1.4–1.5 1.5 1.5–1.6 1.7–1.8 1.7–1.8 1.7–1.9 1.9–2.0 1.9–2.0
13 1.4–1.5 1.6–1.7 1.6–1.7 1.7–1.8 1.9 1.9 2.0 2.1 2.1–2.2
31
12 1.6–1.7 1.8–1.9 1.8–1.9 1.9–2.0 2.0 2.0–2.1 2.1 2.2 2.3
11 1.8–1.9 2.0–2.1 2.0–2.1 2.1 2.1 2.2 2.2–2.4 2.3–2.4 2.4–2.7
10 2.0–2.1 2.2–2.3 2.2–2.4 2.2–2.4 2.2–2.4 2.3–2.5 2.5–2.6 2.5–2.8 2.8–3.4
9 2.2–2.3 2.4–2.5 2.5–2.6 2.5–2.7 2.5–2.7 2.6–2.8 2.7–3.0 2.9–3.3 3.5–3.6
8 2.4–2.5 2.6–2.8 2.7–2.8 2.8–3.2 2.8–3.2 2.9–3.2 3.1–3.5 3.4–3.9 3.7–3.9
7 2.6–2.8 2.9–3.7 2.9–3.7 3.3–3.7 3.3–3.7 3.3–4.0 3.6–4.1 4.0–5.0 4.0–5.1
6 2.9–3.5 3.8–3.9 3.8–3.9 3.8–4.0 3.8–4.1 4.1–5.0 4.2–5.0 5.1–5.5 5.2–5.5
5 3.6–4.0 4.0–4.5 4.0–4.5 4.1–4.7 4.2–4.7 5.1–6.0 5.1–6.0 5.6–6.1 5.6–10.0
4 4.1–4.2 4.6 4.6 4.8–4.9 4.8–4.9 6.1–6.3 6.1–6.4 6.2–6.5 > 10.0
3 > 4.2 > 4.6 > 4.6 > 4.9 > 4.9 > 6.3 > 6.4 > 6.5
2
Note. Midpoint ages are shown in parentheses. Interference scores were calculated as the number of seconds required to complete the Color Word task divided by the num-
ber of seconds required to complete the Dot task.
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
TABLE 6. Means and Cumulative Percentages Associated with Raw Error Scores on Color Word Task, By Age Group
Age 18–39 (29) 30–49 (40) 40–59 (50) 50–64 (57) 60–69 (65) 65–74 (70) 70–79 (75) 75–84 (80) 80–94 (87)
Mean 0.8 0.8 0.7 0.6 0.5 0.6 1.1 1.7 2.1
SD 1.0 1.0 1.0 1.0 0.9 1.2 1.6 1.8 2.0
Errors
0 100 100 100 100 100 100 100 100 100
1 47 50 43 35 27 33 47 64 79
32
2 19 24 22 19 14 17 29 45 50
3 9 9 4 5 4 5 18 28 32
4 3 2 3 2 2 6 11 16
5 2 5 8 10
6 2 3 6 8
7 2 2 4 8
8 3
Note. Midpoint ages are shown in parentheses.
AGING AND RESPONSE INHIBITION 33
for narrow age bands. (e) Interference can be examined without the contami-
nating effects of generalized slowing, by calculating ratio interference scores
and by examining error scores. Given these strengths, the VST may be par-
ticularly useful in many clinical and research applications.
REFERENCES
Anstey, K. J., Matters, B., Brown, A. K., & Lord, S. R. (2000). Normative data on neuropsy-
chological tests for very old adults living in retirement villages and hostels. The Clinical
Neuropsychologist, 14, 309–317.
Arbuckle, T. Y., & Pushkar Gold, D. (1993). Aging, inhibition, and verbosity. Journal of
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
Nathan, J., Wilkinson, D., Stammers, S., & Low, J. L. (2001). The role of tests of frontal
executive function in the detection of mild dementia. International Journal of Geriatric
Psychiatry, 16, 18–26.
Perret, E. (1974). The left frontal lobe of man and the suppression of habitual responses in
verbal categorical behavior. Neuropsychologia, 12, 323–330.
Raz, N. (2000). Aging of the brain and its impact on cognitive performance: Integration of
structural and functional findings. In F. I. M. Craik & T. A. Salthouse (Eds.), The hand-
book of aging and cognition (2nd ed., pp. 1–90). Mahweh, NJ: Lawrence Erlbaum.
Regard, M. (1981). Cognitive rigidity and flexibility: A neuropsychological study. University
of Victoria. unpublished doctoral dissertation.
Salthouse, T. A. (1985). Speed of behavior and its implications for cognition. In J. E. Birren &
K. W. Schaie (Eds.), Handbook of the psychology of aging (2nd ed., pp. 400–426). New York:
Von Nostrand Reinhold.
Downloaded by [Syracuse University Library] at 11:19 18 June 2013
Spieler, D. H., Balota, D. A., & Faust, M. E. (1996). Stroop performance in healthy younger
and older adults and in individuals with dementia of the Alzheimer’s type. Journal of
Experimental Psychology: Human Perception and Performance, 22, 461–479.
Spreen, O., & Strauss, E. (1998). A compendium of neuropsychological tests (2nd ed.).
New York: Oxford University Press.
Stroop, J. R. (1935). Studies of interference in serial verbal reaction. Journal of Experimental
Psychology, 18, 643–662.
Stuss, D. T., Floden, D., Alexander, M. P., Levine, B., & Katz, D. (2001). Stroop perfor-
mance in focal lesion patients: Dissociation of processes and frontal lobe lesion location.
Neuropsychologia, 39, 771–786.
Tabachnick, B. G., & Fidell, L. S. (1983). Using multivariate statistics. New York, NY:
Harper & Row.
Trenerry, M. R., Crosson, B., DeBoe, J., & Leber, W. R. (1989). Stroop Neuropsychological
Screening Test. Odessa, FL: Psychological Assessment Resources.
Uttl, B., & Graf, P. (1997). Color-Word Stroop Test performance across the adult life span.
Journal of Clinical and Experimental Neuropsychology, 19, 405–420.
Vakil, E., Weisz, H., Jedwab, L., Groswasser, Z., & Aberbuch, S. (1995). Stroop color-word
task as a measure of selective attention: Efficiency in closed-head-injured patients.
Journal of Clinical and Experimental Neuropsychology, 17, 335–342.
Vendrell, P., Junqué, C., Pujol, J., Jurado, M. A., Molet, J., & Grafman, J. (1995). The role of
prefrontal regions in the Stroop task. Neuropsychologia, 33, 341–352.
Verhaeghen, P., & De Meersman, L. (1998). Aging and the Stroop effect: A meta-analysis.
Psychology and Aging, 13, 120–126.
West, R. (1999). Age differences in lapses of intention in the Stroop task. Journals of Geron-
tology, 54B, P34–P43.
Whelihan, W. M., & Lesher, E. L. (1985). Neuropsychological changes in frontal functions
with aging. Developmental Neuropsychology, 1, 371–380.