KADSvs - BDIscreeningfordepression LeBlanc2002JChildAdolescPsychopharm

See discussions, stats, and author profiles for this publication at: https://www.researchgate.
net/publication/11198652
Screening for Adolescent Depression: Comparison of the Kutcher Adolescent

Depression Scale with the Beck Depression Inventory
Article in Journal of Child and Adolescent Psychopharmacology · February 2002

DOI: 10.1089/104454602760219153 · Source: PubMed
CITATIONS READS
114 6,702
4 authors, including:
John C Leblanc Stan Kutcher

Dalhousie University Dalhousie University
93 PUBLICATIONS 3,342 CITATIONS 347 PUBLICATIONS 10,740 CITATIONS
SEE PROFILE SEE PROFILE
Some of the authors of this publication are also working on these related projects:
New Initiative Grant, UWI, Mona View project
school mental health literacy View project
All content following this page was uploaded by John C Leblanc on 03 January 2015.
The user has requested enhancement of the downloaded file.

JOURNAL OF CHILD AND ADOLESCENT PSYCHOPHARMACOLOGY
Volume 12, Number 2, 2002
© Mary Ann Liebert, Inc.
Pp. 113–126
Screening for Adolescent Depression:

Comparison of the Kutcher Adolescent
Depression Scale with the Beck Depression Inventory
John C. LeBlanc, M.D., M.Sc., FRCPC,1,2 Anthony Almudevar, Ph.D.,3

Sarah J. Brooks, Ph.D.,1 and Stan Kutcher, M.D., FRCPC1
ABSTRACT
Self-report instruments commonly used to assess depression in adolescents have limited or

unknown reliability and validity in this age group. We describe a new self-report scale, the
Kutcher Adolescent Depression Scale (KADS), designed specifically to diagnose and assess
the severity of adolescent depression. This report compares the diagnostic validity of the full
16-item instrument, brief versions of it, and the Beck Depression Inventory (BDI) against the
criteria for major depressive episode (MDE) from the Mini International Neuropsychiatric
Interview (MINI). Some 309 of 1,712 grade 7 to grade 12 students who completed the BDI had
scores that exceeded 15. All were invited for further assessment, of whom 161 agreed to as-
sessment by the KADS, the BDI again, and a MINI diagnostic interview for MDE. Receiver
operating characteristic (ROC) curve analysis was used to determine which KADS items best
identified subjects experiencing an MDE. Further ROC curve analyses established that the
overall diagnostic ability of a six-item subscale of the KADS was at least as good as that of
the BDI and was better than that of the full-length KADS. Used with a cutoff score of 6, the
six-item KADS achieved sensitivity and specificity rates of 92% and 71%, respectively—a
combination not achieved by other self-report instruments. The six-item KADS may prove to
be an efficient and effective means of ruling out MDE in adolescents.
INTRODUCTION been shown to have weak reliability and/or

validity in this age group (see Brooks and
T HE K UTCHER ADOLESCENT D EPRESSION SCALE

(KADS) was developed for the purpose of
diagnosing and monitoring the severity of de-
Kutcher 2001).
The diagnostic ability of an instrument can
be evaluated against a “gold standard” method
pression in adolescents by self-report. Many using receiver operating characteristic (ROC)
self-rated and observer-rated instruments curve analysis (Metz 1978). An ROC curve de-
commonly utilized to measure depression picts the trade-off between an instrument’s
have had little testing in adolescents or have true positive rate (its sensitivity; i.e., the pro-
1 Department of Psychiatry, Dalhousie University, Halifax, Nova Scotia, Canada.

2 Department of Pediatrics, Community Health and Epidemiology, Dalhousie University, Halifax, Nova Scotia,
Canada.
3 Department of Mathematics and Computing Science, St. Mary’s University, Halifax, Nova Scotia, Canada.
113
114 LEBLANC ET AL.
portion of positive cases it correctly identifies The Children’s Depression Inventory (CDI;
as such) and its false positive rate (1 minus its Kovacs 1992) is a 27-item self-report scale
specificity; i.e., specificity being the proportion requiring 10 to 20 minutes to complete. It
of true negative cases correctly identified as was developed as a format- and language-
such by the instrument) for all possible cutoff modified version of the BDI to measure the se-
scores. By tradition, the x-axis and y-axis dis- verity of depression in children age 7 and
play the false positive and true positive rates, older. Although its reliability and internal con-
respectively, with both axes scaled from 0 to 1. sistency are satisfactory, and its sensitivity
The area under the ROC curve (AUC) is then to change is well established, its ability to
an index of the instrument’s ability to identify discriminate depressed and nondepressed
whether subjects are cases, and ranges in value youngsters has been questioned (Fundudis et
from 0.5 (chance level performance) to 1 (per- al. 1991; McCauley et al. 1983; Weissman et al.
fect identification). Henderson (1993) sug- 1980). In their study of the criterion validity of
gested the following correspondence of AUC the CDI and of Birleson’s (1981) 18-item De-
values to diagnostic accuracy level: AUC # 0.7, pression Self-Rating Scale (DSRS) against clini-
low; 0.7 < AUC # 0.9, moderate; AUC > 0.90, cal diagnoses of depression in a clinical sample
high. ROC analyses have been carried out on of youngsters (age range 12.8 to 16 years), Fun-
several of the instruments reviewed below. dudis et al. (1991) reported minimum misclas-
The 21-item Beck Depression Inventory sification rates of 23% and 26%, respectively
(BDI; Beck et al. 1961) was the first formal at- (AUCs were not reported). Our abstraction of
tempt to use self-ratings to assess the severity Fundudis et al.’s (1991) graphs (their Figs. 1
of depression. Its reliability and sensitivity to and 2, not reproduced here) suggests that a re-
change are well documented in adults. How- quirement for sensitivity at or >90% would
ever, it has been claimed that adolescents’ mean acceptance of specificity <50% with ei-
scores on the BDI are generally less stable than ther instrument. Of course these results do not
those of depression scales designed specifi- necessarily mean that the CDI and DSRS do
cally for them (Reynolds 1994). Furthermore, not perform well when restricted to younger
some authors suggest that BDI scores are not subjects.
specific to clinical depression and may instead The Mood and Feelings Questionnaire
index subjects’ dissatisfaction, demoralization, (MFQ; Costello and Angold 1988) is a 32-item
dysphoria, or anxiety (Gotlib 1984; Kutcher self-report scale designed for use in epidemio-
and Marton 1989). In a community study of logical studies of depression in youngsters aged
1,704 high school students (47% male, 53% fe- 8 to 18 years. A 13-item version has also been
male), Roberts et al. (1991) used ROC analysis developed. Both the long and short versions
to assess the diagnostic validity of the BDI have satisfactory reliability and internal consis-
against Diagnostic and Statistical Manual of tency (Angold et al. 1995a; Costello et al. 1991);
Mental Disorders, third edition, revised (DSM- sensitivity to change has yet to be well estab-
III-R; American Psychiatric Association 1987) lished. The diagnostic accuracy of the long
diagnoses of major depressive disorder MFQ was assessed by Wood et al. (1995)
(MDD). The authors reported AUCs of 0.93 against DSM-III-R diagnoses of MDD made
and 0.83 for males and females, respectively, using the Schedule for Affective Disorders and
which suggests an AUC of 0.88 assuming Schizophrenia–Child version (K-SADS; Am-
equal numbers of males and females. How- brosini et al. 1989). The authors obtained an
ever, although Roberts et al.’s (1991) ROC AUC of 0.82 with their sample of 104 outpa-
curve (their Fig. 1, not reproduced in this arti- tients (10 to 19 years) and an AUC of 0.76 with
cle) suggested it was possible to attain high a subset of 60 of these subjects age 14 and
sensitivity (93%) and high specificity (87%) older. Wood et al.’s (1995) first graph (their Fig.
concurrently for males, the ROC curve for fe- 1, not shown here) shows that the minimum
males showed that sensitivity >90% would misclassification rate was 22% for the total
come at the price of very low specificity sample, and that when sensitivity reached
(<40%). 90%, specificity was close to 50%. Thapar and
THE KUTCHER ADOLESCENT DEPRESSION SCALE 115
McGuffin (1998) assessed the performance of been developed, but Faulstich et al. (1986)
the short MFQ in a community sample of found that it did not discriminate effectively
twins aged 8 to 16 years. The authors used An- between depressed and nondepressed young-
gold et al.’s (1995b) Child and Adolescent Psy- sters. Various authors have claimed that the
chiatric Assessment interview to produce the scale measures general emotional turmoil
gold standard diagnoses according to ICD-10 rather than depression (Doerfler et al. 1988;
(World Health Organization 1992) and to Faulstich et al. 1986; Garrison et al. 1991).
DSM-III-R criteria for a major depressive epi- Roberts et al. (1991) concluded that neither
sode (MDE). The respective AUCs were low the BDI nor the CES-D was adequate on its
(0.63) and moderate (0.72). Thapar and own for screening adolescents for depression
McGuffin’s (1998) second pair of graphs (their and that the development of more efficient
Fig. 2, not shown here) indicated that if high screening instruments would require greater
sensitivity ($80%) was required, specificity attention to the criterion validity of the items
would be no better than 52%. chosen. Roberts et al. (1989) recommended the
The Reynolds Adolescent Depression Scale development of depression scales that explic-
(RADS; Reynolds 1987) is a 30-item self-report itly use items that reflect the diagnostic criteria
scale that measures the severity of symptoms of an accepted classification system, such
of depression in adolescents aged 13 to as DSM-IV (American Psychiatric Association
18 years. It is designed for use in schools or 1994). Today, there is still a need for a time-
clinical settings and takes 5 to 10 minutes to efficient screening tool for adolescent depres-
complete. The RADS has well-documented re- sion whose sensitivity and specificity will
liability, validity, and normative data (Rey- ensure useful positive and negative predictive
nolds 1987), and there is some evidence for its values in common clinical situations.
sensitivity to change (e.g., Reynolds and Coats In line with these recommendations and in
1986). It has yet to be subjected to ROC analy- view of the aforementioned limitations of
sis. However, when used with its suggested commonly used self-report instruments when
cutoff score of 77 and assessed against diag- used with adolescents, one of the authors (SK)
noses of depression made using the Hamilton developed the 16-item KADS. Its principal fea-
Depression Rating Scale (Hamilton 1960) or tures are: (a) items address the core symptoms
the Schedule for Affective Disorders and of adolescent depression, (b) symptoms are
Schizophrenia (Endicott and Spitzer 1978), its described using both standard and colloquial
specificity was high (>90%) but its sensitivity terminology, and (c) items measure the fre-
was fairly low; approximately one third of quency of occurrence (in the case of 14 items)
depressed teens were misclassified as not de- or the severity (in the case of the other two
pressed (Evert and Reynolds 1986, unpub- items) of the specified symptoms.
lished manuscript, cited in Reynolds 1987, By focusing attention on the core symp-
pp 27–28). toms of depression, the authors expect that
The 20-item Center for Epidemiological the KADS will correctly screen for depres-
Studies–Depression scale (CES-D; Radloff sion in adolescents achieving clinically ac-
1977) is a self-report instrument designed ceptable sensitivity and specificity. In this
to measure symptoms of depression in adult first study assessing the properties of the
community studies. However, significant KADS, the authors compared the instrument
weaknesses have been observed when the against the BDI in a large, randomly selected
CES-D is used with adolescents. Roberts et al. school-based cohort of secondary school stu-
(1991) observed fairly poor test-retest reliabil- dents. The authors refined the instrument by
ity (r = 0.61) using a 1- to 4-week intertest in- omitting items with the lowest predictive
terval, and they also reported a very low power for MDE and compared the abilities of
concordance (kappa = 0.11) between clinical the full and short versions of the KADS to di-
diagnoses of MDD and cases of depression agnose MDE against that of the BDI. Poten-
suggested by high CES-D scores. A version tial methods of implementing the KADS are
modified for children and adolescents has discussed.
116 LEBLANC ET AL.
METHODS for diagnosing an MDE consists of nine items

that assess symptoms of depression experi-
Subjects and procedures enced over the previous 2 weeks. The outcome
is confirmation or otherwise of a current
Subjects were recruited from a school-based and/or a past MDE; there are no associated
survey designed to measure the prevalence of rating scores. The MINI’s reliability and valid-
a variety of sleep habits and their relationship ity have been ascertained in a number of stud-
to symptoms of depression (LeBlanc et al. ies (Amorim et al. 1998; Lecrubier et al. 1997;
1999). In total, 1,712 students attending Sheehan et al. 1997, 1998).
grades 7 through 12 in Halifax Nova Scotia MINI diagnostic assessments were com-
schools, serving a mixed urban-rural munici- pleted for all 161 subjects who were inter-
pality (population 330,000), were selected viewed. Thirty subjects (18.6%) met the
using probability-based survey sampling tech- schedule’s criteria for a current MDE, and
niques. Subjects (771 males, 920 females, 131 (including the female subject who scored
21 gender unrecorded; mean age = 15.0 years, 15 on the initial BDI) did not. Nine subjects,
SD = 1.8 years, range 12 to 20 years) completed four of whom met the MDE criteria, were ex-
a sleep habits questionnaire and the BDI. Some cluded from further analysis because they
301 subjects had a BDI score of 16 while an ad- omitted 16 or more items on either the KADS
ditional 7 had BDI scores of < 16 but a score (n = 4) or the BDI (n = 5). These nine subjects
of 2 or 3 on the suicide item. These subjects did not differ from the rest of the sample in
(87 males, 217 females, 4 gender unrecorded; terms of their demographic characteristics or
mean age 15.2 years, SD = 1.8 years) and one in their original BDI scores. Furthermore,
additional 16-year-old female who scored their scores on the one self-report instrument
15 on the BDI were invited for a second inter- that they did complete during the interview
view to assess whether they had a depressive were neither systematically high nor system-
illness. atically low compared to those of the rest of
One hundred sixty-one subjects were inter- the sample. This left 152 subjects for data
viewed (48 males, 113 females; mean age analysis, 26 (17.1%) of whom met the criteria
15.1 years, SD = 1.8 years, range 12 to 20 years). for a current MDE. One hundred fifty of the
Of these subjects, 135 (84%) were Caucasian, analyzed subjects completed all items; 2 sub-
7 (4%) were multiracial, 5 (3%) were African jects omitted no more than two items on both
Canadian, 2 (1%) were Native Canadian, instruments.
10 (6%) were of other (non-Latino) races, and
for 2 subjects (1%) this information was not
The Kutcher Adolescent Depression Scale
recorded. One hundred forty-nine subjects
(48% of those invited) either refused interview The KADS consists of 16 items that cover the
or did not show. These noninterviewees did core symptoms of depression (see Appendix).
not differ significantly in age, gender, or eth- Fourteen of the items require subjects to indi-
nicity from those who were interviewed. Non- cate the frequency of occurrence of particular
interviewees also did not differ in their BDI experiences over the previous week by endors-
scores (mean = 24.0, SD = 9.2) from those who ing one of four possible responses, coded 0
were interviewed (mean = 23.6, SD = 7.3). to 3: “hardly ever,” “much of the time,” “most
During a 1-hour interview, subjects met of the time,” “all of the time.” Response op-
with a health professional and completed a tions for items 12 and 13 are worded slightly
follow-up BDI and the KADS, and were differently but are also listed in ascending
administered the depression section of the order of morbidity; four response options are
Mini International Neuropsychiatric Interview provided for item 12 (coded 0 to 3) and five for
(MINI, version 4.4; Lecrubier et al. 1997). The item 13 (coded 0 to 4). The total score of the in-
MINI is a fully structured diagnostic interview strument is the sum of all 16 items. Item 8 ad-
schedule that covers 15 major axis I psychiatric ditionally includes a probe in which subjects
disorders using DSM-IV criteria. The module whose appetite has changed can indicate
whether it has increased or decreased; this re- compared by noting the proportion of the sam-
sponse does not contribute to the total score. ples for which the AUC for one instrument ex-
ceeded the AUC for the other. This proportion
of 10,000 replications can be interpreted as a
Data analysis
p value.
Data analysis consisted of (a) assessment of
the performance of individual items, (b) cre-
ation of reduced versions of the KADS, and
RESULTS
(c) comparison of the diagnostic validity of the
full-length and reduced versions with that of
Assessment of individual items
the BDI. For (a), the assessment of individual
items, the means and standard deviations of Table 1 summarizes various properties of in-
item scores, and item-total score correlations dividual items of the KADS. Columns 2 and 3
were calculated. The diagnostic validity of show the means and standard deviations of
individual KADS items was then assessed subjects’ item scores (i.e., response levels).
against the MINI diagnoses by calculating the Item 7 (trouble concentrating, daydreaming)
area under each item’s ROC curve (Hanley and item 13 (thoughts/actions regarding sui-
and McNeil 1982). For (b), KADS items were cide/self-harm), respectively, were the most
reduced in number to shorten the administra- and least highly endorsed symptoms. Column 4
tion time of the instrument without unduly shows the Pearson correlation coefficients ob-
reducing internal consistency or diagnostic tained between each item score and the total
power. Cronbach’s (1951) alpha and ROC score of all other items. Each item correlated sig-
curves were the tools employed. The ROC nificantly with the total score of all other items
AUC for the total score of all original items (p < 0.01). The magnitude each item’s ROC AUC
was calculated. Items were then removed one (an index of diagnostic accuracy) is presented in
at a time starting with the one that had the column 5. Item 13 (thoughts/actions regarding
least predictive power (i.e., the smallest AUC). suicide/self-harm) ranked highest in this re-
After each removal, Cronbach’s alpha and the spect, and item 3a (trouble falling asleep, lying
AUC for the remaining set of items were recal- awake) ranked lowest.
culated. This generated a hierarchy of candi- Figure 1 presents a visual summary of the
date subscales where a suitable subscale was diagnostic ability of each response level of
defined as one that exhibited high predictive each KADS item to identify cases of MDE. For
power while maintaining acceptably high in- each item, the plots show the proportion of
ternal consistency. MDE cases among each subset of subjects who
For (c), the ROC curves of two different ver- selected a particular response level. (It should
sions of the KADS and the BDI were compared be noted that these plots do not have a direct
graphically. The statistical significance of dif- correspondence to ROC curves.) Although
ferences between the magnitudes of their some allowance should be made for variation
AUCs was estimated using the following boot- due to sampling error, the proportion of cases
strap method (see Hamilton 1992 for overview of MDE should in general increase with in-
of bootstrap methods). The study’s 152 sub- creasing response level.
jects (with their relevant KADS and BDI data Item 13 (suicide/self-harm) is notable in
and MINI diagnoses) were selected randomly that it yielded the largest individual item
with replacement (i.e., any given subject could AUC, although its plot in Fig. 1 does not show
be selected more than once) to produce a the expected increase in proportion of cases of
new study “sample” of 152 subjects. This pro- MDE with higher response levels. This item’s
cess was carried out 10,000 times to yield large AUC is due to the vast majority of sub-
10,000 replications of the original study, each jects endorsing one of its lower three response
with 152 subjects. The AUCs of the three in- levels (n = 143, 94% of subjects) in a manner
struments were recalculated for each of these appropriate for their diagnostic status (i.e.,
new samples. Pairs of instruments were then very few MDE cases selected response
118 LEBLANC ET AL.
TABLE 1. PSYCHOMETRIC P ROPERTIES OF INDIVIDUAL ITEMS
Mean Item score Item-total Item Item

item standard score ROC area ROC area
Item Item content score deviation correlation magnitude rank
1 Low mood, depressed, can’t be bothered 0.93 0.87 0.55 0.75 5

2 Irritable, losing temper 1.09 0.89 0.37 0.71 8
3a Trouble falling asleep, lying awake 1.05 1.07 0.50 0.57 16
3b Poor sleep during night, waking up 0.70 0.90 0.51 0.69 9
3c Waking too early in morning 0.59 0.89 0.31 0.61 13
3d Sleeping during day, taking naps 0.68 0.98 0.37 0.58 15
4 Less interest in friends, work, going out 0.59 0.82 0.48 0.74 7
5 Feeling worthless, hopeless 0.78 0.91 0.57 0.78 4
6 Tired, low in energy, low motivation 1.22 0.98 0.60 0.78 3
7 Trouble concentrating, daydreaming 1.41 1.03 0.55 0.59 14
8 Appetite changes 0.84 0.96 0.52 0.68 10
9 Less experience of fun, pleasure 0.66 0.79 0.60 0.80 2
10 Feeling worried, nervous, panicky 0.99 1.00 0.59 0.75 6
11 Headaches, butterflies, nausea, etc. 0.86 0.99 0.56 0.66 11
12 Interest in/thoughts re: sex, sexual arousal 0.59 0.94 0.25 0.64 12
13 Thoughts/actions re: suicide/self-harm 0.54 0.96 0.47 0.81 1
ROC = receiver operating characteristic.
level “0,” but relatively large numbers of cases results of the bootstrap indicated that the most
selected response levels “1” and “2”). None- reliable difference in AUC magnitude was that
theless, the results provide little support between the six-item and 16-item KADS; of the
for having five different response levels for two, the six-item KADS’s diagnostic accuracy
item 13. For this reason, future versions of the was better, with a significance level of 0.053
KADS will be produced with the upper two re- (two-sided). The six-item KADS also per-
sponse levels collapsed into one (plans and/or formed better than the BDI, but the difference
actions). could be explained by chance (p < 0.293).
Item selection Criterion scores

Table 2 lists the 16 KADS items in accor- Inspection of subjects’ six-item KADS scores
dance with their individual AUC ranks (see indicated that if minimal overall misclassifica-
Table 1) and displays the AUC and Cronbach’s tion were desired, a cutoff score of either 10
alpha values for the hierarchical sequence of (misclassification = 13%, sensitivity = 62%,
candidate instruments. The values in a given specificity = 93%) or 11 (misclassification =
row are those for the instrument comprising 13%, sensitivity = 42%, specificity = 97%)
the items in and above that row. If we select should be used. However, if a high level of
the six-item subscale (comprising items 13, 9, sensitivity is required in conjunction with an
6, 5, 1, and 10), we maintain a Cronbach’s acceptable level of specificity, a considerably
alpha coefficient of 0.80, with an AUC of 0.89, lower cutoff score, such as 6, should be chosen.
both considered acceptable. This represents a This cutoff score yields overall misclassifica-
good balance between the brevity required for tion, sensitivity, and specificity rates of 25%,
an instrument to be used in a busy clinical set- 92%, and 71%, respectively.
ting and the ability to predict MDE status. When it is known whether or not a random
Figure 2 shows the ROC curves for the BDI, subject’s score is below the chosen cutoff, the
six-item KADS, and 16-item KADS. Respec- probability of MDE becomes a function of
tively, their AUCs were 0.86, 0.89, and 0.85. All the instrument’s specificity and sensitivity and
are comparable, although the ROC curves of the prevalence rate for MDE in the sample
the 16-item KADS and the BDI are mostly from which the subject was drawn. The posi-
dominated by that of the six-item KADS. The tive predictive value (PPV) of the instrument is
FIG. 1. Visual summary of the diagnostic power of each Kutcher Adolescent Depression Scale item. The possible re-
sponse levels for particular items are presented along the horizontal axes. The plots show the proportion of respon-
dents who met the major depressive episode criteria. Although for some items, the number of subjects endorsing high
response levels was relatively low, for most items, the proportion of respondents who met the major depressive epi-
sode criteria can be seen to increase with increases in the response level.
the probability that the disorder is present Table 3 summarizes the instrument’s sensi-
given a criterion score (i.e., a six-item KADS tivity and specificity rates across a range of
score $ 6, say). The negative predictive value possible cutoff scores. The table also illustrates
(NPV) is the probability that the disorder is ab- the PPV and NPV probabilities for two differ-
sent given a noncriterion score (i.e., a six-item ent prevalence rates: one for the community
KADS score < 6, say). These probabilities can sample of the current study and another for a
be calculated as follows: putative clinical setting where, for illustrative
purposes, the probability of an adolescent pre-
senting to clinic with an MDE is set at 10%.
PPV = (Sensitivity 2 Prevalence)/ The prevalence rate (preassessment likelihood
[(Sensitivity 2 Prevalence) + (1 2 Specificity) of MDE) for the total sample of grade 7 to
2 (1 2 Prevalence)] grade 12 students of this study (n = 1,712) was
NPV = [Specificity 2 (1 2 Prevalence)]/ estimated as 30/161 2 309/1,712 = 0.034,
[(1 2 Sensitivity) 2 Prevalence + Specificity where 30/161 was the fraction of interviewees
2 (1 2 Prevalence)] confirmed as cases of MDE and 309/1,712 was
120 LEBLANC ET AL.
TABLE 2. ROC AREAS AND CRONBACH’S ALPHA COEFFICIENTS FOR CANDIDATE SUBSCALES OF THE KADS
Subscale Subscale: Subscale:
composition Content of item added to subscale ROC area Cronbach’s alpha
Item 13 alone Thoughts/actions re: suicide/self-harm 0.81 —

Adding item 9 Less experience of fun, pleasure 0.86 0.47
Adding item 6 Tired, low in energy, low motivation 0.88 0.62
Adding item 5 Feeling worthless, hopeless 0.89 0.72
Adding item 1 Low mood, depressed, can’t be bothered 0.89 0.78
Adding item 10 Feeling worried, nervous, panicky 0.89 0.80
Adding item 4 Less interest in friends, work, going out 0.89 0.81
Adding item 2 Irritable, losing temper 0.90 0.82
Adding item 3b Poor sleep during night, waking up 0.90 0.81
Adding item 8 Appetite changes 0.89 0.83
Adding item 11 Headaches, butterflies, nausea, etc. 0.89 0.84
Adding item 12 Interest in/thoughts re: sex, sexual arousal 0.90 0.83
Adding item 3c Waking too early in morning 0.89 0.83
Adding item 7 Trouble concentrating, daydreaming 0.88 0.85
Adding item 3d Sleeping during day, taking naps 0.87 0.85
Adding item 3a Trouble falling asleep, lying awake 0.85 0.87
Items are listed according to their individual ROC area ranks (see Table 1). Values in a given row are for the subscale
comprising items listed in and above that row. KADS = Kutcher Adolescent Depression Scale; ROC = receiver operat-
ing characteristic.
FIG. 2. Receiver operating characteristic curves for the Beck Depression Inventory (BDI), six-item Kutcher Adoles-
cent Depression Scale (KADS), and 16-item KADS. For the most part, the curve of the six-item KADS dominates over
those of the other two instruments.
TABLE 3. D IAGNOSTIC VALIDITY STATISTICS FOR THE 6-ITEM KADS OVER A RANGE OF
C UTOFF SCORES, FOR THE COMMUNITY SAMPLE STUDIED, AND FOR A PUTATIVE CLINICAL
SETTING IN WHICH THE PREVALENCE R ATE IS 0.10
Prevalence = 0.034a Prevalence = 0.10

Cutoff Sensitivity Specificity
score rate (%) rate (%) PPV NPV PPV NPV
6 92 71 0.10 1.00 0.26 0.99

7 81 78 0.11 0.99 0.29 0.97
8 81 86 0.16 0.99 0.39 0.98
9 69 90 0.19 0.99 0.43 0.96
10 62 93 0.23 0.99 0.49 0.96
11 42 97 0.32 0.98 0.60 0.94
KADS = Kutcher Adolescent Depression Scale; MDE = major depressive episode.

a The prevalence rate (preassessment likelihood of MDE) for the total sample of grade 7
to grade 12 students (n = 1,712) of this study was estimated as 30/161 2 309/1,712 = 0.034,
where 30/161 was the fraction of interviewees confirmed as cases of MDE, and 309/1,712
was the fraction of all original subjects invited for interview.
the fraction of all original subjects invited for The criterion standard for diagnosis used in
interview. this study (i.e., the MINI) will not be in perfect
agreement with other standards (e.g., a lay per-
son using another structured instrument such
DISCUSSION as the Diagnostic Interview Schedule for Chil-
dren Version IV; Shaffer et al. 2000), a clinician
The KADS was designed to address limita- using a semistructured instrument such as the
tions of previous instruments used to assess Schedule for Affective Disorders and Schizo-
depression in adolescents. Its items address phrenia for school-age children (Puig-Antich
the core symptoms of depression, giving it the and Chambers 1978, unpublished manuscript),
potential to have good predictive or criterion a psychiatrist applying DSM-IV criteria without
validity as well as good evaluative properties a standardized interview schedule, and the like.
(i.e., the ability to assess change over time). No two standards will agree completely with
This first study of its characteristics compared one another on which subjects are cases and
the diagnostic ability of the complete 16-item which subjects are not cases; furthermore, there
version and a brief six-item version of the is no agreed upon gold standard diagnostic
KADS with the often-used BDI for adolescents method. The implications of this for our analy-
diagnosed with MDE. In this school-based sis of the KADS are difficult to predict. If, for ex-
sample of adolescents, the brief version out- ample, another standard were preferred, and
performed both the full version of the KADS the KADS agreed better with this standard than
and the BDI, and bootstrapping results sug- did the BDI, the KADS would further outper-
gest that the six-item KADS would perform at form the BDI, as measured by the area under
least as well as the BDI in similar samples of the ROC curve. On the other hand, if the BDI
adolescents. It has the major advantage of agreed better with another standard, the small
brevity (6 items vs. 21 in the BDI), which al- advantage enjoyed by the KADS would de-
lows it to be used as a practical screening tool crease and perhaps reverse. It would be useful,
in a clinical setting where, for example, it in a future study, to compare the instruments
could be filled in by a patient before he or she directly to another standard such as DSM-IV di-
sees a mental health professional. As well, the agnosis by at least two trained professionals,
small number of items poses a low respondent using a protocol where interrater disagree-
burden, thereby allowing it to be combined ments are acknowledged and resolved.
with other brief instruments that screen for To a small extent, the study design may have
different mental health conditions. affected the diagnostic validity results of the
122 LEBLANC ET AL.
KADS and of the BDI in that the BDI itself was in the items. This is intended as a means of
used earlier as the screening tool to determine identifying cases of adolescent depression and
which subjects to invite for interview. There of monitoring the severity of these core symp-
may have been a small number of subjects toms via self-report. As such it is not intended
who were screened out by the initial BDI but to deepen our understanding of adolescent de-
who might have been correctly identified as pression itself. It is noteworthy that most of
cases by the KADS and/or by the second BDI the questions in the six-item version address
had they not been screened out. Contrarily, specific symptoms of depression such as anhe-
there may have been a few subjects who were donia and affect, whereas many of the other
screened out by the initial BDI but who might questions in the full version address symp-
have been wrongly identified as cases by the toms that occur in mood disorders but are not
KADS and/or by the second BDI had they not specific to them. The six-item KADS per-
been screened out. We predict that any bias formed at least as well as the full-length ver-
would likely favor the BDI in a comparison sion and would be a worthwhile candidate for
against the KADS because all of its true posi- screening in a busy primary care practice set-
tives have an opportunity for further testing, ting. Whether this short version has reason-
whereas some true positives that might have able evaluative properties remains to be
been identified by the KADS but not the BDI assessed using longitudinal data. Studies with
would not have been assessed further. It is longitudinal data are necessary to determine
likely that such a bias is small and clinically whether the final KADS will consist of a single
unimportant. instrument (e.g., six-item version) that has rea-
We chose a cutoff score of 6 for the six-item sonable diagnostic and evaluative properties,
KADS to enable it to serve as a relatively sensi- or a package with a six-item version intended
tive screening instrument. A researcher or clin- for screening and a longer version (between
ician could choose a different cutoff according 7 and 16 questions) that can be used for evalu-
to the sensitivity and specificity desired for a ative purposes. One such study is currently in
particular application. As the cutoff score is in- progress.
creased, sensitivity will decrease and speci- Readers interested in obtaining further in-
ficity will increase. Our data indicate that if the formation about the KADS should contact
overall prevalence of MDE is any value < 10%, Stan Kutcher at the Department of Psychiatry,
a six-item KADS score < 6 should virtually Dalhousie University, Queen Elizabeth II
rule out MDE, because the NPV would imply Health Sciences Centre, Abbie J. Lane Build-
a probability of MDE # 1%. A criterion score ing, Suite 9212, 5909 Veterans’ Memorial Lane,
would merely indicate the need for further as- Halifax, Nova Scotia, Canada B3H 2E2,
sessment because the corresponding PPV (e-mail: Stan.Kutcher@dal.ca).
would still imply a probability of < 50% (un-
less the prevalence rate is $ 10% and/or the
KADS score is $ 11). Although diagnostic sus-
ACKNOWLEDGMENTS
picion should be raised, it is still most likely
that the individual is not experiencing an
This work was supported in part by a con-
MDE. Low PPVs are common to all existing
tract from Servier Canada, Ltd. (GlaxoSmith-
self-report depression scales when they are
Kline, Canada) and by the Designated
used with cutoff scores that are chosen to
Mental Health Research Fund, Province of
maintain high sensitivity. In achieving a speci-
Nova Scotia.
ficity rate of > 70% while maintaining very
high sensitivity (>90%), however, the KADS is
less afflicted than other such instruments in
REFERENCES
this respect.
The 16-item KADS has excellent content va- Ambrosini PJ, Metz C, Prabucki K, Lee J-C: Videotape re-
lidity in that the core symptoms of depression liability of the third revised edition of the K-SADS. J
(as specified in DSM-IV) are coded explicitly Am Acad Child Adolesc Psychiatry 28:723–728, 1989.
American Psychiatric Association: Diagnostic and Statis- Gotlib H: Depression and general psychopathology in
tical Manual of Mental Disorders, 3rd ed., rev. (DSM- university students. J Abnorm Psychol 93:19–30, 1984.
III-R) Washington (DC), American Psychiatric Hamilton LC: Bootstrap methods. In: Regression with
Association, 1987. Graphics. Pacific Grove (California), Brooks/ Cole
American Psychiatric Association: Diagnostic and Statistical Publishing, 1992, pp 313–326.
Manual of Mental Disorders, 4th ed. (DSM-IV). Washing- Hamilton M: A rating scale for depression. J Neurol Neu-
ton (DC), American Psychiatric Association, 1994. rosurg Psychiatry 23:56–62, 1960.
Amorim P, Lecrubier Y, Weiller E, Hergueta T, Sheehan Hanley JA, McNeil BL: The meaning and use of the area
D: DSM-III-R psychotic disorders: Procedural validity under a receiver operating characteristic (ROC) curve.
of the Mini International Neuropsychiatric Interview Radiology 143:29–36, 1982.
(MINI): Concordance and causes for discordance with Henderson AR: Assessing test accuracy and its clinical con-
the MIDI. Eur Psychiatry 13:26–34, 1998. sequences: A primer for receiver operating characteris-
Angold A, Costello EJ, Messer SC, Pickles A, Winder F, tic curve analysis. Ann Clin Biochem 30:521–539, 1993.
Silver D: Development of a short questionnaire for Kovacs M: The Children’s Depression Inventory. North
use in epidemiological studies of depression in chil- Tonawanda (New York), Mental Health Systems, 1992.
dren and adolescents. Int J Methods Psychiatr Res Kutcher S, Marton P: Utility of the Beck Depression In-
5:237–249, 1995a. ventory with psychiatrically disturbed adolescent
Angold A, Prendergast M, Cox A, Harrington R, Simonoff outpatients. Can J Psychiatry 34:107–109, 1989.
E, Rutter M: The child and adolescent psychiatric as- LeBlanc JC, Morehouse R, Rajda M, Kutcher S, Hayden
sessment (CAPA). Psychol Med 25:739–753, 1995b. D, Thompson S, Rusak B: Prevalence of Delayed Sleep
Beck AT, Ward C, Mendelson M, Mock J, Erbaugh J: An Phase Syndrome in Adolescents: A School-Based Sur-
inventory for depression. Arch Gen Psychiatry vey [abstract]. San Francisco, Society for Pediatric Re-
4:561–571, 1961. search, May 1999.
Birleson P: The validity of depressive disorder in child- Lecrubier Y, Sheehan DV, Weiller E, Amorim P, Bonora I,
hood and the development of a self-rating scale: A re- Harnett Sheehan K, Janavs J, Dunbar GC: The Mini In-
search report. J Child Psychol Psychiatry 22:73–88, 1981. ternational Neuropsychiatric Interview (MINI): A short
Brooks SJ, Kutcher S: Diagnosis and measurement of diagnostic structured interview: Reliability and validity
adolescent depression: A review of commonly uti- according to the CIDI. Eur Psychiatry 12:224– 231, 1997.
lized instruments. J Child Adolesc Psychopharmacol McCauley E, Mitchell JR, Burke P, Moss S: Cognitive at-
11:341–376, 2001. tributes of depression in children and adolescents. J
Costello EJ, Angold A: Scales to assess child and adoles- Consult Clin Psychol 56:903– 908, 1988.
cent depression: Checklists, screens, and nets. J Am Metz CE: Basic principles of ROC analysis. Semin Nucl
Acad Child Adolesc Psychiatry 27:726–737, 1988. Med 8:283–298, 1978.
Costello EJ, Benjamin R, Angold A, Silver D: Mood vari- Radloff LS: A CES-D scale: A self-report depression scale
ability in adolescents: A study of depressed, nonde- for research in the general population. Applied Psy-
pressed and comorbid patients. J Affect Disord chological Measurement 1:385–401, 1977.
23:199–212, 1991. Reynolds WM, Coats KI: A comparison of cognitive-be-
Cronbach LJ: Coefficient alpha and the internal structure havioral therapy and relaxation training for the treat-
of tests. Psychometrika 16:297–334, 1951. ment of depression in adolescents. J Consult Clin
Doerfler LA, Felner RD, Rowlinson RT, Evans E, Raley Psychol 54:653–660, 1986.
PA: Depression in children and adolescents: A com- Reynolds WM: Reynolds Adolescent Depression Scale
parative analysis of the utility and construct validity Professional Manual. Odessa (Florida), Psychological
of two assessment measures. J Consult Clin Psychol Assessment Resources, 1987.
56:769–772, 1988. Reynolds WM: Assessment of depression in children and
Endicott J, Spitzer R: A diagnostic interview: A schedule adolescents by self-report questionnaires. In: Hand-
for affective disorders and schizophrenia. Arch Gen book of Depression in Children and Adolescents.
Psychiatry 35:837–844, 1978. Edited by Reynolds W, Johnston H. New York,
Faulstich ME, Carey MP, Ruggiero L, Enyart P, Gresham Plenum Press, 1994, pp 209–234.
F: Assessment of depression in childhood and adoles- Roberts RE, Vernon SW, Rhoades HM: Effects of lan-
cence: An evaluation of the Center for Epidemiologi- guage and ethnic status on reliability and validity of
cal Studies Depression Scale for Children (CES-DC). the Center for Epidemiologic Studies– Depression
Am J Psychiatry 143:1024–1027, 1986. Scale with psychiatric patients. J Nerv Ment Dis 177:
Fundudis T, Berney T, Kolvin I, Famuyiwa OO, Barrett L, 581–592, 1989.
Bhate S, Tyrer SP: Reliability and validity of two self- Roberts RE, Lewinsohn PM, Seeley JR: Screening for
rating scales in the assessment of childhood depres- adolescent depression: A comparison of depression
sion. Br J Psychiatry 11 (Suppl):36–40, 1991. scales. J Am Acad Child Adolesc Psychiatry
Garrison CZ, Addy CL, Jackson KL, MeKeown RE, 30:58–66, 1991.
Waller JL: The CES-D as a screen for depression and Shaffer D, Fisher P, Lucas CP, Dulcan MK, Schwab-Stone
other psychiatric disorders in adolescents. J Am Acad M: NIHM Diagnostic Interview Schedule for Children
Child Psychiatry 30:636–641, 1991. Version IV (NIMH DISC-IV): Description, differences
124 LEBLANC ET AL.
from previous versions, and reliability of some com- Wood A, Kroll L, Moore A, Harrington R: Properties of
mon diagnoses. J Am Acad Child Psychiatry 39:28–38, the Mood and Feelings Questionnaire in adolescent
2000. psychiatric outpatients: A research note. J Child Psy-
Sheehan DV, Lecrubier Y, Sheehan KH, Janavs J, Weiller E, chol Psychiatry 36:327–334, 1995.
Keskiner A, Schinka J, Knapp E, Sheehan MF, Dunbar World Health Organization: The ICD-10 Classification of
GC: The validity of the Mini International Neuropsy- Mental and Behavioral Disorders. Geneva, World
chiatric Interview (MINI) according to the SCID-P and Health Organization, 1992.
its reliability. Eur Psychiatry 12:232–241, 1997.
Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs
J, Weiller E, Hergueta T, Baker R, Dunbar GC: The
Address reprint requests to:
Mini-International Neuropsychiatric Interview Stanley C. Kutcher, M.D., M.Sc., FRCPC
(M.I.N.I.): The development and validation of a struc- Department of Psychiatry
tured diagnostic psychiatric interview for DSM-IV and Dalhousie University
ICD-10. J Clin Psychiatry 59 (Suppl 20):22–33, 1998. Queen Elizabeth II Health Sciences Centre
Thapar A, McGuffin P: Validity of the shortened Mood Abbie J. Lane Building
and Feelings Questionnaire in a community sample of Suite 9212
children and adolescents: A preliminary research 5909 Veterans’ Memorial Lane
note. Psychiatry Res 81:259–268, 1998. Halifax, Nova Scotia
Weissman MM, Orvashel H, Padian N: Children’s symp-
Canada B3H 2E2
tom and social functioning self-report scales: Com-
parison of mothers’ and children’s reports. J Nerv
Ment Dis 168:736–740, 1980.
E-mail: Stan.Kutcher@dal.ca
APPENDIX a. hardly ever

THE KUTCHER ADOLESCENT b. much of the time
DEPRESSION SCALE c. most of the time
Over the last week, how have you been “on d. all of the time
average” or “usually” regarding the following 5. Feelings of worthlessness, hopelessness,
items: letting people down, not being a good
person.
1. Low mood, sadness, feeling blah or a. hardly ever
down, depressed, just can’t be bothered. b. much of the time
a. hardly ever c. most of the time
b. much of the time d. all of the time
c. most of the time 6. Feeling tired, feeling fatigued, low in
d. all of the time energy, hard to get motivated, have to
2. Irritable, losing your temper easily, feel- push to get things done, want to rest or
ing pissed off, losing it. lie down a lot.
a. hardly ever a. hardly ever
b. much of the time b. much of the time
c. most of the time c. most of the time
d. all of the time d. all of the time
3. Sleep difficulties: different from your 7. Trouble concentrating, can’t keep your
usual (over the years before you got sick). mind on schoolwork or work, day-
3a.Trouble falling asleep, lying awake in dreaming when you should be working,
bed. hard to focus when reading, getting
a. hardly ever “bored” with work or school.
b. much of the time a. hardly ever
c. most of the time b. much of the time
d. all of the time c. most of the time
3b.Sleeping poorly during the d. all of the time
night,wak-ing up, getting out of bed. 8. Appetite changing from usual (before
a. hardly ever you got sick); not feeling hungry, not
b. much of the time wanting to eat or feeling really hungry,
c. most of the time wanting to eat a lot.
d. all of the time a. hardly ever
3c.Waking up too early in the morning, b. much of the time
at least 1 hour before you want to or c. most of the time
need to get up. d. all of the time
a. hardly ever Note: increased __ or decreased __
b. much of the time 9. Feeling that life is not very much fun, not
c. most of the time feeling good when usually (before get-
d. all of the time ting sick) would feel good, not getting as
3d.Sleep during the day, taking naps, much pleasure from fun things as usual
lying down to rest. (before getting sick).
a. hardly ever a. hardly ever
b. much of the time b. much of the time
c. most of the time c. most of the time
d. all of the time d. all of the time
4. Feeling decreased interest in hanging out 10. Feeling worried, nervous, panicky, tense,
with friends; being with your best friend; keyed up, anxious.
being with your spouse/boyfriend/girl- a. hardly ever
friend; going out of the house; doing b. much of the time
schoolwork or work; doing hobbies, c. most of the time
sports, or recreation. d. all of the time
126 LEBLANC ET AL.
11. Physical feelings of worry like: head- b. occasionally think about it

aches, butterflies, nausea, tingling, rest- c. seldom think about it
lessness, diarrhea, shakes or tremors. d. never think about it
a. hardly ever 13. Thoughts, plans, or actions about suicide
b. much of the time or self-harm.
c. most of the time a. no thoughts or plans or actions
d. all of the time b. occasional thoughts, no plans or actions
12. Interest in sex, thoughts about sex, sex- c. frequent thoughts, no plans or actions
ual arousal (compared to before you d. plans
were ill), sexual fantasies. e. actions
a. as usual or think about it more
This article has been cited by:
1. Ming-Shun Chung, Hsien-Jane Chiu, Wen-Jung Sun, Chieh-Nan Lin, Chien-Cheng Kuo, Wei-Che Huang,
Ying-Sheue Chen, Hui-Ping Cheng, Pesus Chou. 2011. A rapid screening test for depression in junior high
school children. Journal of the Chinese Medical Association . [CrossRef]
2. Teresa Carnevale. 2011. An Integrative Review of Adolescent Depression Screening Instruments:
Applicability for Use by School Nurses. Journal of Child and Adolescent Psychiatric Nursing 24:1, 51-57.
[CrossRef]
3. Pim Cuijpers, Petra Boluijt, Annemieke Straten. 2008. Screening of depression in adolescents through the
Internet. European Child & Adolescent Psychiatry 17:1, 32-38. [CrossRef]
4. Nancy DeRosa, M. Cynthia Logsdon. 2006. A Comparison of Screening Instruments for Depression in
Postpartum Adolescents. Journal of Child and Adolescent Psychiatric Nursing 19:1, 13-20. [CrossRef]
5. Graham J Emslie, Taryn L Mayes, Maryse Ruberu. 2005. Continuation and Maintenance Therapy of Early-
Onset Major Depressive Disorder. Pediatric Drugs 7:4, 203-217. [CrossRef]
6. Sarah Brooks. 2004. The Kutcher Adolescent Depression Scale (KADS). Child and Adolescent
Psychopharmacology News 9:5, 4-6. [CrossRef]
7. HIMANSHU P. UPADHYAYA, KATHLEEN T. BRADY, WEI WANG. 2004. Bupropion SR in
Adolescents With Comorbid ADHD and Nicotine Dependence: A Pilot Study. Journal of the American
Academy of Child & Adolescent Psychiatry 43:2, 199-205. [CrossRef]
8. Lili Penkower, Mary Amanda Dew, Demetrius Ellis, Susan M. Sereika, Julius M. M. Kitutu, Ron Shapiro.
2003. Psychological Distress and Adherence to the Medical Regimen Among Adolescent Renal Transplant
Recipients. American Journal of Transplantation 3:11, 1418-1425. [CrossRef]
9. Sarah J. Brooks, Stanley P. Krulewicz, Stan Kutcher. 2003. The Kutcher Adolescent Depression Scale:
Assessment of Its Evaluative Properties over the Course of an 8-Week Pediatric Pharmacotherapy Trial.
Journal of Child and Adolescent Psychopharmacology 13:3, 337-349. [Abstract] [Full Text PDF] [Full Text
PDF with Links]
View publication stats

KADSvs - BDIscreeningfordepression LeBlanc2002JChildAdolescPsychopharm

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

KADSvs - BDIscreeningfordepression LeBlanc2002JChildAdolescPsychopharm

Uploaded by

Copyright:

Available Formats

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Screening for Adolescent Depression: Comparison of the Kutcher Adolescent

Article in Journal of Child and Adolescent Psychopharmacology · February 2002

John C Leblanc Stan Kutcher

SEE PROFILE SEE PROFILE

New Initiative Grant, UWI, Mona View project

school mental health literacy View project

The user has requested enhancement of the downloaded file.

Screening for Adolescent Depression:

John C. LeBlanc, M.D., M.Sc., FRCPC,1,2 Anthony Almudevar, Ph.D.,3

Self-report instruments commonly used to assess depression in adolescents have limited or

INTRODUCTION been shown to have weak reliability and/or

T HE K UTCHER ADOLESCENT D EPRESSION SCALE

1 Department of Psychiatry, Dalhousie University, Halifax, Nova Scotia, Canada.

METHODS for diagnosing an MDE consists of nine items

TABLE 1. PSYCHOMETRIC P ROPERTIES OF INDIVIDUAL ITEMS

Mean Item score Item-total Item Item

1 Low mood, depressed, can’t be bothered 0.93 0.87 0.55 0.75 5

ROC = receiver operating characteristic.

Item selection Criterion scores

Item 13 alone Thoughts/actions re: suicide/self-harm 0.81 —

Prevalence = 0.034a Prevalence = 0.10

6 92 71 0.10 1.00 0.26 0.99

KADS = Kutcher Adolescent Depression Scale; MDE = major depressive episode.

APPENDIX a. hardly ever

11. Physical feelings of worry like: head- b. occasionally think about it

View publication stats

You might also like