Professional Documents
Culture Documents
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
562 ARCHIVES OF GENERAL PSYCHIATRY
for the measurement of depression; its scales B. Administration of the Inventory.—The in¬
are based on the old psychiatric nomen¬ ventory was administered by a trained interviewer
(a clinical psychologist or a sociologist) who read
clature; and factor analytic studies reveal aloud each statement in the category and asked
that the Depression Scale contains a num¬ the patient to select the statement that seemed
ber of heterogeneous factors only one of to fit him the best at the present time. In order
which is consistent with the clinical concept that the instrument reflect the current status of
of depression.3 Jasper's Depression-Elation the patient, the items were presented in such a
9 way as to elicit the patient's attitude at the time
test was derived from a study of normal of the interview. The patient also had a copy
college students, and his report does not of the inventory so that he could read each
refer to any studies with a psychiatric statement to himself as the interviewer read each
statement aloud. On the basis of the patient's
population.
response, the interviewer circled the number ad¬
Method jacent to the appropriate statement.
In addition to administering the Depression In¬
A. Construction of the Inventory.—The items in
this inventory were primarily clinically derived. ventory, the interviewer collected relevant back¬
In the course of the psychoanalytic psychotherapy
ground data, administered a short intelligence test,
and elicited dreams and other ideational material
of depressed patients, the senior author made sys¬
relevant to the psychoanalytic hypotheses being
tematic observations and records of the character¬
istic attitudes and symptoms of depressed patients.
investigated. These additional procedures were all
administered after the Depression Inventory.
He selected a group of these attitudes and symptoms C. Description of Patient Population.—The pa¬
that appeared to be specific for these depressed
tients were drawn from the routine admissions to
patients and which were consistent with the de¬ the psychiatric outpatient department of a univer¬
scriptions of depression contained in the psychi¬ sity hospital (Hospital of the University of
atric literature.10 On the basis of this procedure,
he constructed an inventory composed of 21 cate¬
Pennsylvania) and to the psychiatric outpatient
department and psychiatric inpatient service of a
gories of symptoms and attitudes. Each category metropolitan hospital (Philadelphia General Hos¬
describes a specific behavioral manifestation of
pital). The outpatients were seen either on the
depression and consists of a graded series of 4 same day of their first visit to the outpatient
to 5 self-evaluative statements. The statements
department or a specific appointment was made
are ranked to reflect the range of severity of the for them to come back a few days later for the
symptom from neutral to maximal severity. Nu¬ complete work-up. Hospitalized patients were all
merical values from 0-3 are assigned each state¬ seen the day following their admission to the
ment to indicate the degree of severity. In many hospital, i.e., during their first full day in the
categories, 2 alternative statements are presented hospital. The demographic features of the popu¬
at a given level and are assigned the same weight ; lation are listed in Table 1. It will be noted
these equivalent statements are labeled a and b that there are 2 patient samples, one the original
(for example, 2a, 2b) to indicate that they are group (226 patients), the other the replication
at the same level. The items were chosen on the group (183 patients). The original sample (Study
basis of their relationship to the overt behavioral I) was taken over a 7-month period starting in
manifestations of depression and do not reflect any June, 1959, the second (Study II) over a 5-month
theory regarding the etiology or the underlying period. The completion of the first study coin¬
cided with the introduction of some new pro¬
psychological processes in depression. jective tests not relevant to this report.
The symptom-attitude categories are as follows :
The most salient aspects of this table are the
a. Mood k. Irritability predominance of white patients over Negro pa¬
b. Pessimism 1. Social Withdrawal tients, the age concentration between 15 and 44,
c. Sense of Failure m. Indecisiveness and the high frequency of patients in the lower
d. Lack of Satis- n. Body Image socioeconomic groups (IV and V). The social
faction o. Work Inhibition position was derived from Hollingshead's Two
e. Guilty Feeling p. Sleep Disturbance Factor Index of Social Position,7 which uses the
f. Sense of Punish- q. Fatigability factors of education and occupation in the class
ment r. Loss ofAppetite level determination.
g. Self-Hate s. Weight Loss The distribution of diagnoses was similar for
h. Self Accusations t. Somatic Pre- Studies I and II. Patients with organic brain
i. Self Punitive occupation damage and mental deficiency were automatically
Wishes u. Loss of Libido excluded from the study. The proportions among
j. Crying Spells the major diagnostic categories were psychotic
54- Vol. 4, June, 1961
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
DEPTH OF DEPRESSION 563
100.0
D. External Criterion.—The patient was seen
M H »H
either directly before or after the administration
of the Depression Inventory by an experienced
+ psychiatrist who interviewed him and rated him
on a 4-point scale for the Depth of Depression.
The psychiatrist also rendered a psychiatric diag¬
nosis and filled out a comprehensive form de¬
signed for the study. In approximately half the
cases, the psychiatrist saw the patient first ; in
the remainder, the Depression Inventory was ad¬
ministered first.
Four experienced psychiatrists participated in
the diagnostic study.* They may be characterized
as a group as follows :
approximately 12 years
experience in psychiatry, holding responsible
teaching and training positions, certified by the
American Board of Psychiatry, interested in re¬
search, and analytically oriented.
The psychiatrists had several preliminary meet¬
•s
ings during which they reached a consensus re¬
garding the criteria for each of the nosological
entities and focused special attention on the various
types of depression. In every case, the Diagnostic
and Statistical Manual of Mental Disorders of the
American Psychiatric Association1 was used, but
it was found that considerable amplification of the
diagnostic descriptions was necessary. After they
had reached complete agreement on the criteria
to be used in making their clinical
judgments, the
psychiatrists composed a detailed instruction man¬
ual to serve as a guide in their diagnostic eval¬
uations.
The psychiatrists then participated in a series
of interviews, during which two of them jointly
interviewed a patient while the other two ob¬
served through a one-way screen. This served as
-i G
* the initial group of 226 patients, some of
the diagnostic evaluations were made by a "non-
standard diagnostician," that is, a psychiatrist other
than the 4 regular psychiatrists. In all, 40 patients
were seen by these psychiatrists.
Beck et al. 55
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
564 ARCHIVES OF GENERAL PSYCHIATRY
a practical testing ground for the application of To establish the degree of agreement, the
the agreed-on instructions and principles and al¬ psychiatrists interviewed 100 patients and made in¬
lowed further discussion of interview techniques, dependent judgments of the diagnosis and the Depth
the logic of diagnosis, and the pinpointing of of Depression. All 4 diagnosticians participated
specific diagnoses. in the double assessment and were randomly paired
Since the main focus of the research was to be with one another so that each of the patients was
on depression, the diagnosticians also established seen by 2 diagnosticians. The procedure was to
specific indices to be used in making a clinical have one psychiatrist interview the patient and
estimation of the Depth of Depression. These then after a resting period of a few minutes, the
indices represented the pooled experience of the other psychiatrist would interview the patient.
4 clinicians and were arrived at independently After the second interview was conplete, the
of the Depression Inventory. For each of the clinicians generally would meet and discuss the
specified signs and symptoms the psychiatrists cases seen concurrently to ascertain the reasons
made a rating on a 4-point scale of none, mild, for disagreement (if any).
moderate, and severe. The purpose of specifying
these indices was to facilitate uniformity among Results
the psychiatrists; however, in making the over-all
rating of the Depth of Depression, they made a A. Reliability of Psychiatrists' Ratings.—
global judgment and were not bound by the rat¬ The agreement among the psychiatrists re¬
ings in each index.f They also concentrated on garding the major diagnostic categories of
the intensity of depression at the time of the
interview; hence, the past history was not as im¬ psychotic disorder, psychoneurotic disorder,
portant as the mental status examination. and personality disorder was 73% in the
The indices of depression which were devised 100 cases that were seen by 2 psychiatrists.^
and used by the psychiatrists were as follows : This level of agreement, while higher than
I. Apperance II. Thought Content that reported in many investigations, was
Facies Reported Mood considered too low for the purposes of our
Gait Helplessness study.
Posture Pessimism The degree of agreement, however, in the
Crying Feelings of In-
Speech adequacy and rating of "Depth of Depression" was much
Volume Inferiority higher. Using the 4-point scale (none, mild,
Key Somatic pre- moderate, and severe) to designate the in¬
Speed occupation tensity of depression, the diagnosticians
Amount Conscious guilt showed the
Suicidal content following degree of agreement:
Complete agreement 56%
III. Vegetative Signs IV. Psychosocial One degree of disparity 41%
Sleep Performance Two degrees of disparity 2%
Appetite Indecisiveness Three degrees of disparity 1%
Constipation Loss of drive This indicates that there was agreement
Loss of interest within one degree on the 4-point scale in
Fatigability
The diagnosticians also rated
the patient on the
97% of the cases.
degree of agitation and overt anxiety and filled B. Reliability of Depression Inventory.—
out a check list to indicate the presence of other Two methods for evaluating the internal
specific psychiatric and psychosomatic symptoms
and disturbances in concentration, memory, recall,
consistency of the instrument were used.
First, the protocols of 200 consecutive cases
judgment, and reality testing. He also made a rat¬ were analyzed. The score for each of the
ing of the severity of the present illness on a 21 categories was compared with the total
4-point scale.
score on the Depression
t A number of problems arose in assessing the Inventory for each
individual. With the use of the Kruskal-
relative degree of depression of patients with con-
trasting clinical pictures. For example, would a Wallis Non-Parametric Analysis of Vari-
patient who is regressed and will not eat be rated \s=d \AA detailed description of the reliability studies
as more depressed than a patient who is not re- will be reported in a separate article.3 The types
gressed but has made a genuine suicidal attempt? of disagreement regarding the nosological cate-
Such problems involved complex clinical judgments gories and the reasons for disagreement are being
and will be the subject of a later report. systematically investigated in another study.
56 Vol. 4, June, 1961
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
DEPTH OF DEPRESSION 565
ance by Ranks,14 it was found that all of Depression was made by one of the
categories showed a significant relationship psychiatrists. The interval between the 2
to the total score for the inventory.§ Sig¬ tests varied from 2 to 6 weeks. It was found
nificance was beyond the 0.001 level for all that changes in the score on the inventory
categories except category S (Weight-loss tended to parallel changes in the clinical
category), which was significant at the 0.01 Depth of Depression, thus indicating a con¬
level. sistent relationship of the instrument to the
The second evaluation of internal con¬ patient's clinical state. (These findings are
sistency was the determination of the split- discussed more fully in the section on valida¬
half reliability. Ninety-seven cases in the tion studies.)
first sample were selected for this analysis. An indirect measure of inter-rater relia¬
The Pearson r between the odd and even bility was achieved as follows : Each of the
categories was computed and yielded a re¬ scores obtained by each of the 3 interviewers
liability coefficient of 0.86; with a Spear¬ was plotted against the clinical ratings. A
man-Brown correction, this coefficient rose very high degree of consistency among the
to 0.93.5 interviewers was observed for the mean
Certain traditional methods of assessing scores respectively obtained at each level of
the stability and consistency of inventories depression. Curves of the distribution of
and questionnaires, such as the test-retest the Depression Inventory scores plotted
method and the inter-rater reliability method, against the Depth of Depression were
were not appropriate for the appraisal of
notably similar, again indicating a high de¬
the Depression Inventory for the following gree of correspondence among those who
reasons: If the inventory were readmin- administered the inventory.
istered after a short period of time, the C. Validation of the Depression Inven¬
correlation between the 2 sets of scores could tory.—-The means and standard deviations
be spuriously inflated because of a memory for each of the
factor. If a long interval was provided,
Depth of Depression cate¬
the consistency would be lowered because
gories are presented in Table 2. It can be
seen from inspection that the differences
of the fluctuations in the intensity of de¬
among the means are as expected; that is,
pression that occur in psychiatric patients. with each increment in the magnitude of
The same factors precluded the successive
administration of the test by different in¬
depression, there is a progressively higher
mean score. The Kruskal-Wallis One-way
terviewers.
Two indirect methods of estimating the
Analysis of Variance by Ranks was used
to evaluate the statistical significance of these
stability of the instrument were available. differences; for both the original group
The first was a variation of the test-retest
method. The inventory was administered to (Study I) and the replication group
a group of 38 patients at two different
(Study II), the /»-value of these differ¬
ences is < 0.001.
times. At the time of each administration
of the test, a clinical estimate of the Depth
Since the Kruskal-Wallis test evaluates
the over-all association between the scores
\s=s\This procedure is designed to assess whether on the Depression Inventory and the
Depth
variation in response to a particular category is
associated with variation in total score on the
of Depression ratings, the Mann-Whitney
inventory. For each category, the distribution of U test14 was used to appraise the power of
total inventory scores for individuals selecting a the Depression Inventory to discriminate
particular alternative response was determined. between specific Depth of Depression cate¬
The Kruskal-Wallis test was then used to assess
whether the ranks of the distribution of total scores
gories. It was found that all differences
increased significantly as a function of the differ-
between adjacent categories (none, mild,
ences in severity of depression indicated by these moderate, and severe) in both studies were
alternative responses. significant at <0.0004 with the exception of
Beck et al. 57
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
566 ARCHIVES OF GENERAL PSYCHIATRY
Nw
*
Pearson biserial r.
Q ^ » ta
the differences between the moderate and
severe categories, which had a ¿»-value of
n
•S rt' =>' =' It will be noted that there are fewer false
·<
H negatives and false positives in Study II
than in Study I. This may be accounted for
M 00 o
by the fact that in Study II only the 4
"standard" psychiatrists were used and by
•a the fact that with increasing experience in
assessing the severity of depression, they
-"- a achieved greater precision in their judg¬
>> 1» ¿2
aw
SS
a
o
ments. As expected, the most clear-cut dis¬
O
mCQ
criminations occurred when extreme groups
(none vs. severe) were compared. In Study
I the cutting score discriminated between
58 Vol. 4, June, 1961
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
DEPTH OF DEPRESSION 567
•S ?
S s "
predicted by a change in the Depression
S
§•„0
œ
Inventory score; in 28 out of the 33 cases
Qa
oS
(85%), the change in the clinical Depth of
•te. Depression was correctly predicted.
Comment
The Depression Inventory was subjected
H
< Ss to a variety of tests to determine its re-
II
g CO
6
Table 5.—Relationship of Changes in Depth of
Depression Rating to Changes in Depression
Inventory Scores
2
CQ M "a
OD O Depth of Depression
Eh
g S&
o
X¡ Decreased Increased
255 Depression Decreased 26 2
Inventory
Score Increased 3 2
Beck et al. 59
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
568 ARCHIVES OF GENERAL PSYCHIATRY
liability and validity. The high correlation primary diagnosis of depression only 50%
coefficient on the split-half item analysis of the time.2 This problem was solved by
and the significant relationship between the having the diagnosticians make judgments
individual category scores and the total of the intensity of depression. When this
scores indicate the instrument is highly re¬ was done, a high degree of consistency was
liable. The highly significant relationship found among the psychiatrists' ratings.
between the scores on the inventory and While this classification disregarded the
the clinical ratings of Depth of Depression primary diagnosis, it did employ the same
and the power to reflect clinical changes in diagnostic signs and symptoms that are gen¬
the Depth of Depression attest to the validity erally considered characteristic of primary
of this instrument. depression. The change from the usual diag¬
When the question arises of assessing nostic procedure was to treat depression as a
some diagnostically relevant behaviors as, personality dimension and not simply as a
for example, are presented by states of de¬ discrete nosological entity. It was found,
pression, the clinician is quite naturally moreover, that this particular cluster of
disposed to rely upon clinical observation symptoms occurred in association with al¬
and tends to mistrust personality inventories. most every other nosological category. In
This objection to so-called "objective" in¬ fact, in only about 26% of the cases was
struments is formally expressed by Horn,8 depression found to be completely absent.
who, in commenting on the "relative ste¬ While the psychiatrists showed a close
rility" of personality inventories in predict¬ agreement on the estimate of the Depth of
ing behavior, challenges the assumption that Depression, one can still raise questions as
the items in an inventory convey the same to whether they were actually assessing de¬
or similar meaning to everyone who takes pression, or whether it might have been some
the test. He argues that "a personality self- other personality variable. While there is
rating questionnaire is in the nature of a no reason to assume that clinical evaluation
projective test: each item serves as an am¬ is the ultimate criterion, as long as one is
biguous stimulus whose interpretation is dealing with a clinical phenomenon we will
affected by the subject's needs, wishes, fears, have to rely on expert judgment as our
etc." This approach, in his opinion, re¬ criterion until other measures are developed.
moves from consideration the efficacy of a The ability of the inventory to approxi¬
self-rating inventory as an accurate self- mate clinical judgments of intensity of de¬
evaluation. However, the adequacy of any pression offers a number of advantages in
test as an accurate index of what it is sup¬ its use for research purposes. First, it meets
posed to measure is essentially an empirical the problem of the variability of clinical
question and cannot be resolved by fiat. judgment of nosological entities and provides
Thus, in the case of the Depression In¬ a standardized, consistent measure that is
ventory, it was possible to obtain self- not sensitive to the theoretical orientation or
evaluations from the patient that were idiosyncrasies of the individual who ad¬
consistent with the total behavior of the pa¬ ministers it. Second, since the inventory can
tient as observed by the clinician. be administered by an interviewer who is
A formidable problem in evaluating the easily trained in its use, it is far more
validity of an inventory centers around the economical than a clinical psychiatric in¬
adequacy of the external criterion. In view terview. Third, since the inventory provides
of the well known variability of psychiatric a numerical score, it facilitates comparison
diagnoses, it is necessary to have some other with other quantitative data. Finally, since
consistent standard against which the in¬ the inventory reflects changes in the Depth
ventory score can be judged. In our study, of Depression over time, it provides an ob¬
for example, there was concurrence on the jective measure for judging improvement
60 Vol. 4, June, 1961
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
DEPTH OF DEPRESSION 569
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
570 ARCHIVES OF GENERAL PSYCHIATRY
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018
DEPTH OF DEPRESSION 571
M (Indecisiveness) Q (Fatigability)
0 I make decisions about as well as ever 0 I don't get any more tired than usual
1 I am less sure of myself now and try 1 I get tired more easily than I used to
to put off making decisions 2 I get tired from doing anything
2 I can't make decisions any more with¬ 3 I get too tired to do anything
out help R (Loss of Appetite)
3 I can't make any decisions at all any 0 My appetite is no worse than usual
more 1 My appetite is not as good as it used
(Body Image) to be
0 I don't feel I look any worse than I 2 My appetite is much worse now
used to 3 I have no appetite at all any more
1 I am worried that I am looking old or S (Weight Loss)
unattractive 0 I haven't lost much weight, if any,
2 I feel that there are permanent changes lately
in my appearance and they make me 1 I have lost more than 5 pounds
look unattractive 2 I have lost more than 10 pounds
3 I feel that I am ugly or repulsive 3 I have lost more than 15 pounds
looking (Somatic Preoccupation)
0 I am no more concerned about my
O (Work Inhibition)
0 I can work about as well as before health than usual
la It takes extra effort to get started at 1 I am concerned about aches and pains
or upset stomach or constipation or
doing something other unpleasant feelings in my body
lb I don't work as well as I used to
2 I am so concerned with how I feel or
2 I have to push myself very hard to do
what I feel that it's hard to think of
anything much else
3 I can't do any work at all
3 I am completely absorbed in what I
(Sleep Disturbance) feel
0 I can sleep as well as usual
U (Loss of
1 I wake up more tired in the morning
Libido)
0 I have not noticed any recent change
than I used to in my interest in sex
2 I wake up 1-2 hours earlier than usual 1 I am less interested in sex than I used
and find it hard to get back to sleep to be
3 I wake up early every day and can't 2 I am much less interested in sex now
get more than 5 hours sleep 3 I have lost interest in sex completely
Beck et al. 63
Downloaded From: by a Tufts Univ. Hirsh Health Sciences Library User on 07/01/2018