Professional Documents
Culture Documents
net/publication/312596477
CITATIONS READS
339 2,286
15 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Darrel A Regier on 29 April 2019.
Background: One of the aims of the World Health termine interrater reliability, 141 of the IPDEs (20%) were
Organization/Alcohol, Drug Abuse, and Mental Health Ad- independently rated by a silent observer. To determine
ministration joint program on psychiatric diagnosis and temporal stability, 243 patients (34%) were reexamined
classification is the development and standardization of after an average interval of 6 months.
diagnostic assessment instruments for use in clinical re-
search worldwide. The International Personality Disor- Results: The IPDE proved acceptable to clinicians and
der Examination (IPDE) is a semistructured clinical in- demonstrated an interrater reliability and temporal sta-
terview compatible with the International Classification of bility roughly similar to instruments used to diagnose the
Diseases, Tenth Revision, and the DMS-III-R classification psychoses, mood, anxiety, and substance use disorders.
systems. This is the first report of the results of a field trial
to investigate the feasibility of using the IPDE to assess Conclusion: It is possible to assess personality disor-
personality disorders worldwide. ders with reasonably good reliability in different na-
tions, languages, and cultures using a semistructured clini-
Methods: The IPDE was administered by 58 psychia- cal interview that experienced clinicians find relevant,
trists and clinical psychologists to 716 patients enrolled meaningful, and user-friendly.
in clinical facilities at 14 participating centers in 11 coun-
tries in North America, Europe, Africa, and Asia. To de- (Arch Gen Psychiatry. 1994;51:215-224)
IN
try, Cornell University Medical 1979, the World Health Orga¬ try (SCAN) ,4 a descendant of the ninth edi¬
College, New York, NY (Drs Lo- nization (WHO) and US Alcohol, tion ofthe Present State Examination (PSE) ,5
ranger and Jacobsberg); World Drug Abuse, and Mental Health is intended for use by clinicians, ie, those
Health Organization (Dr Sarto-
rius), and Institutions Universi- Administration (ADAMHA) joined capable of making independent psychiat¬
taires de Psychiatrie-Gen\l=e'\ve forces in mounting a large-scale re¬ ric diagnoses. Except for the inclusion of
(Dr Andreoli), Geneva, Switzer- view and assessment of the knowledge base antisocial disorder in the CIDI, neither in¬
land; Psychiatrische Universi-
t\l=a"\tsklinik,Vienna, Austria (Dr of psychiatric diagnosis and classification. terview provides coverage of the person¬
Berger); Nervenklinik der Univer- One consequence of this review was the es¬ ality disorders.
sit\l=a"\tMunchen, Munich, Germany tablishment of a long-term program of ac¬ This is the first report of the results of
(Dr Buchheim); National Institute
tivities aimed at developing a battery of stan¬ a field trial to investigate the feasibility of us¬
of Mental Health and Neurosci-
ences, Bangalore, India (Dr Chan- dardized diagnostic assessment instruments ing a semistructured clinical interview de¬
nabasavanna); Institute of Psychia- for use in clinical and epidemiologie research veloped within the WHO/ADAMHA program
try, London, England (Dr Coid); worldwide. ' This was intended to facilitate to diagnose personality disorders in differ¬
Universitetet I Oslo Psychiatrisk
Institutt, Oslo, Norway (Dr Dahl); comparisons of clinical and research find¬ ent languages and cultures. The principal ob¬
Rijksuniversiteit Te Leiden, Leiden, ings from different nations and cultures. jectives ofthe trial were to determine the cul¬
the Netherlands (Dr Diekstra);
Stonebridge Research Centre, Not- One instrument developed in this pro¬ tural acceptability, user friendliness, inter-
tingham, England (Dr Ferguson); gram, the Composite International Diag¬
Max-Planck-Institut f\l=u"\rPsychiat- nostic Interview (CIDI) ,2 a modification of
rie, Munich (Dr Mombour); Cen-
tre Hospitalier de Luxembourg, the Diagnostic Interview Schedule (DIS) ,3
Luxembourg (Dr Pull); Keio Uni- is highly structured, computer scored, and
versity School of Medicine, Tokyo, designed for use by lay interviewers in epi¬
Japan (Dr Ono); and National
Institute of Mental Health, Rock- demiologie studies. Another, the Schedules
ville, Md (Dr Regier). for Clinical Assessment in Neuropsychia-
During the development of the IPDE from 1985 to 1987, Without knowledge of the diagnostic conclusions of the
the principal investigators at each center had participated screening clinicians, the psychiatrists and clinical psycholo¬
in several meetings and workshops concerning the instru¬ gists who administered the IPDE also conducted a general
ment. After they had acquired proficiency in its adminis¬ clinical evaluation of the patients and provided the ICD-10
tration and scoring, they assumed responsibility for the train¬ diagnoses of any disorders other than personality disor¬
ing and supervision of the psychiatrists and clinical ders. At several centers, this evaluation included a semi-
psychologists who served as examiners at their centers. In structured interview: DIS5 at Rijksuniversiteit Te Leiden,
early 1988, the coordinator of the project (A.W.L.) visited Leiden, the Netherlands; Schedule for Affective Disorders
each site and conducted workshops that used videotaped and Schizophrenia —Lifetime version10 at Institute of Psy¬
demonstrations and lengthy discussions of principles of ad¬ chiatry, London, England; and Structured Interview for DSM-
ministration and problems in scoring. The examiners were ÍII-R (SCID)11 at Cornell Medical Center, New York, NY;
required to administer a minimum of 10 interviews before Stonebridge Research Centre, Nottingham, England; Uni-
they were considered sufficiently experienced to evaluate versitetet I Oslo Psychiatrisk Institutt, Oslo, Norway; and
study subjects. To monitor scoring practices during the study, Keio University School of Medicine, Tokyo, Japan.
videotaped interviews conducted in English were circu¬
lated for rating by the participating clinicians. COURSE OF THE TRIAL
PATIENT SELECTION The first patients entered the study in April 1988 and the
last patient was examined in December 1990. After the
The investigators were asked to select approximately 50 pa¬ project coordinator verified all of the forms returned by the
tients, ranging between ages 21 and 55 years, from admis¬ centers, the information was entered in a computer at the
sions to their facilities. In the judgment of a senior screen¬ ADAMHA data processing facility in Rockville, Md. In Au¬
ing clinician, the patients should have had either a person¬ gust 1991, the investigators met at WHO headquarters in
ality disorder according to ¡CD-10 and/or DSM-III-R criteria Geneva, Switzerland, to analyze and discuss the results. They
or another common mental disorder important in the dif¬ also reviewed the replies to a questionnaire completed by
ferential diagnosis of a personality disorder. To ensure an all the participants about the strengths and limitations of
adequate test of the discriminating properties of the IPDE, the IPDE, including its user friendliness, cultural rel¬
the screening clinicians were asked to provide a sample that evance, and apparent clinical validity.
rater reliability, and temporal stability of the instrument. THE INTERNATIONAL PERSONALITY DISORDER
When the study was planned, it was considered inappro¬ EXAMINATION
priate to undertake the development of a highly structured
lay-administered interview such as the CIDI. The state of The International Personality Disorder Examination (IPDE)
the art ofstandardized interviews for personality disorders is an outgrowth of the Personality Disorder Examination
was in its infancy and considerably behind that of the clini¬ (PDE)6 that was modified for international use and com¬
cal assessment of other mental disorders. The complexity patibility with the International Classification of Diseases,
of the diagnostic criteria and the level ofinference and clini¬ Tenth Revision (1CD-10)7 and the DSM-III-R8 classification
cal sophistication that is required to make personality dis¬ systems. A detailed history of its development, rationale,
order diagnoses, especially in those with other mental dis¬ structure, and translations is available elsewhere.9 In brief,
orders, were additional reasons for favoring an instrument the IPDE surveys the behavior and life experiences relevant
for use by clinicians rather than trained lay interviewers. to the more than 150 criteria used in making personality
disorder diagnoses according to DSM-Í/I-R and ICD-10. Al¬ It also provided a dimensional score for all subjects on each
though it contains a prescribed set of carefully selected and disorder, regardless of whether they receive the diagnosis
researched questions, the examiner is expected to conduct or meet any of the criteria for the disorder. It is calculated
an adequate clinical examination of the subject to provide by summing the individual scores for all of the criteria on
clarification and confirmatory examples, anecdotes, and de¬ the disorder.
tails. A manual defines the scope and meaning ofeach criter¬
ion and provides guidelines and anchor points for scoring. RESULTS
The IPDE requires that a behavior or trait be present
at least 5 years before it should be considered a manifes¬ FINAL PATIENT SAMPLE
tation of personality. This conservative and somewhat ar¬
bitrary convention is intended to reflect the relatively en¬ At the conclusion of the study, 716 patients had been ex¬
during nature of personality traits and to avoid confusing amined, 243 reexamined, and 141 of the interviews rated
them with transient or situational phenomena and episodic by an observer. The average interval between the initial
abnormalities of mental state and behavior. The IPDE also and second IPDE was 6 months, with approximately 85%
requires that at least one criterion of a disorder be fulfilled of the repetitions occurring between 2 months and 1 year.
before age 25 years, before that particular disorder can be Table 1 provides details of the sample sizes at each cen¬
diagnosed. This is in keeping with clinical tradition and DSM- ter. Tables containing the demographic characteristics of
III-R and ICD-10, which date the first indications of a per¬ the sample and the clinical ÍCD-J0 diagnoses, exclusive
sonality disorder to childhood, adolescence, or the early adult of personality disorders, are available on request. Table 2
years. Age 25 years rather than an earlier age was selected provides the IPDE personality disorder diagnoses in the
to allow more informed and accurate judgments about many DSM-III-R and ÍCD-I0 systems. They are based on the
ofthe adult-oriented personality disorder criteria in the DSM examiner's scoring of the initial interview.
and ICD systems.
Scoring ofthe 157 items was as follows: absent or within IPDE INTERRATER RELIABILITY AND
normal range, 0; present to an accentuated degree, 1; TEMPORAL STABILITY
pathological/meets criterion, 2. The final algorithmic inte¬
gration of the individually scored items was done clerically Intraclass correlation coefficients12 were used to determine
or by computer. It included the following information for the examiner-observer agreement in scoring each of the 157
each disorder: the presence or absence of each criterion, items on the IPDE and their stability from the initial to the
whether the scoring of a criterion was based solely on the second examinations. To provide a more accurate estimate
interview or included information from informants, the num¬ of stability per se, with adjustment for the less-than-perfect
ber of criteria met for each disorder, and the diagnostic de¬ interrater reliability of the instrument, correlations with a
cision about each disorder (definite, probable, or negative). correction for attenuation13 are also included with the sta-
COMMENT
INTERRATER AGREEMENT
*IPDE Indicates International Personality Disorder Examination; ICD-10,
International Classification of Diseases, Tenth Revision. Personality
disorder diagnoses include patients with more than one type of personality The validity of any diagnostic instrument presupposes a rea¬
disorder. Of the 366 patients with a DSM-III-R personality disorder, 111 sonable degree of interrater reliability in its application. The
(30.3%) had more than one type of disorder, including 55 (15.0%) with
two disorders, 32 (8.7%) with three disorders, and 24 (6.6%) with more IPDE was administered in different languages, in a wide va¬
than three disorders. Of the 283 patients with an ICD-10 personality
disorder, 96 (33.9%) had more than one type of disorder, including 57
riety of national and cultural settings, and by a large num¬
(20.1%) with two disorders, 27 (9.5%) with three disorders, and 12 ber of clinicians. This was an unusually exacting test of re¬
(4.2%) with more than three disorders.
\Did not fulfill diagnostic criteria for any specific personality disorder
liability to which no other interview for personality disor¬
but met 10 or more of the 110 DSM-III-R personality disorder criteria.
ders has been subjected. The results, nevertheless, compare
%Did not fulfill diagnostic criteria for any specific personality disorder quite favorably with published reports on semistructured
but met 10 or more of the 56 ICD-10 personality disorder criteria. interviews that are used to diagnose the psychoses, mood,
anxiety, and substance use disorders. Such comparisons,
bility coefficients. The same statistics were also used to mea¬ however, should be viewed as rough approximations. There
sure the examiner-observer agreement regarding the dimen¬ are differences in the heterogeneity of the patient samples
sional scores and the number of criteria met on each dis¬ and the base rates of the individual disorders and variations
order and their stability from the initial to the second in the methods used to determine reliability.
examinations. These correlations are presented in Table 3. With these caveats in mind, we compared the results
We used 14 to measure the interrater agreement and ofthe present study with those of the SCID Axis I field trial.17
temporal stability of the personality disorder diagnoses. Be¬ That study involved 390 patients at four locations in the United
cause of its known instability when based on samples with States and one in Germany. The median values for those
an infrequent number of cases of a disorder, the recommen¬ individual disorders with abase rate of at least 5% were 0.64
dation was followed that itbe calculated only when the preva¬ for current diagnoses and 0.68 for lifetime diagnoses. In the
lence of a disorder is at least 5%.15 To provide more oppor¬ present study, the median values for the individual per¬
tunities for the calculation of , it was also determined by sonality disorders (diagnosis definite) with base rates
of at least 5% were 0.70 in DSM-III-R and 0.72 in ICD-10. The SCID study involved a test-retest design in which
The overall weighted values in the SCID study were 0.61 the interview was given by different examiners at least 1 day
for current diagnoses and 0.68 for lifetime diagnoses. In the but no more than 2 weeks apart. This is likely to result in
present study, the overall weighted values for the defi¬ somewhat lower reliabilities than when an examiner and
nite diagnoses ofthe specific personality disorders were 0.57 observer rate the same interview, as in the present study.
in DSM-III-R and 0.65 in ICD-10. The median values for The pursuit of an additional obj ective in the IPDE study (the
an IPDE diagnosis that was definite or probable were 0.73 determination of temporal stability over an extended pe¬
for DSM-III-R and 0.77 for ICD-10. The corresponding riod) weighed against the use of a similar test-retest method
IPDE weighted values for a diagnosis that was definite because of the practical and methodological problems as¬
or probable were 0.65 for DSM-III-R and 0.72 for ICD-10. sociated with too frequent repetitions of a lengthy interview.
The SCID study did not identify probable cases. The IPDE also fared well compared with other criteria-
which is no longer extant. Pfohl et al26 reported on the sta¬ terchangeable is reminiscent of the Heisenberg principle
bility of the Structured Interview for DSM-III Personality in physics : one cannot study the phenomenon without some¬
Disorders in 36 depressed inpatients after 6 to 12 months. how tampering with it in the process.
The values, which ranged from 0.16 to 0.84, are prob¬ Another source of instability is the possibility that pa¬
lematic because of the small sample sizes of the individual tients in a dysphoric state may have a selective recall or
disorders. Similar findings with the Structured Interview distorted perception of certain personality traits. They may
for DSM-III Personality Disorders were obtained in a study also confuse them with the symptoms of another (Axis I)
in the Netherlands.27 mental disorder. An earlier version of the PDE proved re¬
The belief that interviewers are perfectly interchange¬ sistant to changes in symptoms of anxiety and depression
able would seem naive in view of the potential influence during the course of treatment.25 In that study, the ma¬
that the age, sex, and personality of an interviewer might jority of patients had mood or anxiety disorders of mild
have on the information volunteered by a patient. The as¬ to moderate severity. The finding has since been replicated
sumption made by those who use semistructured interviews with the present version of the PDE (A.W.L. and Mark F.
is that such factors ordinarily are not a major source of er¬ Lenzenweger, PhD, unpublished data, February 1992).
ror. In planning the present study, consideration was given There is, however, a contradictory report based on a group
to a design in which halfofthe interviews would be repeated of depressed patients treated with cognitive therapy.28 That
by the same examiner and half by a different examiner. This study used an earlier version of the PDE and the authors
would have helped to determine how much the interview¬ failed to specify the professional status of the interview¬
ers themselves, in addition to their ratings, contributed to ers, a potentially relevant variable. It may require an ex¬
the instability of the measures. However, concerns about perienced psychiatrist or clinical psychologist to distin¬
scheduling influenced the decision to use the same inter¬ guish personality traits from transient pathological men¬
viewer whenever possible. As a result, 93% of the interviews tal states and the symptoms of other disorders. The reliability
were given by the same examiner on both occasions. and validity of the IPDE, like that of any semistructured
A second interview, whether conducted by the same interview, cannot be judged apart from the qualifications
or a different person, may be contaminated by the experi¬ of the interviewers. At times, semistructured interviews
ence of the first interview. Repetitions can lead to boredom have assumed a mystique of their own, and that caveat is
and decreased motivation. Patients may believe that the in¬ all too often ignored. The IPDE is intended for use by those
terviews are no longer for their benefit but for that of the who have the clinical sophistication and training required
examiner. Repetitions can also produce fantasies that the to make psychiatric diagnoses independently, ie, without
interviewer is dissatisfied with the previous interview or is a semistructured interview. This is not to imply that the
checking on the consistency of the responses. Patients may IPDE or any other personality disorder interview is nec¬
also refrain from providing as many positive replies as pre¬ essarily impervious to the influence of abnormal mental
viously owing to a heightened awareness that these invite states, particularly those characterized by severe symptoms.
further probing and prolong the interview. Elsewhere,25 one We are encouraged, however, by the evidence that some
of us (A.W.L.) has argued that the problem of attempting clinical states do not appear to invalidate the assessment
to measure the precise degree to which interviewers are in- of personality. In any event, no attempt was made to de-
ported the overall rates of the individual disorders for all disorder that does not meet the requirements for any of
of the centers combined and believe that it would be po¬ the specific types. There is no method of identifying these
tentially misleading to report the rates for each center. It patients without invoking some arbitrary standard. The
is noteworthy, however, that the overwhelming majority IPDE assigns a residual diagnosis to anyone who does not
of personality disorder subtypes were diagnosed in most meet the requirements for a specific disorder, but never¬
of the 11 nations represented in the study. theless accumulates 10 or more criteria from the various
It is also of interest that the two most frequently di¬ disorders. There are more opportunities to obtain the di¬
agnosed types in the sample as a whole are disorders that agnosis in DSM-III-R than in ICD-10 because the former
were not included in the 1CD-9, or the DSM-ÍI. They are has 110 criteria and the latter only 56. It is not surprising,
borderline (DSM-III-R) or emotionally unstable, border¬ then, that approximately twice as many patients received
line type (ICD-10), and avoidant (DSM-III-R) or anxious a nonspecific diagnosis of personality disorder in DSM-
(ICD-10). At least one case of these two disorders oc¬ III-R as in ICD-10
(12.8% vs 6.8%).
curred at every center with the exception of Bangalore, This, of course, does not address the question of
India (no avoidant diagnosis) and Nairobi, Kenya (no bor¬ whether the two classification systems actually identi¬
derline diagnosis). fied the same patients as having a particular disorder. That
Two controversial disorders, sadistic and self-defeating, can be determined by the statistic, particularly for those
are not included in ICD-10 and have been relegated to the disorders with prevalence of at least 5%. There are only
a
appendix of DSM-III-R. Both were among the three least two of them; the ks are 0.66 (borderline) and 0.52 (anx¬
frequent diagnoses in the entire sample. Interestingly, the ious/avoidant), evidence of moderate but far from per¬
third one, narcissistic, was not included in DSM-II and is fect agreement. With the distribution of cases in the sample
still not recognized in ICD-10. It occurred in only 1.3% of of 716 patients, these ks were associated with 92% agree¬
study patients. This contrasts with the findings regarding ment regarding the diagnosis of borderline and 89% for
passive aggressive, which is also not included in ICD-10 anxious/avoidant. The values for the remaining disor¬
but was diagnosed in 5% of study patients and appeared ders should be viewed as relatively unstable because of
in all centers but one. the base rate problem. They range from 0.32 (dissocial/
DownloadedViewFrom:
publicationhttp://archpsyc.jamanetwork.com/
stats by a University of Toronto User on 03/11/2014