Professional Documents
Culture Documents
In this paper we report the clinical utility of the diagnostic guidelines for ICD-11 mental, behavioural and neurodevelopmental disorders as as-
sessed by 339 clinicians in 1,806 patients in 28 mental health settings in 13 countries. Clinician raters applied the guidelines for schizophrenia and
other primary psychotic disorders, mood disorders (depressive and bipolar disorders), anxiety and fear-related disorders, and disorders specifical-
ly associated with stress. Clinician ratings of the clinical utility of the proposed ICD-11 diagnostic guidelines were very positive overall. The guide-
lines were perceived as easy to use, corresponding accurately to patients’ presentations (i.e., goodness of fit), clear and understandable, providing
an appropriate level of detail, taking about the same or less time than clinicians’ usual practice, and providing useful guidance about distinguish-
ing disorder from normality and from other disorders. Clinicians evaluated the guidelines as less useful for treatment selection and assessing prog-
nosis than for communicating with other health professionals, though the former ratings were still positive overall. Field studies that assess per-
ceived clinical utility of the proposed ICD-11 diagnostic guidelines among their intended users have very important implications. Classification is
the interface between health encounters and health information; if clinicians do not find that a new diagnostic system provides clinically useful in-
formation, they are unlikely to apply it consistently and faithfully. This would have a major impact on the validity of aggregated health encounter
data used for health policy and decision making. Overall, the results of this study provide considerable reason to be optimistic about the perceived
clinical utility of the ICD-11 among global clinicians.
Key words: International Classification of Diseases, ICD-11, diagnosis, mental disorders, clinical utility, ease of use, goodness of fit, treatment
selection, assessing prognosis
The World Health Organization (WHO) has released the fication construct, category, or system as depending on: a) its
11th revision of the International Classification of Diseases value in communicating (e.g., among practitioners, patients,
and Related Health Problems to its member states to prepare families, administrators); b) its implementation characteristics
for implementation1. The new classification will be presented in clinical practice, including its goodness of fit (i.e., accuracy
for approval by the World Health Assembly, the WHO’s gov- of description), its ease of use, and the time required to use it
erning body, in May 2019. (i.e., feasibility); and c) its usefulness in selecting interventions
As we have previously described2-5, an important focus in and in making clinical management decisions2. This definition
the development of the ICD-11 chapter on Mental, Behaviour- is based in part on those proposed by M. First and colleagues6,7.
al and Neurodevelopmental Disorders by the WHO Depart- Similar concepts had also been included in the ICD-10 field
ment of Mental Health and Substance Abuse has been to im- trails8,9, which asked clinicians to provide ratings of goodness
prove its clinical utility. of fit, confidence in their selected diagnosis, ease or difficulty
For the purpose of developing the ICD classification of men- of making a diagnosis, and adequacy of the diagnostic guide-
tal disorders, the WHO has defined the clinical utility of a classi- lines for cases evaluated as a part of the study.
Implementation characteristics
Which of the following statements best describes your evaluation of the level of detail and specificity of the essential features for the diagnosis or diagnoses that
you applied to this patient?
Insufficient: About the right amount: Too much:
148 (4.1%) 3,275 (90.8%) 185 (5.1%)
Please rate the extent to which the guidelines imposed assessment requirements that were difficult to apply to this patient (e.g., requirements that rely too much
on the patient’s memory of remote events or the patient’s ability to report temporal relationships between symptoms):
Very difficult: Somewhat difficult: Quite easy: Extremely easy: Quite + extremely easy:
35 (1.0%) 518 (14.4%) 2,752 (76.3%) 303 (8.4%) 3,055 (84.7%)
How would you describe the amount of time that it took you to apply all of the Essential Features to this patient for the diagnosis or diagnoses that you selected,
in comparison to your usual clinical practice?
Much longer: Somewhat longer: About the same: Shorter:
30 (0.8%) 472 (13.1%) 2,669 (74.0%) 437 (12.1%)
Specific sections
Please rate the extent to which the description of the boundary between disorder and normality contained in the guidelines was useful as applied to this patient:
Not at all: Somewhat: Quite: Extremely: Quite + extremely:
78 (2.2%) 770 (21.3%) 2,304 (63.9%) 456 (12.6%) 2,760 (76.5%)
Please rate the extent to which the description of the boundary between this patient’s disorder and other disorders (section on differential diagnosis) was useful
as applied to this patient:
Not at all: Somewhat: Quite: Extremely: Quite + extremely:
49 (1.4%) 762 (21.1%) 2,322 (64.4%) 475 (13.2%) 2,797 (77.5%)
Specific uses
How useful would the diagnostic guidelines be in helping you to select a treatment for this patient?
Not at all: Somewhat: Quite: Extremely: Quite + extremely:
70 (1.9%) 887 (24.6%) 2,223 (61.6%) 428 (11.9%) 2,651 (73.5%)
How useful would the diagnostic guidelines be in helping you to assess this patient’s prognosis?
Not at all: Somewhat: Quite: Extremely: Quite + extremely:
83 (2.3%) 1,055 (29.2%) 2,104 (58.3%) 366 (10.1%) 2,470 (68.5%)
How useful would the diagnostic guidelines be in helping you to communicate about this patient with a colleague or other health care professional?
Not at all: Somewhat: Quite: Extremely: Quite + extremely:
49 (1.4%) 746 (20.7%) 2,216 (61.4%) 597 (16.5%) 2,813 (78.0%)
How useful would the diagnostic guidelines be in helping you to educate this patient and/or family about his or her condition?
Not at all: Somewhat: Quite: Extremely: Quite + extremely:
52 (1.4%) 884 (24.5%) 2,236 (62.0%) 436 (12.1%) 2,672 (74.1%)
For implementation characteristics, a large majority in- same amount of time or less time than their usual practice
dicated that the guidelines did not impose assessment re- (86.1%).
quirements that were difficult to apply (84.7%), provided Regarding specific sections, the manner in which the guide-
about the right level of detail (90.4%), and took about the lines provided guidance about differentiating disorders from
Age, years 39.9 ! 13.7 32.9 ! 9.6 39.8 ! 14.2 43.9 ! 15.6 36.5 !11.4 41.4 !11.2 47.0 !15.1 36.4 !12.5 38.1 !13.0 37.5 !12.2 36.3 !11.7 35.1 !11.0 52.0 ! 16.2 43.2 ! 12.6
(mean ! SD)
Gender, N (%)
Male 908 (50.3) 62 (62.0) 19 (35.8) 123 (60.6) 120 (57.4) 50 (50.0) 72 (42.9) 38 (36.9) 48 (31.4) 65 (49.2) 44 (42.3) 133 (63.9) 26 (37.1) 108 (53.2)
Female 897 (49.7) 38 (38.0) 33 (62.3) 80 (39.4) 89 (42.6) 50 (50.0) 96 (57.1) 65 (63.1) 105 (68.6) 67 (50.8) 60 (57.7) 75 (36.1) 44 (62.9) 95 (46.8)
Relationship status, N (%)
Single 992 (54.9) 81 (81.0) 22 (41.5) 110 (54.2) 66 (31.6) 71 (71.0) 77 (45.8) 68 (66.0) 91 (59.5) 68 (51.5) 65 (62.5) 167 (80.3) 28 (40.0) 78 (38.4)
Married/ 597 (33.1) 12 (12.0) 17 (32.1) 75 (36.9) 133 (63.6) 19 (19.0) 64 (38.1) 20 (19.4) 42 (27.5) 41 (31.1) 22 (21.2) 25 (12.0) 28 (40.0) 99 (48.8)
cohabitating
Separated/divorced 177 (9.8) 6 (6.0) 13 (24.5) 15 (7.4) 4 (1.9) 7 (7.0) 21 (12.5) 15 (14.6) 20 (13.1) 18 (13.6) 13 (12.5) 13 (6.3) 9 (12.9) 23 (11.3)
Widowed 40 (2.2) 1 (1.0) 1 (1.9) 3 (1.5) 6 (2.9) 3 (3.0) 6 (3.6) 0 0 5 (3.8) 4 (3.8) 3 (1.4) 5 (7.1) 3 (1.5)
Employment, N (%)
Full time 403 (22.3) 4 (4.0) 14 (26.4) 47 (23.2) 69 (33.0) 11 (11.0) 26 (15.5) 16 (15.5) 17 (11.1) 41 (31.1) 22 (21.2) 22 (10.6) 26 (37.1) 88 (43.3)
Part time 142 (7.9) 5 (5.0) 6 (11.3) 3 (1.5) 12 (5.7) 9 (9.0) 14 (8.3) 11 (10.7) 31 (20.3) 11 (8.3) 6 (5.8) 8 (3.8) 3 (4.3) 23 (11.3)
Unemployed 1009 (55.9) 76 (76.0) 30 (56.6) 80 (39.4) 110 (52.6) 74 (74.0) 109 (64.9) 66 (64.1) 79 (51.6) 64 (48.5) 53 (51.0) 167 (80.3) 20 (28.6) 81 (39.9)
Student 136 (7.5) 6 (6.0) 4 (7.5) 15 (7.4) 15 (7.2) 4 (4.0) 10 (6.0) 15 (14.6) 30 (19.6) 10 (7.6) 7 (6.7) 12 (5.8) 2 (2.9) 6 (3.0)
Retired 152 (8.4) 10 (10.0) 1 (1.9) 62 (30.5) 3 (1.4) 2 (2.0) 15 (8.9) 0 5 (3.3) 8 (6.1) 18 (17.3) 0 22 (31.4) 6 (3.0)
Treatment setting, N (%)
Outpatient 801 (44.4) 82 (82.0) 53 (100) 0 122 (58.4) 67 (67.0) 48 (28.6) 14 (13.6) 135 (88.2) 84 (63.6) 4 (3.8) 0 49 (70.0) 143 (70.4)
Inpatient 994 (55.0) 18 (18.0) 0 203 (100) 87 (41.6) 33 (33.0) 120 (71.4) 89 (86.4) 17 (11.1) 48 (36.4) 91 (87.5) 207 (99.5) 21 (30.0) 60 (29.6)
Other 11 (0.6) 0 0 0 0 0 0 0 1 (0.7) 0 9 (8.7) 1 (0.5) 0 0
Ease of use
Not at all Somewhat Quite Extremely Quite + extremely
Brazil (N=200) 4 (2.0%) 30 (15.0%) 125 (62.5%) 41 (20.5%) 166 (83.0%)
Canada (N=106) 0 19 (17.9%) 71 (67.0%) 16 (15.1%) 87 (82.1%)
PR China (N=405) 3 (0.7%) 62 (15.3%) 306 (75.6%) 34 (8.4%) 340 (84.0%)
India (N=418) 3 (0.7%) 46 (11.0%) 291 (69.6%) 78 (18.7%) 369 (88.3%)
Italy (N=200) 0 13 (6.5%) 125 (62.5%) 62 (31.0%) 187 (93.5%)
Japan (N=336) 13 (3.9%) 161 (47.9%) 147 (43.8%) 15 (4.5%) 162 (48.2%)
Lebanon (N=206) 1 (0.5%) 15 (7.3%) 147 (71.4%) 43 (20.9%) 190 (92.2%)
Mexico (N=306) 1 (0.3%) 25 (8.2%) 213 (69.6%) 67 (21.9%) 280 (91.5%)
Nigeria (N=264) 0 13 (4.9%) 185 (70.1%) 66 (25.0%) 251 (95.1%)
Russian Fed. (N=208) 0 25 (12.0%) 166 (79.8%) 17 (8.2%) 183 (88.0%)
Spain (N=140) 0 3 (2.1%) 133 (95.0%) 4 (28.6%) 137 (97.9%)
South Africa (N=413) 3 (0.7%) 25 (6.1%) 303 (73.4%) 82 (19.9%) 385 (93.2%)
Tunisia (N=406) 4 (1.0%) 119 (29.3%) 259 (63.8%) 24 (5.9%) 283 (69.7%)
Goodness of fit
Not at all Somewhat Quite Extremely Quite + extremely
Brazil (N=200) 6 (3.0%) 31 (15.5%) 120 (60.0%) 43 (21.5%) 163 (81.5%)
Canada (N=106) 1 (0.9%) 28 (26.4%) 63 (59.4%) 14 (13.2%) 77 (72.6%)
PR China (N=405) 4 (1.0%) 58 (14.3%) 293 (72.3%) 50 (12.3%) 343 (84.6%)
India (N=418) 3 (0.7%) 49 (11.7%) 293 (70.1%) 73 (17.5%) 366 (87.6%)
Italy (N=200) 0 11 (5.5%) 123 (61.5%) 66 (33.0%) 189 (94.5%)
Japan (N=336) 7 (2.1%) 168 (50.0%) 149 (44.3%) 12 (3.6%) 161 (47.9%)
Lebanon (N=206) 1 (0.5%) 20 (9.7%) 139 (67.5%) 46 (22.3%) 185 (89.8%)
Mexico (N=306) 2 (0.7%) 37 (12.1%) 209 (68.3%) 58 (19.0%) 267 (87.3%)
Nigeria (N=264) 0 22 (8.3%) 195 (73.9%) 47 (17.8%) 242 (91.7%)
Russian Fed. (N=208) 0 28 (13.5%) 162 (77.9%) 18 (8.7%) 180 (86.5%)
Spain (N=140) 0 7 (5.0%) 127 (90.7%) 6 (4.3%) 133 (95.0%)
South Africa (N=413) 2 (0.5%) 27 (6.5%) 360 (87.2%) 24 (5.8%) 384 (93.0%)
Tunisia (N=406) 2 (0.5%) 118 (29.1%) 264 (65.0%) 22 (5.4%) 286 (70.4%)
Brazil (N=100) .61 (.39 to .79) .45 (.14 to .73) .85 (.56 to 1.00) .43 (".03 to .78) -
Canada (N=53) - - - .65 (.30 to .90) .85 (.68 to .96)
PR China (N=203) .96 (.92 to .99) - .87 (.78 to .95) .32 (".02 to .66) .71 (.55 to .84)
India (N=209) .90 (.82 to .96) .59 (".01 to .91) .88 (.78 to .96) .76 (.61 to .87) .85 (.70 to .97)
Italy (N=100) .85 (.74 to .96) .79 (.59 to .93) .95 (.84 to 1.00) - -
Japan (N=168) .90 (.82 to .97) - .77 (.53 to .94) .77 (.61 to .90) .75 (.61 to .87)
Lebanon (N=103) .95 (.86 to 1.00) .82 (.64 to .95) .82 (.67 to .93) - .64 (.29 to .88)
Mexico (N=153) .87 (.76 to .96) .38 (".02 to .74) - .46 (.27 to .62) .64 (.52 to .76)
Nigeria (N=132) .93 (.86 to .98) .71 (.45 to .89) .83 (.68 to .94) .93 (.72 to 1.00) -
Russian Fed. (N=104) .54 (.33 to .73) .45 (.20 to .66) .52 (".02 to .88) - -
South Africa (N=208) .71 (.60 to .81) .68 (.55 to .80) .80 (.71 to .88) - .76 (.40 to 1.00)
Spain (N=70) .84 (.51 to 1.00) - .86 (.70 to .97) .58 (.24 to .84) .83 (.58 to 1.00)
Tunisia (N=203) .84 (.75 to .92) .59 (.30 to .80) .69 (.52 to .84) .63 (.41 to .80) .50 (.24 to .71)
Overall .87 (.84 to .89) .66 (.58 to .72) .84 (.81 to .87) .64 (.57 to .77) .74 (.69 to .79)
Cells without values are those with an insufficient number of observations to calculate kappa
Table 5 Clinical utility ratings for three core questions for five most common diagnoses
Ease of use
Not at all Somewhat Quite Extremely Quite + extremely
Schizophrenia 4 (0.3%) 127 (10.0%) 896 (70.9%) 237 (18.8%) 1133 (89.6%)
Schizoaffective disorder 0 24 (11.1%) 166 (76.5%) 27 (12.4%) 193 (88.9%)
Bipolar type I disorder 1 (0.2%) 64 (10.8%) 412 (69.8%) 113 (19.2%) 525 (89.0%)
Single episode depressive disorder 1 (0.4%) 56 (21.5%) 165 (63.5%) 38 (14.6%) 203 (78.1%)
Recurrent depressive disorder 4 (0.9%) 78 (18.4%) 290 (68.6%) 51 (12.1%) 341 (80.6%)
Goodness of fit
Not at all Somewhat Quite Extremely Quite + extremely
Schizophrenia 3 (0.2%) 141 (11.2%) 897 (71.0%) 223 (17.6%) 1120 (88.6%)
Schizoaffective disorder 0 33 (15.2%) 163 (75.1%) 21 (9.7%) 184 (84.8%)
Bipolar type I disorder 1 (0.2%) 65 (11.0%) 446 (75.6%) 78 (13.2%) 524 (88.8%)
Single episode depressive disorder 1 (0.4%) 58 (22.3%) 173 (66.5%) 29 (11.2%) 202 (77.7%)
Recurrent depressive disorder 3 (0.7%) 81 (19.1%) 284 (67.1%) 55 (13.0%) 339 (80.1%)
This analysis excluded diagnostic formulations in which more than one of the five index diagnoses included in the table had been assigned (N=853)