Professional Documents
Culture Documents
Objective: To determine the estimates of minimal, Results: A minimal CID was 2.3 to 2.7 points on the
moderate, and large clinically important differences UPDRS motor score and 4.1 to 4.5 on the UPDRS total
(CIDs) for the Unified Parkinson’s Disease Rating score. A moderate CID was 4.5 to 6.7 points on the UPDRS
Scale (UPDRS). motor score and 8.5 to 10.3 on the UPDRS total score. A
large CID was 10.7 to 10.8 points on the UPDRS motor
Design: Cross-sectional analysis of the CIDs for UPDRS score and 16.4 to 17.8 on the UPDRS total score.
total and motor scores was performed on patients with
Parkinson disease (PD) using distribution- and anchor- Conclusions: Concordance among multiple ap-
based approaches based on the following 3 external stan- proaches of analysis based on subjective and objective
dards: disability (10% on the Schwab and England Ac- data show that reasonable estimates for the CID on the
tivities of Daily Living Scale), disease stage (1 stage on UPDRS motor score are 2.5 points for minimal, 5.2 for
the Hoehn and Yahr Scale), and quality of life (1 SD on moderate, and 10.8 for large CIDs. Estimates for the
the 12-Item Short Form Health Survey). UPDRS total score are 4.3 points for minimal, 9.1 for mod-
erate, and 17.1 for large CIDs. These estimates will as-
Setting: University of Maryland Parkinson Disease and sist in determining clinically meaningful changes in PD
Movement Disorders Center, progression and response to therapeutic interventions.
Patients: Six hundred fifty-three patients with PD. Arch Neurol. 2010;67(1):64-70
A
CLINICALLY IMPORTANT DIF- proaches.7,8 The distribution-based ap-
ference (CID) is the proach relies on the empirical distribu-
amount of change on a tion of a measure in a population and the
measure that patients can derived effect size. The anchor-based ap-
recognize and value. 1 proach uses a familiar and relevant exter-
Growing interest in CIDs stems from a nal standard to determine the correspond-
greater emphasis on evidence-based and ing magnitude of change. This study uses
patient-centered medicine.2 Large ran- both of these approaches and relies on 3
domized clinical trials frequently show sig- different external standards (anchors) to
nificant differences on outcome mea- assess the CID on the UPDRS total scale
sures that are so small that clinicians are and its motor subscale.
unsure how to apply them to clinical de- The minimal clinically important
cision making.3 The Movement Disorder change on the UPDRS was previously de-
Society Task Force on Rating Scales for termined based on data from 2 clinical
Parkinson’s Disease highlighted the im- trials of dopamine agonist monotherapy
portance of identifying thresholds on the in early Parkinson disease (PD).9 Study
Unified Parkinson’s Disease Rating Scale limitations included the inability to gen-
(UPDRS) that represent clinically rel- eralize the results to more advanced PD
Author Affiliations: evant differences.4,5 The US Food and Drug and the reliance on a clinician-based mea-
Departments of Neurology Administration also described the need to sure (Clinical Global Impression of Im-
(Drs Shulman, Anderson,
define minimally important differences on provement) as the anchor for assessing
Fishman, Reich, and Weiner),
Epidemiology and Preventive patient-reported outcome measures used clinical relevance. Assessments by clini-
Medicine (Dr Gruber-Baldini), to support the labeling claims of medical cians do not always match patient evalu-
and Psychiatry (Dr Anderson), products.6 ations because of limitations in aware-
University of Maryland School The 2 key methods of CID assessment ness of the patient experience.6,10 The belief
of Medicine, Baltimore. are the distribution- and anchor-based ap- that clinically relevant differences in health
Abbreviations: SE, Schwab and England Activities of Daily Living Scale; UPDRS, Unified Parkinson’s Disease Rating Scale.
a The regression coefficients (4.5 [0.22] for the motor score and 8.6 [0.27] for the total score) show the UPDRS score change for every level of change on the
SE Scale with level of significance. Level of significance is also indicated for post hoc t test comparisons between adjacent levels on the SE Scale. Differences
between regression coefficients were significant at P ⬍.001.
b P ⬍ .01.
c P ⬍ .001.
d P ⬍ .05 (trend).
ally progressive disorder, moving from one stage to another gen- motor scores ranged from a low of 15.3 (8.2) for the sub-
erally takes several years. Therefore, the HY stages are clini- jects reporting no disability (SE Scale, 100% [com-
cally relevant and represent a relatively large change in disease pletely independent]) to a high of 60.0 (7.1) for sub-
severity. The standard deviation on the HY stages for our sample jects rated as totally dependent (SE Scale, 10%
was 0.9 (Table 1). Therefore, a change of 1 stage on the HY
[bedridden]) (Table 2). Based on the HY stages, the mean
stages is equivalent to 1.1 SD (1 stage per 0.9 SD), or greater
than the large effect size (0.8) based on the Cohen effect size. UPDRS motor scores ranged from a low of 11.2 (4.9) for
subjects with unilateral parkinsonism (stage 1) to a high
of 54.4 (11.4) for subjects assessed as wheelchair bound
DATA ANALYSIS or bedridden (stage 5) (Table 3). Regression analysis
Because the HY stages and SE Scale are not normally distributed
showed the average difference on the UPDRS motor score
(and because the HY scale is not an interval or ratio scale but rather to be 4.5 points for a 10% change on the SE Scale and
an ordinal scale), general linear model analyses (using SAS sta- 10.8 points for a 1-stage change on the HY stages. On
tistical software, version 9.1; SAS Institute Inc, Cary, North Caro- the SF-12, 1 unit on the PH or the MH summary score
lina) were performed to calculate averaged groups for the UPDRS was equivalent to a change of 0.47 points on the UPDRS
total and motor scores by the SE and HY groups (analysis of vari- motor score. Therefore, a difference of 1 SD (defined as
ance model). Average differences between these group means were 10 units) was 4.7 points on the UPDRS motor score and
then calculated. Regression models (general linear model and or- half of a standard deviation was equivalent to 2.3 or 2.4
dinary least squares) were run to examine linear changes for the points (Table 4). The distribution-based analysis showed
UPDRS measures by differences on the SF-12 because the SF-12 that the minimal CID was 2.7 points, the moderate CID
is a normally distributed interval scale. We reported the regres-
sion weight change per specified unit on the SF-12 (eg, 5 units
was 6.7, and the large CID was 10.7 (Table 4). Based on
as a small change and 10 units as a moderate change). A separate a combination of the anchor- and distribution-based analy-
general linear model was run for every predictor (HY stages, SE ses (averaging across the results), 2.5 points is an appro-
Scale, and SF-12 PH and MH) on each outcome (UPDRS total priate estimate for the minimal CID, 5.2 points for the
and motor scores), resulting in 8 separate analyses. The critical moderate CID, and 10.8 points for the large CID (Table 4).
P value for interpretation was set to P⬍.01 to adjust for multiple
comparisons. Unless otherwise indicated, scores are expressed as UPDRS TOTAL SCORE
mean (SD).
The mean UPDRS total score (subscales I, II, and III) was
RESULTS 41.0 (20.5). Based on the SE Scale ratings, the mean
UPDRS total score ranged from a low of 19.9 (9.5) for
The study sample of 653 subjects with PD is described in subjects reporting no disability (SE Scale, 100%) to a high
Table 1. The sample was predominantly white, male, and of 107.5 (7.2) for subjects rated as bedridden (SE Scale,
married, with relatively high education and income. 10%) (Table 2). Based on the HY stages, the mean UPDRS
total score ranged from a low of 16.6 (6.6) for subjects
UPDRS MOTOR SCORE with unilateral parkinsonism (stage 1) to a high of 90.2
(25.0) for subjects assessed as wheelchair bound or bed-
The mean UPDRS motor score (subscale III) was 27.2 ridden (stage 5) (Table 3). Regression analysis showed
(13.4). Based on the SE Scale ratings, the mean UPDRS the average change on the UPDRS total score to be 8.6
Abbreviations: ⌬, change; HY, Hoehn and Yahr Scale (“on” for patients whose stage fluctuated); UPDRS, Unified Parkinson’s Disease Rating Scale.
a The regression coefficients were 10.8 (0.44) for the UPDRS motor score and 17.8 (0.61) for the UPDRS total score.
b All increments are significantly different at P ⬍.001.
Table 4. Summary of Distribution- and Anchor-Based Analyses of the CID on the UPDRS Total and Motor Scores
Abbreviations: CID, clinically important difference; ellipses, not applicable. For other abbreviations, see Table 1.
points for a 10% change on the SE Scale and 17.8 points The minimal clinically important change on the UPDRS
for a 1-stage change on the HY stage. On the SF-12, 1 unit was previously studied in a sample of individuals with
on the PH summary score was equivalent to a change of early PD who participated in 2 clinical trials of dopa-
0.85 points on the UPDRS total score, and 1 unit on the mine agonist monotherapy.9 An anchor-based analysis
MH summary score was 0.91 points on the UPDRS total using the Clinical Global Impression of Improvement
score. Therefore, a change of 1 SD (defined as 10 units) found that the minimal clinically important change was
was 8.5 (PH) or 9.1 points (MH) on the UPDRS total score, 5 points on the UPDRS motor score and 8 points on the
and half of an SD was equivalent to 4.2 or 4.5 points, re- UPDRS total score.9 Our results show that the minimal
spectively (Table 4). The distribution-based analysis showed CID was smaller: 2.5 points on the UPDRS motor score
that the minimal CID was 4.1 points, the moderate CID and 4.5 points on the UPDRS total score. Because the CID
was 10.3 points, and the large CID was 16.4 points on the may vary at different stages of disease, this discrepancy
UPDRS total score (Table 4). Based on a combination of may indicate that the CID is larger in earlier PD. Samsa
the anchor- and distribution-based analyses, a change of et al7 questioned whether the initial decrement from per-
4.3 points is an appropriate estimate for the minimal CID, fect health to early symptoms may be more meaningful
9.1 points for the moderate CID, and 17.1 points for the than the impact of similar decrements in the middle part
large CID on the UPDRS total score (Table 4). of the scale. The discrepancies in the results between the
present and earlier studies underscore the importance of
COMMENT replicating these analyses in different sample popula-
tions and in longitudinal studies. There were several dif-
Concordance across a combination of distribution- and ferences between these 2 studies, including stage of dis-
anchor-based approaches for analysis of the CID dem- ease, the anchor chosen, and the clinical setting
onstrates that the moderate CID for the UPDRS motor (naturalistic vs clinical trial).
score is approximately 5 points and for the UPDRS total The presence of a larger CID in early PD is not sup-
score it is 9 points. Variability across sample popula- ported by our analysis of UPDRS ratings by 10% decre-
tions and clinical settings suggests that a range of CID ments on the SE Scale (Table 2). If this were the case,
values is likely to be more useful than a single esti- one would anticipate larger increments in the UPDRS rat-
mate.18,23 This study describes a range from minimal to ings between SE levels associated with earlier disability.
moderate to large CID, corresponding to about 2.5, 5, However, the largest increment in the UPDRS motor score
and 11 points for the UPDRS motor score and 4.5, 9, and was associated with the change in SE Scale ratings from
17 points for the UPDRS total score. 30% to 20% in advanced PD, corresponding to the change
Abbreviations: ⌬, change; PD, Parkinson disease; PSG, Parkinson Study Group; UPDRS, Unified Parkinson’s Disease Rating Scale.
a Data are not comparable owing to differences in adjustment for placebo and study duration in the different clinical trials. Missing data reflect absence of data in
the publication.
in SE responses from “With effort, now and then does a ing) based on patient responses by history and subscale
few chores alone. . . . Much help needed” to “Nothing III (motor examination) based on clinical observation.
alone. Can be a slight help. . . . Severe invalid.” These The CID analysis was initially developed as a tool for pa-
larger UPDRS score increments associated with selected tient-reported outcomes, particularly quality-of-life mea-
levels of 10% change on the SE Scale may simply iden- sures,7 and more recently has been applied to a greater
tify SE Scale cut points that signal clinical distinctions diversity of measures, including physical performance
more clearly. A single aberration is seen between SE Scale measures.25,45
scores of 60% and 50%, where the UPDRS motor score Clinically important differences may vary across dis-
goes down rather than up. The reason is unclear and re- eases, ethnicity, and socioeconomic status. Therefore,
quires further investigation. these findings may not be applicable to patients who are
Clinical trials in PD have applied arbitrary defini- nonwhite, have lower socioeconomic status, or have
tions of responders such as improvement of 20%, 30%, other forms of parkinsonism. The CID in a cross-sec-
3 points, and 5 points on the UPDRS motor scale.31-34 Three tional sample is conceptually distinct from CID analysis
to 5 points on the UPDRS motor scale is precisely the in a longitudinal sample, although cross-sectional and
minimal to moderate CID range in this analysis. longitudinal analyses have yielded similar results in
Establishing the CID for a measure is particularly mean- previous studies.7 The predetermined estimates of
ingful when this magnitude of improvement can be real- minimal, moderate, and large CID for each of 3 anchors
istically achieved. In fact, the CID results in this study are may be subject to criticism as inaccurate representa-
consistent with effect sizes on the UPDRS found in recent tions of a clinically meaningful difference in PD. How-
clinical trials (Table 5). For example, in the Earlier vs Later ever, the results compare favorably with the calculated
Levodopa Therapy in Parkinson Disease study,43 in which values of CID based on accepted standard effect sizes21
3 dosages of levodopa were studied (150, 300, and 600 mg/ in the distribution-based analysis, and the ranges of
d), the change in UPDRS total score was 5.9 points for the computed values of CID based on each of the methods
low and moderate doses and 9.2 for the highest dose, cor- were in close agreement.
responding to the minimal to moderate CID in this study. The minimum CID was originally defined as “the small-
However, UPDRS score changes of 1 to 2 points for sele- est difference in score in the domain of interest which
giline hydrochloride35 and rasagiline mesylate42 were be- patients perceive as beneficial and which would man-
low the minimal CID. Although the effect sizes in differ- date, in the absence of troublesome side effects and ex-
ent trials described in Table 5 are not comparable owing cessive cost, a change in the patient’s management.”1 The
to differences in study duration and variable adjustment role of a CID in clinical decision making is highlighted
for placebo, the range of effect sizes (1.1-8.4 for the UPDRS in this definition, although there is controversy about
motor score and 1.8-9.2 for the UPDRS total score) are in whether the CID is more applicable to the interpreta-
the range of the CIDs found in this study. tion of group or individual differences.8,10,18,46 Indeed, as-
This study relied on a combination of patient- sessing the risk to benefit ratio may be more straightfor-
reported (SF-12) and clinician-reported (SE Scale and HY ward on the individual level, where the treatment response
stage) assessments. Similarly, the UPDRS has elements may need to be especially robust to compensate for
of patient and clinician assessment, with subscales I (men- troublesome adverse effects or financial limitations. A
tation, behavior, and mood) and II (activities of daily liv- range of CID values as demonstrated in this study (mini-
Announcement