Professional Documents
Culture Documents
Objective. To prospectively validate the preliminary criteria for clinical inactive disease (CID) in patients with select
categories of juvenile idiopathic arthritis (JIA).
Methods. We used the process for development of classification and response criteria recommended by the American
College of Rheumatology Quality of Care Committee. Patient-visit profiles were extracted from the phase III randomized
controlled trial of infliximab in polyarticular-course JIA (i.e., patients considered to resemble those with select categories
of JIA) and sent to an international group of expert physician raters. Using the physician ratings as the gold standard, the
sensitivity and specificity were calculated using the preliminary criteria. Modifications to the criteria were made, and
these were sent to a larger group of pediatric rheumatologists to determine quantitative, face, and content validity.
Results. Variables weighted heaviest by physicians when making their judgment were the number of joints with active
arthritis, erythrocyte sedimentation rate (ESR), physician’s global assessment, and duration of morning stiffness. Three
modifications were made: the definition of uveitis, the definition of abnormal ESR, and the addition of morning stiffness.
These changes did not alter the accuracy of the preliminary set.
Conclusion. The modified criteria, termed the “criteria for CID in select categories of JIA,” have excellent feasibility and
face, content, criterion, and discriminant validity to detect CID in select categories of JIA. The small changes made to the
preliminary criteria set did not alter the area under the receiver operating characteristic curve (0.954) or accuracy (91%),
but have increased face and content validity.
This criteria set has been approved by the American College of Rheumatology (ACR) Board of Directors as Provisional. This signi-
fies that the criteria set has been quantitatively validated using patient data, but it has not undergone validation based on an exter-
nal data set. All ACR-approved criteria sets are expected to undergo intermittent updates.
As disclosed in the manuscript, these criteria were developed with partial financial support from industry sources. The industry
supporters were not involved in any stage of criteria development. As a courtesy, the authors sent copies of submitted manuscripts
to their industry supporters, but review and approval of the manuscripts were neither requested nor given.
Although current ACR practice is to decline requests for review of criteria that have been supported by industry, an exception
was made in this case due to prior ACR project support and because the ACR policy change took place after the industry support
was solicited and received by the investigators. ACR is an independent professional, medical and scientific society which does not
guarantee, warrant or endorse any commercial product or service. The ACR reviewed this manuscript on its merits and found the crite-
ria to be methodologically rigorous and clinically meaningful. The ACR received no compensation for its approval of these criteria.
Supported by Centocor through the investigator-initiated ward H. Giannini, MSc, DrPH, Bin Huang, PhD, Lukasz
study program and by the National Institute of Arthritis and Itert, MS: Cincinnati Children’s Hospital Medical Center
Musculoskeletal and Skin Diseases, NIH (grant P60-AR- and University of Cincinnati College of Medicine, Cincin-
44059). nati, Ohio; 3Nicolino Ruperto, MD, MPH: IRCCS G. Gaslini,
1
Carol A. Wallace, MD: Seattle Children’s Hospital and Pediatria II Reumatologia, Paediatric Rheumatology Inter-
University of Washington School of Medicine, Seattle; 2Ed- national Trials Organisation, Genoa, Italy.
929
930 Wallace et al
models included only those variables that remained sig- Table 3. Generalized estimating equations estimates
nificant at the 0.05 level. This allowed for identification of finding the best-fit model to physician ratings of inactive
those variables that influenced most heavily the physi- disease versus active disease*
cians’ determination of the disease state of the patient.
OR (95% CI)
Step 5: application of the best-fit model to predict how
No joints with active arthritis 121.3 (59.63–247.05)
the remaining 1,036 profiles would have been scored by ESR up to 110% the ULN for the 13.2 (1.32–22.57)
the physician raters. In step 5, the remaining 1,036 pa- test used†
tient-visit profiles (1,096 minus the 60 profiles actually Physician’s global assessment ⫽ best 8.9 (4.59–17.42)
scored by physician raters) that were not directly rated by attainable score on the scale used
physicians were computer scored using the best-fit regres- Duration of morning stiffness of ⱕ15 2.7 (1.04–6.78)
sion model from step 4. This allowed us to use all of the minutes
patient visits from the trial by predicting, with a consid-
* Area under the curve ⫽ 0.942. OR ⫽ odds ratio; 95% CI ⫽ 95%
erable amount of precision, how each of the profiles would confidence interval; ESR ⫽ erythrocyte sedimentation rate; ULN ⫽
have been scored, had the physicians rated each one. The upper limit of normal.
resulting predicted probabilities of patient status were † The ESR was used in the randomized controlled trial of infliximab
in juvenile idiopathic arthritis; the C-reactive protein level was not
compared to the ratings of disease status by the prelimi- assessed. The ESR criterion was later modified as shown in Table 5
nary criteria using the Kruskal-Wallis test. and described in the text.
Table 4. Agreement between preliminary criteria and Table 5. Criteria for defining clinical inactive disease in
predicted physician ratings based on the best-fit model oligoarticular (persistent and extended), polyarticular
for disease activity among 1,036 patient profiles* (RF ⴙ and ⴚ), and systemic JIA*
suggest that physicians believe that DMS of a short dura- have been classified previously as pauciarticular onset–
tion (i.e., ⱕ15 minutes) can represent residua of previously polyarticular course. Additionally, some patients in the
active disease without current active disease. However, trial had experienced a systemic onset, but followed a
DMS of a longer duration was considered to be sufficient polyarticular course. Therefore, while results presented
cause by itself to classify the patient as being in a state of here are likely most applicable to patients with polyarthri-
AD. Despite these modifications, the revised criteria set tis, oligoarticular and systemic-onset patients were in-
yielded the same area under the curve as did the prelimi- cluded in the data set from the trial. Still, results of crite-
nary criteria when using the physician ratings of patient rion and content validity and reliability may have been
profiles as the gold standard. Several reasons can be pos- different if patients with other forms of JIA, particularly
tulated for this finding. First, all patients classified as those with active systemic features, had been included.
being in a state of ID by the preliminary criteria had a DMS Patients with juvenile psoriatic arthritis, those with en-
ⱕ15 minutes. Next, the change in the criterion for inactive thesitis-related arthritis, and patients with uveitis were not
uveitis would not be expected to change the scoring, since included in this data set. Additional clinical trial data-
no patients had uveitis and the modification represents bases now in development will serve as the basis on which
only a further refinement of the definition of inactive eye to validate the criteria in other JIA disease categories as
involvement. well as permit further estimations of sensitivity and spec-
An important change from the preliminary criteria is ificity.
that the ESR may be elevated, and therefore the addition of Recently, the use of a 21-circle VAS rather than a 10-cm
the term “or, if elevated, not attributable to JIA.” No spec- line or 11-circle (0 –10 Likert-like scale) instrument to
ification of an upper limit of normal for the ESR is appro- assess PGA has become popular. The database used in this
priate due to the multitude of methods for its determina- exercise used the 11-circle scale, and therefore we were
tion, many of which have different limits of normality. unable to assess the whether the 21-circle VAS (with a
Importantly, many children with new-onset active JIA and gradation of 0.5) would have allowed more patients to be
a flare of JIA do not have elevation of the ESR. For this classified as being in ID, and increase the sensitivity of the
reason, comments from the surveys repeatedly empha- criteria set. Other databases will need to be used to inves-
sized the fact that elevations of acute-phase reactants fre- tigate this possibility.
quently are caused by conditions unrelated to JIA, and that The reliability exercises described in this manuscript do
such an elevation should not, by itself, be the sole justifi- not estimate either inter- or intrarater reliability of the
cation for classifying a patient as having active JIA. The assessment of individual components of the criteria. Non-
option of finding another source for an elevated ESR may reliability is known to exist among physician raters when
introduce bias into the definition of ID and could allow for judging the specific clinical parameters in the current cri-
potential misinterpretation by physicians. However, these teria set. Interphysician reliability coefficients of an estab-
clinical interpretation issues are the same for the joint lished criteria set cannot be estimated using data from this
examination and the PGA and reinforce the appropriate- exercise. Agreement among physicians from the first sur-
ness of calling these criteria for CID, since a biomarker is vey helped us establish what the criteria should be, not the
not currently available for the state of ID. Recently, the reliability of an established criteria set. The analysis of
C-reactive protein (CRP) level has gained popularity as the intraphysician reliability is more useful in the conven-
acute-phase reactant of choice. Therefore, clinicians and tional sense. Physicians who were able to make a determi-
investigators should feel free to use either the ESR or CRP nation of patient status on the initial survey were quite
level as a laboratory measure of inflammation. reliable in their reassessment at a later date of the same
The approach described in this article has limitations, patient profile.
and additional validation in other data sets is appropriate. The convergent validity subtype of construct validity
When the patient profile survey was conducted, we used has been estimated in prior retrospective validation exer-
the term ID rather than CID. Results of ratings may have cises using existing instruments for describing remission
been different if CID had been used in the survey rather in JIA. Analyses of these subtypes of validity were not
than ID. We believe that the term “clinical inactive dis- performed in the current effort, as other remission criteria
ease” is best to use, as these criteria have not been shown used variables that were not collected during the inflix-
to identify biologically inactive disease. It is hoped that in imab trial. Therefore, further work is needed to establish
the near future, translational research will develop such a both convergent and divergent validity, although prelimi-
definition or biomarker. Because systemically ill and per- nary results from prior retrospective work are encouraging,
sistent oligoarthritis patients were not included in the RCT as published earlier (12).
of infliximab in JIA, the results shown here apply most Van Tuyl and colleagues recently have described the
directly to those with polyarticular-course JIA (RF⫹ or collaborative process between the American College of
RF⫺). The systemic criteria in the current set are those that Rheumatology and the European League Against Rheuma-
had high face validity to physicians who participated in tism to redefine remission in adults with rheumatoid ar-
synthesis of the preliminary set. The clinical trial used for thritis (30). Two forms of remission have been developed
this exercise used the former classification of juvenile for use in clinical trials and in the clinic (30,31). In con-
rheumatoid arthritis, as defined by the American College trast, the current efforts in pediatric rheumatology have
of Rheumatology (29). However, some patients included been focused on a single definition of CID, CRM, and CR.
in the current study did have an oligoarticular onset Our ultimate goal is to establish worldwide criteria for
that progressed to extended oligoarthritis, which would CID, CRM, and CR for JIA that can be easily used in clinical
ACR Provisional Criteria for Inactive Disease in JIA 935
care and research settings. Criteria for CID have been pro- course juvenile rheumatoid arthritis. Arthritis Rheum 2007;
spectively validated and modified as a result of the vali- 56:3096 –106.
3. Lovell DJ, Reiff A, Ilowite NT, Wallace CA, Chon Y, Lin SL, et
dation process. The modified criteria, while having the
al, for the Pediatric Rheumatology Collaborative Study Group.
same sensitivity, specificity, and accuracy as the prelimi- Safety and efficacy of up to eight years of continuous etaner-
nary criteria, likely have greater face and content validity. cept therapy in patients with juvenile rheumatoid arthritis.
Future efforts are necessary to 1) validate these criteria in Arthritis Rheum 2008;58:1496 –504.
additional prospectively collected data sets (32), 2) esti- 4. Lovell DJ, Ruperto N, Goodman S, Reiff A, Jung L, Jarosova K,
et al, for the Paediatric Rheumatology International Trials
mate the conditional probability of CRM, given CID and Organisation and the Pediatric Rheumatology Collaborative
CR given CRM, and 3) establish the predictive validity of Study Group. Adalimumab with or without methotrexate in
CRM and CR. juvenile rheumatoid arthritis. N Engl J Med 2008;359:810 –20.
5. Ruperto N, Lovell DJ, Quartier P, Paz E, Rubio-Perez N, Silva
CA, et al, for the Paediatric Rheumatology International Trials
Organisation and the Pediatric Rheumatology Collaborative
ACKNOWLEDGMENTS Study Group. Abatacept in children with juvenile idiopathic
The authors wish to acknowledge the important participa- arthritis: a randomised, double-blind, placebo-controlled
tion by the following members of the PRCSG, CARRA, and withdrawal trial. Lancet 2008;372:383–91.
PRINTO: L. Abramson, S. Bowyer, B. Chalom, R. Cron, M. 6. Raine R, Sanderson C, Hutchings A, Carter S, Larkin K, Black
N. An experimental study of determinants of group judgments
Elder, H. Gewanter, N. Ilowite, L. Jung, Y. Kimura, D.
in clinical guideline development. Lancet 2004;364:429 –37.
Kingsbury, C. Lindsley, D. Lovell, D. McCurdy, R. Moore, 7. Bowles N. The Delphi technique. Nurs Stand 1999;13:32– 6.
K. O’Neil, L. Rider, C. Rose, K. Schickler, D. Sherry, J. 8. Horton JN. Nominal group technique: a method of decision-
Soep, L. Stein, R. Vehe, L. Wagner-Weiner, L. Zemel (US); making by committee. Anaesthesia 1980;35:811– 4.
B. Lang, R. Laxer, R. Schneider, L. Tucker (Canada); S. 9. Wallace CA, Ruperto N, Giannini E. Preliminary criteria for
clinical remission for select categories of juvenile idiopathic
Al-Mayouf (Saudi Arabia); B. Andersson-Gare (Sweden); arthritis. J Rheumatol 2004;31:2290 – 4.
T. Avcin (Slovenia); B. Flato (Norway); R. Cimaz, F. De 10. Boers M, Brooks P, Strand CV, Tugwell P. The OMERACT
Benedetti, F. Fantini, A. Martini, A. Ravelli (Italy); C. De filter for outcome measures in rheumatology [editorial].
Cunto, S. Garay (Argentina); C. Saad-Magalhaes, S. J Rheumatol 1998;25:198 –9.
Oliveira (Brazil); J. de Inocencio (Spain); P. Dolezalova 11. Wallace CA, Huang B, Bandeira M, Ravelli A, Giannini EH.
Patterns of clinical remission in select categories of juvenile
(Czech Republic); D. Foell, G. Horneff (Germany); J. Melo- idiopathic arthritis. Arthritis Rheum 2005;52:3554 – 62.
Gomes (Portugal); S. Nielsen (Denmark); H. Ozdogan, S. 12. Wallace CA, Ravelli A, Huang B, Giannini EH. Preliminary
Ozen (Turkey); P. Quartier (France); F. Kanakoudi-Tsaka- validation of clinical remission criteria using the OMERACT
lidou (Greece); Y. Uziel (Israel); R. Vesely (Slovakia); and filter for select categories of juvenile idiopathic arthritis.
J Rheumatol 2006;33:789 –95.
N. Wulffraat (The Netherlands). 13. Oen K. Long-term outcomes and predictors of outcomes for
patients with juvenile idiopathic arthritis. Best Pract Res Clin
Rheumatol 2002;16:347– 60.
AUTHOR CONTRIBUTIONS 14. Fantini F, Gerloni V, Gattinara M, Cimaz R, Arnoldi C, Lupi E.
All authors were involved in drafting the article or revising it Remission in juvenile chronic arthritis: a cohort study of 683
critically for important intellectual content, and all authors ap- consecutive cases with a mean 10 year followup. J Rheumatol
proved the final version to be published. Dr. Wallace had full 2003;30:579 – 84.
access to all of the data in the study and takes responsibility for 15. Flato B, Lien G, Smerdel A, Vinje O, Dale K, Johnston V, et al.
the integrity of the data and the accuracy of the data analysis. Prognostic factors in juvenile rheumatoid arthritis: a case-
Study conception and design. Wallace, Giannini, Ruperto. control study revealing early predictors and outcome after
Acquisition of data. Wallace, Giannini, Itert, Ruperto. 14.9 years. J Rheumatol 2003;30:386 –93.
Analysis and interpretation of data. Wallace, Giannini, Huang, 16. Knowlton N, Jiang K, Frank MB, Aggarwal A, Wallace C,
Ruperto. McKee R, et al. The meaning of clinical remission in polyar-
ticular juvenile idiopathic arthritis: gene expression profiling
in peripheral blood mononuclear cells identifies distinct dis-
ROLE OF THE STUDY SPONSOR ease states. Arthritis Rheum 2009;60:892–900.
17. Classification and Response Criteria Subcommittee of the
This study was supported by an unrestricted grant from Cen- American College of Rheumatology Committee on Quality
tocor. As disclosed in the manuscript, these criteria were devel- Measures. Development of classification and response criteria
oped with partial financial support from industry sources. The for rheumatic diseases [editorial]. Arthritis Rheum 2006;55:
industry supporters were not involved in any stage of criteria 348 –52.
development. As a courtesy, the authors sent copies of submitted 18. Bombardier C, Tugwell P. A methodological framework to
manuscripts to their industry supporters, but review and approval develop and select indices for clinical trials: statistical and
of the manuscripts were neither requested nor given. judgmental approaches. J Rheumatol 1982;9:753–7.
19. Felson DT, Anderson JJ, Boers M, Bombardier C, Furst D,
Goldsmith C, et al. American College of Rheumatology pre-
REFERENCES liminary definition of improvement in rheumatoid arthritis.
Arthritis Rheum 1995;38:727–35.
1. Lovell DJ, Giannini EH, Reiff A, Cawkwell GD, Silverman ED, 20. Magni-Manzoni S, Ruperto N, Pistorio A, Sala E, Solari N,
Nocton JJ, et al, for the Pediatric Rheumatology Collaborative Palmisani E, et al. Development and validation of a prelimi-
Study Group. Etanercept in children with polyarticular juve- nary definition of minimal disease activity in patients with
nile rheumatoid arthritis. N Engl J Med 2000;342:763–9. juvenile idiopathic arthritis. Arthritis Rheum 2008;59:
2. Ruperto N, Lovell DJ, Cuttica R, Wilkinson N, Woo P, Espada 1120 –7.
G, et al, for the Paediatric Rheumatology International Trials 21. Ruperto N, Ravelli A, Oliveira S, Alessio M, Mihaylova D,
Organisation and the Pediatric Rheumatology Collaborative Pasic S, et al, for the Pediatric Rheumatology International
Study Group. A randomized, placebo-controlled trial of inf- Trials Organization (PRINTO) and the Pediatric Rheumatol-
liximab plus methotrexate for the treatment of polyarticular- ogy Collaborative Study Group (PRCSG). The Pediatric Rheu-
936 Wallace et al
matology International Trials Organization/American College 28. Jabs DA, Nussenblatt RB, Rosenbaum JT. Standardization of
of Rheumatology provisional criteria for the evaluation of uveitis nomenclature for reporting clinical data: results of the
response to therapy in juvenile systemic lupus erythem- First International Workshop. Am J Ophthalmol 2005;140:
atosus: prospective validation of the definition of improve- 509 –16.
ment. Arthritis Rheum 2006;55:355– 63. 29. Cassidy JT, Levinson JE, Brewer EJ Jr. The development of
22. Giannini EH, Ruperto N, Ravelli A, Lovell DJ, Felson DT, classification criteria for children with juvenile rheumatoid
Martini A. Preliminary definition of improvement in juvenile arthritis. Bull Rheum Dis 1989;38:1–7.
arthritis. Arthritis Rheum 1997;40:1202–9. 30. Van Tuyl LH, Vlad SC, Felson DT, Wells G, Boers M. Defining
23. Rider LG, Giannini EH, Brunner HI, Ruperto N, James-Newton remission in rheumatoid arthritis: results of an initial Amer-
L, Reed AM, et al, for the International Myositis Assessment ican College of Rheumatology/European League Against
and Clinical Studies Group. International consensus on pre-
Rheumatism consensus conference. Arthritis Rheum 2009;61:
liminary definitions of improvement in adult and juvenile
704 –10.
myositis. Arthritis Rheum 2004;50:2281–90.
31. Felson DT, Smolen JS, Wells G, Zhang B, van Tuyl LH, Funo-
24. Khanna D, Lovell DJ, Giannini E, Clements PJ, Merkel PA,
Seibold JR, et al. Development of a provisional core set of vits J, et al. American College of Rheumatology/European
response measures for clinical trials of systemic sclerosis. League Against Rheumatism provisional definition of remis-
Ann Rheum Dis 2008;67:703–9. sion in rheumatoid arthritis for clinical trials. Arthritis
25. Fleiss JL. Measuring nominal scale agreement among many Rheum 2011;63:573– 86.
raters. Psychol Bull 1971;76:378 – 82. 32. Foell D, Wulffraat N, Wedderburn LR, Wittkowski H, Frosch
26. Davies EH, Surtees R, DeVile C, Schoon I, Vellodi A. A sever- M, Gerss J, et al, for the Paediatric Rheumatology Interna-
ity scoring tool to assess the neurological features of neurono- tional Trials Organization (PRINTO). Methotrexate with-
pathic Gaucher disease. J Inherit Metab Dis 2007;30:768 – 82. drawal at 6 vs 12 months in juvenile idiopathic arthritis in
27. Landis JR, Koch GG. The measurement of observer agreement remission: a randomized clinical trial. JAMA 2010;303:1266 –
for categorical data. Biometrics 1977;33:159 –74. 73.