Professional Documents
Culture Documents
Validation of The Modi Fied Checklist For Autism in Toddlers, Revised With Follow-Up (M-CHAT-R/F)
Validation of The Modi Fied Checklist For Autism in Toddlers, Revised With Follow-Up (M-CHAT-R/F)
Connecticut
WHAT THIS STUDY ADDS: The Modified Checklist for Autism in
KEY WORDS Toddlers, Revised with Follow-up (M-CHAT-R/F), simplifies wording
autism, screening, toddlers
of the original M-CHAT. The current validation study indicates that
ABBREVIATIONS the M-CHAT-R/F improves the ability to detect autism spectrum
ASD—autism spectrum disorder
CI—confidence interval
disorders in toddlers screened during well-child care visits.
GSU—Georgia State University
M-CHAT—Modified Checklist for Autism in Toddlers
M-CHAT/F—Modified Checklist for Autism in Toddlers with
Follow-up
M-CHAT-R—Modified Checklist for Autism in Toddlers, Revised
M-CHAT-R/F—Modified Checklist for Autism in Toddlers, Revised abstract
with Follow-up
OBJECTIVE: This study validates the Modified Checklist for Autism in
PPV—positive predictive value
STAT—Screening Tool for Autism in Two-Year-Olds Toddlers, Revised with Follow-up (M-CHAT-R/F), a screening tool for low-
UConn—University of Connecticut risk toddlers, and demonstrates improved utility compared with the
WCC—well-child care original M-CHAT.
Dr Robins conceptualized and designed the study, contributed to
the proposal for National Institutes of Health funding,
METHODS: Toddlers (N = 16 071) were screened during 18- and 24-
participated in data collection, analyzed the data, and drafted month well-child care visits in metropolitan Atlanta and Connecticut.
the initial manuscript; Ms Casagrande participated in data Parents of toddlers at risk on M-CHAT-R completed follow-up; those who
collection and assisted with management of the study, data
continued to show risk were evaluated.
analyses, and drafting and editing of the initial manuscript;
Dr Barton contributed to the conceptualization of the study, RESULTS: The reliability and validity of the M-CHAT-R/F were demon-
participated in data collection, and critically reviewed the strated, and optimal scoring was determined by using receiver operating
manuscript; Dr Chen contributed to data analyses and critically
reviewed the manuscript; Dr Dumont-Mathieu contributed to characteristic curves. Children whose total score was $3 initially and
conceptualization of the study, participated in data collection, $2 after follow-up had a 47.5% risk of being diagnosed with autism
and critically reviewed the manuscript; Dr Fein conceptualized spectrum disorder (ASD; confidence interval [95% CI]: 0.41–0.54) and
and designed the study, led the efforts to obtain National
Institutes of Health funding, participated in data collection,
a 94.6% risk of any developmental delay or concern (95% CI: 0.92–0.98).
contributed to the interpretation of data, and critically reviewed Total score was more effective than alternative scores. An algorithm
the manuscript; and all authors approved the final manuscript based on 3 risk levels is recommended to maximize clinical utility and to
as submitted.
reduce age of diagnosis and onset of early intervention. The M-CHAT-R
(Continued on last page) detects ASD at a higher rate compared with the M-CHAT while also
reducing the number of children needing the follow-up. Children in
the current study were diagnosed 2 years younger than the national
median age of diagnosis.
CONCLUSIONS: The M-CHAT-R/F detects many cases of ASD in toddlers;
physicians using the 2-stage screener can be confident that most
screen-positive cases warrant evaluation and referral for early
intervention. Widespread implementation of universal screening can
lower the age of ASD diagnosis by 2 years compared with recent
surveillance findings, increasing time available for early intervention.
Pediatrics 2014;133:37–45
FIGURE 1
Flowchart indicating screening results. aEvaluations based on cases detected through STAT screen positive (n = 20), physician concern (n = 18), and alternate
scoring (n = 4). bDetected through STAT (n = 6), physician concern (n = 9), and alternate scoring (n = 3). Neg, negative; Pos, positive.
38 ROBINS et al
Downloaded from by guest on November 23, 2015
ARTICLE
TABLE 1 Total Sample Demographic Characteristics by Site and Level of Completion Childhood Autism Rating Scale–2,17 the
UConn GSU Total Toddler Autism Symptom Interview,18
Complete Incomplete Complete Incomplete Mullen Scales of Early Learning,19
Total sample 5932 115 9680 344 16 071
Vineland Adaptive Behavior Scales–II,20
Sex Behavioral Assessment System for
Male 3048 71 4745 192 8056 Children–2,21 and a developmental his-
Female 2877 43 4693 143 7756
Not reported 7 1 242 9 259
tory form.
Race
White/Caucasian 3712 47 4850 89 8698 Procedures
Black/African American 426 15 2874 185 3500
Asian/Pacific Islander 316 15 412 9 752 Parents completed the M-CHAT, Revised
Native American/ Alaskan Native 26 0 11 1 38 (M-CHAT-R), and provided informed
Bi- or multiracial 464 9 465 17 955 consent and demographic character-
Other 46 3 56 5 110
Unknown 942 26 1012 38 2018 istics during their child’s 18- or 24-
Ethnicity month WCC visit (41 sites at GSU, 44
Hispanic 1361 35 435 7 1838 sites at UConn). Pediatricians were
Non-Hispanic 4571 80 9245 337 14233
Maternal education asked to indicate concern about ASD,
Less than high school 429 15 379 36 859 based on their clinical judgment, by
High school/GED 1149 36 1389 82 2656 checking a box at the top of the
Some college 1356 25 2201 70 3652
College degree 1512 19 2477 60 4068
screener. Completed M-CHAT-R forms
Advanced degree 1284 14 1764 29 3091 were scored at GSU or UConn. Re-
Unknown 202 6 1470 67 1745 search staff contacted parents of
ASD concernsa 13 6 32 13 65
Age at screening, mean (SD), mo 20.86 (3.20) 20.53 (3.04) 21.01 (3.36) 20.71 (3.57) 20.94 (3.30)
screen-positive children to complete
M-CHAT-R total score, mean (SD)b 0.68 (1.22) 4.63 (2.54) 0.64 (1.24) 4.10 (2.96) 0.76 (1.44) the follow-up by telephone; children
M-CHAT-R/F total score, mean (SD)b 1.24 (2.09) 3.15 (1.75) 1.17 (2.15) 3.06 (2.20) 1.43 (2.20) who continued to screen positive on
Data are presented as n unless otherwise indicated. Incomplete cases were eligible to complete additional steps based on the M-CHAT-R/F or whose physician had
M-CHAT-R scores but did not complete the study. Incomplete cases include those who initially screened positive and did not
complete the follow-up as well as those who continued to screen positive on follow-up and did not attend the evaluation. concerns were offered a diagnostic
Completed cases completed all eligible steps of the study, but in most cases, children who screened negative were not asked evaluation. Evaluations were conducted
to continue to any additional assessment. GED, general educational diploma.
a Physicians noted ASD concerns by checking a box on the M-CHAT-R protocol. by a team consisting of a licensed
b Note that the incomplete columns contain those cases who screened positive but did not complete the follow-up or
psychologist/developmental pediatri-
evaluation. Therefore, it is expected that they show elevated scores relative to the completed cases.
cian supervising a graduate student
and research assistants; team mem-
structured follow-up questions to ob- with one finger….” Finally, examples bers were research reliable on all mea-
tain additional information and exam- provided developmental context and sures they administered.
ples of at-risk behaviors, which takes clarity. The final diagnosis integrated all
∼5 to 10 minutes with a professional The original M-CHAT recommended available information and used the
(ie, nurse or physician’s assistant).13 a threshold of $3 items total or $2 psychologist/developmental pediatri-
The M-CHAT-R/F14 incorporated 5 mod- critical items identified through dis- cian’s clinical judgment to assess Di-
ifications to improve utility. Three items criminant function analysis.15 However, agnostic and Statistical Manual of
that performed poorly were dropped analyses of larger samples indicated Mental Disorders, 4th edition, text re-
(peek-a-boo, playing with toys, and that the critical score did not improve vision (DSM-IV-TR)22 criteria for Autistic
wandering without purpose). The re- sensitivity above the total score.12 The Disorder and Pervasive Developmental
maining 20 items were reorganized current study tested several scoring Disorder, Not Otherwise Specified.
to remove agreement bias. The items methods. A threshold based on total When ASD was ruled out, diagnoses of
that comprised the Best7 score (see score had strong psychometric prop- Global Developmental Delay, Language
Supplemental Information) were placed erties and was more parsimonious Delay, or other DSM-IV-TR disorders
within the first 10 items. Language was than combinations of total and alter- were considered. Children who did
simplified to improve comprehension. native scorings (see Supplemental In- not meet criteria for any diagnosis
For example, “Does your child ever use formation). were classified as typically developing
his/her index finger to point…” was Clinical measures included the Autism or as having developmental concerns,
rephrased as “Does your child point Diagnostic Observation Schedule,16 which were operationally defined as
40 ROBINS et al
Downloaded from by guest on November 23, 2015
ARTICLE
Total2 as the threshold on the follow-up screen positive. The sample was ex- 8–20; warrants immediate referral
halves the number of missed cases, amined to determine (1) the number of for evaluation and intervention) (see
significantly increasing sensitivity cases who reverted from screen posi- Fig 3). In the current sample, 75 chil-
(McNemar’s test, P , .001) and dem- tive to screen negative during the dren scored in the high-risk range on
onstrates an area under the curve of follow-up and (2) the initial M-CHAT-R M-CHAT-R and completed the evalua-
0.907 (see Table 3 and Fig 2). Psycho- scores for those children diagnosed tion, all of whom were diagnosed with
metrics were verified on the sub- with ASD to arrive at the following risk developmental disorders or concerns
sample ascertained after the score classifications: low risk (total score: (44 ASDs, 27 non-ASD disorders, 4
change was implemented (n = 7579; 60 0–2; requires no further evaluation developmental concerns). Compared
diagnosed with ASD). unless other risk factors are present), with a positive predictive value (PPV) of
To increase utility of a 2-stage screening medium risk (total score: 3–7; requires the initial questionnaire of 0.26 for any
tool in busy pediatric settings, it would administration of the follow-up to de- developmental delay or concern (con-
be helpful to bypass the follow-up in termine whether referrals are war- fidence interval [95% CI]: 0.20–0.32),
cases who are likely to continue to ranted), and high risk (total score: the PPV for high-risk scores is 1.0. It is
FIGURE 2
A: ROC curve for 2-stage M-CHAT-R/F. AUC, area under the curve; ROC, receiver operating characteristic. B: Table showing sensitivity and specificity for each
M-CHAT-R/F total score.
M-CHAT improved the tool for use in d VABS-II standard scores (mean = 100, SD = 15).
42 ROBINS et al
Downloaded from by guest on November 23, 2015
ARTICLE
may bypass the follow-up. On the basis The performance of the M-CHAT-R/F was seen. Therefore, the rate of detection of
of initial screening only, the total sam- compared with the published studies 1 in 127 may not be far off from the
ple of screen-positive children have a using the M-CHAT/F in low-risk sam- actual prevalence in 2-year-olds.
27% risk of any developmental delay or ples.12 Overall, the revision significantly
concern (95% CI: 0.20–0.32), whereas reduced the initial screen-positive rate, Limitations and Future Directions
all cases in the high-risk range were which means that fewer children re- One limitation of level 1 screening re-
diagnosed with delays or concerns, jus- quire follow-up. Also indicating im- search is that it is impossible to eval-
tifying immediate referrals for evalua- provement of the tool, the rate of ASD uate all screen-negative cases to identify
tion and possible early intervention. detection increased for M-CHAT-R/F. misses and calculate true sensitivity.
Children who screened positive on M- Although it is impossible to rule out An additional challenge is that many
increasing ASD prevalence as contrib- parentsofchildrenwhoinitiallyscreened
CHAT-R/F were 114 times more likely
uting to this finding, this finding sug- positive did not complete additional
to receive an ASD diagnosis than chil-
gests that the reduction in the initial steps in the study.27 The current study
dren who screened negative. In addi-
screen-positive rate is not negatively had a disproportionately high number
tion, 94.6% of children evaluated for ASD
affecting sensitivity. When M-CHAT-R/F of African-American families who did
risk on the 2-stage M-CHAT-R/F showed
was compared with physician clinical not complete the study (eg, follow-up
developmental delay or concern that
judgment, sensitivity was significantly or evaluation), indicating that barriers
warranted referrals to early interven-
higher for M-CHAT-R/F; when these continue to exist even under stan-
tion (95% CI: 0.92–0.98). Although one
methods were combined, ASD detection dardized protocols. In addition, mater-
might interpret this finding to mean
was very high, indicating that stan- nal education was significantly higher
that the M-CHAT-R/F may be screening
dardized screening in conjunction with in the GSU sample (mean: 14.93 years;
more broadly than for ASD, it is not routine developmental surveillance SD: 2.53 years) than the UConn sample
justified to use the screener for that optimizes early detection for ASD. (mean: 14.57 years; SD: 2.46 years) (t
purpose, given that the sensitivity of the
It is important to examine the preva- [13 938] = 28.37; P , .001), although
tool for non-ASD delays is not known.
lence of ASD detected in the current the effect size was very small (h2 =
One study directly comparing M-CHAT
sample to evaluate utility of the 0.005). These variables are complex
to the Parents’ Evaluation of Devel- and are addressed in other articles.28
screening tool. The M-CHAT-R/F detected
opmental Status (PEDS)24 found that
ASD at a rate of 1 per 149 cases. This rate The current study used multiple ap-
25% of children demonstrated risk for
is notably below the published preva- proaches to detect possible false-
a broad range of developmental con-
lence of ASD as 1 in 882; however, the negative cases, improving accuracy of
cerns on the PEDS, far exceeding the
Centers for Disease Control and Pre- sensitivity estimates. Both sites asked
screen-positive rate of the M-CHAT.25 An vention’s prevalence data were ascer- physicians to identify cases of possible
important finding from the study is that tained on the basis of review of school ASD, and these families were offered
the average age of diagnosis was just and health records for 8-year-old chil- evaluation regardless of M-CHAT-R/F
after the second birthday, which is 2 dren, and it is not expected that all ASD score. In some cases, when physicians
years earlier than the median age of cases will be detectable in toddlers. had ASD concerns but the child screened
diagnosis2; this finding suggests that Furthermore, with enhanced methods negative on the M-CHAT-R/F, the children
implementing standardized screening to detect missed cases, such as fol- were found to have other develop-
and expeditious evaluation for positive lowing up on physician concern and mental delays; however, 9 ASD cases
cases can greatly increase the time sampling screen-negative cases, 123 were detected with physician concern
that children are eligible for early- ASD cases were detected in the sample, but had negative M-CHAT-R/F. Not all
intervention services and therefore which is 1 in 127. It is likely that many of physicians applied this surveillance
improve the outcome. However, it is the remaining children who will later component equally in their practices,
important to note that ASD screening be diagnosed with ASD, such as those and further research may identify fac-
continues to be challenging. Because with Asperger disorder without early tors that predict use of surveillance,
no screening tool can have perfect developmental delays, are not showing screening, and their integration. A sec-
sensitivity and specificity, providers significant symptoms at this young ond approach to find false negatives
should continue to perform develop- age26; in addition, later detection may conducted at 1 site (GSU) invited a
mental surveillance in addition to us- occur in mild cases only in the school sample of screen-negative cases for
ing validated screening tools. setting where peer interactions can be play-based screening. The sample was
REFERENCES
1. Autism Spectrum Disorder. American Psy- 6. Robins DL, Fein D, Barton M. The Modified 12. Chlebowski C, Robins DL, Barton ML, Fein D.
chiatric Association. Diagnostic and Sta- Checklist for Autism in Toddlers (M-CHAT). Large-scale use of the Modified Checklist
tistical Manual of Mental Disorders. 5th ed. Self-published; 1999 for Autism in low-risk toddlers. Pediatrics.
Arlington, VA: American Psychiatric Pub- 7. Canal-Bedia R, García-Primo P, Martín- 2013;131(4):e1121–e1127
lishing; 2013:50–59 Cilleros MV, et al. Modified Checklist for 13. Nygren G, Sandberg E, Gillstedt F, Ekeroth G,
2. Autism and Developmental Disabilities Autism in Toddlers: cross-cultural adapta- Arvidsson T, Gillberg C. A new screening
Monitoring Network Surveillance Year 2008 tion and validation in Spain. J Autism Dev programme for autism in a general pop-
Principal Investigators; Centers for Disease Disord. 2011;41(10):1342–1351 ulation of Swedish toddlers. Res Dev Dis-
Control and Prevention. Prevalence of au- 8. Inada N, Koyama T, Inokuchi E, et al. Re- abil. 2012;33(4):1200–1210
tism spectrum disorders—Autism and liability and validity of the Japanese ver- 14. Robins DL, Fein D, Barton M. The Modified
Developmental Disabilities Monitoring Net- sion of the Modified Checklist for Autism in Checklist for Autism in Toddlers, Revised
work, 14 sites, United States, 2008. MMWR Toddlers (M-CHAT). Res Autism Spectr Dis-
with Follow-Up (M-CHAT-R/F). Self-published;
Surveill Summ. 2012;61(3):1–19 ord. 2011;5(1):330–336
2009
3. Myers SM, Johnson CP; American Academy 9. Gura GF, Champagne MT, Blood-Siegfried JE.
15. Robins DL, Fein D, Barton ML, Green JA. The
of Pediatrics Council on Children With Autism spectrum disorder screening in
Modified Checklist for Autism in Toddlers:
Disabilities. Management of children with primary care. J Dev Behav Pediatr. 2011;32
autism spectrum disorders. Pediatrics. an initial study investigating the early de-
(1):48–51
2007;120(5):1162–1182 tection of autism and pervasive develop-
10. Dumont-Mathieu T, Fein D. Screening for
mental disorders. J Autism Dev Disord.
4. Johnson CP, Myers SM; American Academy autism in young children: the Modified
of Pediatrics Council on Children With 2001;31(2):131–144
Checklist for Autism in Toddlers (M-CHAT)
Disabilities. Identification and evaluation of and other measures. Ment Retard Dev 16. Lord C, Risi S, Lambrecht L, et al. The
children with autism spectrum disorders. Disabil Res Rev. 2005;11(3):253–262 Autism Diagnostic Observation Schedule-
Pediatrics. 2007;120(5):1183–1215 11. Kleinman JM, Robins DL, Ventola PE, et al. Generic: a standard measure of social and
5. Mandell DS, Listerud J, Levy SE, Pinto- The Modified Checklist for Autism in Tod- communication deficits associated with the
Martin JA. Race differences in the age at dlers: a follow-up study investigating the spectrum of autism. J Autism Dev Disord.
diagnosis among Medicaid-eligible chil- early detection of autism spectrum disor- 2000;30(3):205–223
dren with autism. J Am Acad Child Adolesc ders. J Autism Dev Disord. 2008;38(5):827– 17. Schopler E, Reichler RJ, Renner BR. The
Psychiatry. 2002;41(12):1447–1453 839 Childhood Autism Rating Scale-2. Los
44 ROBINS et al
Downloaded from by guest on November 23, 2015
ARTICLE
Angeles, CA: Western Psychological Ser- 22. American Psychiatric Association. Di- online December 21, 2012]. Autism. 2012.
vices; 2010 agnostic and Statistical Manual of Mental Available at: http://aut.sagepub.com/content/
18. Barton M, Boorstein H, Herlihy L, Dumont- Disorders. 4th ed, text revision. Wash- early/2012/12/21/1362361312466962.full.pdf.
Mathieu T, Fein D. Toddler ASD Symptom ington, DC: American Psychiatric Associa- (Accessed September 4, 2013)
Interview. Self-published; 2012 tion; 2000 26. Marks K, Glascoe FP, Aylward GP, Shevell MI,
19. Mullen EM. Mullen Scales of Early Learning. 23. Stone WL, Coonrod EE, Ousley OY. Brief re- Lipkin PH, Squires JK. The thorny nature of
port: Screening Tool for Autism in Two-year- predictive validity studies on screening
Circle Pines, MN: American Guidance Ser-
olds (STAT): development and preliminary tests for developmental-behavioral prob-
vice, Inc; 1995
data. J Autism Dev Disord. 2000;30(6):607– lems. Pediatrics. 2008;122(4):866–868
20. Sparrow SS, Cicchetti DV, Balla DA. Vineland 612 27. Pierce K, Carter C, Gallagher N, et al.
Adaptive Behavior Scales. 2nd ed. (Vineland
24. Glascoe FP. Parents’ Evaluation of De- Detecting, studying, and treating autism
II) Survey Interview Form/Caregiver Rating velopmental Status (PEDS). Nolensville, TN: early: the one-year well-baby check-up ap-
Form. Livonia, MN: Pearson Assessments; PEDSTest.com, LLC; 2010. Available at: www. proach. J Pediatr. 2011;159(3):458–465
2005 pedstest.com. Accessed September 9, 2013 28. Herlihy L, Brooks B, Dumont-Mathieu T,
21. Reynolds CR, Kamphaus RW. Behavior As- 25. Wiggins LD, Piazza V, Robins DL. Compari- et al. Standardized screening facilitates
sessment System for Children (BASC-2). son of a broad-band screen versus disorder- timely diagnosis of autism spectrum dis-
2nd ed. Bloomington, MN: Pearson Assess- specific screen in detecting young children order in a diverse sample of low-risk tod-
ments; 2004 with an autism spectrum disorder [published dlers. J Dev Behav Pediatr. 2013, In press
The online version of this article, along with updated information and services, is
located on the World Wide Web at:
/content/133/1/37.full.html