Professional Documents
Culture Documents
Purpose: To determine the diagnostic performance of the COVID-19 Reporting and Data System (CO-RADS) and the Radiological
Society of North America (RSNA) categorizations in patients with clinically suspected coronavirus disease 2019 (COVID-19)
infection.
Materials and Methods: In this meta-analysis, studies from 2020, up to August 24, 2020, were assessed for inclusion criteria of studies
that used CO-RADS or the RSNA categories for scoring chest CT in patients suspected of having COVID-19. A total of 186 studies
were identified. After review of abstracts and text, a total of nine studies were included in this study. Patient information (n¸ age, sex),
CO-RADS and RSNA scoring categories, and other study characteristics were extracted. Study quality was assessed with the Quality
Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) tool. Meta-analysis was performed with a random effects model.
Results: Nine studies (3283 patients) were included. Overall study quality was good, except for risk of nonperformance of repeated
reverse-transcriptase polymerase chain reaction (RT-PCR) testing after negative initial RT-PCR testing and persistent clinical suspicion
in four studies. Pooled COVID-19 frequencies in CO-RADS categories were: 1, 8.8%; 2, 11.1%; 3, 24.6%; 4, 61.9%; and 5, 89.6%.
Pooled COVID-19 frequencies in RSNA classification categories were: negative, 14.4%; atypical, 5.7%; indeterminate, 44.9%; and
typical, 92.5%. Pooled pairs of sensitivity and specificity using CO-RADS thresholds were the following: at least 3, 92.5% (95% CI:
87.1, 95.7) and 69.2% (95%: CI: 60.8, 76.4); at least 4, 85.8% (95% CI: 78.7, 90.9) and 84.6% (95% CI: 79.5, 88.5); and 5, 70.4%
(95% CI: 60.2, 78.9) and 93.1% (95% CI: 87.7, 96.2). Pooled pairs of sensitivity and specificity using RSNA classification thresholds
for indeterminate were 90.2% (95% CI: 87.5, 92.3) and 75.1% (95% CI: 68.9, 80.4) and for typical were 65.2% (95% CI: 37.0,
85.7) and 94.9% (95% CI: 86.4, 98.2).
Conclusion: COVID-19 infection frequency was higher in patients categorized with higher CO-RADS and RSNA classification
categories.
© RSNA, 2021
This copy is for personal use only. To order printed copies, contact reprints@rsna.org
Diagnostic Performance of CO-RADS and the RSNA Classification System
Study Selection
Abbreviations
Original studies that provided data on the diagnostic perfor-
CO-RADS = COVID-19 reporting and data system, COVID-19
= coronavirus disease 2019, QUADAS-2 = Quality Assessment of mance of the CO-RADS or RSNA classification system in
Diagnostic Accuracy Studies 2, RSNA = Radiological Society of evaluating patients with clinically suspected COVID-19 infec-
North America, RT-PCR = reverse-transcription polymerase chain tion, and in which reverse-transcription polymerase chain re-
reaction action (RT-PCR) was the reference standard, were eligible for
Summary inclusion. Reviews, abstracts, and studies were excluded for the
The frequency of coronavirus disease 2019 infection was higher in following reasons: (a) included fewer than 10 patients, (b) re-
patients with higher CO-RADS and RSNA classification categories, ported insufficient data to compose a 2 3 2 contingency table
which supports the order of grading used by both systems. to calculate sensitivity and specificity on per-patient level for
any CO-RADS or RSNA classification system threshold, and
Key Points
(c) only provided data on the performance of artificial intel-
n Using the lowest clinically meaningful thresholds of CO-RADS of
at least 3 and indeterminate according to the RSNA classification, ligence–based analyses. When overlapping data were presented
sensitivity values were 92.5% and 90.2%, which implies that CO- in more than one study, the study with the largest number of
RADS 1 and 2 and RSNA classification categories negative and patients was selected. Titles and abstracts of retrieved studies
atypical certainly do not exclude COVID-19. were reviewed using the aforementioned selection criteria. The
n Using the highest thresholds of CO-RADS 5 and typical accord- full-text version of each potentially eligible study was then re-
ing to the RSNA classification, specificity values increased up to
93.1% and 94.9% at the cost of sensitivity, with values of 70.4% viewed to definitively determine whether the study fulfilled the
and 65.2%, respectively. selection criteria.
Figure 1: Flow diagram of study selection. The asterisk indicates that there were duplicate studies.
Cochran Q and x2 tests were performed to test for heterogene- evaluated with CO-RADS and 1400 patients were evaluated
ity between studies, which was defined as P , .10. Statistical with the RSNA classification system.
analyses were performed using the Open Meta-Analyst soft-
ware package (12) and Meta-analysis of Diagnostic Accuracy Study Quality
Studies package in R software (13,14). Figure 2 provides a summary of the QUADAS-2 quality as-
sessments. In one study (19), it was unclear whether pa-
Results tients were enrolled consecutively or randomly. There was
no risk of bias with regard to patient selection in the other
Literature Search studies or with regard to index test. Risk of bias with respect
Figure 1 displays the study selection process. A total of 182 to reference test was rated high in three studies (22,25,26)
studies were eligible for inclusion after searching databases. Af- because repeated RT-PCR testing was not used in all pa-
ter screening titles and abstracts, 168 studies were excluded, tients with a negative initial RT-PCR result and persistent
leaving 14 studies that were potentially eligible for inclusion. clinical suspicion of COVID-19. Risk of bias with respect
After reading the full text of the 14 studies, three studies to reference test was rated unclear in one study (20) be-
(15–17) were excluded because the diagnostic performance of cause it was not clear whether all patients with an initial
either CO-RADS or the RSNA classification system was not negative RT-PCR result and a persistent clinical suspicion
investigated, one study (5) was excluded because no data on a of COVID-19 underwent repeated RT-PCR testing. In one
per-patient level were reported, and another study (18) was ex- study (19), there was potential risk of bias with regard to
cluded because there were overlapping data with another study flow and timing because the time interval between CT and
(7) which comprised a larger number of patients. Nine studies RT-PCR testing was not reported. There was no risk of bias
were eventually included (7,19–26). with regard to flow and timing in the other studies because
The main study characteristics are shown in Table 1 and Table the maximum time interval between chest CT and RT-PCR
E3 (supplement). All assessed studies were performed between did not exceed 7 days (21). There were no applicability
January and June 2020. The median number of patients per concerns.
study was 312 (range, 71–859), and the total number of patients
of all studies combined was 3283. All nine studies included pa- Diagnostic Performance of CO-RADS
tients with a clinical suspicion of COVID-19. The mean fre- The frequency of COVID-19 in each of the categories of
quency of COVID-19 was 48.7% (range, 41.7%–59.8%). Of CO-RADS is displayed in Table 2. With higher CO-RADS
all patients included in the nine studies, 1979 patients were classification, the frequency of COVID-19 increased.
Pooled frequency of COVID-19 in CO-RADS categories Diagnostic Performance of the RSNA Classification System
1, 2, 3, 4, and 5 were 8.8%, 11.1%, 24.6%, 61.9%, and The frequency of COVID-19 in each of the categories of the
89.6%, respectively. Pooled sensitivity and specificity of the RSNA classification systems is displayed in Table 4. With
CO-RADS and the RSNA classification system at specific higher RSNA classification, the frequency of COVID-19 in-
thresholds are displayed in Table 3. Pooled pairs of sensitiv- creased. Pooled frequencies of COVID-19 in RSNA classifi-
ity and specificity using CO-RADS thresholds were the fol- cation categories negative, atypical, indeterminate, and typical
lowing: at least 3, 92.5% (95% CI: 87.1, 95.7) and 69.2% were 14.4%, 5.7%, 44.9%, and 92.5%, respectively. Pooled
(95% CI: 60.8, 76.4); at least 4, 85.8% (95% CI: 78.7, sensitivity and specificity of the RSNA classification system
90.9) and 84.6% (95% CI: 79.5, 88.5); and 5, 70.4% (95% at specific thresholds are displayed in Table 5. Pooled pairs of
CI: 60.2, 78.9) and 93.1% (95% CI: 87.7, 96.2). sensitivity and specificity using RSNA classification thresholds
Figure 2: Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) quality assessments of included studies.
Hermans et al (23) 6.1% (6/99) 9.4% (3/32) 9.1% (4/44) 64.5% (20/31) 90.1% (100/111)
Korevaar et al (24)† 5.9% (4/68) 17.2% (5/29) 82.4% (117/142)
de Jaegere et al (7)* 11.1% (1/9) 3.1% (1/32) 38.5% (5/13) 76.9% (10/13) 96.6% (28/29)
Dofferhoff et al (26) 10.2% (9/88) 14.3% (3/21) 19.4% (6/31) 63.0% (17/27) 82.1% (119/145)
Pooled frequency‡ 8.8% (6.2, 11.4) 11.1% (4.3, 18.0) 24.6% (12.8, 36.5) 61.9% (45.0–78.7) 89.6% (85.6, 93.7)
P value for heterogene- .35 .048 ,.001 ,.001 .04
ity§
Note.—The 95% CI is shown within parenthesis for the pooled frequency.
* Data from the first reader.
†
CO-RADS categories 1 and 2, and CO-RADS categories 4 and 5 were merged.
‡
CO-RADS 1, 2, 4, and 5 data from the study of Korevaar et al (24) were not included in the pooled analysis.
§
Statistical heterogeneity between studies was defined as P , .10.
Table 4: Frequency of COVID-19 in Each of the Categories of the RSNA Classification System
Table 5: Pooled Sensitivity and Specificity at Specific Thresholds according to the RSNA Classification
System
RSNA classification, sensitivity values were 92.5% (95% CI: sensitivity and specificity were 94.6% (95% CI: 91.9, 96.4) and
87.1, 95.7) and 90.2% (95% CI: 87.5, 92.3), respectively. 46.0% (95% CI: 31.9, 60.7), respectively (4). Using CO-RADS
These findings imply that CO-RADS 1 and 2 and RSNA of at least 3 and RSNA classification indeterminate as diagnostic
classification categories negative and atypical do not exclude thresholds, similar sensitivity values of 92.5% (95% CI: 87.1,
COVID-19. Furthermore, when using these low diagnostic 95.7) and 90.2% (95% CI: 87.5, 92.3) can be achieved, while
thresholds, specificity is only moderate with values of 69.2% relatively higher specificity values of 69.2% (95% CI: 60.8,
(95% CI: 60.8, 76.4) for CO-RADS of at least 3 and 75.1% 76.4) and 75.1% (95% CI: 68.9, 80.4) are obtained. Thus,
(95% CI: 68.9, 80.4) for RSNA indeterminate. If higher diag- when using CO-RADS or the RSNA classification system in-
nostic thresholds are applied, specificity naturally increases at stead of nonstandardized criteria, it appeared that specificity may
the cost of sensitivity. Using CO-RADS of at least 5 and the be improved without sacrificing sensitivity.
RSNA classification typical as diagnostic thresholds, specificity If a low threshold is being used (eg, any lung abnormality
values increased up to 93.1% (95% CI: 87.7, 96.2) and 94.9% on chest CT is considered positive for COVID-19), virtually all
(95% CI: 86.4, 98.2). However, when using these high diag- COVID-19 cases with lung abnormalities will be correctly clas-
nostic thresholds, sensitivity is only moderate with values of sified, but all non–COVID-19 cases with any lung abnormality
70.4% (95% CI: 60.2, 78.9) and 65.2% (95% CI: 37.0, 85.7). at chest CT will be incorrectly classified as having COVID-19
Methodologic quality of the studies included in the current (28). By applying standardized diagnostic criteria such as CO-
meta-analysis generally appeared to have higher quality than RADS or the RSNA classification system, a higher proportion
studies included within prior meta-analyses (4,8). In two prior of non–COVID-19 cases with lung abnormalities due to other
meta-analyses, high risk of bias was present in all six included lung diseases will be correctly classified as not having COVID-19
studies (100%) (4) and in 10 of 13 included studies (77%) (8). but an alternative lung disease. It should be noted that the stud-
In our current meta-analysis, the reference standard was the only ies in our meta-analysis included patients between January and
QUADAS-2 item which was deemed to be of high risk of bias. June 2020, a period with a high COVID-19 frequency (mean
This item applied to three of the nine included studies (33%) be- of 48.7%; range, 41.7%–59.8%). Specificity is likely to decrease
cause repeated RT-PCR testing was not used in all patients with with lower COVID-19 frequency and increasing frequency of
a negative initial RT-PCR result and persistent clinical suspicion other viral lung infections, such as influenza (29).
of COVID-19 (22,25,26). Our study had some limitations. First, the included studies
Importantly, we provided a meta-analysis that specifically fo- used RT-PCR, which is an imperfect reference standard with a
cused on the diagnostic performance of chest CT in COVID-19 reported sensitivity of 89% (95% CI: 81, 94) (30). Sensitivity of
by selecting studies that used standardized diagnostic criteria. RT-PCR appears to be lower in elderly patients (30), which may
Therefore, our study results were more generalizable and useful be due to sampling error in these patients who are more likely
to clinical practice compared with other prior meta-analyses on to have poorer performance status (25), Furthermore, vendor-
CT for COVID-19 assessment. Our finding that CO-RADS 1 specific effects and differences in the quality assurance process
and 2 and RSNA classification categories negative and atypical may affect the performance of RT-PCR (30). However, RT-PCR
do not exclude COVID-19 was in line with the results of a meta- is still the recommended method to confirm current COVID-19
analysis in nearly 3500 patients, which reported an estimated infection (31–33). Second, because of the relatively low num-
frequency of 10.6% for normal chest CT findings in symptom- ber of included studies, we did not perform subgroup or meta-
atic patients with COVID-19 (27). In a prior meta-analysis of regression analyses to explain statistical heterogeneity between
six studies which did not use uniform diagnostic criteria, pooled studies. Geographic differences, nonreported prevalence of other
lung diseases, interobserver variability in chest CT assessment, 13. The R Project for Statistical Computing. https://www.r-project.org/. Accessed
August 24, 2020.
RT-PCR performance, and some methodologic quality issues 14. Mada-package. https://www.rdocumentation.org/packages/mada/ver-
may have been potential sources of heterogeneity. Note that in- sions/0.5.8/topics/mada-package. Accessed August 24, 2020.
terobserver agreement varied from substantial to almost perfect 15. Byrne D, O’Neill SB, Müller NL, et al. RSNA Expert Consensus State-
ment on Reporting Chest CT Findings Related to COVID-19: Interob-
for the CO-RADS (7,19) and from moderate to substantial for server Agreement Between Chest Radiologists. Can Assoc Radiol J 2020.
the RSNA classification system (7,22). 10.1177/0846537120938328. Published online July 2, 2020.
In conclusion, COVID-19 infection frequency was higher in 16. Lang M, Som A, Mendoza DP, et al. Detection of Unsuspected Coronavirus
Disease 2019 Cases by Computed Tomography and Retrospective Imple-
patients categorized with higher CO-RADS and RSNA classifi- mentation of the Radiological Society of North America/Society of Thoracic
cation categories. Radiology/American College of Radiology Consensus Guidelines. J Thorac
Our data may be useful for deciding on the probability of Imaging 2020. 10.1097/RTI.0000000000000542. Published online June
17, 2020.
COVID-19 based on chest CT (along with clinical information 17. Adams SJ, Dennie C. Chest imaging in patients with suspected COVID-19.
and RT-PCR). CMAJ 2020;192(25):E676.
18. Kwee RM, Krdzalic J, Fasen BACM, de Jaegere TMH; COVID-19 CT
Author contributions: Guarantors of integrity of entire study, R.M.K., T.C.K.; Investigators South-East Netherlands (CISEN) Study Group. CT Scanning
study concepts/study design or data acquisition or data analysis/interpretation, all in Suspected Stroke or Head Trauma: Is it Worth Going the Extra Mile
authors; manuscript drafting or manuscript revision for important intellectual con- and Including the Chest to Screen for COVID-19 Infection? AJNR Am J
tent, all authors; approval of final version of submitted manuscript, all authors; Neuroradiol 2020;41(7):1165–1169.
agrees to ensure any questions related to the work are appropriately resolved, all 19. Fujioka T, Takahashi M, Mori M, et al. Evaluation of the Usefulness of
authors; literature research, all authors; statistical analysis, all authors; and manu- CO-RADS for Chest CT in Patients Suspected of Having COVID-19.
script editing, all authors. Diagnostics (Basel) 2020;10(9):E608.
20. De Smet K, De Smet D, Ryckaert T, et al. Diagnostic Performance of Chest
CT for SARS-CoV-2 Infection in Individuals with or without COVID-19
Disclosures of Conflicts of Interest: R.M.K. disclosed no relevant relation-
Symptoms. Radiology 2021;298(1):E30–E37.
ships. H.J.A.A. disclosed no relevant relationships. T.C.K. disclosed no relevant
21. Falaschi Z, Danna PSC, Arioli R, et al. Chest CT accuracy in diagnosing
relationships.
COVID-19 during the peak of the Italian epidemic: A retrospective cor-
relation with RT-PCR testing and analysis of discordant cases. Eur J Radiol
References 2020;130:109192.
1. Johns Hopkins University School of Medicine. Coronavirus Resource Center. 22. Ciccarese F, Coppola F, Spinelli D, et al. Diagnostic Accuracy of North
https://coronavirus.jhu.edu/. Updated June 13, 2020. Accessed August 24, America Expert Consensus Statement on Reporting CT Findings in Patients
2020. with Suspected COVID-19 Infection: An Italian Single Center Experience.
2. Rubin GD, Ryerson CJ, Haramati LB, et al. The Role of Chest Imaging in Radiol Cardiothorac Imaging 2020;2(4):e200312.
Patient Management During the COVID-19 Pandemic: A Multinational Con- 23. Hermans JJR, Groen J, Zwets E, et al. Chest CT for triage during CO-
sensus Statement From the Fleischner Society. Chest 2020;158(1):106–116. VID-19 on the emergency department: myth or truth? Emerg Radiol
3. Huang C, Wang Y, Li X, et al. Clinical features of patients infected with 2019 2020;27(6):641–651.
novel coronavirus in Wuhan, China. Lancet 2020;395(10223):497–506. 24. Korevaar DA, Kootte RS, Smits LP, et al. Added value of chest computed
4. Adams HJA, Kwee TC, Yakar D, Hope MD, Kwee RM. Systematic Review tomography in suspected COVID-19: an analysis of 239 patients. Eur Respir
and Meta-Analysis on the Value of Chest CT in the Diagnosis of Coronavirus J 2020;56(2):2001377.
Disease (COVID-19): Sol Scientiae, Illustra Nos. AJR Am J Roentgenol 25. Miranda Magalhães Santos JM, Paula Alves Fonseca A, Pinheiro Zarattini
2020;215(6):1342–1350. Anastacio E, Formagio Minenelli F, Furtado de Albuquerque Cavalcanti C,
5. Prokop M, van Everdingen W, van Rees Vellinga T, et al. CO-RADS: Borges da Silva Teles G. Initial Results of the Use of a Standardized Diagnostic
A Categorical CT Assessment Scheme for Patients Suspected of Having Criteria for Chest Computed Tomography Findings in Coronavirus Disease
COVID-19-Definition and Evaluation. Radiology 2020;296(2):E97–E104. 2019. J Comput Assist Tomogr 2020;44(5):647–651.
6. Simpson S, Kay FU, Abbara S, et al. Radiological Society of North America 26. Dofferhoff ASM, Swinkels A, Sprong T, et al. Diagnostic algorithm for
Expert Consensus Statement on Reporting Chest CT Findings Related to COVID-19 at the ER [in Dutch]. Ned Tijdschr Geneeskd 2020;164:D502.
COVID-19. Endorsed by the Society of Thoracic Radiology, the American 27. Adams HJA, Kwee TC, Yakar D, Hope MD, Kwee RM. Chest CT Imaging
College of Radiology, and RSNA - Secondary Publication. J Thorac Imaging Signature of Coronavirus Disease 2019 Infection: In Pursuit of the Scientific
2020;35(4):219–227. Evidence. Chest 2020;158(5):1885–1895.
7. de Jaegere TMH, Krdzalic J, Fasen BACM, Kwee RM. Radiological 28. Adams HJA, Kwee TC, Kwee RM. Coronavirus Disease 2019 and Chest CT:
Society of North America Chest CT Classification System for Reporting Do Not Put the Sensitivity Value in the Isolation Room and Look Beyond
COVID-19 Pneumonia: Interobserver Variability and Correlation with the Numbers. Radiology 2020;297(1):E236–E237.
Reverse-Transcription Polymerase Chain Reaction. Radiol Cardiothorac 29. Altmayer S, Zanon M, Pacini GS, et al. Comparison of the computed
Imaging 2020;2(3):e200213. tomography findings in COVID-19 and other viral pneumonia in im-
8. Suchá D, Van Hamersvelt RW, van den Hoven AF, de Jong PA, Verkooijen munocompetent adults: a systematic review and meta-analysis. Eur Radiol
HM. Suboptimal Quality and High Risk of Bias in Diagnostic Test Ac- 2020;30(12):6485–6496.
curacy Studies on Chest Radiography and Computed Tomography in the 30. Kim H, Hong H, Yoon SH. Diagnostic Performance of CT and Reverse
Acute Setting of the COVID-19 Pandemic: A Systematic Review. Radiol Transcriptase Polymerase Chain Reaction for Coronavirus Disease 2019: A
Cardiothorac Imaging 2020;2(4):e200342. Meta-Analysis. Radiology 2020;296(3):E145–E155.
9. PRISMA. Transparent reporting of systematic reviews and meta-analyses. 31. World Health Organization. Laboratory testing for 2019 novel coronavirus
http://www.prisma-statement.org/. Updated 2015. Accessed August 24, (2019-nCoV) in suspected human cases. https://www.who.int/publications/i/
2020. item/10665-331501. Updated March 19, 2020. Accessed August 24, 2020.
10. Whiting PF, Rutjes AW, Westwood ME, et al. QUADAS-2: a revised tool 32. World Health Organization. Advice on the use of point-of-care immunodiag-
for the quality assessment of diagnostic accuracy studies. Ann Intern Med nostic tests for COVID-19. https://www.who.int/news-room/commentaries/
2011;155(8):529–536. detail/advice-on-the-use-of-point-of-care-immunodiagnostic-tests-for-
11. Reitsma JB, Glas AS, Rutjes AW, Scholten RJ, Bossuyt PM, Zwinderman AH. covid-19. Updated April 8, 2020. Accessed August 24, 2020.
Bivariate analysis of sensitivity and specificity produces informative summary 33. National Institute for Public Health and the Environment. Testing for
measures in diagnostic reviews. J Clin Epidemiol 2005;58(10):982–990. COVID-19. https://www.rivm.nl/en/novel-coronavirus-covid-19/testing-
12. OpenMeta[Analyst]. http://www.cebm.brown.edu/openmeta/. Accessed for-covid-19. Updated August 17, 2020. Accessed August 24, 2020.
August 24, 2020.