You are on page 1of 9

INT J TUBERC LUNG DIS 22(3):328–335

Q 2018 The Union


http://dx.doi.org/10.5588/ijtld.17.0520

Detection of tuberculosis patterns in digital photographs of


chest X-ray images using Deep Learning: feasibility study

A. S. Becker,* C. Blüthgen,* V. D. Phi van,* C. Sekaggya-Wiltshire,† B. Castelnuovo,† A. Kambugu,†


J. Fehr,‡ T. Frauenfelder*
*Institute of Diagnostic and Interventional Radiology, University Hospital of Zurich, Zurich, Switzerland;

Infectious Disease Institute, College of Health Sciences, Makerere University, Kampala, Uganda; ‡Division of
Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, University of Zurich, Zurich, Switzerland

SUMMARY

O B J E C T I V E : To evaluate the feasibility of Deep Learn- R E S U LT S : The study cohort was 138 patients with
ing-based detection and classification of pathological human immunodeficiency virus (HIV) and TB co-
patterns in a set of digital photographs of chest X-ray infection (median age 34 years, IQR 28–40); 54 patients
(CXR) images of tuberculosis (TB) patients. were female. Localisation of pathological areas was
M A T E R I A L S A N D M E T H O D S : In this prospective, excellent (area under the ROC curve 0.82). The software
observational study, patients with previously diagnosed could perfectly distinguish pleural effusions from intra-
TB were enrolled. Photographs of their CXRs were parenchymal changes. The most frequent misclassifica-
taken using a consumer-grade digital still camera. The tions were consolidations as cavitations, and miliary
images were stratified by pathological patterns into patterns as interstitial patterns (and vice versa).
classes: cavity, consolidation, effusion, interstitial chang- C O N C L U S I O N : Deep Learning analysis of CXR photo-
es, miliary pattern or normal examination. Image graphs is a promising tool. Further efforts are needed to
analysis was performed with commercially available build larger, high-quality data sets to achieve better
Deep Learning software in two steps. Pathological areas diagnostic performance.
were first localised; detected areas were then classified. K E Y W O R D S : CXR; teleradiology; TB; Deep Learning;
Detection was assessed using receiver operating charac- chest radiograph
teristics (ROC) analysis, and classification using a
confusion matrix.

IT IS ESTIMATED THAT, WORLDWIDE, 8 million these handheld devices (Figure 1). As the sending and
people develop active tuberculosis (TB) every year. receiving of images between mobile phones and a
The disease, if left untreated, has a mortality rate of server can be implemented easily today, we focus on
~70%,1 with 2 million people dying each year from the actual image analysis (Figure 1, right-hand
TB or its complications. As TB is only contagious in image).
its active, pulmonary form, chest X-ray (CXR) The human visual system allows for a very high
images play a crucial role in identifying affected level of abstraction.6 Radiologists can therefore
patients and taking preventive measures.2,3 detect abnormalities on original X-ray examinations
The prevalence of TB is highly dependent on socio- as well as on digital photographs of X-ray films.7 As
economic factors. This is evident by its decline in an example, to the reader of this article the two CXR
Western Europe in the past century with increasing images in the top row of Figure 2 will look very
wealth, long before effective chemotherapies were similar. To the computer, however, these images are
available.4 Today, the majority of the disease burden fundamentally different, as indicated by the histo-
is concentrated in only six countries,5 one of the grams in the middle row. Not only does the digital
common denominators being large areas with poor photograph come at a lower resolution than the
socio-economic development. In these areas, very few original image, the addition of spurious colour
health care facilities and physicians are available. channels by the digital camera adds unnecessary
However, with the advent of smartphones and mobile noise and lowers the signal-to-noise ratio (SNR),
internet, it may be feasible to offer help in the form of which may lower the ability of both computers and
automated diagnosis using the integrated camera of humans to detect subtle lesions close to the inherent

Correspondence to: Anton S Becker, Institute for Diagnostic and Interventional Radiology, University Hospital Zurich,
Raemistrasse 100, 8091 Zurich, Switzerland. e-mail: anton.becker@usz.ch
Article submitted 26 July 2017. Final version accepted 15 November 2017.
Deep Learning on TB CXR photographs 329

Figure 1 Proposed image processing pipeline. Possible workflow for a teleradiological TB


detection service for health care providers working remotely (schematic). After taking a
photograph of the original chest X-ray with a smartphone application, the image is transferred
wirelessly to a remote server, where image analysis is carried out in two steps: anomaly detection
and classification. The server then sends the image with an overlaid heatmap and a structured
report back to the user (health care provider), who may use it to augment his/her clinical
judgement for optimal management of the patient.

noise limit (bottom row). Furthermore, other objects stratified by the dominant pathological pattern:
in the image (such as the Post-itw Notes in our cavity, consolidation, effusion, interstitial changes,
example) may add further ‘real-world noise’. For a miliary pattern or normal examination.
computer algorithm to be successful in this task, it
would thus need to exhibit a high ability for Image analysis
abstraction, similar to that for human readers. A For image analysis, industrial-grade Deep Learning
novel machine-learning technique called Deep Learn- image analysis software (Suite v2.0; ViDi Systems,
ing has recently yielded remarkable results, with its Villaz-Saint-Pierre, Switzerland) was used.9,10 In Deep
performance becoming increasingly human-like in Learning, artificial neural networks are arranged in
recent years.8 multiple layers, which imitates the brain, their natural
The purpose of the present study was to evaluate counterpart.11 The neurons of the mammalian neocor-
the feasibility of Deep Learning-based detection and tex are organised in multiple layers, which is particu-
classification of pathological patterns in a set of larly apparent in the human visual cortex. This means
digital photographs of CXR images of TB patients that any given input percept will be deconstructed and
from Uganda. represented at multiple levels of abstraction, each
corresponding to a different layer or cortical area.6
Although currently not approved for routine clinical
STUDY POPULATION AND METHODS
use, ViDi has recently shown human-like performance
Patient population in the detection of breast cancer.12
This was a prospective observational study of patients Computations were carried out using a GeForce
enrolled at the integrated TB-HIV (human immunode- GTX 1080 graphics processor unit (Nvidia, Santa
ficiency virus) out-patient clinic of the Infectious Clara, CA, USA). Anomalies were first contoured by
Diseases Institute in Kampala, Uganda. CXRs were two investigators in consensus (ASB, TF) using the
taken in the Radiology Department of Mulago ViDi Detection tool for supervised training. A
National Referral Hospital, which is the main national randomly chosen subset of images (n ¼ 117, 85%)
and teaching hospital providing specialised care to the was used to train the software, and the remaining
whole country. The study received ethics approval from cases (n ¼ 21) were used to validate the resulting
the Joint Clinical and Research Centre Ethics Com- model (cross-validation). The detected anomalies
mittee, Kampala, and the Uganda National Council for were then transferred as individual image patches to
Science and Technology, Kampala (HS 1303). the ViDi classification tool and labelled by the same
All photographs were taken using a digital still investigators. Again, a randomly chosen subset of
camera (DMC-TZ56; Panasonic, Osaka, Japan; see image patches (n ¼ 150, 80%) was used to train the
Table 1 for full details). The images were screened by software, and the remaining patches (n ¼ 40) were
two investigators in consensus (ASB, TF) and used for validation.
330 The International Journal of Tuberculosis and Lung Disease

Figure 2 Illustrated comparison of original X-ray and photograph. Examples of an original digital chest X-ray image (DCXr, top left)
and a digital photograph of a film radiograph (photo, top right). Although to the human observer the images look very much alike, the
histogram (middle row) indicates that they look fundamentally different to a computer. While the DCXr only contains greyscale
information, a photograph typically contains three additional separate colour channels, which are an ‘artifact’ of the camera sensor
and do not convey any meaningful information, as the original radiograph only contains greyscale information as well. This scenario
results in a decreased signal-to-noise ratio (bottom row) and hence a smaller real-world information content per image, which lowers
the conspicuousness of subtle differences/lesions (top right and bottom left circles).

Table 1 Technical details of the Panasonic Lumix DM-TZ57 camera*


Sensor Sensor size/total pixels/filter 1/2.33-type high-sensitivity MOS sensor/total pixel number 17.5 megapixels/primary colour filter
Pixels Camera effective pixels 16 megapixels
Lens
Aperture F3.3–6.4/multistage iris diaphragm (F3.3–8.0 (W), F6.4–8.0 (T))
Optical zoom 203
Focal length f ¼ 4.3–86.0 mm (24–480 mm in 35 mm equivalent)
Lens Lumix DC Vario/12 elements in 10 groups/(3 aspherical lenses/6 aspherical surfaces/2 extra-low
dispersion lenses)
Focus
Focusing area Normal: wide 50 cm–infinity/tele 200 cm–infinity/AF macro/intelligent auto/motion picture: wide 3
cm– infinity/tele 100 cm–infinity
Focus Normal/AF macro/macro zoom/quick AF (always on), continuous AF (only for motion picture)/AF
tracking
Shutter Shutter speed Approximately 4–1/2000 s (0.2 s)
Exposure parameters
ISO sensitivity Auto/i.ISO/100/200/400/800/1600/3200/high-sensitivity mode (ISO1600–6400)
White balance Auto/daylight/cloudy/shade/incandescent/white set/white balance adjustment (except auto)
Flash Not used
* Panasonic, Osaka, Japan.
MOS ¼ metal–oxide–semiconductor; AF ¼ auto focus; ISO ¼ International Organization for Standardization.
Deep Learning on TB CXR photographs 331

Table 2 Disease patterns in the study cohort Image analysis


Pattern n The training time for pathology detection was 2 min.
Cavity 10 The processing time per validation image with the
Consolidation 68 final model was 53.4 6 1.4 msec. Detection
Effusion 18 performance was excellent, with AUC ¼ 0.98
Interstitial 14
Miliary 8 (validation set 0.82) (Figure 3A).
None (controls) 20 More than one feature was often present in the
patches detected (e.g., consolidation around a cavity,
as shown in Figure 4A, which looks similar to the
Statistical analysis
pure consolidation in Figure 4B). As the software
Due to the preliminary nature of the present study, the does not support multiple labels per patch, and due to
statistical analysis is mainly descriptive. Continuous
the limited size of our data set, the most dominant
variables are expressed as mean and standard devia-
feature was used when labelling the patches. Training
tion; categorical variables as counts and percentages.
time was 3 min; processing time per validation image
Detection performance was assessed using receiver
was 39.4 6 9.7 msec. The confusion matrix is
operating characteristics (ROC) analysis with the area
illustrated in Figure 3B: overall, the software was
under the curve (AUC). Pathology classification was
able to perfectly distinguish pleural effusions from
assessed with a confusion matrix generated using
intraparenchymal patterns (specificity and PPV ¼ 1.0
Python 3.5.2, seaborn 0.7.1, numpy 1.12.0 and pandas
0.19.2 packages (Python Software Foundation, Beaver- overall and in the validation set, see example in Figure
ton, OR, USA). The specificity and positive predictive 4C). The most frequent false diagnoses were cavita-
value (PPV) were calculated for each feature in the tions misclassified as consolidations or vice versa (n ¼
whole data set as well as for the validation data only. 4), and a miliary pattern mistaken for an interstitial
pattern (n ¼ 5) and vice versa, resulting in the
markedly lower sensitivity and PPV for these patterns
RESULTS (Table 3). Furthermore, miliary and interstitial
Study cohort patterns were sometimes misclassified as consolida-
Our cohort comprised 138 HIV-infected patients tions, but not vice versa (see example in Figure 4D).
diagnosed with TB (median age 34 years, interquar-
tile range 28–40). Over 95% of these participants DISCUSSION
presented to the clinic with a cough of 72 weeks’
duration, and .80% had fever and weight loss, in The detection and classification of pathology as seen
accordance with their diagnosis of TB. The number of in digital photographs of CXR images of TB patients
different patterns is given in Table 2. is feasible with Deep Learning. Despite the draw-

Figure 3 A) Receiver operating characteristic curve of disease detection in the whole cohort demonstrates the excellent ability of the
software to localise abnormal areas in the chest X-ray photograph (AUC 0.98). B) Confusion matrix of the classification of the
different patterns. Pleural effusions were detected with excellent accuracy (1.0), whereas cavities were sometimes misdiagnosed as
consolidations (probably due to surrounding consolidations). Diagnosis of interstitial and miliary patterns also exhibited a greater
degree of uncertainty. AUC ¼ area under the receiver operating characteristic curve.
332 The International Journal of Tuberculosis and Lung Disease

Figure 4 A) This cavity with surrounding consolidations in the right upper lobe was correctly
detected and diagnosed. B) Sample image of consolidations in the left upper lobe, which were
detected and classified correctly by the software. C) An example of a small left-sided pleural
effusion, which was reliably detected and classified. D) Female patient with extensive bilateral
miliary tuberculosis. Although pathological changes were detected correctly, they were falsely
identified as consolidations.
Deep Learning on TB CXR photographs 333

Table 3 Sensitivity, specificity, PPV and AUC for the different patterns overall (all) and untrained
samples only (valid)
Sensitivity Specificity PPV AUC
All Valid All Valid All Valid All Valid
Cavity 0.86 0.50 0.98 0.92 0.67 0.25 0.98 0.90
Consolidation 0.95 0.74 0.95 0.76 0.95 0.74 0.95 0.75
Effusion 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00
Interstitial 0.88 0.43 0.97 0.85 0.85 0.38 0.95 0.78
Miliary 0.84 0.29 0.99 0.94 0.93 0.50 0.96 0.83
PPV ¼ positive predictive value; AUC ¼ area under the ROC curve.

backs of digital photography, image analysis yielded solving has been tremendously successful in the past
useful preliminary results. few years in financial modelling, marketing and
Due to the often negative smear results in HIV/ education, as well as manufacturing and medical
AIDS (acquired immune-deficiency syndrome) pa- imaging.19 The latter two fields have a key compo-
tients, CXR may be the main tool for establishing a nent in common: there is often only a small amount of
diagnosis of active TB in these patients and is high-quality data available for a given manufacturing
endorsed by the World Health Organization,13 line, or body part and modality. To put things in
despite some conflicting evidence in the literature.14 perspective, in machine learning research, a large data
One possible reason for this could be the shortage of set would contain ~14 million annotated images with
expert physicians in affected areas. Non-expert 750 000 samples per high-level category (‘ImageNet
readers, even after several training sessions and Summary and Statistics’. http://image-net.org/about-
implementation of a systematic reading and reporting stats accessed July 2017).
system, may exhibit only limited diagnostic accuracy We therefore chose to evaluate Deep Learning
in detecting X-ray signs of active TB.15–17 With software conceived for industrial use because we
rapidly expanding access to smartphone cameras and hypothesised that due to these shared problems, our
mobile internet in the developing world, diagnosing medical image analysis may benefit from the solu-
active TB with the help of these tools appears to be a tions engineered into such software. The drawback to
logical next step in resource-limited settings. this method is the proprietary network architecture,
However, digitising the film using a consumer- which makes it impossible to understand or improve
grade digital camera sensor adds spurious colour the underlying algorithm itself. In contrast, current
channels and electronic noise to the original greyscale state-of-the-art open-source machine learning frame-
image. In larger centres with an existing digital works such as TensorFlow (Google, Mountain View,
workflow and a medical-grade film digitiser, the CA, USA) or Keras (Massachusetts Institute of
‘smartphone approach’ may be of limited use, as Technology, Cambridge, MA, USA) enable access to
Deep Learning can be directly integrated using the the network architecture and hyperparameters as well
original Digital Imaging and Communications in as visualisation of activation patterns in each layer of
Medicine (DICOM) files over a local area network the network. We project that such freely available,
(LAN) connection. This has already been practised open-source software will play an important role in
for decades in Europe and the United States using the research and development of machine learning
traditional computer-aided detection (CAD) pro- tools for the detection of TB in CXR examinations.
grammes. Due to the higher SNR, more homoge- There are three inherent limitations to Deep
neous image characteristics and faster data transfer, Learning. First, especially in smaller data sets, there
more reliable results with less training data could be is a significant danger of overfitting the model. This
expected. However, the majority of health care means that the trained network will not generalise
facilities in the developing world remain dependent well to new, unseen cases. Usually, this is assessed by
on plain-film CXR examinations and a non-digital (cross-)validation, i.e., setting aside a percentage of
workflow. Studying the challenges associated with an the data ‘not shown’ to the algorithm and evaluating
indirect workflow via a digital (smartphone) camera the performance on this validation set. Second, the
thus seems justified. exact computations that take place are, due to the
Machine learning is a field in computer science in sheer complexity of the network, not traceable or
which the computer is trained with sample data comprehensible for the human observer. This means
instead of being explicitly programmed to perform a that the resulting model is something of a ‘black box’.
task. The machine learning algorithm, Deep Learning The use of heat maps which highlight the features
in our case, analyses the existing relationships and learned to be suspicious by the neural network may
features of training data and attempts to find useful somewhat compensate for this drawback.20 Third,
patterns.18 This data-driven approach to problem many authors use excessive downsampling of the
334 The International Journal of Tuberculosis and Lung Disease

images. For example, in a recent study on identifying mediastinal involvement (enlarged and/or calcified
TB patterns in CXR images using Deep Learning by lymph nodes or pericardial effusions), airway affec-
Lakhani and Sundaram, images were scaled down to tion (bronchiectasis or tracheobronchial stenosis) or
a resolution of 256 3 256 pixels.21 This is usually sequelae such as aspergilloma (present in up to 11%
necessary to improve efficiency in terms of memory of patients with chronic disease27) or scar-based
and computing time. However, in contrast to, for changes. For the latter category, it would be helpful to
example, object identification in the real world, evaluate the course of the disease on the basis of serial
diagnostic imaging often relies on the detection of CXR images, a problem suitable for recurrent neural
subtle changes. In particular, initial changes in early networks.28
disease stages are often only detectable at full In conclusion, analysis of even a few CXR
resolution. It may thus be preferable to build photographs from TB patients is feasible using Deep
networks which take advantage of the high resolution Learning software. Although further efforts are
as, for example, in a recent study on mammogra- needed to build larger, high-quality data sets and
phy.22 achieve better diagnostic performance, our approach
The first step, the detection of pathological may be useful to develop a machine learning-based
patterns, worked very well, despite the small number application to help diagnose TB in remote or
of cases. Keeping in mind that the neural networks of underdeveloped areas.
the software were engineered to find anomalies in
highly repetitive textures and patterns, this remains Acknowledgements
surprising, as CXR examinations are composed of The authors thank all patients for their participation and their
fairly complex patterns with a high degree of natural families for their support, as well as the health care workers at
and normal variability. Moreover, performance was Infectious Diseases Institute, Kampala, Uganda; M Bauer for image
slightly higher (AUC 0.82) than traditional computer- digitalisation and N Eberhard for her support while working in
aided systems specifically developed for TB detection Uganda.
Conflicts of interest: none declared.
(AUC 0.71–0.75),23,24 which is comparable with
human performance.25 When classifying the different
manifestations of the disease, our results showed a References
clear strength in differentiating between effusion and 1 Tiemersma E W, van der Werf M J, Borgdorff M W, Williams B
parenchymal changes. As demonstrated by the worse G, Nagelkerke N J D. Natural history of tuberculosis: duration
performance in the validation set, differentiation and fatality of untreated pulmonary tuberculosis in HIV
negative patients: a systematic review. PLOS ONE 2011; 6:
between intraparenchymal patterns proved to be
e17601.
more challenging for three main reasons. First, for 2 Skoura E, Zumla A, Bomanji J. Imaging in tuberculosis. Int J
our pilot study, only a limited data set was available. Infect Dis 2015; 32: 87–93.
We therefore expect our performance to improve with 3 Somashekar N, Chadha V K, Praseeja P, et al. Role of pre-
more experience (which equates to more data), for Xpertw screening using chest X-ray in early diagnosis of smear-
which we have planned a follow-up study. Second, negative pulmonary tuberculosis. Int J Tuberc Lung Dis 2014;
18: 1243–1244.
purely isolated patterns are rarely present in real-
4 Davies P, Gordon S B, Davies G. Clinical tuberculosis. Boca
world clinical scenarios. Cavities may often be Raton, FL, USA: CRC Press Book, 2014.
surrounded by consolidations or prominent intersti- 5 World Health Organization. Global tuberculosis report, 2016.
tial markings or, in addition to a miliary pattern, WHO/HTM/TB/2016.13. Geneva, Switzerland: WHO, 2016.
consolidations or thickened interstitial septae may be 6 Serre T, Kreiman G, Kouh M, Cadieu C, Knoblich U, Poggio T.
present. This leads to a certain degree of ‘confusion’ A quantitative theory of immediate visual recognition. In: Cisek
P, Drew T, Kalaska J, eds. Computational neuroscience:
in the neural network due to the inherent imprecision
theoretical insights into brain function. Progress in Brain
of the data. As a result, inexperienced human readers Research. Vol 165. Amsterdam, The Netherlands: Elsevier,
also exhibit poor consensus with regard to character- 2007: pp 33–56.
ising intraparenchymal patterns in bedside CXR 7 Szot A, Jacobson F L, Munn S, et al. Diagnostic accuracy of
images but not when diagnosing pleural effusions.26 chest X-rays acquired using a digital camera for low-cost
Finally, small fluid- or debris-filled cavities could be teleradiology. Int J Med Inform 2004; 73: 65–73.
8 Esteva A, Kuprel B, Novoa R A, et al. Dermatologist-level
present within areas of consolidation. As Deep
classification of skin cancer with deep neural networks. Nature
Learning inherits the physical limitations of the given 2017; 542: 115–118.
modality, the density difference between fluid and 9 LeCun Y A, Bottou L, Orr G B, Müller K-R. Efficient BackProp.
soft tissue is too small to be discriminated on plain X- In: Neural networks: tricks of the trade. Lecture Notes in
ray examinations. Computer Science. Vol 7700. Berlin, Heidelberg, Germany:
In our study, we focused our analysis on active Springer Berlin Heidelberg, 2012: pp 9–48.
10 Hinton G E. To recognize shapes, first learn to generate images.
airspace disease and pleural effusions. In a clinical
Prog Brain Res 2007; 165: 535–547.
setting, these are probably the most important 11 Bengio Y. Learning Deep architectures for AI. In: Jordan M, ed.
findings, with immediate consequences for patient Foundations and trendsw in machine learning. Delft, The
management. Future studies could also investigate Netherlands: Now Publishers Inc, 2009.
Deep Learning on TB CXR photographs 335

12 Becker A S, Marcon M, Ghafoor S, Wurnig M C, Frauenfelder 21 Lakhani P, Sundaram B. Deep Learning at chest radiography:
T, Boss A. Deep Learning in mammography: diagnostic automated classification of pulmonary tuberculosis by using
accuracy of a multipurpose image analysis software in the convolutional neural networks. Radiology 2017; 284: 574–
detection of breast cancer. Invest Radiol 2017; 52: 434–440. 582.
13 Harries A D, Maher D, Graham S. TB/HIV: a clinical manual. 22 Geras K J, Wolfson S, Kim S G, Moy L, Cho K. High-resolution
Geneva, Switzerland: World Health Organization, 2004. breast cancer screening with multi-view deep convolutional
14 Davis J L, Worodria W, Kisembo H, et al. Clinical and neural networks. Computer Vision and Pattern Recognition, 6
radiographic factors do not accurately diagnose smear-negative November 2017.
tuberculosis in HIV-infected inpatients in Uganda: a cross- 23 Steiner A, Mangu C, van den Hombergh J, et al. Screening for
sectional study. PLOS ONE 2010; 5: e9859. pulmonary tuberculosis in a Tanzanian prison and computer-
15 Kosack C S, Spijker S, Halton J, et al. Evaluation of a chest aided interpretation of chest X-rays. Public Health Action
radiograph reading and recording system for tuberculosis in a 2015; 5: 249–254.
HIV-positive cohort. Clin Radiol 2017; 72: 519.e1–519.e9. 24 Muyoyeta M, Maduskar P, Moyo M, et al. The sensitivity and
16 Semakula-Katende N S, Andronikou S, Lucas S. Digital specificity of using a computer aided diagnosis program for
platform for improving non-radiologists’ and radiologists’ automatically scoring chest X-rays of presumptive TB patients
interpretation of chest radiographs for suspected tuberculosis— compared with Xpert MTB/RIF in Lusaka Zambia. PLOS ONE
a method for supporting task-shifting in developing countries. 2014; 9: e93757.
Pediatr Radiol 2016; 46: 1384–1391. 25 Maduskar P, Muyoyeta M, Ayles H, Hogeweg L, Peters-Bax L,
17 Seddon J A, Padayachee T, Plessis du A-M, et al. Teaching chest van Ginneken B. Detection of tuberculosis using digital chest
X-ray reading for child tuberculosis suspects. Int J Tuberc Lung radiography: automated reading vs. interpretation by clinical
Dis 2014; 18: 763–769. officers. Int J Tuberc Lung Dis 2013; 17: 1613–1620.
18 Kohli M, Prevedello L M, Filice R W, Geis J R. Implementing 26 Martini K, Ganter C, Maggiorini M, et al. Interpretation of
machine learning in radiology practice and research. AJR Am J bedside chest X-rays in the ICU: is the radiologist still needed? J
Roentgenol 2017; 208: 754–760. Clin Imaging 2015; 39: 1018–1023.
19 Jordan M I, Mitchell T M. Machine learning: trends, 27 Fraser R S, Muller N L, Colman N, Pare P D. Fraser and Paré’s
perspectives, and prospects. Science 2015; 349: 255–260. diagnosis of diseases of the chest. Vols 1–4. Philadelphia, PA,
20 Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A. Learning USA: W B Saunders, 1999.
deep features for discriminative localization. 2016 IEEE 28 Schmidhuber J. Deep Learning in neural networks: an overview.
Conference on Computer Vision and Pattern Recognition Neural Networks 2015; 61: 85–117.
(CVPR), 27–30 June 2016, Las Vegas, NV, USA: pp 2921–
2929.
Deep Learning on TB CXR photographs i

R É S U M É
OBJECTIF : Evaluer la faisabilité de l’apprentissage de d’efficacité du receveur (ROC) et la classification par
la détection et de la classification de profils matrice de confusion.
pathologiques sur un ensemble de photographies R É S U L T A T S : La cohorte a inclus 138 patients
numériques de radiographie pulmonaires (CXR) de coinfectés par la TB et le virus de l’immunodéficience
patients atteints de tuberculose (TB). humaine, d’âge médian 34 ans (intervalle interquartile
M A T E R I E L S E T M E T H O D E S : Dans cette étude 28–40), dont 54 ont été des femmes. La localisation des
prospective d’observation, les patients atteints de TB zones pathologiques a été excellente (zone sous la courbe
déjà diagnostiquée ont été enrôlés. Les photographies de ROC 0,82). Le logiciel a été capable de distinguer
leurs CXR ont été prises avec un appareil photo parfaitement les épanchements pleuraux des
numérique grand public. Les images ont été stratifiées modifications intra-parenchymateuses. Les erreurs de
en fonction des profils pathologiques dans les classes classification les plus fréquentes ont été les
suivantes : caverne, consolidation, épanchement, consolidations prises pour des cavernes et les miliaires
modifications interstitielles, miliaire ou aspect normal. confondues avec un profil interstitiel (et vice versa).
L’analyse des images a été réalisée avec un logiciel C O N C L U S I O N : L’apprentissage de l’analyse de
d’apprentissage disponible dans le commerce en deux photographies de CXR est un outil prometteur.
étapes. D’abord, les zones pathologiques ont été Davantage d’efforts sont requis pour créer des
localisées, puis les zones détectées ont été classées. La ensembles de données plus vastes et de bonne qualité
détection a été évaluée par une analyse de fonction et aboutir à une meilleure performance de diagnostic.

RESUMEN
O B J E T I V O: Evaluar la viabilidad de un método basado R E S U L T A D O S: La cohorte incluy ó 138 pacientes
en el aprendizaje profundo, con el fin de detectar y aquejados de coinfecci ón por el virus de la
clasificar de perfiles patológicos en un conjunto de inmunodeficiencia humana y TB, con una edad
fotografı́as digitales de radiografı́as de tórax (CXR) de mediana de 34 años (amplitud intercuartil 28–40).
pacientes con tuberculosis (TB). Cincuenta y cuatro pacientes eran de sexo femenino.
M A T E R I A L E S Y M É T O D O S: En el presente estudio La calidad de la localización de las zonas patológicas fue
prospectivo observacional, se incluyeron pacientes con excelente (área bajo la curva de eficacia diagnóstica
un diagnóstico anterior de TB. Se tomaron fotografı́as 0,82). El programa informático pudo diferenciar
de las CXR de los pacientes con una cámara fotográfica exactamente el derrame pleural de los cambios
digital de calidad para el consumidor. Las imágenes se intraparenquimatosos. Los errores más frecuentes
estratificaron por perfiles patológicos en las siguientes fueron clasificar las cavernas como consolidaciones y
clases: caverna, consolidación, derrame, infiltrados el patrón miliar como un patrón intersticial (y viceversa).
intersticiales, patrón miliar o examen normal. Se llevó C O N C L U S I Ó N: El análisis por aprendizaje profundo de
a cabo un análisis de las imágenes con un programa las fotos de CXR aparece como un instrumento
informático de aprendizaje profundo en dos etapas. En prometedor. Se precisan nuevas iniciativas que
primer lugar, se localizaron las zonas patológicas y luego contribuyan a crear conjuntos de datos más extensos
se clasificaron. Se evaluó la detección mediante una de gran calidad y lograr un mejor rendimiento
curva de eficacia diagnóstica y la clasificación con una diagnóstico.
matriz de confusión.

You might also like