Professional Documents
Culture Documents
ABSTRACT
Pneumonia is an infection in one or both lungs. It can be caused by bacteria, viruses, or
fungi. Pneumonia causes inflammation in the air sacs in your lungs, which are called
alveoli. The alveoli fill with fluid or pus, making it difficult to breathe. According to
National Institutes of Health (NIH), chest x ray is the best test for pneumonia diagnosis.
However, reading x ray images can be tricky and requires domain expertise and
experience. It would be nice if we can just ask a computer to read the images and tell us
the results.
The chest X-ray is one of the most commonly accessible radiological examinations
for screening and diagnosis of many lung diseases. A tremendous number of X-ray
imaging studies accompanied by radiological reports are accumulated and stored in
many modern hospitals. we present a new chest X-ray database, namely “ChestX-
ray”, which comprises frontal-view X-ray images of some unique patients with the text-
mined eight disease image labels (where each image can have multi-labels), from the
associated radiological reports. Importantly, we demonstrate that these commonly
occurring thoracic disease(pneumonia) can be detected and even spatially-located via a
unified weakly-supervised multi-label image classification and disease localization
framework, which is validated using our proposed dataset. Although the initial
quantitative results are promising as reported, deep convolution neural network based
“reading chest X-rays” (i.e., recognizing and locating the common disease patterns
trained with only image-level labels) remains a strenuous task for fully-automated high
precision CAD systems.
LITERATURE SURVEY
More than 1 million adults are hospitalized with pneumonia and around 50,000 die from
the disease every year in the US alone (CDC, 2017). Chest X-rays are currently the best
available method for diagnosing pneumonia (WHO, 2001), playing a crucial role in
clinical care (Franquet, 2001) and epidemiological studies (Cherian et al., 2005).
However, detecting pneumonia in chest X-rays is a challenging task that relies on the
availability of expert radiologists. In this work, we present a model that can automatically
detect pneumonia from chest X-rays at a level exceeding practicing radiologists. This
model is a multiple layered convolutional neural network that inputs a chestX-ray image
and outputs the probability of pneumonia along with a heat map localizing the areas of
the image most indicative of pneumonia. We train model on the recently released ChestX-
ray dataset (Wanget al., 2017), which contains 112,120 frontal-view chest X-ray images
individually labelled with up to 14 different thoracic diseases, including pneumonia.
They use dense connections (Huang et al., 2016) and batch normalization (Ioffe &
Szegedy, 2015) to make the optimization of such a deep network tractable. Detecting
pneumonia in chest radiography can be difficult for radiologists. The appearance of
pneumonia in X-ray images is often vague, can overlap with other diagnoses, and can
mimic many other benign abnormalities. These discrepancies cause considerable
variability among radiologists in the diagnosis of pneumonia (Neuman et al., 2012;
Davies et al., 1996; Hopstakenet al., 2004). To estimate radiologist performance, we
collect annotations from four practicing academic radiologists on a subset of 420 images
from ChestX-ray14. On these 420 images, we measure performance of individual
radiologists using the majority vote of other radiologists as ground truth, and similarly
measure model performance.
We find that the model exceeds the average radiologist performance at the pneumonia
detection task on both sensitivity and specificity. To compare CheXNet against previous
work using ChestX-ray14, we make simple modifications to CheXNet to detect all 14
diseases in ChestX-ray14, and find that we outperform best published results on all 14
diseases. Automated detection of diseases from chest X-rays at the level of expert
radiologists would not only have tremendous benefit in clinical settings, it would also be
invaluable in delivery of health care to populations with inadequate access to diagnostic
imaging specialists.
2 CheXNet
The pneumonia detection task is a binary classification problem, where the input is a
frontal-view chest X-ray image X and the output is a binary label y 2 f0; 1g indicating the
absence or presence of pneumonia respectively. For a single example in the training set,
we optimize the weighted binary cross entropy loss L(X; y) = w+ _ y log p(Y =
1jX)w _ (1 y) log p(Y = 0jX); where p(Y = ijX) is the probability that the network
assigns to the label i, w+ = jNj=(jPj+jNj), and w = jPj=(jPj+jNj) with jPj and jNj the
number of positive CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays
with Deep Learning cases and negative cases of pneumonia in the training set
respectively.
3. Data
3.1. Training
We use the ChestX-ray14 dataset released by Wang et al. (2017) which contains 112,120
frontal-views X-ray of 30,805 unique patients. Wang et al. (2017) annotate each image
with up to 14 different thoracic pathology labels using automatic extraction methods on
radiology reports. We label images that have pneumonia as one of the annotated
pathologies as positive examples and label all other images as negative examples. For the
pneumonia detection task, we randomly split the dataset into training (28744 patients,
98637 images), validation (1672 patients, 6351 images), and test (389 patients, 420
images). There is no patient overlap between the sets. Before inputting the images into the
network, we downscale the images to 224_224 and normalize based on the mean and
standard deviation of images in the Image Net training set. We also augment the training
data with random horizontal flipping.
3.2. Test
We collected a test set of 420 frontal chest X-rays. Annotations were obtained
independently from four practicing radiologists at Stanford University, who were asked to
label all 14 pathologies in Wang et al. (2017). The radiologists had 4, 7, 25, and 28 years
of experience,
and one of the radiologists is a sub-specialty fellowship trained thoracic radiologist.
Radiologists did not have access to any patient information or knowledge of disease
prevalence in the data. Labels were entered into a standardized data entry program.
4.1Comparison
We assess radiologist performance on the test set on the pneumonia detection task. Recall
that each of the images in the test set has a ground truth label from four practicing
radiologists. We evaluate the performance of an individual radiologist by using the
majority
vote of the other three radiologists as ground truth. Similarly, we evaluate CheXNet using
the majority vote of three of four radiologists, repeated four times to cover all groups of
three. We compare CheXNet against radiologists on the Receiver Operating
Characteristic (ROC) curve, which plots model sensitivity against 1 - specificity. Figure 2
illustrates the model ROC curve as well as the four radiologists and average radiologist
operating points: a single radiologist's performance is represented by an orange marker,
while the average is represented by green. CheXNet outputs the probability of detecting
pneumonia in a Chest X-ray, and the ROC curve is generated by varying the thresholds
used for the classification boundary. CheXNet has an AUROC of 0.828 on the test set.
The sensitivity-specificity point for each radiologist and for the average lie below the
ROC curve, signifying that CheXNet is able to detect pneumonia at a level matching or
exceeding radiologists.
We randomly split the dataset into training (70%), validation (10%), and test (20%) sets,
following previous work on ChestX-ray14 (Wang et al., 2017; Yao et al., 2017). We
ensure that there is no patient overlap between splits. We compare the per-class AUROC
of the model against the previous state of the art held by Yet al. (2017) on 13 classes and
Wang et al. (2017) on the remaining 1 class.
We find that CheXNet achieves state of the art results on all 14 pathology classes. Table 1
illustrates the per class AUROC comparison on the test set. On Mass, Nodule,
Pneumonia, and Emphysema, we outperform previous state of the art considerably (>
0:05 increase in AUROC).
9.References
Wang, Xiaosong, Peng, Yifan, Lu, Le, Lu, Zhiyong,
Bagheri, Mohammadhadi, and Summers, Ronald M.
Chestx-ray8: Hospital-scale chest x-ray database
and benchmarks on weakly-supervised classification
and localization of common thorax diseases. arXiv
preprint arXiv: 1705.02315, 2017.
Gulshan, Varun, Peng, Lily, Coram, Marc, Stumpe,
Martin C, Wu, Derek, Narayanaswamy, Arunachalam,
Venugopalan, Subhashini, Widner, Kasumi,
Madams, Tom, Cuadros, Jorge, et al. Development
and validation of a deep learning algorithm for detection
of diabetic retinopathy in retinal fundus photographs.
Jama, 316(22):2402{2410, 2016.
Problem Identification
Pneumonia accounts for over 15% of all deaths of children under 5 years old
internationally. In 2015, 920,000 children under the age of 5 died from the disease.
Pneumonia is an acute respiratory infection which affects the lungs. It causes difficulty in
breathing and limits oxygen intake. It can be caused by bacteria, fungi or viruses and is a
contagious disease.
While common, accurately diagnosing pneumonia is a tall order. It requires review of a
chest radiograph (CXR) by highly trained specialists and confirmation through clinical
history, vital signs and laboratory exams. Pneumonia usually manifests as an area or areas
of increased opacity [3] on CXR. However, detecting pneumonia in chest X-rays is a
challenging task that relies on the availability of expert radiologists. It would be nice if
we can just ask a computer to read the images and tell us the results.
Problem Solution
The above challenge can be solved by building an algorithm to detect a visual signal for
pneumonia in medical images. Specifically algorithm needs to automatically locate lung
opacities on chest radiographs.
We develop an algorithm that can detect pneumonia from chest X-rays at a level
exceeding practicing radiologists. Here algorithm is a layered convolutional neural
network trained on ChestX-ray.
Objectives
1) This project could help deliver accurate diagnoses quickly and accurately without the
need for an expert radiologist.
2) Automating this process would help expedite diagnosis in clinical settings, helping
patients get early diagnosis and treatment.
3) For instance, in the case of pneumonia, this could provide early diagnosis leading to
early treatment, which is critical for preventing complications and death.
4) This could also help fill the need for expert diagnosis in the many places around the
world with inadequate access to radiologists.
5) In this way, it could help deliver healthcare to vulnerable populations.
6) This study not only shows the potential for an automated diagnosis for pneumonia, but
also for 13 other thoracic pathologies, including cardiomegaly, edema, emphysema and
hernia.
7) And the technology may be tweaked to provide even better diagnosis.
8) This study only allowed the radiologists and developer to analyze the X-ray images
with a frontal view.
9) However, the algorithm may be modified to also interpret images that present a lateral
view of the patient, potentially increasing the accuracy of diagnosis by 15 percent.
Methodology
1) We use deep learning to build this project.
2) Construction of Hospital-scale chest X-ray Database.
3) Processing Chest X-ray images.
4) Common Pneumonia diseases detection.
5) Localizing using Convolution neural network.
6) Drawing a heat maps for cnn using tensor flow and keras.
HeapMap Localization
Convolutional neural network that takes a chest X-ray image as input, and outputs the
probability of pathology. On this example, CheXNet correctly detects pneumonia and
also localizes areas in the image most indicative of the pathology.
Pathology Localization
Global Pooling Layer and Prediction Layer:
In our multi-label image classification network, the global pooling and the predication
layer are designed not only to be part of the DCNN for classification but also to generate
the likelihood map of pathologies, namely a heatmap. The location with a peak in the
heatmap generally corresponds to the presence of disease pattern with a high probability.
Limitations
1) Neither the model nor the radiologist were permitted to use the prior examinations
or patient history which have been shown to decrease radiologist performance.
2) Only frontal radiographs were presented to the radiologist and the model during
diagnosis, but it has been shown that up to 15% of accurate diagnoses require the
lateral view.
Conclusion
1) Early diagnosis and treatment of pneumonia is critical to preventing complications
including death.
2) There is a shortage of experts who can interpret X-rays even when imaging
equipment is available to prevent this project will exceeds the performance of
radiologist.