Professional Documents
Culture Documents
FCN and Conditional Random Fields (CRF). Ronneberger et al.[11] technique given by Adrian Rosebrock[15]. Figure 1 illustrates the
and ÃĞiÃğek et al.[12] used UNet architecture which consisted of aforesaid procedure.
two paths. These paths are namely down-sampling and up-sampling
path. The former tries to capture contextual features while the latter
is enables accurate localization with 3D extension. The methods in
2-dimensions ignored information about the depth. Lai[13] used the
depth information by implementing a 3D convolution model which
utilized the correlation between slices. The 3D convolution network
requires a large number of parameters. Moreover, when a dataset
with fewer samples is used, 3D CNN often overfit.
In this paper, different pretrained models are applied using the
transfer learning approach. These are namely VGG-16[3], Inception-
Fig. 1. Finding extreme points in contours and clipping them[15]
v3[4] and ResNet-50[5]. The main reason for choosing transfer learn-
ing is that since these models are already trained on huge datasets,
so their weights can be utilized to predict the brain tumor class by
simply changing the number of target classes in the output layer, 3.1.3 Resizing of images . The input dataset contains images with a
which in this case is two. different dimensions and with different aspect ratio. Therefore, the
images in the dataset are resized to a preset format, since the pre-
3 METHODOLOGY trained models used require the images to be 224×224×3 dimensions.
The proposed methodology is organized into three different steps.
This paper uses Brain MRI Images for Brain Tumor Detection dataset,
which contains a total of 253 images. The dataset has 155 samples of 3.2 Artificial data augmentation
MRI images with a malignant tumor and 98 samples benign tumor. Deep neural networks provide a significant boost in performance
In the first step, data-preprocessing on Brain MRI Images for tumor and produce skilful models whenever trained on more data. It is a
detection dataset[14] is done to run and test the developed model. method to artificially enhance the size of image training data by
In the second step, data augmentation is performed since the size of generating modified images using the original dataset. By using this
the dataset is small. Finally, in the third step, different pre-trained technique, variations are introduced in the images which enhance
CNN models are trained and then utilised to classify a given tumor the capability of the model to learn and generalize better on future
as malignant or benign using transfer learning. unseen data. Thus by introducing variation in the training dataset,
the model becomes generalized and in turn, becomes less prone to
overfitting. In Figure 2, the first image s the original image present
3.1 Data preprocessing in the dataset and the remaining twenty images are generated by
the technique of artificial data augmentation[17].
Data preprocessing can be termed as a data mining technique which
involves the transformation of raw data to a format which is more
interpretable and makes the images more suitable for further pro-
cessing. The following preprocessing steps were involved:
XXX
XXXMetric
Model XXX κ F1-score AUC-ROC Training time(sec) Accuracy
VGG-16 0.80 0.909 0.90 2158.31 90%
Inception-V3 0.10 0.689 0.55 2152.92 55%
Resnet-50 0.90 0.952 0.95 2183.72 95%
had zero false-negative rate on the test data, which makes it suitable [7] El-Dahshan, El-Sayed A., et al. "Computer-aided diagnosis of human brain tumor
for practical purposes. The area under the ROC curve is highest for through MRI: A survey and a new algorithm." Expert systems with Applications
41.11 (2014): 5526-5545.
Resnet-50 and least for inception-V3. The performance of Inception- [8] Emblem, Kyrre E., et al. "Predictive modeling in glioma grading from MR perfusion
v3 was very close to a random classifier since its AUC is 0.55. Resnet- images using support vector machines." Magnetic Resonance in Medicine: An
Official Journal of the International Society for Magnetic Resonance in Medicine
50 achieves the highest test accuracy while the inception-V3 suffered 60.4 (2008): 945-952.
from overfitting. Cohen’s kappa score is again least for inception- [9] Rahmani, Masoumeh, and Gholamreza Akbarizadeh. "Unsupervised feature learn-
v3 and highest for Resnet-50. Thus, by analyzing and comparing ing based on sparse coding and spectral clustering for segmentation of synthetic
aperture radar images." IET Computer Vision 9.5 (2015): 629-638.
the overall effect of all performance metrics, it can be seen that [10] Chang, Peter D. "Fully convolutional deep residual neural networks for brain
Resnet-50 has outperformed the other two models in brain tumor tumor segmentation." International Workshop on Brainlesion: Glioma, Multiple
classification. Sclerosis, Stroke and Traumatic Brain Injuries. Springer, Cham, 2016.
[11] Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. "U-net: Convolutional net-
works for biomedical image segmentation." International Conference on Medical
image computing and computer-assisted intervention. Springer, Cham, 2015.
6 CONCLUSION AND FUTURE SCOPE [12] Cicek, Ozgun, et al. "3D U-Net: learning dense volumetric segmentation from
In this paper, three pre-trained CNN models using transfer learn- sparse annotation." International conference on medical image computing and
computer-assisted intervention. Springer, Cham, 2016.
ing technique are applied to classify brain tumor MRI images into [13] Lai, Matthew. "Deep learning for medical image segmentation." arXiv preprint
malignant and benign. After preprocessing the dataset, the MRI arXiv:1505.02000 (2015).
images are cropped out of dark corners using crop normalization [14] Chakrabarty, Navoneel. âĂIJBrain MRI Images for Brain Tumor Detection.âĂİ
Kaggle, 14 Apr. 2019, https://www.kaggle.com/navoneel/brain-mri-images-for-
technique. Following that data augmentation is performed to intro- brain-tumor-detection.
duce variations in the data samples by increasing the size of training [15] Rosebrock, Adrian. âĂIJFinding Extreme Points in Contours with OpenCV.âĂİ
PyImageSearch, 11 2016, https://www.pyimagesearch.com/2016/04/11/finding-
sets which in turn helped to create a generalizable model and thus extreme-points-in-contours-with-opencv/.
reducing overfitting. Resnet-50 had the best F1 score and highest [16] Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classifica-
test accuracy with the zero false-negative rate, which is the pri- tion with deep convolutional neural networks." Advances in neural information
processing systems. 2012.
mary concern in an automatic diagnostic system and thus making [17] Tanner, Martin A., and Wing Hung Wong. "The calculation of posterior distri-
it suitable for practical applications. The area under the ROC curve butions by data augmentation." Journal of the American statistical Association
and Cohen’s Kappa coefficient(κ) are also highest for Resnet-50 82.398 (1987): 528-540.
REFERENCES
[1] Ritchie, Hannah. âĂIJHow Many People in the World Die from Cancer?âĂİ Our
World in Data, Institute for Health Metrics and Evaluation (IHME) , 1 Feb. 2018,
https://ourworldindata.org/how-many-people-in-the-world-die-from-cancer.
[2] Thrun, Sebastian, and Lorien Pratt, eds. Learning to learn. Springer Science &
Business Media, 2012.
[3] Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks
for large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014).
[4] Szegedy, Christian, et al. "Rethinking the inception architecture for computer
vision." Proceedings of the IEEE conference on computer vision and pattern
recognition. 2016.
[5] He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of
the IEEE conference on computer vision and pattern recognition. 2016.
[6] Beno, M. Marsaline, et al. "Threshold prediction for segmenting tumour from
brain MRI scans." International Journal of Imaging Systems and Technology 24.2
(2014): 129-137.