You are on page 1of 6

Transfer learning based Deep learning framework

for identification of type of Brain Tumor

1stG.JayaLakshmi 2ndDr.Sangeetha yalamanchili


Assistant Professor Associate Professor
Department of Information Technology Department of Information Technology
V R Siddhartha Engineering College V R Siddhartha Engineering College
Vijayawada,AndhraPradesh Vijayawada,AndhraPradesh
jaya1123@vrsiddhartha.ac.in ysangeetha@vrsiddhartha.ac.in
https://orcid.org/0000-0002-4044-4138

Abstract—Brain tumors are not a condition that is very accurate details, it is used in conjunction with further
frequent, but they are between the most deadly cancers. imaging modalities, such as computed tomography (CT),
Various Magnetic Resonance Imaging (MRI) modalities are positron emission tomography (PET) and magnetic
currently state-of-the-art for the non-invasive diagnosis of resonance spectroscopy (MRS),providing the majority
brain tumors. There are several imaging techniques that are correct tumor morphology and metabolism details.
used to diagnose tumors in the brain. Because of its unrivaled
image quality and the fact that it does not rely on ionizing Deep Learning (DL) is a form of AI method that follows
radiation, Magnetic Resonance Imaging (MRI) is generally the implementation of an individual brain in the dealing out
used for such a mission. Deep learning, especially in division of data and sample generation, helpful in creation appropriate
and classification of brain tumors, has demonstrated an decisions.DL computations allow use of different non-linear
impressive presentation. For the classification of types of brain levels that are fine structured for an image to extract features.
tumors into malignant and benign using publicly available The effect of each prearranged layer is the involvement of the
datasets, the Deep Learning and Transfer learning methods are following layer, which helps with details (Deng & Yu,
suggested in this work. On the dataset, the proposed approach 2014).The Convolution Neural Network (CNN) is branch of
was validated with an precision of 99.04 percent .The outcome the DL relation and is commonly used for visual scrutiny and
accomplished demonstrates the ability of the proposed negligible pre-processing. Convolution Neural Networks'
algorithm to identify brain tumors. The newly built primary points of interest are their capacity to learn features
architecture may be used in medical diagnostics as an
and have limitless accuracy compared to traditional AI
important decision-support method for radiologists.
techniques by rising the number of trials used for training
Keywords—transfer learning, deep learning, alexnet, and thereby contributing to a very efficient and exact
googlenet, fully CNN, brain tumor model(Litjens et al., 2017).
A powerful and original method for DL using Transfer
I. INTRODUCTION (HEADING 1)
Learning (TL) practices to detect as well as classify BTs by
Brain tumors are not very common with an occurrence of mining pivotal characteristics from a standard dataset is the
less than 1hin in the western populace, but they are amongst main contribution of the presented study. To analyze the five
the majority deadly cancers [1]. A latest study anticipated the distinct DL models such as Alexnet, Googlenet, ResNet50,
US prevalence rate for most important brain or nervous ResNet101, and SqueezeNet using BT MR images as well as
system tumors to be about 25 per 100,000 adults with around apply TL processes to the certain dataset and to there an in-
one-third of the tumors life form malignant and the depth assessment of the presentation of important features
remainder either benign or intermediate malignant [2].Brain affecting the modification of pre-trained structures and to
tumors can be categorized according to their basis or degree conduct an investigative comparison between the beyond
of hostility. Major brain tumors develop in the brain, while models using different dynamic constraints for BT detection
other regions of the body also cause metastatic brain tumors. [4].
In adults, gliomas are the mainly frequent primary brain
tumors, counting for 70% of adult malignant main brain
tumors.MRI offers information about the location and tumor
size despite numerous clinical advances. By maneuvering
radio frequency waves and restoring their stable state, their
work relies on the behavior of protons in an immense
magnetic field (Pereira et al., 2016).It is very capable and
more and more sensitive to changes in tissue firmness needed
for pathological discussion to determinedly distinguish
fragile tissues with high accurateness MR technology. In
non-invasive brain studies, the MRI images are categorized
into T1-weighted (T1-w) images that be continuously used. Fig. 1. T1-weighted MR images

Magnetic resonance imaging is the preferred procedure II. RELATED WORK


for the diagnosis of brain tumors (MRI) [1].MRI is a non-
invasive technique that offers excellent contrast to soft tissue While the most common cancers are not high-grade
[3] and is commonly available in clinics. To provide the most gliomas, they be among the mainly dangerous. In order to

XXX-X-XXXX-XXXX-X/XX/$XX.00 ©20XX IEEE


advance discover the reaction of gliomas to various healing overlapping minute dimensions. The two well-known
regimens, new therapies have enhanced the prediction of features, mostly used by the pooling layers,
layers are average and
patients and new clinical experiment designs include become pooling (Liu et al., 2018).A batch normalization layer is
relevant. Therefore,, the most detailed explanation of widely used for standardizing feature maps after the
transforms in tumor size and composition is needed. activation layer to normalize the feature map.
For the segmentation and categorization of brain tumors, Toğaçar et al.[35] have suggested a combination system
Sharif et al.[5] introduced an dynamic deep learning method. for the taxonomy of brain tumors, depending on CNN and
Initially,, they performed contrast enhancement, and for the feature selection. They obtained an improved precision of
creation of a saliency map,, the resulting image be transferred over 90%.They did not include the computation time in the
to the Saliency-based
based Deep Learning (SbDL) process. In the above methods. For each automated system, however, the
next step, the threshold was applied with the resulting images computation time was most required for this contemporary
were used to adjust the pre-trained .In In addition, prevailing period. A more recent comprehensive study on the
rotated confined binary pattern (DRLBP)features, merged categorization of multi-grade
grade brain tumors was given by
with CNN features, were also extracted. A PSO-based Muhammad et al.[7].They
].They provided a detailed explanation of
optimization be later achieved and the finest vector was the classification
lassification of brain tumor(BT)
tumor(BT measures, such as
approved for final classification to the Softmax classifier. tumor preprocessing, deep learning features, as well as
In[6], the authors provided a scheme for thehe taxonomy of cataloging. Comprehensive shortcomings with
brain tumors based on CNN.They measured the issue of the accomplishments of current deep learning techniques
tumor's structural heterogeneity around the neighboring intended for BTC were addressed. In accumulation, they
regions. To this end, tiny kernelss were engineered to keep the have accessible the significance of transfer learning for the
weight of each neuron very small. They obtained an accuracy extraction of DL functionality.
of 97.5percent, taking benefit of these weights. A sub-
III. PROPOSED METHODOLOGY
category of the artificial NN is deep learning networks or
CNNs. The various layers of CNN and the pooling layers be The goal of this research is in the direction of improve
used as a means to match multi-dimensional
nsional MR images to the accuracy of the identification of brain MR images by
generate optimal result subsequent to they are build using DL methods and the approach of Transfer Learning
(Lundervold & Lundervold, 2019). (TL).TL is the task of using the knowledge specified by a
The advantage of by means of DL is that while getting pre-trained framework to find out novel models given by
educated, the device by itself finds away how to accomplish original data. It’s typically much easier and simpler to
the feature mining.DL
.DL networks use their own kernels or calibrate a pre-trained
trained system with TL than starting from
convolutional filters to extract the function. In addition, the basic.
Convolutional Layers (CL) has a variety of tiny sorts. These It starts with the dataset of the MR brain picture that
sorts are added to all layers in order to construct a function was compiled as well as approved into benign in addition to
tensor. The tensor in the activation layer of feature maps malignant MR slices. The proposed approach involves the
created by a CL is resolute by a Rectified Linear Unit (RLU) following stages: pre-processing,
processing, data distribution with
.In DL system,, the RLU is the commonly used activation augmentation, feature extraction supported on DL, and
function that is used to limit any negative value in the aspect finally the classification of the tumor type. The process of
map to zero. The rectified features are linked to the pooling this exploration is presented in fig 2.
layers to reduce the measurements by producing non non-

Fig. 2. Block diagram illustration of classification using AlexNet and the GoogleNet

glioblastoma [8].As shown in Fig 1wewe chose T1-weighted


T1
images. It comprises a total of 696 MR images, 224 of
which are categorized as benign with 472 are malignant
Brain MR dataset for assessment of the proposed work images. The file size of each image is position in 225X225
integrates datasets from patients with acknowledged JPEG format.
GoogleNet for various applications without its final layer as a
stable task extractor.
A. Preprocessing
The key move is to scale back the unprocessed MR image Depending on its hidden states, the classification layer
for preprocessing to decrease measurements of will help us turn a new area image into a multi-dimensional
dimensionality and help the device in far less time to show a vector, thus enabling us to extract features from a new
superior performance. At that point, before separating them, domain, using the information from a pre-trained network
image data is disorderly up to maintain the network to run on (PTN) domain mission. For any pre-trained network, the
an unprepared dataset along with to focus on a limited extraction of visually evident aspects is achieved by using
collection of the entire dataset. enhancement. In comparison to stand-in and retraining the
complete architecture of the classifier, the weights of CNN
B. Training and test datasets are applied from the apex of the PTN.This condition works
Image data is next divided into three sections: training, by shifting the PTN weights from the resource to the target
testing, as well as validation, among 70% for training as data. The basic process is to replace the PTN's softmax layer
well as 15% for testing and validation, correspondingly. and replace it with a latest softmax layer that is important for
Finally, image augmentation such as flipping, mirroring, the task projected.
furthermore rotating is performed with the intend of D. Regularization function
producing a excess of network records that is usually used to A regularization feature is used to prevent over-fitting
avoid the network commencing overfitting along with when practicing. It means the correct use of a function
progress device toughness [9].In addition to these image solver to avoid network overfitting. To avoid overfitting
augmentations, several salt noise is often applied to the data during preprocessing as well as training phases, various
for a grayscale deformation and consists of 45 degrees of methods have been used. Initially, image information is
flipping, mirroring, salt noise addition, furthermore image improved to avoid overfitting [10].Optimization methods are
rotation. considered an objective function for whichever optimizing
C. Feature Extraction using Deep Learning and Transfer or reducing a function f(x) by changeable, and it is called the
Learning approaches cost function or loss function until the function is
A section of the artificial NN is DL networks or CNNs. minimized. An adaptive learning strategy is RMSprop It is
The multiple layers of CNN and the pooling layers be used as planned to resolve the issue of the drastically declining
a way of matching multi-dimensional MR images to generate learning rate. Once a convex bowl is located, it formulates
an optimal result later they are trained. The advantage of use use of an exponentially moldering average to organize of the
DL is that the method itself perform how to extract the past so that it can converge easily.
function when being trained. The tensor in the activation
layer of feature maps created by a CL is resolute by a Various system constructs are then attempted to ward
Rectified Linear Unit (RLU).In DL systems, the RLU is the off system anomalies. Later, dropout layers are used fig 3, to
commonly used activation function that is used to limit any dismiss unknown weights relatively.
negative value in the feature map to zero. A batch
normalization layer is widely used for standardizing feature
maps subsequent to the activation layer to normalize the
aspect map. The network is governed through this
normalization layer and the training is accelerated. The
ending layer of the method is usually used as the Softmax
layer that is used for image taxonomy and is devised as

Xz = ∑

Finally, a classification layer is used with the intention of


relies on cross-entropy to determine the loss because of
classification along with which provides the very last labels
required for each MR image.
Transfer Learning (TL) is an stimulation for overcoming Fig. 3. Dropout layer with dropout probability
remote learning models and using insightful information
gained for one challenge to resolve related ones. It is the
project of using the in sequence given by a pre-trained
method to find out new models accomplish by new data. DL
frameworks are the fundamental concept of transfer learning E. Metrics for assessment
along with representations are layered structures that find out The feasibility of the proposed recognition and
different characteristics at different levels and after that relate identification system for BT is evaluated by measuring
to the final layer called a completely associated layer to essential results that are used to verify the classifier's
generate the result. Such a layered arrangement enables one efficiency.
to employ a pre-trained framework such as Alexnet, The proposed output of the system is determined as follows:
Accuracy: is the capacity of a device to correctly establish Using the nine pre-trained
trained architectures, we test the
the BT type and is specified by: classification efficiency along with review our findings in the
outline of tables.

TABLE I. TRANSFER LEARNING USING ALEXNET


Specificity: It is a system's ability to properly classify the Tested
real BT with is calculated as: Augmented image sets
constraints
SGDM RMSProp ADAM
Batch 10 10 8
dimension
Maximum 6 6 6
Sensitivity: takes account of a model's ability to properly epochs
categorize the BT along with is intended as:
Training 43:31 64:41 49:13
time
(min/s)
Accuracy 99.04 99.04 98.65
Initial 1.00E-04 1.00E
1.00E-04 1.00E-04
Precision: It is definite as the nearness to each other of the Learn Rate
two calculated values furthermore is specified by
TABLE II. GOOGL NET
TRANSFER LEARNING USING GOOGLE

Tested
Augmented image sets
constraints
SGDM RMSProp ADAM
Matthews correlation coefficient (MCC): It is referred to as Batch 10 10 10
a metric to determine classification dominance furthermore dimension
is specified by: Maximum 5 6 6
epochs
Training 43:31 112:11 118:54
time
(min/s)
Error Rate: The proportion of all false predictions to Accuracy 95.13 98.02 96.9
the entire purpose dataset is calculated and devised as Initial 1.00E-04 1.00E
1.00E-04 1.00E-04
Learn Rate

F1Score: It is used to calculate a test's accuracy and is given


by:

IV. EXPERIMENTAL RESULTS


Transfer learning with deep learning frameworks is used
in this work to reliably distinguish benign and malignant BT Fig.4. Accuracy and loss metrics
types. The algorithm is trained to access the best precision of
the system using different deep learning pre-trained
trained networks
such as AlexNet, GoogleNet use se softmax layers of prepre- A. Performance Classification
trained networks by enhancement the image recognition. In fig 5, the results of the uncertainty matrix are shown. A
Here, the visually evident trained aspectss from each DL description of the predictions made by the model reveals the
network be adapted to the goal dataset and the ssoftmax layer performance effects of the confusion matrix classification,
conducts the BT classification by installing the extent of where every row correspond to the actual class with each
neurons into two groups. These perfect parameters are not column correspond to the predicted class. Table 4 tabulates
self trained, so setting the optimized parameters according to the performance of measurement parameters in terms of
the result of trained MR images for performance precision, error rate, sensitivity, specificity, accuracy, and
augmentation is necessary. recall for the AlexNet and GoogleNet Net architectures.
For every one of the above trained networks, the system TABLE III. PERFORMANCES METRICS FOR THE ALEXNET AND
is trained several
everal times using various well commended GOOGLENET.
optimizers such as Stochastic gradient descent through Model Accura Sensitivi Specifici Precisi Reca Erro
momentum(SGDM), Root means square propagation cy ty ty on ll r
(RMSProp), as well as Adaptive moment estimation Rate
(ADAM) to train the network to achieve the finest trained GoogleN 95.13% 95.6% 95.8% 95.6% 47.8 7.8%
method possible. et %
AlexNet 99.045 93% 94.3% 88.4% 46.1 5.1%
% %
architectures for programmed classification of brain cancer
types as of T1-w MRI images with deep TL
models.Glioma and meningioma are two forms of brain
tumors.
REFERENCES

[1] L M DeAngelis.Brain tumors.The New England journal of


medicine,344(2):114–23, January 2001. Available
from:http://www.getcited.org/pub/102462096http://www.ncbi.nlm.
nih.gov/pubmed/11150363
[2] Betsy A Kohler, Elizabeth Ward, Bridget J McCarthy, Maria J
Schymura, Lynn A G Ries, Christie Eheman, Ahmedin
Jemal,Robert N Anderson, Umed A Ajani, and Brenda K
Edwards.Annual report to the nation on the status of cancer, 1975-
2007, featuring tumors of the brain and other nervous
system.Journal of the National Cancer Institute,103(9):714–36,
May 2011. Available
from:http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=3
086878&tool=pmcentrez&rendertype=abstract
[3] Zhi-Pei Liang and Paul C. Lauterbur.Principles of Magnetic
Resonance Imaging: A signal processing perspective. Wiley -
Fig.5. Comparison of performance metrics IEEE Press, 2000.
The AlexNet and GoogleNet haveachieved greater [4] Isensee F., Kickingereder P., Wick W., Bendszus M., Maier-Hein
K.H. Brain tumor segmentation and radiomics survival prediction:
precision than previous versions.This is because the Contribution to the brats 2017 challenge; Proceedings of the
functionality of transfer learning (TL) in the AlexNet International MICCAI Brainlesion Workshop, BrainLes 2017;
andGoogleNet architectures has been applied to the last Quebec City, QC, Canada. 14 September 2017; pp. 287–297.
three layers as replacement layers. [5] Sharif M.I., Li J.P., Khan M.A., Saleem M.A. Active deep neural
network features selection for segmentation and recognition of
Author Year Method Accuracy/Error brain tumors using MRI images. Pattern Recognit. Lett.
Rate 2020;129:181–189. doi: 10.1016/j.patrec.2019.11.019.
[6] Seetha J., Raja S.S. Brain tumor classification using convolutional
He, K., 2015 ResNet 3.57% neural networks. Biomed. Pharmacol. J. 2018;11:1457–1461. doi:
Zhanget al. 10.13005/bpj/1511.
[11] [7] Deep Learning for Multigrade Brain Tumor Classification in Smart
Healthcare Systems: A Prospective Survey.Muhammad K, Khan S,
Szegedy et 2016 ResNet- 3.08% Ser JD, de Albuquerque VHC IEEE Trans Neural Netw Learn Syst.
al. [12] inception 2020 Jun 30; PP():.
[8] Clark K., Vendt B., Smith K., Freymann J., Kirby J., Koppel P., …
Hu, J.,et al. 2017 SENet 2.251% R., Tarbox L.The cancer imaging archive (TCIA): maintaining and
[13] operating a public information repository Journal of Digital
Imaging, 26 (6) (2013), pp. 1045-1057
Paul et 2017 Deep 91.4% [9] Wong S.C., Gatt A., Stamatescu V., McDonnell
al.[14] Learning M.D.Understanding data augmentation for classification: when to
CNN warp? 2016 international conference on digital image computing:
techniques and applications (DICTA), IEEE (2016), pp. 1-6
Afshar et 2018 CapstNets 90.09% [10] Ciregan D., Meier U., Schmidhuber J.Multi-column deep neural
al.[15] networks for image classification 2012 IEEE conference on
computer vision and pattern recognition, IEEE (2012), pp. 3642-
Tang, T.T., 2019 Random 84% 3649
Zawaski, et forest [11] He, K., Zhang, X., Ren, S., Sun, J.: Deep Residual Learning for
al.[16] classification Image Recognition, arXiv:1512.03385 [cs], December 2015
model [12] Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.: Inception-v4,
Inception-ResNet and the Impact of Residual Connections on
Our 2021 AlexNet and 99.04% Learning, arXiv:1602.07261 [cs], February 2016
Proposed GoogleNet [13] Hu, J., Shen, L., Albanie, S., Sun, G., Wu, E.: Squeeze-and-
Model Excitation Networks, arXiv:1709.01507 [cs], September 2017
[14] J. S. Paul, A. J. Plassard, B. A. Landman, and D. Fabbri, “Deep
learning for brain tumor classification,” Biomed. Appl. Mol. Struct.
Funct. Imaging, vol. 10137, no. 2, p. 1013710, 2017, doi:
CONCLUSION 10.1117/12.2254195
[15] P. Afshar, K. N. Plataniotis, and A. Mohammadi, “Capsule
Deep Learning-based Pre-Trained Convolutional in the Networks for Brain Tumor Classification Based on MRI Images
and Coarse Tumor Boundaries,” ICASSP, IEEE Int. Conf.
presented paper. The Neural Network architecture is used Acoust. Speech Signal Process. -Proc., vol. 2019-May, pp.
for MRI image classification. A benign and malignant 1368–1372, 2019, doi: 10.1109/ICASSP.2019.8683759
form of BT. The paper discusses different aspects of Pre- [16] Tang, T.T., Zawaski, J.A., Francis, K.N. et al. Image-based
trained networks to improve BT’s MR image Classification of Tumor Type and Growth Rate using Machine
classification. It is obvious that learning transfers through Learning: a preclinical study. Sci Rep 9, 12529 (2019).
AlexNet offers 99.04 percent overall performances https://doi.org/10.1038/s41598-019-48738-5
amongst all PTNs.Used in this study. We came across a [17] Mrs. G Jayalakshmi (Ph.D), V.N.V. Srikanth, V. Ramya Devi, T.
Kalyan Babu, V. Rajitha , " ANALYZING AND PERFORMING
various interpretations observationally. The training PRIVACY PRESERVING DATA MINING ON ELECTRONIC
outcomes of various pre-trained DL networks, to begin HEALTH RECORDS" , International Journal of Emerging Trends
with. It demonstrates that the efficiency of a PTN depends & Technology in Computer Science (IJETTCS) , Volume 6, Issue
greatly on the form of the preferred optimizer. This study 2, March - April 2017 , pp. 046-050 , ISSN 2278-6856.
provides a comparison of the AlexNet and the GoogleNet

You might also like