Intelligent Diagnostic System For The Diagnosis and Prognosis of Breast Cancer Using ANN

JOURNAL OF COMPUTING, VOLUME 3, ISSUE 3, MARCH 2011, ISSN 2151-9617
HTTPS://SITES.GOOGLE.COM/SITE/JOURNALOFCOMPUTING/
WWW.JOURNALOFCOMPUTING.ORG 93
Intelligent Diagnostic System for the

diagnosis and prognosis of Breast
Cancer using ANN
R.R.Janghel1, Anuj Mehra2, Anupam Shukla3, Ritu Tiwari4
Soft Computing and Expert System Laboratory
ABV-Indian Institute of Information Technology and Management, Gwalior, India
Abstract- Breast cancer occurs mostly in women and figures states that almost one out of eight women suffers from this disease. Breast
cancer is second only to lung cancer that causes the majority of deaths among women and is difficult to identify accurately and treat.
Diagnostic computational tools can be used effectively, with high degree of accuracy, which may help in improving the specificity and
sensitivity of diagnoses. There are two known types of breast cancer namely benign and malignant which are cured mostly by mammography,
FNA (Fine Needle Aspirate) and surgical biopsy. In this research work, our main objective is to develop and compare a diagnostic system for
diagnosis; prognosis and prediction of breast cancer with the usage of the computational intelligence namely Back Propagation (BPA),
Learning vector Quantization (LVQ), Support vector machine (SVM) and Adaptive Neuro-fuzzy Inference System (ANFIS). These modeling
tools can highlight the important features that play pivotal roles in the classification and aid physicians to diagnose and prognosticate breast
cancer. The database used in developing the above discussed model in this research work is adopted from the University of Wisconsin (UCI)
Machine Learning Repository. Experimental result shows that different models give optimal performance for the data set. However, all the
models are able to solve the problem up to a greater extent but a maximum accuracy of 96.50% is being achieved on the usage of LVQ which
outdo all the other models.
Keywords- Artificial Neural Networks, Breast Cancer, Expert System, Back Propagation Algorithm, Learning vector Quantization, Support
vector machine and Neuro-fuzzy Inference System.
of care as well as patient and public opinion
1. Introduction satisfaction. Recently, computer‐based medical data
Breast cancer has become one of the most perilous processing research has yielded methods and tools for
carcinomas in the world among middle‐aged and old managing the task away from the hospital management
age women. Breast cancer is the second leading cause of level and closer to the desired disease and patient
cancer deaths in women worldwide and occurs in management level.
nearly one out of eight women [1]. Breast cancer occurs
mostly in women but hardly ever it is found to occur in 2. Literature Review
men. Computational intelligence is a stimulating field
and from the last decade, Artificial Neural Networks Yuanjiao et al. proposed a technology to extract micro‐
(ANN) has become important in medical applications calcifications clusters with accurate edge effects which
due to their accuracy for predictive inference, with canʹt be detected by the naked eye on mammograms to
potential to support large data sets. Feedforward obtain much more hidden information in order to help
Neural Network (FNN) is a kind of ANN, which has a the doctors in diagnosing breast cancer [2]. Another
better structure but due to slow training rate, easy to important research was been carried out by Heng‐Da et
trap local minimum point and poor ability in global al. who has proposed a computerized micro‐
search prevents its usage as alone. Learning involves calcification detection based on fuzzy logic, vibro‐
the extraction of rules or patterns from the set of data. acoustography and probabilistic neural network on
Further the time and memory requirement would be mammograms for breast cancer [3]. Image feature
reasonably less as the system has already summarized extraction was utilized to retrospectively analyze
the data into some patterns or rules. In the computer‐ screening mammograms taken prior to the detection of
aided decision system, a system is considered as a malignant mass was done by Mohd. Sameti et al.[4].
effective and accurate if it shows a very high Al Mutazet al. uses statistical texture features for breast
generalizing capability which would help a physician cancer detection and few ANN [5].
to diagnose a patient correctly. However, unfortunately T.Z. Tan et al.uses thermogram which is a promising
clinicians can act in different ways depending on their front‐line screening tool to warn women of cancer [6].
knowledge and experience which highlighted the need Laufer et al. proposed a modified self‐organizing map
to introduce diagnostic tools to support the scientific with nonlinear weight adjustments to reduce number of
homogeneity and accountability of healthcare decisions unnecessary biopsies with a minimal number of
and actions. The benefits expected from such actions subsequent [7]. Taio et al. used Kohonen self organizing
include an overall reduction in cost, improved quality map and multilayer perceptron trained with the
backpropagation algorithm [8]. Xiong et al. used Collect data set for Breast Cancer
statistics methods like PCA, PLS linear regression
analysis, data mining methods, and hybrid system of
Data Feature Extraction and analysis.
combination rough set and probabilistic neural Processing Removal OR Filling of missing data.
network. Probabilistic neural network to perform Data Normalization.

supervised classification and rough sets was able to
reduce the number of attributes in the dataset without Models
Simulated
sacrificing classification accuracy in [9, 10, 11].
Karabatak et al. used ANN and ANFIS with linear
Training and Testing Phase
discriminant analysis and principal component analysis
for diagnosis of breast cancer. [12, 13]. Ky Van Ha et al.
Diagnosis Decision
constructed a hierarchical evolutionary RBF network
and employed it to detect the breast cancer. [15, 16].
The optimum network for classification of breast cancer Benign Malignant
cells was found using Hybrid Multilayer Perceptron
(HMLP) network. [17, 18].
Performance Comparison of computational models
Jain et al. used fuzzy‐logic, Hybrid Neuro‐Fuzzy
generator based on the Knowledge Oriented Design
(KOD) concept and Cooperative Neuro‐Fuzzy systems Figure 1: Block Diagram of Complete Methodology
using Genetic Algorithms were used for the
classification (diagnosis) of breast cancer [19, 20]. Pena‐ 3.1. Diagnosis using Back Propagation Algorithm
Reyes et al. proposed fuzzy‐genetic breast cancer
identification. [21]. Seker et al. proposed a methodology In back propagation, the partial derivatives of the cost
with neural network, fuzzy logic, FK‐NN and statistical function with respect to the free parameters of the
method to prognostic analysis of cytometric image network is being determined by back propagation error
data. [22]. signals through the network, layer by layer. The feed‐
forward neural network architecture used in this
experiment consists of a single hidden layer along with
3. Methodology
input and output. The employed neural network is a
In this research work, computational intelligence feed forward neural network and it is shown in the
models have been used for the diagnosis and prediction following figure
of breast cancer. The learning takes place through an
iterative process of weight adjustments applied to its
initial weight after epoch iteration of the learning
process. Figure 1, shows the overall diagram of
complete methodology. Mammography, biopsy and
FNA database of breast cancer microscopic and clinical
tests reports are used in our diagnostic system which
helps in prediction of cancer as either Benign or
Malignant. In this research work, we have employed
ANN like Back Propagation (BPA), Support vector

machine (SVM), learning vector Quantization (LVQ)
Figure 2. Back Propagation Multi‐layer neural network
and Adaptive Nero‐fuzzy Inference System (ANFIS) for
structure.
training of the dataset. Among all the above mentioned

ANN, LVQ achieved the 96.50% of accuracy which
The transfer function in hidden layer neurons and
comes out to be highest.
output layer neurons are sigmoid and purelin

respectively. The performance function used was Mean

Sum‐squared Error (MSE).

For hidden units, we must propagate the error back

from the output nodes; equation (1) gives us the

equation of error back propagation

E neti y j
j   

(1)
iPj neti y j net j

where, first factor represents the error of node i, second 3.3. Diagnosis using Adaptive Neuro-Fuzzy

factor is the weight from unit j to i, and third factor is Inference System
the derivative of node jʹs activation function [23].
The basic structure of the type of fuzzy inference
3.2. Diagnosis using Learning Vector system seen thus far is a model that maps input
Quantization Network characteristics to input membership functions, input
membership function to rules, rules to a set of output
LVQ network are based on supervised classification characteristics, output characteristics to output
algorithm. LVQ networks are designed to make membership functions, and the output membership
decision boundaries in the input space, for a particular function to a single‐valued output or a decision
set of training data. According to structure, the network associated with the output. ANFIS are a class of
contains an input layer, a competitive layer and an adaptive networks that are functionally equivalent to
output layer. fuzzy inference systems. These represent Sugeno e
Tsukmaoto fuzzy models and uses a hybrid learning
algorithm [26].

Layer 1 Layer 2 Layer 3 Layer 4 Layer 5
A1 A1 R1
x1 R1

x1
x1
A2  R2
x1 A2 R2

wR3 C1
C1
Figure 3. Learning Vector Quantization Neural A3 A3 R3 R3 wR6
y
Network Structure wR1 

B1 B1 R4 R4 wR2
x2 C2
Here, input layer consists of data provided. In the wR4
C2
R5
competitive layer, each competitive unit corresponds to x2 x2  wR5
B2 B2 R5
a cluster and the output layer has the neurons equal to x2
R6
the no of classes. B3
B3 R6
According to LVQ algorithm,

(a) If  w   xi
c
wc ( n  1)  wc ( n)   n [ xi  wc ( n)]
Figure 4. Adaptive Neuro‐fuzzy Inference Systems
where, 0 <  n < 1. (2)
else  w   xi The ANFIS can be trained by a hybrid learning
c
algorithm [28]. In the forward pass the algorithm uses
wc ( n  1)  wc ( n)   n [ xi  wc ( n)] (3) least‐ squares method to identify the consequent
Where, {w j}lj1 represent the set of Voronoi vectors, parameters while in the backward pass the errors are
propagated backward and the premise parameters are
w c
represent the class associated with the Voronoi updated by gradient descent [27].
vector, {x i }iN1 represent the set of input vectors, Layer 1 is the input layer.
 x represent the class label of the input vector xi, and
i yi(1)  xi(1) (4)
 n denotes the learning constant with the number of
iterations n.
Layer 2 is the fuzzification layer.
(b) The other Voronoi vectors are not modified.  b
0, if xi( 2 )  a
LVQ network has an advantage over perceptron as they 2
can classify any set of input vectors whereas perceptron 
 2 xi  a
( 2)
( 2)  b b
can classify only linearly separable sets of input vectors. yi  1  , if a   xi( 2 )  a 
The only requirement is that the competitive layer must
 b 2 2 (5)

have enough neurons and each class must be assigned  b
enough competitive neurons [23][24][25]. 0, if xi( 2 )  a 
 2

subject to (for any 1=1,2,3,………….,n)
Layer 3 is the inference layer. (14)

yi(3)  x1(i3)  x2(3i )    xki(3) (6) 4. Experiments and ( Results
The results are 6 measured against the following
diagnostic performance measures:
)
Layer 4 is the output membership layer 1. True positive (TP): The number of positive

cases correctly detected,
y
i
( 4)
 x ( 4)
1i  x ( 4)
2i    x ( 4)
li (7)
2. True negative (TN): The number of negative
cases correctly detected,

3. False positive (FP): The number of negative
Layer 5 is the defuzzification layer. For, the datasets C1
cases diagnosed as positive, and
and C2, the weighted average would be calculated as,
4. False negative (FN): The number of positive

cases diagnosed as negative.
μ a b μ a b
y C1 C1 C1 C2 C2 C2 (8) The Wisconsin breast cancer diagnosis (WBCD)
(
μ b μ b database is the result of the efforts made at the
C1 C1 C2 C2 8
University of Wisconsin Hospital for accurately
)
diagnosing breast masses based solely on an FNA test.

The database comprises diagnosis results of 699
3.4. Diagnosis using Support Vector Machine
patients beyond which 458 are benign and 241 are
(SVM)
Support Vector Machine (SVM) is based on the linear malignant [25]. Different neural network models were
classifiers and on a general method known as “kernel simulated for different parameter settings. The various
tricks”, which is based on the concepts of duality and performance measures are summarized and
kernels that allow algorithms to be formulated such experimental results of breast cancer system using
that they could be systematically extended to non‐ computational intelligence models are shown in table 1
linear cases. and table 2 respectively. LVQ has emerged as the
Support Vector Machine (SVM) models are similar to optimal network, which uses 10 hidden neurons and
multilayer perceptron neural networks. Using a kernel 0.01 learning rate.
function these are an alternative training method for
polynomial, radial basis function and multi‐layer Table 1. Diagnostic performance measurement for
perceptron classifiers. These classifiers are found to be Breast cancer.
helpful in solving a quadratic programming problem
Cancer Total
with linear constraints as compared to others. Present Absent
Test
In SVM, let us assume the training data to be D, a set of
True False [TP +FP]
n points of the form
Positive Positive Positive
(9)
[TP] [FP]
where, ci is either 1 or −1, indicating the class to which
False True [FN+TN]
the point Xi belongs. Each Xi is a p‐dimensional real
Negative Negative negative
vector. Any hyperplane can be written as the set of
[FN] [TN]
points X satisfying
(10) [TP + FN+ TN
Total [TP+ FN] [TN + FP]
We want to choose the W and b to maximize the + FP]
margin. These hyperplanes can be described by the
equation
(11) Sensitivity (Sens.)
TP / (TP + FN)
And Specification (Spec.)
TN / (TN + FP)
(12) Accuracy (Acc.)
(TP + TN) / (TP + TN + FP +
If the training data are linearly separable then we can FN)
select the two hyperplanes of the margin in such a False Positive Rate FP / (TP+FP)
manner that there are no points between them and then (FPR)
try to maximize their distance False Negative Rate FN / (FN+TN)
for all 1≤ i ≤ n (13) (FNR)
We can put this together to get the optimization
problem:
Minimize (in w,b)
[4] Mohammad Sameti, Rabab Kreidieh Ward, Jacqueline

Morgan‐Parkes and Branko Palcic “Image Feature Extraction in
the Last Screening Mammograms Prior to Detection of Breast
Table 2. Performance comparisons of Computationals
Cancer” IEEE journal of selected topics in signal processing, vol. 3,
intelligence models no. 1, February 2009, pp 46‐52.
[5] Al Mutaz M, Abdalla , Safaai Deris, Nazar Zaki and Doaa M.
ANN Sens. Spec. Acc. FPR FNR TT Ghoneim “ Breast Cancer Detection Based on Statistical Textural
Model (sec.) Features Classification“ 2008 IEEE , pp 728‐730.
BPA 80.00 90.00 85.00 20.00 10.00 2.00 [6] T.Z. Tan , C. Quek , G.S. Ng a, E.Y.K. Ng “A novel cognitive
LVQ 95.00 98.00 96.50 05.00 02.00 57.00 interpretation of breast cancer thermography with
SVM 90.00 92.00 91.00 10.00 08.00 05.00 complementary learning fuzzy neural memory structure” Expert
ANFIS 88.00 90.00 89.00 12.00 10.00 10.00 Systems with Applications 33 (2007) 652–666.
[7] Shlomi Laufer and Boris Rubinsky “Tissue Characterization

with an Electrical Spectroscopy SVM Classifier” IEEE transactions
5. Conclusions on biomedical engineering, vol. 56, no. 2, February 2009 pp 525‐
Breast cancer is a major cause of deaths and 528.
predominantly affects women more than the age of 40. [8] Taio C. S. Santos‐Andrk and Anttinio C. Roque da Silva “A
All the systems mentioned in the introduction were Neural Network Made of a Kohonen’s SOM Coupled to a MLP
Trained Via Backpropagation for the Diagnosis of Malignant
simulated on the breast cancer database adopted from
Breast Cancer from Digital Mammograms” 1999 , IEEE pp 3647‐
the University of Wisconsin (UCI) Machine Learning 3650.
Repository. The ultimate goal is to make a diagnostic [9] Kenneth Revett, Florin Gorunescu, Marina Gorunescu, Elia El‐
system that would assist the doctors in diagnosis of the Darzi and Marius Ene “A Breast Cancer Diagnosis System: A
disease which would hence prove to be a very useful Combined Approach Using Rough Sets and Probabilistic Neural
Networks “, EUROCON 2005, Serbia & Montenegro, Belgrade,
system considering the present scenario where diseases
November, 22‐24, 2005,pp 1124‐1127.
are on a hike and there is a lack of availability of
[10] Xiangchun Xiong, Yangon Kim, Yuncheol Baek, Dae Wong
specialized doctors. Such a system learns from the past Rhee, Soo‐Hong Kim “Analysis of Breast Cancer Using Data
data which is a collection of a lot of information in Mining & Statistical Techniques” Proceedings of the Sixth
itself. The system tries to extract this hidden International Conference on Software Engineering, Artificial
information to make a generalized system for the Intelligence, Networking and Parallel/Distributed Computing and
First ACIS International Workshop on Self‐Assembling Wireless
detection of the disease. It may be noted that though we
Networks (SNPD/SAWN’05) 2005 IEEE.
had tried to build an effective system and obtain a [11] Emad Nabil, Amr Badr ,Ibrahim Farag, Mohamed Osama
accuracy of 100%, which is the ultimate goal of medical Khozium “ A Hybrid Artificial Immune Genetic Algorithm with
diagnosis but we were able to achieve a maximum Fuzzy Rules for Breast Cancer Diagnosis “INFOS2008, March 27‐
accuracy of 96.50% using LVQ with 10 hidden neurons 29, 2008 Cairo‐Egypt, pp 31‐40.
[12] David B. Fogel, Eugene C. Wasson, Edward M. Boughton,
and 0.01 learning rate which surpasses the SVM whose
Vincent W. Porto, and Peter J. Angeline “Linear and Neural
accuracy is found out to be 91%. It can be concluded Models for Classifying Breast Masses “IEEE transactions on
from the above result that LVQ should be used for medical imaging, vol. 17, no. 3, June 1998, pp 485‐488.
prediction of the breast cancer as compared to other [13] Shakti K. Davis, Barry D. Van Veen, Susan C. Hagness, and
computational intelligence models. Frederick Kelcz “ Breast Tumor Characterization Based on
Ultrawideband Microwave Backscatter” IEEE transactions on
biomedical engineering, vol. 55, no. 1, january 2008, pp 237‐246.
Acknowledgement [14] Lubomir Hadjiiski, Berkman Sahiner, Heang‐Ping Chan,
Nicholas Petrick and Mark Helvie “ Classification of Malignant
The authors sincerely acknowledge Prof. S.G. and Benign Masses Based on Hybrid ART2LDA Approach “IEEE
Deshmukh, Director ABV‐IIITM, Gwalior, India for transactions on medical imaging, vol. 18, no. 12, December 1999
encouraging and providing facilities to carry out this pp1178‐1187.
research work. This work is sponsored by ABV‐IIITM [15] Ky Van Ha “Hierarchical Radial Basis Function Networks”
1998 EEE 1893 PP 1893‐1898.
Gwalior.
[16] Yuehui Chen, Yan Wang, and Bo Yang “ Evolving
Hierarchical RBF Neural Networks for Breast Cancer Detection “
References ICONIP 2006, Part III, LNCS 4234, Springer‐Verlag Berlin
[1] http://www.breastcancer.org Heidelberg 2006 ,pp. 137–144, 2006.
[2] Yuanjiao MA, Ziwu WANG, Jeffrey Lian LU, Gang WANG, [17] Nuryanti Mohd Salleh , Harsa Amylia Mat Sakim and Nor
Peng LI Tianxin MA, Yinfu XIE ,Zhijie ZHENG “Extracting Micro‐ Hayati Othman “Neural Networks to Evaluate Morphological
calcification Clusters on Mammograms for Early Breast Cancer Features for Breast Cells Classification” IJCSNS International
Detection” Proceedings of the 2006 IEEE International Conference Journal of Computer Science and Network Security, VOL.8 No.9,
on Information Acquisition August 20 ‐ 23, 2006, Weihai, September 2008, pp 51‐58.
Shandong, China ,pp499‐504. [18] Nuryanti Mohd Salleh , Harsa Amylia Mat Sakim and Nor
[3] Heng‐Da Cheng, Yui Man Lui, and Rita I. Freimanis “A Novel Hayati Othman “Neural Networks to Evaluate Morphological
Approach to Microcalcification Detection Using Fuzzy Logic Features for Breast Cells Classification” IJCSNS International
Technique” IEEE transactions on medical imaging, vol. 17, no. 3, Journal of Computer Science and Network Security, VOL.8 No.9,
June 1998, pp442‐450. September 2008, pp 51‐58.
[19] A. Rahmoun, S. Berrani, “A Genetic‐Based Neuro‐Fuzzy

Generator: NEFGEN” 2001 IEEE pp 18‐23
[20] Carlos Andres Pena‐Reyes, Moshe Sipper “A fuzzy‐genetic
approach to breast cancer diagnosis “Artificial Intelligence in
Medicine 17 (1999) 131–155.
[21] C.A. Pena Reyes “Breast Cancer Diagnosis by Fuzzy CoCo”
Coevolutionary Fuzzy Modeling, LNCS 3204, Springer‐Verlag
Berlin Heidelberg 2004 pp 71‐87, 2004.
[22] H. seker, m.Odeyato, D. petrovie, R.N.G.Naguib, C. Bartoli, L.
Alasilo and G.V. Sherbet, “ Prognostic comparison of statistical,
neural and fuzzy methods of analysis of breast cancer image
cytometric data “2001 proc. Of the 23rd Annual IEEE MBS Int.
Conference October 25‐28, Istanbul, Turkey, pp 3811‐3814.
[23] Shukla Anupam, Tiwari Ritu,Janghel R.R.,and Kaur
Prabhdeep, Diagnosis of Thyroid Disorders using Artificial
Neural Networks on 2009 IEEE International Advance Computing
Conference (IACC 2009),Patiala, India, 6–7March 2009,pp:2722‐
2726.
[24] Janghel R.R , Shukla anupam,Tiwari Ritu and Pritesh Tiwari ,
“Clinical Decision support system for fetal Delivery using
Artificial Neural Network”2009 (NISS) IEEE International
Conference on New Trends in Information and Service Science,
June 30 – July 2, 2009, Beijing, China,pp:1070‐1075.
[25] Janghel R.R , Shukla anupam and Tiwari Ritu “ Decision
Support System for Fetal Delivery using Soft Computing
Techniques” (2nd ICIS), 2009 International Conference on
Interaction Sciences: Information Technology, Culture and
Human, IEEE CPS series ,24‐26 November, 2009 ‐ Seoul, Korea,
pp: 1514‐1519.
[26] S.N. Sivanandan,S.N Deepa 2007, principle of soft computing,
Wiley India private limited.
[27] Abraham, A. 2001, Neuro‐Fuzzy Systems: Stateof‐the‐Art
Modeling Techniques. In Connectionist Models of Neurons,
Learning Processes, and Artificial Intelligence, Springer, Berlin, J.
Mira,A. Prieto (Eds.).
[28] Jeen‐Shing Wang and C. S. George Lee, “Self‐Adaptive
Neuro‐Fuzzy Inference Systems for Classification Applications”,
IEEE Transactions on Fuzzy Systems, 2002.
[29] Jaime S. Cardoso, Joaquim F. Pinto da Costa and Maria J.
Cardoso,” SVMs Applied to Objective Aesthetic Evaluation of
Conservative Breast Cancer Treatment” Proceedings of
International Joint Conference on Neural Networks, Montreal,
Canada, July 31 ‐ August 4, 2005,pp 2481‐2486.

Intelligent Diagnostic System For The Diagnosis and Prognosis of Breast Cancer Using ANN

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Intelligent Diagnostic System For The Diagnosis and Prognosis of Breast Cancer Using ANN

Uploaded by

Copyright:

Available Formats

JOURNAL OF COMPUTING, VOLUME 3, ISSUE 3, MARCH 2011, ISSN 2151-9617

Intelligent Diagnostic System for the

where, first factor represents the error of node i, second 3.3. Diagnosis using Adaptive Neuro-Fuzzy

[4] Mohammad Sameti, Rabab Kreidieh Ward, Jacqueline

[19] A. Rahmoun, S. Berrani, “A Genetic‐Based Neuro‐Fuzzy

You might also like