Professional Documents
Culture Documents
Ga Based Neuro Fuzzy
Ga Based Neuro Fuzzy
1. Introduction
Objective of digital mammography is to detect breast cancer at the early stage of development. Masses appearing in breast are three-dimensional lesions representing sign of breast cancer. Masses are described by their shape, margin and textural characteristics, and may affect the surrounding tissues. The margin is the border, of mass, which is one of the most important criteria to determine whether the mass is belonging to benign group or malignant group. A round or round to oval shape masses with sharply defined borders may have a high likelihood of benign stage. A benign mass generally possesses circumscribed margin. In our earlier work we [1],[2] have suggested shape similarity measure for finding the prognosis of
136
Presently fuzzy c-means clustering algorithm has been used to develop fully automated intensity based segmentation technique of masses. The chosen number of fuzzy cluster centers is three as shown in Fig-3. Cluster A represents the healthy breast tissue. Second cluster B represents the false presence of masses (due to dense fibroglandular tissues) and C represents the actual mass region.
3. Proposed Methodology
In present study, tumor masses are extracted from surrounding normal breast tissues by fuzzy based segmentation technique. The significant shape-based boundary features are searched by genetic algorithm and are fed to an adaptive neuro fuzzy classifier for a correct decision, whether the masses are benign or malignant. The overview of the proposed method is presented below in Fig-2.
The ultimate Fuzzy partition membership functions are shown in Fig-3, which depict that there is an overlapping among the membership functions. In present problem, decision algorithm which has been developed is described below: If the membership grade of a pixel in the mammograms is greater than 0.5 in cluster C, decision is taken that the particular pixel considered to a calcified lesion. According to the decision rule developed here, the shaded region in Fig-3 indicates the actual mass. Other two clusters are suppressed to reduce the effect of false positive presence of masses. Algorithm: Let X={x1, x2,..,x n} be a set of given data. A fuzzy c-partition of X is a family of fuzzy subsets of X, denotes by P = {A1, A2,.., Ac}, which satisfies
A (x
i =1 i
) =1
(1)
The performance index of a fuzzy partition P, Jm (), is defined in terms of the cluster centers by the formula
( A , v 1 ,... v c ) =
c m [ A i ( x k )] k =1 i = 1
n
|| x k v i||
(2)
where xk vi 2 represents the distance between xk and vi (vi is the cluster centers) . Clearly, the smaller the value of Jm (), the better the fuzzy partition P. Thus, the goal of fuzzy c-means clustering method is to find a fuzzy partition P that minimizes the performance index Jm (), which offers
137
[ A i ( x k )] x
m k =1 n
(3)
[ A i ( x k )]
k =1
finding a feature subset of input training as well as test patterns that are able to describe all of the information required to classify them. The boundary or margin detection of masses based on Fourier Descriptors, introduces a large number of feature vectors. Thus to classify different boundaries, a standard classifier needs a large number of inputs, that encounters the problem of over learning and which may introduce the chance of misclassification. To solve this problem of over learning we have introduced the optimization technique for feature selection using Genetic algorithm. GA uses three operators selection (or reproduction), crossover and mutation to achieve the goal of evolution. Presently Genetic algorithm (GA) is used to search two significant Fourier shape descriptors that are able to represent a particular class of tumors. Compactness measure has been used to describe the third important shape feature. Different image boundaries are recognized on the basis of Fourier Descriptors and play the role of objective functions that would be maximized to search the significant descriptors.
The x-axis is treated as real axis and y-axis as the imaginary one. The Discrete Fourier Transform (DFT) of s(k) is given below K1 (j2 u k /K) a(u) = (1/K) s(k) e k=0 for u = 0, 1, 2, ., K1. (6)
The complex coefficient a(u) is known as Fourier Descriptor of the boundary. Measuring Compactness of a particular shape is another frequently used boundary descriptor. It is defined as (perimeter)2/area. Compactness is minimal for round shaped figure. It is also insensitive to the orientation of the images. In this paper, compactness measurement is used as an important shape feature.
138
(7)
for i = 1, 2. In general, any T-norm operator that performs fuzzy AND can be used as the node function in this layer.
= 1+|
x ci ai
2b
Layer 2: Every node in this layer is a fixed node labeled , whose output is the product of all the incoming signals: O2,i = wi = Ai (x) Bi (y), (8)
Layer 3: Every node in this layer is a fixed node labeled N. The ith node calculates the ratio of the rules firing strength to the sum of all rules firing strengths:
3 ,i
w
1
(9)
2
Parameters of this layer are referred to as consequent parameters. Layer 5: The single node in this layer is fixed node labeled , which computes the overall output as the summation of all incoming signals:
Layer 4: Every node i in this layer is an adaptive node with a node function
5 ,i
wf
i
w f w
i i i i
(11)
4 ,i
w f
i
w (p
i
x +
y +
(10)
3.4.1 Hybrid Learning Rule for Training ANFIS Hybrid leaning rule combines Gradient Decent (GD) method and least-squares estimator (LSE) for fast identification of parameters in adaptive neuro-fuzzy model. In hybrid learning approach, each epoch is composed of a forward pass and a backward pass as
where wi is a normalized firing strength from layer-3 and {pi, qi, ri} is the parameter set of this node.
139
shown in Table-1. The hybrid method converges much faster than any conventional approach since it reduces the search space dimensions of the original pure back propagation learning. Also the hybridization of Neurofuzzy approaches is robust and adaptive even in the noisy, uncertain environment. In the present paper we reduced the input feature vector size to 3 only and there are two bell-shaped membership functions assigned for each input variable. Thus number of fuzzy if-then rules for ANFIS learning is 2 3 =8. Table 1: Summary of hybrid learning procedure # Premise parameters Consequent parameters Signals Forward pass Fixed LSE Node outputs Backward pass GD Fixed Error signals
roundness. 1 = D1 O1 (12)
where, D1 = Desired output value of a benign mass, O1= Obtained output value of the test mass. The degree of malignancy is higher for higher value of 1. The decisions regarding the prognosis of test masses are defined below: If 1 <= 20, the shape & margin of test masses are considered as Almost Round or Round to Oval Shape & Smooth Boundary Benign. If 20 <= 1 <= 40, the shape & margin of test masses are considered as Lobulated & Non-Circumscribed Boundary Tendency towards Malignancy. If 1 > 40, the shape & margin of the test masses are considered as Irregular & Ill-defined Boundary Malignant.
3.5 Decision Making Logic Design of appropriate decision rule is the most important step for successful pattern recognition scheme. Presently classification of tumor masses has been carried out extracting shape features of the patterns. The proposed ANFIS model has been trained with round shaped benign masses as indicated by the radiologists. Our objective is to classify each of the test masses, whether they belong to benign or malignant stage. For this purpose, it is necessary to define an Euclidean distance function (1) to determine the deviation of
4. Experimental Results
We have implemented our proposed algorithm to a database consisting of 200 images. The classifier was first trained with obvious benign masses as identified by the radiologists and other non-obvious test cases have been classified during the experiment. Segmentation and final classification of tumor patterns in benignancy or malignancy of few non-obvious cases are given in Fig-5 and Table-2. The successful classification rate of the proposed methodology is almost 87%.
[1]
[2]
[3]
[4]
[5]
[6]
140
Decision on the Mass Shape Almost Round or Round to Oval Irregular & Ill-defined Irregular & Ill-defined Irregular & Ill-defined Irregular & Ill-defined Lobulated & Non-Circumscribed
Final Decision on Prognosis Benign Stage Possibly in Malignant Stage Possibly in Malignant Stage Possibly in Malignant Stage Possibly in Malignant Stage Tendency towards Malignant Stage
5. Discussion
Proposed methodology for breast tumor classification using mammogram, is based on adaptive neuro-fuzzy model extracting the boundary of the lesion which is the region of interest. This classification predicts the prognosis of the disease either towards benignancy or malignancy. The output node value of the classifier indicates deviation or Distance function of the test masses with respect to the trained benign masses. Genetic algorithm has been used to overcome the problem of over learning and chances of misclassification in feature extraction and representation for the adaptive neuro Fuzzy based classifier. The performance of the proposed technique is satisfactory in 87% cases The result has been further verified by the physicians/ radiologists,
[3]. D. H. Davies and D. R. Dance, Automatic computer detection of clustered calcifications in digital mammograms, Phys. Med. Biol., vol. 35, no. 8, pp. 11111118, 1990. [4]. L. Shen, R. M. Rangayyan, and J. E. L. Desautels, Application of shape analysis to mammographic calcifications, IEEE Trans. Med. Imag,, vol.13, pp. 263 274, 1994. [5]. J. Kilday, F. Palmieri, and M. D. Fox, Classifying mammographic lesions using computerized image analysis, IEEE Trans. Med. Imag., vol. 12, pp. 664669, 1993. [6]. Mendez AJ, Tahoces PG, Lado MJ, Souto M., Vidal JJ, Computer-aided diagnosis: Automatic detection of malignant masses in digitized mammograms, Medical Physics, vol. 25, no. 6, pp. 957-964, 1998. [7]. B. Sahiner, H.P. Chan, N. Petrick, M.A. Helvie, and L.M. Hadjiiski, Improvement of mammographic mass characterization using speculation measures and morphological features, Med. Physics, vol.28, pp.1455 1465, 2001. [8]. M. Bhattacharya and A. Das, Discrimination for Malignant and Benign Masses in Breast Using Mammogram: A Study on Adaptive Neuro-Fuzzy Approaches, Proc. of Indian International Conference on Artificial Intelligence (IICAI-07), Pune, India by Springer Link, pp. 1007-1026, 17 -19 Dec, 2007. [9]. J.-S. R. Jang, ANFIS: Adaptive-Network based Fuzzy Inference Systems, IEEE Trans. Systems, Man and Cybernetics, vol. 23, no. 3, pp. 665-685, 1993. [10]. L. M. Bruce and R. R. Adhami, Classifying Mammographic Mass Shapes Using the Wavelet Transform Modulus-Maxima Method, IEEE Trans. Medical Imaging, vol. 18, no. 12, pp. 1170-1177, 1999. [11]. Craig K. Abbey, Roger J. Zemp, Jie Liu, Karen K. Lindfors, and Michael F. Insana, Observer Efficiency in Discrimination Tasks Simulating Malignant and Benign Breast Lesions Imaged With Ultrasound, IEEE Trans. Medical Imaging, vol. 25, no.2, pp: 198 -209, 2006.
Acknowledgement
The authors would like to thank to Dr. S. K. Sharma of EKO X-ray and Imaging Institute, Kolkata. The authors also acknowledge CSIR, Govt. of India for financial support to continue this research work.
References
[1]. M. Bhattacharya, D. Dutta Majumder, Knowledge Based Approach to Medical Image Processing in Pattern Directed Information Analysis (Algorithms, Architecture & Applications), publisher: New Age Wiely, 2008.
[2]. D.Dutta Majumder & Mahua Bhattacharya, Cybernetic Approach To Medical Technology : Application To Cancer Screening And Other Diagnostics, Millennium Volume of Kybernetes, International Journal of Systems & Cybernetes, MCB publications UK, Vol. 29 , number 7/8 , pp : 871-895, 2000.
141