Professional Documents
Culture Documents
Abstract—Breast cancer is one of the most common causes caused by the digitization process image, it can be easily
of death among women globally. Accurate and early detection is removed by using image processing techniques depend on the
necessary for decreasing mortality and increase treatment spatial domain or frequency domain of an image [5]. Labels
success rates. Mammogram image is currently one of the best and small artifacts are removed by converting the image to a
ways to detect breast cancer in the early stages, but it contains binary format using an appropriate threshold and then
many artifacts such as noise, labels, and pectoral muscles, that arranging these components by area to isolate the small
must be deleted or suppressed because it greatly affects the spaces, including the labels [4]. The biggest obstacle is the
results of the diagnosis in the coming stages. Removing the removal of the pectoral muscle, whose presence significantly
pectorals muscle is the biggest problem because it possesses an
affects the results of the analysis image in the next stages, such
intensity tissue that closely resembles the tissue of fat, glands,
and tumors in the form of mammograms. In this paper, an
as segmentation and feature extraction. It appears as a dense
effective algorithm has been suggested by Hybridization area in the X-ray image and is sometimes similar to the rest of
Bounding Box and Region growing algorithm (HBBRG) the breast tissue, especially in the types of glandular and fatty,
algorithm to solve the problem of pectoral muscle removal so some algorithms used to suppression the pectoral muscles
which greatly affects the results of tumor detection in the next based on the threshold and intensity and morphological
stages by combines the Bounding Box (BB) and Region growing operation may not give good results in this case [6]. In this
(RG). To perform this work, pre-processing for mammogram paper, a hybrid method has been used to remove the pectoral
images was applied in two stages. In the first stage, a medium muscle based on the Bounding Box and Region growing
filter and binary image with a specific threshold were used to algorithms (HBBRG Algorithm) to obtain better results by
remove noise and label respectively. In the second phase, the taking e benefits of each method and reducing the defects
pectoral muscles were removed by applying the (BB) and (RG) existing.
algorithm separately, and then we proposed merging the two
methods to set up an HBBRG algorithm with the aim to get
better results for remove pectoral muscles. The proposed
algorithms were tested on all the Mammographic Image
Analysis Society (MIAS) database images, and the results
showed a significant advantage in the HBBRG algorithm
compared to other algorithms as it achieved results in over 98%
to completely remove the pectoral muscles of all types of images.
I. INTRODUCTION
Breast cancer infects nearly 8 percent of the world's
women throughout their lifetime. It ranked second among the
causes of female deaths worldwide. [1].According to Fig. 1. Shows types of noises observed in mammogram images and marked
statistical data conducted throughout the United States in with numbers as 1. Pictorial Muscle, 2.Tumor, 3.High Intensity, 4.Low
2015, breast cancer takes 29 percent of new cancer cases and Intensity, 5. Breast Part and 6. Background [4].
15 percent of cancer deaths [2]. Early and efficient detection
is the best way for limited deaths and successful treatment. II. RELATED WORK
Now, mammography is one of the best and most common Many image processing algorithms have been developed
methods for early diagnosis of breast cancer, it is a low dose by many researchers to suppress the pectoral muscles in the
X-ray that allows visualization of the Inner structure of the form of a mammogram. The most important research in this
breast [3]. A mammogram is a clinical image that contains field will be covered as follows:
many additions such as noise, labels, pectoral muscles, black In 2014 L. Aroquiaraj and K. Thangavel., they proposed
background, etc. that make it difficult to analyze and Hybrid Component Connected Labeling, Fuzzy, and Straight-
understand for diagnosis cancer in the early stages see Fig. 1. line technique to remove the pectoral muscles from the
Therefore, the suppression of these artifacts and the mammogram, to obtain only the breast image to be ready for
enhancement quality of an image is essential for the extraction the next treatment. The system was tested in 322 images from
Region of interest (ROI) [4]. Noises are most likely to be the MIAS database. Results indicated a 96% achieving for
Authorized licensed use limited to: University of Massachusetts Amherst. Downloaded on July 27,2020 at 13:05:23 UTC from IEEE Xplore. Restrictions apply.
2020 International Conference on Computer Science and Software Engineering (CSASE), Duhok, Kurdistan Region – Iraq
removing pectoral muscles. The results indicated that this fast, powerful, easy to implement and can provide a high level
achieved good results, but it is considered complex of knowledge, On the other hand, she is suffering the problem
programmatically to be applicable[7] of automatically setting the seed point [15]. The steps of the
algorithm are explained in the following points:
In 2015 Sreedevi. S and Elizabeth. S., They used a
universality threshold for pectoral muscle removal and edge 1. At the start, a list of regions {Ri, i = 1, n} is formatted (n
detection techniques to define the breast area alone to remove represents a number of pixels in the image). Firstly each
pixels outside the breast area. Experimental results indicated region is consists of one pixel, so-named “seed point.
that the system received 90%. The results showed that you Mean value and neighboring regions are sitting for each
are the determinants of this method: cutting different parts of region.
the breast tissue beyond the pectoral muscle in some types of 2. For each region, Ri its neighboring regions N(Ri ) are
images[8] testing and the most similar neighboring region Rk ę
In 2016 Aziz .M and Bhagirathi .H ., They proposed a N(Ri ) is select and both regions are merged If D(Ri, Rk
method based on threshold techniques and modified region ) d T where Rk: the best neighbor of Ri, dist(Ri, Rk): is
growing to remove pectoral muscles, Also the proposed the Euclidian distance between the mean grey levels of
method shown that the Contrast Limited Adaptive Histogram the regions and T: threshold value
Equalization (CLAHE) and Wiener filter for enhancing the 3. Each time the area that is merged is deleted from the list
image quality. The system was tested on each MIAS database and mean value is updated
image with 98% accurate results [4]. 4. This process repeats even no joinable areas remain [16].
In 2017, Anitha. Immanuel.Sand J, Dinesh. P., in this VI. THE PROPOSED METHOD
work, the noise was removed by the application of the medium This paper focuses on removing the pectoral muscles to
filter 3 × 3, and then the labels are Marks suppressed by an obtain a better quality image that contains only the breast,
application global thresholding along with morphological helps in achieving more accurate diagnostic results and is
techniques like erode, dilation, open, and close. A single-seed ready for use in the next stages such as feature extraction and
region growing (SRG) has been utilized for pectoral muscle segmentation. Fig. 2 illustrates the proposed system stages and
removal [9]. the proposed algorithm for removing the pectoral muscles.
In 2018 K.U. Sheba and S. Gladston Raj2., in this research
pre-processing, have been implemented in two-phase. In the Start
first phase, the Median filter has been used for noise filtering,
noise filtering, global thresholding for small artifact and label
Image acquisition
removal and adaptive fuzzy logic based bi-histogram
equalization for enhanced image quality. In the second stage,
the pectoral muscles were suppressed by applying a Bounding L noise removal
Box technique[10].
III. DATASET USED Label removal
Due to personal privacy concerns and the difficulty of
obtaining real pictures for testing and analysis, Therefore the
Pectoral muscles removal By Hybrid Bounding Box
images available in the Mammogram image analysis and Region Growing (HBBRG) algorithm
Society(MIAS) were used for this purpose [11]. The image 3fffrgg
was presented by The United Kingdom national breast
screening program, and capturing from a film-screen
Extract ROI
mammographic. The database consists of 322 digitized
mammograms 208 normal and 114 abnormal, which
subdividing to 63 Benign and 51 malignant. The image of the
End
size is 1024 x 1024 pixels and format "PGM" [12].
Fig. 2. Block diagram of the proposed method
IV. MAGE SEGMENTATION (RG)
There are a lot of medical imaging methods Such as A. Noise Removal
mammography, Computed Tomography (CT) and magnetic Mammograms sometimes contain many errors such as
resonance imaging, which are widely used in hospitals and noise, scratches, weak borders, and poor contrast. This can
clinics. In general, image processing techniques such as result from fast or bad lighting conditions of the camera
feature extraction and segmentation are necessary to assist
shutter when taking an image. In this work, we used a 3 x 3
physicians and specialists to achieve more accurate diagnostic
results. Region growing One of the techniques that have median filters, it is a spatial domain filter work by replaces
proven successful in the analysis and partition of medical the pixels in a given block with the average values of those
images [13]. SRG is an algorithm that integrates adjacent pixels. It is easy to implement and gives good results to
pixels into larger and larger areas based on the similarity of suppress noise and reduce local contrast see ALGORITHM I.
pixel intensity by identifying the seed point as the primary B. Label Removal
point and maximum difference of intensity (threshold). The
algorithm tests the adjacent pixels of the seed point, then it Some mammogram image Contains a special mark (label
was decided whether to add to the seed point or not depend on ) for the hospital or clinic in which it was taken, it represents
the threshold that has been specified [14]. The algorithm is the High-Intensity artifacts. This also needs to be eliminated
147
Authorized licensed use limited to: University of Massachusetts Amherst. Downloaded on July 27,2020 at 13:05:23 UTC from IEEE Xplore. Restrictions apply.
2020 International Conference on Computer Science and Software Engineering (CSASE), Duhok, Kurdistan Region – Iraq
to provide the correct interpretation of images. ALGORITHM the obstacles facing each algorithm separately. As an
I illustrates the process and Fig. 3 illustrates the results, this additional step, image enhancement techniques and
process has been used for the removal of the label and obtain morphological operation were used to fill the holes to remove
preprocessing images consist only of the breast and pectoral impurities and extreme edges for obtaining better results on
muscles. the mask, ALGORITHM VI illustrates the process and Fig. 6
illustrates the results.
Algorithm I: Remove label
Input: Image from MIA’s data base Algorithm II: Bounding Box algorithm (BB)
Output: Filtered image without label Input: preprocessing- image
Step 1: Apply 3*3 median filters. Output: image without pectoral muscle
Step2: convert the image to a binary forms with a Step1: Specify the breast orientation
Appropriate threshold. Step2: Create a zero array (SZ) with size equal to the
Step3: find the area for all components in the image were Size of the preprocessing image
Arranged from the largest to the smallest. Step3: find Bounding Box of the breast contour in the
Step4: Create Mask 1, which represents the two largest Mammogram represented Mask1 (Which contains
Areas in the image breast and background and Only the boundaries of the breast).
Exclude small areas including the label. Step4: Create a white mask (binary mask) called mask2
Step 5: multiply mask1 with filtered image to obtain the For the previous mask (mask1).
Pre-processing image. Step5: Find the upper triangle of the mask2 to create
Step 6: end BB mask by a specific diagonal related to the
Width of the image (k) and set it to ones value,
While the others is zeros.
Step6: Replace the BB mask with coordinates of the
Bounding box in the SZ array (mask3).
Step7: invert the values for all pixels of mask3 to form a
mask4, which has been used to remove muscle.
Step8: end
148
Authorized licensed use limited to: University of Massachusetts Amherst. Downloaded on July 27,2020 at 13:05:23 UTC from IEEE Xplore. Restrictions apply.
2020 International Conference on Computer Science and Software Engineering (CSASE), Duhok, Kurdistan Region – Iraq
149
Authorized licensed use limited to: University of Massachusetts Amherst. Downloaded on July 27,2020 at 13:05:23 UTC from IEEE Xplore. Restrictions apply.
2020 International Conference on Computer Science and Software Engineering (CSASE), Duhok, Kurdistan Region – Iraq
Mdb320
Mdb293
Mdb281
Mdb183
Mdb170
Mdb134
mdb168
Mdb75
Mdb21
Mdb9
HBBRG
results
Mammogram Image
TABLE II. THE DIFFERENCE BETWEEN THE REAL AREA WITH BB, RG
AND PROPOSED ALGORITHM, (IN PIXEL)
HBBRG
MIAS Image BB results[10] RG results[4,9]
results
Mdb9 26167 6230 5657
Mdb21 19404 8383 6961
Mdb75 21470 4590 8024
Mdb134 138918 7259 6030
Mdb170 24616 27695 14464
Mdb183 20654 98 1465
mdb168 20920 6343 5574
Mdb281 22104 71596 5577
Mdb293 11488 2174 4790
Mdb320 14841 21904 20314
Table II. Shows the Difference area between the real area of
the pectoral muscle that must be removed and the pectoral
muscles area removed by the proposed methods BB, and RG
algorithm. The results show that the HBBRG algorithm
achieved the lowest difference values for most images see
Fig. 9.
Fig. 7. Shows the comparison of the experimental results of the proposed
methods with the ideal results.
160000
Area in pixels
80000
MIAS Image BB results[10] RG results[4,9] HBBRG results 60000
Mdb9 27227 6356 5899 RG 40000
Mdb21 22312 8445 7373 results 20000
Mdb75 24528 5250 8094 0
Mdb320
Mdb293
Mdb281
Mdb183
Mdb170
Mdb134
Mdb21
mdb168
Mdb75
Mdb9
Table I.Show the Real area of the pectoral muscle and the
area removed by the proposed algorithms, BB and RG so that
the results of the HBBRG algorithm were closer to the Real
in most images as shown in Fig. 8.
150
Authorized licensed use limited to: University of Massachusetts Amherst. Downloaded on July 27,2020 at 13:05:23 UTC from IEEE Xplore. Restrictions apply.
2020 International Conference on Computer Science and Software Engineering (CSASE), Duhok, Kurdistan Region – Iraq
results 0 (2014).
mdb168
Mdb9
Mdb75
Mdb21
Mdb134
Mdb320
Mdb293
Mdb281
Mdb183
Mdb170
151
Authorized licensed use limited to: University of Massachusetts Amherst. Downloaded on July 27,2020 at 13:05:23 UTC from IEEE Xplore. Restrictions apply.