You are on page 1of 4

Data Mining in digital image processing using the Gabor filters algorithm

Oana Astrid Vatamanua , Mihaela Ionescua


a
Department of Medical Informatics,
University of Medicine and Pharmacy “Victor Babes”
Timisoara, Romania
astridv2002@yahoo.com

Abstract—This paper presents a method of analysis and number of pixel lines and columns. Pixels represent the
processing of digital images, with a particular interest in the smallest informational units of an image (usually associated
technique that uses the set of Gabor mathematical functions with small dots) which contain quantifiable values for
(Gabor wavelet). It also proposes a method of image indexing brightness and for the image dots. Normally, pixels are
and searching, according to the information contained in the
stored in memory as a two-dimensional matrix of integer
image. This information is defined by a signature obtained
using a Gabor filter. Medicine represents a possible field of values of type small. These values are sent or stored in a
application, as different types of images that appear in this compressed way.
area may be therefore analyzed: x-rays, ultrasounds, NMR, Each pixel in an image is associated to a specific
etc. position in a two-dimensional area. It has a specific numeric
value quantifying the image features (brightness, grey
Keywords-Gabor; wavelet; image; digital; medicine; analysis shades or color information) that define the corresponding
image dot. Therefore, the digital images may be classified
I. INTRODUCTION
according to the number and the nature of these samples:
During the past few years, digital images are becoming binary, grayscale, false-color, multi spectral, thematic,
increasingly used, in accordance with the development of graphical functions.
digital equipments that are cheaper and more accessible. Even if the term “texture” refers to the properties
The field of image analysis, especially the texture analysis, belonging to the external surfaces of the objects and to the
also received a special attention. Since the images may be sensations induced by touch, it is sometimes used to
considered a mosaic of textures separated by certain describe non-tactile sensations. Also, the textures may
borders, a pertinent analysis must comprise the obtaining represent sub-dimensioned models (especially in case of
process, the classification and the segmentation of the non-tactile two-dimensional textures), where the individual
image. In order to classify and obtain textures, it is elements are placed one next to the other, in order to
necessary to take into consideration a function that measures generate undistinguishable models.
the similarities between the given image and the images Color may be an integral part of the texture’s model, or it
included in a database, specifically selected for comparison. may be applied separately. In this case, a repetitive model of
This function measures the distances between certain the texture may be used in a number of ways in computer
features of the textures. This is a two-step method: Feature graphics: green for grass, blue for water, grey for concrete,
Extraction – FE, Similarity Measurement – SM. Until now, dark grey for nocturnal images, etc.
image comparison using signatures computed with Gabor
filters was performed only for fragments of images, without III. GABOR WAVELET
taking into account the entire image [1]. This paper presents the texture analysis using Gabor
We used the facilities of .NET Framework, the filters, evaluating also the performance in identifying
programming environment Visual Studio and the C# models in a large database of images representing textures.
programming language [2], as it is object oriented, easy to These images are compared with other representations of
use for a desktop application, and finally as it is of latest existing textures. A simple algorithm for a neural network is
generation in programming languages. Another advantage is used to discover the similarities between the texture’s space
represented by the fact that the code is easy to read and to segmentation [4,5]. The performance of finding similar
“generate”, due to the multitude of predefined libraries models increases significantly by the discovery of
offered by Microsoft [3]. similarities [6]. An important aspect of this process is its use
on real images. The extraction of texture patterns and the
II. DIGITAL IMAGES ANALYSIS AND PROCESSING
discovery of similarities are used in searching in high
A digital image is a representation of a two-dimensional dimensional digital images.
image, defined as a finite set of digital values, named image
elements or pixels. The digital image consists of a fixed
As mentioned, a function to measure the similarities is The analysis based on the texture properties is largely
necessary; this function works in two steps: Feature used in multimedia or image databases [1]. Usually, the
Extraction – FE and Similarity Measurement – SM. images from these databases have very large dimensions,
As an example, we may consider two texture models: a varying from several MB to hundreds of MB, representing a
weaver model, which may be studied by frequency analysis, serious challenge for the image analysis and data
and a grass model, which may be analyzed throughout visualization.
statistic descriptions [7]. Still, these models correspond to
extreme cases that do not fit in real images. The general FE IV. EXPERIMENTAL RESULTS
and SM methods define constrains: the pattern must be In order to exemplify the use of Gabor wavelet in image
small sized, it must correctly characterize the texture, and analysis, we realized a tool consisting of two parts: a
the numeric value that defines similarity must be accurate theoretical part and the program itself.
and small for similar features; in opposition, a large pattern This tool was implemented in C# programming language
is needed for the other cases. Also, the features may be set and it offers, besides the corresponding forms, certain
as invariant to translations and rotations, in order to accept classes, among: FeatureVector, GaborAnalyse and
two textures as being equivalent, one deriving from another GaborFeatureExtraction. These classes define the objects
by translation or rotation [8]. used by the tool. By calling the corresponding method, we
Gabor functions are Gaussian functions modulated by shall obtain the characteristic vector, which will lately be
complex sinusoids. In two dimensions, they may be defined used for comparison.
as follows: Next, it is exemplified the manner in which an image is
interpreted. We used an ultrasound image (Figure 1). In
 1    2 2   order to reduce the analysis time, the image was resized to
g ( x, y ) =   exp − 1  x + y  + 2πjWx . (1) 200x250 pixels. Considering that the images received from
 2πσ σ   2 2 
 x y   2  σ x σ y   the analysis laboratory have the same sizes in pixels, usually
generated in very high resolution, to be represented in the
The masks for the Gabor filters may be considered as graphical interface (GUI – Graphical User Interface), each
orientation and scale regulators [9]. In a given region, image will be resized in order to fit in the entire space of the
statistics for these micro features may be used to cell reserved for the GUI image. The display manner of the
characterize the information behind the texture. Gabor images does not influence their analysis process, as the tool
wavelet represents a class of this type of functions. The will analyze the initial image files.
dictionary of similarity filters may be obtained using the
corresponding dilatation and rotation of g(x,y), through the
generating function.

g mn ( x, y ) = a − m g ( x' , y ' ) , a>1, m,n – integers (2)

x' = a − m ( x cos θ + y sin θ ), y ' = a − m (− x sin θ + y cos θ ) (3)


where θ= , K being the number of orientations.
K
A measure of similarities in the feature space should
capture the similarities between the initial image models [5]. Figure 1. Ultrasound medical image
Still, many times, the value scale is a subjective measure.
The work flow is the following: the analyzed image is
The simple measurement of distances, as the Euclidian
chosen by pressing the corresponding button, and then the
distance, may not maintain the initial similarity. Therefore,
tool will extract its features and will generate the
the computation of the right measure of similarity may be
characteristic vector using the Gabor wavelet and the
seen as a learning problem. The purpose of learning is to
„Energy Based” method, as well as the images
divide the original feature space into clusters with similar
corresponding to the filtering operation, in .png format.
visual models. A large number of labeled images and the
The tool itself works with medical images databases (x-
associated characteristic vectors are used during the last
rays, ultrasounds, etc). The databases are either newly
phase. When a texture model is presented, the network
created, or already existing. In order to add an image to the
allocates a class, based on its characteristic vector. The final
database, it is necessary to press the „Add reference image”
ordered set of results is then computed using the Euclidian
button (Figure 2), which triggers the extraction of features
distance within the same class [10].
and the generation of the characteristic vector (using the
same Gabor filters). Before performing these computations,
the tool requires a description in medical terms of the image
and a diagnosis as input data.

Figure 4. Stages in image processing – recording the image to be included


in the image database

There is also the possibility to export the characteristic


Figure 2. Stages in image processing – loading the image database vectors obtained in .xls format, in order to be processed and
analyzed using other Data Mining tools.
A visualization of the existent images in the database
and the corresponding diagnosis are possible in any A database of ultrasound images may be built using this
moment. There is also the possibility to delete a record from procedure. Our database contains a total of 67 images for a
the database, if the user desires it. By a record, one can total of 21 persons. For each person there are a number of 2
understand the set of information which characterizes an to 5 images. The database images are indexed according to
image (name, the image itself, description, diagnosis). the signature on each image they present. There have been
After the comparison of the new image with the images two types of tests:
already stored in the database, the following results will be (A) A total of five people, whose ultrasound images are
presented: the most similar image found in the Gabor found in the database, were selected. These people have
database, the similarity percentage and a presumptive every new ultrasound image. Several persons whose images
diagnosis (the one already known – of the most resembling are similar were searched using this set of images in the
image, Figure 3). If the user accepts the given result, a database. The results are presented in Table 1.
reference to the analyzed image will be recorded in the The results (similarity percent) show that it is possible to
database, in the corresponding category (Figure 4). find a similar image according to the information present in
the image.
(B) A total of five people, whose scans are not in the
database, were selected. These people have every ultrasound
images. For these people, the medical diagnosis is known.
Several persons whose images are similar were searched
using this set of images in the database. The results are
presented in Table 2.
The similarity percent in the last row indicates the
measure in which the known diagnosis for these new people
corresponds to that of similar individuals found. The results
obtained show that it is possible to obtain a proper diagnosis
of an unknown image by comparing the information present
in an ultrasound image.

TABLE I. THE MOST SIMILAR IMAGES FROM THE DATABASE FOR


NEW ULTRASOUND IMAGES PEOPLE HAVE IN THE DATABASE

New image 1 2 3 4 5
Figure 3. Stages in image processing – loading an image that will be The most similar image 1 2 3 4 5
compared with those from the database from database
Similarity percent 0.81 0.92 0.74 0.86 0.83
TABLE II. THE MOST SIMILAR IMAGES FROM THE DATABASE FOR
[3] D.A. Forsyth and J. Ponce, “Computer Vision – A Modern
ULTRASOUND IMAGES OF PEOPLE WHO DO NOT HAVE IMAGES IN THE Approach”, Prentice Hall 2003.
DATABASE [4] J.G. Daugman, “Complete discrete 2-D Gabor transforms by
neural networks for image analysis and compression”, In:
New image 1 2 3 4 5
IEEE Trans. ASSP, July 1988, vol. 36, pp. 1169-1179
The most similar image 4 13 7 18 10
from database [5] S.E. Grigorescu, N. Petkov and P. Kruizinga, “Comparison of
Similarity percent 0.73 0.87 0.63 0.71 0.44 texture features based on Gabor filters”, IEEE Trans. on
Similar diagnosis yes no no yes no Image Processing, 11 (10), 2002, 1160-1167
[6] N. Petkov and E. Subramanian, “Motion detection, noise
reduction, texture suppression and contour enhancement by
V. CONCLUSIONS
spatiotemporal Gabor filters with surround inhibition”,
This paper proposes a method of image indexing and Biological Cybernetics, 97 (5-6), 2007, 423-439.
searching according to the information contained. This [7] T. Chang and C.-C. Jay Kuo, “Texture analysis and
information is defined by a signature obtained using a Gabor classification with tree-structured wavelet transform”, In:
IEEE Trans. Image Processing, October 1993, vol. 2, no. 4,
filter. This method may be used in most fields that use
pp. 429-441
image comparison, but it is mostly useful in medicine, [8] G.M. Haley and B.S. Manjunath, “Rotation invariant texture
where it may be developed towards an expert system, by classification using the modified Gabor filters”, In: Proc.
combining the medical knowledge with mathematics. IEEE Intl. Conf. on Image Processing '95, Washington D.C.,
October 1995, vol. I, pp. 265.
REFERENCES [9] A.D. Alexandrov, W.Y. Ma, A. El Abbadi and W.Y.
Manjunath, “Adaptive filtering and indexing for image
[1] B.S. Manjunath and W.Y. Ma, “Texture features for browsing databases”, In: Proc. of SPIE, San Jose, CA, February 1995,
and retrieval of image data”, CIPT TR-95-06, July 1995. vol. 2420, pp. 12-23
[2] D. Blythe, “Advanced Graphics Programming Techniques [10] T. Kohonen, “The self-organizing map”, In: Proc. IEEE,
Using OpenGL”, Siggraph 1999. 1990, vol. 78, no. 9, pp. 1464-1480.

You might also like