You are on page 1of 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/221296015

An enhanced shape descriptor based on radial distances

Conference Paper · November 2011


DOI: 10.1109/ICSIPA.2011.6144073 · Source: DBLP

CITATIONS READS
3 745

2 authors, including:

Tiagrajah v. Janahiraman
Aerodyne Group
17 PUBLICATIONS   94 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Intelligent systems in Turning operation View project

Face recognition View project

All content following this page was uploaded by Tiagrajah v. Janahiraman on 17 June 2019.

The user has requested enhancement of the downloaded file.


2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA2011)

An Enhanced Shape Descriptor Based On Radial Distances


Tiagrajah V. J.1, Muhammad Razeen A. A. S.
Dept of Electronics and Communication Engineering,
College of Engineering, Universiti Tenaga Nasional,
Jalan IKRAM-UNITEN, 43000 Kajang, Selangor
1
tiagrajah@uniten.edu.my

Abstract new focus on developing image indexing techniques


which have the capability to retrieve image based on their
Contour and region based descriptors are among the contents. The technologies are now referred to as
main approaches in content based image retrieval Content-Based Image Retrieval (CBIR). CBIR is the
(CBIR). In the contour based approaches, Fourier retrieval of images based on visual features such as
Transform is applied on the shape signatures to obtain colour, texture and shape [2].
Fourier descriptors (FD), which has attractive properties Shape may be defined as the characteristic surface
such as rotational, scale, and translational invariance. configuration of an object, an outline or contour. It
However, the FD derived from different signatures can permits an object to be distinguished from its
have significant effect on the performance of retrieval. In surroundings by its outline. Shape retrieval involves three
this paper we introduce a new shape signature, which is primary issues which are shape representation, shape
called the radial distance with triangular centroid area similarity measure and shape indexing. Among them,
(RDTCA) signature. Based on our retrieval experiments shape representation is the most important issue in shape
using Kimia 99 and Kimia 216 dataset, FD derived from retrieval.
RDTCA signature outperformed the FD derived from Various shape representation methods, or shape
other signatures in terms of robustness and retrieval descriptors, exist in the literature are first classified into
performance. two categories which are region based and contour based.
In region-based techniques, all the pixels within a shape
Key Words: CBIR, Shape, Fourier descriptors, are taken into account to obtain the shape representation.
Retrieval. In boundary-based shape representation only the outer
boundary of the shape is used. Some of the region based
techniques are geometrical moments [1], pseudo-Zernike
1. Introduction moments [2], Zernike moments [3, 4] and Tchebichef
moments [5]. Many boundary-based techniques have been
Recent years there is the rapid increase of digital proposed in the literature, including Fourier Descriptors
images around the world due to the growing power of [6], wavelet descriptors [7], and Curvature Scale Space
workstations, decreasing storage and processing costs and (CSS) [8]. The most efficient boundary-based technique
the Internet. Variety of image application areas contribute was Fourier Descriptors (FDs). With Fourier descriptors,
to the generation of digital image. They have become global shape features are captured by the first few low
more and more dependent on the use of the generated frequency terms, while higher frequency terms capture
images. The use of the generated images can help produce finer features of the shape. Fourier Descriptors overcomes
new creations, save time and costs. However, instead of the weak discrimination ability of the moment descriptors
making things easier, the huge amount of digital images and the global descriptors and overcome the noise
stored around the world makes the utilization of images sensitivity in the shape signature representations. To
from existing database more difficult than ever. This is derive the FDs of an image, the 2-D image is converted to
due to the lack of a standard way of indexing and 1-D signature. Many signatures have been proposed in the
managing digital images. literature, including centroid distance (CD), complex
Traditionally, images are stored into database using coordinate (CC) and cord-length distance (CLD) [9]. In
textual information. The advantage of textual indexing of this paper I propose a new signature, namely Radial
image is that it can provide user with key word searching Distance with Triangular Centroid Area (RDTCA), and
and catalogue browsing. However, it has limitations. One compare it with other frequently used shape signature.
is that it is time consuming, when the database is large, it The paper is organized as follows: section 2 introduces
is almost impossible to manually annotate all the images. the proposed signature. Shape indexing using Fourier
The other is visual features of image are difficult to be Descriptors discussed in section 3. Experimental results
described using words [1]. Therefore, there has been a

978-1-4577-0242-6/11/$26.00 ©2011 IEEE 472


2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA2011)

are presented in section 4. Finally, conclusion is presented where 1 , 1 is the next boundary
in section 5. coordinate after point , , and , is the
centroid of the shape. As illustrated in Fig. 1, the
2. Radial Distance with Triangular Centroid signature is calculated by adding the Euclidian distances
Area Signature between point a and the centroid c and the triangular
centroid area formed by point a, the next boundary point
In this section a new shape signature is introduced, b and the centre of the object. The triangular centroid area
where the radial distance is combined with triangular integrates some information on the shape’s area together
centroid area, referred as RDTCA. with the radial distance, which enhances its robustness.
Fig. 2 shows the RDTCA signatures for shapes from 4
different classes.

(a)

Fig. 2 The RDTCA signature for shapes from four different


classes.

3. Derivation of Fourier Descriptors


Fourier descriptors are obtained by applying Fourier
transform on shape boundary usually represented by a
shape signature s(t), the Fourier transformed coefficients
are called the Fourier descriptors FDn, of the shape.
These descriptors represent the shape of the object in a
(b) frequency domain where the first few low frequency
Fig. 1 Example of the Radial Distance with Triangular Centroid terms capture global shape features of the object and
Area (RDTCA) signature. higher frequency terms capture finer features of the shape.

Initially, the preprocessing is performed by tracing the Discrete Fourier transform for s(t):
boundary of the object. The shape coordinates are
assumed to be (x(t), y(t)), t = 0, 1, …, N-1. The
1 2
computation of RDTCA signatures can be described by exp
the following equation. (2)

where 0,1, … , 1
| 1 1 | (1)

978-1-4577-0242-6/11/$26.00 ©2011 IEEE 473


2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA2011)

The coefficient bn, n= 0, 1, … N-1, are usually called


Fourier descriptors (FD) of the shape, denoted as FDn, n= Table 1. The first 8 feature vectors consist of Fourier
0, 1, ...., N-1. descriptors with RDTCA signatures of the image of hammer
Since RDTCA is relative to the centroid, FD are with rotational angle of 00, 900, 1800 and 2700.
translation invariant. In Fourier Transform, rotation in
spatial domain means phase-shift in frequency domain so
using magnitude values of coefficients make FD
rotationally invariant. Scale invariance is achieved by by
dividing the magnitude values of FD by the DC
component. The reason of choosing FD0 as the
normalization factor is because it is the average energy of
the signal. Since Radial Distance with Triangular
Centroid Area (RDTCA) is real valued, first half of FDs
are the same with second half. Therefore, first half of the
FD is sufficient to represent a shape. The normalized
invariant feature vector, f, used to index the shape is:
Table 2. The first 8 feature vectors consist of Fourier
descriptors with RDTCA signatures of the image of hammer
| | | | / with spatial resolutions ranging from 512×512 to 32×32.
, ,…, (3)
| | | | | |

Thus, , , ,…, (4)

The invariant features represent the shape of the object


in frequency domain, where the global features are
captured by the first few low frequency terms and finer
features are captured by the high frequency terms. In
some cases, noise component may be located in the higher
frequencies. Thus, selecting the first Nc (< M) terms
would be suitable for pattern classification tasks. An
image of hammer with different rotation angles (00, 900,
1800 and 2700) are shown in Fig. 3.

The standard deviation among the feature vectors, f,


are very low and significant difference can be observed
(a) (b) (c) (d) among the mean value of feature vectors. The mean value
of the feature vectors among the images are within the
Fig. 3. Image of hammer with rotational angles of 00, 900, range of 0.07 to 0.075 for different spatial resolutions,
1800 and 2700. except, when size of the image is reduced to 32 x 32, the
mean increases to 0.0806. Hence, low spatial resolution
The first 8 FD values of the RDTCA signatures of the seems to produce feature vectors which are less
images of hammer are tabulated in Table 1. Results show prominent. This is due to images with small resolution
that the standard deviations are very low among the will produce less number of boundary coordinates.
feature vector , , ,…, for different The feature vectors were also calculated on images
rotational angles. This is due to the property of DFT of belonging to 4 different classes (fish, cow, human and ray
being invariant to shapes which are rotated. In order to fish) to analyse its ability to produce discriminative
analyse the effect of changing spatial resolution, the features among the classes. A plot of the magnitude of the
invariant feature vector were calculated on the same feature vector consisting the first 10 Fourier descriptors
image of hammer by varying the spatial resolutions: are shown below each image in Fig 4. It can be observed
512x512, 256x256, 128x128, 64x64 and 32x32. Table 2 that the magnitudes for within class images are similar
shows the result of the first 8 normalised Fourier and a significant difference can be noticed for images
descriptor with RDTCA signatures. from different classes.

978-1-4577-0242-6/11/$26.00 ©2011 IEEE 474


2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA2011)

Fig. 4 The Fourier Descriptors for shapes from four different Fig. 5 The database of 99 shapes.
classes.
Kimia 216 database constructed from samples of a
For shape retrieval tasks, the query shape indexed by very large database of shapes created for testing the
normalised FD features are represented as fm= [fm1, fm2 , compression rates for MPEG7, provided by Latecki et al.
…, fmNc] and a target shape indexed by normalized FD [12]. This database consists of 18 categories: bones,
features are represented as fd = [fd1, fd2 , …, fdNc], the glasses, and so on with 12 shapes in each category, as
Euclidean distance between the two feature vectors can be shown in Fig 6.
calculated by:

/ (5)

4. Experiment result
The experiment results for shape retrieval are
composed of two major tasks: the feature extraction and
image recognition. In the feature extraction process, the
feature vector of a shape based image is obtained from the
Fourier Descriptors of the RDTCA signature. Then, the
recognition is performed by calculating the recognition
rate, which compares the accuracy of the features, in
terms of retrieval the same type of images from a
database.
To evaluate the performance of the proposed method,
experiments are conducted on two shape databases
(Kimia 99 and Kimia 216) provided by Kimia’s group
[10, 11]. In Kimia 99 dataset there are 99 shapes in total
from nine categories, including fish, rabbits, airplanes,
grebes, wrenches, hands, humans, quadrupeds and rays.
Each category contains 11 samples. These samples have
variations in form, or are exposed to other influences such
as occlusion, articulation and missing parts [10]. Some Fig. 6 The database of 216 shapes selected from the MPEG-7
samples of shape images from this database are shown in test database [15].
Fig. 5.

978-1-4577-0242-6/11/$26.00 ©2011 IEEE 475


2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA2011)

An experimental method recommended in [13], 5. Conclusion


suggests that each shape is used as a model to be matched
against all the others, and the results are ordered by the A new Fourier descriptors based on contour based
distances between each pair. The nearest matches by shape signature was presented in this paper. The proposed
category and the corresponding recognition rates based on shape signature is obtained by applying the Fourier
n-th retrieval are presented in Table 3 and Table 4. The Descriptors using radial distance with triangular centroid
proposed RDTCA signature is compared with the other area. The proposed method is compared with other shape
best performing signatures, such as centroid distance and signatures such as complex coordinate signature, centroid
complex coordinate signatures and also the recently distance signature and template-based shape
proposed template based shape representation technique representation technique using two shape databases
[14]. (Kimia 99 and Kimia 216) provided by Kimia’s group
[10, 11]. The experiment results show that the proposed
Table 3. Recognition rate of the top 10 retrieval based on method and the centroid distance signature yield
Kimia 99 database comparable results in a small database. However, the
proposed signature performs the best in larger databases.
This is due to the ability of RDTCA signature to capture
both the shape boundary and area information of the
object.

6. Reference
[1] M.Hu, ‘Visual Pattern Recognition by Moment Invariants,”
IRE Transactions on Information Theory, vol. IT-8,pp.179-187,
1962.
Based on Table 3, results shows that the proposed [2] S. O. Belkasim, M. Shridhar, and M. Ahmadi, "Pattern
method performed very similar to centroid distance but Recognition with Moment Invariants: A Comparative Study and
outperformed the complex coordinate and template based New Results," Pattern Recognition, vol. 24, pp. 1117- 1138,
1991.
shape representation.
[3] M. Teague, “Image analysis via the general theory of
moments,” Journal of the Optical Society of America, vol. 70,
Table 4. Recognition rate of the top 10 retrieval based on pp.920-930 1980.
Kimia 216 database [4] A. Khotanzad, “invariant Image Recognition by Zernike
Moments,” IEEE Transactions on Pattern Analysis and
Machine Intelligence, vol. 12, pp. 489-497, 1990.
[5] R. Mukundan, S. H. Ong, and P. A. Lee, "Image Analysis
by Tchebichef Moments," IEEE Transactions on Image
Processing, vol. 10, pp. 1357-1364, 2001.
[6] T. P. Wallace and P A Wintz, "An efficient three
dimensional aircraft recognition algorithm using normalized
Fourier descriptors," Computer Graphics and Image Processing,
vol. 13, pp. 99-126, 1980.
[7] G. Chauang and C. Kuo, " Wavelet descriptor of planar
curves: Theory and applications," IEEE Transaction on Image
Results from Table 4 shows that the proposed method Processing, vol. 5, pp. 56-70, 1996.
performed better than centroid distance in the 2nd [8] F. Mokhtarian and A. Mackworth, "Scale-based description
retrieval by 1% and all the three methods performed the and recognition of planar curves and two-dimensional shapes,"
same at the 1st retrieval with 100% accuracy. From the IEEE Transactions on Pattern Analysis and Machine
3rd to 11th, the proposed method performed the best at Intelligence, vol. 8, pp. 34-43, 1986.
the 8th, 9th and 11th retrieval and the centroid distance [9] D. S. Zhang and G. Lu, "Study and Evaluation of Different
performed the best at the 4th and 10th retrieval. In overall, Fourier Methods for Image Retrieval," International Journal of
the proposed method performed better than other Computer Vision, vol. 23, pp. 33-49, 2005.
methods. Whereas complex coordinate signature’s [10] T. B. Sebastian, P. N. Klein and B. B. Kimia. “Recognition
of Shapes by Editing Their Shock Graphs”, PAMI, 26(5):550-
performed the worse. The centroid distance and radial 571, 2004.
distance with triangular centroid area signatures have [11] Z. Tu and A. L. Yuille. “Shape Matching and Recognition-
comparable results in smaller database. However, in the Using Generative Models and Informative Features”, ECCV,
case of large database, the RDTCA performs better than 3:95-209, 2004.
the centroid distance. [12] L.J. Latecki, R. Lakamper, and U. Eckhardt, “Shape
Descriptors for Non-Rigid Shapes with a Single Closed

978-1-4577-0242-6/11/$26.00 ©2011 IEEE 476


2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA2011)

Contour,” IEEE Conf. Computer Vision and Pattern


Recognition, pp. 424-429, 2000.
[13] E. Milios and E. G. Petrakis, “Shape retrieval based on
dynamic programming,” IEEE Trans. Image Process., vol. 9,
no. 1, pp. 141–146, Jan. 2000.
[14] Yasser Ebrahim, Maher Ahmed, Siu-Cheung Chau and
Wegdan Abdelsalam, “Image analysis and recognition”,
Portugal: Springer Berlin / Heidelberg, 2008.
[15] L.J. Latecki, R. Lakamper, and U. Eckhardt, “Shape
Descriptors for Non-Rigid Shapes with a Single Closed
Contour,” IEEE Conf. Computer Vision and Pattern
Recognition, pp. 424-429, 2000.

978-1-4577-0242-6/11/$26.00 ©2011 IEEE 477

View publication stats

You might also like