Professional Documents
Culture Documents
Utilizing the benefits of sparse coding, proposed skin tone recognition scheme in detail. The
system introduces a face detection scheme based on experimental results with output images and
inverted index generated from sparse representation. performance analysis are illustrated in Section III.
This work is intended to identify the problems in Finally, section IV provides conclusion for this
face image retrieval is to introduce a comprehensive paper.
coding technique for face the retrieval application.
II. METHODOLOGY
Further the performance is improved by using face
attribute information into this algorithm. Here a The work flow of proposed facial image retrieval
retinal neural network inspects small regions of system is delineated in Fig.1. A new face detection
given image and check for face. The neural algorithm combining Haar Wavelets and Adaptive
networks are directly applied to the input image Boosting (AdaBoost) Classifier is applied on all
portions and the results are arbitrated [2]. images in the database to find the face location.
Along with face detection, proposed system find
Initial step in face detection is the segmentation of
local patches of face (Eyes, Nose, and Mouth).
human skin to determine probable regions
corresponds to human face. Segmentation utilizes a
model based method to represent and differentiate
skin colour from background colours. From the
initial segmentation outputs, human faces are
separated by using adaptive shape analysis. Each
human face is located using a view based face
detection methodology. Eigen maps learning method
provided robust representation in varying
illumination condition. Instead of intensity values,
gradient orientation was considered. Han et al. [3]
implemented cosine kernel to generate image
correlation to ensure robustness towards noise and
other outliers. Su et al. [4] utilized Dual Tree
Complex Wavelet Transform (DTCWT) to extract
image features. An extreme learning machine Fig. 1. Block Diagram of the Proposed System
(ELM) is assigned to classify images. In ELM, Facial attributes like Face colour, Patch Similarity
iterative modification of weight is unnecessary and are found out separately for all images. Image
it is highly agile than the gradient descent method patches are extracted and uniform LBP feature
proposed by Nikan et al [5]. A proper number of descriptor are calculated to use as local features.
DCT coefficients, are discarded to reduce the These local feature descriptors are then quantized to
illumination effect Nabatchian et al. [6] on the generate code words using optimal sparse coding. It
image. Yang et al. [7] discussed the generation of is performed in the training stage of retrieval system.
dual eigen spaces by extracting the algebraic For efficient detection, attribute embedded inverted
features of samples and performed face index is then created. When a query image is given,
identification using a dual layer minimum distance the above procedures are followed to generate sparse
classifier. code words. These code words with binary attributes
The remainder of this paper is arranged as follows. are used to retrieve images. Therefore inverted
Section II presents the proposed face detection and indexing is done in the testing stage of these
systems.
14250
Published by: The Mattingley Publishing Co., Inc.
March-April 2020
ISSN: 0193-4120 Page No. 14249 - 14255
A. Face Detection
Proposed face detection method has mainly three
steps. In the first step a unique image descriptor
named integral image is computed for faster feature
evaluation. The integral image is calculated using
point operations on the image. The next step is to
construct a classifier by taking minimum features
using AdaBoost [8]. In a sub image the number of
Haar feature is higher than the count of all image
Fig. 3. Integral Image
pixels. The third step is to successively combine
complex classifiers in cascade to improve the speed The integral image is given by the equation,
of face detector by targeting prominent image areas.
I ( x, y ) = ∑
x '≤ x , y '≤ y
i ( x ', y ') (1)
p ( x, y=
) p ( x, y − 1) + i ( x, y ) (2)
I ( x, y ) = I ( x − 1, y ) + p ( x, y ) (3)
testing stage, when a new image comes the strong binary image consist of 1 (skin) and 0s (not-skin).
classifier classify the images correctly. A series of Morphological operations are then applied to
AdaBoost classifiers is combined to form a chain of separate the loosely connected (overlapped) skin
filters, as shown in Fig.4. Each of these filters is an areas. Initially, closing is performed on binary
AdaBoost classifier with minimum number of weak image. The dilated image is then multiplied
classifiers. segmented output to retain the number of holes. In
the later stage, these holes are used to filter out some
non-face regions. Finally, stage cross correlation is
obtained between gray scale region and template
face. By choosing a suitable threshold, faces are
classified into black or white.
∑ s (tl *) d (tl *)
l
Major limitation of LBP operator is the incapability MATLAB is used as simulation tool to obtain the
of small 3×3 kernel to obtain striking features results. Images from LFW dataset are used as input
present in big structures. Face image is initially split images for the experiment. In LFW data set, face of
into tiny regions and LBP features are extracted. person in the image is labelled with name. The
These features are concatenated to form a feature extracted local image patches and LBP features are
histogram. In LBP, the face images are considered obtained. Skin colour is considered as an attribute of
as a combination of micro patterns invariant to grey the faces and is classified. The simulation results
scale transforms. A global representation of facial obtained for LFW dataset are given below.
image is obtained by combining these micro
patterns. LBP is considered as uniform, if there exist
two or more bitwise transformations from (0 to 1) or
(1 to 0). An example for LBP is shown in Fig. 8.
To obtain the skin colour, the segmented image is Mean of performance parameters are calculated to
initially considered. Then the input gray image is compare the performance of the proposed system
extracted along with the template and the face with existing systems. Average accuracy of the
region. The average gray level values are obtained proposed system is 98.65%, which is higher
and are compared with a predefined threshold. For a compared to accuracy of existing face detection
given input image, segmented image and matched methods. The accuracy obtained using the
output are shown in Fig. 11. The skin tone detection combination of Principal Component Analysis
output is illustrated in Fig. 12. (PCA) and Artificial Neural Network (ANN) is
96.94%, which is little bit lower than the accuracy of
proposed method. Average specificity of the
proposed system is 98.11% and all other methods
exhibits lower specificity during detection. The
specificity obtained using Support Vector Machine
(SVM) is 74.86%, which is very much less than the
Fig. 11. (a) Input Image, (b) Skin Segmented specificity of proposed method. Average sensitivity
image, (c) Template Matched Output of proposed method is 99.29%, which is higher than
the sensitivity of PCA-ANN method. Table 1 shows
a detailed comparison of performance parameters
while using various methods for face detection and
it is graphically represented in Fig. 13.
V. CONCLUSION
Fig. 12. Skin Tone Detection Output in Here face detection is carried out by using optimal
Command Window sparse coding. It provides high detection accuracy
while minimizing computation time. In order to
Table 1. Performance of Face Detection improve retrieval results, facial skin colour based
Algorithms attribute classification is also performed with low
Accuracy Specificity Sensitivity level features. Facial attributes provide prominent
Method
(%) (%) (%) morphological descriptions of an individual. Most of
SVM 78.95 74.86 77.46
KNN 80.91 79.93 79.97 these attributes remain same over the life time. LBP
FFNN 85.50 61.62 87.60 is tolerant to illumination fluctuations and reduced
PCA-ANN 96.94 90.10 98.28 computational complexity. But LBP facial features
IPSONN 86.12 93.27 96.32
Proposed 98.65 98.11 99.29 are viewed as a combination of distinct tiny patterns.
Optimal sparse code words for the input images are
found out and the faces are retrieved based on
similarity measures.
REFERENCES
[1] N. Kumar, A.C. Berg, P.N Belhumeur, and S.K.
Nayar, “Describable visual attributes for face
verification and image search,” in IEEE
Transactions on Pattern Analysis and Machine
Fig. 13. Comparison of Classification Intelligence (PAMI), Special Issue on Real World
Performance Face Recognition, Oct 2011.
14254
Published by: The Mattingley Publishing Co., Inc.
March-April 2020
ISSN: 0193-4120 Page No. 14249 - 14255
14255
Published by: The Mattingley Publishing Co., Inc.