Professional Documents
Culture Documents
Abstract—Facial expression is an effective method of non- Usually, seven basic facial expressions are considered while
verbal communication and expressing feelings. A method dealing with FER problems and they are neutral, angry, fear,
to recognize emotions through facial expressions is happy, sad, surprise, disgust. Due to its immense applications,
proposed in this paper. After some preprocessing of the FER has been an active research topic in the field of computer
input image, the facial region is segmented into four vision and human-computer interaction for the last few
expression regions according to the highly effective decades. Many efforts have been made and many are still being
proposed segmentation method. Features from these made to develop a robust and accurate system. Basic steps of
segmented parts are extracted using a fusion of Histogram FER include image preprocessing, analyzing action units,
of Oriented Gradients (HOG) and Local Binary Patterns segmenting facial image, feature extraction, and classification.
(LBP). Reduction of the dimension of the feature vector is Popular feature extraction techniques include Gabor
done using Principal Component Analysis (PCA). For wavelets, Linear Discriminant Analysis (LDA), Principal
classifying the features and thus the expression images, Component Analysis (PCA), Speeded-Up Robust Features
multiclass Support Vector Machine (SVM) is used. The (SURF), moments, Scale Invariant Feature Transform (SIFT),
performance of the proposed method is measured using Gray Level Co-occurrence Matrix and many others. But each
three publicly available and highly used datasets (JAFFE, has some drawbacks. For example, 2D Gabor wavelets work
CK+, RaFD). Finally, achieved performance is compared well in FER problems but their dimensionality is a reason for
with performance on these datasets by other available not choosing them as the feature extraction technique. For
methods to indicate that the proposed method succeeds in extracting features, the proposed method uses a fusion of
achieving state-of-the-art performance. Histogram of Oriented Gradients (HOG) and Local Binary
Patterns (LBP). Combination of HOG and LBP has been
Index Terms— Facial Expression Recognition (FER), PCA, successfully used in many FER systems [2]. Different
Emotion Recognition, Image Segmentation, Fusion of HOG and classification algorithms have been used for classifying as
LBP, Multiclass Support Vector Machine (SVM). well. Nearest neighbour classification [3], Artificial Neural
Networks (ANN) [4], Extreme Learning Machine [5], random
I. INTRODUCTION forests [6], AdaBoost classification [7], Support Vector
Facial expressions are helpful in expressing the feelings of Machine (SVM) [6], [8-9] are among the few worth
a person. Mental condition, sentiment can also be analyzed mentioning.
using facial expression. So emotion recognition is highly Classifiers used in FER systems have some issues. Proper
dependent on facial expression recognition. Facial expressions distance metric, the number of neighbours to be considered,
can play a vital role in communicating nonverbally. In a classic inefficient memory usage are challenges with nearest
work [1] A. Mehrabian showed that facial expression neighbour classification. ANN requires long training time, a
contributes 55% to the speaker’s message, which is higher than lot of training samples and many parameters to be tuned. An
vocal and textual information’s contribution. So a speaker’s optimal number of nodes in hidden layers and overfitting are
feelings, emotion can be successfully analyzed if the facial issues with ELM. Visualization is a problem with random
expression of the speaker can be analyzed successfully. forests. Outliers and noisy data are challenging while using
Successfully recognized expressions can be used in many AdaBoost. To solve the problems mentioned, multiclass SVM
sectors of our lives for the betterment of everyday experience. is used for the classification task by the proposed method.
It can be used for security purposes, robots can be given this SVM is highly effective in high dimensional spaces and
ability to enhance their performance, automated machines can images usually have a lot of features. SVM is memory
be articulated with expression analyzing feature, expressions efficient, can separate linearly inseparable data, can be used
can be analyzed to allow or prohibit a person from doing with different kernel functions as well as custom kernel
crucial tasks and in many other sectors this facial expression functions. These attractive features forced to select SVM as the
recognition (FER) can be used. One of the most probable classification algorithm. The next section of this paper briefly
future use can be in social networks where the user can be describes the proposed method and the subsequent sections
suggested to post status depending on the expression of the describe each section of the proposed method in detail with
uploaded image. As the world is heading towards automation, implementation up to some extent. Last few sections of the
so the ultimate goal is to recognize expressions flawlessly and paper are dedicated to result analysis, state-of-the-art
spontaneously using machines, as humans can do. comparison and conclusion.
II. PROPOSED METHOD width, height values to segment the four parts from a 150×150
An input image, which may be colour or grayscale, is taken facial image are mentioned in TABLE I.
TABLE I
as the input of the system. Then it is converted to grayscale if VALUES FOR SEGMENTATION
it is a colour one. Then only the face region is detected from Facial Parts Coordinate (x,y) Width (w) Height (h)
the image using Viola-Jones face detection method [10] if Right Eye 23.55,46.58 46.01 30.67
there is any face in it. Then the detected facial region is resized Left Eye 88.29,46.58 44.00 29.15
to a fixed size for using it conveniently in the upcoming steps. Nose 54.33,81.84 45.43 38.00
These three steps are altogether defined as image Mouth 50.24,114.0 57.00 34.18
preprocessing. The resized image is then segmented into four
facial expression regions (right eye, left eye, nose, mouth) These values are defined by analyzing many facial images
according to the proposed image segmentation method. and the position of these four parts in these images. A
Features are extracted from the segmented parts using both challenge was to segment these four parts as accurately as
HOG and LBP. The dimension of the feature vector is reduced possible with the least possible dimensions. When these values
using PCA. Finally, multiclass SVM is used to train the system are applied to an image of size 150×150, the image gets
with some of the images and the rest of the images are used to segmented into four parts as illustrated in Fig. 3.
test the system. The methodology is illustrated in Fig. 1.
Image Preprocessing
Conversion Face Image
Image
to Grayscale Detection Resizing
PCA HOG+LBP
Image
(Dimension (Feature
Reduction) Extraction)
Segmentation Fig. 3 Proposed image segmentation method on a block of size 150×150.