Professional Documents
Culture Documents
Hand Gesture Recognition Based On Convolution Neural Network CNN and Support Vector Machine SVM
Hand Gesture Recognition Based On Convolution Neural Network CNN and Support Vector Machine SVM
Malaysia
School of Electrical Engineering Wireless High Speed Network Research Wireless High Speed Network Research
College of Engineering Group (WHiSNet) Group (WHiSNet)
Universiti Teknologi MARA School of Electrical Engineering School of Electrical Engineering
Shah Alam, Selangor, Malaysia College of Engineering College of Engineering
afiqre1998@gmail.com Universiti Teknologi MARA Universiti Teknologi MARA
Shah Alam, Selangor, Malaysia Shah Alam, Selangor, Malaysia
farahy@uitm.edu.my* roslina780@uitm.edu.my
Abstract— Gestural communication is a type of nonverbal computer's intelligence and make it easier for people to
communication in which visible body gestures are utilised to communicate with computers in more intricate ways [4].
communicate vital messages, either in place of speech or in Technology needs to be able to recognize, classify, and
conjunction with it. The problem of gesture division is interpret many simple hand gestures and use them in a wide
presented as a first step toward visual hand gesture range of situations [3].
recognition, i.e., the detection, analysis, and recognition of
gestures through real-time hand sequences. Visual hand The main purpose of this research is to recognize nine
recognition and motion tracking are quite challenging to solve (9) hand gestures which consist of “Call", "Fist", "Live
due to their inconvenient nature. This research seeks to Long", "Okay", "Peace", "Rock", "Stop", "Thumbs Up" and
address the issue by determining which classification "Thumbs Down" hand gestures. The recognition system was
technique, Convolutional Neural Network (CNN) or Support developed using the Python programming language. The
Vector Machine (SVM), is superior in recognising hand feature extraction technique namely hand skeletal-based
motions. The hand-skeletal was used as the features to technique was used to represent the hand gestures. Then,
represent the hand gestures. Both classification methods CNN and SVM were used as recognition algorithms. Finally,
utilised the same sample dataset and camera input to achieve a we analyse the recognition techniques based on their
fair comparison. Then, the performance in terms of accuracy accuracy and processing time.
and processing time being analysed. The results indicate that
the CNN excels in recognising hand gestures with an accuracy
of 97.78% compared to the SVM with 96.30%. In terms of II. LITERATURE REVIEW
processing time to train/process the datasets, SVM has the Hand gesture recognition has become one of research
upper hand by taking 5 minutes and 16 seconds. Meanwhile topics that gain attentions of many researchers including
the CNN used 8 minutes and 24 seconds.
works presented in [1] – [10]. There were many approaches
Keywords—Hand Gesture Recognition, Hand Skeletal, CNN, being introduced, but the one that we interested to
SVM, Accuracy, Processing Time investigate is work done by Md Abdur Rahim et al [1]. They
proposed Skeleton Distance Measurement (SDM) as feature
I. INTRODUCTION extraction technique to represent hand gestures. This
approach was very straight forward where prominent points
Hand gesture recognition is critical to the effectiveness of on hand being detected and distance among those points
human-computer interaction (HCI) technology, which is being calculated and became the input of the classifier. CNN
used as a helpful interface in a variety of challenging is one of the classifiers that gave high recognition rate in
scenarios [1]. The system allows for nonverbal hand gesture recognition research. For example, research
communication that is natural, inventive, and modern. It can presented by Md Rashedul Islam et al [9]. Their work able
also be used in a variety of settings [3]. As a result, fostering to obtain classification accuracy at 98.41%. Besides CNN,
natural HCI is critical to bridging the gap between humans SVM also able to give high accuracy in recognising hand
and computers [2]. Further, HCI technology can be used to gestures. Research done by Chin-Pan Huang et al [10] used
create a smart environment [4]. (SVM) algorithm as their recognition algorithm and gain
Hand gestures have become a popular way of recognition rate of 97.51%.
communicating simple thoughts, which are then translated Based on our reviews, both CNN and SVM able to give
into events by a gesture detection system [3]. However, very good accuracy which is more than 90% in hand gesture
analysing the complete number of features takes a long time recognition studies. Therefore, we chose to used both
[1]. Every level of development aimed to improve the
techniques and investigate their performance in terms of During training and validation process, total 2700
accuracy and processing time. images were used. Each gesture has 300 photo samples.
Meanwhile during testing, the images taken in real time
III. METHODOLOGY where 5 different people doing all 9 gestures were being
used.
A. Data Acquisition
1) CNN
Figure 2: Dataset of Hand Gesture Figure 5 above shows the flow of the CNN algorithm
used in this project. The data extracted via the skeletal-based
124
Authorized licensed use limited to: Mukesh Patel School of Technology & Engineering. Downloaded on September 21,2023 at 17:15:42 UTC from IEEE Xplore. Restrictions apply.
2023 IEEE 14th Control and System Graduate Research Colloquium (ICSGRC), 5 Aug 2023, Shah Alam, Selangor, Malaysia
hand detection in the CSV file was then be used to train the gestures. Nonetheless, the recognition rate is still good,
CNN model algorithm. This project utilized the CNN which is more than 80%. Next, the "Call" and "Fist gestures
training algorithm with the import of the TensorFlow has the same accuracy rate of the 94%. The overall
library, which contains the preinstalled programme of the accuracy of the CNN algorithm is 97.78%.
CNN architecture that can be found in the Python
programming language. The data from the CSV file were Table 1: Confusion Matrix for CNN technique
used to train the CNN model.
2) SVM
125
Authorized licensed use limited to: Mukesh Patel School of Technology & Engineering. Downloaded on September 21,2023 at 17:15:42 UTC from IEEE Xplore. Restrictions apply.
2023 IEEE 14th Control and System Graduate Research Colloquium (ICSGRC), 5 Aug 2023, Shah Alam, Selangor, Malaysia
126
Authorized licensed use limited to: Mukesh Patel School of Technology & Engineering. Downloaded on September 21,2023 at 17:15:42 UTC from IEEE Xplore. Restrictions apply.