Professional Documents
Culture Documents
Analysis
Arnab Pramanik
1
, Rajorshee Raha
2
1,2
Heritage Institute of Technology, Kolkata
1
arnabpramanik.ece@gmail.com
2
rajorshee87@gmail.com
Abstract The growth in wireless communication
and mobile devices has supported the development of
Speech recognition systems. So for any speech
recognition system feature extraction and patter
matching are two very significant terms. In this
paper we have developed a simple algorithm for
matching the patterns to recognize speech. We used
Mel frequency cepstral coefficients (MFCCs) as the
feature of the recorded speech. This algorithm is
implemented simply by using the principle of
correlation. All the simulation experiments were
carried out using MATLAB where the method
produced relatively good results. This paper gives a
details introduction of recorded speech processing,
design considerations and evaluation results.
Keywords Automated Speech Recognition (ASR), Mel-frequency
cepstral coefficients (MFCCs), Discrete Fourier 1ransform (DF1),
Fast Fourier 1ransforms (FF1, Mel-frequency cepstrum (MFC).
I. INTRODUCTION
The basic principle of Automatic Speech Recognition
System (ASR) is to recognize the spoken words irrespective
of the speakers and to convert them to text. The growth in
wireless communication and mobile devices has given us a
large access to a pool of different information resources and
service. In this case ASR is a key component. Present mobile
devices are having limited memory and processing capacities
which are adding several challenges to ASR. As a result ASR
systems executing on mobile devices only supports low
complexity recognition tasks such as simple name dialing.
In this paper we have shown a simple and less complex
algorithm for recognizing the spoken words. So many
researches have been made to recognize patterns for ASR. But
is this paper we use Mel frequency cepstral coefficients as the
feature of the speech sample. The proposed algorithm uses the
principle of Correlation to recognize the spoken word
perfectly.
In communication and signal processing, cross-
correlation is a measure of similarity of two waveforms as a
function of a time-lag applied to one of them. This is also
known as a sliding dot product or sliding inner-product. It is
commonly used for searching a long-signal for a shorter,
known feature. It also has applications in pattern recognition,
single particle analysis, electron tomographic averaging,
cryptanalysis, and neurophysiology.
For continuous functions, f' and g, the cross-
correlation is defined as