Professional Documents
Culture Documents
Developing An Isolated Word Recognition System in Matlab
Developing An Isolated Word Recognition System in Matlab
Products Used
MATLAB
Data Acquisition Toolbox
Signal Processing Toolbox
Statistics Toolbox
three steps:
Speech acquisition
Speech analysis
MATLAB Digest
Acquiring Speech
For training, speech is acquired from a microphone and brought into the development
environment for offline analysis. For testing, speech is continuously streamed into
the environment for online processing.
During the training stage, it is necessary
to record repeated utterances of each digit
in the dictionary. For example, we repeat
the word one many times with a pause
between each utterance.
Using the following MATLAB code with
a standard PC sound card, we capture ten
www.mathworks.com
Developing a Speech-Detection
Algorithm
The speech-detection algorithm is developed by processing the prerecorded speech
<DO _ SOMETHING>
end
% Stop acquisition
stop(ai);
% Disconnect/Cleanup
delete(ai);
clear ai;
MATLAB Digest
www.mathworks.com
for i = 1:nframes
% Get current frame for analysis
frame = speech(samp1:samp2);
% Do some analysis
<DO _ SOMETHING>
From the linear predictive filter coefficients, we can obtain several feature vectors
using Signal Processing Toolbox functions,
including reflection coefficients, log area ratio
end
at least 25 milliseconds.
PSD estimate:
order = 12;
nfft = 512;
Fs = 8000;
pyulear(speech,order,nfft,Fs);
challenge then becomes to select an appropriate PDF to represent the MFCC feature vector
distributions.
We modified an MFCC implementation from the Auditory Toolbox, available via the MATLAB Central Link Exchange. www.mathworks.com/auditory-toolbox
MATLAB Digest
www.mathworks.com
Figure 3a. Yule Walker PSD estimate of three different utterances of the word ONE.
Figure 3b. Yule Walker PSD estimate of three different utterances of the word TWO.
MATLAB Digest
www.mathworks.com
Figure 4a. Distribution of the first dimension of MFCC feature vectors for the digit one.
MATLAB Digest
Figure 4b. Overlay of estimated Gaussian components (red) and overall Gaussian mixture
model (green) to the distribution in 4(a).
www.mathworks.com
Resources
visit
www.mathworks.com
Technical Support
www.mathworks.com/support
Online User Community
www.mathworks.com/matlabcentral
Demos
www.mathworks.com/demos
Training Services
www.mathworks.com/training
www.mathworks.com/acoustic_demo
www.mathworks.com/cochlear_implant
testdata,
posterior(gmmmodel,testdata);
Third-Party Products
and Services
www.mathworks.com/connections
Worldwide CONTACTS
www.mathworks.com/contact
For Download
Auditory Toolbox and the MFCC
e-mail
info@mathworks.com
function
www.mathworks.com/auditory-toolbox
2009 The MathWorks, Inc. MATLAB and Simulink
are registered trademarks of The MathWorks, Inc. See
www.mathworks.com/trademarks for a list of additional
trademarks. Other product or brand names may be trademarks or registered trademarks of their respective holders.
91805v00 01/10
MATLAB Digest
www.mathworks.com