Professional Documents
Culture Documents
LPC analyses the speech signal by estimating the formants, applied as the voice signal is a time varying random signal
removing their effects from the speech signal, and estimating which means that the pitch of the signal will not remain
the intensity and frequency of the remaining buzz. The constant throughout the length of the song or speech.
process of removing the formants is called inverse filtering, Therefore the window function is applied to segment the
and the remaining signal after the subtraction of the filtered entire input audio into a number of smaller chunks off sample
modelled signal is called the residue. This property of the LPC lengths ranging from 30-50.
algorithm makes it a valuable tool in pitch detection process Fig. 1 explains the pitch detection process by using the LPC
of the voiced vocal signal. Speech signals vary with time and method for vocal signals.
therefore this process is done on short chunks of the speech
signal, which are called frames; generally 30 to 50 frames per
second give intelligible speech with good compression. The
LPC algorithm approximates the vocal signal as a linear
combination of the past samples.
Thus a voiced vocal signal can be represented in
accordance to the LPC algorithm by equation 1 as
............1
where,
X[n]: Present speech sample FIGURE 1: BLOCK DIAGRAM OF PITCH DETECTOR USING LPC
X[n-k]: previous speech samples The window function that are generally used in signal
p: order of the model processing include the Rectangular window, Hann window,
ak: prediction coefficient Hamming window, Tukey window, Cosine window, Lanczos
e[n]: prediction error window, Triangular windows, Gaussian windows, Bartlett–
For the detection of pitch of vocal signals, the prediction Hann window, Blackman windows, Kaiser windows.
model that is used is the Autoregressive model. The Of these windows, the rectangular window is generally
autoregressive (AR) model is a type of random process which used for the purpose of pitch estimation as it provides a
is used to model and predict an output of a system based on constant value inside the interval and a zero outside the
the previous outputs. interval. The rectangular window is the best choice for
The auto regressive model can be expressed as in equation detecting a sinusoid at low signal-to-noise ratios.
The choosing of the window function depends on a number
............2 of factors of which the attenuation in the stop band plays an
important role.
where,
The LPC algorithm is preferred over the other pitch
detection algorithms for the detection of pitch of vocal signals
are the parameters of the model because of the reason that the voice signal is made up of
c is a constant (often omitted for simplicity) formant frequencies which are the harmonics of the original
is white noise.
pitch of the signal. The LPC algorithm eliminates these
P is the order of the model
formant frequencies in vocal signal and detects the main
Thus a first order AR model can be represented in the form envelope of the signal.
of a mathematical equation as , The envelope that is detected is then passed through a peak
............3 detector where the peaks are detected based on a preset
Where, threshold value. The detected peaks are then counted by using
is a white noise process with zero mean and variance a counter.
To increase the accuracy of the prediction of the LPC The pitch of the signal is calculated by finding the number
algorithm the order of the model is usually high. To determine of ratio of the peaks to the total time period of the window.
the parameters of the model, the Yule-Walker equations are ............4
used and there is a direct correspondence between these Where,
parameters and the covariance function of the process, and - Number of peaks that is detected
this correspondence can be inverted to determine the
parameters from the autocorrelation function. - Time period of the window.
Fig. 2 shows the varied pitch detected by using different
IV. PITCH DETECTION USING LPC window functions. It can be seen from Fig 2 that the
Before the application of the LPC algorithm on the voice rectangular window offers exact segmentation of the audio
signal whose pitch is to be extracted, a window function is signal with minimum overlap and minimum computations
which makes the model an optimal one that can be realised in VI. CONCLUSION AND FUTURE SCOPE
a real time scenario. In this paper we have proposed and implemented a novel
pitch detection algorithm for detecting the pitch of singing
voice. The LPC technique that is discussed here uses spectral
analysis for the detection of pitch after finding the main
envelope of the vocal signal for a given window duration. The
main advantage of using the LPC algorithm for pitch detection
is that it is not affected by the problem of harmonic
frequencies which are the main formants in vocal signals as in
AMDF and ACF techniques, and it eliminates the unwanted
formants and thus helps in accurate pitch detection of vocal
signals which is not possible in other pitch detection
techniques. The technique was further implemented in
LABVIEW and the experimental results of the same show that
the detected pitch of the vocal signals lie in a Frequency band
and can also be used for automatic annotation of music by
comparing each musical note to a given band of frequencies.