Professional Documents
Culture Documents
Modeling of Speech Signals Using Fractional Calculus: Khaled Assaleh Wajdi M. Ahmad
Modeling of Speech Signals Using Fractional Calculus: Khaled Assaleh Wajdi M. Ahmad
CALCULUS
-1 xˆ (n) = ∑ µ k D α k x (8)
a=R r (3) k =1
It should be emphasized that a negative value of α
where R is a P×P Toeplitz autocorrelation matrix whose
corresponds to a fractional integral of order α, denoted as
first column is given by:
Iαx. Thus, for noise immunity and numerical stability,
[R(0) R (1) L R ( P − 1)] , where
T
equation (8) can be recast as a fractional integral equation
N −1 given by:
R ( k ) = ∑ x ( n) x ( n − k ) (4) Q Q
n =0 xˆ (n) = ∑ γ k I β k x = ∑ γ kψ k (n) (9)
and r = [R(1) R(2) L R( P)]
T k =1 k =1
1 -0.2
0.5
-0.4
0
-0.6
-0.5
-0.8
-0.2 Finally, in Fig. 5 we show the fitting error plots for the
three cases discussed above for the whole 512-sample
-0.4
frame; the 12th order LPC (top), the 2nd order LPC
-0.6
(middle), and the two-parameter FLP (bottom). It is quite
visible that the proposed modeling technique, with only
-0.8 two parameters, is far more superior to the LPC modeling
130 140 150 160 170 180 190 200 210
technique. The modeling error using a 2nd order LPC is
Sample # much larger in magnitude and frequency content than its
Figure 2. Fitting a speech signal using Two-parameter two-parameter FLP model. The figure also shows that
FLP model; showing part of the frame, its fit, and fitting increasing the LPC order to 12 fails to match the
error; SNR = 26.5 dB performance of the two-parameter FLP.
0.6 original
filtted REFERENCES
0.4 error
0.2
[1] I. Podlubny, “Fractional Differential Equations”,
Academic Press, 1999.
0
[2] K. Oldham and J. Spanier, “Fractional Calculus”,
-0.2 Academic press, New York, 1974
-0.4
[3] W. Ahmad and J. C. Sprott, “Chaos in fractional-
-0.6 order autonomous nonlinear systems”, Chaos,
Solitons, and Fractals, 16, 339-351, 2003
-0.8
130 140 150 160 170 180 190 200 210 [4] W. Ahmad and A. Harb, “On nonlinear control
Sample # design for autonomous chaotic systems of integer
Figure 4. Fitting a speech signal using 2nd order LPC and fractional orders”, Chaos, Solitons, and
model; showing part of the frame, its fit, and fitting error; Fractals, vol. 18, 693-701, 2003
SNR = 16.1 dB.
[5] R. El-Khazali, W. Ahmad, and Y. Al-Assaf,
0.2
“Sliding Mode Control of Fractional Chaotic
0.1 Systems”, Proc. IFAC workshop on Fractional
0
-0.1
Differentiation and its Application, France, 495-
-0.2 500, July 2004
50 100 150 200 250 300 350 400 450
0.2
0.1
[6] W. Ahmad, R. El-Khazali, and Y. Al-Assaf,
0 “Stabilization of Fractional Chaotic Systems Using
-0.1 State-Feedback Control”, Chaos, Solitons, and
-0.2
Fractals, vol 22/1, pp 141-150, 2004.
50 100 150 200 250 300 350 400 450
0.2
0.1 [7] J. Makhoul, “Linear prediction: a tutorial review,”
0
Proceedings of the IEEE, Vol. 63, No. 4, April
-0.1
-0.2 1975.
0 50 100 150 200 250 300 350 400 450 500
[8] J. P. Campbell, T. E. Tremain, and V. C. Welch,
th
Figure 5. Fitting errors; (top) 12 order LPC; (middle) 2 nd “The Federal Standard 1016 4800 bps CELP Voice
order LPC; (bottom) two-parameter [0.15 and 0.3] FLP. Coder,” Digital Signal Processing, Vol. 1, no. 3
(1991): 145 – 155.
4. CONCLUSION [9] K. T. Assaleh, and R. J. Mammone, “New LP-
Derived Features for Speaker Identification,” IEEE
A novel approach for speech signal modeling has been Transactions. on Speech and Audio Processing,
presented. The new approach utilizes fractional integrals Vol. 2, no. 4, pp 630-638, 1994
as basis functions to model the speech signal. It has been
demonstrated that fractional order linear prediction
models (FLP) with a smaller number of parameters give
better performance results than those obtained from the
commonly used LPC-based models in terms of the energy
of the fitting error and its smoothness. The smoothness of
the fitting error indicates that in the case of FLP less
information is contained in the fitting error. Furthermore,
we have showed that a two-parameter FLP model
outperforms 12th order LPC models. Hence, the proposed
FLP modeling technique shows considerable promise as a
viable and accurate technique for speech signal modeling.
Further investigations are underway for unvoiced signals.
The obtained preliminary results are encouraging to
further investigate the proposed modeling technique for
other applications like speech recognition and text-to-
speech synthesis.