Professional Documents
Culture Documents
Literature
Flanagan, J. 1972. Speech Analysis, Synthesis and Perception. Second Edition. Springer-Verlag Rabiner, L. and Schafer, R.W. 1978. Digital Processing of Speech Signals. Prentice-Hall.
Modeling Speech
A realistic model would have to model all of the above, plus loss due to soft tissue We start with a simpler model: the uniform lossless tube
Term Denitions
p = p(x, t) is the variation in sound pressure in the tube at position x and time t u = u(x, t) is the variation in volume velocity ow at position x and time t is the density of air in the tube c is the velocity of sound A = A(x, t) is the area function of the tube
Area Function
For an arbitrary tube A(x, t) would have some varying shape
(Rabiner & Schafer, Fig. 3.13a, p. 61) For a uniform tube, A(x, t) = A(x) = A is a constant
(2)
For instance, the rst equation states that the change in pressure with respect to distance along the tube is related to the change in volume velocity with respect to time, times a constant /A.
(4)
10
11
12
13
14
Continuity
Continuity must obtain, so that the pressure at the right edge of the kth tube must be the same as the pressure at the left edge of the (k + 1)st tube. Thus (R&S Eqn. 3.36, p. 84):
pk (lk , t) = pk+1(0, t) uk (lk , t) = uk+1(0, t) Substitution of 8 into 910 yields (R&S Eqn. 3.37a, p. 84): Ak+1 + [uk (t k ) + u(t + k )] = u+ (t) + u (t) k k+1 k+1 Ak where k = lk /c is the time required to travel the kth tube,
ECE 598: Vocal Tract and Lossless Tube Models
(9) (10)
(11)
15
And (R&S Eqn. 3.37b, p. 84): u+(t k ) u(t + k ) = u+ (t) u (t) k k k+1 k+1 (12)
16
Continuity
17
Continuity
It is important to note that it is not that u+(t k ) = u+ (t) k k+1 (13)
Rather it is only the sum of the forward and backward volume velocity that must be equal at the boundary. In particular, some of the forward wave will be reected back at the righthand boundary and some of the backward wave will be reected forward at the lefthand boundary.
18
Continuity Continued
From Equations 1112 (solving 12 for u(t + k ), substituting the result into k 11; and then subtracting 12 from 11) we have that (R&S Eqn. 3.38, p. 84): u+ (t) k+1 2Ak+1 Ak+1 Ak + = u (t k ) + u (t) Ak+1 + Ak k Ak+1 + Ak k+1 Ak+1 Ak + 2Ak uk (t k ) + u (t) Ak+1 + Ak Ak+1 + Ak k+1 (14)
u(t + k ) = k Here,
(15)
Ak+1 Ak (16) Ak+1 + Ak is termed the reection coecient since it determines the amount of u (t) k+1 that is reected at the junction. Note that 1 rk 1. rk =
ECE 598: Vocal Tract and Lossless Tube Models
19
Substituting, we have (R&S Eqn. 3.41, p. 85): u+ (t) = (1 + rk )u+(t k ) + rk u (t) k+1 k k+1 u(t + k ) = rk u+(t k ) + (1 rk )u (t) k k k+1 (17) (18)
20
Signal-Flow Representation
21
Signal-Flow Representation
Thus, for example u(t + k ) is obtained as follows: k
22
Boundary Conditions
We have so far ignored the boundary conditions at the lips and glottis. A good model of radiation at the lips is the spherical bae This turns out to be hard to model, so another approximation, which assumes that the size of the orice is relatively small compared to the curvature of the sphere, is the planar bae.
23
uN (lN , t) = u+ (t N ) u (t + N ) N N = (1 + rL)u+ (t N ) N
ECE 598: Vocal Tract and Lossless Tube Models
(21) (22)
24
25
(Rabiner & Schafer, Fig. 3.28, p. 79) Opening and closing of the glottis is controlled by: Air pressure in lungs Tension/stiness in vocal cords Area of glottal opening under rest conditions It is also aected by the vocal tract (coupling) so the system is non-linear.
ECE 598: Vocal Tract and Lossless Tube Models
26
27
1 + rG uG(t) + rGu(t) 1 2
(24)
rG =
ZG + c A
(25)
28
29
30
31
32
33
y(n) =
k=
(28)
(Where, f g
t f (t)g(t 0
34
X(z) =
n=
x(n)z n
(29)
where z is some complex number. Various properties of z-transforms include that the z-transform of x(n) h(n) is equal to X(z)H(z) If we set z = ej we get (R&S Eqn. 2.9a, p. 15):
X(ej ) =
n=
x(n)ej
(30)
35
Note that H(z), the z-transform of the unit sample response is called the system function.
36
Transfer Function
We seek the transfer function V (z) =
UL (z) UG (z)
Returning to our lossless tube model, suppose we have a model with N equal length sections, and we sample every T = 2 samples, where is the time required to traverse each section (the delay of each section). Then the previous equations for the volume velocity at each section become: 1 = (1 + ) + rk u (t) k+1 2 1 u(t) = rk u+(t 1) + (1 rk )u (t ) k k k1 2 u+ (t) k+1 rk )u+(t k (31) (32)
A property of the z-transform is that for x(n + n0) the z-transform is z n0 X(z) (shift). Also, for ax1(n) + bx2(n) we get aX1(z) + bX2(z) (linearity ).
ECE 598: Transfer Function
37
Transfer Function
Thus the z-transform for the junction equations are (R&S Eqn. 3.64, p. 92):
+ + Uk+1(z) = (1 + rk )z 1/2Uk (z) + rk Uk+1(z) + Uk (z) = rk z 1Uk (z) + (1 rk )z 1/2Uk+1(z)
(33) (34)
which can be solved to (R&S Eqn. 3.65, p. 9293): z 1/2 + rk z 1/2 Uk+1(z) Uk+1(z) = 1 + rk 1 + rk z 1/2 rk z 1/2 + Uk (z) = U (z) + U (z) 1 + rk k+1 1 + rk k+1
+ Uk (z)
(35) (36)
38
(37) (38)
39
Transfer Function
Let (R&S Eqn. 3.673.68, p. 93):
+ Uk Uk rk z 1+rk 1/2 z 1+rk
1/2
Uk = Qk =
(39) (40)
1/2
Then we can express equations 35-36 as the system (R&S Eqn. 3.66, p. 93): Uk = Qk Uk+1 Applying this equation iteratively yields :
ECE 598: Transfer Function
(41)
40
U1 = Q1 Q2 . . . QN UN +1
N
(42) (43)
=
k=1
Qk UN +1
u+(t) = 1
1 + rG uG(t) + rGu(t) 1 2
41
So (R&S Eqn. 3.70, p. 94): UG(z) = Or (R&S Eqn. 3.71, p. 94): 2 2rG , U1(z) 1 + rG 1 + rG 2rG 2 + U1 (z) U1 (z) 1 + rG 1 + rG (47)
UG(z) =
(48)
UN +1 =
UL(z)
(49)
42
Qk
k=1
1 0
(50)
43
Transfer Function
If we factor out the z 1/2, z 1/2 terms for Q as follows (R&S Eqn. 3.74, p. 94):
Qk
= z 1/2 = z 1/2Qk
1 1+rk rk z 1 1+r k
rk 1+r z 1+rk
1
(51) (52)
Qk
k=1
1 0
(53)
44
N k=1 (1
...
+ rk )z N/2 1 rN rN z 1 z 1
1 0
(54)
0.5(1 + rG)
+ rk )z N/2
D(z)
(55)
D(z) = 1
k=1
k z k
(56)
Transfer function of lossless tube has delay corresponding to number of sections, has no zeroes, only poles (resonances).
ECE 598: Transfer Function
45
If we assume rG = 1 (innite impedance at the glottis), then we can evaluate the following recursion to solve D(z) (R&S Eqn. 3.89, p. 96):