You are on page 1of 63

1

x l ting ni
Trnh Vn Loan
FIT-HUT
2
X l ting ni
Ti liu tham kho
1. La parole et son traitement automatique
Calliope, Masson, 1989
2. Traitement de la parole
Rene Boite et Murat Kunt, Presse
Polytechniques Romandes, 1987
3. Fundamentals of Speech Signal
Processing
Saito S., Nakata K. , Academic Press, 1985

3
X l ting ni
Ni dung
1. Mt s khi nim c bn
2. X l tin hiu ting ni
3. M ho ting ni
4. Tng hp ting ni
5. Nhn dng ting ni

4
X l ting ni ?
X l thng tin cha trong tn
hiu ting ni nhm truyn, lu tr
tn hiu ny hoc tng hp, nhn
dng ting ni.
1. Mt s khi nim c bn
Cc nghin cu c tin hnh
x l ting ni yu cu nhng hiu
bit trn nhiu lnh vc ngy cng
a dng: t ng m v ngn ng hc
cho n x l tn hiu...
5
Mc ch
M ho mt cch c hiu qu tn
hiu ting ni truyn v lu tr
ting ni.
1. Mt s khi nim c bn
Tng hp v nhn dng ting ni tin
ti giao tip ngi-my bng ting ni.
Tt c cc ng dng ca x l ting
ni u cn phi da trn cc kt qu
ca phn tch ting ni
6
1. Mt s khi nim c bn
Phn bit ting ni v m
thanh:



cc c tnh m hc c ngun
gc t c ch to ting ni.
C 2 loi ngun m
tun hon (dy thanh rung)
tp m (dy thanh khng rung)
7
B my pht m


1. Mt s khi nim c bn
Hng
Thc qun
Np thanh qun
Kh qun
Dy thanh
Vm ming cng
Vm ming mm
Khoang mi
Li
8
S khi b my pht m

1. Mt s khi nim c bn
9
Thanh mn (1)
1. Mt s khi nim c bn
Thanh mn
Dy thanh
10
Thanh mn (2)
cc v tr ht, th, pht m, ni th
tho
1. Mt s khi nim c bn
11
Dy thanh trong mt chu k dao
ng

1. Mt s khi nim c bn
12
Biu din tn hiu ting ni

Dng sng theo thi gian
1. Mt s khi nim c bn
13
Biu din tn hiu ting ni

Ph tn hiu ting ni
1. Mt s khi nim c bn
14
Biu din tn hiu ting ni

Spectrogram (Sonagram)
1. Mt s khi nim c bn
15
To m hu thanh . Formant v
antiformant

1. Mt s khi nim c bn
16
To m v thanh

1. Mt s khi nim c bn
17
Mt s c im ng m ting
Vit

n m tit
C thanh iu (6), bin i
thanh iu km theo bin i
ngha
Khng bin i hnh thi
1. Mt s khi nim c bn
18
M hnh to ting ni (Fant-1960)

) z 1 )( z 1 (
A
) z ( G
1 1
| + o +
=
[
=

+ +
=
K
1 k
2
k 2
1
k 1
) z b z b 1 (
B
) z ( V
) z 1 ( C ) z ( R
1
=
P
1. Mt s khi nim c bn
Lc thng
thp G(z)
Tuyn m
V(z)
Ti bc x
R(z)
19
A(z): Hm truyn t ca b lc o
M hnh ton im cc (AR)
) z ( A
) z ( R ) z ( V ) z ( G ) z ( T
o
= =
1. Mt s khi nim c bn

+
=

+ =
1 K 2
1 i
i
i
z a 1 ) z ( A
) n ( u ) i n ( x a ) n ( x
p
1 i
i
o = +

=
) z ( A
) z ( T
o
=
20
Nu tnh n khoang mi
xut hin cc im khng(ARMA)

) z ( A
) z ( C
) z ( A ) z ( A
) z ( T
2
2
1
1
o =
o
+
o
=
1 z c ) z ( C
-i
q
0 i
i
= =

=
0
c
1. Mt s khi nim c bn
) i n ( u c ) i n ( x a ) n ( x
q
0 i
i
p
1 i
i
o = +

= =
21
Bi tp
1. Mt s khi nim c bn
2
k k k
k
2
1 2
k k k
1 2 z cos z
H (z)
1 2 z cos z z z

u +
=
u +
Bi 1.
Hm truyn t ca mt b lc s tn s formant F
k
c cho bi:



trong , , T: chu k ly mu, : di thng.

1. V cc im cc ca H
k
(z) trong mt phng Z
2. Vit phng trnh sai phn m t quan h gia tn hiu ra y
k
(n) v tn hiu vo x
k
(n)
3. V s khi ca b lc s ny vi 3 b nhn.
4. Bng cch sp xp li cc s hng ca phng trnh sai phn,
v s khi ca b lc s ch c 2 b nhn

k
T
k
z e
o
=
k k
2 F T u = t
k
2o
22
Bi tp
1. Mt s khi nim c bn
Bi 1.
Hm truyn t ca mt b lc s tn s formant F
k
c cho bi:



trong , , T: chu k ly mu, : di thng.

2
k k k
k
2
1 2
k k k
1 2 z cos z
H (z)
1 2 z cos z z z

u +
=
u +
k
T
k
z e
o
=
k k
2 F T u = t
k
2o
k
' 2 2 2 2 2
k k k k k
1,2 k k k k
j
k
| z | cos | z | | z | sin
p | z | cos j | z | sin
| z | e
u
A = u = u
= u u
=
1. H(z) c 2 im cc l nghim ca mu s:
23
Bi tp
1. Mt s khi nim c bn
1
0,7
f
K
Di thng
F
k
B
k
24
Bi tp
1. Mt s khi nim c bn
u
k

u
k

Re(z)
Im(z)
|z
k
|
|z
k
|
p
1

p
2

25
Bi tp
1. Mt s khi nim c bn
a) Xc nh bin i z ca g(n): Tra bng
26

Phn tch ph tn hiu ting ni
2. X l tn hiu ting ni
2. X l tn hiu ting ni
B lc
hiu
chnh
Ca s FFT
Log
10
|.|
H(z)=1-az
-1

a = 0,95..0,98
Hamming
27
x(n)= x(n).w(n)
X(f) = X(f) * W(f)
28
s(n) = h(n) * e(n) S(e) = H(e).E(e)
2. X l tn hiu ting ni
X l ng hnh (homomorphisme)
log[S(e)]= log[H(e)]+ log[E(e)]
FFT
-1
{log[S(e)]}=
FFT
-1
{log[H(e)]}+ FFT
-1
{log[E(e)]}
FFT
-1
{log[S(e)]}: cepstrum:
FFT
-1
{log[H(e)]}: thng tin v`h(n):
FFT
-1
{log[E(e)]}: thng tin v`ngu`n:
s(n)
h(n)
e(n)
29
2. X l tn hiu ting ni
S khi x l ng
hnh
B lc
hiu
chnh
Ca
s
FFT
Log
10
|.| FFT
-1

c(n)
FFT
?
30
T
0
h(n)
= +

s(n) h(n) e(n)


31
2. X l tn hiu ting ni
Tin on tuyn tnh (Linear Prediction
Coding)
Tin
on

=
=
p
1 i
i
) i n ( x a ) n ( x
Sai s tin on
) n ( x ) n ( x ) n ( e =
Sai s bnh phng ton
phn

=
n
2
) n ( e E
Ti thiu ho sai
s
p ,..., 2 , 1 i , 0
a
E
i
= =
c
c

M hnh ton im
cc
) n ( u ) i n ( x a ) n ( x
p
1 i
i
o = +

=
32
2. X l tn hiu ting ni
Xc nh tn s c bn
Fo
Ging nam: 80 .. 250 Hz. Ging n:
150..500 Hz
Xc
nh
Fo
nh gi
kt qu
Tin
x l
Mt s phng php xc nh Fo
Da vo hm t tng quan
Da vo hm vi sai bin trung bnh
Dng b lc o v hm t tng quan
X l ng hnh
33
2. X l tn hiu ting ni
Da vo hm t tng
quan
1
1
( ) ( ) ( ) 0,1,...,
N k
n
r k x n x n k k K

=
= + =

Fs = 10 kHz, N = 300, K = 150


Tm cc i trong khong (0, K)
34
2. X l tn hiu ting ni
Da vo hm t tng
quan
Hn ch, loi b |x|<C
L

35
2. X l tn hiu ting ni
Da vo hm vi sai bin
trung bnh
(Average Magnitude Difference Function) (1)
( ) 0, 0,1,... D iP i = =
1/ 2
1 1
2
0 0
1 1
( ) ( )
N N
n n
u n u n
N N

= =
(
s
(


2
1/ 2
1
( ) [ ( ) ( )]
2[ (0) ( )] 0,1,...,
1
1/2
N-1
n=0


D k x n x n k
N
r r k k K


= +
`
)
= =
<

v i
K k k n x n x k D
k N
n
,..., 1 , 0 ) ( ) ( ) (
1
1
= =


=
36
2. X l tn hiu ting ni
Da vo hm vi sai bin
trung bnh
(Average Magnitude Difference Function) (2)
1000 1050 1100 1150 1200 1250
-0.25
-0.2
-0.15
-0.1
-0.05
0
0.05
0.1
0.15
0.2
0.25
x
(
n
)
n
50 100 150 200
0
10
20
30
40
50
D
(
k
)
k
37
2. X l tn hiu ting ni
Dng b lc o
(Simplified Inverse Filter Tracking)
Thng thp Thng thp 1-z
-1
W(n) LPC(p=4)
A(z) Hm t tng quan
Tm cc i Ni suy nh gi kt qu
Fo
HT/VT
10kHz
4700Hz
900Hz
38
39
2. X l tn hiu ting ni
X l ng hnh
40
2. X l tn hiu ting ni
X l ng hnh

Xc nh formant (1)
B lc
hiu
chnh
Ca
s
FFT
Log
10
|.|
FFT
-1
FFT
Wc(n)
41
2. X l tn hiu ting ni
X l ng hnh

Xc nh formant (1)
42
2. X l tn hiu ting ni
Tin on tuyn tnh (LPC)
F
k
,B
k

s(n)
B lc
hiu
chnh
Ca
s
Tnh
h.s.
a
i

Tnh1/
|A(e
je
)|
bng FFT
Tm
cc i
Tnh
nghim
ca A(z)
Quyt
nh
Xc nh formant (2)
43
3. M ho ting ni
3. M ho ting ni
Dy thao tc m ho v gii m
M ho F1 AD
Gii m DA
F2
Nhiu, suy gim,
sai s
Nhiu, suy gim,
sai s
44
3. M ho ting ni
Mt s tnh cht thng k ca
tn hiu ting ni
Mt xc sut
N

: s lng mu x(n)
c bin trong khong
[-A/2, +A/2]
ne[-N,...,N]
x egodic v dng

0
( ) lim[ /(2 1)]
x
N
p N N


A
= +
45
3. M ho ting ni
Gi tr trung bnh v phng sai
Gi tr trung bnh ca tn hiu dng


vi tn hiu ting ni, gi thit
x
= 0
Phng sai




1
( ) d lim ( )
2 1
N
x x
N
n N
p x n
N

= =
+

}
2 2 2
1
( ) d lim ( )
2 1
N
x x
N
n N
p x n
N
o

= =
+

}
46
3. M ho ting ni
Lng t tc thi (khng nh)
Lut lng t y = Q(x) c nh ngha:
(L+1) mc tn hiu x(0), x(1), ..., x(L)
L mc lng t ho
Mi mc lng t ho biu din bng t b bit
L = 2
b
.
Sai s lng t (tp m lng t) e = Q(x) - x
Bc lng t : hiu 2 mc tn hiu k nhau
o(i) = x(i)-x(i-1)
Thng lng I = bFs (bit/s). Fs : tn s ly mu
47
3. M ho ting ni
Thng lng (1)
Tn hiu lng t 8 bit (256 mc), Fs = 8
kHz Thng lng = 64 kbit/s
Tn hiu lng t 16 bit (65536 mc), Fs =
16 kHz Thng lng = 256 kbit/s ,
1 gi ting ni ~ 100 Mbyte
Cn phi m ho tn hiu ting ni (MPEG,
GSM, G723, ...) truyn ting ni trn mng
hoc lu tr


48
3. M ho ting ni
Thng lng (2)
Tn s ly
mu (kHz)
S bit cho
1 mu
Thng
lung kbit/s
Dung lng
/ pht
(kbyte)
Lnh vc
48 16 768 11520 Ghi m chuyn
nghip
44,1 16 705,6 10584 CD Audio
32 16 512 7680 Radio FM
22 8 264 3960 Radio AM
8 8 64 960 in thoi
49
3. M ho ting ni
Lng t u
Tng qut, bc lng t l hm ca bin tn
hiu x (lng t khng u) n gin nht l
lng t u.
Mc lng t c chn gia 2 mc tn hiu
y(i) = (1/2)[x(i-1)+x(i)]
Lut lng t u v i xng c trng bi:
- cc mc bo ho x
s
- mc lng t L hoc (L+1) = 2
b
.
Bc lng t o = 2x
s
/L

50
3. M ho ting ni
Lng t u
L = 9
51
3. M ho ting ni
Lng t u
L = ?
0 2 4 6 8 10 12 14
-1
-0.8
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
52
3. M ho ting ni
Lng t u
L = 16
0 2 4 6 8 10 12 14
-1
-0.8
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
53
3. M ho ting ni
Lng t u
0 2 4 6 8 10 12
-1
0
1
0 2 4 6 8 10 12
-1
0
1
0 2 4 6 8 10 12
-1
0
1
0 2 4 6 8 10 12
-0.2
0
0.2
Quantification Error
54
3. M ho ting ni
Cc tnh cht lng t u
Mt xc sut sai s lng t


phn b u gia -o/2 v + -o/2


Trung bnh tp m lng t = 0
Phng sai

( ) ( ), ( 1) / 2
e x
i
p p i L o
=
= + =

/ 2
2 2 2
/ 2
/ d /12
e
o
o
o o o

= =
}
( ) ( ), ( 1) / 2
e x
i
p p i L o
=
= + =

( ) 1/ , / 2
0, / 2
e
p o o
o
= s
= >
55
3. M ho ting ni
Cc tnh cht lng t u
T s tn hiu trn nhiu
2
2
10lg (dB) 6, 02 4, 77 20lg
x s
x e
x
SN b
o
o o
| | | |
= = +
| |
\ . \ .
max
4 (dB) 6 7,3 Nu
s
x SN b o = =
Vi b > 6, tng 6 dB mi khi tng 1 bit lng t
c cht lng thch hp cn c b > 11
56
3. M ho ting ni
T s tn hiu trn nhiu
s
n
dB 10
W N
SN
N W
SN 10log SN
ng l ng tn hiu
ng l ng nhiu
= =
=
dB 10
Bi
SN 20log
Bi
n tn hiu
n nhiu
=
Hoc
57
3. M ho ting ni
T s tn hiu trn nhiu
Nng lng SN (dB)
Tn hiu = Nhiu 0
Tn hiu = 2 Nhiu 2
Tn hiu = 10 Nhiu 10
Tn hiu = 100 Nhiu 20
Tn hiu = 1000 Nhiu 30
Tn hiu = 10
N
Nhiu N x 10
58
3. M ho ting ni
Lng t logarit
Sau khi ly logarit bin tn hiu s m ho
tuyn tnh
log[] Q[] M ho
signe[]
x(n) c(n)
y(n)

y'(n)
exp[] Gii m
signe[x(n)]
c(n)

x'(n)

x'(n)
59
3. M ho ting ni
Lng t logarit
Hai gii php dng cho in thoi
Lut (dng M)
log(1 )
log(1 )
x
y

+
=
+
Lut A (dng chu
u)
1 log
1 log
A x
y
A
+
=
+
255 87,56 A = =
8 bit logarit ~ 12 bit lng t u
60
3. M ho ting ni
Lng t thch nghi
Bc lng t tu thuc vo bin tn
hiu
Thch nghi trc
y(n)
Q[] M ho
x(n)
c(n)
Thch
nghi
k.i
G(n)
A G(n)

y(n)

y'(n)
Gii m
c(n)
:

y'(n)
x'(n) =
G'(n)
G(n)
61
3. M ho ting ni
Lng t thch nghi
Thch nghi sau
Q[] M ho
x(n)
c(n)
y(n)
Thch
nghi
k.i
G(n)
A

y(n)

y'(n)
Gii m
c(n)
:

y'(n)
x'(n) =
G'(n)
G(n)
Thch
nghi
k.i
62
3. M ho ting ni
M ho vi sai (DPCM)
Lng t ho
63
3. M ho ting ni
Mt s chun m ho
m thanh/ting ni
G.721 : ADPCM, 32 kbps, 4bits, 8kHz
G.722 : ~ADPCM, 48 n 64 kbps,
G.723 : ~ADPCM, 24 kbps, 3 bits, 8kHz
G.728 : 16 Kbps
GSM : in thoi di ng, 13 kbps
Linear Predictive Encoding (Xerox), 5 kbps
Code Excited Linear Prediction (CELP)
Digital Video Interactive : ~ADPCM, 4 n 8 bits
VoIP: G723.1 (6.4kbits/s), G728, G729 (8kbits/s)

You might also like