You are on page 1of 63

1

x l ting ni
Trnh Vn Loan
FIT-HUT
2
X l ting ni
Ti liu tham kho
1. La parole et son traitement automatique
Calliope, Masson, 1989
2. Traitement de la parole
Rene Boite et Murat Kunt, Presse Polytechniques
Romandes, 1987
3. Fundamentals of Speech Signal Processing
Saito S., Nakata K. , Academic Press, 1985
X l ting ni
Ni dung
1. Mt s khi nim c bn
2. X l tin hiu ting ni
3. M ho ting ni
4. Tng hp ting ni
5. Nhn dng ting ni
3
4
X l ting ni ?
X l thng tin cha trong tn hiu
ting ni nhm truyn, l- u tr tn hiu
ny hoc tng hp, nhn dng ting ni.
1. Mt s khi nim c bn
Cc nghin cu - c tin hnh x
l ting ni yu cu nhng hiu bit trn
nhiu lnh vc ngy cng a dng: t
ng mv ngn ng hc cho n x l
tn hiu...
5
Mc ch
M ho mt cch c hiu qu tn hiu
ting ni truyn v l- u tr ting ni.
1. Mt s khi nim c bn
Tng hp v nhn dng ting ni tin
ti giao tip ng- i-my bng ting ni.
Tt c cc ng dng ca x l ting
ni u cn phi da trn cc kt qu
ca phn tch ting ni
6
1. Mt s khi nim c bn
Phn bit ting ni v m thanh:
Ting ni - c phn bit vi cc m
thanh khc bi cc c tnh m hc c
ngun gc t c ch to ting ni.
C 2 loi ngun m
tun hon (dy thanh rung)
tp m (dy thanh khng rung)
7
B my pht m
1. Mt s khi nim c bn
Hng
Thc qun
Np thanh qun
Kh qun
Dy thanh
Vm ming cng
Vm ming mm
Khoang mi
Li
8
S khi b my pht m
1. Mt s khi nim c bn
9
Thanh mn (1)
1. Mt s khi nim c bn
Thanh mn
Dy thanh
10
Thanh mn (2)
cc v tr ht, th, pht m, ni th tho
1. Mt s khi nim c bn
11
Dy thanh trong mt chu k dao ng
1. Mt s khi nim c bn
12
Biu din tn hiu ting ni
Dng sng theo thi gian
1. Mt s khi nim c bn
13
Biu din tn hiu ting ni
Ph tn hiu ting ni
1. Mt s khi nim c bn
14
Biu din tn hiu ting ni
Spectrogram (Sonagram)
1. Mt s khi nim c bn
15
To m hu thanh . Formant v
antiformant
1. Mt s khi nim c bn
16
To m v thanh
1. Mt s khi nim c bn
17
Mt s c im ng m ting Vit
n m tit
C thanh iu (6), bin i thanh
iu kmtheo bin i ngha
Khng bin i hnh thi
1. Mt s khi nim c bn
18
M hnh to ting ni (Fant-1960)
) z 1 )( z 1 (
A
) z ( G
1 1
| + o +
=
[
=

+ +
=
K
1 k
2
k 2
1
k 1
) z b z b 1 (
B
) z ( V
) z 1 ( C ) z ( R
1
=
P
1. Mt s khi nim c bn
Lc thng
thp G(z)
Tuyn m
V(z)
Ti bc x
R(z)
19
A(z): Hm truyn t ca b lc o
M hnh ton im cc (AR)
) z ( A
) z ( R ) z ( V ) z ( G ) z ( T
o
= =
1. Mt s khi nim c bn

+
=

+ =
1 K 2
1 i
i
i
z a 1 ) z ( A
) n ( u ) i n ( x a ) n ( x
p
1 i
i
o = +

=
) z ( A
) z ( T
o
=
20
Nu tnh n khoang mi
xut hin cc im khng(ARMA)
) z ( A
) z ( C
) z ( A ) z ( A
) z ( T
2
2
1
1
o =
o
+
o
=
1 z c ) z ( C
-i
q
0 i
i
= =

=
0
c
1. Mt s khi nim c bn
) i n ( u c ) i n ( x a ) n ( x
q
0 i
i
p
1 i
i
o = +

= =
21
Bi tp
1. Mt s khi nim c bn
2
k k k
k
2
1 2
k k k
1 2 z cos z
H (z)
1 2 z cos z z z

u +
=
u +
Bi 1.
Hm truyn t ca mt b lc s tn s formant F
k
c cho bi:
trong , , T: chu k ly mu, : di thng.
1. V cc im cc ca H
k
(z) trong mt phng Z
2. Vit phng trnh sai phn m t quan h gia tn hiu ra y
k
(n) v tn hiu vo x
k
(n)
3. V s khi ca b lc s ny vi 3 b nhn.
4. Bng cch sp xp li cc s hng ca phng trnh sai phn,
v s khi ca b lc s ch c 2 b nhn
k
T
k
z e
o
=
k k
2 F T u = t
k
2o
22
Bi tp
1. Mt s khi nim c bn
Bi 1.
Hm truyn t ca mt b lc s tn s formant F
k
c cho bi:
trong , , T: chu k ly mu, : di thng.
2
k k k
k
2
1 2
k k k
1 2 z cos z
H (z)
1 2 z cos z z z

u +
=
u +
k
T
k
z e
o
=
k k
2 F T u = t
k
2o
k
' 2 2 2 2 2
k k k k k
1,2 k k k k
j
k
| z | cos | z | | z | sin
p | z | cos j | z | sin
| z | e
u
A = u = u
= u u
=
1. H(z) c 2 im cc l nghim ca mu s:
23
Bi tp
1. Mt s khi nim c bn
1
0,7
f
K
Di thng
F
k
B
k
24
Bi tp
1. Mt s khi nim c bn
u
k
u
k
Re(z)
Im(z)
|z
k
|
|z
k
|
p
1
p
2
25
Bi tp
1. Mt s khi nim c bn
a) Xc nh bin i z ca g(n): Tra bng
26
Phn tch ph tn hiu ting ni
2. X l tn hiu ting ni
2. X l tn hiu ting ni
B lc
hiu chnh
Ca s
FFT
Log
10
|.|
H(z)=1-az
-1
a = 0,95..0,98
Hamming
27
x(n)= x(n).w(n)
X(f) = X(f) * W(f)
28
s(n) = h(n) * e(n) S(e) = H(e).E(e)
2. X l tn hiu ting ni
X l ng hnh (homomorphisme)
log[S(e)]= log[H(e)]+ log[E(e)]
FFT
-1
{log[S(e)]}=
FFT
-1
{log[H(e)]}+ FFT
-1
{log[E(e)]}
FFT
-1
{log[S(e)]}: cepstrum:
FFT
-1
{log[H(e)]}: thng tin v`h(n):
FFT
-1
{log[E(e)]}: thng tin v`ngu`n:
s(n)

h(n)

e(n)

29
2. X l tn hiu ting ni
S khi x l ng hnh
B lc
hiu chnh
Ca s
FFT
Log
10
|.| FFT
-1
c(n)
FFT
?
30
T
0
h(n)

= +

s(n) h(n) e(n)


31
2. X l tn hiu ting ni
Tin on tuyn tnh (Linear Prediction Coding)
Tin on

=
=
p
1 i
i
) i n ( x a ) n ( x
Sai s tin on
) n ( x ) n ( x ) n ( e =
Sai s bnh ph- ng ton phn
=
n
2
) n ( e E
Ti thiu ho sai s
p ,..., 2 , 1 i , 0
a
E
i
= =
c
c

M hnh ton im cc
) n ( u ) i n ( x a ) n ( x
p
1 i
i
o = +

=
32
2. X l tn hiu ting ni
Xc nh tn s c bn Fo
Ging nam: 80 .. 250 Hz. Ging n: 150..500 Hz
Xc nh
Fo
nh gi
kt qu
Tin
x l
Mt s ph- ng php xc nh Fo
Da vo hm t t- ng quan
Da vo hm vi sai bin trung bnh
Dng b lc o v hm t t- ng quan
X l ng hnh
33
2. X l tn hiu ting ni
Da vo hm t t- ng
quan
1
1
( ) ( ) ( ) 0,1,...,
N k
n
r k x n x n k k K

=
= + =

Fs = 10 kHz, N = 300, K = 150


Tm cc i trong khong (0, K)
34
2. X l tn hiu ting ni
Da vo hm t t- ng
quan
Hn ch, loi b |x|<C
L
35
2. X l tn hiu ting ni
Da vo hm vi sai bin trung
bnh
(Average Magnitude Difference Function) (1)
( ) 0, 0,1,... D iP i = =
1/ 2
1 1
2
0 0
1 1
( ) ( )
N N
n n
u n u n
N N

= =
(
s
(


2
1/ 2
1
( ) [ ( ) ( )]
2[ (0) ( )] 0,1,...,
1
1/2
N-1
n=0


D k x n x n k
N
r r k k K


= +
`
)
= =
<

vi
K k k n x n x k D
k N
n
,..., 1 , 0 ) ( ) ( ) (
1
1
= =


=
36
2. X l tn hiu ting ni
Da vo hm vi sai bin trung
bnh
(Average Magnitude Difference Function) (2)
1000 1050 1100 1150 1200 1250
-0.25
-0.2
-0.15
-0.1
-0.05
0
0.05
0.1
0.15
0.2
0.25
x
(
n
)
n
50 100 150 200
0
10
20
30
40
50
D
(
k
)
k
37
2. X l tn hiu ting ni
Dng b lc o
(Simplified Inverse Filter Tracking)
Thng thp Thng thp 1-z
-1
W(n) LPC(p=4)
A(z) Hm t t- ng quan
Tm cc i Ni suy nh gi kt qu
Fo
HT/VT
10kHz
4700Hz
900Hz
38
39
2. X l tn hiu ting ni
X l ng hnh
40
2. X l tn hiu ting ni
X l ng hnh
Xc nh formant (1)
B lc
hiu chnh
Ca s FFT
Log
10
|.|
FFT
-1
FFT
Wc(n)
41
2. X l tn hiu ting ni
X l ng hnh
Xc nh formant (1)
42
2. X l tn hiu ting ni
Tin on tuyn tnh (LPC)
F
k
,B
k
s(n)
B lc
hiu chnh
Ca s
Tnh h.s.
a
i
Tnh1/ |A(e
je
)|
bng FFT
Tm
cc i
Tnh nghim
ca A(z)
Quyt nh
Xc nh formant (2)
43
3. M ho ting ni
3. M ho ting ni
Dy thao tc m ho v gii m
M ho F1 AD
Gii m DA
F2
Nhiu, suy gim,
sai s
Nhiu, suy gim,
sai s
44
3. M ho ting ni
Mt s tnh cht thng k ca
tn hiu ting ni
Mt xc sut
N

: s l- ng mu x(n)
c bin trong khong
[-A/2, +A/2]
ne[-N,...,N]
x egodic v dng
0
( ) lim[ /(2 1)]
x
N
p N N


A
= +
45
3. M ho ting ni
Gi tr trung bnh v ph- ng sai
Gi tr trung bnh ca tn hiu dng
vi tn hiu ting ni, gi thit
x
= 0
Ph- ng sai
1
( ) d lim ( )
2 1
N
x x
N
n N
p x n
N

= =
+

}
2 2 2
1
( ) d lim ( )
2 1
N
x x
N
n N
p x n
N
o

= =
+

}
46
3. M ho ting ni
L- ng t tc thi (khng nh)
Lut l- ng t y = Q(x) - c nh ngha:
(L+1) mc tn hiu x(0), x(1), ..., x(L)
L mc l- ng t ho
Mi mc l- ng t ho biu din bng t b bit
L = 2
b
.
Sai s l- ng t (tp m l- ng t) e = Q(x) - x
B- c l- ng t : hiu 2 mc tn hiu k nhau
o(i) = x(i)-x(i-1)
Thng l- ng I = bFs (bit/s). Fs : tn s ly mu
47
3. M ho ting ni
Thng lng (1)
Tn hiu lng t 8 bit (256 mc), Fs =
8 kHz Thng lng = 64 kbit/s
Tn hiu lng t 16 bit (65536 mc),
Fs = 16 kHz Thng lng = 256
kbit/s ,
1 gi ting ni ~ 100 Mbyte
Cn phi m ho tn hiu ting ni
(MPEG, GSM, G723, ...) truyn ting ni
trn mng hoc lu tr
48
3. M ho ting ni
Thng lng (2)
Tn s ly
mu (kHz)
S bit cho
1 mu
Thng
lung kbit/s
Dung lng
/ pht
(kbyte)
Lnh vc
48 16 768 11520 Ghi m chuyn
nghip
44,1 16 705,6 10584 CD Audio
32 16 512 7680 Radio FM
22 8 264 3960 Radio AM
8 8 64 960 in thoi
49
3. M ho ting ni
L- ng t u
Tng qut, b- c l- ng t l hm ca bin tn
hiu x (l- ng t khng u)n gin nht l
l- ng t u.
Mc l- ng t - c chn gia 2 mc tn hiu
y(i) = (1/2)[x(i-1)+x(i)]
Lut l- ng t u v i xng c tr- ng bi:
- cc mc bo ho x
s
- mc l- ng t L hoc (L+1) = 2
b
.
B- c l- ng t o = 2x
s
/L
50
3. M ho ting ni
L- ng t u
L = 9
51
3. M ho ting ni
L- ng t u
L = ?
0 2 4 6 8 10 12 14
-1
-0.8
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
52
3. M ho ting ni
L- ng t u
L = 16
0 2 4 6 8 10 12 14
-1
-0.8
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
53
3. M ho ting ni
L- ng t u
0 2 4 6 8 10 12
-1
0
1
0 2 4 6 8 10 12
-1
0
1
0 2 4 6 8 10 12
-1
0
1
0 2 4 6 8 10 12
-0.2
0
0.2
Quantification Error
54
3. M ho ting ni
Cc tnh cht l- ng t u
Mt xc sut sai s l- ng t
phn b u gia -o/2 v + -o/2
Trung bnh tp m l- ng t = 0
Ph- ng sai
( ) ( ), ( 1) / 2
e x
i
p p i L o
=
= + =

/ 2
2 2 2
/ 2
/ d /12
e
o
o
o o o

= =
}
( ) ( ), ( 1) / 2
e x
i
p p i L o
=
= + =

( ) 1/ , / 2
0, / 2
e
p o o
o
= s
= >
55
3. M ho ting ni
Cc tnh cht l- ng t u
T s tn hiu trn nhiu
2
2
10lg (dB) 6, 02 4, 77 20lg
x s
x e
x
SN b
o
o o
| | | |
= = +
| |
\ . \ .
max
4 (dB) 6 7,3 Nu
s
x SN b o = =
Vi b > 6, tng 6 dB mi khi tng 1 bit l-ng t
c cht l-ng thch hp cn c b > 11
56
3. M ho ting ni
T s tn hiu trn nhiu
s
n
dB 10
W N
SN
N W
SN 10log SN
ng lng tn hiu
ng lng nhiu
= =
=
dB 10
Bi
SN 20log
Bi
n tn hiu
n nhiu
=
Hoc
57
3. M ho ting ni
T s tn hiu trn nhiu
Nng lng SN (dB)
Tn hiu = Nhiu 0
Tn hiu = 2 Nhiu 2
Tn hiu = 10 Nhiu 10
Tn hiu = 100 Nhiu 20
Tn hiu = 1000 Nhiu 30
Tn hiu = 10
N
Nhiu N x 10
58
3. M ho ting ni
L- ng t logarit
Sau khi ly logarit bin tn hiu s m ho tuyn
tnh
log[] Q[] M ho
signe[]
x(n) c(n)
y(n)

y'(n)
exp[] Gii m
signe[x(n)]
c(n)

x'(n)

x'(n)
59
3. M ho ting ni
L- ng t logarit
Hai gii php dng cho in thoi
Lut (dng M)
log(1 )
log(1 )
x
y

+
=
+
Lut A (dng chu u)
1 log
1 log
A x
y
A
+
=
+
255 87,56 A = =
8 bit logarit ~ 12 bit l-ng t u
60
3. M ho ting ni
L- ng t thch nghi
B- c l- ng t tu thuc vo bin tn hiu
Thch nghi tr- c
y(n)
Q[] M ho
x(n)
c(n)
Thch nghi
k.i G(n)
A G(n)

y(n)

y'(n)
Gii m
c(n)
:

y'(n)
x'(n) =
G'(n)
G(n)
61
3. M ho ting ni
L- ng t thch nghi
Thch nghi sau
Q[] M ho
x(n)
c(n)
y(n)
Thch nghi
k.i
G(n)
A

y(n)

y'(n)
Gii m
c(n)
:

y'(n)
x'(n) =
G'(n)
G(n)
Thch nghi
k.i
62
3. M ho ting ni
M ho vi sai (DPCM)
L- ng t ho
63
3. M ho ting ni
Mt s chun m ho
m thanh/ting ni
G.721 : ADPCM, 32 kbps, 4bits, 8kHz
G.722 : ~ADPCM, 48 n 64 kbps,
G.723 : ~ADPCM, 24 kbps, 3 bits, 8kHz
G.728 : 16 Kbps
GSM : in thoi di ng, 13 kbps
Linear Predictive Encoding (Xerox), 5 kbps
Code Excited Linear Prediction (CELP)
Digital Video Interactive : ~ADPCM, 4 n 8 bits
VoIP: G723.1 (6.4kbits/s), G728, G729 (8kbits/s)

You might also like