Professional Documents
Culture Documents
File Goc 768960 PDF
File Goc 768960 PDF
3.5
Nhim v ca m-un ny l thu tn hiu t micro,
dng k thut x l u cui pht hin phn tn 3
0.4
0.2
0
Am
p
-0.2
-0.4
Hnh 4 Tn hiu ca t ti.
-0.6
Qua qu trnh x l theo chu trnh trn ta c c
-0.8 th dng xung nh sau:
(a) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
Time (s)
! Phn tch ph
Nu nhng gi tr c khong cch u nhau, tc l
2k
xem w = , th bin i Fourier ri rc (DFT)
N
ca tt c cc frame ca tn hiu l:
F
hay Fmel = 2595 . log 10 1 + Hz (2.3)
1000
Vic phn tch ph s th hin nhng c trng tn
hiu ting ni m do chnh hnh dng ca vng pht
m to ra. Nhng c trng ph ca tn hiu ting
ni s c c sau khi cho qua nhng b lc. i
vi thang tn s Mel th mt lc cho mi thnh
Hnh 6 Qu trnh tnh cc h s MFCC. phn tn s mong mun (hnh 7). B lc ny c p
! Ca s ho tn hiu (Windowing) ng tn s dng tam gic, v khong cch hay bng
thng c xc nh bi mt hng s Mel.
Nhng phng php nh gi ph c in ch ng
tin cy trong trng hp tn hiu dng (stationary
signal), v d mt tn hiu m nhng c trng l
bt bin i vi thi gian. i vi tn hiu ting ni
th iu ny ch c c trong mt khong thi gian
ngn, vic ny c th thc hin c bng cch
ca s ho mt tn hiu x(n) thnh mt chui
lin tc nhng ca s tun t xt(n), t=1,2,,T,
gi l nhng frame.
Trong h thng nhn dng t ng th dng ca s
thng dng nht l Hamming window, p ng
xung ca n l mt hm cosin tng:
2n
0.54 0.46 cos n = 0,..., N 1
w(n ) = N 1
0 n khac
Hnh 7 Mt v d v b lc thang Mel
! Tnh nng lng logarit (LOG)
Cc bc trc ng vai tr lm phng ph, thc
hin mt x l ging nh tai ca con ngi. n
bc ny tnh ton logarit ca bnh phng ln Hun luyn:
nhng h s ti ng ra b lc. Ch rng tai ngi
Nhng mu
thc hin rt tt vic x l ln v logarit. Hn Ti Lui Tri hun luyn
th na, x l ln th loi b nhng thng tin
khng cn thit trong khi x l logarit thc hin
mt nn ng, trch c trng t nhy i vi nhng
bin i ng.
! Tnh ph tn s mel c lng
Bc cui cng trong vic tnh ph tn s mel thng s
(MFCC) bao gm thc hin bin i ngc DFT
ti lui tri
trn ln logarit ca ng ra ca b lc.
Ch rng do nng lng ph log l thc v i Nhn dng:
xng nn bin i DFT ngc c ni gn l
chuyn i cosine ri rc (Discrete Cosine O = ,,,,,,
Transform DCT). Tnh cht ca DCT l to ra
nhng c trng rt khc nhau. DCT cng c tc
dng lm phng ph nu ch c nhng h s u P(O/ti) P(O/lui) P(O/tri)
tin c gi li. Trong nhn dng ting ni th s Hnh 9 S m hnh HMM
h s MFCC thng nh hn 15. [6]
ng vi mi t cn nhn dng th chng ta c mt
Sau khi tn hiu ting ni c trch c trng th c s d liu cc c trng t cc ln c khc
mi t c c c trng bi mt ma trn h s nhau (nh trn s l 3 ln ly mu). Sau ta s
thc. Do m hnh HMM ri rc c ng dng
c lng cc thng s ca m hnh = ( A, B, )
nhn dng nn nhng vector c trng ny phi
c c lng vector (VQ) thnh mt ch s xc sut P(O|) t cc i, tng ng vi mi
codebook ri rc. Thut ton ph bin dng thit t l mt xc nh. nhn dng mt t th ta ch
k codebook l LBG (Linde, Buzo v Gray). vic tnh xc sut chui quan st ca t ng vi
cc c hun luyn, v chn mu no c xc
sut ln nht.
Da vo cc ti liu tham kho v nhng thng tin
v cc h thng nhn dng xy dng thnh cng
chng ti thy rng: i vi nhn dng tn hiu
ting ni th m hnh HMM thng c chn l
m hnh tri phi (left-right) c t 5 n 6 trng
thi. Qua qu trnh th nghim, m hnh c 6 trng
thi cho kt qu tt hn nn trong chng trnh ca
mnh, cc tc gi xy dng mt HMM vi s
trng thi l 6, xem hnh 10.
ti
phi tri