Professional Documents
Culture Documents
Bao Cao Thuc Tap
Bao Cao Thuc Tap
() () ; (3.1)
Hm ca s thng c dng l lm ca s Hamming:
()
; (3.2)
Hnh 3.2. th hm ca s Hamming
3.1.3. Bin i Fourier ri rc
Tn hiu ca mt khung sau khi nhn vi hm ca s, c chuyn sang min
tn s bng bin i Fourier ri rc:
() ()
; (3.3)
3.1.4. Lc qua cc b lc mel-scale
Cc b lc mel-scale l cc b lc tam gic, t cch u nhau trong min tn
s nh hn 1kHz v khong cch tng theo hm m trong min t 1kHz n fs/2 ( mt
na ca dy tn s ly mu).
CHNG 1
L QUC T D10CQDT01_N [Type text]
Hnh 3.3. Cc b lc mel-scale tam gic
Vi M b lc , ta hon ton xc nh c h s nhn h
i
(k) ca mi b lc.
Kt qu lc i vi tn hiu min tn s qua cc b lc c tnh nh sau:
()
()
|()| ; (3.4)
Ch : X(k) l s phc nhng thng tin v pha ca X(k) khng quan trng nn
ta ch tnh kt qu lc vi modun ca X(k).
Vic nhn tn hiu min tn s vi cc b lc mel-scale chuyn biu din
min tn s t thang Hz sang thang mel mc dch l phn gii tn s theo c im
cm th m ca con ngi: tuyn tnh i vi tn s nh hn 1kHz v phi tuyn i
vi tn s trn 1kHz.
3.1.5. Logarit v bin i Fourier ngc
Ly logarit ca tn hiu min tn s (spectrum) ri bin i Fourier ngc s
a tn hiu v mt min gi l cepstrum c n v thi gian. Bin i t spectrum
sang cepstrum l mt bin i ng hnh.
Cng thc ca bc ny l :
CHNG 1
L QUC T D10CQDT01_N [Type text]
() (()) [
) ]
; (3.5)
Mc d bin i t spectrum sang cepstrum l bin i Fourier ngc, tuy
nhin do ta dng spectrum v cepstrum thc nn ch s dng bin i cosine ri rc
(DCT) tng hiu nng tnh ton.
Sau bc ny ta c vector cepstral p thnh phn. Thng thng ngi ta
thng nhn thm vo kt qu mt hm ca s sng sin ( gi l th tc liftering)
gim bt nh hng ca cc bin i n kt qu.
()
)
(3.6)
() () () (3.7)
3.1.6. Tnh ton nng lng
Km theo thng tin v nng lng ca tn hiu s tng thm thng tin cho nhn
dng (v d: phn bit cc khong cha tn hiu m v khong lng, phn bit vng tn
hiu cha nguyn m v ph m...)
Nng lng ca c khung c tnh qua cng thc:
(())
(3.8)
3.1.7. Tnh ton c trng delta
c trng delta l o hm bc nht (ri rc) ca c trng theo thi gian. C cc c
trng delta s lm tng thm thng tin cho nhn dng ( chng hn: xc nh cc vng
m ph tn hiu n nh...). c trng delta c tnh theo cng thc:
()
() ( )
(3.9)
Trong C(t) l c vector cepstral ti thi im t. T l mt hng s chn trc, thng
T = 3.
CHNG 1
L QUC T D10CQDT01_N [Type text]
3.2. PHN TCH LPC TRONG NHN DNG TING NI
3.2.1. Phn tch LPC
Phng php phn tch LPC (Linear Predictive Coding) hay cn c gi l
phn tch m d on tuyn tnh l mt trong cc phng php phn tch tn hiu ting
ni mnh nht v c s dng ph bin.
tng vic s dng m hnh LPC l vic c th xp x mt mu tn hiu ting
ni thi im n bt k, s(n), nh l mt t hp tuyn tnh ca p mu trc . Ni
cch khc:
()
( )
( )
( ) (3.10)
Cc h s a
1
,a
2
,...a
p
c gi thit l khng i trong khung phn tch tn hiu.
Biu thc (3.10) c th c vit li thnh ng thc nu ta thm vo mt thnh phn
kch thch Gu(n) :
()
( ) ()
(3.11)
Gi s rng t hp tuyn tnh ca cc mu trc thi im xem xt l mt c
lng ca tn hiu, k hiu l ()
()
( )
(3.12)
Khi sai s d tnh e(n) s c tnh l:
() () () ()
( )
(3.13)
Vn t ra i vi phng php phn tch LPC l xc nh tp cc h s a
k
mt
cch trc tip t tn hiu sau cho sai s d tnh e(n) l nh nht. tm ra cc h s d
on, chng ta nh ngha cc khung tn hiu ngn hn v tng ng sai s ngn hn:
() ( ) (3.14)
() ( ) (3.15)
CHNG 1
L QUC T D10CQDT01_N [Type text]
Chng ta cn ti thiu ha tn hiu sai s trung bnh bnh phng thi im n:
()
(3.16)
Biu thc (3.16) c th vit li bng cch s dng cc nh ngha
()v
()nh
sau:
()
( )
(3.17)
tm cc tiu ca (3.17) chng ta ly o hm ln lt theo cc h s a
k
v cho
chng bng khng:
(3.18)
Khi chng ta c:
*
()
()
( ) [
( )
(3.19)
( )
()
( )
( )
(3.20)
t
( )
( )
( )
, khi
( )
( )
()
(3.21)
Thay vo (3.20) ta c biu thc thu gn nh sau:
( )
( ) (3.22)
CHNG 1
L QUC T D10CQDT01_N [Type text]
V i c gi tr t 1 ti p, nn ta c h phng trnh sau:
{
()
( )
( )
( )
(3.23)
Chuyn h phng trnh trn di dng ma trn:
[
()
()
()
()
( )
( )
( )
( )
( )
] [
] [
()
()
( )
] (3.24)
Qu trnh phn tch LPC s tnh cc biu thc
( ) (3.25)
Khi , tn hiu u ra ca b tin nhn c th tnh nh sau:
() () ( ) (3.26)
Hnh 3.5. Ph bin ca mch tin nhn tn hiu
Hnh 3.5 biu din bin c tnh hm truyn t (
)vi gi tr . T
hnh v, chng ta c th quan st thy rng ti , tc l bng mt na tc
ly mu, c s gia tng bin khong 32dB sao vi bin tn s
b. Phn tch tnh t tng quan
Sau khi qua khi tin nhn tn hiu, ta phn khung tn hiu ri sau nhn vi
hm ca s tng t nh trong phn trch chn c trng MFCC. Khi kt
qu ca tn hiu mi khung s l:
() ( ) (3.27)
Kt qu t tng quan ca mi khung tn hiu s to hm :
( )
( )
( )
(3.28)
c. Phn tch LPC
Ta tm cc h s a
k
t ma trn :
[
()
()
()
()
( )
( )
( )
( )
( )
] [
] [
()
()
( )
] (3.29)
d. Chuyn i cc tham s LPC sang cc h s Cepstral
CHNG 1
L QUC T D10CQDT01_N [Type text]
( ) (3.30)
( ) (3.31)
Vi a
m
l cc h s LPC, C
m
l cc h s Cepstral. Ta s dng cc h s ny
c trng cho khung tn hiu ting ni cho qu trnh nhn dng.
CHNG 1
L QUC T D10CQDT01_N [Type text]
CHNG 4. NG DNG K THUT MFCC V MNG NRON
NHN DNG TING VIT
4.1. NHN DNG TING VIT
4.1.1. Mt s c im ng m ting Vit
Mt c im d thy l ting Vit l ngn ng n m ( monosyllable mi t
n ch c mt m tit), khng bin hnh ( cch c, cch ghi m khng thay i trong
bt c tnh hung ng php no). Ting Vit hon ton khc vi cc ngn ng n- u
nh ting Anh, ting php l cc ngn ng a m, bin hnh.
Theo thng k trong ting Vit c khong 6000 m tit. Nhn v mt ghi m:
m tit ting Vit c cu to chung l : ph m - vn. V d m xinh c ph m l x c
m vn l inh. Ph m l mt m v v m v ny lin kt rt lng lo vi phn cn li
ca m tit ( hin tng ni li).
Vn trong ting Vit li c cu to t cc m v nh hn, trong c mt m
v chnh l nguyn m.
Hnh sau l ph tn hiu ca m tit ba. Chng ta c th quan st v phm bit
r min nhiu nn, min ph ca ph m b v nguyn m a (min m hn l c mt
nng lng ln hn).
Hinh 4.1. Ph tn hiu ca m tit ba
CHNG 1
L QUC T D10CQDT01_N [Type text]
Quan st ph cc m tit tng t chng ta c th rt ra kt lun: cc ph m
v nguyn m u phn bit vi nhau rt r qua s phn b nng lng ti cc min
tn s, v d: ph m thn s thp, nng lng nh, nguyn m c nng lng ln v
c vng tn s cao. Vng khng c tn hiu ting ni (nhiu nn v khong lng) c
nng lng thp v ch tp trung cc tn s rt thp.
Cc nguyn m c thn ph (spectrum) khc nhau kh r, Hnh sau minh ha
s khc nhau v ph ca 5 nguyn m c bn. Min m l min c mt nng
lng cao.
Hnh 4.2. S khc nhau v ph ca 5 nguyn m c bn.
Xt v mt ng m- m v hc m tit ting Vit c lc nh sau:
Lt cho thy m tit ting Vit c cu trc r rng, n nh. Lt cn
cho thy ting Vit l ngn ng c thanh iu. H thng thanh iu gm 6 thanh: bng,
huyn, sc, hi, ng, nng.
Thanh iu trong m tit l m v siu on tnh (th hin trn ton b m
tit). Do c trng v thanh iu th hin trong tn hiu ting ni khng r nt nh
cc thnh phn khc ca m tit.
S khc bit v cch pht m ting Vit rt r rt theo gii, la tui v c
bit l theo v tr a l.
CHNG 1
L QUC T D10CQDT01_N [Type text]
4.1.2. Nhng thun li v kh khn trong nhn dng ting Vit
4.1.2.1. Thun li
- Ting Vit l ngn ng n m, s lng m tit khng qu ln. iu ny s gip h
nhn dng xc nh ranh gii cc m tit d dng hn nhiu.
- Ting Vit l ngn ng khng bin hnh t. m tit ting Vit n nh, c cu trc r
rng. c bit khong ch 2 m tit no c ging nhau m vit khc nhau. iu ny s
d dng cho vic xy dng cc m hnh m tit trong nhn dng.
4.1.2.2. kh khn
- Ting Vit l ngn ng c thanh iu ( 6 thanh). Thanh iu l m v siu on tnh,
c trng v thanh iu th hin trong tn hiu ting ni khng r nt nh cc thnh
phn khc ca m tit.
- Cch pht m ting Vit thay i nhiu theo a l. Ging a phng trong ting Vit
rt a dng.
- H thng ng php, ng ngha ting Vit rt phc tp, rt kh p dng vo h
nhn dng vi mc dch tng hiu nng nhn dng. H thng phin m cng cha
thng nht.
- Cc nghin cu v nhn dng ting Vit cng cha nhiu v t ph bin. c bit kh
khn ln nht l hin nay cha c mt b d liu chun cho vic hun luyn v kim
tra cc h thng nhn dng ting Vit.
4.2. MNG NRON NHN TO
B no con ngi, di gc tnh ton c th coi l mt h thng x l
song song ln v mt kt ni cao: phn t x l l cc nron l mt v kt ni l cc
dy thn kinh.
Kh nng tuyt vi ca b no ngi gi ln nhng tng v vic m
phng chng trong lnh vc tnh ton. V mng nron nhn to (artificial neural
network ANN) l kt qu ca nhng tng .
4.2.1. M hnh mng n ron
C nhiu m hnh mng nron khc nhau. M hnh mng n gin v ph
bin nht l m hnh mng perceptron truyn thng nhiu lp (multi layer perceptron
MLP). l m hnh mng c s dng trong ti ny.
4.2.1.1. M hnh mt nron perceptron
Mt n ron perceptron l mt phn t x l gm :
CHNG 1
L QUC T D10CQDT01_N [Type text]
n u vo x
i
, mi u vo ng vi mt gi tr thc w
i
gi l trng s
Mt gi tr thc b gi l ngng (bias).
Mt hm kch hot f
Gi tr ra y.
Hnh 4.3. M hnh mt n ron perceptron
Gi tr ra ca perceptron c tnh theo quy tc sau:
(4.1)
() (4.2)
Hm kch hot c dng ph bin l hm sigmoid ( cn gi l hm logistic)
do tnh phi tuyn kh vi:
()
(4.3)
Ngoi ra cn c mt s hm kch hot khc : hm tan hyperbolic (tanh), hm
softmax.
Kh nng tnh ton ca mt n ron perceptron kh hn ch. ci thin
ngi ta ni chng thnh mng. M hnh mng n gin nht l mng perceptron
truyn thng a lp MLP.
CHNG 1
L QUC T D10CQDT01_N [Type text]
4.2.1.2. M hnh mng n ron MLP
Mng n ron MLP n u vo, m u ra c m hnh nh sau:
Cc n ron c chia thnh cc lp: lp sau c ni vi lp trc.
Lp u tin l lp vo ( input ), lp cui cng l lp ra (output).
Gia lp vo v lp ra l cc lp n (hidden). Thng thng ch c
mt lp n.
Tt c cc n ron cng mt lp s dng chung mt vector u vo.
Mi lp khi nhn mt vector u vo s tnh ra u ra ca mi n
ron, kt hp thnh mt vector v ly lm u vo cho lp sau.
Mng MLP nhn u vo l mt vector n thnh phn, ly lm u
vo ca lp input v tnh ton cho n khi lp output c u ra, ly
lm u ra ca mng l mt vector m thnh phn.
Ton b cc n ron ca mng s dng chung mt hm kch hot,
thng l hm logistic.
Ngoi lp vo v lp ra,mng MLP thng c mt hay nhiu lp n. Thng
thng ngi ta ch s dng mt lp n. V vy i khi ngi ta hay ng nht MLP
vi MLP 3 lp.
Hnh 4.4. M hnh mng perceptron 3 lp