You are on page 1of 38

I HC BCH KHOA NNG

KHOA IN T VIN THNG



GVHD: PGS-TS PHM VN TUN
SVTH: Cao Xun Binh
L Cng Nam

M HA M THANH
CHUN WAV


Vai tr x l mthanh
iu xung m PCM
Cc chun nn audio
Cc khi nim trong m thanh
Gii thut nn m thanh
Cu trc Wave File
Ni dung
Vai tr ca x l m thanh hinh nh trong
truyn thng a phng tin

Xu hng pht trin ca vin thng: cc nhu cu v dch v v
cc hn ch ca cng ngh truyn dn, chuyn mch lin quan
thy c vai tr ca x l m thanh v hnh nh;

Cc chun nn m thanh v hnh nh ang c s dng ph bin
trong truyn thng: G711, G729; JPEG; MPEG; H264
Cc c im ca h thng thnh gic
con ngi
S cm th ca tai ngi i vi
m thanh
Khong cch di m ln nht v yu nht m tai con ngi c th
nghe l 120dB, tc l di 1 triu ln bin . Ngi nghe c th pht
hin s thay i n m thanh khi tn hiu b thay i khong 1dB
(bin thay i 12%)

Nhn bit mc n lin quan mt thit vi cng sut m thanh theo
bc m 1/3.
- Nu tng cng sut m ln 10 ln, ngi nghe nhn c rng n
tng ln tm 2 ln (101/32 ln)
- Di nghe ca con ngi thng thng t 20Hz n 20kHz, nhy
m ln nht t 1kHz n 4kHz.
- Kh nng xc nh hng ngun m tt nhng xc nh khong cch
n ngun m km.


Ngng nghe
Mt n thi gian






Mt n tn s










T/h s
010011...
T/h tng
t x
a
(t)
Lng
t ha
M
ha
Ly
mu
T/h ri rc
x(n)
T/h s
x
q
(n)
M ha
ADC
27/11/2013 9
Bin i ADC

iu xung m PCM c c trng bi 3 qu trnh :
Ly mu
Lng t ha
M ha
iu xung m PCM
Lng t ha v m ha khng u
Thc hin:
1. Nn tn hiu tng t bng b khuch i nn phi tuyn
2. Lng t ha & m ha PCM
Lut nn:
1. Lut (Smith 1957)
) 1 ln(
) ) t ( s 1 ln(
) t ( s
1
2
+
+
=
27/11/2013 11
K thut PCM
c tuyn nn lut A

s <
+
+
s s
+
=
1 ) t ( s
A
1
,
A ln 1
) ) t ( s A ln( 1
A
1
) t ( s 0 ,
A ln 1
) t ( s A
) t ( s
1
1
1
1
2
- 1.0
- 1.0
1.0
1.0 0
A=87.6
A=1
A=5
s
1
(t)
s
2
(t)
Nn lut A
27/11/2013 12
K thut PCM
c tuyn nn xp x ha tuyn tnh lut A
1.0 1/2 1/4 1/8 1/16
1/8
8
6
7
5
4
3
2
1
7/8
6/8
5/8
1.0
4/8
3/8
2/8
0
c tuyn xp x c 13 on: 6 on phn dng, 6 on
phn m v 1 on gia
27/11/2013 13
K thut PCM
Khi nim : Ly mu l qu trnh ri rc ha tn hiu theo thi gian
Ly mu
1. Lng t ha u: Chia bin xung ly mu thnh cc khong u
nhau, mi khong l mt bc lng t u
Lng t ha
2. Lng t ha khng
u : Chia bin xung
ly mu thnh cc
khong khng u nhau
theo nguyn tc khi
bin xung ly mu
cng ln th di
bc lng t cng ln.
Lng t ha
1. Chc nng:
Chuyn i bin
xung lng t
thnh mt t m 8
bt.
2.c tnh b m
ha A=87,6/13
M ha
iu xung m vi sai DPCM
Tip
M ha ngun: Mc ch l ly d liu ngun v thu nh
chng bng cch loi b nhng phn d tha khng cn thit
cn tn ti trong ngun, li phn ngun vi s lng bt t
hn, nhng nhiu tin tc.

M ha knh: L tm ra nhng m c th truyn thng nhanh
chng cha ng nhiu m k hp l v c th sa li hoc
t nht pht hin li (M khi tuyn tnh: m chn l, m
tun hon, M kt hp).
M hnh min thi gian
Vic m t m thanh trong min tn s rt c hiu qu, tuy nhin
trong mt vi ng dng, tin vic nghin cu vic tng hp m thanh,
vic phn tch trong min thi gian li c u th hn.


Realaudio: nh dng ca cng ty RealNetworks, ch yu dng
cho pht nhc trc tuyn, nh dng u tin ra i nm 1995, n
nay c RealAudio 10

WMA: nh dng m thanh ca Microsoft, ra mt nm 1999, trn
l thuyt c th nn 96 kbps vi cht lng ca MP3 128 kbps.
WMA cng ph bin trong th gii m thanh pht trc tuyn.


Chun nn audio

Hiu ng che (masking)
Ngng nghe v mc nhy cm

Che tn s (Frequency Masking)
Che nht thi. (che thi gian)
Cc khi nim trong m thanh
Hiu ng che (masking): l m ln t m b, m mnh t m yu:

Cc khi nim trong m thanh
Che tn s: (Frequency Masking)
Pht ra 1 m c tn s 1 kHz vi mc to c nh l 60dB, gi l m
che (masking tone). Pht ra mt m khc (gi l test tone) mc
tn s khc (v d 1.1kHz), v tng mc to ca m ny cho n khi
c th nghe c n (phn bit c m 1.1 kHz v m che 1kHz).


Cc khi nim trong m thanh
Lm th nghim vi cc masking tones c cc tn s khc nhau, ta c
c hnh v:

Che tn s (Frequency Masking)
Che nht thi. (che thi gian)
Nu ta nghe mt m thanh ln, ri ngng n li, mi mt lc sau ta mi
c th nghe c mt m ln cn nh hn .

Cc gii thut nn m thanh
Nn khng tn tht

M ha Huffman

M ha Huffman sa i

M ha s hc

Gii thut Lempel Ziv Welch (LZW)

Huffman:
Da vo m hnh thng k tnh tn sut xut hin cc k t.
Gn cho cc k t c tn xut cao bng mt t m ngn, cc k t c
tn xut thp bng mt t m di.
Thut ton:
Bc 1:
Tnh tn sut xut hin cc k t trong d liu gc
Xy dng bng m
Sp xp li bng m theo th t tn sut gim dn
Bc 2:
To cy huffman


Gii thut nn khng tn tht
Huffman:
- Khi to: a cc node vo danh sch open theo th t ti mi thi
im.
- Lp li cho n khi danh sch open ch cn mt node bn tri nh sau:
T danh sch open, chn hai node c xc sut thp nht ri to node cha cho
chng. Sau a node cha vo danh sch open. Gn m 0,1 vo cc
nhnh cy v xa cc node con khi danh sch open.
Gii thut nn khng tn tht
K t Tn sut K t Tn sut xc sut
"1" 152 "0" 1532 0.2770
"2" 323 "6" 602 0.1088
"3" 412 "." 536 0.0969
"4" 226 " " 535 0.0967
"5" 385 "3" 112 0.0746
"6" 602 "5 " 385 0.0696
"7" 92 "2" 323 0.0585
"8" 112 "_" 315 0.0569
"9" 87 "4" 226 0.0409
"0" 1532 "+" 220 0.0396
"." 536 "1" 152 0.0275
"+" 220 "8" 112 0.0203
"_" 315 "7" 92 0.0167
" " 535 "9" 87 0.0158
Bng tn xut sp xp theo th t gim dn
HUFFMAN
3. M ha s hc:
tng: gi s mu t l [X,Y] v P(X) = 2/3
P(Y) = 1/3.
Nu ta ch quan tm chiu di m ha l 2 thng ip th ta c
th nh x tt c nh thng ip c th vo nhng on trong
phm vi [0..1]

X Y
XX XY YX YY
Gii thut nn khng tn tht
4. Gii thut Lempel-Ziv-Welch(LZW).
- Gia s chung ta muon ma hoa cho mot cuon t ien Tieng Anh 159,000 t.
Nh vay moi t can 18 bit e ma hoa.
- Nhc im: Dung qua nhieu bit. Ch lam viec cho ky t tieng Anh.
- Giai phap: Can phai tm mot cach ma hoa cuon t ien cho thch hp.
- Giai thuat:
w = NIL;
while (read a character k)
{
if wk exists in the dictionary
w = wk;
else
add wk to the dictionary;
output the code for w;
w = k;
}
Gii thut nn khng tn tht

typedef struct {
WORD wFormatTag;
WORD nChannels;
DWORD nSamplesPerSec;
DWORD nAvgBytesPerSec;
WORD nBlockAlign;
} WAVEFORMAT;

Cu truc Wave File

- wFormatTag:
wFormatTag thng c gi tr l WAVE_FORMAT_PCM c nh
ngha trong tp tin MMSYSTEM.H nh sau :
#define WAVE_FORMAT_PCM 1

Gi tr ny bo cho phn mm ang c Wave File bit kiu m ha d
liu m thanh sang d liu s l kiu m ha PCM. Hin nay y l
kiu m ha duy nht ca Wave file.
- nChannels: nChannels: c hai gi tr bng 1 cho m thanh mono v
bng 2 cho m thanh stereo.

Cu truc Wave File
- nSamplesPerSec:
nSamplesPerSec: cho bit tc ly mu, c cc gi tr:
11025 -- 11.025 kHz
22050 -- 22.050 kHz
44100 -- 44.100 kHz

- nAvgBytesPerSec:
nAvgBytesPerSec: cho bit s bytes yu cu trung bnh trong mt giy
pht li mu d liu ca sng m.

Cu truc Wave File

- nBlockAlign:
nBlockAlign: cho bit s byte dng cha mt mu m thanh. Nh
vy mu 8 bit hay t hn s yu cu 1 byte, mu 9 n 16 bit s yu
cu 2 bytes. Nu m thanh l Stereo th yu cu gp 2 ln s byte
dng cho m thanh mono.

Cu truc Wave File

[1] K thut Multimedia PGS-TS. Phm Vn Tun - HBKN

[2] Bi ging x l m thanh v hnh nh (Hc vin cng ngh BCVT)

[3] Bi ging K thut Multimedia ca Thy Phm Duy Nht Vin- HBK N

[4] Ti liu X l m thanh Hnh nh ca Trung tm o to BCVT 1

[5] WWW. Tailieu.vn

[6] Cc ti liu khc



Tai liu tham kho

You might also like