Professional Documents
Culture Documents
Gri 2007 619
Gri 2007 619
y V
z
x
ii
Mèlh trimeloÔ
epitrop
: Kwnstantno
Kotrìpoulo
, Nikìlao
Lskarh
, kai Iwnnh
P ta .
Wnsh .
iii
iv
Perieqìmena
1 Eisagwg 1
1.1 Stìqoi diatrib
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 Logismì
Tanust¸n 5
2.1 DianÔsmata kai Pnake
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1.1 Orismo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1.2 Idiìthte . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
v
PERIEQ
OMENA PERIEQ
OMENA
4.2.3 PARAFAC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.3.1 Montèlo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
4.5.1 Prìblhma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.5.3 Apìdeixh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
vi
PERIEQ
OMENA PERIEQ
OMENA
4.9 Parathr sei . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
5 Exagwg Qarakthristik¸n 75
5.1 Eisagwg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
6 Peirmata 87
6.1 Eisagwg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
vii
PERIEQ
OMENA PERIEQ
OMENA
viii
Keflaio 1
Eisagwg
Stìqo th diatrib enai h melèth tou pedou th anang¸rish mousikoÔ edou se polugram-
apì mousik kommtia kai kajstatai epitaktik h angkh orgnwsh twn dedomènwn. En¸
èqoun anaptuqje ergalea gia thn qeiroknhth orgnwsh twn arqewn, o ìgko tou enai
tìso meglo pou kajist to egqerhma sqedìn adÔnato. 'Ara, enai epitaktik h angkh
prìblhma th anagn¸rish mousikoÔ edou ìmw den enai tetrimmèno: enai qarakthristikì
ìti den èqei protaje kama genik taxinìmhsh gia mousik edh kai akìma kai h anjr¸pinh
Sthn bibliografa èqoun protaje pollè teqnikè anagn¸rish mousikoÔ edou , pou sti
sei thn taxinìmhsh twn arqewn katìpin ekpadeush . Ma llh logik enai h omadopohsh
twn arqewn me bsh ta qarakthristik perigraf tou , afoÔ oÔtw llw den uprqei
mousikoÔ edou , qrhsimopoi¸nta ma nèa logik : ìti to s ma tou qou den enai monodista-
to, all mpore na anaparastaje me ma poludistath dom , ekmetalleÔonta sto èpakro thn
plhrofora pou uprqei sto s ma, kai dnonta èmfash sthn allag twn qarakthristik¸n
pou sunteloÔntai ston qrìno. H lÔsh sto prìblhma th poludistath anaparstash enì
s mato dnetai apì to pedo th polugrammik lgebra (multilinear algebra ) kai pio sugke-
krimèna apì ti poludistate domè pou onomzontai tanustè ( tensors). Katìpin melèth
tou pedou th anlush tanust¸n, protenetai sthn diatrib ma teqnik gia thn anlush kai
1
KEF
ALAIO 1. EISAGWG
H
tanust¸n ( non-negative tensor fa torization ) kai baszetai sthn idèa ìti èna opoiod pote
poludistato s ma enai sunduasmì basik¸n shmtwn pou den enai grammik¸ exarthmèna.
Parìmoioi algìrijmoi èqoun protaje sth bibliografa, kurw sto pedo th epexergasa
eikìna , all sthn paroÔsa diatrib dnetai èmfash sth dhmiourga enì algorjmou pou na
efarmìzetai se tanustè ìlwn twn diastsewn kai qrhsimopoi¸nta ma omda apì mètra
omoiìthta , mh periorzonta thn efarmog tou algorjmou se èna jèma kai mìno.
AkoloÔjw , gia thn eplush tou probl mato dhmiourgoÔntai tanustè dedomènwn kai e-
tìnou. Gia plhrìthta, sta peirmata taxinìmhsh qrhsimopoioÔntai kai sun jei algìrij-
moi mhqanik mjhsh , kaj¸ kai algìrijmoi paragontopohsh mh arnhtik¸n pinkwn, pou
Sto Keflaio 2 dnetai ma eisagwg sti basikè ènnoie th grammik kai polugram-
mik lgebra . Dnetai èmfash ston orismì twn tanust¸n kai sti idiìthtè tou .
Epsh dnetai orismì twn apoklsewn Bregman , oi opoe apoteloÔn thn bsh twn
proteinìmenwn algorjmwn.
Ma sunoptik diereÔnhsh tou pedou th autìmath anagn¸rish mousikoÔ edou d-
algìrijmoi taxinìmhsh .
nust¸n, pou apotele genkeush tou prohgoumènou. Tèlo , protenetai èna taxinomht
2
1.2. DOM
H DIATRIB
HS
3
KEF
ALAIO 1. EISAGWG
H
4
Keflaio 2
Logismì Tanust¸n
lgebra . Perièqei basikoÔ orismoÔ kai idiìthte th grammik lgebra gia lìgou plh-
rìthta kai sÔgkrish . Sthn sunèqeia, dnetai orismì kai basikè idiìthte twn tanust¸n.
Sth sunèqeia gnetai ma parjesh statistik¸n mètrwn uyhl txh , ta opoa perigrfo-
ntai me thn qr sh twn tanust¸n. To keflaio klenei me anafor sti apoklsei Bregman ,
pou qrhsimopoioÔntai sthn paroÔsa diatrib , parajètonta ti idiìthtè tou ston tanustikì
q¸ro.
2.1.1 Orismo
1. a + d b 2 V
2. 9 a: a + ( a) = a a = 0
3. 9 mhdenikì dinusma: 0 + a = a
4. 1 a = a; 8a 2 V
5. a + b = b + a
6. a + (b + ) = (a + b) +
7. ( + d) a = a + d a,
(a + b) =
a +
b,
(
d) a =
(da)
Pardeigma 2.1.1. 'Estw a dinusma m kou
n:
a= a a : : : an
1 2
(2.1)
5
KEF
ALAIO 2. LOGISM
OS TANUST
WN
koÔ ginomènou gia tou
q¸rou
V ; V . 'Ena stoiqeo tou q¸rou dianusmatikoÔ ginomènou
1 2
'Ena pio praktikì orismì tou pnaka: ma disdistath ditaxh stoiqewn enì pedou
F, pou èqei m grammè kai n st le , onomzetai pnaka mn sto F. Sumbolzetai w
A = [aij ℄mn , ìpou oi dekte mn orzoun ti dunamikè perioqè twn deikt¸n ij , dhlad
: : : : :
n
22
:
C
C
C
2
(2.2)
A
am am : : : amn 1 2
2.1.2 Idiìthte
i=1
Ta dianÔsmata u kai v enai orjog¸nia an u v = 0.
Orismì
2.1.4. Monadiao
Pnaka
( ) Orzetai w
o tetragwnikì
pnaka
( m = n):
I = [Æij ℄nn (2.4)
i=1 j =1
6
2.2. ORISM
OS TANUST
WN
A B
pinkwn kai AB = [(AB)ij ℄mk
sumbolzetai ij , kai èqei stoiqeo:
n
(AB)ij =
X
air brj (2.6)
r=1
AT = A 1
(2.7)
tètoio ¸ste AA = I. 1
Oi tanustè ( tensors ) apoteloÔn ma genkeush twn dianusmtwn kai twn pinkwn, ennoi¸n
duðkì q¸ro ston dianusmatikì). Ston kldo th fusik qrhsimopoioÔntai gia na perigr-
youn posìthte ston trisdistato q¸ro. Epsh , o ìro tanust qrhsimopoietai kai gia
thn perigraf pedwn. Gia tou parapnw lìgou , qrhsimopoioÔntai ekten¸ sto pedo th
mpore na anaparastaje w èna pollaplì ( multiway ) pnaka . Akolouje o orismì twn ta-
w pollaplo pnake , èqei protaje to N-way Toolbox gia peribllon MATLAB [2℄. Orismì
7
KEF
ALAIO 2. LOGISM
OS TANUST
WN
me diastsei
I ; I ; : : : ; IN .
1 2 JewroÔme N dianÔsmata u 2 V ; u 2 V ; : : : ; 1 1 2 2
pragmatik¸n ( I1 I : : : IN
2 )-tanust¸n, kai sumbolzetai me R I1 I2 :::IN . O antstoiqo
0 1 0 1 0 1
1 1 1C 2 2 2C 3 3 3C
A = [Aij jAij jAij ℄ Aij Aij Aij
B B B
1 2 1 me 1 =B
1 1 1A
C
2 =B
2 2 2A
C
3 =B
3 3 3A
C (2.8)
1 1 1 2 2 2 3 3 3
Gia lìgou plhrìthta paratjetai o orismì twn tanust¸n me qr sh mh-grammik¸n me-
ij11ij22:::i
:::jQ
metasqhmatzontai se
P ~ij11ij22:::i
:::jQ
P metabanonta
apì to sÔsthma suntetagmènwn xip
(p)
sto sÔsthma suntetagmènwn x~(ipp) gia p = 1; : : : ; P kai apì yj(qq) se y~j(qq) gia q = 1; : : : ; Q me ton
akìloujo kanìna:
~ ij11ij22:::i
:::jQ
= rs11rs22:::r ::: Q ::: P
1 2
(2.9)
P P
y~j(1)
1
y~j(2)
2
y~jQ x~r x~r
( )
x~rP
(1)
1
(2)
2
( )
Gia tanustè peperasmènh txh kai diastsewn orzontai kai oi basikè prxei th polu-
grammik lgebra , pou apoteloÔn genkeush twn basik¸n prxewn th grammik lgebra .
Oi paraktw orismo dnontai gia migadikoÔ tanustè , kai profan¸ mporoÔn na efarmostoÔn
8
2.3. PR
AXEIS ME TANUST
ES
Anloga orzetai kai to bajmwtì ginìmeno enì
tanust A me ma pragmatik stajer ,
pou dhmiourge ènan tanust dia
txh
kai diastsewn me ton A .
B 2 C J J :::JN
1 2
. To eswterikì ginìmeno w
pro
tou
dekte
im kai in , me koin distash
jImj = jJnj = p , orzetai w
hA; Bim;n 2 C I I :::Im 1 2 1 Im+1 :::IM J1 J2 :::Jn 1 Jn+1 :::JN :
X
p
(hA; Bim;n)i :::im1 1 im+1 :::iM j1 :::jn 1 jn+1 :::jN = ai :::im
1 1 kim+1 :::iM bj1 :::jn 1 kjn+1 :::jN (2.11)
k=1
Enai dunatìn na oriste eswterikì ginìmeno pnw se polloÔ dekte . Epsh , sthn
dÔo tanust¸n, oi opooi èqoun ma koin distash. To apotèlesma enai èna tanust 2h
txh o opoo èqei orzetai sti dÔo mh koinè diastsei twn pinkwn.
koin :
2
X
(hA; Bi ; )i i j j =
1 1 2 3 2 3
aki i bkj j
2 3 2 3 (2.12)
k=1
'Ara, gia na upologsoume ta stoiqea tou tanust hA; Bi ; 2 R
1 1
2 2 2 2
qrhsimopoioÔme thn
9
KEF
ALAIO 2. LOGISM
OS TANUST
WN
(A
B)i :::iM j :::jN = ai :::iM bj :::jN
1 1 1 1 (2.13)
'Ara to exwterikì ginìmeno dÔo tanust¸n (to opoo sunanttai sthn bibliografa kai w
perptwsh pinkwn (tanust¸n txh 2) ston dio dianusmatikì q¸ro, to exwterikì ginìmeno
onomzetai ginìmeno Krone ker , lìgw tou GermanoÔ majhmatikoÔ Leopold Krone ker . Sthn
sh me to jroisma th summetablht txh twn dÔo kai antstoiqa prokÔptei tanust me
diwn diastsewn bkl = 2. To exwterikì ginìmeno twn dÔo pinkwn enai tanust me stoiqea:
(A
B)i i i i = ai i bi i
1 2 3 4 1 2 3 4
(2.14)
2 2 2 2
S' aut n thn perptwsh to apotèlesma enai pnaka
diastsewn 2222=44 .
orzetai w ma polumetablht genkeush tou Dèlta tou Krone ker kai èqei stoiqea:
N
Y
Ii :::iN j :::jN =
1 1
Æij11:::i
:::jN
N = Æik jk (2.16)
k=1
10
2.3. PR
AXEIS ME TANUST
ES
nadiawn pinkwn I . Koin¸ , o monadiao tanust orzetai gia rtie txei (akolouje
pardeigma).
Ii i j j = Æi j Æi j ;
1 2 1 2 1 1 2 2
(2.17)
diastsei 2222 , tìte parnei tim 1 sta stoiqea: (1,1,1,1), (1,2,1,2), (2,1,2,1),
(2,2,2,2). Antistoqw , parnei tim 0 sta stoiqea (1,1,1,2), (1,1,2,1), (1,2,1,1), (2,1,1,1),
Epeid ta stoiqea tou monadiaou tanust den exart¸ntai apì sugkekrimènh epilog b-
pou oi timè twn stoiqewn tou enai anallowte me thn peristrof tou sust mato sunte-
tagmènwn onomzetai isotropikì . 'Oloi oi tanustè mhdenik txh (bajmwt megèjh) enai
deÔterh txh enai to Dèlta tou Krone ker . O monadikì isotropikì tanust trth
( ontra tion ) tou A w pro tou dekte ip kai iq , me koin distash u, enai o tanust
hAip;q , me stoiqea:
u
X
(hAip;q)i :::ip1 1 ip+1 :::iq 1 iq+1 :::iN = ai :::ip
1 1 kip+1 :::iq 1 kiq+1 :::iN (2.19)
k=1
H sustol enì tanust mei¸nei thn txh tou kat 2. Enai epsh dunatìn na oriste
sustol gia perissìterou apì 2 dekte . Sthn perptwsh pnaka, h sustol enai antstoiqh
me thn eÔresh tou qnou ( tra e ) tou pnaka. Sthn perptwsh miktoÔ tanust , h sustol
11
KEF
ALAIO 2. LOGISM
OS TANUST
WN
Axzei na shmeiwje h sqèsh tou eswterikoÔ ginomènou, tou exwterikoÔ ginomènou kai th
ginomènou w pro dedomènou dekte , dnei dio apotèlesma me to eswterikì ginìmeno twn
timè
ai i i i = 1.
1 2 3 4
H sustol tou A w
pro
tou
dekte
i ;i2 3 enai:
2
X
(hAi ; )i ;i =
2 3 1 4
ai kki = ai
1 4 1 11 4i + ai i
1 22 4
(2.20)
k=1
!
2 2
(hAi ; )i ;i = A0 = (2.21)
2 2
2 3 1 4
pnaka B 2 C Jn In . To ginìmeno tou A ep ton B w
pro
thn koin distash In orzetai w
:
A n B = hA; Bin; : 2 (2.22)
koinoÔ dekte . Analutik, ta stoiqea tou ginomènou upologzontai me bsh thn paraktw
sqèsh:
In
(A n B)i :::in
X
1 1 jn in+1 :::iN = ai :::k:::iN bjn k
1
(2.23)
k=1
ìpou in enai o koinì
dekth
tou tanust kai tou pnaka. ParathroÔme ìti to ginìmeno
tanust me pnaka, o tanust pou prokÔptei èqei thn dia txh me ton arqikì tanust :
(A n B) 2 C I :::Jn:::IN
1
. Apl o n-ostì
dekth
an kei plèon sto Jn .
Jewr¸nta
tanust A2 C I1 I2 :::IN kai pnake
B 2 C
Jn In kai C 2 C Jm Im orzontai
(A n B) m C = (A m C) n B = A n B m C (2.24)
12
2.3. PR
AXEIS ME TANUST
ES
H exswsh (2.24) ekfrzei thn prosetairistik idiìthta tou pollaplasiasmoÔ tanust me pna-
ke . To sÔmbolo sthn (2.25) ekfrzei ton pollaplasiasmì dÔo pinkwn, ìpw diatup¸jhke
ston orismì (2.1.6). Apì thn (2.25) fanetai h shmasa tou sumbolismoÔ n .
to ginìmeno B A C
H mpore na grafe w
: A B C. Fanetai kajar ìti o B
1 2
ai i i = 1.
1 2 3
Epsh
jewroÔme pnaka B2 R I4 I1 , diastsewn 22 :
B = 20 12
To ginìmeno A B1 dnetai apì thn paraktw sqèsh:
B)i i i =
2
X
(A 1 4 2 3 aki i bi k = a i i bi k + a i i bi k
2 3 4 1 2 3 4 2 2 3 4
k=1
!
3 3 2 2
A B= 1
3 3 2 2
I1 X
X I2 IN
X
hA; Bi = ::: ai i :::iN bi i :::iN
1 2 1 2
(2.26)
i1 =1 i2 =1 iN =1
'Opw fanetai apì thn (2.26), to bajmwtì ginìmeno tanust¸n apotele genkeush tou
eswterikoÔ ginomènou dÔo dianusmtwn, pou orsthke ston Orismì 2.1.3. Sthn perptwsh
I1 X
I2
hA; Bi = ai i bi i = tr(BH A)
X
1 2
(2.27)
1 2
i1 =1 i2 =1
13
KEF
ALAIO 2. LOGISM
OS TANUST
WN
ìpou tr() sumbolzei to qno enì pnaka, dhlad to jroisma twn stoiqewn th kÔria
diagwnou tou. ParathroÔme ìti to bajmwtì ginìmeno dÔo tanust¸n isoÔtai me to eswterikì
tou ginìmeno, w pro ìlou tou (koinoÔ ) dekte . Me bsh ton orismì tou bajmwtoÔ
p
kAk = hA; Ai
2 (2.28)
H nìrma Frobenius mpore na qrhsimopoihje san dekth gia thn mètrhsh tou megèjou
O paraktw orismì twn amoibaa orjogwnwn ( mutually orthogonal ) tanust¸n apotele ge-
hA; Bi = 0 (2.29)
A = a
a
a N
(1) (2) ( )
(2.30)
ìpou ai
( )
dianÔsmata me ai ( )
2 C Ii ; i = 1; 2; : : : ; N
kai sumbolzei to ginìmeno Krone
ker .
0 1 0 1 0 1
0 0 0C 1 0 1C 0 0 0C
A ij = 0 0 0C
1
B
B
A
A ij =
2
B
B
0 0 0CA
A ij = 0 0 0C
3
B
B
A
0 0 0 1 0 1 0 0 0
O A enai aposuntejeimèno
, kaj¸
mpore na grafe w
to exwterikì ginìmeno tri¸n dianu-
14
2.3. PR
AXEIS ME TANUST
ES
a i ?b i
( ) ( )
, a i b i = 0; 8i = 1; : : : ; N:
( ) ( )
(2.31)
Pardeigma 2.3.7. JewroÔme ton tanust A tou Paradegmato 2.3.6, kaj¸ kai ton apo-
suntejeimèno tanust B 2 R I I I 1 2 3
. O B èqei san bsh ta dianÔsmata b = (0; 1; 0); a =
(1) (2)
a i ?b i
( ) ( )
a i = b i ;
( ) ( )
8i = 1; : : : ; N: (2.32)
Pardeigma 2.3.8. JewroÔme ton tanust A tou Paradegmato 2.3.6, kaj¸ kai ton apo-
suntejeimèno tanust B2 R I1 I2 I3 . O B èqei san bsh ta dianÔsmata b = (0; 1; 0); a =
(1) (2)
(0; 1; 0); a = ( 1; 0; 1)
(3)
. ParathroÔme ìti a (1)
?b ; a
(1) (2)
= b ; a = b . 'Ara prokÔ-
(2) (3) (3)
Me bsh tou OrismoÔ 2.3.10-2.3.13, prokÔptei h paraktw idiìthta gia dÔo aposunte-
O paraktw orismì perigrfei thn ènnoia tou upo-tanust , pou qrhsimopoietai ston
orismì enì orjog¸niou tanust . Se antistoiqa me ton orismì enì orjog¸niou pnaka
(tou opoou ìla ta dianÔsmata pou ton apoteloÔn enai orjog¸nia an dÔo), h sunj kh pou
dÔo.
in .
15
KEF
ALAIO 2. LOGISM
OS TANUST
WN
hAin ; Ain i = 0
= = (2.34)
Sto [49℄ dnontai oi orismo twn orjogwnwn tanust¸n, maz me basikè idiìthtè tou . E-
psh , protenontai algìrijmoi gia thn eÔresh orjog¸nia aposÔnjesh tanust¸n, dhlad thn
2.3.9).
Poll probl mata sthn anlush tanust¸n enai dunatìn na epilujoÔn topojet¸nta ti ti-
mè twn tanust¸n se pnake antstoiqou megèjou . Basikì lìgo enai ìti h anaparstash
enì tanust se ènan pnaka enai pio katanoht apì programmatistik all kai diaisjhtik
poyh. Epsh , pollè gl¸sse programmatismoÔ kai pakèta anlush dedomènwn den upo-
nì probl mato ston tanustikì q¸ro se eplush poll¸n upo-problhmtwn ston q¸ro twn
pinkwn.
ptugma tou A w
pro
ton n-ostì dekth enai o pnaka
An ( ) 2 C In In In :::IN I I :::In
( +1 +2 1 2 1)
.
To stoiqeo ai i :::iN
1 2
tou A brsketai sthn in seir tou A n , kai sthn st lh:
( )
2; aij = 3 (i; j = 1; 2; 3)
3 .
0 1
1 2 3 1 2 3 1 2 3C
A (1)
B
=B
1 2 3 1 2 3 1 2 3 A
C
1 2 3 1 2 3 1 2 3
16
2.3. PR
AXEIS ME TANUST
ES
0 1
1 1 1 2 2 2 3 3 3C
A = (2)
B
B
1 1 1 2 2 2 3 3 3CA
1 1 1 2 2 2 3 3 3
To anptugma A (3) tou A enai pnaka
diastsewn I3 I I =333= 39
1 2 :
0 1
1 1 1 1 1 1 1 1 1C
A (3)
B
=B
2 2 2 2 2 2 2 2 2CA
3 3 3 3 3 3 3 3 3
Ma polÔ qr simh idiìthta tou anaptÔgmato tanust enai sthn perptwsh aposÔnjesh
A = B C C N C N
1
(1)
2
(2) ( )
(2.35)
An
( ) = C n B n [C
C
C n
( )
( )
(1) (2) ( 1)
C n
C N ℄T
( +1) ( )
(2.36)
ìpou
sumbolzei to ginìmeno Krone
ker .
Sto pedo th grammik lgebra , o grammobajmì apl¸ bajmì ( rank ) enì pnaka
isoÔtai me ton arijmì twn grammik¸ anexrthtwn gramm¸n sthl¸n tou pnaka. 'Ena
enallaktikì orismì tou bajmoÔ enai o arijmì twn mh mhdenik¸n oriak¸n tim¸n (idiazous¸n
tim¸n - singular values ) tou pnaka. Sto pedo th polugrammik lgebra , h ènnoia tou
bajmoÔ genikeÔetai, anloga me th morf tou tanust . Sthn perptwsh miktoÔ tanust ,
lìgw twn periorism¸n pou efarmìzontai sthn kataskeu tou, o bajmì isoÔtai me ton arijmì
twn summetablht¸n kai antimetablht¸n deikt¸n. Sthn perptwsh tanust N -ost txh
17
KEF
ALAIO 2. LOGISM
OS TANUST
WN
O parapnw orismì brsketai se apìluth antistoiqa me ton orismì enì pnaka me bajmì
1 sthn grammik lgebra: o pnaka pou dhmiourgetai apì to exwterikì (kartesianì) ginìmeno
i=1
ìpou i > 0; i = 1; : : : ; r stajerè
kai U i ( )
2 C I1 I2 IN tanustè
me bajmì 1. O bajmì
tou
'Estw ìti o A ikanopoie thn sqèsh (2.37). O orjog¸nio
bajmì
( orthogonal rank A
) tou ,
o opoo sumbolzetai me rank? (A), enai to elqisto r ètsi ¸ste o A ikanopoie thn (2.37)
bajmoÔ.
( strong orthogonal rank ) touA , o opoo
sumbolzetai me rank?s (A), enai to elqisto r ètsi
¸ste o A ikanopoie thn (2.37) kai isqÔei ìti U i ?sU j ; 8i 6= j
( ) ( )
.
Axzei na shmeiwje ìti, sthn perptwsh tanust¸n deÔterh txh , h aposÔnjesh bajmoÔ,
Ta dianÔsmata n-ostoÔ dekth tou A èqoun distash In kai prokÔptoun èqonta eleÔjero
n-ostì bajmì tou A sumbolzetai me rankn(A) kai enai h distash tou dianusmatikoÔ
m kai dianusmtwn st lh enì pnaka. Antstoiqa, o n-ostì bajmì tanust apotele
genkeush tou grammobajmoÔ. Sthn perptwsh tou tanust ìmw
, oi n-osto bajmo den tau-
tzontai metaxÔ tou
aparathta. Kai sthn perptwsh pou tautzontai, uprqei perptwsh na
diafèroun apì ton bajmì tou tanust . H sqèsh pou isqÔei metaxÔ tou bajmoÔ tanust kai
18
2.3. PR
AXEIS ME TANUST
ES
H eÔresh tou bajmoÔ enì tanust enai polÔ pio sÔnjeto se sqèsh me thn eÔresh th
n-ostoÔ bajmoÔ, giat jewre ton tanust san ma n-distath posìthta. O kajorismì
tou
bajmoÔ enì
I I IN tanust enai akìma anoiktì prìblhma sthn bibliografa. 'Eqoun
1 2
protaje teqnikè gia thn eÔresh bajmoÔ tanust¸n sugkekrimènh txh kai diastsewn (pq.
gia tanustè 2 2 2 ). Epsh , apodeiknÔetai ìti to prìblhma eÔresh tou bajmoÔ
enì tanust 3h txh enai ekjetik poluplokìthta . Tèlo , apodeiknÔetai ìti h tim
tou bajmoÔ tou tanust exarttai apì to pedo sto opoo orzetai o tanust . Koin¸ , o
bajmì enì pragmatikoÔ tanust pou orzetai sto pedo twn pragmatik¸n arijm¸n den èqei
dia tim me ton bajmì tou diou tanust an oriste sto migadikì pedo. Me thn parapnw
parat rhsh fanetai h diafor sthn eÔresh tou bajmoÔ enì tanust se sqèsh me thn eÔresh
tou bajmoÔ enì pnaka, ìpou èqoun protaje pollè teqnikè gia thn eÔres th (anlush
ìson A=a
a
a
(1) (2) (3)
, o bajmì
tou A den mpore na enai megalÔtero
apì 1, sÔmfwna
me thn (2.37). Epeid profan¸ o bajmì enì tanust enai jetikì akèraio , o bajmì tou
A isoÔtai me 1.
!
1 1 2 2
A =(1)
2 2 1 1
O 1-ostì
bajmì
tou A isoÔtai me ton bajmì tou A (1) : rank 1 (A) = 2 .
!
1 2 1 2
A =(2)
2 1 2 1
O 2-ostì
bajmì
tou A isoÔtai me ton bajmì tou A (2) : rank 2 (A) = 2.
!
1 2 2 1
A =(3)
1 2 2 1
19
KEF
ALAIO 2. LOGISM
OS TANUST
WN
ìpou a = (0 1) a = (2 1)
(1)
, a = (1 0)
(2)
A
, kai
(3)
. 'Ara, o bajmì
tou den enai megalÔtero
Sto pedo th grammik lgebra , h ènnoia th summetra sunanttai se dÔo ekdoqè : thn
pragmatik kai thn ermitian summetra. Sto pedo th polugrammik lgebra , h pragmati-
Protenetai epsh kai h ènnoia th summetra an dÔo ( pairwise symmetry ), pou apotele
migadikoÔ tanust , h genkeush den enai tetrimmènh, kai sqetzetai me thn anaparstash
kai dinusma x 2 RI . To sqetizìmeno polu¸numo ( asso iated polynomial A(x) A ) tou or-
zetai w :
I X
X I I
X
A(x) = A x x : : : N x =
1 2 ::: ai i :::iN xi xi : : : xiN
1 2 1 2
(2.39)
i1 =1 i2 =1 iN =1
I X
X I I
X
A(x) = A x~ x~ : : : N x~ N =
1
(1)
2
(2) ( )
::: ai i :::iN x~i x~i : : : x~iNN
1 2
(1)
1
(2)
2
( )
(2.40)
i1 =1 i2 =1 iN =1
ìpou ta dianÔsmata x~ n
( )
isoÔntai ete me to xn ( )
ete me to x n (n = 1; 2; : : : ; N ).
( )
1
(2)
2
( N)
aP i i :::iN = ai i :::iN
, isqÔei ìti ( 1 2 ) 1 2
. 2) Gia kje
( 1)
(2)
(
N
2)
(
(
)
)
aP i i :::iN =
(1)
1
(2)
2
( )
isqÔei ìti ( 1 2 )
ai i :::iN .
1 2
20
2.3. PR
AXEIS ME TANUST
ES
Axzei na shmeiwje ìti to sqetikì polu¸numo enì uper-summetrikoÔ tanust parnei mìno
pragmatikè timè . Epsh , o orismì th uper-summetra enai anexrthto apì thn bsh
sthn opoa orzetai to dinusma x. Oi uper-summetriko tanustè èqoun meglh qr sh sto
proseq enìthta. Akolouje èna pio qalarì orismì th summetra tanust¸n, h summetra
O A enai summetrikì
an dÔo an, gia kje zeÔgo
deikt¸n ( in ; in ), uprqei ma antimetjesh
1 2
! !
1 2 2 3
aij = aij =
1
2 3 2
3 4
ParathroÔme ìti isqÔei h akìloujh sqèsh: ai i i = ai i i
1 2 3 2 3 1.
Epsh
parathretai ìti h para-
pnw sqèsh enai dunatìn na isqÔsei an gnoun dÔo antimetajèsei
deikt¸n ston ai i i i ;i
1 2 3: ( 1 2)
kai ( i ;i
1 3 ). 'Ara, o A enai summetrikì
an dÔo.
anmesa se ènan pnaka kai ènan tanust 3h txh . Profan¸ , apotele ma genkeush
J1 X
X J2 X
Jn
1
bi i :::in =
1 2 2
i i :::in
1 2 j j :::jn1 aj1 j2 :::jn1 :
2 1 2
(2.42)
j1 =1 j2 =1 jn1 =1
21
KEF
ALAIO 2. LOGISM
OS TANUST
WN
nta ta anaptÔgmata twn tanust¸n. Se aut n thn perptwsh, to anptugma tou B mpore na
upologiste me to ginìmeno twn pinkwn pou ekfrzoun ta anaptÔgmata twn A kai C . Na sh-
meiwje epsh ìti sthn perptwsh twn mikt¸n tanust¸n, o metasqhmatismì gnetai anmesa
H paroÔsa enìthta anafèretai ston orismì statistik¸n mètrwn uyhlìterh
txh
( higher-
order statisti
s ). Sundèontai me to pedo th
anlush
tanust¸n, kaj¸
oi ropogenn trie
kai susswreÔtrie enai sthn ousa uper-summetriko tanustè uyhl txh . Perigrafè twn
! ! k !nkn !
(2.46)
1 2 k
1
1
2
2
!
1= 2= !n
= =0
22
2.4. STATISTIK
A M
ETRA UYHL
OTERHS T
AXHS
r (!
; ! ; : : : ; !n)
k k :::kn = ( j )r 1 2
! k ! k !nkn !
1 2
(2.47)
1
1
2
2
1= 2= ! !n
= =0
Na shmeiwje epsh ìti gia diaforetikè tuqae metablhtè h rop deÔterh txh a-
nafèretai w eterosusqètish ( ross- orrelation ), en¸ h rop deÔterh txh gia ma tuqaa
m = Efx g;
1 1 m = Efx 2
2
1
g
m = Efx g;
3
3
1
m = Efx 4
4
1
g
kai sundèontai me ti
antstoiqe
susswreÔtriè
tou
me ti
akìlouje
sqèsei
:
=m ;
1 1
=m2 2 m 2
1
= m 3m m + 2m ;
3 3 2 1
2
1
=m4 4 4m m 3 1 3m + 12m m
2
2 2
2
1
6m 4
1
MN = fx
x
xg
E (2.48)
Praktik, h rop N -ost txh tuqaou dianÔsmato enai tanust N -ost txh . Gia
m = Efxg;
1 M = fx
x g
2 E
M = Efx
x
xg;
3 M = fx
x
x
xg
4 E
Gia stsime stoqastikè diadikase x(t), enai dunatìn na oristoÔn ropè me bsh ti
qronikè kajuster sei :
mn ( ; ; : : : ; n ) = Efx(t)x(t + ) x(t + n )g
1 2 1 1 1 (2.49)
23
KEF
ALAIO 2. LOGISM
OS TANUST
WN
Oi sqèsei
pou sundèoun ti
susswreÔtrie
me ti
ropè
th
x(t) gia thn pr¸th kai deÔterh
txh:
= m = Efx(t)g;
1 1
( ) = m ( ) (m )
2 1 2 1 1
2
n
Y
m( x ; x ; : : : ; n xn ) =
1 1 2 2 i m(x ; x ; : : : ; xn )
1 2 (2.50)
i=1
Yn
( x ; x ; : : : ; n xn ) =
1 1 2 2 i
(x ; x ; : : : ; xn )
1 2 (2.51)
i=1
paraktw sqèsei gia ti apì koinoÔ ropè kai susswreÔtrie gia ti tuqae metablhtè
x + y; x ; : : : ; xn :
1 2
(x + y; x ; : : : ; xn ) =
(x ; x ; : : : ; xn ) +
(y; x ; : : : ; xn )
1 2 1 2 2 (2.53)
Mx = Mx A A N A
~
N N 1 (2.54)
2
CNx = CNx
~
1 A A N A
2 (2.55)
24
2.4. STATISTIK
A M
ETRA UYHL
OTERHS T
AXHS
m(x ; x ; : : : ; xn ) = m(xP ; xP ; : : : ; xP n )
1 2 (1) (2) ( ) (2.56)
(x ; x ; : : : ; xn ) =
(xP ; xP ; : : : ; xP n )
1 2 (1) (2) ( ) (2.57)
Basik sunèpeia th Idiìthta 2.4.4 enai ìti oi ropè kai susswreÔtrie tuqawn dianu-
blht y. An h sunrthsh puknìthta
pijanìthta
th
y , p(y ) enai rtia, tìte oi ropè
kai
oi susswreÔtrie
peritt
txh
th
y mhdenzontai.
Idiìthta 2.4.6. Metatìpish ( ) JewroÔme tuqae
metablhtè
x ; x ; : : : ; xn
1 2 kai stajer
m( + x ; x ; : : : ; xn ) = m(x ; : : : ; xn ) + m(x ; : : : ; xn )
1 2 2 1 (2.58)
( + x ) = +
(x )
1 1 (2.59)
( + x ; x ; : : : ; xn ) =
(x ; x ; : : : ; xn )
1 2 1 2 (2.60)
x ; : : : ; xn .
2 An èna uposÔnolo twn n tuqawn metablht¸n enai anexrthto w
pro
ti
upì-
loipe
, tìte h apì koinoÔ susswreÔtria isoÔtai me mhdèn.
San sunèpeia th idiìthta 2.4.7, h susswreÔtria uyhl txh enì tuqaou dianÔsmato
e
metablhtè
x ; x ; : : : ; xn ; y ; y ; : : : ; yn.
1 2 1 2 An oi x ; x ; : : : ; xn
1 2 enai anexrthte
apì ti
(x + y ; x + y ; : : : ; xn + yn ) =
(x ; x ; : : : ; xn ) +
(y ; y ; : : : ; yn)
1 1 2 2 1 2 1 2 (2.61)
y pou akolouje thn kanonik katanom . H mèsh tim kai diakÔmansh th
y isoÔntai me autè
th
x. IsqÔei h akìloujh sqèsh gia txh N > 3:
25
KEF
ALAIO 2. LOGISM
OS TANUST
WN
pou akolouje kanonik katanom isoÔntai me 0. Genik, oi susswreÔtrie txh N >2 den
ephrezontai apì th mèsh tim tuqawn metablht¸n. Epsh , oi idiìthte twn susswreutri¸n
gia kanonikè tuqae metablhtè , kaj¸ kai gia amoibaa anexrthte metablhtè , odhgoÔn
T
^ xN =
m
1X xN
T t (2.63)
t=1
ton tÔpo:
sei sqetik me thn ektmhsh rop¸n kai idiìthte twn ektim¸menwn rop¸n uyhlìterh txh
Gia ton upologismì twn susswreutri¸n, èqei protaje h qr sh twn k-statistik¸n, pou
apoteloÔn mh polwmènou
ektimhtè
susswreutri¸n. Gia ton orismì twn k -statistik¸n, qrh-
T
~ xN =
m
1X (x ^ x)N
m
t (2.65)
T t=1
1
kx =
^x = m
1
^x
1 1
T
kx =
^x = ~
m x
2
T 1
2 2
T 2
kx =
^x =
2) m~
x
3
(T 1)(T
3 3
T 2
kx4
=
^x
4
= (T 1)(T 2)(T 3) (T + 1)m~ x 4
3(T 1)(m~ x)2
2
26
2.5. APOKL
ISEIS BREGMAN
ApodeiknÔetai ìti gia T ! 1 ^xN , h sugklnei sthn xN me pijanìthta 1. Epsh isqÔei ìti
E f g=
^xn
xN . Oi diakumnsei
th
^xN gia txei
1 èw
4 dnontai paraktw:
x
Varf
^xg = T
1
2
Varf
^xg =
T + 2(
x)
x 2
4 2
2
T 1
Varf
^xg = T +
T + 9(1
) + (T 6T1)(
9 (
x)
x x x x 2 3
6 4 2 3 2
3
T 2)
Varf
^xg =
T + 16
+ 48T
1 + 34(
)
x x x x x x 2
8 6 2 5 3 4
4
T 2)(T 3)
Tèlo , apodeiknÔetai ìti h ^xN asumptwtik akolouje thn kanonik katanom . H diakÔmansh
th ektim¸menh susswreÔtria uyhl txh epitrèpei thn ektmhsh tou arijmoÔ deigmtwn
H paroÔsa enìthta anafèretai se ma oikogèneia sunart sewn pou enai gnwstè w apokl-
sei Bregman . Protjhkan apì ton Bregman to 1967 [16℄. Ja dojoÔn oi basiko orismo
kai idiìthte , kaj¸ kai oi sqèsei pou anafèrontai sqetik me ti apoklsei Bregman ston
tanustikì q¸ro. Oi sqèsei pou anafèrontai ja qrhsimopoihjoÔn gia thn eÔresh tou algo-
kai orzetai w :
ìpou S sÔnolo, int(S ) to eswterikì sÔnolo (interior set) tou S , kai 0 (y ) h pr¸th pargwgo
th
.
jroisma dÔo apoklsewn Bregman enai epsh apìklish Bregman . AkoloÔjw , o orismì
X
D (X; Y) = D (xij ; yij ) (2.67)
ij
27
KEF
ALAIO 2. LOGISM
OS TANUST
WN
2 IN
I1 IX
D (X ; Y ) = D (xi i :::iN ; yi i :::iN )
1 2 1 2
(2.68)
i1 i2 :::iN
Paradegmata tupik¸n (x) pou qrhsimopoioÔntai gia thn dhmiourga apoklsewn Bergman
paratjentai ston Pnaka 2.1 [4℄ [83℄. H Nìrma Frobenius 2ou BajmoÔ, alli¸
tetragwnik
Eukledia apìstash, enai h pio eurèw qrhsimopoioÔmenh apìklish Bregman . Enai kurt
kai ston 2o ìro th . Sto sq ma 2.1 apeikonzetai h diadikasa upologismoÔ th apìklsh
h diafor metaxÔ dÔo katanom¸n puknìthta
pijanìthta
. Kai h apìklish Kullba
k-Leibler
enai kurt ston 2o ìro th
. H apìstash Itakura-Saito qrhsimopoietai sto pedo th
epexer-
28
2.5. APOKL
ISEIS BREGMAN
Sq ma 2.2: Apeikìnish upologismoÔ th apìklish Bregman gia (z ) = z log(z ) (apì to
[83℄).
29
KEF
ALAIO 2. LOGISM
OS TANUST
WN
30
Keflaio 3
3.1 Eisagwg
tìso apì thn ankthsh analogik¸n arqewn ìso kai apì nèa yhfiak dedomèna. Kajstatai
loipìn epitaktik h angkh gia th dhmiourga axiìpistwn kai gr gorwn ergalewn gia an-
lush dedomènwn gia na qrhsimopoihje se anaz thsh kai prìsbash se ulikì. Sto pedo th
anaz thsh kai ankthsh dedomènwn, ta mousik edh ( musi al genres) èqoun meglo rìlo,
kaj¸ qrhsimopoioÔntai ed¸ kai poll qrìnia sthn orgnwsh mousik¸n katalìgwn, mousik¸n
mousikoÔ kommatioÔ ( musi al tra k ) me kpoio mousikì edo enai shmantik sto na bohj sei
Parìla aut, to prìblhma th anagn¸rish mousik¸n eid¸n den enai tetrimmèno. En¸
uprqei diadedomènh qr sh twn mousik¸n eid¸n, aut paramènoun ma kak¸ orismènh ènnoia.
Enai qarakthristikì ìti h swst taxinìmhsh mousikoÔ edou apì anjr¸pou (mh eidikoÔ
sthn taxinìmhsh mousik¸n eid¸n) ftnei polÔ qamhl posost: gia to prìblhma th taxinì-
mhsh kommati¸n se 10 mousik edh, h anjr¸pinh akrbeia enai mìli 53% gia degmata 250
mse , kai anebanei sto 72% gia degmata 3 se [72℄. Tautìqrona, en¸ polÔ suqn qrhsimo-
kommti antistoiqe se perissìtera apì èna mousik edh. Tèlo , tjetai to z thma poÔ ja
efarmoste h taxinìmhsh mousikoÔ edou : sto kommti, sto lmpoum (sullog kommati¸n)
to lmpoum. 'Omw , gnetai katanohtì ìti afoÔ èna kommti mpore na an kei se perissìtera
apì èna edh, tìte èna lmpoum, to opoo mpore kai na perièqei eterogenè ulikì, den enai
31
KEF
ALAIO 3. ANAGN
WRISH MOUSIKO
U E
IDOUS
'Ara, gia to prìblhma th taxinìmhsh mousik¸n eid¸n, apaitetai ma ierarqik taxinìmhsh
twn mousik¸n eid¸n, h opoa na kalÔptei ìso to dunatìn gnetai ìla ta edh, kai parllhla
na mhn uprqoun pollè epikalÔyei kai asfeie stou orismoÔ twn eid¸n. O Pa het ìmw
èdeixe ìti h dhmiourga ma tètoia ierarqa den enai aplì z thma [64℄. Gia pardeigma, to
site Amazon1 èqei kathgoriopoi sei kommtia qrhsimopoi¸nta
719 mousik edh. To site All-
musi
2 qrhsimopoie 531 edh kai to Mp3 3 qrhsimopoie 430 edh. Mìno 70 mousik edh tan
koin kai sta tra sites . Epsh , parathretai apì ton Pa het ìti oi asfeie kai diaforetikè
taxinom sei eid¸n den apoteloÔn kat angkh prìblhma gia tou qr ste , all profan¸
den prosfèrontai gia qr sh se sust mata autìmath taxinìmhsh [64℄. 'Ena llo prìblhma
prostjetai sto prìblhma eÔresh taxinìmhsh : ìti exarttai apì thn perioq sthn opoa
protenetai. Gia pardeigma, èna kommti ènteqh ellhnik mousik se mh ellhnikè uphre-
se ja taxinomhje w mousik tou kìsmou ( world musi ). Epsh , uprqei asfeia me bsh
ta krit ria apì ta opoa prokÔptei ma ierarqa: lle ierarqe prokÔptoun qarakthrzonta
qronikè periìdou (pq. tragoÔdia dekaeta tou 50), lle me bsh th q¸ra, me bsh th
gl¸ssa, me bsh to jèma twn kommati¸n, me bsh ton kallitèqnh. 'Ena teleutao prìblhma
pou prokÔptei, enai h prosj kh nèou mousikoÔ edou sthn dh uprqousa ierarqa. Ta nèa
mousik edh prokÔptoun sun jw sugqwneÔonta dh uprqonta edh, qwrzonta èna dh
uprqon edo se kathgore . To parapnw jèma apotele prìblhma gia èna sÔsthma autì-
math taxinìmhsh mousik¸n eid¸n, to opoo ja prèpei na metabllei dunamik thn ierarqa
tou.
To parìn keflaio suneqzei me ma anaskìphsh twn dedomènwn kai ierarqi¸n pou qrhsi-
mopoi jhkan se efarmogè autìmath taxinìmhsh mousik¸n eid¸n sthn enìthta 3.2. Sthn
enìthta 3.3 perigrfontai ta qarakthristik pou exgontai apì thn kumatomorf tou mousi-
koÔ kommatioÔ gia thn perigraf mousikoÔ edou . Sthn enìthta 3.4 perigrfontai sunoptik
algìrijmoi taxinìmhsh pou èqoun qrhsimopoihje gia thn eplush tou probl mato th au-
tìmath taxinìmhsh mousikoÔ edou . Tèlo , h enìthta 3.5 parajètei telik sumpersmata
gia to pedo.
natìthta prosj kh nèou mousikoÔ edou , kai h ierarqa na enai antikeimenik , dhlad na
1 http://www.amazon.
om
2 http://www.allmusi
.
om
3 http://www.mp3.
om
32
3.2. S
UNOLA DEDOM
ENWN KAI IERARQ
IES
mhn ephrezetai apì qronikoÔ kai topikoÔ pargonte [64℄. H ierarqa pou dhmiourg jhke
eqe san pr¸to eppedo basik mousik edh kai se deÔtero eppedo perigrafè exeidkeush
tou mousikoÔ edou (oi perigrafè anafèrontai se q¸ra, enorq strwsh, kai kallitèqnh).
Sunolik h ierarqa pou protjhke perieqe 378 mousik edh. 'Omw , telik o Pa het e-
gkatèleiye th dhmiourga th ierarqa [3℄, lìgw poll¸n epikalÔyewn pou up rqan anmesa
se edh kai se euaisjhsa th taxinìmhsh se nèa mousik edh pou prokÔptoun apì mexh
eid¸n pou up rqan sthn ierarqa. H nèa ierarqa pou protjhke apì ton Pa het - gia to
prìgramma anaz thsh mousik¸n eid¸n tou ereunhtikoÔ èrgou Cuidado - eqe 2 eppeda, to
pr¸to me 18 genik mousik edh kai to deÔtero me 250 upo-edh [65℄. Ta 18 genik mousik
Ambien
e, Blues, Classi
al, Country, Ele
troni
a, Folk, Hard, Hip Hop, Jazz, New
edh enai:
Oi Tzanetkh kai Cook dhmioÔrghsan to 2002 ma bsh dÔo epipèdwn gia peirmata
anagn¸rish mousikoÔ edou (GTZAN dataset ) [84℄. Sto pr¸to eppedo uprqoun 10 basik
mousik edh, en¸ to deÔtero eppedo perièqei 4 upo-edh gia thn kathgora Classi al kai 6
upo-edh gia thn kathgoraJazz . Gia kje mousikì edo kai upo-edo dhmiourg jhkan 100
se
arqea, dirkeia
30 to kajèna. Na shmeiwje ìti ta peirmata genik
taxinìmhsh
pou
pragmatopohse o Tzanetkh sto [84℄ èginan xeqwrist apì ta peirmata taxinìmhsh sta
upo-edh th klassik kai jazz mousik . H ierarqa th bsh dedomènwn tou Tzanetkh
apeikonzetai sto Sq ma 3.1. Peirmata sthn bsh tou Tzanetkh pragmatopoi jhkan apì
ton dio sto [84℄, apì ton Li sto [57℄, apì ton Lidy sto [58℄, kai pragmatopoioÔntai kai sthn
paroÔsa diatrib .
Choir
Classi
al Or
hestra
Country Piano
Dis
o String Quartet
HipHop Bigband
Jazz Cool
Ro
k Fusion
Blues Piano
Reggae Quartet
Pop Swing
Metal
O Ogihara sta peirmata pou pragmatopohse gia anagn¸rish mousik¸n eid¸n, dhmioÔr-
ghse ma bsh qrhsimopoi¸nta mousik kommtia apì proswpikè sullogè [57℄. H bsh
33
KEF
ALAIO 3. ANAGN
WRISH MOUSIKO
U E
IDOUS
perieqe 756 arqea qou, ta opoa sugkentr¸jhkan apì 189 mousik lmpoum. Ta arqea
tan organwmèna se 5 mousik edh: Ambient, Classi al, Fusion, Jazz, Ro k kai . Kje arqeo
eqe dirkeia 30 se
. 109 arqea dhmiourg jhkan gia thn klsh Ambient , 164 gia thn Classi-
al , 136 gia thn Fusion , 251 gia thn Jazz
, kai 96 gia thn Ro
k . Ta arqea èqoun suqnìthta
anaptÔqjhke apì ton Ogihara den perilambnei arket mousik edh gia na qarakthriste
axiìpisth. Epsh , uprqei asfeia ston orismì twn eid¸n: to fusion apoteletai apì th
sugq¸neush dÔo perissìterwn mousik¸n eid¸n kai den prèpei na oriste apì mìno tou san
th (kommtia pou enai sthn kathgora Classi ial kai Jazz ja mporoÔsan na qarakthristoÔn
w Ambient ).
nismì anagn¸rish mousikoÔ edou . To pr¸to sÔnolo onomzetai ISMIRgenre kai perièqei
1458 arqea me genikè klsei mousik¸n eid¸n, kalÔptonta 6 genikè klsei . To deÔtero
sÔnolo perieqe mousik edh qor¸n, gia taxinìmhsh kommati¸n me bsh ton rujmì. Onomze-
tai ISMIRrhythm kai perièqei 698 arqea pou kalÔptoun 8 qoreutik mousik edh. Ta edh
pou kalÔptei h bsh ISMIRgenre parousizontai ston Pnaka 3.1 kai ta edh pou kalÔptei
h bsh ISMIRrhythm ston Pnaka 3.2. Peirmata kai sthn bsh ISMIRgenre pragmatopoi-
jhkan sta plasia tou diagwnismoÔ ISMIR 2004 apì tou
Lidy Ellis West Pampalk
, , , , kai
Tzanetkh [58℄. Peirmata sthn bsh ISMIRrhythm pragmatopoi jhkan apì tou
Dixon
[25℄, Gouyon [36℄, kai Lidy [58℄.
bsei dedomènwn apì diaforetikè phgè . H pr¸th bsh apoteletai apì 1.515 kommtia pou
kalÔptoun 10 mousik edh:
lassi
al, ambient, ele
troni
, new-age, ro
k, punk, jazz, blues,
folk, kaiethni
. H deÔterh bsh apoteletai apì 1.414 kommtia pou kalÔptoun 6 mousik
edh: ro k, hip-hop, ountry, new-age, kai reggae . Sugkentrwtik apotelèsmata gia ta dÔo
To 2005 o Meng dhmioÔrghse dÔo sÔnola dedomènwn gia peirmata se anagn¸rish mou-
sikoÔ edou
[62℄. To pr¸to sÔnolo perieqe 100 kommtia, organwmèno se 5 edh:
lassi
al,
hard ro
k, jazz, pop , kai te
hno . To deÔtero sÔnolo dedomènwn apoteletai apì 354 degmata
dirkeia 30 se to kje èna, proerqìmeno apì dwren degmata pou parèqei to Amazon . To
deÔtero sÔnolo organ¸netai se 6 mousik edh: lassi al, ountry, jazz, rap, ro k te hno , kai .
[19℄. Ta pr¸ta 2 eppeda apeikonzontai sto Sq ma 3.2. To edo Chamber Musi qwrzetai
4 http://ismir2004.ismir.net/ISMIR Contest.html
5 http://www.musi
-ir.org/evaluation/mirex-results/audio-genre/index.html
34
3.2. S
UNOLA DEDOM
ENWN KAI IERARQ
IES
Jazz - Blues 52
Metal - Punk 90
Ro k - Pop 203
World 244
Jive 60
Qui kstep 82
Rumba 98
Samba 86
Tango 86
Viennese Waltz 65
sta: Chamber musi with piano, Solo musi , String quartet, Other hamber ensembles .
dhmiourg jhke ma bsh dedomènwn me 50 mousik kommtia se kje kathgora, ìpou to kje
Musi
To 2007 o Barbedo prìteine ma ierarqik orgnwsh mousik¸n eid¸n me 4 eppeda [5℄.
35
KEF
ALAIO 3. ANAGN
WRISH MOUSIKO
U E
IDOUS
To pr¸to eppedo qwrzei th mousik se polÔ genik edh: Classi al, Pop/Ro k, Dan e . To
Ele troni , kai thn Dan e se Vo al, Per ussion . Sto 4o eppedo, katal gei se 29 mousik
edh.
H enìthta anafèretai se qarakthristik pou epitrèpoun thn perigraf tou mousikoÔ edou .
Se orismène peript¸sei mousik , sugkekrimèna sthn orqhstrik dutik mousik , enai du-
ìpw h Musi XML6 tÔpou dedomènwn ìpw ta arqea MIDI kai .sib (tou progrmmato
sÔnjesh Sibelius ). Sthn prxh ìmw , den uprqoun anaparastsei uyhloÔ epipèdou domè
perigraf pou na kalÔptoun ìla ( ta perissìtera) mousik edh, an ton qrìno kai ton
tìpo.
'Ara, opoiad pote efarmog autìmath anagn¸rish mousikoÔ edou apaite thn qr sh
degmata den mporoÔn na qrhsimopoihjoÔn apeujea se efarmogè anagn¸rish , lìgw tou me-
glou ìgkou plhrofora pou perièqoun. Parllhla, h plhrofora pou uprqei sta degmata
th kumatomorf enai qamhloÔ epipèdou kai den mpore na qrhsimopoihje gia shmasiologik
anaparstash enì mousikoÔ edou . San apotèlesma, to pr¸to b ma sthn anagn¸rish mou-
sikoÔ edou enai h exagwg qarakthristik¸n mesaou kai uyhloÔ epipèdou apì ta hqhtik
degmata.
se 3 kathgore [65℄:
Qronik-rujmik qarakthristik
Rujmik qarakthristik
Melwdik-armonik qarakthristik.
6 http://www.musi xml.org/xml.html
36
3.3. QARAKTHRISTIK
A PERIGRAF
HS MOUSIKO
U E
IDOUS
Sthn paroÔsa diatrib ja qrhsimopoihje èna sunduasmì twn dÔo parapnw taxinom -
sewn qarakthristik¸n.
H qroi orzetai sth bibliografa w to qarakthristikì pou knei dÔo qou sthn dia jeme-
li¸dh suqnìthta kai me thn dia èntash na akoÔgontai diaforetiko [78℄. Ta qarakthristik
pou perigrfoun thn qroi exetzoun sun jw thn fasmatik katanom tou s mato , an kai
merik upologzontai sto pedo tou qrìnou. O Peeters dhmioÔrghse ma exantlhtik lsta
Qronik qarakthristik
Upologzontai gia kje qronikì plasio tou s mato
:
Rujmì mhdenism¸n ( Zero-Crossing Rate ): Upologzei ton arijmì for¸n pou mhdenze-
tai to s ma se èna qronikì parjuro. 'Ena enjìrubo s ma èqei meglh tim rujmoÔ
mhdenism¸n. Qrhsimopoi jhke apì tou Tzanetkh [84℄, Meng [62℄, Burred [19℄, Li [57℄,
pou jewretai ìti pargei to hqhtikì s ma, èqonta san esodo ma periodik diègersh,
pou perièqei plhrofora gia thn jemeli¸dh suqnìthta. Qrhsimopoi jhkan apì tou
Qarakthristik enèrgeia
Anafèrontai sto energeiakì perieqìmeno tou s mato
:
Tetragwnik rza th enèrgeia ( Root Mean Square Energy ): Enai mètro th isqÔo
Lìgo qamhl enèrgeia ( Low Energy Rate ): Posostì twn plaiswn enì s mato
pou èqoun tetragwnik rza th enèrgeia mikrìterh apì thn mèsh enèrgeia. Qrhsi-
mopoi jhke apì tou
Tzanetkh [84℄, Meng [62℄, Burred [19℄, Li [57℄, kai Cataltepe
[20℄.
Fasmatik qarakthristik
Perigrfoun to sq ma tou fsmato
isqÔo
enì
qronikoÔ plaisou:
37
KEF
ALAIO 3. ANAGN
WRISH MOUSIKO
U E
IDOUS
Kèntro brou tou fsmato ( Spe tral Centroid ): orzetai w to kèntro brou tou
fsmato isqÔo kai enai mètro tou fasmatikoÔ sq mato . Qrhsimopoi jhke apì tou
apoteloÔn h diaspor tou fsmato ( Spe tral Spread ) kai h epipedikìthta tou fsmato
Suqnìthta apìsbesh ( Spe tral Roll-o Frequen y ): Metrei se pìso uyhlè suqnì-
thte (sun jw 85%) sunanttai èna sugkekrimèno posostì th enèrgeia tou s mato .
Qrhsimopoi jhke apì tou
Pa
het [3℄, Tzanetkh [84℄, Li [57℄, Barbedo [5℄, kai Catal-
tepe [20℄.
DiakÔmansh fsmato ( Spe trum Flux ): orzetai w h mèsh diakÔmansh tou fsmato
metaxÔ dÔo geitonik¸n diasthmtwn. Qrhsimopoi jhke apì tou
Tzanetkh [84℄, Burred
[19℄, Pa
het [3℄, Li [57℄, Barbedo [5℄, kai Cataltepe [20℄.
Mel -qasmatiko suntelestè ( Mel Frequen y Cepstral CoeÆ ients ): Upologzontai apì
rontai apì ma trpeza fltrwn. Qrhsimopoi jhkan apì tou
Tzanetkh [84℄, Burred
[19℄, Meng [62℄, Pa
het [3℄, Li [57℄, kai Cataltepe [20℄.
Antilambanìmena qarakthristik
Ta antilambanìmena qarakthristik ( per
eptual features ) upologzontai qrhsimopoi¸nta
Antilambanìmenh èntash ( Per eptual Loudness ): qrhsimopoie thn èntash enì diast -
mato qou metasqhmatismènh me bsh èna montèlo ako . Qrhsimopoi jhke apì ton
O Pa het uposthrzei ìti èna sÔsthma autìmath anagn¸rish mousikoÔ edou den prèpei
[3℄. Akrib orismì tou rujmoÔ den uprqei, all anafèretai sthn ènnoia th qronik
epanlhyh [78℄. Diaisjhtik, enai profanè ìti to rujmikì perieqìmeno enì kommatioÔ
mpore na qrhsimopoihje gia diaqwrismì èntona rujmik¸n kommati¸n (pq. ro k ) kai kommati¸n
qoun pollè diaforetikè ìyei sthn anaz thsh rujmoÔ: exagwg tempo , anaz thsh kroÔsh
38
3.3. QARAKTHRISTIK
A PERIGRAF
HS MOUSIKO
U E
IDOUS
dhmiourgetai apì thn sunrthsh autosusqètish tou s mato [84℄. Parathr¸nta ta brh
sti difore periodikìthte tou s mato , exgetai ma rujmik posìthta, pou suntele ston
diaqwrismì mousik¸n kommati¸n me èntono rujmì se sqèsh me aut pou den èqoun dunat thn
asjhsh tou rujmoÔ. 'Ena parìmoio istìgramma qrhsimopoietai apì ton Burred , apì to opoo
exgei dÔo rujmik qarakthristik: dÔnamh kroÔsh ( beat strength ) kai rujmik kanonikì-
Ta rujmik qarakthristik pou exgontai enai: sqetikì plto twn dÔo koruf¸n tou isto-
grmmato , lìgo tou pltou twn dÔo koruf¸n, perodoi twn dÔo koruf¸n, kai sunolikì
jroisma tou istogrmmato . O Meng , ektì apì to istìgramma kroÔsh , protenei kai th
qr sh fsmato kroÔsh ( beat spe trum ) to opoo qrhsimopoie tou Mel qasmatikoÔ sunte-
lestè gia ton upologismì tou [62℄. O Lidy protenei ta qarakthristik rujmik¸n protÔpwn
rhythm histogram
( ), to opoo perièqei plhrofora rujmoÔ an suqnotik mpnta. Tèlo
, o
H armona mpore na oriste w h qr sh tou mousikoÔ tìnou maz me ti armonikè (sugqo-
de sthn perptwsh th dutik mousik ). Antijètw , melwda orzoume w ma akolouja
qrhsimopoietai ai¸ne apì mousikolìgou gia th melèth mousik¸n dom¸n kai h qr sh th sto
Gomez knei ma anaskìphsh th perigraf kai exagwg melwda apì hqhtikì s ma sto
[34℄.
na ston mousikì tìno gia anagn¸rish mousikoÔ edou [84℄. To dinusma qarakthristik¸n
39
KEF
ALAIO 3. ANAGN
WRISH MOUSIKO
U E
IDOUS
mousikoÔ tìnou ( pit h ontent feature set ) pou protenei baszetai se pollaplè teqnikè
anqneush mousikoÔ tìnou. Sthn teqnik pou protenetai, to s ma aposuntjetai se dÔo
suqnotikè mpnte , pnw kai ktw twn 1000 Hz . Upologzontai sthn sunèqeia oi peribl-
tou s mato . Sth sunèqeia dhmiourgetai èna istìgramma mousikoÔ tìnou ( pit h histogram )
goÔntai dÔo istogrmmata, to diplwmèno ( folded ), pou èqei uposte thn metatrop se MIDI ,
kai to aplwmèno ( unfolded ), pou den èqei uposte thn metatrop . Ta telik qarakthristik
UP0 : Perodo th mègisth koruf sto unfolded istìgramma, pou antistoiqe sto
IPO1 : Disthma mousik¸n tìnwn anmesa sti
dÔo uyhlìtere
korufè
tou folded
istogrmmato
, deqonta
to kÔrio tonikì disthma pou qrhsimopoietai sto kommti
tìnou [20℄.
sti mejìdou qwr epbleyh (ìpou den dnetai ierarqa, all h ierarqa prokÔptei omadopoi¸-
nta ta mousik kommtia) kai sti mejìdou me epbleyh (ìpou dnetai h ierarqa mousik¸n
eid¸n, to sÔsthma ekpaideÔetai kai sth sunèqeia elègqetai h apìdosh tou sust mato me ta
dedomèna elègqou).
40
3.4. ALG
ORIJMOI ANAGN
WRISHS
aut¸n twn mejìdwn enai ìti den uprqei o periorismì ma dedomènh ierarqa , pou mpore na
perièqei asfeie kai epikalÔyei . Epsh , kpoia kommtia polÔ apl mpore na mhn an koun
Kje mousikì kommti anaparisttai apì èna sÔnolo qarakthristik¸n, ìpw parousi-
sthke sthn enìthta 3.3. Epilègetai katìpin èna mètro omoiìthta gia na sugkrnei ta kom-
mtia metaxÔ tou . Oi algìrijmoi omadopohsh qrhsimopoioÔn ta mètra omoiìthta gia thn
To pio aplì mètro omoiìthta pou qrhsimopoietai gia na metr sei thn apìstash anmesa
sunhmitìnou ( osine similarity measure ). 'Omw , aut ta mètra omoiìthta prèpei na efar-
th katanom twn qarakthristik¸n kai èpeita qrhsimopoietai kpoio mètro omoiìthta gia
Kullba
k-Leibler
h apìklish qrhsimopoietai gia na upologsei thn apìstash anmesa se
katanomè all den sust netai gia GMMs . 'Allo mètro pou qrhsimopoietai gia sÔgkrish
katanom¸n enai h apìstash Earth Movers , pou qrhsimopoi jhke apì ton Pampalk [67℄.
Epsh
, o Shao qrhsimopohse krummèna montèla Markov hidden Markov models - HMMs
( )
stato q¸ro, ètsi ¸ste ìmoia dianÔsmata qarakthristik¸n apeikonzontai kont. Ta SOMs
enai teqnht neurwnik dktua qwr
epbleyh pou dhmiourgoÔn antistoiqe
metaxÔ dedomè-
nwn poll¸n diastsewn kai se q¸rou qamhl¸n diastsewn. Ta GHSOM apoteloÔn eidik
perptwsh twn SOM , ta opoa qrhsimopoioÔn ma ierrqik dom poll¸n epipèdwn, ìpou se
kje eppedo antistoiqe èna SOM . O Rauber qrhsimopohse GHSOMs gia na anaparast sei
41
KEF
ALAIO 3. ANAGN
WRISH MOUSIKO
U E
IDOUS
Ma efarmog omadopohsh
mousik¸n kommati¸n me endiafèron enai o Qrth
tou Mo-
zart 7
, pou dhmiourg jhke apì to Teqnikì Panepist mio Biènnh
. Ston qrth omadopoioÔntai
ìla ta èrga tou W. A. Mozart se nhsi mousik¸n eid¸n, ìpw sumfwne , sonte , kon-
sèrta, tragoÔdia, k.a. Sto Sq ma 3.3 apeikonzetai o Qrth , me diakrit ta nhsi, me ti
To kÔrio meionèkthma twn mejìdwn qwr epbleyh enai ìti oi prokÔptouse omde den
onomatzontai. Genik ìmw , oi omde den apeikonzoun aparathta kpoio mousikì edo ,
all mousik kommtia me kpoia ìmoia qarakthristik. Uposthrzetai ìti h ènnoia tou
mousikoÔ edou mpore na qaje se bjo qrìnou kai na prokÔyei ma orgnwsh twn mousik¸n
Oi mèjodoi anagn¸rish mousikoÔ edou me epbleyh èqoun melethje perissìtero sthn bi-
bliografa. Se autè ti teqnikè uprqei ma bsh mousik¸n kommati¸n, h opoa prèpei
Sto pr¸to b ma, to sÔsthma ekpaideÔetai me dedomèna gia ta opoa enai gnwstì to mousikì
edo . Sto deÔtero b ma, taxinomoÔntai sto sÔsthma arqea gia ta opoa den enai gnwstì to
7 http://www.ifs.tuwien.a .at/mir/mozart/
42
3.4. ALG
ORIJMOI ANAGN
WRISHS
kajorzei thn apìfash gia to dedomèno elègqou. Dosmènou enì dianÔsmato qarakth-
gia thn ektmhsh twn paramètrwn twn Gkaousian¸n. Ta GMMs sto pedo th anagn¸-
rish mousikoÔ edou qrhsimopoioÔntai sun jw gia thn dhmiourga montèlwn qroi .
pijanofneia , gia thn eÔresh tou montèlou pou perissìtero tairizei sto dosmèno kom-
eid¸n [84℄. O Burred qrhsimopohse ma dendroeid dom apì GMMs gia th monte-
kurw sthn anagn¸rish omila , lìgw th ikanìtht tou sto qeirismì qronoseir¸n.
Qrhsimopoi jhkan apì ton S aringella [77℄ gia anagn¸rish se 7 mousik edh kai apì
idèa enai h eÔresh enì grammikoÔ metasqhmatismoÔ pou diaqwrzei ìso to dunatìn
tanom [88℄.
43
KEF
ALAIO 3. ANAGN
WRISH MOUSIKO
U E
IDOUS
Teqnht neurwnik dktua (arti ial neural networks - ANN ): qrhsimopoioÔn san
domik stoiqea teqnhtoÔ neur¸ne gia thn eplush problhmtwn. Ta pio diadedomèna
ma parallag twn MLPs gia anagn¸rish mousikoÔ edou , ta opoa periègraye w
rht qronik montelopohsh me neurwnik dktua ( expli
it time modeling with neural
networks - ETM-NN ) [82℄. Sto ETM-NN , kje krummèno
neur¸na
mpore na jewrhje
eid¸n pou qrhsimopoioÔntai sth bibliografa. Gia to sÔnolo dedomènwn GTZAN , 61.0% enai
h mègisth akrbeia tou Tzanetkh [84℄. O Lidy me SVMs anèfere 74.9% akrbeia [58℄, en¸
o Li 78.5%. Gia to sÔnolo ISMIRrhythm , h akrbeia sto [58℄ tan 84.2%, sto [36℄ 90.1%,
kai sto [25℄ 96.0%. Gia to sÔnolo dedomènwn ISMIRgenre , h akrbeia tou Lidy enai 70.4%,
tou Tzanetkh 71.3%, tou West 78.3%, kai tou Pampalk 82.3% [58℄. Gia to sÔnolo ISMIR
2005 , thn kalÔterh akrbeia eqan oi Bergstra, Casagrande, kai E
k, me 82.34% kai gia ta 2
sÔnola dedomènwn.
Genik, to pedo th anagn¸rish mousikoÔ edou enai sqetik nèo, kaj¸ oi pr¸te a-
xiìloge ergase pragmatopoi jhkan apì to 1998 kai èpeita. Parìla aut, gnwrzei meglh
njhsh kai mpore na jewrhje ìti enai to pio energì pedo ston q¸ro th epexergasa qou
( audio pro essing ), qwr na sumperilhfje h epexergasa omila . Ta sÔnola dedomènwn pou
uprqoun sthn bibliografa apoteloÔn sthn pleioyhfa tou arket realistikè peript¸sei ,
apì etaire orgnwsh /p¸lhsh mousikoÔ ulikoÔ, enai akìma anoiktì. Ta qarakthristik
pou èqoun protaje plèon kalÔptoun ti angke gia autìmata sust mata anagn¸rish , ka-
j¸ plèon h akrbeia twn susthmtwn enai megalÔterh se sqèsh me thn akrbeia anagn¸rish
44
Keflaio 4
Paragontopohsh mh arnhtik¸n
tanust¸n
4.1 Eisagwg
Kentrikì stìqo twn pedwn th mhqanik mjhsh kai anagn¸rish protÔpwn enai h eÔre-
sh katllhlh anaparstash twn dedomènwn. Shmantikì rìlo èqoun epsh kai oi teqnikè
twn twn dedomènwn, na knei pio eÔkolh thn ermhnea tou , kai na gnei kalÔtera anlush
twn paragìntwn pou sunjètoun ta dedomèna kajeaut. Uprqoun dÔo stìqoi se ma tètoia
Sto parìn keflaio, protenetai ma nèa polugrammik mèjodo anlush upoq¸rwn, h
koin¸ pnake . Sthn enìthta 4.4 anafèrontai oi teqnikè paragontopohsh mh arnhtik¸n ta-
4.5. Sthn enìthta 4.6 parousizetai o algìrijmo gia sugkekrimène apoklsei Bregman .
Sthn enìthta 4.7 parousizontai algìrijmoi NMF gia tanustè 3 diastsewn. O algìrijmo
45
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
taxinìmhsh NTF me epbleyh dnetai sthn enìthta 4.8. Tèlo , kpoia genik sumpersmata
Sth bibliografa èqoun protaje pollè teqnikè anlush upoq¸rwn. H pio diadedomènh enai
h anlush prwteuous¸n sunistws¸n ( prin ipal omponent analysis - PCA ), h opoa meta-
sqhmatzei ta dedomèna se èna nèo sÔsthma suntetagmènwn, ètsi ¸ste na megistopoietai h me-
lo mexh parìmoio me thn ICA , ètsi ¸ste oi parathroÔmene metablhtè twn dedomènwn na
pargonte . H lanjnousa shmasiologik anlush ( latent semanti analysis - LSA ) [24℄ kai
diastsewn se q¸ro qamhl¸n diastsewn kai qrhsimopoioÔntai kurw sta peda ankthsh
nkwn ( non-negative matrix fa torization - NMF ), h opoa ja analuje sto parìn keflaio,
h opoa brskei thn paragontopohsh enì pnaka, me ton periorismì ìti perièqei mh arnhtikè
timè [55, 56, 45℄. Na shmeiwje ìti h PLSA lÔnei to prìblhma th NMF qrhsimopoi¸nta
polumetablht dedomèna qrhsimopoioÔsan pnake kai teqnikè grammik lgebra gia thn
mopoioÔn tanustè kai teqnikè polugrammik lgebra gia thn pragmatopohsh twn upolo-
gism¸n. H angkh gia thn dhmiourga twn poludistatwn teqnik¸n pro lje apì ta peda th
yuqologa kai qhmea , ìpou èginan oi pr¸te apìpeire anlush kai ermhnea trisdista-
twn dedomènwn. To 1964 o Tu ker prìteine ma parallag th PCA gia anlush pragmatik¸n
46
4.2. TEQNIK
ES AN
ALUSHS UPOQ
WRWN
4.2.3 PARAFAC
de
omposition PARAFAC
) [17℄. H protjhke gia ta peda th
yuqometra
( psy
hometri-
s
hemometri
s
) kai th
qhmeiometra
( ). Basikì
skopì
th
mejìdou PARAFAC enai h
dianusmtwn:
k
X
X= aj
bj
j (4.1)
j =1
H aposÔnjesh pragmatopoietai me algìrijmo enalassìmenwn elaqstwn tetrag¸nwn ( alter-
nating least squares - ALS ). Sthn PARAFAC lÔnetai to akìloujo prìblhma beltistopohsh
:
X k
X
2
min
a;b;
Xi i i
1 2 3
aj i
( 1)
bj i
( 2) j (i3 ) (4.2)
i1 i2 i3 j =1
Basikì pleonèkthma th
PARAFAC enai ìti brskei monadik lÔsh me katllhlh epilog
tou k. Sto Sq ma 4.1 apeikonzetai h leitourga th mejìdou PARAFAC (sto sugkekrimèno
O Kroonenberg èjese pr¸to ta jemèlia gia ènan algìrijmo pou na apotele polugrammik
protenei enai basismèno sto montèlo tou Tu ker kai enai to akìloujo:
K X
X M
L X
Gi i i =
1 2 3
gi k hi l ei m
klm
1 2 3 (4.3)
47
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
ìpou G 2 R I I I
1 2 3
kai C 2 R K LM . To montèlo mpore enallaktik na grafe w
:
G =C G H E 1 2 3 (4.4)
To 1986, o Kapteyn knei ma epèktash tou montèlou 3-mode omponents analysis pou
pr¸to eqe protenei o Tu ker , se N diastsei [47℄. Gia thn eplush tou probl mato
( singular value de omposition - SVD ) [51℄ [53℄. Onomzei to montèlo w SVD uyhl txh
G = S U U N U N
1
(1)
2
(2) ( )
(4.5)
orjog¸nioi. O kentrikì tanust S enai ìlo-orjog¸nio (bl. Enìthta 2.3.7) kai uprqei
O Lathauwer apodeiknÔei ìti gia thn eÔresh tou HOSVD apaitetai upologismì SVD sta
S=G U 1
(1)
T
U 2
(2)
T
N U N T :( )
(4.7)
ApodeiknÔetai ìti h lÔsh pou dnei h HOSVD enai monadik . Epsh
, gia N = 2 o algìrijmo
enai dio
me ton SVD . H HOSVD mpore na qrhsimopoihje akìma gia ton upologismì tou
n-bajmoÔ tou G. Tèlo
, h efarmog th
HOSVD se ènan summetrikì an dÔo tanust G2
R I I I isoduname me thn anlush idiotim¸n ston G . Koin¸
, protenetai ma genikeumènh
je apì thn PARAFAC kai qrhsimopoi jhke sthn anlush FMRI . Jewre èna montèlo ìmoio
me th PARAFAC sthn (4.1), sun ènan ìro GkaousianoÔ jorÔbou. Sto montèlo PICA , o ìro
48
4.2. TEQNIK
ES AN
ALUSHS UPOQ
WRWN
A B = [a
b j jaN
bN ℄
1 1 (4.8)
Praktik ìmw to montèlo Tensor PICA den axiopoie thn plhrofora twn 3 diastsewn,
O Terzìpoulo protenei ma llh epèktash th ICA , me ttlo Multilinear ICA [86℄.
Sto montèlo th polugrammik ICA pou protenoun, pragmatopoietai ma aposÔnjesh tou
tanust dedomènwn:
G = S U U N U N
1
(1)
2
(2) ( )
(4.9)
st¸se gia thn i-ost distash. Sthn ousa ìmw , to montèlo pou protenetai den èkfrzei
anptugma tou tanust G gia kje distash, ìpou pragmatopoie ICA gia thn eÔresh twn
anexrthtwn sunistws¸n. 'Ara pli den axiopoietai h poludistath dom tou tanust , all
O Yoo protenei ma epèktash th ICA , me ttlo Multiway ICA [89℄. H teqnik baszetai
pnw sthn Multiway PCA h opoa praktik enai efarmog th PCA pnw sta anaptÔgmata
tou tanust dedomènwn, gia ìle
ti
diastsei
. Sthn Multiway ICA , pragmatopoietai ICA
sta anaptÔgmata G (2) kai G(3) tou tanust G (jewretai ìti h 1h distash enai h distash
twn dedomènwn). AkoloÔjw , pargontai oi dÔo pnake mexh gia ti lle 2 diastsei
twn dedomènwn. H mèjodo protjhke gia anlush kai epexergasa lummtwn kai èqei ta dia
meionekt mata pou èqei kai h mèjodo tou Terzìpoulou [86℄. Sto Sq ma 4.2 fanontai ta
anaptÔgmata tou arqikoÔ tanust pou qrhsimopoie o Yoo gia ton upologismì th
Multiway
ICA .
katìpin efarmog ICA sti asusqètiste metablhtè pou pargei h polugrammik PCA , ètsi
anptugma tanust .
49
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
4.3.1 Montèlo
V WH (4.10)
ton arqikì pnaka V, pou èqei w apotèlesma mia sumpiesmènh ekdoq tou arqikoÔ pnaka
dedomènwn.
O NMF baszetai sthn sumpesh topik¸n kai ìqi olistik¸n qarakthristik¸n enì s mato ,
se sÔgkrish me lle mejìdou aposÔnjesh dedomènwn, ìpw h PCA kai h ICA . Epsh
twn pinkwn. Autì ermhneÔetai ìti h plhrofora apojhkeÔetai san grammikì sunduasmì
basik¸n stoiqewn plhrofora (sunart sei bsh ), kai ìti prosjetiko sunduasmo poll¸n
50
4.3. PARAGONTOPO
IHSH MH ARNHTIK
WN PIN
AKWN
sunart sewn bsh anaparistoÔn thn plhrofora. Koin¸ o NMF baszetai sth jewra ìti
h antlhyh tou ìlou baszetai sthn antlhyh twn mer¸n tou, kti pou èrqetai se antjesh
me thn olistik je¸rhsh th PCA , ìpou h plhrofora anaparstatai san sunduasmì ìlwn
H (4.10) mpore na xanagrafte an st le
, ètsi ¸ste vi Whi , i = 1; 2:::m, ìpou vi
kai hi enai oi antstoiqe
st le
twn V kai H. Me lla lìgia, kje dinusma plhrofora
en¸ to stoiqeo hjk deqnei ton bajmì me ton opoo to dinusma plhrofora vi an kei sthn
klsh j .
antikatastsei ( update rules ) stou pnake W kai H. Gia na breje mia proseggistik
paragontopohsh tou pnaka V prèpei na oristoÔn sunart sei kìstou pou orzoun thn
X
jjA Bjj = 2
(aij bij ) 2
(4.11)
ij
h opoa èqei san ktw ìrio to mhdèn kai emfan¸ mhdenzetai ìtan A = B. 'Allo èna qr simo
X a
DKL (AjjB) = aij log ij aij + bij (4.12)
ij
bij
'Opw
kai h Eukledeia apìstash, èqei san ktw ìrio to mhdèn, kai mhdenzetai ìtan A = B.
Den enai jewretai apìstash, giat den enai summetrik , kai anafèretai w
apìklish tou A
apì to B. 'Ara jewroÔme dÔo diaforetikè ulopoi sei th mejìdou NMF san probl mata
beltistopohsh : H pr¸th enai h elaqistopohsh tou jjV WHjj se sqèsh me ta W kai H,2
kola ulopohsh kai taqÔthta gia thn lÔsh twn parapnw problhmtwn [55℄:
51
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
ptikoÔ kanìne :
hj hj
(WT V)j w w
(VHT )i
i i
(WT WH)j (WHHT )i
(4.13)
epanalhptikoÔ kanìne :
P wi vij P hj vij
hj hj P
i WH ij
( )
wi wi P
WH ij
j ( )
(4.14)
k wk j hj
ìpou = 1; 2:::r; i = 1; 2:::n; j = 1; 2:::m. 'Oson afor thn efarmog tou basikoÔ NMF, oi
pnake
W kai H mporoÔn na arqikopoihjoÔn me tuqae
mh arnhtikè
timè
, kai apodeiknÔetai
'Ena apì ta meionekt mata tou BasikoÔ NMF enai ìti oi bsei tou enai qwrik olikè , en¸
ja tan protimhtèo na tan topikè . Gia autìn ton lìgo, o Li prìteine ma llh mèjodo upo-
st¸se
, ètsi ¸ste na elaqistopoihje o arijmì
twn sunart sewn bsh
pou qreizo-
P
ntai gia na anaparast soun ton pnaka V. Koin¸
prèpei
i uii = min, ìpou uij enai
P
3. Mìno ta stoiqea pou dnoun thn pio shmantik plhrofora prèpei na krathjoÔn:
i bii =
max , ìpou bij enai to ij -ostì stoiqeo tou pnaka B.
O sunduasmì
twn parapnw periorism¸n dnei thn paraktw apìklish pou qrhsimopoietai
X v X X
DLNMF (VjjWH) = vij log ij vij + yij + uij bii ; yij = (WH)ij (4.15)
i;j
yij i;j i
(4.15). Ma topik lÔsh sthn parapnw elaqistopohsh brsketai qrhsimopoi¸nta tou
52
4.3. PARAGONTOPO
IHSH MH ARNHTIK
WN PIN
AKWN
i wi hj
wi
P
j vij P hwjihj w
wi = P ; wi = P i (4.17)
j hj i wi
O periorismì
orjogwnikìthta
moizei me ton periorismì pou epibllei h mèjodo
PCA sti
sunart sei bsh . Parìla ta topik qarakthristik tou ìmw , o LNMF den kwdikopoie
Oi Hu Zhang
kai parousasan mia llh parallag tou NMF me ìnoma Araiì
NMF sparse
(
kanìne analoga (ratio rules ). Basikì skeptikì psw apì ton Araiì NMF enai ìti en¸
h NMF mèjodo enai epituq sthn paragontopohsh pinkwn, den jètei ìmw periorismoÔ
puknìthta twn dedomènwn stou pnake . San apotèlesma, den enai se jèsh na pragmatopoi-
sei paragontopohsh se ènan pnaka V pou èqei topik arai qarakthristik sta dedomèna
tou.
i;j
yij j
ìpou hj = (h j ; h j ; :::hrj )T , j = 1; 2; :; m,
1 2 enai h antstoiqh st lh tou pnaka H, kai to
enai jetik stajer apoktoÔmenh apì empeira. W
khj kl orzetai h l-nìrma tou dianÔsmato
st lh
tou pnaka H. To prìblhma th
paragontopohsh
me SNMF orzetai w
:
Mia lÔsh gia thn parapnw elaqistopohsh me periorismoÔ dnetai apì tou paraktw ka-
nìne epanlhyh :
P
P wi
i vij wi hj
P hj
j vij wi hj
P
hj = hj P (
wi = wi )
P
i wi +
(4.20)
j hj
Gia na gnei h lÔsh monadik , apaitetai h l -nìrma tou dianÔsmato
st lh
ston pnaka W na
w X
wi = P i hj = hj wi (4.21)
i wi i
ApodeiknÔetai ìti h (4.18) den auxnei ktw apì tou
kanìne
epanlhyh
, kai h sÔgklish
53
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
H sun jh
mèjodo
gia taxinìmhsh dedomènwn stou
upoq¸rou
pou dhmiourgoÔn oi NMF
mèjodoi perigrfetai sto [7℄. V dhmiourgetai qrhsimopoi¸nta
dedomèna apì to
O pnaka
sÔnolo ekpadeush (training set). Kje st lh tou V, vj perièqei èna dinusma qarakthri-
stik¸n. H diadikasa ekpadeush ekteletai efarmìzonta ènan algìrijmo NMF ston pnaka
dedomènwn, pargonta
ètsi ton pnaka bsh
W kai ton pnaka kwdikopohsh
H.
Sthn fsh elègqou ( testing phase ), gia kje dedomèno elègqou to opoo anaparstatai me
similarity measure - CSM). To dedomèno elègqou katatssetai sthn klsh l0, ìpou:
hTtest hl
l0 = arg max
l=1;2;:::;N khtestkkhl k : (4.23)
Koin¸ , megistopoietai to sunhmtono th gwna metaxÔ twn htest kai hl . 'Ena enallaktikì
mètro mpore epsh
na qrhsimopoihje, ìpou h klsh sthn opoa an kei to vtest kajorzetai
exetzonta
kje stoiqeo tou htest :
l0 = arg max
i
hi;test (4.24)
sthn Enìthta 4.3.5 enai ìti sthn diadikasa ekpadeush den sumperilambnetai plhrofora
sqetik me diaqwrismì klsewn sta dedomèna. Epiprìsjeta, oi arqikè tuqae timè twn
pinkwn W kai H mpore na ephresoun thn sÔgklish tou algorjmou, kaj¸ h tim th
antikeimenik sunrthsh pou qrhsimopoietai apì ton ekstote NMF algìrijmo mpore na
Sto [8℄ protenetai h dhmiourga enì taxinomht me epbleyh, ìpou h diadikasa ekpa-
deush me NMF algorjmou ekteletai gia kje klsh dedomènwn xeqwrist. AkoloÔjw ,
54
4.3. PARAGONTOPO
IHSH MH ARNHTIK
WN PIN
AKWN
ìpou N enai o arijmì
twn klsewn, Vi o pnaka
dedomènwn gia thn klsh i, kai oi pnake
Wi ,Hi oi paragìmenoi mèsw NMF pnake
gia thn klsh i. O arijmì
twn sunistws¸n pou
qrhsimopoietai gia thn ekpadeush twn dedomènwn kje klsh
dnetai apì:
ni mi
ri =
ni + mi
(4.26)
ìpou ni kai mi enai oi diastsei tou pnaka Vi . 'Omw , h bsh pou orzetai apì ti st le
tou pnaka Wi den enai orjokanonik . Gia na metatrape h bsh se orjokanonik , o protei-
Wi = Qi Ri ; i = 1; 2; ; N (4.27)
ìpou o pnaka
Qi èqei diastsei
n r kai enai orjog¸nio
. O pnaka
Ri èqei diastsei
r r kai enai nw trigwnikì
. Sunep¸
, o pnaka
orjokanonik
bsh
gia kje klsh enai
o Qi kai o nèo
pnaka
kwdikopohsh
enai o:
H0i = Ri Hi ; i = 1; 2; ; N: (4.28)
Kat th dirkeia th fsh elègqou, kje dedomèno elègqou anaparstatai apì to dinu-
sma qarakthristik¸n vtest . Sth sunèqeia, to vtest problletai stou
orjokanonikoÔ
pnake
bsh
Qi gia kje klsh, kai prokÔptei to dinusma kwdikopohsh
-elègqou gia kje klsh:
0( )
i
htest = Qyi vtest; (4.29)
ìpou Qyi enai o genikeumèno
antstrofo
tou Qi . Gia kje klsh, gnetai sÔgkrish tou
0 (i)
dianuÔsmato
htest me ti
st le
tou pnaka H0i qrhsimopoi¸nta
to mètro omoiìthta
sunh-
mitìnou.
To dinusma pou megistopoie to CSM gia ton pnaka H0i qrhsimopoietai w
mètro omoiì-
thta
metaxÔ tou vtest kai th
klsh
i:
0( )
iT i 0( )
htest hj
CSMi = j max
khtest kkh0j i k
0i (4.30)
; ;:::;r
=1 2 i ( ) ( )
0( )
ìpou to hj i j -ost st lh tou pnaka H0i .
anaparist thn Tèlo
, to mègisto CSMi orzei se
poia klsh an kei to dinusma elègqou vtest :
l0 = arg i max
; ;:::;N
=1 2
fCSMig: (4.31)
55
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
0 (1)
Q 1
ht H0 1
CSM 1
0 (2)
vt Q 2
ht H0 2
CSM 2
arg
max
. . .
. . .
. . .
0(
QN ht N )
H0N CSMN
0( )
ht i = Qyi vt
Sq ma 4.3: Diadikasa elègqou qrhsimopoi¸nta
taxinomht NMF me epbleyh (ta h0t kai vt
sumbolzoun ta h0test kai vtest , antstoiqa).
O Donoho sto [26℄ jètei dÔo erwt mata pou sqetzontai me thn NMF mèjodo:
Ktw apì poie sunj ke enai h paragontopohsh mh arnhtik¸n pinkwn kal¸ orismè-
ApodeiknÔei ìti h paragontopohsh enai monadik gia dedomèna pou plhroÔn sugkekrimènou
ìti prèpei na dhmiourgetai ma bsh dedomènwn ìpou ìla ta epijumht mèrh gia anqneush
eikìnwn me basik gewmetrik sq mata pou na plhro ton periorismì (onomzetai Separable
Fa
torial Arti
ulation Families ), o Donoho deqnei ìti uprqei monadik lÔsh. Se efarmogè
pragmatikoÔ kìsmou ìmw , den enai dunatìn na kataskeuaste ma tètoia bsh dedomènwn pou
na eggutai monadik lÔsh, eidik sthn perptwsh pou h NMF pragmatopoietai diereunhtik,
O Allbright protenei dÔo nèou algorjmou giaNMF qrhsimopoi¸nta thn teqnik twn
enallassìmenwn elaqstwn tetrag¸nwn ( alternative least squares - ALS ) [1℄. O pr¸to algì-
56
4.3. PARAGONTOPO
IHSH MH ARNHTIK
WN PIN
AKWN
kaj¸ o tÔpo gia thn araiìthta protjhke apì ton Hoyer [44℄. ApodeiknÔetai ìti oi algì-
shmea sèlla ( saddle points ). 'Omw , stou dÔo parapnw algorjmou , h mh arnhtikìthta
epiblletai ( ad-ho non-negativity enfor ement ), jètonta ti arnhtikè timè twn pinkwn
W kai H se me mhdèn. Autì kajist tou algorjmou mh elkustikoÔ apì jewrhtik
skopi.
O Allbright epsh
jètei to er¸thma sqetik me thn arqikopohsh tim¸n stou
pnake
W
kai H [1℄. Enai gnwstì ìti h sÔgklish twn algorjmwn NMF exarttai apì ti
arqikè
timè
twn pinkwn. Se kje perptwsh, ma orj arqikopohsh mpore na aux sei thn taqÔthta
sÔgklish twn algorjmwn, kaj¸ kai thn akrbei tou . H tupik arqikopohsh tim¸n gia
tou algorjmou NMF enai h arqikopohsh twn pinkwn W kai H me tuqae timè sto
disthma [0; 1℄ . O Allbright uposthrzei ìti h tuqaa arqikopohsh pinkwn den odhge se
krit rio ljou anmesa ston pnaka V kai to ginìmeno WH), eidik stou algorjmou
paratjentai ston Pnaka 4.1. H arqikopohsh Kèntrou SVD apaite efarmog SVD ston
pnaka V. H Tuqaa A
ol arqikopoie kje st lh tou W me bsh ton mèso ìro p tuqawn
sthl¸n tou V. H mèjodo
Tuqaa
C qrhsimopoie thn aposÔnjesh CUR [29℄. Tèlo
, h
mèjodo Pnaka Sumpt¸sewn ( o-o uren e matrix ) qrhsimopoie ton pnaka C = VVT . Na
shmeiwje ìti oi algìrijmoi pou qrhsimopoioÔn ALS teqnikè apaitoÔn arqikopohsh mìno tou
pnaka W.
57
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
bliografa [83℄. H enopohsh aut gnetai upì to prsma ìti ìloi oi algìrijmoi prospajoÔn
na lÔsoun èna prìblhma beltistopohsh , sto opoo h sunrthsh kìstou enai ma apìklish
Bregman (bl. Enìthta 2.5). Prgmati, oi algìrijmoi pou prìteine o Lee [55℄ baszontai sthn
nìrma Frobenius qrhsimopoietai epsh apì ton Hoyer [44℄, maz me ènan epiplèon ìro pou
ekfrzei thn araiìthta tou pnaka H Sra. O onomzei thn oikogèneia algorjmwn pou prote-
jèlonta pio orj na dhl¸sei ìti oi algìrijmoi brskoun prosèggish paragontopohsh , kai
ìqi thn paragontopohsh kajeaut ). Ta probl mata pou tjentai enai dÔo: (V2 Rm
+
n ,
W 2 R mr kai H 2 R rn ):
(1) Wmin
;H>
D (WH; V) + (W) + (H)
0
(4.34)
(2) Wmin
;H>
D (V; WH) + (W) + (H)
0
(4.35)
piprìsjetou
periorismoÔ
stou
pnake
W kai H. To pr¸to prìblhma pou jètei o Lee
[55℄ gia pardeigma, qrhsimopoie thn apìklish Bregman (x) = x me (W) = 0 1
2
2
, me kai
Gia thn eplush tou probl mato 1 o Sra qrhsimopoie pollaplasiastikoÔ kanìne , qrh-
) > F (
); 8
~.
(2) G(
; ~
t = arg min
+1
G(
;
)
t (4.36)
Gia thn eÔresh epanalhptikoÔ kanìna gia thn eplush tou probl mato
1 w
pro
to h,
w
F (h) jewretai h:
X X X
F (h) = ( wij hj ) (vi ) + (vi )((Wh)i vi ) ; (4.37)
i j i
58
4.4. PROTEIN
OMENES M
EJODOI
ìpou () = 0() . Me bsh ton Orismì 4.3.1, dhmiourgetai h antstoiqh bohjhtik sunrthsh:
w h
G(h; h~ ) =
X X
ij ij j (vi ) + (vi )((Wh)i vi ) ; (4.38)
ij
ij i
ij = (wij h~ j )=( l wil h~ l ). Gia thn eÔresh epanalhptikoÔ kanìna gia to h, prèpei na
P
ìpou
hp h~ p 1
[WT (v)℄p :
[WT (Wh~ )℄p
(4.39)
wp w~p 1
[ (vT )HT ℄p :
[ (w~ T H)HT ℄p
(4.40)
O Sra sth sunèqeia kalÔptei kai peript¸sei me mh mhdenikè sunart sei poin , kaj¸
epsh peript¸sei me apoklsei ektì twn Bregman , ìpw oi apoklsei Csiszar kai Young .
Apì to 2001 kai èpeita, èginan oi pr¸te apìpeire epèktash twn algorjmwn gia efarmo-
1
g se tanustè
tri¸n parapnw diastsewn . Oi perissìtere
protsei
anafèrontai se
O Welling to 2001 prìteine ma genkeush tou algorjmou jetik paragontopohsh pin-
kwn gia tanustè N -ost txh [87℄. O algìrijmo onomsthke paragontopohsh jetik¸n
tanust¸n ( positive tensor fa torization ). Sto montèlo pou protenetai, jewretai èna tanu-
I1 I2 IN
st
V2R +
. To montèlo ekfrzei ta stoiqea tou V w
:
K
X
Vi i :::iN =
1 2
uj i uj i
1( 1) 2( 2)
ujN iN ;
( )
(4.41)
j =1
ìpou uj i
1( 1)
enai to i 1 -ostì stoiqeo tou dianÔsmato
uj .
1
O basikì
skopì
tou algorjmou
2 IN
I1 IX K
X
2
RE = Vi i :::iN
1 2
uj
1(
j
i1 ) u2(i2 ) ujN (iN ) : (4.42)
i1 i2 :::iN j =1
1 Na shmeiwje ìti ta onìmata twn metablht¸n kai twn telest¸n pou qrhsimopoioÔntai sti
proteinìmene
mejìdou den enai ta dia me aut pou qrhsimopoioÔntai sthn diatrib . Se ma apìpeira enopohsh th
59
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
stikì kanìna :
Ui ! Ui
Vi i :::ii ii :::iN M
1 2 1 +1
Ui MT M
(4.43)
PK j j j
ìpou M= j =1 u1(i1 ) u2(i2 ) : : : ui 1( ii 1)
uji +1(
j
ii+1 ) : : : uN (iN ) . Protenetai epsh
kai ma sunr-
thsh kìstou pou apotele genkeush th sunrthsh pou protjhke apì ton Lee [55℄ (pa-
mènou pnake tou zwgrfou Mondriaan , pou apoteloÔntai apì orjog¸nia se diaforetikè
Sq ma 4.4: Pnake tou Mondariaan pou qrhsimopoi jhkan gia peirmata jetik parago-
To 2005 o Lim protenei ma polugrammik parallag tou NMF , h opoa ousiastik
retai sto prìblhma th
eÔresh
elqistou k gia to opoo ma tètoia aposÔnjesh enai efikt .
SÔmfwna me ton Lim , h idanik paragontopohsh mh arnhtikoÔ tanust dnetai apì to K
ìpou:
K
X
arg min
K
jjV u
u
j
1
j
2
jj
ujN F ; uji 2 R Ii
+
: (4.44)
j =1
'Omw
, den enai pnta efikt h eÔresh tou idanikoÔ K, ra anazhtetai h mh idanik lÔsh
sthn opoa, dojènto K, proseggzei ènan tanust w jroisma exwterik¸n ginomènwn dia-
Oi Shashua kai Hazan prìteinan to 2005 ma genkeush tou algorjmou NMF gia tanu-
60
4.4. PROTEIN
OMENES M
EJODOI
2
aposÔnjesh
tanust se jroisma exwterikoÔ ginomènou dianusmtwn , parapèmponta
se
montèla orjog¸nia aposÔnjesh tanust¸n [49℄ se montèla eÔresh bajmoÔ tanust (bl.
K
X
V= uj
uj
ujN
1 2
(4.45)
j =1
K
min 1 jjV X
uj
uj
ujN jjF ; 2
uji > 0; i = 1; : : : ; N; j = 1; : : : ; K
ui 2
(4.46)
j 1 2
j =1
ìpou jjAjjF 2
enai h nìrma Frobenius 2ou bajmoÔ. H eÔresh tou pollaplasiastikoÔ epanalh-
j =1 ui(l) m= 6 i (um um )
ParathroÔme ìti to prìblhma pou epiqeire na lÔsei o Shashua enai ìmoio me to prìblhma
tou Welling [87℄, me th mình diafor ston periorismì (mh arnhtikè timè ant jetik¸n tim¸n).
O epanalhptikì kanìna pou prokÔptei enai epsh parapl sio me autìn th (4.43). Gia
ton NTF algìrijmo pou protjhke apì tonShashua pragmatopoi jhkan peirmata se bsei
eikìnwn ( Iris image data set, Swimmer library ), sti opoe eqe prosteje jìrubo .
Oi Hazan Shashua
kai sunèqisan thn èreuna sqetik me ti
NTF mejìdou
, aut th for
periorzonta
ton arijmì twn diastsewn sti
3 [39℄. Protjhke èna
algìrijmo
gia NTF
qrhsimopoi¸nta
thn sqetik entropa san mètro apìstash
, koin¸
thn apìklish Kullba
k-
Leibler . O epanalhptikì
pollaplasiastikì
kanìna
pou prokÔptei gia ènan tanust 3
diastsewn V 2 R I I I
1 2 3
enai:
up2(i2 ) up3(i3 )
PI I
2 3
i2 i3 V i i PKm
1
2 3 m m m
=1 u1(i1 ) u2(i2 ) u3(i3 )
up l up l PI I p p : (4.48)
i2 i3 u2(i2 ) u3(i3 )
1( ) 1( ) 2 3
To 2006 oi Heiler kai S hnorr prìteinan ma genkeush tou algorjmou pou eqe protenei o
Hoyer [44℄ gia NMF me periorismoÔ araiìthta [42℄. O algìrijmo protjhke gia tanustè
2 An kai oi perissìteroi apì tou proteinìmenou algorjmou qrhsimopoioÔn ton ìro paragontopohsh
tanust¸n, kat analoga me ton NMF, pio akrib
ìro
enai h aposÔnjesh tanust¸n, pou dhl¸nei ìti èna
tanust
mpore na aposunteje se ajrosmata exwterik¸n ginomènwn dianusmtwn.
61
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
3 diastsewn (me pijan efarmog thn kwdikopohsh eikìnwn). To prìblhma pou tjetai enai
ìmoio me autì pou protjhke apì tou Welling [87℄ kai Hazan [80, 39℄:
K
min 1 jjV X
uj
uj
uj jjF ; 2
uji > 0
uji 2 1 2 3
(4.49)
j =1
me th diafor ìti tjetai èna epiplèon periorismì araiìthta . O Heiler orzei thn araiìthta
ìpw thn orzei o Albright sthn (4.33) [1℄. O epiplèon periorismì enai:
smin
i 6 spar(ui) 6 smax
i : (4.50)
smin max enai pragmatiko arijmo sto disthma [0; 1℄ kai dnontai apì ton
kai si
Oi parmetroi i
qr sth, anloga me thn efarmog . O algìrijmo
onomsthke SMA sparsity maximization
(
algorithm ) kai uprqei se morf yeudok¸dika sto [42℄. ApodeiknÔetai ìti o algìrijmo
Ton IoÔnio tou 2006 o Mpoutsdh prìteine ènan algìrijmo gia thn aposÔnjesh enì
mh arnhtikoÔ tanust [15℄. O algìrijmo
onomsthke PALSIR proje
ted alternating least
(
squares with initialization and regularization ). To prìblhma pou jètei enai to:
K
min 1 jjV X
uj
uj
uj jjF ; 2
uji > 0:
ui 2
(4.51)
j 1 2 3
j =1
Gia thn eplush tou probl mato
, qrhsimopoietai o algìrijmo
ALS . Apì ta dianÔsmata uji
dhmiourgoÔntai oi pnake
Ui 2 R Ii K . Se kje epanlhyh, gia kje i = 1; 2; 3, lÔnetai
efarmog mejìdwn arqikopohsh twn tim¸n gia megalÔterh taqÔthta kai kalÔterh sÔgkli-
sh. Protenetai h qr sh arqikopoi sewn pou uprqoun gia thn mèjodo PARAFAC , kaj¸
kai polugrammikè
epektsei
twn mejìdwn arqikopohsh
tou NMF pou prìteine o Albright
[1℄. Parìla aut, den gnetai kpoia prospjeia melèth
twn mejìdwn arqikopohsh
. Na
shmeiwje ìti sto [15℄ den dnetai apìdeixh gia ton algìrijmo kai gia thn sÔgklis tou.
tik¸n tanust¸n me periorismoÔ ariìthta [21℄. O algìrijmo protenetai mìno gia tanustè
Vk = ADk Sk + Ek ; k = 1; 2; : : : ; I 3 (4.52)
tomè
). O A 2 R
I R enai o pnaka
bsh
( mexh
) pou anaparist tou
pargonte
pou
1
sunjètoun ta dedomèna. O Dk 2 R
RR enai diag¸nio
pnaka
pou èqei sthn kÔria diag¸nio
62
4.4. PROTEIN
OMENES M
EJODOI
Tèlo
, oi pnake
Ek 2 R
krufè
sunist¸se
) twn dedomènwn.
I I enai oi tomè
tou tanust
1 2
E 2 R I I I 1 2 3
pou perièqei ta ljh ton jìrubo, anloga me thn efarmog . Sto Sq ma 4.5
apì ton Ci ho ki .
Sq ma 4.5: Sqhmatik anaparstash tou montèlou pou protenetai apì ton Ci ho ki [21℄.
Dk (Vk jjASk ) =
( ) 1 X
(vitk
[AS ℄ v + ( 1)[AS ℄ ):
1
k it itk k it
( 1)
(4.53)
itk
Na shmeiwje ìti gia = 2, h (4.53) gnetai h apìstash tou Pearson. Gia = 0:5, h (4.53)
gnetai h apìstash tou Hellinger. Gia = 1, h (4.53) gnetai h apìstash tou Neyman.
Gia !1 gnetai h genikeumènh apìklish Kullba
k-Leibler (onomzetai kai I -apìklish).
Qrhsimopoi¸nta ma metasqhmatismènh ekdoq th elttwsh kat klsh ( gradient des ent ),
PI
1=
i=1 air (vitk =[ASk ℄it )
1
q=1 aqr
1
PI2 I3
p=1 (vip =[ASk ℄ip )
s 1=
rp
air air PI I (4.55)
q=1 srq
2 3
X vitk [ ASk ℄it
[ASk ℄it vitk
Dk) (Vk
(
jjASk ) = vitk
( + 1)
+ [ASk ℄it + 1 +Sk jjSk jjL ; 1 (4.56)
it
63
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
ìpou A enai parmetroi kanonikopohsh
pou elègqoun ton bajmì araiìthta
twn pinkwn
S kai A. Sthn perptwsh pou = 1 kai Sk = 0, h (4.56) ekfrzei thn nìrma Frobenius
2ou bajmoÔ. 'Otan ! 0 ekfrzetai h genikeumènh apìklish Kullba
k-Leibler. Tèlo
, ìtan
PI 1
srtk srtk
[ i=1 air (vitk =[ASk ℄it )
1
Sk ℄
PI (4.57)
PI
2 I3
[ p=1 (vip =[ASk ℄it )srp
1
A ℄
air air PI I (4.58)
O Ci ho ki epektenei thn èreuna stou NTF algorjmou sto [22℄, ìpou protenei enalla-
sthn ALS teqnikè kai sthn teqnik Alternating Interior-Point Gradient . Na shmeiwje ìti
oi algìrijmoi pou dnontai aforoÔn mìno dedomèna 3 diastsewn. Den dnontai apodexei
sqetik me thn eÔresh twn pollaplasiastik¸n kanìnwn apì ti sunart sei kìstou . Tèlo ,
na parathrhje ìti to montèlo (4.52) afenì den mpore na genikeute gia perissìtere dia-
stsei , afetèrou den mpore na gnei anagwg tou montèlou sthn perptwsh 2 diastsewn,
stìqou :
Apì ta proteinìmena montèla sthn bibliografa, protim jhke autì twn Shashua kai H-
azan [80, 39℄. To montèlo pou eqe protaje apì ton Heiler anafèretai mìno gia tanustè
3 diastsewn. Epsh , to montèlo tou Ci ho ki anafèretai kai autì mìno gia 3-distatou
tanustè , me epiplèon prìblhma ìti den gnetai anagwg tou probl mato gia tanustè 2h
txh . O lìgo pou epilèqjhke to montèlo twn Shashua kai Hazan enai giat efarmìzetai
se N -distatou tanustè kai gia N = 2 angetai sto montèlo th NMF [55℄. 'Ena epiplèon
64
4.5. NTF QRHSIMOPOI ISEIS BREGMAN
WNTAS APOKL
pleonèkthma tou montèlou enai ìti ekfrzei ènan tanust w jroisma tanust¸n 1ou baj-
tou dojènto tanust me kajorismèno bajmì. To meionèkthma th mejìdou twn Shashua kai
Hazan enai ìti sto [80℄ protjhke o algìrijmo qrhsimopoi¸nta san mètro mìno thn nìr-
maFrobenius . Antstoiqa, sto [39℄ protjhke algìrijmo qrhsimopoi¸nta san mètro thn
ekfrzoun diaforetik mètra [16℄. Qrhsimopoi jhkan gia thn eplush tou probl mato th
prosèggish mh arnhtik¸n pinkwn [83℄ kai sthn paroÔsa diatrib ja qrhsimopoihjoÔn se èna
polugrammikì peribllon gia thn eplush tou pio polÔplokou probl mato th aposÔnjesh
tanust¸n.
4.5.1 Prìblhma
tanust¸n enai h aposÔnjesh tou V se èna jroisma apì k tanustè
1ou bajmoÔ:
k
X
V= uj
uj
ujn
1 2
(4.59)
j =1
ìpou uji 2 R Ii .
+
DÔo probl mata NTF mporoÔn na dhmiourghjoÔn:
k
X
(1) min D u
u
j j
ujn ; V (4.60)
ui >
j 0
j =1
1 2
k
X
(2) min D V ; u
u
j j
ujn (4.61)
ui >
j 0
j =1
1 2
(2.68), parnoume:
k
X
2 IN
I1 IX k
X
D u
u
j
1
j
2
ujn ; V = D j
V
u1(i1 ) u2(i2 ) : : : ujN (iN ) ; i1 i2 :::iN
j
(4.62)
j =1 i1 ;i2 ;:::;iN j =1
Efarmìzonta ton genikì orismì gia apoklsei Bregman pou dnetai apì thn (2.66) sthn
(4.62):
65
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
k
X
2 IN
I1 IX k
X
D u
u
j
1
j
2
ujn ; V = u j
1(
j
i1 ) u2(i2 ) V
ujN (iN ) ; i1 i2 :::iN
j =1 i1 ;i2 ;:::;iN j =1
2 IN
I1 IX k
X
Vi i :::iN
1 2
Vi i :::iN
1 2
u j
1(
j
i1 ) u2(i2 ) ujN (iN ) Vi i :::iN
1 2
(4.63)
i1 ;i2 ;:::;iN j =1
O skopì
enai h eÔresh epanalhptikoÔ pollaplasiastikoÔ kanìna gia kje stoiqeo
k
X
Ii
X
k
X
D u
u
j
1
j
2
ujn ; V = D u
u
j
1
j
2
uji(l)
ujn ; ii =lV (4.64)
j =1 l=1 j =1
ìpou uji(l) l-ost tim tou dianÔsmato
uji
enai h kai Vii l 2 R I I Ii
=
1 2 1 Ii+1 IN upo-
k
X
F (ui l ) = D
( ) u
u
j
1
j
2
uji(l)
ujn ; ii =l V (4.65)
j =1
An efarmìsoume ton orismì twn apoklsewn Bregman ìpw
fanetai sthn (2.66) mpore
na deiqje ìti:
I1 Ii X
1 Ii+1 IN k
X
F (ui l ) =
( ) u j
1( i1 ) uji(l) ujN (iN ) Vi :::ii
1 1 lii+1 :::iN
i1 ;:::;ii 1 ;ii+1 ;:::;iN j =1
I1 Ii X
1 Ii+1 IN k
X
Vi :::ii
1 1 lii+1 :::iN u j
1( i1 ) uji(l) ujN (iN ) Vi :::ii
1 1 lii+1 :::iN (4.66)
I1 Ii X
1 Ii+1 IN k
X
uj i uji l ujN iN
Vi :::ii
1 1 lii+1 :::iN
k
X
Vi :::ii
1 1 lii+1 :::iN u j
1( i1 ) uji(l) ujN (iN ) Vi :::ii
1 1 lii+1 :::iN ; (4.67)
j =1
66
4.5. NTF QRHSIMOPOI ISEIS BREGMAN
WNTAS APOKL
ìpou to i :::ii
1 1 jii+1 :::iN orzetai w
:
uj i u~ji l ujN iN
i :::ii jii+1 :::iN = Pk m 1( 1) ( ) ( )
=1 1( 1) ( ) ( )
4.5.3 Apìdeixh
Ja apodexoume ìti h G(ui l ; u~ i l ) enai ìntw
bohjhtik sunrthsh gia thn F (ui l ). Mpo-
( ) ( ) ( )
i :::ii jii :::iN > 0. Gnwrzonta
ìti h sunrthsh () enai kurt , isqÔei h answsh tou
1 1 +1
sunrthsh
G(ui l ; u~ i l ) enai ìmoioi me tou
ìrou
th
sunrthsh
F (ui l ), ektì
tou
( ) ( ) ( )
pr¸tou ìrou. An efarmìsoume thn answsh tou Jensen ston 1o ìro th sunrthsh
G(ui l ; u~ i l ):
( ) ( )
I1 Ii X
1 Ii+1 IN k
X
uj i uji l ujN iN
i :::ii
( ) ( )
>
i1 ;:::;ii 1 ;ii+1 ;:::;iN j =1 1 1 jii+1 :::iN
I1 Ii X1 Ii+1 IN
X k
u j
1( i1 ) uji(l) ujN (iN ) : (4.69)
To deÔtero misì th answsh (4.69) enai o pr¸to ìro th sunrthsh F (ui l ).( ) 'Ara,
Gia thn paragwg enì
pollaplasiastikoÔ kanìna enhmèrwsh
gia to uji l , h bohjhtik su-
( )
67
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
I1 Ii X
1 Ii+1 IN
G
up i upi l upN iN
upi l
= i :::ii
1 1 pii+1 :::iN
1( 1)
i :::ii
( ) ( )
( ) i1 ;:::;ii 1 ;ii+1 ;:::;iN 1 1 pii+1 :::iN
up1(i1 ) : : : upi 1(ii upi p
ii+1 ) : : : uN (iN )
i :::ii
1) +1(
1 1 pii+1 :::iN
I1 Ii X
1 Ii+1 IN
Vi :::l:::iN
1
u1(i1 ) : : : upi
p
up
1(ii 1 ) i+1(ii+1 )
: : : upN (iN ) (4.70)
Antikajist¸nta
to i :::ii
1 1 pii+1 :::iN kai ektel¸nta
ti
prxei
:
I1 Ii X
1 Ii+1 IN
G
upi l
= u1(i1 ) : : : upi
p
up
1(ii 1 ) i+1(ii+1 )
: : : upN (iN )
( ) i1 ;:::;ii 1 ;ii+1 ;:::;iN
k
X upi(l)
u1(i1 ) : : : u~i(l) : : : uN (iN ) p
m m m
u~i(l)
m=1
I1 Ii X1 Ii+1 IN
V
i1 :::l:::iN
up1(i1 ) : : : upi up
1(ii 1 ) i+1(ii+1 )
: : : upN (iN ) (4.71)
H exswsh (4.71) den mpore na luje genik gia ìle ti peript¸sei apoklsewn Bregman .
An jewrhje ìti h sunrthsh () enai diaqwrsimh, dhlad ìti (xy) = (x) (y) , tìte
katal goume:
I1 Ii X
1 Ii+1 IN
upi l k
X
( )
u~pi l
u1(i1 ) : : : upi
p
up
1(ii 1 ) i+1(ii+1 )
: : : upN (iN ) um
1(i1 )
: : : u~mi(l) : : : um
N (iN )
( ) i1 ;:::;ii 1 ;ii+1 ;:::;iN m=1
I1 Ii X
1 Ii+1 IN
= Vi :::l:::iN
1
u1(i1 ) : : : upi
p
1(ii 1 )
upi+1(ii+1 ) : : : upN (iN ) (4.72)
upi(l) u~pi(l) 1
P
1 1( 1) 1( ii 1)
P
+1(
(up i : : : upi
1( 1) 1( ii 1)
upi +1( ii+1 ) : : : uN (iN ) )
p
( km=1 um1(i1 ) : : : u~mi(l) : : : umN(iN ))
(4.73)
P PI
1 Ii 1 Ii+1 IN
ìpou to sumbolzei to pl re
jroisma
i1 ;:::;ii 1 ;ii+1 ;:::;iN . O pollaplasiastikì
kanìna
68
4.6. NTF GIA SUGKEKRIM ISEIS BREGMAN
ENES APOKL
Gnwrzoume ìti (x) = x log x. 'Ara, h pargwgo
th
(x) enai (x) = log x + 1. Parath-
retai ìti (xy) = log x + log y + 1, ra h (xy) enai diaqwrsimh. Katal goume ètsi ston
epanalhptikì kanìna enhmèrwsh
gia paragontopohsh mh arnhtik¸n tanust¸n qrhsimopoi¸-
Vi1 :::l:::iN
P
(up i : : : upi ii upi p
ii+1 ) : : : uN (iN ) ) log( km=1 um :::u~m
P m
i(l) :::uN (iN )
)
exp
1( 1) 1( 1) +1(
upi(l) u~pi(l) P p
1(i1 )
ìti (xy) = xy, ra h (xy) enai diaqwrsimh. O telikì
epanalhptikì
kanìna
gia NTF
qrhsimopoi¸nta
thn nìrma Frobenius deutèrou bajmoÔ mpore na prokÔyei qrhsimopoi¸nta
thn (4.73):
um
m :::um
1)
u p :::up u p p (4.76)
( ) ( ) PI I
1 i 1 Ii+1 IN 1( i1 ) i 1(i i 1 ) i +1( ii+1 ) :::uN (iN )
i1 ;:::;ii 1 ;ii+1 ;:::;iN Vi1 :::l:::iN
up l u~p l exp PI PI
1(i1 ) 2(i2 ) 3(i3 )
(4.77)
i2 =1 i3 =1 (u2(i2 ) u3(i3 ) )
1( ) 1( ) 2 3 p p
69
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
Vi1 li3
PI
1
i1 =1
PI
3 p p
P
i3 =1 (u1(i1 ) u3(i3 ) ) log( km=1 um u~m um )
up
l u~ p
l exp PI PI
1(i1 ) 2(i2 ) 3(i3 )
(4.78)
i1 =1 i3 =1 (u1(i1 ) u3(i3 ) )
2( ) 2( ) 1 3 p p
Vi1 i2 l
PI
1
i1 =1
PI
2 p p
P
i2 =1 (u1(i1 ) u2(i2 ) ) log( km=1 um um u~m )
u p
l u~ p
l exp PI PI
1(i1 ) 2(i2 ) 3(i3 )
(4.79)
i1 =1 i2 =1 (u1(i1 ) u2(i2 ) )
3( ) 3( ) 1 2 p p
u p
u~ p
l l PI PI Pk
(4.80)
p p
i2 =1 i3 =1 (u2(i2 ) u3(i3 ) ) ( m=1 u ~m1(i1 ) um2(i2 )um3(i3 ) )
1( ) 1( ) 2 3
PI
PI p p
i1 =1 i3 =1 (u1(i1 ) u3(i3 ) ) i1 li3
1 3
V
up
u~ p
l l PI PI Pk
(4.81)
PI
21
PI
i1 =1 i2 =1 (u1(i1 ) u2(i2 ) ) i1 i2 l
p p
V
up
u~ p
l l PI PI Pk
(4.82)
p p
i1 =1 i2 =1 (u1(i1 ) u2(i2 ) ) ( m=1 u1(i1 ) u2(i2 ) u
3( ) 3( ) 1 2 m m ~m )
3(i3 )
1( ) 1( ) PI PI u p up (4.83)
2 3 2(i2 ) 3(i3 )
i2 =1 i3 =1 Vli2 i3
up1(i1 ) up3(i3 )
PI
1
PI
3
i1 =1 i3 =1 Pkm m ~m um
=1 u1(i1 ) u
up l 2( )
u~p l
2( )
PI
1
PI
3
u p up
2(i2 ) 3(i3 )
1(i1 ) 3(i3 )
(4.84)
i1 =1 i3 =1 Vi1 li3
up1(i1 ) up2(i2 )
PI
1
PI
2
i1 =1 i2 =1 Pkm m m ~m
=1 u1(i1 ) u2(i2 ) u
up l 3( )
u~p l
3( )
PI
1
PI
2
u p up
1( i ) 2( i )
3(i3 )
(4.85)
1 2
i1 =1 i2 =1 Vi1 i2 l
O taxinomht NTF me epbleyh apotele genkeush tou taxinomht NMF me epbleyh pou pou
protjhke sto [8℄. Baszetai sth logik ìti pragmatopoietai h fsh ekpadeush gia kje
pnake pou èqoun dhmiourghje kat thn ekpadeush. San mètro sÔgkrish twn probol¸n
gia kje klsh qrhsimopoietai to CSM . Ston NMF ginìtan probol twn dedomènwn elègqou
stou pnake W, stou opoou eqe upoblhje orjogwniopohsh, ètsi ¸ste ta dedomèna
tanust enai akìma anoiktì prìblhma sth bibliografa, kai analutik lÔsh uprqei mìno gia
70
4.8. TAXIN
OMHSH NTF ME EP
IBLEYH
tanust .
O algìrijmo taxinìmhsh NTF me epbleyh paratjetai sth sunèqeia. Afor mìno thn
k
X
Vi = uj
uj
uj = (U
1 2 3 i
2( ) U i ) U3( ) 3 i
1( ) (4.86)
j =1
ìpou U 1( )i enai pnaka
diastsewn I 1( )i k (ìpou I i
1( ) o arijmì
twn dedomènwn ekpa-
Vi = (U U )i UT i
2 3 1( )
(4.87)
(U U )i = Qi Ri
2 3 (4.88)
5. Gia kje klsh, pragmatopoietai sÔgkrish tou dianÔsmato hitest me kje st lh tou
me thn klsh i:
hitestT hji ( )
CSMi = max
j ; ;:::;k khi kkh i k
(4.90)
( )
=1 2
test j
ìpou hji
( )
enai h j -ost st lh tou pnaka Hi . CSMi , kajorzei thn
Tèlo
, to mègisto
ht CSM
Q 1
1
H 1
1
Q ht H CSM
Vt
2 2
2 2
arg
max
. . .
. . .
. . .
h
t CSM
Q
H
hit = Qyi Vt
Sq ma 4.6: Diadikasa elègqou qrhsimopoi¸nta ton proteinìmeno taxinomht NTF (ta dia-
4.9 Parathr sei
O proteinìmeno algìrijmo NTF upertere ènanti twn algorjmwn pou eqan protaje sthn
bibliografa, lìgw th genikìtht tou, tìso ston arijmì twn diastsewn pou qrhsimopoioÔ-
ntai, ìso kai sta mètra omoiìthta pou mporoÔn na qrhsimopoihjoÔn. O algìrijmo mpore
na qrhsimopoihje gia kwdikopohsh eikìnwn kai bnteo, gia taxinìmhsh dedomènwn, akìma kai
gia omadopohsh dedomènwn. Sthn paroÔsa diatrib ìmw , ja qrhsimopoihje sto pedo th
epexergasa qou, èna pedo sto opoo kuriarqoÔn oi monodistate teqnikè . Sugkekrimè-
na, ja qrhsimopoihje gia taxinìmhsh arqewn me bsh to mousikì edo sto opoo an koun,
ìpw ja perigrafe sto Keflaio 6. Sthn sugkekrimènh perptwsh, kje mousikì arqeo ja
anaparstatai apì ènan pnaka qarakthristik¸n ston qrìno. Diaisjhtik, h efarmog NTF
72
4.9. PARATHR
HSEIS
algorjmwn se tètoiou pnake epexhgetai me thn ènnoia ìti kje arqeo qou apoteletai
poteletai apì grammikì sunduasmì basik¸n suqnot twn ston qrìno. 'Etsi, apokttai ma
pio analutik melèth enì s mato qou, kti pou mpore na odhg sei se beltiwmèna apote-
73
KEF
ALAIO 4. PARAGONTOPO
IHSH MH ARNHTIK
WN TANUST
WN
74
Keflaio 5
Exagwg Qarakthristik¸n
5.1 Eisagwg
mènwn enai krsimh gia thn apìdosh tou taxinomht . 'Opw anafèrjhke sthn enìthta 3.3,
armonik qarakthristik.
poioÔntai perigrafe
pou protenontai eurèw
sto pedo th
genik
anagn¸rish
qou ( ge-
neral audio re
ognition - GAD ) kai sto pedo th
epexergasa
omila
( spee
h pro
essing).
O Peeters perigrfei kpoia apì ta qarakthristik qroi pou qrhsimopoioÔntai sthn pa-
roÔsa diatrib sto [70℄. Epsh , qrhsimopoioÔntai basik qronik ( temporal ) kai armonik
qarakthristik, gia ma plhrèsterh perigraf twn mousik¸n kommati¸n. Epsh , gia ìsa
prìtupo MPEG-7 Audio [63℄ [71℄. 'Ola ta qarakthristik pou qrhsimopoi jhkan, perigr-
global
fontai w
kajolik ( ), me thn ènnoia ìti upologzontai gia ìlo to dosmèno s ma
parjuro, lambnonta upìyin plhrofora apì ìla ta mousik ìrgana kai tou tragou-
distè pou akoÔgontai sto mousikì kommti tautìqrona [78, 65℄. Ta qarakthristik pou
75
KEF
ALAIO 5. EXAGWG
H QARAKTHRISTIK
WN
Akolouje h dom tou kefalaou. Sthn enìthta 5.2 dnetai ma genik perigraf tou
protÔpou MPEG-7 Audio kai dnetai èmfash stou perigrafe pou uposthrzei. Analuti-
To prìblhma th paragwg kai dijesh polumesikoÔ ulikoÔ anagnwrsthke apì thn epi-
trop MPEG (Moving Pi tures Experts Group) ton IoÔlio tou 1996, ìpou apofassthke h
MPEG-7
ènarxh tou protÔpou pou enai gnwstì w
, epshma onomazìmeno w
Multimedia
Content Des
ription Interfa
e (Diepaf Perigraf
PolumesikoÔ UlikoÔ) [63℄. O rìlo
tou
MPEG-7 enai na kajorsei ènan sugkekrimèno trìpo perigraf diafìrwn tÔpwn polumesi-
k plhrofora , me skopì na katast sei pio gr gorh kai apodotik thn anaz thsh kai thn
1. Sust mata ( Systems )
3. Optikì ( Visual )
4. Hqhtikì ( Audio )
7. Sumbatìthta ( Conforman e )
To prìtupo MPEG-7 Audio apoteletai apì ergalea perigraf ( des riptors ) kai sq mata
76
5.2. TO PR
OTUPO MPEG-7
Ta ergalea qamhloÔ epipèdou qarakthrzontai apì to prìtupo MPEG-7 [63, 71℄ w pla-
sio ergasa perigraf tou qou ( Audio Des ription Framework ). 'Eqoun thn idiìthta ìti
perilambnei ta ex :
To Audio Des ription Framework perilambnei ergalea qamhloÔ epipèdou ta opoa mpo-
roÔn na qrhsimopoihjoÔn gia thn kataskeu efarmog¸n qou uyhlìterou epipèdou. Stìqo
tou sto MPEG-7 Audio enai na jèsei ma koin bsh ergalewn suggraf kai shmasio-
loga pnw sthn opoa ja mporoÔn na anaptuqjoÔn efarmogè sumbatè metaxÔ tou . H
diepaf perigraf qamhloÔ epipèdou ( LLD interfa e ) qwrzetai se 7 fasmatikè kai qronikè
kathgore :
1. Basik Basi
( ): Stigmiae
kumatomorfè
kai timè
enèrgeia
.
2. Basik Fasmatik ( Basi spe tral ): To fsma kai fasmatik qarakthristik, ìpw
to kèntro brou
tou fsmato
( spe
tral
entroid ), h diaspor tou fsmato
( spe
tral
spread ), h epipedikìthta tou fsmato
( spe
tral
atness ).
ristik se q¸ro grammik suqnìthta pou perilambnoun to kèntro brou tou f-
smato ( spe tral entroid ), kai eidik qarakthristik pou anafèrontai sti armonikè
idiìthte
twn shmtwn, ìpw
to armonikì fasmatikì kèntro brou
( harmoni
spe-
tral
entroid harmoni
spe
tral deviation
), armonik fasmatik apìklish ( ), armonik
fasmatik diaspor ( harmoni spe tral spread ) kai armonik fasmatik diakÔmansh
kthristik pou qrhsimopoioÔntai kurw gia anagn¸rish qou, all kai w anapara-
77
KEF
ALAIO 5. EXAGWG
H QARAKTHRISTIK
WN
Oi LLDs enai mia seir apì aploÔ perigrafe qamhl poluplokìthta pou enai sqediasmè-
noi gia na qrhsimopoioÔntai apì to plasio ergasa tou diast mato qou ( AudioSegment ).
upokathgora tou klimak¸menou tÔpou ( S alar ) to opoo kai kajorzei thn tim th peri-
tim 10 ms
.
maxRange pou deqnoun ta pnw kai ktw ìria tou pltou tou s mato , antstoiqa.
logarijmhmènh suqnìthta .
78
5.3. QRHSIMOPOIO
UMENA QARAKTHRISTIK
A
tou fsmato isqÔo tou s mato sthn suqnìthta apì èna eppedo fsma (èna s ma
jorÔbou).
(low-dimensional features ) tou fsmato met thn probol tou se bsh meiwmènh
txh .
qhtikoÔ s mato .
13. AudioHarmoni ityType: Perigrfei ton bajmì armonikìthta enì hqhtikoÔ s -
14. TimbreDes
riptors: Efarmìzontai se olìklhrh thn kumatomorf kai ìqi se degmata
tou s mato
. Oi perigrafe
qroi
baszontai ston upologismì th
jemeli¸dou
suqnì-
Timbre-
thta
tou s mato
, kai sthn eÔresh twn armonik¸n koruf¸n tou fsmato
. Oi
ni
Spe
tralDeviation Harmoni
Spe
tralSpread Harmoni
Spe
tralVariation Spe
tra-
, , ,
lCentroid TemporalCentroid
, kai .
15. Silen eType: H siwp anafèretai w qarakthristikì enì hqhtikoÔ diast mato , sto
P [n℄ = js[n℄j 2
(5.1)
79
KEF
ALAIO 5. EXAGWG
H QARAKTHRISTIK
WN
ìpou s[n℄ to arqikì s ma. H AP upologzetai w h mèsh tim th stigmiaa isqÔo gia
ma akolouja tim¸n m kou so me to hopsize kai apojhkeÔetai se ma qronoseir mèswn
tim¸n ( Mean eld of a SeriesOfS alarType ). Parèqei mia oikonomik apeikìnish tou fsmato
2. Oi suntelestè tou fsmato isqÔo pou enai ktw apì 62.5 Hz antikajist¸ntai me
ènan suntelest , me isqÔ sh me to jroism tou kai onomastik suqnìthta 31.25 Hz .
ASCx = n log2 (P
fx [n℄=1000)Px[n℄
n Px [n℄
(5.2)
thte .
me anafor sto kèntro brou tou. H exagwg tou ASS èqei w ex :
1. Upologismì
twn suntelest¸n tou fsmato
isqÔo
(ìpw
orzontai sto AP ) Px (n)
kai twn antstoiqwn suqnot twn.
s
P
n ((log2 (fx [n℄P
=1000) ASCx) Px [n℄)
2
ASSx =
n Px [n℄
(5.3)
80
5.3. QRHSIMOPOIO
UMENA QARAKTHRISTIK
A
H diaspor fsmato enai mia oikonomik perigraf th morf tou fsmato isqÔo
pou dhl¸nei an sugkentr¸netai kont sto kèntro brou tou ektenetai se ìlo to fsma.
Epitrèpei ton diaqwrismì anmesa se jìrubo tìno (hmitonoeidè s ma ma suqnìthta ).
to . To ASF katagrfei thn apìklish tou fsmato isqÔo tou s mato apì èna eppedo
fsma (èna s ma jorÔbou). Uyhl apìklish deqnei thn parousa tonik¸n sustatik¸n. H a-
nlush omoiomorfa fsmato upologzetai gia ènan arijmì zwn¸n suqnìthta . H exagwg
2. KalÔptetai èna eÔro suqnìthta metaxÔ lowEdge kai highEdge . Oi akrae suqnìthte
twn zwn¸n metasqhmatzontai se dekte twn suntelest¸n tou fsmato isqÔo . Gia
q
ih(b) il(b)+1 Qih(b)
n=1 Px [n℄
ASFx;b = Pih(b) (5.4)
1
ih(b) il(b)+1 n=1 Px [n℄
'Opou il(b) kai ih(b) enai ta kat¸tata kai an¸tata kra th
z¸nh
b. An to s ma èqei
85%-95%) th enèrgeia tou s mato . O majhmatikì tou orismì dnetai apì th sqèsh:
h
X K
X1
K 1
SRF = arg max S [t; k℄ < T H S [t; k℄ (5.5)
h=0
k=0 k=0
ìpou TH enai èna kat¸fli me sun jei
timè
0.85-0.95 (parnei timè
sthn perioq 0-1) kai
S [t; k℄ o DFT tou s[n℄ sto t-ostì plasio. Oi toniko qoi èqoun uyhlì SRF , en¸ oi krousto
qoi kai h omila èqoun qamhlè
timè
. 'Etsi to SRF metrei se pìso uyhlè
suqnìthte
sto
pou montelopoioÔn thn anjr¸pinh antlhyh suqnot twn. O upologismì tou enai w ex :
81
KEF
ALAIO 5. EXAGWG
H QARAKTHRISTIK
WN
8
>
>
>
>
0 m < f [n 1℄
>
>
>
>
>
>
>
>
>
>
>
< (f [n+1℄
>
2(m f [n 1℄)
f [n 1℄)(f [n℄ f [n 1℄)
f [n 1℄ 6 m 6 f [n℄
Hn [m℄ = (5.6)
>
>
f [n℄ 6 m 6 f [n + 1℄
>
> 2(f [n+1℄ m)
>
(f [n+1℄ f [n 1℄)(f [n+1℄ f [n℄)
>
>
>
>
>
>
>
>
>
>
>
:
0 m > f [n + 1℄
F
Bmel (F ) = 1125 ln 1 +
700 : (5.7)
Nsl B (F ) Bmel (Fl )
f [n℄ = Bmel Bmel (Fl ) + n mel h
1
;
N +1
(5.8)
Fs
ìpou Fs enai h suqnìthta deigmatolhya
se Hz, Fl enai h qamhlìterh suqnìthta th
trpeza
fltrwn se Hz, kai Fh h antstoiqh uyhlìterh suqnìthta. Tupikè
timè
gia
Nsl 1
X
E [n℄ = ln jS [t; k℄j Hn[k℄ ; 0 < n 6 N;
2
(5.9)
t=0
MF CC [m℄ =
X1
E [n℄
os m
2n 1 ; 06 m<N
n=0
2N (5.10)
82
5.3. QRHSIMOPOIO
UMENA QARAKTHRISTIK
A
sto pedo tou qrìnou, kaj¸ enai o antstrofo metasqhmatismì Fourier tou fsmato
N
AC (t) =
1 X
s[n℄s[n t℄; t = 0; : : : ; N 1 (5.11)
N n=t+1
Sthn paroÔsa diatrib , krat¸ntai oi 13 pr¸toi suntelestè
th
autosusqètish
[70℄.
mèqri thn qronik stigm pou to s ma parnei th mègisth tim tou. Majhmatik orzetai w :
LAT = log (T 1 T 0)
10
(5.12)
ìpou T 0 enai o qrìno
ènarxh
tou s mato
kai T 1 o qrìno
pou to s ma parnei th mègisth
tim tou. O upologismì
tou LAT apeikonzetai sto Sq ma 5.1.
s mato . Orzetai w :
TC =
PN
n=1 js[n℄j n :
2
n js[n℄j
PN (5.13)
2
=1
83
KEF
ALAIO 5. EXAGWG
H QARAKTHRISTIK
WN
N
1 X1
ZCR =
N 1 n jsign(s[n℄) sign(s[n 1℄)j
=0
(5.14)
ìpou sign() enai h sunrthsh pros mou, dhlad 1 gia tou jetikoÔ arijmoÔ kai -1 gia tou
arnhtikoÔ arijmoÔ .
O algìrijmo enai aplì kai gi> autìn to lìgo èqei qamhl upologistik poluplokìthta.
To ZCR enai èna mètro tou jorÔbou pou perièqetai se èna s ma. Genik¸
, oi armoniko qoi
èqoun qamhlè
timè
ZCR, en¸ oi krousto qoi kai h omila èqoun uyhlè
timè
.
klmaka Bark protjhke apì ton Zwi ker to 1980, se ma prospjeia montelopohsh tou
Bark orzetai w :
F F
BBark (F ) = 13 tan
1315:8 +3:5 tan 7518 : (5.15)
ìpou parmetro , ìpou sun jw = 1:0 . Sto Sq ma 5.2 apeikonzontai oi mpnte th
klmaka Bark se Hz .
Gia ton upologismì th T L, h klmaka Bark qwrzetai se 24 mpnte diou eÔrou . Gia kje
suqnotik mpnta upologzetai h enèrgei tou DFT pou antistoiqe sthn mpnta, EBark (z ),
ìpou z = 1; : : : ; 24. H Spe
i
Loudness orzetai w
:
SL(z ) = (EBark (z )) : 0 23
(5.16)
z =1
upologismì twn suntelest¸n baszetai sti klmake Bark [90℄ kai Sone [11℄. Paratjetai
84
5.3. QRHSIMOPOIO
UMENA QARAKTHRISTIK
A
3. Efarmog spe tral masking sti 24 mpnte , ìpou prokÔptei èna s ma pou perièqei ta
pou parathretai apì to fsma enì s mato , ìpou h enèrgeia tou s mato sugkentr¸netai
85
KEF
ALAIO 5. EXAGWG
H QARAKTHRISTIK
WN
s mato . O mousikì tìno ( pit h), enai h antilambanìmenh suqnìthta se èna s ma kai den
O pio aplì trìpo ektmhsh th jemeli¸dou suqnìthta dnetai apì thn sunrthsh au-
tosusqètish tou s mato [75℄. H jemeli¸dh perodo (ra kai h jemeli¸dh suqnìthta) tou
s mato dnetai apì to disthma anmesa sta dÔo mègista th sunrthsh autosusqètish .
Mpore epsh na upologiste apì ti korufè tou DFT tou s mato . Sthn paroÔsa diatrib ,
mègisth pijanofneia sto fsma tou s mato [27, 28℄. O upologismì th jemeli¸dou
K
k=0
ìpou K to m ko
tou DFT an disthma. To t pou kalÔtera exis¸nei ta dÔo mèrh dnei thn
kalÔterh ektmhsh th
jemeli¸dou
periìdou kai kat sunèpeia th
jemeli¸dou
suqnìthta
.
86
Keflaio 6
Peirmata
6.1 Eisagwg
Sto parìn keflaio perigrfetai h peiramatik diadikasa pou efarmìsthke sthn diatrib .
Sthn enìthta 6.2 parousizetai h diadikasa exagwg qarakthristik¸n apì arqea mousikoÔ
edou kai h dhmiourga tou tanust dedomènwn. H epilog twn pio katllhlwn qarakthri-
stik¸n gia taxinìmhsh dnetai sthn enìthta 6.3. Sthn enìthta 6.4 anafèrontai sunoptik oi
fa, kai h ermhnea twn apotelesmtwn dnontai sthn enìthta 6.5. Tèlo , sthn enìthta 6.6
Gia ta peirmata sthn diatrib qrhsimopoi jhke h bsh mousikoÔ edou GTZAN , pou dhmiour-
g jhke apì ton Tzanetkh [84℄. 'Opw
èqei dh anaferje sthn enìthta 3.2, h bsh GTZAN
perièqei 1000 arqea, kalÔptonta
10 mousik edh: Classi
al, Country, Dis
o, HipHop, Jazz,
Ro
k, Blues, Reggae, Pop , kai Metal . Se kje klsh an koun 100 arqea. Kje arqeo èqei
deigmatolhya sta 22.050 Hz , me 16 bits an degma. Ta arqea èqoun thn onomatologa
timè apì 000 èw 100. Profan¸ , o stìqo enai h orj taxinìmhsh twn arqewn sti pa-
rapnw 10 kathgore (me diamersei twn arqewn se dedomèna ekpadeush kai dedomèna
elègqou).
'Oson afor thn exagwg qarakthristik¸n apì ta arqea: se kje arqeo qrhsimopoietai
87
KEF
ALAIO 6. PEIR
AMATA
èna parjuro dirkeia 1 se , qwr epikalÔyei , ra kje arqeo qwrzetai se 30 tm mata.
An parjuro 1 se upologzontai 100 qarakthristik, ra to m ko tou diast mato an
tm ma enai 10 mse . Gia kje tm ma tou 1 se , upologzontai ta qarakthristik perigraf
pou analÔjhkan sthn enìthta 5.3. Gia ìla ta qarakthristik, ektì twn AC , LAT , kai
ropè
twn antstoiqwn diafor¸n pr¸th
txh
. Gia ta qarakthristik AC , LAT , kai TC
den upologzontai statistikè
ropè
kai pr¸te
diaforè
, all upologzontai autoÔsia. To
pl jo twn tim¸n pou parnei to kje qarakthristikì an qronikì disthma parousizetai
èqei diastsei 1000 187 30 , ìpou 1000 enai h distash twn arqewn, 187 h distash twn
qarakthristik¸n, kai 30 h distash tou qrìnou. Sto Sq ma 6.1 dnetai ma anaparstash
88
6.3. EPILOG
H QARAKTHRISTIK
WN
Qar/k
V
t
Dedomèna
na megistopoie ton lìgo th
diaspor
entì
twn klsewn ( inter-
lass dispersion
) pro
thn
diaspor anmesa sti klsei ( intra- lass dispersion ). O lìgo autì sumbolzetai me:
J = tr(Sw Sb );
1
(6.1)
ìpou o telest tr() dhl¸nei to qno enì pnaka, dhlad to jroisma twn stoiqewn th
diagwnou tou. O pnaka
Sw enai o pnaka
diaspor
entì
twn klsewn ( within-
lass
s
atter matrix ) kai orzetai w
:
C X
X
Sw = (vk i )(vk i)T ; (6.2)
i=1 vk 2Ci
an koun sthn i-ost klsh, kai i to dinusma mèsh
tim
gia ta qarakthristik pou an koun
sthn i-ost klsh. 'Ara o Sw enai tetragwnikì
pnaka
me diastsei
ìso me to mègejo
twn
qarakthristik¸n. O pnaka Sb enai o pnaka diaspor anmesa sti klsei (between- lass
C
X
Sb = (i )(i )T ; (6.3)
i=1
ìpou to dinusma mèsh tim gia ta qarakthristik ìlwn twn klsewn. Oi pnake dia-
spor klsewn qrhsimopoioÔntai sta plasia th grammik anlush diaqwrismoÔ pou pro-
'Estw ìti o arqikì arijmì qarakthristik¸n enai F, en¸ o epijumhtì arijmì qara-
89
KEF
ALAIO 6. PEIR
AMATA
F 0!
pou mpore na dhmiourghjoÔn enai
(F 0 F )!F ! , kajist¸nta
thn exantlhtik anaz thsh upo-
pedo antistoiqe sto pl re sÔnolo, en¸ sto qamhlìtero eppedo kje kìmbo antistoiqe
dom qrhsimopoi¸nta
anaz thsh pr¸ta se bjo
me opisjoq¸rhsh ( depth rst sear
h with
ba
ktra
king ). Perissìtere
plhrofore
gia ton algìrijmo uprqoun sto [41℄.
Sthn sugkekrimènh perptwsh, to F isoÔtai me 187, en¸ kajorsthke met apì peirma-
g jhke to anptugma V 2R
(2)
187 1000 30 , sto opoo h klsh metablletai kje 3.000 st le
tou pnaka. Ta 20 pr¸ta epilegmèna qarakthristik paratjentai ston Pnaka 6.2. Para-
throÔme ìti ta perissìtera epilegmèna qarakthristik an koun stou MF CCs, kti pou
deqnei giat qrhsimopoioÔntai eurèw sthn bibliografa. Axzei na shmeiwje kai h parousa
tou T L kai twn suntelest¸n AC . Oi suntelestè SLS , parìlo pou enai 32 ston arijmì,
twn teqnht¸n neurwnik¸n diktÔwn ( arti ial neural networks - ANNs ). ApoteloÔntai apì
eppeda upologistik¸n mondwn, tou neur¸ne , ìpou kje eppedo neur¸nwn sundèetai me to
epìmeno ( feedforward network ). Kje teqnhtì neur¸na èqei pollè eisìdou , ìpou dèqetai
wi ; i = 1; : : : ; N . Tèlo
, kje neur¸na
èqei ma èxodo, y , h opoa parnei thn tim y = f (u),
PN
ìpou u =
i xi wi kai f (u) sunrthsh, sun jw
h bhmatik h sigmoeid
sunrthsh.
T
=1
Oi MLPs sun jw apoteloÔntai apì 3 eppeda, to eppedo eisìdou ( input layer ), to endi-
meso eppedo ( hidden layer ), kai to eppedo exìdou ( output layer). Ma anaparstash enì
MLP dnetai sto Sq ma 6.2. Enai dunatìn oi MLPs me èna endimeso eppedo na apeikonsoun
H ekpadeush twn MLPs gnetai me th metabol twn bar¸n tou diktÔou, qrhsimopoi¸nta
90
6.4. EPIPR
OSJETOI TAXINOMHT
ES
Pnaka 6.2: 20 pr¸ta epilegmèna qarakthristik tou algorjmou bran h and bound .
4 Metablhtìthta 2ou MF CC
5 5o suntelest AC
9 Metablhtìthta ASS
14 Metablhtìthta 3ou MF CC
15 Metablhtìthta T L
19 Mèsh tim T L
20 3o suntelest AC
deush enai h èxodo tou diktÔou me dedomènh esodo na tautzetai me thn epijumht èxodo.
Dhmiourgetai ma sunrthsh ljou kai ta brh twn neur¸nwn diorj¸nontai, ètsi ¸ste na
elaqistopoietai h tim th sunrthsh . H diìrjwsh twn bar¸n gnetai apì thn èxodo pro
ta psw eppeda. H diìrjwsh twn bar¸n enai epanalhptik diadikasa kai qrhsimopoie ton
kanìna: wij (t + 1) = wij (t) + awij (t 1) + Æj yj (t), ìpou t o arijmì
th
epanlhyh
, wij to
bro
apì ton neur¸na i ston neur¸na j , onomzetai parmetro
mjhsh
, a enai h par-
metro orm (momentum), kai Æj to ljo ston neur¸na j , to opoo upologzetai anloga
91
KEF
ALAIO 6. PEIR
AMATA
'Exodo
Sq ma 6.2: Anaparstash enì Multilayer Per eptron me 4 eisìdou , 5 kìmbou sto endimeso
Oi mhqanè edrawn dianusmtwn ( support ve tor ma hines - SVMs ) apoteloÔn èna sÔnolo
mejìdwn pou qrhsimopoioÔntai gia taxinìmhsh kai palindrìmhsh. Enai gnwsto kai w taxi-
nomhtè mègistou perijwrou ( maximum margin lassiers ). Majanoun èna eidikì grammikì
ìrio apìfash , to upereppedo mègistou perijwrou, kti pou ti kajist anjektikè sthn
smì metaxÔ twn klsewn. Ta shmea me th mikrìterh apìstash apì to upereppedo mègistou
ra, to sÔnolo twn edrawn dianusmtwn kajorzei me monadikì trìpo to upereppedo mègistou
perijwrou.
Sthn apl perptwsh, me dedomèna pou an koun se 2 klsei kai enai grammik¸ diaqwr-
apì thn arq twn axìnwn. A
oriste d h apìstash apì to upereppedo sto kontinìtero
+
shmeo th pr¸th klsh kai d h apìstash apì to upereppedo sto kontinìtero shmeo
92
6.4. EPIPR
OSJETOI TAXINOMHT
ES
perijwrou pou enai kont sta edraa dianÔsmata th
1h
klsh
enai tow x + b = +1, en¸
to upereppedo pou enai kont sta edraa dianÔsmata th
2h
klsh
enai to w x + b = 1.
H apìstash anmesa sta dÔo upereppeda enai 2=jjwjj, ra ja prèpei na elaqistopoihje to
jjwjj. Sto Sq ma 6.3 apeikonzetai to eppedo mègistou perijwrou gia ma perptwsh grammi-
k¸
diaqwrsimwn dedomènwn, maz me ta 2 upereppeda twn klsewn. An upojèsoume ìti ta
Sq ma 6.3: Apeikìnish tou uperepipèdou mègistou perijwrou gia grammik¸ diaqwrsima
93
KEF
ALAIO 6. PEIR
AMATA
n n
1
LP = jjwjj 2
X
i yi (xi w + b) +
X
ai ;
2 i=1 i=1
(6.7)
n n
X 1X
LD =
i=1
ai
2 i;j i j yiyj (xi xj );
=1
(6.8)
O parapnw taxinomht enai grammikì , èqoun ìmw protaje kai mh grammiko algì-
stoÔn gia probl mata poll¸n klsewn. Ma apl teqnik ekpaideÔei C taxinomhtè , ìpou o
kje èna diaqwrzei thn ma klsh enanton ìlwn twn llwn [13℄.
nomhtè : NMF, LNMF, SNMF, SVM, MLP NTF , me apìklish Kullba k-Liebler NTF , me
mopoi jhke o taxinomht pou parousisthke sthn enìthta 4.3.6. Gia ton SNMF algìrijmo
94
6.5. PEIRAMATIK
A APOTEL
ESMATA
Pnaka 6.3: Apotelèsmata epituqhmènh taxinìmhsh gia pnake -tanustè 187 qarakthri-
SNMF 2 = 0:001
( ) 55.44% 61.00%
den leitourgoÔn me poludistata dedomèna, par mìno me dianÔsmata. Gia thn ekpadeush
tou
MLPs , qrhsimopoi jhke dktuo 3 epipèdwn, me rujmì mjhsh
so me 0.3, me momentum
0.2, kai me 500 epanal yei
gia ekpadeush. O arijmì
twn neur¸nwn sto endimeso eppedo
enai so me ( 187 30 + 10)=2 = 2810 . Antstoiqa, ta dedomèna tou tanust V metatrphkan
diaforetikè diamersei dedomènwn ekpadeush kai elègqou, h pr¸th me 70%-30% kai h deÔ-
ta dedomèna twn 187 qarakthristik¸n, en¸ ston Pnaka 6.4 parousizontai ta apotelèsmata
'Oson afor tou
algorjmou
NTF , h parmetro
bajmoÔ k epilèqjhke anloga me ton
algìrijmo, gia megistopohsh th
akrbeia
twn apotelesmtwn. Gia thn perptwsh twn 187
qarakthristik¸n, me 70%-30% diamèrish, to k = 65. Gia diamèrish 90%-10%, enai 62, 62,
kai 68 gia tou 3 taxinomhtè me th seir pou parousizontai. Gia ta 80 epilegmèna qara-
kthristik me 70%-30% diamèrish, enai 60, 64,kai 65, antstoiqa. Tèlo , gia thn diamèrish
95
KEF
ALAIO 6. PEIR
AMATA
qarakthristik¸n.
SNMF 2 = 0:001
( ) 57.55% 66.66%
sh sunanttai apì ton taxinomht NTF me nìrma Frobenius , me 75%. To posostì uperbanei
to 61.0% tou Tzanetkh sto [84℄, kai to 74.9% tou Lidy [58℄, all den uperbanei to 78.5%
tou Li [57℄. O taxinomht NTF genikìtera parousizetai na èqei polÔ uyhl posost se
klish Kullba k-Leibler genik èqei qamhlìterh akrbeia se ìle ti peript¸sei , me posost
kont stou taxinomhtè NMF . Epsh , o taxinomht NTF me apìstash Itakura-Saito dnei
pnw deqnei ìti h apìstash Itakura-Saito den enai katllhlh gia thn taxinìmhsh genik¸n
qarakthristik¸n kai kalÔtera na qrhsimopoietai gia thn sÔgkrish fasmtwn omila , ìpou
protjhke. Ta MLPs genikìtera èqoun akrbeia parìmoia me aut n twn SVMs . Parathretai
dnontai apì ton taxinomht NTF me nìrma Frobenius , ta SVMs kai ta MLPs .
Genikìtera fanetai h uperoq twn 80 epilegmènwn qarakthristik¸n ènanti twn 187 arqi-
k¸n qarakthristik¸n. Gia ton taxinomht NTF me nìrma Frobenius , h beltwsh se sqèsh me
ta 187 qarakthristik enai +8%, gia ta SVMs enai +5%, kai gia ta MLPs +5%. Epsh ,
fanetai ìti me diamèrish 90%-10% ta apotelèsmata enai saf¸ beltiwmèna se sqèsh me thn
diamèrish 70%-30%, kti pou enai logikì afoÔ sthn diamèrish 90%-10% dnontai perissìtera
Anaforik me tou taxinomhtè NMF , parathretai ìti usteroÔn ènanti twn taxinomht¸n
SVM, MLP , kai NTF me nìrma Frobenius . Sthn perptwsh twn 80 epilegmènwn qarakth-
akrbeia 66.66%. Parathretai sunolik ma uperoq tou taxinomht SNMF 2 ènanti tou
SNMF 1 . O taxinomht LNMF sthn perptwsh twn 187 qarakthristik¸n kai gia ti 2 dia-
96
6.6. MELLONTIK
ES KATEUJ
UNSEIS
O taxinomht NMF tèlo , parousizei akrbeia qamhlìterh apì tou taxinomhtè SNMF , me
posost sugkrinìmena me ton taxinomht NTF me apìklish Kullba k-Leibler . 'Ara, to sumpè-
rasma pou prokÔptei enai ìti oi polugrammiko algìrijmoi NTF enai saf¸ pio katllhloi
gia taxinìmhsh dedomènwn se sqèsh me tou grammikoÔ NMF algorjmou (profan¸ sthn
anmesa ston taxinomht NTF me nìrma Frobenius kai tou taxinomhtè SVM kai MLP .
H mèjodo pou qrhsimopoietai parousizetai sto [35℄, ìpou dnetai h upìjesh ìti ta ljh
deiqje ìti h beltwsh tou taxinomht NTF me nìrma Frobenius se sqèsh me tou taxinomhtè
SVM kai MLP den enai statistik shmantik me disthma empistosÔnh
95% ( = 0:05).
Antijètw
, enai statistik shmantik se sqèsh me tou
taxinomhtè
NMF kai tou
llou
2 taxinomhtè NTF . Na shmeiwje ìti h diafor 3.5% tou NTF me nìrma Frobenius kai tou
Analutik oi epidìsei twn taxinomht¸n NTF me nìrma Frobenius SVM, , kai MLP pa-
sÔgqush gia ton taxinomht NTF me nìrma Frobenius parousizetai ston Pnaka 6.5, gia
ton taxinomht SVM ston Pnaka 6.6, kai gia ton taxinomht MLP ston Pnaka 6.7. Oi st -
le twn pinkwn sÔgqush antistoiqoÔn sto mousikì edo pou katètaxe o taxinomht , en¸
oi grammè sto pragmatikì mousikì edo . Gia ton taxinomht NTF me nìrma Frobenius , ta
ljh taxinìmhsh apì thn Hiphop sth Dis o . Na shmeiwje ìti pollè forè ta ìria metaxÔ
eid¸n enai dusdikrita, ìpw sthn perptwsh Pop kai Ro k [78℄. 'Oson afor ta SVMs , oi
tssetai se Country kai Reggae. Uprqoun epsh
ljh apì to Reggae pro
Ro
k Dis
o
, ,
kai Hiphop . Akìma perissìtere ljo taxinom sei gia to Ro k uprqoun ston MLP , ìpou
poi jhkan dedomèna mousikoÔ edou pou èqoun efarmoste sthn bibliografa. Protjhke ma
nèa teqnik gia thn eplush tou probl mato anagn¸rish mousikoÔ edou , qrhsimopoi¸nta
97
KEF
ALAIO 6. PEIR
AMATA
Pnaka
6.5: Pnaka
sÔgqush
gia ton taxinomht NMF me nìrma Frobenius
, gia 80 epileg-
Edo
Blues Classi
al Country Dis
o Hiphop Jazz Metal Pop Reggae Ro
k
Blues 10 0 0 0 0 0 0 0 0 0
Classi al 0 8 1 0 0 0 0 0 1 0
Country 0 0 7 0 0 2 1 0 0 0
Dis o 1 0 0 7 0 1 0 1 0 0
Hiphop 0 0 0 3 7 0 0 0 0 0
Jazz 0 0 1 0 0 9 0 0 0 0
Metal 0 0 0 0 1 0 9 0 0 0
Pop 1 0 0 0 1 0 0 6 1 1
Reggae 0 0 1 0 2 1 0 0 6 0
Ro k 0 0 0 2 0 0 0 1 1 6
Pnaka 6.6: Pnaka sÔgqush gia ton taxinomht SVM gia 80 epilegmèna qarakthristik
me diamèrish 90%-10%.
Edo
Blues Classi
al Country Dis
o Hiphop Jazz Metal Pop Reggae Ro
k
Blues 6 1 1 0 0 0 0 0 0 0
Classi al 0 8 0 1 0 0 0 0 0 1
Country 0 0 7 0 0 0 0 0 0 2
Dis o 0 0 0 8 0 0 0 0 0 0
Hiphop 0 0 0 0 5 0 0 1 1 0
Jazz 0 1 0 1 0 10 0 0 0 0
Metal 0 0 0 2 0 0 14 0 0 0
Pop 0 0 1 0 0 0 0 6 0 0
Reggae 0 0 1 2 2 0 0 0 5 2
Ro k 1 0 3 0 0 0 0 0 3 4
98
6.6. MELLONTIK
ES KATEUJ
UNSEIS
Pnaka 6.7: Pnaka sÔgqush gia ton taxinomht MLP gia 80 epilegmèna qarakthristik
me diamèrish 90%-10%.
Edo
Blues Classi
al Country Dis
o Hiphop Jazz Metal Pop Reggae Ro
k
Blues 7 0 0 0 0 0 1 0 0 0
Classi al 0 10 0 0 0 0 0 0 0 0
Country 1 0 6 0 0 0 0 0 0 2
Dis o 0 0 0 8 0 0 0 0 0 0
Hiphop 0 0 0 0 5 0 0 1 1 0
Jazz 0 1 0 0 0 8 0 0 2 1
Metal 0 0 0 0 1 0 13 0 0 2
Pop 0 0 0 0 1 0 0 6 0 0
Reggae 0 0 0 1 3 0 0 0 7 1
Ro
k 1 0 3 0 1 0 0 0 4 2
krit rio apoklsei
Bregman . Epiprìsjeta, protjhke èna
nèo
taxinomht
gia thn mèjodo
kthristik kai sta qrhsimopoioÔmena dedomèna, gia beltwsh twn apotelesmtwn kai kalÔterh
Qr sh dedomènwn apì lle bsei : gia pardeigma oi bsei twn diagwnism¸n MIREX ,
ìpw oi bsei ISMIR 2004 kai 2005. Ektì th bsh GTZAN , oi upìloipe bsei den
èqoun qrhsimopoihje gia sugkritik apotelèsmata kai den sunstatai h qr sh tou .
fèretai kai w istìgramma rujmoÔ), to opoo mpore na qrhsimopoihje kai w bsh gia
99
KEF
ALAIO 6. PEIR
AMATA
na qrhsimopoihjoÔn teqnikè parìmoie me autè pou protenontai apì ton Albright [1℄.
ljh sthn taxinìmhsh. Sthn sugkekrimènh perptwsh bèbaia, jewr jhkan mikr tm ma-
tanom¸n.
100
Bibliografa
[3℄ J.-J. Au
outurier and F. Pa
het, \Representing musi
al genre: a state of the art," J.
New Musi
Resear
h, Vol. 32, No. 1, pp. 83-93, 2003.
[4℄ A. Banerjee, S. Merugu, I. S. Dhillon, and J. Ghosh, \Clustering with Bregman Diver-
gen
es,", J. Ma
hine Learning Resear
h, Vol. 6, pp. 1705-1749, 2005.
[5℄ J. G. A. Barbedo and A. Lopes, \Automati
genre
lassi
ation of musi
al signals,"
EURASIP Journal on Advan
es in Signal Pro
essing, Vol. 2007, Arti
le ID 64960, 2007.
[8℄ E. Benetos, M. Kotti, and C. Kotropoulos, \Applying supervised
lassiers based on
non-negative matrix fa
torization to musi
al instrument
lassi
ation," in Pro
. IEEE
Int. Conf. Multimedia & Expo, July 2006.
[9℄ E. Benetos, M. Kotti, and C. Kotropoulos, \Large s
ale musi
al instrument identi
a-
tion," in Pro
. 4th Sound and Musi
Computing Conferen
e, July 2007.
[10℄ G. Birkho and S. Ma
Lane, \A Survey of Modern Algebra ", New York: Ma
millan
Publishing Co., 1977.
101
BIBLIOGRAF
IA
[11℄ R. Bladon, \Modeling the judgment of vowel quality dieren
es," J. A
ousti
al So
iety
of Ameri
a, Vol. 69, No. 5, pp. 1414-1422, 1981.
[12℄ A. I. Borisenko and I. E. Taparov, \Ve
tor and Tensor Analysis with Appli
ations ",
New York: Dover Publi
ations In
., 1968.
[13℄ B.E. Boser, I.M. Guyon, and V. Vapnik, \A training algorithm for optimal margin
lassiers," in Pro
. 5th Annual Workshop Computational Learning Theory, pp. 144-
152, 1992.
[14℄ C. Bourin and P. Bondon, \EÆ
ien
y of high-order moment estimates,", IEEE Trans.
Signal Pro
essing, Vol. 46, No. 1, January 1998.
[16℄ L. M. Bregman, \The relaxation method of nding the
ommon points of
onvex sets
and its appli
ation to the solution of problems in
onvex programming," USSR Compu-
tational Mathemati
s and Mathemati
al Physi
s, Vol. 7, pp. 200-217, 1967.
[17℄ R. Bro, \Parafa
: tutorial and appli
ations,", Chemometri
s and Intelligent Laboratory
Systems, Vol. 38, pp. 149-171, 1997.
[18℄ C. J. C. Burges, \A tutorial on support ve
tor ma
hines for pattern re
ognition," Data
Mining and Knowledge Dis
overy, Vol. 2, pp. 121-167, 1998.
[19℄ J.J. Burred and A. Ler
h, \A hierar
hi
al approa
h to automati
musi
al genre
lassi-
ation," in Pro
. 6th Int. Conf. Digital Audio Ee
ts (DAFx), September 2003.
[20℄ Z. Cataltepe, Y. Yaslan, and A. Sonmez, \Musi
genre
lassi
ation using MIDI and
audio features," EURASIP Journal on Advan
es in Signal Pro
essing, Vol. 2007, Arti
le
ID 36409, 2007.
[21℄ A. Ci
ho
ki, R. Zdunek, S. Choi, R. Plemmons, and S. Amari, \Non-negative tensor
fa
torization using alpha and beta divergen
es," in Pro
. 32nd Int. Conf. A
ousti
s,
Spee
h, and Signal Pro
essing, April 2007.
[22℄ A. Ci
ho
ki, R. Zdunek, S. Choi, R. Plemmons, and S. Amari, \Novel multi-layer non-
negative tensor fa
torization with sparsity
onstraints," in Pro
. 8th Int. Conf. Adaptive
and Natural Computing Algorithms, April 2007.
102
BIBLIOGRAF
IA
[23℄ C. Cortes and V. Vapnik, \Support-Ve
tor Networks," Ma
hine Learning, Vol. 20, pp.
273-297, 1995.
[24℄ S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman, \Inde-
xing by latent semanti
analysis,", J. Ameri
an So
iety for Information S
ien
e, Vol.
41, No. 6, pp. 391-407, 1990.
[25℄ S. Dixon, F. Gouyon, and G. Widmer, \Towards
hara
terisation of musi
via rhyth-
mi
patterns," in Pro
. Int. Conf. Musi
Information Retrieval (ISMIR), pp. 509-516,
O
tober 2004.
[26℄ D. Donoho, and V. Stodden, \When does non-negative matrix fa
torization give a
or-
re
t de
omposition into parts?," in S. Thrun, L. Saul, and B. S
holkopf, eds., Advan
es
in Neural Information Pro
essing Systems 16, Cambridge: MIT Press, 2004.
[27℄ B. Doval and X. Rodet, \Estimation of fundamental frequen
y of musi
al sound signals,"
in Pro
. IEEE Int. Conf. A
ousti
s, Spee
h, and Signal Pro
essing, Vol. 5, pp. 3657-3660,
April 1991.
[28℄ B. Doval and X. Rodet, \Fundamental frequen
y estimation and tra
king using maxi-
mum likelihood harmoni
mat
hing and HMMs," in Pro
. IEEE Int. Conf. A
ousti
s,
Spee
h, and Signal Pro
essing, Vol. 1, pp. 221-224, April 1993.
[29℄ P. Drineas, R. Kannan, and M. W. Mahoney, \Fast Monte Carlo algorithms for matri
es
III: Computing a
ompressed approximate matrix de
omposition,", SIAM J. Compu-
ting, Vol. 36, No. 1, pp.184-206, 2006.
[30℄ R. O. Duda, P. E. Hart, and D. G. Stork, \Pattern Classi
ation," 2nd Edition, New
York: John Wiley & Sons, November 2000.
[31℄ R. A. Fisher, \The Use of Multiple Measurements in Taxonomi
Problems," Annals of
Eugeni
s, Vol. 7, pp. 179-188, 1936.
[32℄ E. Gaussier and C. Goutte, \Relation between PLSA and NMF and impli
ations,"
in Pro
. Annual ACM Conf. Resear
h and Development in Information Retrieval, pp.
601-602, August 2005.
[33℄ G. H. Golub and C. F. Van Loan, \Matrix Computations," 3rd Ed., Baltimore MD:
Johns Hopkins University Press, 1996.
[34℄ E. Gomez, A. Klapuri, and B. Meudi
, \Melody des
ription and extra
tion in the
ontext
of musi
ontent pro
essing," J. New Musi
Resear
h, Vol. 32 No. 1, 2003.
103
BIBLIOGRAF
IA
[35℄ I. Guyon, J. Makhoul, R. S
hwartz, and V. Vapnik, \What size test set gives good error
rate estimates?,", IEEE Trans. Pattern Analysis and Ma
hine Intelligen
e, vol. 20, no.
1, pp. 52-64, January 1998.
[36℄ F. Gouyon, S. Dixon, E. Pampalk, and G. Widmer, \Evaluating rhythmi
des
riptors
for musi
al genre
lassi
ation," in Pr
. AES 25th Int. Conf., pp 196-204, June 2004.
[37℄ W. H. Greub, \Multilinear Algebra ", New York: Springer-Verlag, 1967.
[38℄ S. Haykin, \Neural networks: a
omprehensive foundation," 2nd Edition, Upper Saddle
River NJ, USA: Prenti
e Hall, 1999.
[39℄ T. Hazan, S. Polak, and A. Shashua, \Sparse image
oding using a 3D non-negative
tensor fa
torization," in Pro
. 10th IEEE Int. Conf. Computer Vision, Vol. 1, pp. 50-57,
O
tober 2005.
[40℄ N. He, J. Zhang, and S. Wang, \Combination of independent
omponent analysis and
multi-way prin
ipal
omponent analysis for bat
h pro
ess monitoring," in Pro
. 2004
IEEE Int. Conf. Systems, Man, and Cyberneti
s, pp. 530-535, O
tober 2004.
[41℄ F. van der Hedjen, R. P. W. Duin, D. de Ridder, and D. M. J. Tax, Classi
ation,
Parameter Estimation and State Estimation, London UK: Wiley, 2004.
[42℄ M. Heiler and C. S
hnorr, \Controlling sparseness in non-negative tensor fa
torization,"
in Pro
. 9th European Conf. Computer Vision, Vol. 1, pp. 56-67, May 2006.
[43℄ T. Hofmann, \Probabilisti
latent semanti
analysis," in Pro
. Fifteenth Conf. Un
er-
tainity in Artif
ial Intelligen
e, pp. 289-296, July 1999.
[44℄ P. O. Hoyer, \Non-negative matrix fa
torization with sparsness
onstraints," Journal
Ma
hine Learning Resear
h, Vol. 5, pp. 1457-1469, 2004.
[45℄ C. Hu, B. Zhang, S. Yan, Q. Yang, J. Yan, Z. Chen, and W. Ma, \Mining ratio rules
via prin
ipal sparse non-negative matrix fa
torization," in Pro
. 2004 IEEE Int. Conf.
Data Mining, 2004.
[46℄ A. Hyvarinen and E. Oja, \Independent
omponent analysis: algorithms and appli
a-
tions," Neural Networks, Vol. 13, pp. 411-430, 2000.
[47℄ A. Kapteyn, H. Neude
ker, and T. Wansbeek, \An approa
h to n-mode
omponents
analysis," Psy
hometri
a, Vol. 51, No. 2, pp. 269-275, June 1986.
[48℄ D. C. Kay, \Theory and Problems of Tensor Cal
ulus ", New York: M
Graw-Hill, 1988.
104
BIBLIOGRAF
IA
[50℄ P. M. Kroonenberg and J. De Leeuw, \Prin
ipal
omponent analysis of three-mode data
by means of alternating least squares algorithms," Psy
hometrika, Vol. 45, No.1, Mar
h
1980.
[51℄ L. De Lathauwer, \Signal Pro
essing Based on Multilinear Algebra ", Ph.D. thesis, K.U.
Leuven, E.E. Dept.-ESAT, Belgium, 1997.
[52℄ L. De Lathauwer, B. De Moor, and J. Vandewalle \An introdu
tion to independent
omponent analysis," J. Chemometri
s, Vol. 14, pp. 123-149, 2000.
[53℄ L. De Lathauwer, B. De Moor, and J. Vandewalle \A multilinear singular value de
o-
mposition," SIAM J. Matrix Analysis Appli
ations, Vol. 21, No. 4, pp. 1253-1278, April
2000.
[54℄ L. De Lathauwer, \Tensor de
ompositions and independent
omponent analysis," in
Pro
. Workshop Tensor De
ompositions, July 2004.
[55℄ D. D. Lee and H. S. Seung, \Algoritnms for non-negative matrix fa
torization," Adva-
n
es in Neural Information Pro
essing Systems, Vol. 13, pp. 556-562, 2001.
[56℄ S. Z. Li, X. Hou, H. Zhang, and Q. Cheng, \Learning spatially lo
alized, parts-based
representation," in Pro
. IEEE Conf. Computer Vision and Pattern Re
ognition, pp.
1-6, 2001.
[57℄ T. Li, M. Ogihara, and Q. Li, \A
omparative study on
ontent-based musi
genre
las-
si
ation," in Pro
. 26th Annual ACM Conf. Resear
h and Development in Information
Retrieval, pp. 282-289, July-August 2003.
[58℄ T. Lidy and A. Rauber, \Evaluation of feature extra
tors and psy
ho-a
ousti
tran-
sformations for musi
genre
lassi
ation," in Pro
. 6th Int. Conf. Musi
Information
Retrieval, pp. 34-41, September 2005.
[61℄ C. D. M. Martin, \Tensor de
ompositions workshop dis
ussion notes," Ameri
an Insti-
tute of Mathemati
s, Palo Alto CA, July 2004.
[62℄ A. Meng, P. Ahrendt, and J. Larsen, \Improving musi
genre
lassi
ation by short-time
feature integration," in Pro
. 6th Int. Symp. Musi
Information Retrieval, pp. 604-609,
September 2005.
[63℄ MPEG-7, \Information Te
hnology-Multimedia Content Des
ription Interfa
e-Part 4:
Audio," ISO/IEC JTC1/SC29/WG11 N5525, Mar
h 2003.
[64℄ F. Pa
het and D. Cazaly, \A taxonomy of musi
al genres," in Pro
. Content-Based
Multimedia Information A
ess Conf., April 2000.
[65℄ F. Pa
het, J.J. Au
outurier, A. La Burthe, A. Zils, and A. Beurive, \The
uidado musi
browser: an end-to-end ele
troni
musi
distribution system," Multimedia Tools and
Appli
ations, Spe
ial Issue on the CBMI 2003 Conf., 2004.
[66℄ E. Pampalk, A. Rauber, and D. Merkl, \Content-based organization and visualization
of musi
ar
hives," in Pro
. 10th ACM Int. Conf. Multimedia, pp. 570-579, De
ember
2002.
[67℄ E. Pampalk, A. Flexer, and G. Widmer, \Improvements of audio based musi
similarity
and genre
lassi
ation," in Pro
. 6th Int. Symp. Musi
Information Retrieval, pp. 628-
633, 2005.
[68℄ A. Papoulis, \Probability, Random Variables and Sto
hasti
Pro
esses ", New York:
M
Graw-Hill, 2nd ed., 1984.
[69℄ S. W. Park and M. Savvides \Tensor fa
torization by simultaneous estimation of mixing
fa
tors for robust fa
e re
ognition and synthesis," Le
ture Notes in Computer S
ien
e,
Vol. 4105, pp. 371-378, September 2006.
[70℄ G. Peeters, \A large set of audio features for sound des
ription (similarity and
lassi-
ation) in the CUIDADO proje
t," CUIDADO I.S.T. Proje
t Report, 2004.
[71℄ F. Pereira and R. Koenen, \Context, goals and pro
edures", in Introdu
tion to MPEG-
7, (B. S. Manjuntath, P. Salembier, and T. Sikora, Eds.), pp. 7-29, New York: Wiley,
2000.
[72℄ D. Perrott and R.O. Gjerdingen, \S
anning the dial: An exploration of fa
tors in the
identi
ation of musi
al style," Dept. Musi
, Northwestern University, Illinois, Res.
Notes, 1999.
106
BIBLIOGRAF
IA
[76℄ F. Rousseaux and A. Bonardi, \Re
on
ile art and
ulture on the Web: lessen the i-
mportan
e of instantiation so
reation
an better be
tion," in Pro
. 1st Int. Workshop
Philosophy Informati
s, April 2004.
[77℄ N. S
aringella and G. Zoia, \On the modeling of time information for automati
gen-
re re
ognition systems in audio signals," in Pro
. 6th Int. Symp. Musi
Information
Retrieval, pp. 666-671, September 2005.
[78℄ N. S
aringella, G. Zoia, and D. Mlynek, \Automati
genre
lassi
ation of musi
o-
ntent: a survey," IEEE Signal Pro
essing Mag., Vol. 23, No. 2, pp. 133-141, Mar
h
2006.
[79℄ B. S
holkopf, C. J. C. Burges, and A. J. Smola, \Advan
es in kernel methods: support
ve
tor learning," Cambridge MA, USA: MIT Press, 1999.
[80℄ A. Shashua and T. Hazan, \Non-negative tensor fa
torization with appli
ations to sta-
tisti
s and
omputer vision," in Pro
. 22nd Int. Conf. Ma
hine Learning, pp. 792-799,
August 2005.
[81℄ X. Shao, C. Xu, and M. Kankanhalli, \Unsupervised
lassi
ation of musi
al genre using
hidden Markov model," in Pro
. IEEE Int. Conf. Multimedia & Expo, pp. 2023-2026,
2004.
[82℄ H. Soltau, T. S
hultz, M. Westphal, and A. Waibel, \Re
ognition of musi
types," in
Pro
. IEEE Int. Conf. A
ousti
s, Spee
h Signal Pro
essing, Vol. II, pp. 1137-1140, 1998.
[83℄ S. Sra and I. S. Dhillon, \Nonnegative matrix approximation: algorithms and appli
a-
tions," Te
hni
al Report TR-06-27, Computer S
ien
es, University of Texas at Austin,
2006.
107
BIBLIOGRAF
IA
[84℄ G. Tzanetakis and P. Cook, \Musi
al genre
lassi
ation of audio signals," IEEE Trans.
Spee
h and Audio Pro
essing, Vol. 10, No. 5, pp. 293-302, July 2002.
[85℄ L. R. Tu
ker, \The extension of fator analysis to three-dinensional matri
es," in H.
Gulliksen, N. Frederiksen, Contributions to Mathemati
al Psy
ology, Holt, Rinehart &
Winston, N.Y., pp.109-127, 1964.
[86℄ M. A. O. Vasiles
u and D. Terzopoulos, \Multilinear independent
omponents analysis,"
in Pro
. IEEE Conf. Computer Vision and Pattern Re
ognition, Vol. 1, pp. 547-553,
June 2005.
[87℄ M. Welling and M Weber, \Positive tensor fa
torization," Pattern Re
ognition Letters,
Vol. 22, No. 12, pp. 1255-1261, O
tober 2001.
[88℄ K. West and S. Cox, \Finding an optimal segmentation for audio genre
lassi
ation,"
in Pro
. 6th Int. Symp. Musi
Information Retrieval, pp. 680-685, September 2005.
[89℄ C. K. Yoo, D. S. Lee, P. A. Vanrolleghem, \Appli
ation of multiway ICA for on-line
pro
ess monitoring of a sequen
ing bat
h rea
tor," Water Resear
h, Vol. 38, pp. 1715-
1732, 2004.
[90℄ E. Zwi
ker and E. Terhardt, \Analyti
al expressions for
riti
al-band rate and
riti
al
bandwidth as a fun
tion of frequen
y," J. A
ousti
al So
iety of Ameri
a, Vol. 68, No.
5, pp. 1523-1525, June 1980.
108
Euret rio
MPEG-7 AudioFundamentalFrequen
y , 85
Isqur Orjog¸nio
Bajmì
Tanust , 18
MPEG-7 AudioPower , 79
Isqur Orjog¸nioi Tanustè
, 15
MPEG-7 AudioSpe
trumCentroid , 80
Total Loudness , 84
Pnaka
, 6
Apoklsei
Bregman , 27
Polugrammik PCA , 47
Aposuntejeimèno
Tanust
, 14
Polustrwmatiko Per
eptrons , 90
Bajmì
Tanust , 18
Ropogenn tria 1h
Txh
, 22
Bajmwtì Ginìmeno Tanust¸n, 13
Ropogenn tria 2h
Txh
, 22
Bohjhtik Sunrthsh, 58
Rujmì
Mhdenism¸n, 84
Dianusmatikì
Q¸ro
, 5
Suntelestè
Autosusqètish
, 83
Suqnìthta apìsbesh
, 81
Epilog Qarakthristik¸n, 88
Sustol Tanust , 11
Eswterikì Ginìmeno Tanust¸n, 9
109
EURET
HRIO
Uper-summetrikì Tanust , 20
Upo-tanust , 15
110