You are on page 1of 28

Tng quan cc phng php xc nh khun

mt ngi
Phm Th Bo, Nguyn Thnh Nht, Cao Minh Thnh, Trn Anh Tun, Phan Phc Don

khun mt, [5, 6, 7, 32, 54, 95, 118,


I. GII THIU 130].
Hn mt thp k qua c rt nhiu cng trnh o Nhn dng ngi A [29, 38, 46, 55, 56, 58,
nghin cu v bi ton xc nh khun mt ngi t 60, 61] c phi l ti phm truy n hay
nh en trng, xm n nh mu nh ngy hm nay. khng? Gip c quan an ninh qun l tt
Cc nghin cu i t bi ton n gin, mi nh ch con ngi. Cng vic nhn dng c th
c mt khun mt ngi nhn thng vo thit b thu trong mi trng bnh thng cng nh
hnh v u t th thng ng trong nh en trng. trong bng ti (s dng camera hng
Cho n ngy hm nay bi ton m rng cho nh ngoi).
mu, c nhiu khun mt trong cng mt nh, c o H thng quan st, theo di [35, 35, 106] v
nhiu t th thay i trong nh. Khng nhng vy bo v. Cc h thng camera s xc nh
m cn m rng c phm vi t mi trng xung u l con ngi v theo di con ngi
quanh kh n gin (trong phng th nghim) cho xem h c vi phm g khng, v d xm
n mi trng xung quanh rt phc tp (nh trong phm khu vc khng c vo, .
t nhin) nhm p ng nhu cu tht s v rt nhiu o Lu tr (rt tin ATM, bit ai rt tin
ca con ngi. vo thi im ), hin nay c tnh trng
1. nh ngha bi ton xc nh khun mt nhng ngi b ngi khc ly mt th
ngi ATM hay mt m s PIN v nhng ngi
Xc nh khun mt ngi (Face Detection) l n cp ny i rt tin, hoc nhng ngi
mt k thut my tnh xc nh cc v tr v cc ch th i rt tin nhng li bo cho ngn
kch thc ca cc khun mt ngi trong cc nh hng l mt th v mt tin. Cc ngn hng
bt k (nh k thut s). K thut ny nhn bit cc c nhu cu khi c giao dch tin s kim tra
c trng ca khun mt v b qua nhng th khc, hay lu tr khun mt ngi rt tin sau
nh: ta nh, cy ci, c th, [105]. i chng v x l [66, 81, 98, 133].
o Th cn cc, chng minh nhn dn (Face
2. ng dng ca phng php xc nh khun
Identification) [114].
mt ngi
o iu khin vo ra: vn phng, cng ty, tr
C nhiu ng dng c v ang thit k, ti
s, my tnh, Palm, . Kt hp thm vn
ch xin a ra mt s loi ng dng sau:
tay v mng mt. Cho php nhn vin c
o H thng tng tc gia ngi v my:
ra vo ni cn thit, hay mi ngi s ng
gip nhng ngi b tt hoc khim khuyt
nhp my tnh c nhn ca mnh m khng
c th trao i. Nhng ngi dng ngn
cn nh tn ng nhp cng nh mt khu
ng tay c th giao tip vi nhng ngi
m ch cn xc nh thng qua khun mt
bnh thng. Nhng ngi b bi lit thng
[44].
qua mt s k hiu nhy mt c th biu l
o An ninh sn bay, xut nhp cnh (hin nay
nhng g h mun, . l cc bi ton
c quan xut nhp cnh M p dng).
iu b ca bn tay (hand gesture), iu b

1
Dng xc thc ngi xut nhp cnh v II. PHNG PHP XC NH KHUN MT
kim tra c phi l nhn vt khng b NGI
khng. C nhiu nghin cu tm phng php xc nh
o Tng lai s pht trin cc loi th thng khun mt ngi, t nh xm n ngy nay l nh
minh c tch hp sn c trng ca ngi mu. Ti s trnh by mt cch tng qut nht nhng
dng trn , khi bt c ngi dng khc hng gii quyt chnh cho bi ton, t nhng
dng truy cp hay x l ti cc h thng hng chnh ny nhiu tc gi thay i mt s nh
s c yu cu kim tra cc c trng bn trong c kt qu mi.
khun mt so vi th bit nay c phi l Da vo tnh cht ca cc phng php xc nh
ch th hay khng. khun mt ngi trn nh. Cc phng php ny
o Tm kim v t chc d liu lin quan n c chia lm bn [9] hng tip cn chnh. Ngoi
con ngi thng qua khun mt ngi trn bn hng ny, nhiu nghin cu c khi lin quan
nhiu h c s d liu lu tr tht ln, nh n khng nhng mt hng tip cn m c lin
internet, cc hng truyn hnh, . V d: quan nhiu hn mt hng chnh:
tm cc on video c tng thng Bush pht o Hng tip cn da trn tri thc: M ha cc
biu, tm cc phim c din vin L Lin hiu bit ca con ngi v cc loi khun mt
Kit ng, tm cc trn banh c Ronaldo ngi thnh cc lut. Thng thng cc lut
, [50, 94, 134]. m t quan h ca cc c trng.
o Hng tip cn da trn c trng khng
o Hin nay c nhiu hng tip cn xc
thay i: Mc tiu cc thut ton i tm cc
nh mt nh c phi l nh kha thn hay c trng m t cu trc khun mt ngi m
khng? Khun mt ngi c xem nh cc c trng ny s khng thay i khi t th
mt yu t xc nh cho mt hng tip khun mt, v tr t thit b thu hnh hoc
cn m c dng gn y [271, 272]. iu kin nh sng thay i.
o Hng tip cn da trn so khp mu: Dng
o ng dng trong video phone [10].
cc mu chun ca khun mt ngi (cc mu
o Phn loi trong lu tr hnh nh trong in ny c chn la v lu tr) m t cho
thoi di ng. Thng qua bi ton xc nh khun mt ngi hay cc c trng khun mt
khun mt ngi v trch c trng, ri da (cc mu ny phi chn lm sao cho tch bit
vo c trng ny sp xp lu tr, gip nhau theo tiu chun m cc tc gi nh ra
so snh). Cc mi tng quan gia d liu
ngi s dng d dng truy tm khi cn
nh a vo v cc mu dng xc nh
thit [69, 105]. khun mt ngi.
o Kim tra trng thi ngi li xe c ng gt, o Hng tip cn da trn din mo: Tri
mt tp trung hay khng, v h tr thng ngc hn vi so khp mu, cc m hnh (hay
bo khi cn thit [109]. cc mu) c hc t mt tp nh hun luyn
trc . Sau h thng (m hnh) s xc
o Phn tch cm xc trn khun mt [112].
nh khun mt ngi. Hay mt s tc gi cn
o Trong lnh vc thit k iu khin robot gi hng tip cn ny l hng tip cn theo
[42, 43, 124, 151, 236]. phng php hc.
o Hng my chp hnh Canon ng dng 1. Hng tip cn da trn tri thc
bi ton xc nh khun mt ngi vo my Trong hng tip cn ny, cc lut s ph thuc
chp hnh th h mi cho kt qu hnh rt ln vo tri thc ca nhng tc gi nghin cu v
nh p hn, nht l khun mt ngi bi ton xc nh khun mt ngi. y l hng
[277]. tip cn dng top-down. D dng xy dng cc lut
c bn m t cc c trng ca khun mt v cc
quan h tng ng. V d, mt khun mt thng c
hai mt i xng nhau qua trc thng ng gia
khun mt v c mt mi, mt ming. Cc quan h

2
ca cc c trng c th c m t nh quan h v bn, v mc khc nhau gia cc gi tr xm
khong cch v v tr. Thng thng cc tc gi s trung bnh ca phn trung tm v phn bao bn trn
trch c trng ca khun mt trc tin c c l ng k. phn gii thp nht (mc m) ca
cc ng vin, sau cc ng vin ny s c xc nh dng tm ng vin khun mt m cn tm
nh thng qua cc lut bit ng vin no l cc mc phn gii tt hn. mc hai, xem xt biu
khun mt v ng vin no khng phi khun mt. histogram ca cc ng vin loi bt ng vin
Thng p dng qu trnh xc nh gim s no khng phi l khun mt, ng thi d ra cnh
lng xc nh sai. bao xung quanh ng vin. mc cui cng, nhng
Mt vn kh phc tp khi dng hng tip ng vin no cn li s c xem xt cc c trng
cn ny l lm sao chuyn t tri thc con ngi sang ca khun mt v mt v ming. Hai ng dng
cc lut mt cc hiu qu. Nu cc lut ny qu chi mt chin lc t th n mn hay lm r dn
tit (cht ch) th khi xc nh c th xc nh thiu gim s lng tnh ton trong x l. Mc d t l
cc khun mt c trong nh, v nhng khun mt chnh xc cha cao, nhng y l tin cho nhiu
ny khng th tha mn tt c cc lut a ra. nghin cu sau ny [200].
Nhng cc lut tng qut qu th c th chng ta s Kotropoulos v Pitas [200] a mt phng
xc nh lm mt vng no khng phi l khun php tng t [191, 261] dng trn phn gii
mt m li xc nh l khun mt. V cng kh khn thp. Hai ng dng phng php chiu xc nh
m rng yu cu t bi ton xc nh cc khun cc c trng khun mt, Kanade thnh cng vi
mt c nhiu t th khc nhau. phng php chiu xc nh bin ca khun mt
[191]. Vi I(x,y) l gi tr xm ca mt im trong
nh c kch thc m x n ti v tr (x,y), cc hm
chiu nh theo phng ngang v thng ng c


n
Hnh 1: (a) nh ban u c phn gii n=1; nh ngha nh sau: HI ( x) = y=1
I ( x, y) v
(b), (c), v (d) nh c phn gii n=4, 8, v 16.
V I ( y) = x=1 I ( x, y) . Da trn biu hnh chiu
m

ngang, c hai cc tiu a phng khi hai ng xt


qu trnh thay i c ca HI, chnh l cnh
bn tri v phi ca hai bn u. Tng t vi hnh
chiu dc VI, cc cc tiu a phng cng cho ta
Hnh 2: Mt lai tri trc ca ngi nghin cu phn tch trn khun mt. bit v tr ming, nh mi, v hai mt. Cc c trng
Yang v Huang [261] dng mt phng thc ny xc nh khun mt. Hnh 3.a cho mt v
theo hng tip cn ny xc cc khun mt. H d v cch xc nh nh trn. Cch xc nh ny c
thng ca hai tc gi ny bao gm ba mc lut. t l xc nh chnh xc l 86.5% cho trng hp ch
mc cao nht, dng mt khung ca s qut trn nh c mt khun mt thng trong nh v hnh nn
v thng qua mt tp lut tm cc ng vin c th khng phc tp. Nu hnh nn phc tp th rt kh
l khun mt. mc k tip, hai ng dng mt tp tm, hnh 3.b. Nu nh c nhiu khun mt th s
lut m t tng qut hnh dng khun mt. Cn khng xc nh c, hnh 3.c.
mc cui cng li dng mt tp lut khc xem xt
mc chi tit cc c trng khun mt. Mt h
thng a phn gii c th t c dng xc
nh, hnh 1. Cc lut mc cao nht tm ng
vin nh: vng trung tm khun mt (phn ti hn Hnh 3: Phng php chiu:
trong hnh 2) c bn phn vi mt mc u c (a) nh ch c mt khun mt v hnh nn n gin;
bn, phn xung quanh bn trn ca mt khun mt (b) nh ch c mt khun mt v hnh nn phc tp;
(c) nh c nhiu khun mt
(phn sng hn trong hnh 2) c mt mc u c

3
Rodrigues v Buf [132] dng phng php chn
cc keypoint trong nhiu t l khc nhau, c bit
tc gi ch dng cc keypoint d tha da trn nhiu
phn gii. Da trn quan h hnh hc ca cc
thnh phn khun mt, hai ng nhm cc keypoint
Hnh 4: Chiu tng phn ng vin xc nh khun mt. li xc nh khun mt ngi.
Fan [82] phn on nh mu tm cnh thng Fred [1140] d trn tnh cht i xng ca
qua thut ton tng vng xc nh cc ng vin. khun mt ngi, ng xem xt cc phn b trn
Dng c tnh hnh ellipse ca khun mt ngi histogram c tnh cht gn i xng xc nh
xc nh ng vin no khun mt ngi. khun mt ngi trong nh xm n c khun mt
Kim [65] kt hp thut ton watershed cho cc chp thng.
nh c nhiu phngii cng m hnh mu da Berbar [279] kt hp m hnh mu da ngi v
ngi tm ng vin, ri xc nh khun mt xc nh cnh tm ng vin khun mt ngi. Sau
ngi trong video. T l chnh xc khong 87-94%. kt hp quan h cc c trng v phng php
Phng php ch x l cho cc frame nh ch c mt chiu cc ng vin khun mt xung hai trc: dng
khun mt v nh ny phi chp thng ch c u v v ngang xc nh ng vin no tht s l khun
vai. mt ngi.
Sahbi v Boujemaa [8] s dng mng neural hc 2. Hng tip cn da trn c trng khng
c lng cc tham s cho m hnh Gauss, mc thay i
ch tm ng vin trn sc mu da ca ngi. Sau y l hng tip cn theo kiu bottom-up. Cc
khi c ng vin, hai ng chiu ln hai trc: ng v tc gi c gng tm cc c trng khng thay i ca
ngang xc nh khun mt ngi. khun mt ngi xc nh khun mt ngi. Da
C nhiu nghin cu sau ny s dng phng trn nhn xt thc t, con ngi d dng nhn bit
php chiu xc nh khun mt ngi. Min [80] cc khun mt v cc i tng trong cc t th
dng m hnh mu da khng tham s, Baskan [76], khc nhau v iu kin nh sng khc nhau, th phi
Mateos [74], v Nicponski [45] xy dng b lc, tn ti cc thuc tnh hay c trng khng thay i.
tm ng vin khun mt, sau chiu ln hai trc C nhiu nghin cu u tin xc nh cc c trng
xc nh cc thnh phn khun mt xc nh ng khun mt ri ch ra c khun mt trong nh hay
vin c phi l khun mt hay khng. Cn khng. Cc c trng nh: lng my, mt, mi,
Mateos v Chicote [34] dng kt cu xc nh ming, v ng vin ca tc c trch bng
ng vin trong nh mu. Sau phn tch hnh dng, phng php xc nh cnh. Trn c s cc c
kch thc, thnh phn khun mt xc nh trng ny, xy dng mt m hnh thng k m t
khun mt. Khi tm c ng vin khun mt, hai quan h ca cc c trng ny v xc nh s tn ti
ng trch cc ng vin ca tng thnh phn khun ca khun mt trong nh. Mt vn ca cc thut
mt, sau chiu tng phn ny xc thc c tan theo hng tip cn c trng cn phi iu
phi l thnh phn khun mt hay khng, hnh 4. T chnh cho ph hp iu kin nh sng, nhiu, v b
l chnh xc hn 87%. che khut. i khi bng ca khun mt s to thm
Farhad v Abdolhorsein [136] dng tri thc v cnh mi, m cnh ny li r hn cnh tht s ca
histogram xc nh khun mt trong cc frame khun mt, v th nu dng cnh xc nh s gp
lin tc trong mt on video. Tng t, Hidekazu kh khn.
v Mamoru [100, 139] cng dng histogram, nhng
a) Cc c trng khun mt
hai ng dng thut gii di truyn (Genetic Algorithm Sirohey a mt phng php xc nh khun
GA) lai nh l mt phng php tm kim ngu mt t mt nh c hnh nn phc tp [240]. Phng
nhin da vo nh ca biu mu ca nh. php da trn cnh (dng phng php Candy [155]
v heuristics loi b cc cnh cn li duy nht

4
mt ng bao xung quanh khun mt. Mt hnh trng khc ca khun mt. Ging nh xy dng mt
ellipse dng bao khun mt, tch bit vng u th quan h mi node ca th tng ng nh
v hnh nn. T l chnh xc ca thut tan l 80%. cc c trng ca mt khun mt, a xc sut vo
Cng dng phng php cnh nh Sirohey, xc nh. T l xc nh chnh xc l 86%.
Chetverikov v Lerch dng mt phong php da Bn cnh tnh khang cch lin quan m t
trn blob v streak (hnh dng git nc v sc xen quan h gia cc c trng nh Leung [154, 206].
k), xc nh theo hng cc cnh [157]. Hai ng Kendall [195] v [212] dng l thuyt xc sut thng
dng hai blob ti v ba blob sng m t hai mt, k v hnh dng. Dng hm mt xc sut
hai bn g m, v mi. M hnh ny dng cc treak (Probility Density Function - PDF) qua N im c
m t hnh dng ngoi ca khun mt, lng my, trng, tng ng (xi, yi) l c trng th i vi gi s
v mi. Dng nh c phn gii thp theo bin da vo phn b Gauss c 2N-chiu. Cc tc gi p
i Laplace xc nh khun mt thng qua blob. dng phng thc cc i kh nng (Maximum-
Graf a ra mt phng php xc nh c Likelihood - ML) xc nh v tr khun mt. Mt
trng ri xc nh khun mt trong nh xm [180]. thun li ca phng php ny l cc khun mt b
Dng b lc lm ni cc bin, cc php tan hnh che khut vn c th xc nh c. Nhng phng
thi hc (morphology) c dng lm ni bt cc php khng xc nh c a khun mt trong nh.
vng c cng cao v hnh dng chc chn (nh Yow v Cipolla [265, 266] trnh by mt
mt). Thng qua histogram tm cc nh ni bt phng thc da vo c trng, dng s lng ln
xc nh cc ngng chuyn nh xm thnh hai cc du hiu t nh v c du hiu v ng cnh. u
nh nh phn. Cc thnh phn dnh nhau u xut tin dng b lc o hm Gauss th hai, xc nh
hin trong hai nh nh phn th c xem l vng cc im mu cht ti cc i a phng trong b
ca ng vin khun mt ri phn loi xem c phi l lc, ri ch ra ni c th l c trng. Giai on hai,
khun mt khng. Phng php c kim tra trn kim tra cc cnh xung quanh im mu cht v
cc nh ch c u v vai ca ngi. Tuy nhin cn nhm chng li thnh cc vng. Tiu chun nhm
vn , lm sao s dng cc php ton morphology cc cnh l gn v tng t hng v cng . o
v lm sao xc nh khun mt trn cc vng ng lng cc c tnh vng nh: chiu di cnh, cng
vin. cnh, v bin thin cng c lu trong mt
Leung trnh by mt m hnh xc sut xc vector c trng. T d liu c trng khun mt
nh khun mt trong nh c hnh nn phc tp c hun luyn, s tnh c gi tr trung bnh v
trn c s mt b xc nh c trng cc b v so ma trn hip phng sai ca mi c trng khun
khp th ngu nhin [205]. chnh l xem bi mt. Mt vng l ng vin khun mt khi khong
ton xc nh khun mt nh l bi ton tm kim cch Mahalanobis gia cc vector c trng u
vi mc tiu l tm th t cc c trng chc chn di mt ngng. Ri thng qua mng Bayes xc
ca khun mt to thnh ging nht mt mu nh ng vin c phi l khun mt khng. T l
khun mt. Dng nm c trng (hai mt, hai l chnh xc l 85% [267], tuy nhin mc sai l
mi, phn ni gia mi v ming) m t mt 28%, v ch hiu qu vi hnh khun mt c kch
khun mt. Lun tnh quan h khong cch vi cc thc 60x60 im nh. Phng php ny c dng
c trng cp (nh mt tri, mt phi), dng phn b thm vi m hnh ng vin linh hat [158, 267].
Gauss m hnh ha. Mt mu khun mt c Takacs v Wechsler trnh by mt phng php
a ra thng qua trung bnh tng ng cho mt tp da trn tch c trng vng mc v c ng theo
a hng, a t l ca b lc o hm Gauss. T dao ng nh ca mt [250]. Thut ton hot ng
mt nh, cc c trng ng vin c xc nh bng trn bn hay vng ca cc mu cht, m hnh ha
cch so khp tng im nh khi lc tng ng vi li vng mc. u tin tnh ton c lng th
vector mu (tng t mi tng quan), chn hai ng vng khun mt trn c s b lc. Giai on th hai
vin c trng ng u tm kim cho cc c

5
tinh ch trn phn gii mn hn. T l sai l c th c cc t th, v tr, t l khc nhau. T l
4.69%. chnh xc l 53%.
Han pht trin mt k thut trn c s Kruppa [21] dng sc mu ca da ngi tm
morphology trch cc on ging mt (eye- ng vin, nhng ng khng x l cho tng im nh
analogue) xc nh khun mt ngi [182]. ng theo cch thng thng, m ng dng m hnh mu
ni rng mt v lng my l c trng ni bt nht da ngi trn tng phn nh ri x l phn on
v n nh nht ca khun mt con ngi, v n rt trn . Sau khi c ng vin khun mt, ng dng
hu dng xc nh khun mt ngi. ng nh mt s c tnh v hnh dng xc nh khun mt
ngha cc on ging mt nh l cc cnh trn ngi. T l chnh xc l 85%.
ng vin ca mt. u tin, cc php tan Park dng Gaze tm ng vin gc mt, ming
morphology nh ng, ct b sai khc, v phn v tm mt [27]. ng xy dng SVM c hc
ngng trch cc im nh c gi tr cng trc xc nh cc v tr ng vin c phi l
thay i ng k. Cc im nh ny s tr thnh cc gc mt, ming, v tm mt hay khng theo vt
im nh ging mt. Sau mt tin trnh gn nhn con mt ngi.
sinh cc on ging mt. Cc on ny c Sato [67] dng quan h ng vin cm ca
dng ch dn tm kim cc vng tim nng c th khun mt. Tc gi chia lm hai trng hp: thon
l khun mt qua kt hp cc c tnh hnh hc ca di v trn xem xt. Tc gi dng GA xem xt
mt, mi, lng my, v ming. Cc vng ny s mi tng quan ca ng cong, hnh dng khun
c mt mng neural xem xt c phi l khun mt mt xc nh khun mt.
khng, ging [48]. Theo tc gi t l chnh xc l Chai v Ngan [708] xy dng phng php xc
94%. nh khun mt ngi da trn c trng v: quan
Amit a ra phng thc xc nh khun mt h hnh hc, mt , chi trong nh mu ch c
da trn hnh dng v p dng cho cc khun mt u v vai ca ng vin xc nh. Kim [47] cng
chp thng [145]. C hai giai on xc nh phn on tm ng vin khun mt, nhng xc
khun mt ngi: tp trung v phn loi chi tit. thc khun mt thng qua cc cu trc cc c trng
Lm c th t cc mnh cnh, cc mnh ny c mt, mi, ming, v ng vin ca ng vin.
trch t b xc nh cnh n gin thng qua s Jang [53] dng phn b mu da phn on
khc bit cng l qu trnh tp trung. Khi c cc tm ng vin ri dng cc c trng hnh hc xc
ng vin t qu trnh trn, dng thut ton CART nh khun mt.
[152] xy dng mt cy phn loi t cc nh Christian v Jonh [135] xy dng mt loi c
hun luyn, xem xt ng vin no l khun mt trng mi, l c trng v cong ca cc ng
ngi. trn khun mt gii quyt vn iu kin nh
Jin [90] dng cu trc hnh hc ca khun mt sng. T c trng cong ny, hai ng quay li
ngi tm ng vin khun mt trong nh xm v phng php PCA xc nh khun mt.
hnh nn khng phc tp. Mi nh ch c mt khun Juan v Narciso [111] xy dng mt khng gian
mt ngi, nhng t th iu kin nh sng, khng mu mi YCgCr lc cc vng l ng vin khun
c nh. T l chnh xc khang 94.25% v thi gian mt da trn sc thi ca mu da ngi. Sau khi c
kh nhanh. ng vin, hai ng dng cc quan h v hnh dng
Chan v Lewis [16] dng k thut lc loi khun mt, mc cn i ca cc thnh phn
bt tc ng ca nh sng, sau phn on tm khun mt xc nh khun mt ngi. Tng t,
v tr cc ng vin l con mt. T cc ng vin ny Chang v Hwang [127] cng dng mt phng thc
xy dng mng neural nh Rowley [48] xc nh nh [111], t l chnh xc hn 80% trong nh xm.
khun mt ngi. Phng php ny c th xc nh Dae v Nam [116] xem xt cc c trng khng
nhiu khun mt trong mt nh, cc khun mt ny thay i khi thay i t th ca khun mt bng
cch xem xt cc quan h hnh hc. Sau c

6
lng cc t th ca khun mt ri xy dng d liu c) Sc mu ca da
xc nh thng qua PCA. T l chnh xc l 76%. Thng thng cc nh mu khng xc nh trc
Jin [128] xy dng mt b lc xc nh ng tip trn ton b d liu nh m cc tc gi dng
vin khun mt ngi theo mu da ngi. T ng tnh cht sc mu ca da ngi (khun mt ngi)
vin ny tc gi xc nh khun mt ngi theo hnh chn ra c cc ng vin c th l khun mt
dng khun mt v cc quan h c trng v thnh ngi (lc ny d liu thu hp ng k) xc
phn khun mt, vi mt phi c chn lm gc nh khun mt ngi. Ti s trnh by chi tit v
ta xt quan h. T l chnh xc cho khun m hnh ha mu da ngi mt bi sau.
mt chp thng trn 80%. d) a c trng
b) Kt cu Gn y c nhiu nghin cu s dng cc c
Khun mt con ngi c nhng kt cu ring trng ton cc nh: mu da ngi, kch thc, v
bit m c th dng phn loi so vi cc i hnh dng tm cc ng vin khun mt, ri sau
tng khc. Augusteijn v Skufca cho rng hnh s xc nh ng vin no l khun mt thng qua
dng ca khun mt dng lm kt cu phn loi dng cc c trng cc b (chi tit) nh: mt, lng
[147], gi l kt cu ging khun mt (face-like my, mi, ming, v tc. Ty mi tc gi s s dng
texture). Tnh kt cu qua cc c trng thng k th tp c trng khc nhau [70, 186].
t th hai (SGLD) [183] trn vng c kch thc Yachida a ra mt phng php xc nh
16x16 im nh. C ba loi c trng c xem xt: khun mt ngi trong nh mu bng l thuyt logic
mu da, tc, v nhng th khc. Hai ng dng mng m [156, 259, 260]. ng dng hai m hnh m
neural v mi tng quan cascade [170] cho phn m t phn b mu da ngi v mu tc trong khng
loi c gim st cc kt cu v mt nh x c trng gian mu CIE XYZ. Nm m hnh hnh dng ca
t t chc Kohonen [199] gom nhm cc lp kt u (mt thng v bn xoay xung quanh) m t
cu khc nhau. Hai tc gi xut dng phng hnh dng ca mt trong nh. Mi m hnh hnh
php bu c khi khng quyt nh c kt cu a dng l mt mu 2-chiu bao gm cc vung c
vo l kt cu ca da hay kt cu ca tc. kch thc mxn, mi c th cha nhiu hn mt
Dai v Nakano dng m hnh SGLD xc nh im nh. Hai thuc tnh c gn cho mi l: t
khun mt ngi [165]. Thng tin mu sc c kt l mu da v t l tc, ch ra t l din tch vng da
hp vi m hnh kt cu khun mt. Hai tc gi xy (tc) trong so vi din tch ca . Mi im nh s
dng thut gii xc nh khun mt trong khng c phn loi thnh tc, khun mt, tc/khun mt,
gian mu, vi cc phn ta mu cam xc nh cc v tc/nn trn c s phn b ca m hnh, theo
vng c th l khun mt ngi. Mt thun li ca cch s c c cc vng ging khun mt v
phng php ny l c th xc nh khun mt ging tc. M hnh hnh dng ca u s c so
khng ch chp thng v c th c ru v c knh. snh vi vng ging khun mt v ging tc. Nu
Mark v Andrew [12] dng phn b mu da v tng t, vng ang xt s tr thnh ng vin khun
thut ton DoG (a Difference of Gauss) tm cc mt, sau dng cc c trng mt-lng my v
ng vin, ri xc thc bng mt h thng hc kt mi-ming xc nh ng vin no s l khun
cu ca khun mt. mt tht s.
Manian v Ross [88] dng bin i wavelet Sobottka v Pitas dng cc c trng v hnh
xy dng tp d liu kt cu ca khun mt trong dng v mu sc xc nh khun mt ngi [241].
nh xm thng qua nhiu phn gii khc nhau kt Dng mt ngng phn on trong khng gian
hp xc sut thng k xc nh khun mt ngi. mu HSV xc nh cc vng c th l mu da
Mi mu s c chn c trng. T l chnh xc l ngi (vng ging mu da ngi) [251, 252], cc
87%, t l xc nh sai l 18%. tin ng vin. Cc thnh phn dnh nhau s c xc
nh bng thut ton tng vng phn gii th.
Xem xt tin ng vin no va khp hnh dng

7
ellipse s c chn lm ng vin ca khun mt. blob. Mi vector c trng bao gm ta nh v
Sau dng cc c trng bn trong nh: mt v sc mu c chun ha,
ming, c trch ra trn c s cc vng mt v r g
X = ( x, y, , ) [218, 243]. Dng
ming s ti hn cc vng khc ca khun mt, sau r + g+ b r + g+ b
cng phn loi da trn mng neural bit vng mt thut ton to cc vng lin kt li vi nhau
ng vin no l khun mt ngi v vng no khng tng kch thc ca blob v xem xt nu ng vin
phi khun mt ngi. T l chnh xc l 85%. dng blob no tha mn hnh dng kch thc khun
Da vo mc cn xng ca cc mu khun mt th xem l khun mt.
mt ngi xc nh khun mt ngi [154]. Mt Phm vi v mu sc c Kim [197] dng
b phn loi mu da/khng phi mu da dng trong xc nh khun mt ngi. Tnh biu chnh lch
khng gian mu YES cho php lm mn cc vng k ri phn on da trn biu histogram vi gi
c ng cong khng mn, sau khi lc cc vng c thuyt cc im nh l nn s c cng su v s
th l mu da ngi. Mt mu khun mt dng lng s nhiu hn cc im nh trong i tng.
ellipse c dng xem xt mc tng t ca Dng phn b Gauss trong khng gian mu RGB
cc vng c cng mu da ngi vi mu ny thng c chun ha, c cc ng vin ri dng phn
qua khong cch Hausdorff [188]. Sau cng, xc loi xc nh cui cng ng vin no l khun
nh tm mt thng qua cc hm tnh gi tr da trn mt ngi. Cng cc tip cn ny c Darrell [84].
quan h cn i ca khun mt v v tr hai mt. Hsu c xem l ngi kh thnh cng khi xc
nh ca mi v tm ca ming c c lng qua nh khun mt ngi trong nh mu [1, 96]. ng
khong cch tm mt. Mt hn ch ca phng php xy dng mt b phn loi xc nh cc v tr ca
ny l ch xc nh trn nh chp thng khun mt, ng vin mt v ming da trn sc mu c trng
ch c duy nht mt khun mt trong nh, v xc ca mt v ming. Trn quan h v khong cch ca
nh c v tr ca c hai mt. Cng c tc gi dng hai mt v ming xc nh ng vin no s l
phng php tng t gii quyt [245]. khun mt thng qua bin i Hough c ng vin
Tri ngc vi phng php x l trn im no gn ging dng ellipse nht.
nh, mt phng php c xy dng trn cu trc, Jesorsky [270] xc nh cnh ca cc i tng
mu sc, v lin quan hnh hc c ngh trong nh ri so snh hnh dng kt hp dng
[262]. u tin dng phn on a t l [144] khong cch Hausdorff o mc tng t ca
trch cc vng ng u trong nh da vo m hnh khun mt ngi vi cc mu. Sau Kirchberg
mu da ngi theo Gauss c c cc vng c [17] ci tin dng m hnh Gen (Genetic Model)
mu cng vi mu da ngi, gom cc vng ny vo pht sinh m hnh khun mt ngi t d liu ln
trong cc vng c hnh dng ellipse. Mt vng c xn sau khi phn on trong nh xm kt hp
hnh dng ellipse c xc nh l mt khun mt khong cch Hausdorff. Mc chnh xc khang
ngi nu tn ti mt ming trong vng . Tc gi 85%.
cho bit c th xc nh cc khun mt cc hng Yen v Nithianandan [66] dng GA trch cc
khc nhau khi c thm cc c trng ph nh: ru, c trng khun mt, nh mt (lng my), mi, v
mt knh. ming. p dng hnh thi khun mt ging hnh
Kauth trnh by mt biu din dng blob trch ellipse xc nh khun mt bng GA trong nh
c trng, m c trng ny dng t t c ngha mu. Phng php ny cho php gii quyt trong
cu trc ca a ph ca nh chp t v tinh [194]. iu kin nh sng khc nhau, t th khun mt khc
Mi vector c trng ti mt im nh bao gm cc nhau.
ta ca im nh v lin quan theo cc thnh Chang [89] xem xt tnh a dng v mt ca
phn ph (hay cc thnh phn kt cu). Cc im khun mt ngi. T y ng xy dng mng
nh ny c gom nhm bng cch dng vector c wavelet tch cc (Active Wavelet Network) trch
trng c cc vng dnh lin nhau, hoc c dng cc c trng ca khun mt ri dng hai phng

8
php lm gim s chiu ca khng gian c trng l Sau tip tc tm ming v lng my xc nh
LLE (Locally Linear Embedding) v LE (Lipschitz ng vin ny c phi l khun mt ngi hay khng.
Embedding) v hc cu trc a dng ny xc nh
a) Xc nh cc mu trc
khun mt. Sakai c gng th xc nh khun mt ngi
Daidi v Irek [117] trch cc c trng ca chp thng trong nh [232]. ng dng vi mu con
khun mt bng s phn b tham s xc nh v mt, mi, ming, v ng vin khun mt m
khun mt ngi. T l chnh xc cho nh xm v hnh ha mt khun mt. Mi mu con c nh
khun mt c chp thng l 91.4%. ngha trong gii hn ca cc on thng. Cc ng
Ehsan v Jonh [125] dng tp h s Gabor thng trong nh c trch bng phng php xem
wavelet cc hng khc nhau trch cc c xt thay i gradient nhiu nht v so khp cc mu
trng ca khun mt. Sau dng entropy cc b con. u tin tm cc ng vin thng qua mi tng
xc nh khun mt trong nh xm v khun mt quan gia cc nh con v cc mu v ng vin.
c chp thng hay ta thng nhng c cc v tr Sau , so khp vi cc mu con khc. Hay ni mt
khc nhau. T l chnh xc l 94%. cch khc, giai on u xem nh l giai on s
Bao [281, 282] dng sc thi mu da ngi ch tm ng vin, giai an th hai l giai on
xc nh ng vin trong nh mu. Tc gi xy tinh ch xc nh c tn ti hay khng mt khun
dng cc lut m da vo hai loi c trng: (1) bn mt ngi. tng ny c duy tr cho n cc
ngoi v (2) bn trong. c trng bn ngoi gm: t nghin cu sau ny.
l chiu cao, din tch, chu vi, mc trn, c Craw a ra mt phng php xc nh khun
trng bn trong gm: quan h mc cn i ca mt ngi da vo cc mu v hnh dng ca cc
hai mt v ming cng nh t l khong cch vi nh c chp thng (dng v b ngoi ca hnh
khun mt. Phng php ny cho php xc nh dng khun mt) [163]. u tin dng php lc
khun mt nhiu t th, v tr, mc nghing Sobel tm cc cnh. Cc cnh ny s c nhm
khc nhau trong mi trng phc tp. c bit, tc li theo mt s rng buc. Sau , tm ng vin
gi xy dng b iu khin m tch cc khun ca u, qu trnh tng t c lp i lp li vi
mt dnh ln nhau. T l chnh xc khong 87%- mi t l khc nhau xc nh cc c trng khc
89%. nh: mt, lng my, v mi. Sau Craw m t mt
3. Hng tip cn da trn so khp mu phng thc xc nh dng mt tp c 40 mu
Trong so khp mu, cc mu chun ca khun tm cc c trng khun mt v iu khin chin
mt (thng l khun mt c chp thng) s c lc d tm [164].
xc nh trc hoc xc nh cc tham s thng qua Govindaraju ngh mt phng thc xc nh
mt hm. T mt nh a vo, tnh cc gi tr tng khun mt ngi c hai giai an pht sinh cc
quan so vi cc mu chun v ng vin khun gi thuyt khun mt v kim tra n [177, 178, 179].
mt, mt, mi v ming. Thng qua cc gi tr tng Mt m hnh khun mt c xy dng trong cc
quan ny m cc tc gi quyt nh c hay khng c giai on xc nh c trng bng cc cnh. Cc
tn ti khun mt trong nh. Hng tip cn ny c c trng c m t nh cc ng cong ca pha
li th l rt d ci t, nhng khng hiu qu khi t bn tri, ng vin tc, pha bn phi ca khun
l, t th, v hnh dng thay i ( c chng mt c chp thng. Dng php ton Marr-Hildreth
minh). Nhiu phn gii, a t l, cc mu con, v xc nh cnh. Sau dng mt b lc loi b
cc mu bin dng c xem xt thnh bt bin v cc i tng khng tham gia vo xy dng khun
t l v hnh dng. mt. Lin kt cc cp ca cc on ng vin trn
Oh [119] phn on tm ng vin khun mt, c s mc k v cc hng lin quan. Xc nh
tc gi dng cc mu mt c trc so khp vi cc gc phn on ng vin thnh cc ng
cc vng quan tm tm v tr mt trong ng vin. cong c trng. Gn nhn cc ng cong c trng
bng cch kim tra thuc tnh hnh hc v cc v tr

9
lin quan trong lng ging ca n. Ni cc cp ca mt (nh hai mt, hai m, v trn), quan h v mc
cc ng cong c trng thng qua cc cnh nu sng ca cc vng cn li thay i khng ng
cc thuc tnh ca n tng thch. So snh cc t l k. Xc nh cc cp t s ca mc sng ca mt
ca cc cp thuc tnh cho mt cnh v n ng mt s vng (mt vng ti hn hay sng hn) cho ta mt
gi tr tng ng. Nu gi tr ca mt nhm ca ba lng bt bin kh hiu qu. Cc vng c sng
ng cong c trng (vi cc nhn khc nhau) thp u c xem nh mt mu t s m l mu th
th nhm ny s tr thnh mt gi thuyt. Khi xc trong khng gian nh ca mt khun mt vi
nh khun mt trong cc bi bo th thng tin ph thch hp t dng chn nh cc c trng chnh
s c dng thm l s lng ngi trong nh ca khun mt nh hai mt, hai m, v trn. Lu gi
chn gi thuyt ti u [178] . T l chnh xc ca thay i sng ca cc vng trn khun mt trong
phng php ny l 70%, tuy nhin cc khun mt mt tp thch hp vi cc cp quan h sng hn ti
phi c chp thng v khng b che khut. hn gia cc vng nh. Mt khun mt c xc
Venkatranman v Govindaraju dng cch tip cn nh khi mt nh tha tt c cc cp sng hn ti
tng t, nhng dng wavelet trch cnh [257]. hn. tng ny xut pht t s khc bit ca
Tsukamoto trnh by mt m hnh hiu qu khi cng gia cc vng k cc b, sau ny c m
dng mu khun mt (QMF) [253, 254]. Trong rng trn c s bin i wavelet biu din cho
QMF , mi nh mu c chia thnh nhiu khi, cc xc nh ngi i b, xc nh xe hi, xc nh
c trng hiu qu c c lng cho mi khi. khun mt [222]. tng ca Sinha cn c p
Tham s ha mt mu khun mt theo: lightness v dng cho h thng th gic ca robot [151, 236].
edgeness l cc c trng trong m hnh. Sau Hnh 5 cho thy mu ni bt trong 23 quan h c
dng cc mu ( c chia thnh cc khi) tnh nh ngha. Dng cc quan h ny phn loi, c
gi tr faceness (mc l khun mt) ti mi v 11 quan h thit yu (cc mi tn mu en) v 12
tr ca nh. Mt khun mt c xc nh khi gi tr quan h xc thc (cc mi tn xm). Mi mi tn l
faceness vt mt ngng c cho trc. mt quan h. Mt quan h tha mn mu khun mt
Hnh chiu c dng nh cc mu xc nh khi t l gia hai vng vt qua mt ngng v 23
khun mt ngi [233]. Dng PCA (phn tch thnh quan h ny vt ngng th xem nh xc nh
phn chnh Principal Component Analysis - PCA) c mt khun mt.
c mt tp hnh chiu c bn t cc mu khun
mt, hnh chiu c m t nh mt mng cc bit.
Dng c trng hnh chiu ring kt hp bin i
Hough xc nh khun mt ngi. Sau mt
phng php xc nh da trn a loi mu xc
nh cc thnh phn ca khun mt c trnh by
[244]. Phng php ny nh ngha mt s gi
Hnh 5: Mt mu khun mt, c 16 vng v 23 quan h (cc mi tn).
thuyt m t cc kh nng ca cc c trng
khun mt. Vi mt khun mt s c mt tp gi Phng php so khp mu theo th t xc
thuyt, l thuyt DepsterShafer [166]. Dng mt nh khun mt ngi do Miao trnh by [214].
nhn t tin cy kim tra s tn ti hay khng ca giai on u tin, nh s c xoay t -20o n 20o
cc c trng ca khun mt, v kt hp nhn t tin vi mi bc l 5o v theo th t. Xy dng nh a
cy ny vi mt o xem xt c hay khng c phn gii, hnh 1, ri dng php tan Laplace
khun mt trong nh. xc nh cc cnh. Mt mu khun mt gm cc
Sinha dng mt tp nh cc bt bin nh trong cnh m t su thnh phn: hai lng my, hai mt,
khng gian nh m t khng gian cc mu nh mt mi, v mt ming. Sau p dng heuristic
[238, 239]. T tng chnh ca ng da vo s thay xc nh s tn ti ca khun mt trong nh, phng
i mc sng ca cc vng khc nhau ca khun php ny cho php xc nhiu khun mt, nhng kt

10
qu khng tt bng xc nh mt khun mt (chp Iwata [39] xy dng mu mi c trng gm
thng hoc xoay) trong nh xm. bn c trng theo bn hmg: ngang, bn phi pha
Wei v Lai [78] dng b lc phn on kt trn, ng, v bn tri pha trn ca khun mt chp
hp thut ton tm lng ging gn nht xc nh ng thng hoc gn thng trong nh xm. so khp
vin khun mt, t ng vin ny sau so khp vi tng phn ca mu kt hp xc sut cc lng ging.
cc mu xc nh trc bit ng vin c phi T l chnh xc ca phng php ny l gn 99%.
l khun mt hay khng. T l chnh xc l 80%. Keren [33] xy dng khi nim Antifaces xc
Darrell [84] dng phn on tm ng vin, nh khun mt ngi (tng qut cho cc i tng
dng ng vin ny xc nh khun mt ngi da 3-chiu). Da trn nhiu loi mu kt hp gi thuyt
vo mu ri theo vt chuyn ng ca ngi. phn b xc sut tm nhng i tng khng c
Dowdall dng ph ca mu da ngi xc mi tng quan tm khun mt ngi. ng cho
nh ng vin. Sau chiu cc ng vin ny so bit, phng php ny nhanh hn eigenface v SVM
sanh vi cc mu c trc xc nh ng vin no v mc chnh xc gn tng ng.
l khun mt ngi. Phng php ny ch xc nh Feris [59] dng mng wavelet th nht xc
cho khun mt chp thng v gn thng, gc quay nh ng vin khun mt khi so khp vi cc mu
khong t -10o n 10o [86]. hc trc. Sau tc gi dng mng wavelet th
Holst xy dng mt h thng t cc mu vi cc hai xc nh cc thnh phn nh mt, mi, v
c trng kp [92]: (1) thnh phn, gm: mt, mi, ming thng qua cc c trng gc cnh. T cc
v ming; (2) hnh dng khun mt, trn phn thnh phn ny xem xt tnh ha hp c quyt
gii thp. ng dng hai phng php tm kim trong nh cui cng ng vin no l khun mt ngi.
khng gian d liu ca mnh xc nh khun mt
b) Cc mu b bin dng
ngi. Yuille dng cc mu bin dng m hnh ha
cc c trng ca khun mt, m hnh ny c kh
nng linh hot cho cc c trng khun mt [268].
Trong hng tip cn ny, cc c trng khun mt
c m t bng cc mu c tham s ha. Mt
hm nng lng (gi tr) c nh ngha lin kt
cc cnh, nh, v thung lng trong nh tng
ng vi cc tham s trong mu. M hnh ny tt
nht khi ti thiu hm nng lng qua cc tham s,
Mc d kt qu tt vi mu bin dng trong theo vt
i tng trn c trng khng m hnh theo li,
mt hn ch ca hng tip cn ny l cc mu bin
Hnh 6: Phn nhm d liu khun mt
v nhm d liu khng phi khun mt. dng phi c khi to trong phm vi gn cc i
tng xc nh.
Froba v Zink lc cnh phn gii thp ri
Mt hng tip cn da trn dng gp khc
dng bin i Hough so khp mu theo hng
(snake) [193, 208] v cc mu xc nh khun
cnh xc nh hnh dng khun mt dng chp
mt [202]. u tin mt nh s c lm xon li
hnh thng dng xm. T l chnh xc trn 91%
bi mt lc lm m ri dng php ton morphology
[25].
lm ni bt cnh ln. Dng mt ng gp khc
Shu v Jain xy dng ng ngha khun mt [85].
c n im nh (gi tr n nh) tm v c lng
Ng ngha theo hnh dng v v tr cc thnh phn
cc an cong nh. Mi khun mt c xp x
khun mt. Hai ng t b ng ngha ny xy dng
bng mt ellipse v bin i Hough, ri tm mt
mt th quan h d dng so khp khi xc nh
ellipse ni tri nht. Mt tp c bn tham s m t
khun mt ngi.
nt ellipse c dng nh ng vin xc nh

11
khun mt. Vi mi ng vin, mt phng thc 4. Hng tip cn da trn din mo
tng t nh phng thc mu bin dng [268] Tri ngc vi cc phong php so khp mu
dng xc nh cc c trng mc chi tit. Nu vi cc mu c nh ngha trc bi nhng
tm thy s lng ng k cc c trng khun mt chuyn gia, cc mu trong hng tip cn ny c
v tha t l cn i th xem nh xc nh c hc t cc nh mu. Mt cc tng qut, cc phng
mt khun mt. Lam v Yan cng dng ng gp php theo hng tip cn ny p dng cc k thut
khc xc nh v tr u vi thut ton greedy theo hng xc sut thng k v my hc tm
cc tiu ha hm nng lng [203]. nhng c tnh lin quan ca khun mt v khng
Thay v dng ng gp khc th Huang v Su phi l khun mt. Cc c tnh c hc trong
[13] dng l thuyt dng chy xc nh ng hnh thi cc m hnh phn b hay cc hm bit s
vin khun mt da trn c tnh hnh hc. Hai ng nn dng c th dng cc c tnh ny xc nh
dng l thuyt tp ng mc (Level Set) loang t khun mt ngi. ng thi, bi ton gim s chiu
cc khi ng ban u c c cc khun mt thng c quan tm tng hiu qu tnh ton
ngi. cng nh hiu qu xc nh.
Lanitis m t mt phng php biu din khun C nhiu phng php p dng xc sut thng
mt ngi vi c hai thng tin: hnh dng v cng k gi quyt. Mt nh hay mt vector c trng
[204]. Bt u vi cc tp nh c hun luyn xut pht t mt nh c xem nh mt bin ngu
vi cc ng vin mu nh l ng bao mt, mi, nhin x, v bin ngu nhin c c tnh l khun mt
cm/m c gn nhn. Dng mt vector cc im hay khng phi khun mt bi cng thc tnh theo
mu m t hnh dng. Tc gi dng mt m hnh cc hm mt phn lp theo iu kin
phn b im (Point Distribution Model PDM) p(x | khuo
n mat) v p(x | kho
ng pha i khuo
n mat) .
m t vector hnh dng qua ton b cc c th. Dng C th dng phn loi Bayes hoc kh nng cc i
tip cn nh Kirby v Sirovich [198] m t cng phn loi mt ng vin l khun mt hay khng
b ngai ca hnh dng c chun ha. Mt phi l khun mt. Khng th ci t trc tip phn
PDM c hnh dng nh khun mt dng xc nh loi Bayes bi v s chiu ca x kh cao, bi v
khun mt bng m hnh hnh dng tch cc (Active p(x | khuo t) v p(x | khong phaikhuon mat)
n ma
Shape Model - ASM) tm kim v c lng v
l a phng thc, v cha th hiu nu xy dng
tr khun mt cng nh cc tham s v hnh dng.
cc dng tham s ha mt cch t nhin cho
Cc mnh ca khun mt c lm bin dng v
p(x | khuo t) v p(x | khong phaikhuon mat) .
n ma
hnh dng trung bnh ri trch cc tham s cng .
Cc tham s hnh dng v cng c dng C kh nhiu nghin cu theo hng tip cn ny
phn loi. Cootes v Taylor p dng cch tip cn quan tm xp x c tham s hay khng c tham s
ny xc nh khun mt [161]. u tin, hai ng cho p(x | khuo
n mat) v
nh ngha nt vng hnh ch nht cha cc c p(x | kho
ng pha
i khuo t) .
n ma
trng quan tm. Dng phn tch nhn t [146] lm Cc tip cn khc trong hng tip cn da trn
va cc c trng hun luyn c mt hm phn din mo l tm mt hm bit s (nh: mt phng
b. C uc cc c trng l ng vin nu o xc quyt nh, siu phng tch d liu, hm ngng)
sut vt qua mt ngng khi dng ASM. Sau khi phn bit hai lp d liu: khun mt v khng
hun luyn xong c th xc nh khun mt ngi. phi khun mt. Bnh thng, cc mu nh c
Hng tip cn theo ASM c m rng bng hai chiu vo khng gian c s chiu thp hn, ri sau
lc Kalman c lng cc tham s v hnh dng dng mt hm bit s (da trn cc o khong
v cng dng theo vt khun mt ngi cch) phn loi [255], hoc xy dng mt quyt
[169]. nh phi tuyn bng mng neural a tng [48]. Hoc
dng SVM (Support Vector Machine) v cc
phng thc kernel, chiu hon ton cc mu vo

12
khng gian c s chiu cao hn d liu b ri rc cc nh khng c khun mt th xut hin s khc
hon ton v ta c th dng mt mt phng quyt nhau cng khng t. Xc nh s c mt ca mt
nh phn loi cc mu khun mt v khng phi khun mt trong nh thng qua tt c khong cch
khun mt [220]. gia cc v tr trong nh v khng gian nh. Khong
cch ny dng xem xt c hay khng c khun
a) Eigenface
Kohonen a ra phng php dng vector mt ngi, kt qu khi tnh ton cc khong cch s
ring nhn dng khun mt [199], ng dng mt cho ta mt bn v khun mt. C th xc nh
mng neural n gin chng t kh nng ca c t cc tiu a phng ca bn ny. C
phng php ny trn cc nh c chun ha. nhiu nghin cu v xc nh khun mt, nhn dng,
Mng neural tnh mt m t ca khun mt bng v trch c trng t tng vector ring, phn r,
cch xp x cc vector ring ca ma trn tng quan v gom nhm. Sau Kim [23] pht trin cho nh
ca nh. Cc vector ring sau ny c bit n vi mu, bng cch phn on nh tm ng khng
ci tn Eigenface. gian tm kim bt i.
Kirby v Sirovich chng t cc nh c cc b) Hng tip cn da trn phn b
khun mt c th c m ha tuyn tnh bng mt Sung v Poggio pht trin mt h thng xc nh
s lng va phi cc nh c s [198]. Tnh cht khun mt ngi da trn phn b [246, 247], chng
ny da trn bin i Karhunen-Leve [176, 192, t bng cch dng phn b cc cc mu nh cng
211], m cn c gi di mt ci tn khc l PCA mt lp i tng c th c hc t cc mu
[189] v bin i Hotelling [104]. tng ny c negative v positive. H thng ca hai ng bao gm
xem l ca Pearson trnh by u tin vo nm 1901 hai thnh phn: m hnh phn b ca cc mu l
[223] v sau l Hotelling vo nm 1933 [185]. khun mt/khng phi khun mt v mt phn loi
Cho mt tp cc nh hun luyn c kch thc n x a tng da vo th gic. Mi mu l khun mt v
m c m t bi cc vector c kch thc m x m, khng phi l khun mt c chun ha v x l
cc vector c s cho mt khng gian con ti u thnh nh c kch thc 19 x 19 im nh v xem
c xc nh thng qua li bnh phng trung bnh nh mt vector hay mu c 361-chiu. Sau cc
khi chiu cc nh hun luyn vo khng gian con mu c nhm vo cc nhm, mi nhm gm su
ny. Cc tc gi gi tp cc vector c s ti u ny mu cng loi l khun mt hoc nhm khng phi
l nh ring sau gi cho n gin l vector ring l khun mt bng thut ton k-trung bnh (k-mean),
ca ma trn hip phng sai c tnh t cc nh hnh 6. Mi nhm s c m t nh mt hm
khun mt vector ha trong tp hun luyn. Nu Gauss a chiu vi mt nh trung bnh v ma trn
cho 100 nh, m mi khun mt c kch thc hip phng sai. Hnh 7 cho thy cch tnh khong
91x50 th c th ch dng 50 nh ring, trong khi cch ca hai ng. Hai o khong cch dng
vn duy tr c mt kh nng ging nhau hp l tnh khong cch gia nh a vo v tm ca
(gi c 95% tnh cht). nhm. Thnh phn khong cch u tin l khong
Turk v Pentland p dng PCA xc nh v cch Mahalanobis c chun ha gia hnh chiu
nhn dng khun mt [255]. Tng t [198], dng ca mu cn kim tra v tm ca nhm, tnh trong
PCA trn tp hun luyn nh cc khun mt sinh khng gian con c s chiu thp hn, c m t
cc nh ring (cn gi l eigenface) tm mt bng 75 vector ring ln nht. Thnh phn khong
khng gian con (khng gian khun mt) trong khng cch th hai l khong cch Euclide gia mu cn
gian nh. Cc nh khun mt c chiu vo khng kim tra v hnh chiu ca n trong khng gian con
gian con ny v c gom nhm li. Tng t cc c 75- chiu ny. Dng hai khong cch ny xc
nh khng c khun mt dng hun luyn cng nh khong cch t mu cn kim tra n tm mt
c chiu vo cng khng gian con v gom nhm nhm. T nay chng ta c th bit mu cn kim tra
li. Cc nh khi chiu vo khng gian khun mt th gn nhm no nht. Bc cui cng dng mng a
khng b thay i tnh cht c bn, trong khi chiu tng (Multilayer Perceptron Network MLP)

13
phn loi da vo 12 cp khong cch (c 12 nhm) ny uc p dng cho xc nh khun mt, m ha
khi mng ny c hun luyn trc . D dng khun mt, v nhn dng. So snh vi hng tip
chn mu khun mt hun luyn, nhng khng eigenface c in [255], phng php ny cho thy
d chn mu khng phi l khun mt hun hiu qu hn trong xc nh v nhn dng khun
luyn. Dng phng php bootstrap gi gii mt [196].
quyt vn ny. Bt u t tp nh khng phi Yang s dng mt hn hp nhiu phn tch h s
khun mt trong tp mu hun luyn hun luyn lm tiu ch xc nh khun mt. Phn tch h s
MLP. Dng b xc nh khun mt ngi xc (Factor Analysis FA) l mt phng php thng
nh mt ngi trn mt dy cc nh ngu nhin, sau k m hnh ha tnh hip bin cu trc ca d
chn cc mu khng phi khun mt ngi m b liu c s chiu cao bng cch dng m lng nh
xc nh l khun mt ngi xem nh l mu khng cc bin tim tng. FA cng tng t PCA trong vi
phi khun mt ngi mi hun luyn tip tc. kha cnh. Tuy nhin, PCA khng ging FA, khng
Phng php ny b qua vn chn mu no trong nh ngha mt m hnh mt thch hp cho d
cc mu tng tnh hiu qu, c nhiu nghin cu liu. Hn na, PCA khng hiu qu khi c nhiu
sau ny v vn ny [48, 220]. c lp trong cc c trng ca d liu. Tng hp t
[148, 150, 167, 168] cho thy cc mu c chiu t
cc lp khc nhau vo khng gian con PCA thng
c th khng hiu qu. Trong cc trng hp khi cc
mu c mt cu trc chc chn, dng PCA s cho
kt qu kh tt. Hinton dng FA nhn dng cc
con s, ng so snh FA v PCA [184]. Mt m
hnh hn hp ca cc phn tch h s c m rng
nhn dng khun mt ngi [174]. C hai nghin
cu u cho thy FA tt hn PCA. T t th, hng,
Hnh 7: (a) Khong cch gia mu cn kim tra v cc nhm; cm xc, v nh hng nh sng trn din mo ca
(b) hai thnh phn khong cch. khun mt ngi, phn b cc khun mt trong
Moghaddam v Pentland a ra mt m hnh hc khng gian nh c th c biu din tt hn bng
theo xc sut da trn c lng mt trong khng mt m hnh mt a phng thc khi mi
gian c s chiu cao bng khng gian ring [216]. phng thc gi cc c tnh chc chn ca din
Hai ng dng PCA tm khng gian con m t mo chc chn ca khun mt. H trnh by mt
tt nht mt tp cc mu khun mt ngi. Phng m hnh theo xc sut khi dng mt hn hp cc
php ny vn gi li cc mi tng quan tuyn tnh phn tch h s (Mixture of Factor Analyzer MFA)
chnh trong d liu v loi b cc th yu khc. xc nh khun mt ngi. Dng thut ton EM
Phng php ny phn r mt khng gian vector c lng cc tham s trong m hnh hn hp.
thnh hai khng gian con m hai khng gian con ny
loi tr ln nhau v cng b sung cho nhau: khng
gian con chnh (khng gian c trng) v phn b
trc giao. V th, mc tiu mt c phn r lm
hai thnh phn: mt trong khng gian chnh (da
vo cc thnh phn chnh) v phn b trc giao,
Hnh 8: Phn r mt nh khun mt vo khng gian chnh F
hnh 8. Xy dng h thng hc da vo Gauss nhiu ---
v phn b trc giao F .
bin v Gauss hn hp, h thng ny hc da trn
thng k cc c trng cc b ca mt khun mt. Phng php th hai [263] dng bit s tuyn
Dng cc mt xc sut xc nh khun mt tnh Fisher (Fishers Linear Discriminant FLD)
trn c s c lng kh nng cc i. Phngphp chiu cc mu t khng gian nh c s chiu cao
sang mt khng gian c trng c s chiu thp hn.

14
V trn c s phn tch bit s tuyn tnh, cc tc gi c) Mng Neural
xy dng phng php Fisherface [148] v Mng neural c p dng kh thnh cng trong
nhng phng php khc [249, 269] gii quyt tt cc bi ton nhn dng mu, nh: nhn k t, i
hn phng php Eigenface [255] trong nhn dng tng, robot t vn hnh. Xc nh khun mt ngi
khun mt. Khi dng FLD phn loi mu s tt c th xem l bi ton nhn dng hai loi mu, c
hn PCA khi chiu. Do , kt qu phn loi trong nhiu kin trc mng neural c trnh by. Mt
khng gian con c chiu c th kh hn cc thun li khi dng mng neural xc nh khun
phng php khc ( [213] trnh by r v kch mt l tnh kh thi ca h thng hc khi c s phc
thc tp hun luyn). Trong phng thc th hai, tp trong lp ca cc mu khun mt. Tuy nhin,
cc tc gi phn r cc mu hun luyn khun mt iu tr ngi l cc kin trc mng u tng
mt v khng phi khun mt vo trong vi lp con qut, khi p dng th phi xc nh r rng s lng
bng nh x t t chc Kohonen (Kohonens Self tng, s lng node, t l hc, , cho tng trng
Organizing Map SOM) [199]. Hnh 9 cho thy mt hp c th, hnh 10.
i din ca mi lp khun mt. T cc mu c
gn nhn li, tnh cc ma trn cc gi tr ri rc v
tnh cht mu trong lp hay gia lp, bng cch
pht sinh php chiu ti u trn c s FLD. Mi
nhm con, m hnh ha mt nh mt phng
thc Gauss vi cc tham s trong Gauss c c
lng bng phng php cc i ha kh nng Hnh 10: M hnh mng Neural theo Rowley
[167]. Qut trn ton b nh a vo bng mt ca Agui trnh by mng neural x l c th t
s ri tnh xc sut mc ph thuc lp. Dng lut [143]. Gia on u dng hai mng con song song
quyt nh da trn cc i ha kh nng xc m d liu vo l cc gi tr cng ca nh ban
nh c phi l khun mt hay khng. C hai u v cc gi tr cng ca nh c lc
phng php trong [263] c t l chnh xc l 92.3% bng thut ton lc Sobel vi ca s lc 3x3. u
cho MFA v 93.6% khi dng FLD. vo ca mng giai on hai bao gm d liu u ra
t hai mng con v cc gi tr c trng c
trch ra, nh: c trng lch chun ca cc gi tr
im nh trong mu a vo, mt t l ca s im
nh trng trn tng s im nh ( chuyn sang nh
nh phn) trong mt ca s, v c trng thit yu v
hnh hc. Mt gi tr xut ti giai on hai cho bit
c tn ti hay khng khun mt ngi trong vng
a vo. Qua kinh nghim, tc gi ch ra rng nu
cc nh cng mt kch thc th mi dng phng
php ny c.
Propp v Samal pht trin mng neural xc
Hnh 9: i din ca mi lp khun mt.
nh khun mt ngi sm nht [224]. Mng neural
Mi i din tng ng tm ca mt nhm. ca hai ng gm bn tng vi 1,024 u vo, 256
Choi [31] xy dng h thng xc nh khun mt u k tip trong tng n th nht, tm u k tip
ngi trong nh mu bng c trng ca mt ngi trong tng n th hai, v hai u ra. Tng t nh
thng qua phn on xc nh ng vin khun mng neural x l theo th t c a ra sau
mt da trn phn b mu da ca khun mt. [251]. Phng php ca Soulie [242] duyt mt nh
a vo vi mng neural c thi gian tr [258] (kch
thc ca s l 20x25 im nh) xc nh khun
mt. Dng bin i wavelet phn r nh cc phn

15
c kch thc khc nhau xc nh khun mt. Theo nh gi cc phng php dng mng
Vaillant dng mng neural dng xon xc nh neural xc nh khun mt ngi ca nhiu tc
khun mt ngi [256]. u tin to cc nh mu gi, th nghin cu ca Rowley [48, 231] c xem
khun mt v khng phi khun mt c kch thc l tt nht i vi nh xm. Mt mng a tng c
20x20. Dng mt mng neural, mng ny c dng hc cc mu khun mt v khng phi
hun luyn, tm cc v tr tng i ca cc khun t cc nh tng ng (da trn quan h
khun mt cc t l khc nhau. Ri dng mt cng , v mt khng gian ca cc im nh)
mng khc xc nh v tr chnh xc ca cc trong khi Sung [246] dng mng neural xc nh
khun mt. Mng u tin dng tm cc ng vin mt hm bit s cho mc ch phn loi mu c phi
khun mt, ri dng mng th hai xc nh ng l khun mt hay khng da vo o khong cch.
vin no that s l khun mt. Burel v Carel dng Hai ng cng dng nhiu mng neural v vi
mng neural a tng c t mu hn vi thut ton phng thc quyt nh ci thin kt qu, trong
Kohenens SOM hc cc mu khun mt v hnh khi Burel v Carel [153] dng mt mng n, v
nn, m cc mu ny c phn loi trc. Giai Vaillant [256] dng hai mng phn loi. C hai
on xc nh khun mt bao gm duyt trn mi thnh phn chnh x l: nhiu mng neural (xc
nh c bin i t nh bn u cc phn nh mu no l khun mt) v mt m un quyt
gii khc nhau. ti mi v tr v kch thc ca s nh (a ra quyt nh cui cng t nhiu kt qu
duyt, iu chnh sng. Mi ca s c xc nh). Hnh 9, thnh phn u tin ca phng
chun ha s c phn loi bng MLP. php ny l mt mng neural nhn mt vng nh c
Feraud v Bernier dng mng neural kt hp t kch thc 20x20 im nh v xut ra mt gi trc
ng [171, 172, 173]. tng da trn [201] mng trong khong t -1 n 1. Khi a vo mt nh, nu
kt hp t ng c nm tng th c th biu din mt kt qu gn -1 th ngha l mu ny khng phi l
phn tch thnh phn chnh phi tuyn. Dng mt khun mt ngi, nhng nu kt qu gn 1 th y
mng kt hp t ng xc nh cc khun mt chnh l khun mt ngi. xc nh khun mt
chp thng ri m rng bng cch xoay 60 t tri c kch thc ln hn 20x20 im nh, c chn mt
sang phi khun mt chp thng, mng ny s tn t l ri duyt ri xc nh, ri li thay i t l
dng cc trng s khi xy dng vi d liu khun (bin thin t l ny do ngi xy dng quyt nh).
mt chp thng cho cc t th mi. Hai ng cho bit Gn 1050 mu khun mt c kch thc, hng, v
kt qu cng tng t [231]. Phng php ny cng tr, v cng khc nhau dng hun luyn
dng cho LISTEN [159] v MULTRAK [149]. mng. S gn nhn cho mt, nh ca mi, gc cnh,
Lin xy dng mng neural quyt nh trn c s v tm ca ming ri dng chun ha khun mt
xc sut (Probabilistic Decision-based Neural v cng mt t l, hng, v v tr. Thnh phn th
Network PDBNN) [209]. Kin trc ca PDBNN hai l phng php trn cc xc nh chng cho
th tng t mt mng c hm trn nn tng tng nhau v a ra quyt nh. Php ton logic
t tia (Radial Basis Function RBF) vi cc lut (AND/OR) l mt quyt nh n gin nht v
hc c h tr xc sut. Thay v chuyn ton b phng php bu c c dng tng tnh hiu
nh khun mt thnh mt vector c cc gi tr cng qu. Rowley [48] a nhiu cch gii quyt bi ton
hun luyn cho mng neural, ng s trch quyt khc nhau nhng chi ph tnh ton t hn Sung
vector c trng da trn cng v thng tin v Poggio nhng t l chnh xc cao hn.
cnh trong vng khun mt c cha lng my, mt, Mt gii hn ca phng php ca Rowley [48]
v mi. Hai vector c trng c trch th a v Sung [246] l ch c th xc nh khun mt chp
vo hai PDBNN v hp nht cc kt qu c kt thng v ta thng (nghing u). Sau Rowley
qu phn loi. Trn c s 23 nh ca Sung v [49] ci tin c th xc nh khun mt b xoay
Poggio [248]. ng cho mt s kt qu so snh bng mng nh hng (Router Network), hnh 11,
vi cc mng khc [48, 248]. s thm tin trnh xc nh hng khun mt v

16
xoay v li t th chun (chp thng), tuy nhin khi ng dng 10,000,000 mu c kch thc 19x19
quay li d liu nh trn th t l chnh xc li gim im nh, h thng ca ng c t l li t hn Sung
i, ch cn khong 76.9%. v Poggio [247], nhng nhanh hn gn 30 ln. SVM
cng c th dng xc nh khun mt ngi v
ngi i b vi phn tch Wavelet [219, 221, 222].
Shihong v Masato s dng bin i wavelet
Gabor trch c trng ca khun mt cng nh
khng phi khun mt a vo cho SVM hc
[15].
Kang v Lee [14] xy dng ng dng cho robot
Hnh 11: Mt v d cho d liu vo v d liu ra ca mng nh hng.
i b vt qua con ngi v chng ngi vt da
Lee [71] pht trin tng ca Rowley [48] cho
trn xc nh khun mt. Hai ng dng phn on
xc nh khun mt trong nh mu. ng dng m
ni kt hp SVM phn loi. Tng t Kui v
hnh mu da ngi bng Gauss xc nh cc ng
Silva [22] cng xy dng ng dng cho phng thng
vin, sau loi bt nhng ng vin no khng tha
minh bng cch xc nh khun mt ngi da trn
mn tnh cht hnh dng gn ging hnh ellipse. Cui
eigenface lm d liu cho SVM hc phn loi.
cng ng dng mt mng neural c hun luyn
Bileschi v Heisele [18] dng phn gii thp
xc nh khunmt ngi. T l xc nh chnh
hc thnh phn khun mt trong nh xm vi cc
xc l 88.9%, cn t l xc nh sai l 11.1%.
khun mt chp thng hoc ta thng cho SVM
Da trn nghin cu ca Rowley [48], Hazem
xc nh khun mt. Trong khi Terrillon [20] dng
[108] ci tin tc x l tng ln ng k.
tnh cht mu da ngi tm ng vin kt hp
Kwolek [131] dng b lc Gabor trch c
SVM v cc m men Fourier-Mellin trc giao
trng, dng c trng ny hun luuyn cho mng
gii quyt. Thay v lc n gin, Shu-Fai v Kwan-
neural xon. Mng neural xon l mng neural m
Yee [72] dng QuaTree tm ng vin khun mt
mi node mi tng c th lin kt vi cc lng
ngi trong nh mu. Sau kt hp wavelet phn
ging cc b tng pha trc ca n. T l chnh xc
tch mu cho SVM hc trong nhiu t l. a phn
l 87.5%.
khi cho SVM hc, cc tc gi u dng hai lp
d) SVM khun mt v khng phi khun mt hc. Wang
Support Vector Machine (SVM) c Osuna [75] ch dng mt lp khun mt trong nh mu
[220] p dng u tin xc nh khun mt xc nh khun mt ngi. T l chnh xc khong
ngi. SVM c xem nh l mt kiu mi dng 81%. Fang v Qiu [83] kt hp SVM v thut ton
hun luyn phn loi theo hm a thc. Trong khi leo i xc nh khun mt. Zhang v Zhao [51]
hu ht cc phng php khc hun luyn phn xy dng SVM da trn histogram ca khun mt
loi (Mng Bayes, Nueral, RBF) u dng tiu ch v khng phi khun mt xc nh khun mt. T
ti thiu li hun luyn (ri ro do kinh nghim), l chnh xc khong 92% cho khun mt chp thng
trong khi SVM dng quy np (c gi l ti thiu hoc gn thng trong nh mu. Je li xy dng nhiu
ri ro cu trc), mc tiu l lm ti thiu mt bao SVM xc nh khun mt ngi theo th t quyt
bn trn trn li tng qut. Mt phn loi SVM l nh kt hp phng php bu c trong nh mu
mt phn loi tuyn tnh, dng mt siu phng [30].
tch d liu. Da trn mt kt hp c cc trng s Julien [129] xy dng mt cu trc SVM mi
ca mt tp con nh cc vector hun luyn, cc gm nhiu SVM kt ni song song vi nhau hc d
vector ny c gi l support vector. c lng liu t khng gian eigenface. T l chnh xc hn
siu phng th tng ng gii mt bi ton tuyn 93% trong nh xm vi khun mt n c chp
tnh bc hai. Osuna [220] pht trin mt phng thng.
php hiu qu hun luyn mt SVM vi t l ln
p dng cho bi ton xc nh khun mt ngi.

17
e) Mng lc tha gian c s chiu thp hn (dng PCA xy dng)
Yang xut mt phng php dng mng lc v lng t ha thnh mt tp cc mu c gii hn,
d tha (Sparse Network of Winnows SNoW) v thng k mi vng c chiu, cc thng k
[181, 230] xc nh khun mt ngi vi cc c ny c c lng t cc mu c chiu xung
trng khc nhau v biu din trong cc t th khc khng gian c s chiu nh hn, m ha din
nhau, v di iu kin nh sng khc nhau [264]. mo cc b. Khi t l kh nng ln hn t l ca cc
ng thi nghin cu phng php hc s khai tt xc sut u tin th c khun mt ngi. ng cng
nh th no khi dng cc c trng a t l. SNoW cho thy so snh gia phng php ny v [48],
l mt mng tha dng cc hm tuyn tnh v dng hng tip cn ny cho php xc nh cc khun
lc cp nht lut [210]. Phng php ny thch mt b xoay v nhn nghing. Schneiderman v
hp cho hc trong min khi cc c trng tim nng Kanade sau kt hp bin i wavelet xc nh
to cc quyt nh sai khc nhau m khng bit mc cc khun mt nhn nghing v xe hi [58].
u tin. Mt vi c tnh ca kin trc hc ny l Rickert cng dng cch chn cc c trng cc
rt him d liu c phn chung, c ch nh cc c b [229]. Cc c trng cc b c trch ra bng
trng v lin kt trong d liu, k thut quyt nh, cch p dng cc b lc a t l v nhiu phn
v cp nht lut hiu qu. T l li l 5.9%, hiu qu gii trn d liu nh a vo. Dng phng php
cng nh cc phng php khc [48, 160, 220, 237]. gom nhm d liu v mt Gauss hn hp tm
Gundimada [4] da trn kin trc SNoW xy phn b ca cc vector c trng. Sau khi hun
dng ba mng, mng th nht phn loi da trn luyn cho m hnh v tinh ch, tnh kh nng ca
phn b cng , hai mng da trn phn b mu cc vector c trng ca cc nh phn loi.
da ngi tm ng vin kt hp phng php lm Phng php ny cho kt qu tt cho xc nh
ni bt cnh. Xy dng cc mu y t th ca khun mt ngi cng nh xe hi.
khun mt, mi b phn loi s tng ng cho mt Thang [77] xc nh khun mt ngi thng qua
hng, mi hng lch nhau 10o. phn loi mng Bayes kt hp, hay cn gi l mng
f) Phn loi Bayes Bayes c cu trc nh rng (Forest-Structured
Tri ngc vi cc phng php trong [48, 220, Bayesian Network), xc nh cc bit s. Kt hp
248] da vo din mo trn ton khun mt, phng php Bagging xy dng phn loi tch
Schneiderman v Kanade m t mt phn loi naive hp nhm xc nh khun mt ngi trong nh xm.
Bayes uc lng xc sut ni cc din mo ti T l chnh xc hn 90%.
v tr cc b trn khun mt v v tr ca cc mu Nam v Rhee [110, 123] xy dng mng Bayes
khun mt ngi (cc vng con trn khun mt) hc phn loi theo ng cnh: mu da, nh sng, v
trong nhiu phn gii [73, 237]. Hai ng nhn kt cu khun mt v kt hp mng FuzzyART
mnh tnh cht din mo khun mt v tr cc b xc nh khun mt ngi trong nh. Hai tc gi ny
bi v vi vi mu v tr cc b ca mt i tng cng dng thm khong cch Mahalanobis [122] khi
s c tnh cht duy nht, cng xung quanh mu kt hp mng RBF v FuzzyART xc nh khun
mt th c bit hn v tr m. y l hai l do mt c nhiu t l khc nhau. Hai tc gi pht trin
dng phn loi naive Bayes (khng xem xt thng bng cch dng nhiu phn loi Bayes chn ng
k nhng ph thuc gia cc vng). u tinphn vin thng qua cc c trng thng tin v cng
loi ny cung cp c lng tt hn ca cc hm v kt cu ca khun mt [126]. T l chnh xc hn
mt c iu kin ca cc vng ny. Th hai, mt 87%.
phn loi Bayes cung cp mt dng hm ca theo Lee v Kim [120] dng c trng Haar wavelet
xc sut nhn thng k ca din mo v tr cc 1-chiu hun luyn cho mng Bayes xc nh
b v v tr ca n trn i tng. Ti mi t l, mt nhiu khun mt chp thng trong nh xm thng
nh khun mt ngi c phn r lm bn vng qua PDF ca cc mu khunmt ngi v mu
hnh ch nht con. Chiu cc vng ny xung khng

18
khng phi khun mt ngi. T l chnh xc l t v mt quan st c xem nh mt khi cc im
98%. nh, hnh 12a v hnh 13. p dng mt nh hng
Zhu [97] dng wavelet trch cc tham s c theo xc sut chuyn t trng thi ny sang trng
trng da vo histogram ri dng mng Bayes thi khc, hnh 12b, d liu nh c m hnh ha
c hc xc nh khun mt ngi trong nhiu bng phnb Gauss nhiu bin. Mt chui quan st
t l khc nhau. bao gm tt c gi tr cng t mi khi. Kt qu
Duy Nguyen [280] dng b lc Sobel xc xut ra cho bit quan st thuc lp no. HMM c
nh cc c trng ri dng phn loi naive Bayes dng nhn dng khun mt ngi v xc nh
nh Schneiderman v Kanade xc nh khun khun mt ngi. Samaria [235] dng nm trng
mt ngi. thi tng ng nm vng, hnh 12b m hnh ha
tin trnh xc nh khun mt ngi. ng hun
g) M hnh Markov n
Mt gi thuyt quan trng ca m hnh Markov luyn tng vng cho HMM. Mi tnh trng s ph
n (Hidden Markov Model HMM) l cc mu c trch xem xt vng tng ng a ra quyt nh
th c c tnh ha nh cc tin trnh ngu nhin ph hp. Nu kt qu xem xt cui cng vt qua
c tham s v cc tham s ny c c lng mt ngng th quan st ny s l khun mt ngi.
chnh xc, y l mt trong nhng nh ngha r
rng. Khi pht trin HMM gii quyt bi ton
nhn dng mu, phi xc nh r c bao nhiu trng
thi n u tin cho hnh thi m hnh. Sau , hun Hnh 12: M hnh Markov n:
luyn HMM hc xc sut chuyn tip gia cc trng (a) cc vector quan st hun luyn cho HMM;
thi t cc mu, m mi mu c m t nh mt (b) nm trng thi n.

chui cc quan st. Mc tiu hun luyn HMM l Samaria v Young dng HMM 1-chiu (hnh
cc i ha xc sut ca quan st t d liu hun 12) v 2-chiu (hnh 13) trch c trng khun
luyn bng cch iu chnh cc tham s trong m mt dng nhn dng khun mt [234, 235]. HMM
hnh HMM thng qua phng php phn on khai thc cu trc ca khun mt tun theo cc
Viterbi chun v cc thut ton Baum-Welch [227]. chuyn tip trng thi. T cc cng c c trng
Sau khi hun luyn xong, da vo xc sut xc quan trng nh: tc, trn, mt, mi, v ming, hai
nh mt quan st thuc lp no. ng phn tch theo t nhin t trn xung di, mi
Mt cch trc quan, c th chia mt mu khun vng c thit k thnh mt trng thi 1-chiu. Mi
mt ngi thnh nhiu vng khc nhau nh u, nh c phn on chun thnh nm vng theo th
mt, mi, ming, v cm. C th nhn dng mt mu t t trn xung di to thnh nm trng thi. Hai
khun mt ngi bng mt tin trnh xem xt cc ng dng phn on Viterbi thay cho phn on
vng quan st theo mt th t thch hp (t trn chun v cc tham s trong HMM c ti c
xung di, t tri qua phi). Thay v tin tng vo lng bng thut ton Baum-Welch. Tng t
mc chnh xc v tr l dng cho cc phng [234], Nefian v Hayes dng HMM v bin i
php da trn so khp hay da trn din mo (ni Karhunen Leve (Karhunen Leve Tranform
xut hin cc c trng nh mt v mi cn xc nh KLT) xc nh khun mt ngi v nhn dng
v tr l tt ly c ton b chi tit ca c [217]. Thay v dng cc gi tr cng th, cc
trng). Mc tiu ca hng tip cn ny l kt hp vector quan st s bao gm cc h s (dng KLT
cc vng c trng khun mt vi cc trng thi ca c) th kt qu s tt hn [234], v t l chnh xc
m hnh. Thng cc phng php da vo HMM khi dng HMM 2-chiu (hnh 13) l 90%.
s xem xt mt mu khun mt nh mt chui cc
vector quan st, vi mi vector l mt dy cc im
nh, hnh 12a v hnh 13. Trong qu trnh hun
luyn v kim tra, mt nh c qut theo mt th

19
Hammersley-Clifford, mt MRF c th c c
tnh ha tng ng bng mt phn b Gibbs v
cc tham s thng cc i ha sau khi c lng
[225]. Nh mt s la chn, cc phn khun mt
ngi v khng phi khun mt c th c c
lng qua cc histogram. Dng thng tin quan h
Kullback, tin trnh Markov cc i ha bit s trn
Hnh 13: Xc nh khun mt bng HMM cc trng thi, mi trng thi c s thng tin gia cc lp xc nh khun mt
li c nhng trng thi nh bn trong: trng thi trn c ba trng thi nh
bn trong; trng thi mt c nm trng thi nh bn trong.
ngi [160, 207].
Lew p dng thng tin quan h Kullback [162]
Rajagopalan a ra hai phng php xc sut
kt hp hm xc sut p(x) khi mu l khun mt
xc nh khun mt ngi [228]. Tng phn vi
ngi v q(x) khi mu khng phi l khun mt
[248], dng mt tp cc Gauss nhiu bin m
ngi xc nh khun mt ngi [207]. ng dng
hnh ha phn b ca khun mt ngi, phong
100 c th khun mt ngi gm chin quang cnh
php u tin trong [228] dng htng k c th t
c lng phn b ca khun mt. Dng 143,000
mc cao hn (Higher Order Statistic - HOS) c
mu khng phi khun mt c lng hm mt
lng cng . Tng t [248], cc phn b khng
xc sut (PDF) thng qua histogram. T y
bit ca khun mt v khng phi khng mt c
chn c cc im nh giu thng tin nht (Most
gom nhm bng su hm cng da trn HOS
Informative Pixel MIP) cc i ha thng tin
ca cc mu. Nh trong [246], s dng tri gic nhiu
quan h Kullback gia p(x) v q(x) (c c mt
tng phn loi, mt vector a vo x l gm
phn tch lp cc i). Phn b MIP tp trung cc
mi hai o lng khong cch gia mu nh v
vng mt v ming, nhng b qua vng mi. MIP
mi hai nhm. Tip cn ny da trn c s sinh
c dng c c cc c trng tuyn tnh dng
mt dy quan st t nh ri dnh HMM hc cc
cho phn loi v m t bng phng php ca
tham s tng ng. Kt qu ca ng cho thy c hai
Fukunage v Koontz [175]. Dng mt ca s duyt
phng php HOS v HMM u c kt qu xc nh
trn tan b nh xy dng khong cch t khng
khun mt ngi cao hn [48, 248], nhng nhiu
gian khun mt (Distance From Face Space
xc nh nhm hn.
DFFS), c nh ngha [283]. Nu DFFS n
Filareti dng c trng sc mu kt hp thng
khng gian con khun mt ngn hn khong cch
tin v su ca nh lm d liu u vo dy cho
n khng gian con khng phi khun mt, hnh 8,
HMM xc nh khun mt ngi [63]. Phng
th xem nh xc nh c khun mt trong ca
php ny cho php gii quyt vn v iu kin
s.
hnh nn, sng, che khut, t th khun mt.
Thng tin quan h Kullback cng c
Hong [121] xy dng m hnh Markov n hc
Colmenarez v Huang dng cc i ha bit s
d liu da trn cc c trng Haar-like xc nh
trn c s thng tin gia cc mu negative v
khun mt ngi. T l chnh xc l 96%.
positive ca khun mt [160]. Phn tch cc nh t
h) Hng tip cn l thuyt thng tin tp hun luyn ca mi lp (lp khun mt ngi v
Thuc tnh trong khng gian ca mu khun mt lp khng phi khun mt ngi) nh cc quan st
c th c m hnh ha qua nhiu din mo khc trong tin trnh ngu nhin v ac c tnh ha
nhau. Dng ng cnh phn on l mt phng bng hai hm xc sut. Hai ng dng mt hc cc
php hiu qu, xc nh ng cnh thng qua cc qu trnh x l Markov ri rc m hnh cc mu
im nh ln cn. L thuyt trng ngu nhin khun mt v hnh nn ri c lng m hnh xc
Markov (Markov Random Field MRF) cung cp sut tng ng. Qu trnh hc c chuyn thnh
mt tin li v cch ph hp m hnh ha cc bi tan ti u chn c tin trnh cc i bit
thc th da vo ng cnh nh cc im nh v cc s trn c s thng tin gia hai lp. Tnh t l kh
c trng c mi tng quan. Theo nh l

20
nng dng cho m hnh xc sut c hun luyn nh vi 32 mc cng hac kt cu, trong khi
ri dng xc nh khun mt ngi. [248] dng ton b t l cc gi tr cng . T l
Qian v Hang [225] trnh by mt phng php chnh xc l 90%.
dng c hai phng php trn c s quang cnh v Bernhard Froba v Andreas Ernts [25] dng cy
m hnh ha. u tin, mt thut ton dng tri thc quyt nh c nhiu nhnh cho php xc nh khun
min mc cao ca nhng g khi nhn vo th quan mt ngi nhn nghing t -60o n 60o, mi node
tm ngay gim s chiu khng gian tm kim c kh nng loi b ca s con hin hnh ang xt
(thay v tm trn ton b khng gian c trng, ch hoc phn loi vo mt trong ba lp quay. T l
cn tm trn khng gian con c nhng c trng chnh xc cho nh xm l 90%.
quan tm). Thut ton ny chn cc vng trn nh Socolinsky [91] dng phn loi CCCD (Class-
lm mc tiu khi c din mo quan tm xut hin Cover Catch Digraph) kt hp boosted tree-like
bng thut ton xc nh vng (phng php water- thng qua o cross-correlation xc nh khun
shed). Dng cc vng c chn xc nh mt ngi da trn tp mu hun luyn.
khun mt so khp mu v so khp c trng thng Ramana [112] dng cy quyt nh nh mt
qua trng ngu nhin Markov v cc i c lng cng c phn loi xem phn no s l khun mt
sau. ngi. Trong khi xy dng cy ng kt hp c
Feng v Shi [102] dng KFD (Kernel Fisher cascade tng tnh hiu qu.
Discriminant) phn tch nh c khun mt ngi
j) AdaBoost
ri hc cc c trng ny xc nh khun mt Hc vi AdaBoost l mt phn loi mnh phi
ngi. T l chnh xc trong nh xm khong 72%. tuyn phc HM(x), c xy dng t M phn loi
i) Hc theo quy np
mhm ( x)
M

Cc thut ton hc theo quy np c p dng yu [274], H M ( x) = m=1 vi x l mu cn



M

xc nh khun mt ngi. Huang dng thut m=1 m

ton C4.5 [226] xy dng cy quyt nh t cc mu phn loi, hm(x){-1,1} l M phn loi yu, m0 l

m l nhn t
M
khun mt ngi [187]. Mi mu hun luyn l mt cc h s trong , v m=1
ca s c kch thc 8x8 im nh v c m t
chun ha. Mc tiu ca Adaboost l hc mt dy
nh mt vector c 30 thuc tnh v entropy, trung
cc phn loi yu. Gi s c mt tp N mu hun
bnh, v lch chun ca cc gi tr cng ca
luyn c gn nhn {(x1,y1), , (xN,yN)}, vi yi
im nh. Mi node ca cy quyt nh s ch r
quyt nh trn mt thuc tnh n. T l chnh xc l nhn tng ng ca mu xi n . Tnh mt phn
khi xc nh khun mt c chp thng l 96%. b ca cc mu hun luyn [w1, , wN] cp nht
Duta v Jain [273] m t mt phng php hc trong sut qu trnh hc. Sau bc lp m, mu kh
khi nim khun mt ngi bng thut ton Find-S phn loi (xi,yi) c trng mi wi(m), n bc lp th
ca Mitchell [215]. Tng t [248], hai ng c (m+1), mu ny s c tm quan trng hn.
on phn b ca cc mu khun mt ngi bng Viola v Jones dng AdaBoost kt hp cascade
p(x | khuo
n mat) thng qua mt tp cc nhm theo xc nh khun mt ngi [52] vi cc c trng
Gauss v khong cch t mu khun mt n tm dng Haar wavelet-like [221, 275]. Tc x l kh
nhm nn nh hn khang cch ln nht t cc im nhanh v t l chnh xc hn 80% trn nh xm.
n tm nhm (ngha l name trong phn bao ca Schneiderman v Kanade dng wavelet trch
nhm). Sau dng thut ton Find-S hc khong c trng ri xy dng h thng hc vi Adaboost,
cch ngng. Phng php ny c vi c tnh da trn xc sut v histogram xc nh khun
ring. Th nht, khng dng cc mu khng phi l mt ngi [26]. T l chnh xc trn 90%.
khun mt ngi, trong khi [48, 248] dng c hai Chen [37] c lng tham s nh iu chnh
loi mu. Th hai, ch dng duy nht phn tm nh sng cho ph hp cho cc mu bng SVM. Sau
hun luyn. Th ba, cc vector c trng gm c cc

21
cng dng Adaboost xc nh khun mt ngi gim t l sai. Cui cng dng cascade x l
vi t th chp thng. T l chnh xc l 89.7%. khut.
Lu [58] dng phn tch bit s tuyn tnh (Linear Maeynet v Thiran [62] xc nh c trng
Discriminant Analysis LDA) trch c trng. Gauss khng ng hng v Haar-like kt hp
Sau hun luyn b phn loi yu boosting Adaboost xc nh khun mt ngi.
phn loi khun mt ngi. Tng t Song [99] cng dng AdaBoost xc
Shinji v Osamu [137] xy dng cc trng ca nh khun mt ngi kt hp mu da ngi. Ri
khun mt bng cch s dng nhiu mc phn dng cc thng tin ny theo vt cc i tng l
gii thp xc nh khun mt ngi thng qua con ngi trong phng.
Adaboost. Ishii [64] xy dng phng php BDF (Block
Jin [113] ch ra nu dng tng phng php so Difference Feature) vi cc c trng Haar-like
khp mu hay cascade ring r th mc chnh xc xc nh cc t th ca khun mt, ri dng kt qu
gn nh nhau, nhng mc xc nh sai kh cao. ny so snh vi d liu c hc trc v
Tc gi kt hp hai phng php ny gim t l xc nh c khun mt hay khng trong nh.
sai ca phng php xc nh khun mt ngi.
l) Hc vi FloatBoost
Ou [115] thy rng khi dng cascade AdaBoost Li v Zhang a ra mt khi nim mi l
xc nh khun mt ngi th thng thng dng FloatBoost [103]. Phng php ny hc da trn
thut ton greedy tm cc trng ca b phn loi phn loi boosting t l li cc tiu. Nhng
yu th khng uc ti u. Tc gi xut dng GA phng php ny cho php quay lui sau khi ti mi
thay th cch tm trn nhm tng tnh hiu qu. bc khi hc bng AdaBoost cc tiu c t l
k) Cc c trng Haar-like v phn loi vi li trc tip, cc tiu theo hm m. C hai vn
cascade gp khi dng phng php AdaBoost:
Viola v Jones dng bn loi c trng Haar- o Th nht: AdaBoost cc tiu theo hm m ti
like c bn xc nh khun mt ngi [52, 221], bin qua tp hun luyn. y l tin li, tuy
hnh 13. c trng Haar c a thch v c hai l nhin mc tiu cui cng trong cc ng dng
do: (1) phn loi mnh trong vic xc nh khun dng phn loi mu th thng l cc tiu
mt ngi hay khng phi khun mt ngi; v (2) mt gi tr trc tip (tuyn tnh) kt hp vi
c hiu qu [276] khi dng bng tng cc vng t l li. Mt phn lai mnh c hc bng
[284] hoc k thut nh y [52]. AdaBoost th gn im ti u ca ng dng
trong iu kin t l li. Vn ny khng
thy ti liu ni n c li gii.
o Th hai: AdaBoost li mt thch thc nu
dng phn lai yu hc. Hc phn loi
ti u khi dng phn loi yu cn c lng
mt khng gian c trng, iu ny l vn
Hnh 13: Bn loi c trng Haar wavelet-like.
kh, c bit khi s chiu ca khng gian
nh y II(x,y) ti v tr (x,y) l tng cc kh ln.
im nh trn v bn tri ca (x,y) [52], Mt thut ton hc yu c hiu qu v d dng
II ( x, y) =
x ' x , y ' y
I ( x, y) th rt cn thit. FloatBoost xem nh mt cu ni
gia mc tiu ca hc boosting thng thng (cc
Lienhart pht trin cc c trng Haar-like thnh i bin) v nhiu ng dng dng cc tiu t l li
mt b c trng mi [19] kt hp phn loi cascade thng qua vic kt hp phng php tm kim
xc nh khun mt ngi. Floating v AdaBoost kt hp k thut quay lui.
Lin [79] dng boost loi trng hp hc qu Tian [101] xy dng cy xc nh trn c s hc
khp ng thi s dng hun luyn tng cng tch cc bng thut ton gom nhm c-mean m da

22
trn nn tng FloatBoost. Tc gi dng cc c trng o) Hng tip cn tng hp
Harr wavelet-like p dng cho phng php ca Cc cc phng php c chia lm bn phn
mnh. Phng php ny cho php xc nh nhiu loi chnh theo bn hng tip cn. Tuy nhin, c
khun mt ngi nhiu t th khc nhau trong nh nhiu phng php khng hon ton ri vo mt
mu. trong bn hng tip cn ny m trong nhiu
hng tip cn khc nhau. V d, phng php so
khp mu dng m hnh khun mt ngi v cc
mu con trch cc c trng khun mt [163, 177,
232, 238, 269], v sau dng cc c trng ny
xc nh khun mt. Hn na phng php da trn
tri thc v phng php so khp mu khng tht s
tch bit, t c nhiu hng gii quyt dng tri
thc ca con ngi nh ngha cc mu khun
mt ngi [164, 232, 238].
Kim [24] kt hp cc c trng lng ging ca
Hnh 15: Mt v d v c php nh.
khun mt xy dng cc mu theo cc hng, sau
m) Phn loi da trn c php hay mnh dng k thut xc nh cnh EBM (Edge-like
Tu [141] da trn khi nim c php trong x l Blob Map) theo cng . ng xy dng logic m
ngn ng xy dng th c php ca nh da kt hp PCA c lng t th cc khun mt.
trn ni dung nh, hnh 15, vi chui Markov. Sau Taur v Tao [68] xy dng phn loi neuro-
khi c c cc t vng, ng dng phng php fuzzy (neuro-fuzzy classifier NEFCAR) c o
Adaboost c hun luyn trc xc nh tin cy bit nh no l khun mt ngi. Cc ng
cc i tng. vin c chn thng qua phn an mu da.
Hoc c th xem tng vng ca khun mt Chen s dng MRF kt hp hnh thi hc xc
ngi nh cc phn cu thnh mt khun mt, ri nh khun mt ngi [87]. ng dng cc b lc
xy dng cc gi thuyt xc nh khun mt top-hat, bottom-hat, v watershed phn on nh
ngi. Bastian [278] xy dng mt l thuyt tng i tng ang di chuyn nhm tm ng vin khun
qut cho cc loi i tng. mt. Sau cng dng MRF v hnh dng ellipse ca
n) Phn loi da trn loi b khun mt xc nh ng vin no l khun mt.
Elad xy dng mt phn loi da trn khi nim Garcia v Tziritas [2, 11] sau khi lc nhng
loi b ti a (Maximal Rejection Classifier MRC) vng no c mu l mu da ngi tm ng vin,
khc hn tng phn loi khc. Cc phng php sau dng thut ton trn vng to ng vin
khc tm mc chung ca mt cc th no so mn hn. Hai ng dng phn tch theo wavelet
vi cc lp chn c th vo lp no. ng chn phn r ng vin xem c cng kt cu vi khun
cch loi b nhng lp m c th ny khng c hoc mt ngi hay khng thng qua khong cch
c t mi tng quan, chi ph loi b khng cao lm Bhattacharrya. T l chnh xc khong 95%.
[28]. ng tnh PDF ca hai lp: khun mt ngi v Cooray v Connor [3] v Lee [36] dng k thut
khng phi khun mt ngi. ng xem khun mt lai trn c s trch c trng v PCA xc nh
ngi l target v khn phi khun mt ngi l khun mt ngi. Hai ng dng cc c trng v
clutter. ng tm ngng loi b theo xc sut thng mu sc ca mt v ming xc nh cc vng ny.
qua bit s tuyn tnh Fisher (FLD). ng chiu d T nay p dng l thuyt eigenface chun ha
liu xung mt vector chiu, thng qua php chiu khng gian tm kim, ng thi dng khong cch
ny xc nh khunmt ngi. gia hai mt c kt qu cui cng.
Zhang [41] kt hp m hnh mu da ngi
phn on tm ng vin khun mt. ng xy dng
mng neural nh ca Rowley [49] quay khun

23
mt sau so khp cc mu c sn. Phng php o iu kin nh, c bit l v sng v cht
ny cho xc nh cc khun mt cc t th khc lng nh, cht lng thit b thu hnh.
nhau trong nh mu, thi gian x l s gim hn v o Trc to ca my nh so vi nh.
o Kch thc khc nhau ca cc khun mt
khng gian tm kim b thu hp. Tng t ngi, v c bit l trong cng mt nh.
Haizhou [40] cng dng phng php nh th o Mu sc ca mi trng xung quanh, hay mu
nhng thay i qu trnh xc nh. ng so khp mu sc qun o ca ngi c chp ly nh.
dng tm ng vin. Sau dng mng neural o Xut hin thnh phn khun mt hay khng.
phn lai ng vin no l khun mt ngi. o Nhiu khun mt c vng da dnh ln nhau.
Li [138] dng kernel hc nh l mt nh x Cc kh khn trn chng t rng bt c phng
phi tuyn, u tin ng dng KPCA (Kernel PCA) php gii quyt (thut tan) bi tan xc nh khun
chn cc c trng v khng gian c trng mt ngi s khng th trnh khi mt s khim
hc. Sau ng dng KSVC (Kernel Support khuyt nht nh. nh gi v so snh cc
Vector Classifier) kt hp FLD phn loi u l phng php xc nh mt ngi, ngi ta thng
khun mt ngi. da trn cc tiu ch sau:
Yin v Meng [107] dng phn on nh tnh o T l xc nh chnh xc l t l s lng cc
cht mu da ngi, t y tc gi tm c v tr ca khun mt ngi c xc nh ng t h
thng khi s dng mt phng php xy
mt theo tiu chun mc cn i xem cc
dng so vi s lng khun mt ngi tht s
vng ny l cc ng ng ca khun mt. T cc ng c trong cc nh (detection rate).
vin ny, tc gi so khp vi cc mu c sn xc o S lng xc nh nhm l s lng vng
nh ng vin no l khun mt ngi. T l chnh trong nh khng phi l khun mt ngi m
xc l 85%. h thng xc nh nhm l khun mt ngi
(false positives).
Lingmin [93] v Emanuele [142] dng l thuyt
o Thi gian thc hin l thi gian my tnh
hc theo xc sut xy dng m hnh xc nh xc nh khun mt ngi trong nh (running
khun mt ngi. Trong khi Emanuele dng c time).
trng Haar wavelet xc nh c trng ri dy
cho SVM phn loi. Lingmin dng ba lp: tng IV. KT LUN
cui dng mi tng quan hnh hc cc mu, tng Bi vit ny c gng cung cp mt ci nhn tng
gia xt tnh a dng ca cc thnh phn khun mt quan cc phng php xc nh khun mt ngi v
ngi vi HMM, v tng trn cng dng m hnh mt cch phn loi cc hng tip cn t nh xm
th cho quan h ba thnh phn to thnh mt hnh n nh mu da vo gn 300 bi bo, bo co ca
tam gic (xt tnh can i). cc phng nghin cu, v lun vn trn th gii.
Xc nh khun mt ngi trong nh l mt bi
III. KH KHN V THCH THC TRONG ton hp dn, thch thc nhiu ngi nghin cu
BI TON XC NH KHUN MT NGI v tnh ng dng to ln trong thc t. Hy vng vi
Vic xc nh khun mt ngi c nhng kh bi vit ny s gip nhiu ngi c c nhng kin
khn nht nh [57] nh sau, hnh 16: thc nht nh, gip nhiu ngi khng phi mt
o Hng (pose) ca khun mt i vi my nh, nhiu thi gian khi bt u nghin cu bi ton hp
nh: nhn thng, nhn nghing hay nhn t dn ny, cng nhau vn ra bin khi tri thc ca
trn xung. Cng trong mt nh c th c th gii.
nhiu khun mt nhng t th khc nhau.
o S c mt ca cc chi tit khng phi l c
trng ring ca khun mt ngi, nh: ru
quai nn, mt knh, .
o Cc nt mt (facial expression) khc nhau trn
khun mt, nh: vui, bun, ngc nhin, .
o Mt ngi b che khut bi cc i tng
khc c trong nh.

24
[7] Eun Yi Kim, Sin Kuk Kang, Keechul Jung, and
Hang Joon Kim, Eye Mouse: Mouse
Implementation using Eye Tracking, IEEE, 2005.
[8] Hichem Sabbi and Nozha Boujemaa, Coarse to Fine
Face Detection Based on Skin Color Adaption,
Biometric Authentication, LNCS 2359, pp. 112-120,
Springer-Verlag Berlin Heidelberg, 2002.
(a) (b) (c) [9] Ming-Hsuan Yang, David J. Kriegman, and
Narendra Ahuja, Detecting Faces in Images: A
Survey, IEEE Transaction on Pattern Analysis and
Machine Intelligent, vol. 24, no. 1, 2002.
[10] Douglas Chai and King N. Ngan, Face
Segmentation Using Skin-Color map in Videophone
(d) (e) Applications, IEEE Transaction on Circuits and
Systems for Video Technology, vol. 9, no. 4, 1999.
[11] Christophe Garcia and Georgios T zirita, Face
Detection Using Quantized Skin Color Region
Merging and Wavelet Packet Analysis, IEEE
Transaction on Multimedia, vol. 1, no. 3, 1999.
[12] Mark Everingham and Andrew Zisserman,
(e) (f) Automated Person Identification in Video, CIVR
2004, LNCS 3115, pp. 289-298, Springer-Verlag
Hnh 16: Cc kh khn ca vic xc nh mt ngi: Berlin Heidelberg, 2004.
(a) hng mt nghing; (b) mt knh en v nn; (c) nh b chi bi n;
(d) my nh t pha trn v sau lng ngi b chp; [13] Fuzhen Huang and Jianbo Su, Multiple Face
(e) vng da cc khun mt dnh nhau; Contour Detection Using adaptive Flows,
(f) mu mi trng xung quanh gn vi mu da ngi; Sinobiometrics 2004, LNCS 3338, pp. 137-143,
(g) cht lng nh km. Springer-Verlag Berlin Heidelberg, 2004.
[14] Seonghoon Kang and Seong-Whan Lee, Object
Detection and Classification for Outdoor Walking
TI LIU THAM KHO Guidance System, BMCV 2002, LNCS 2525, pp.
[1] Rein-Lien Hsu, Mohamed abdel-Mottaleb, and Anil 601-610, Springer-Verlag Berlin Heidelberg, 2002.
K. Jain, Face Detection in Color Images, IEEE [15] Shihong Lao and Masato Kawade, Vision-Based
Transaction on Pattern Analysis and Machine Face Understanding Technologies and Their
Intelligent, vol. 24, no. 5, pp. 696-706, 2002. Application, Sinobiometric 2004, LNCS 3338, pp.
[2] C. Garcia, G. Zikos, and G. Tziritas, Face Detection 339-348, Springer-Verlag Berlin Heidelberg, 2004.
in Color Images using Wavete Packet Analysis, [16] Stephen C. Y. Chan and Paul H. Lewis, A Pre-filter
Proc. of IEEE International Conference on Enabling Fast Frontal Face Detection, Visual99,
Multimedia Computing and System, vol. 1, pp. 703- LNCS 1614, pp. 777-785, Springer-Verlag Berlin
708, IEEE, 1999. Heidelberg, 1999.
[3] Saman Cooray and Noel OConnor, Facial Feature [17] Klaus J. Kirehberg, Oliver Jeorsky and Robert W.
Extraction and Principal Component Analysis for Frischholz, Genetic Model Optimization for
Face Detection in Color Image, ICIAR 2004, LNCS Hausdorff Distance-Based Face Localization,
3212, pp. 741-749, Springer-Verlag Berlin Biometric Authentication, LNCS 2359, pp. 103-111,
Heidelberg, 2004. Springer-Verlag Berlin Heidelberg, 2002.
[4] Satyanadh, Li Tao, and Vijayan Asari, Face [18] Stanley M. Bileschi and Bernd Heisele, Advances
Detection Technique Based on Intesity and Skin in Component-Based Face Detection, SVM 2002,
Color Distribution, International Conference on LNCS 2388, pp. 135-143, Springer-Verlag Berlin
Image Processing, IEEE, 2004. Heidelberg, 2002.
[5] Rainer Stiefelhagen, Jie Yang, and Alex Waibel, [19] Rainer Lienhart, Alexander Kuranov, and Vadim
Tracking Eyes and monitoring Eye Gaze, Pisarevsky, Empirical Analysis of Detection
Proceedings of the Workshop on Perceptual User Cascades of Boosted Classifiers for Rapid Object
Interfaces (PUI'97), Alberta, Canada. pp. 98-100, Detection, DAGM 2003, LNCS 2781, pp. 297-304,
1997. Springer-Verlag Berlin Heidelberg, 2003.
[6] Carlos Morimoto, Dave Koons, Amon Amir, and [20] Jean-Christophe Terrillon, Mahdad N. Shirazi,
Myron Flickner, Real-Time Detection of Eyes and Daniel McReynolds, Mohamed Sadek, Yunlong
Faces, In Proc. Workshop on Perceptual User Sheng, Shigeru Akamatsu, and Kazuhiko
Interfaces, pp. 117-120, 1998. Yamamoto, Invariant Face Detection in Color
Image Using Orthogonal Fourier-Mellin Moments
and Support Vector Machines, ICAPR 2001, LNCS

25
2013, pp. 83-92, Springer-Verlag Berlin Heidelberg, Analysis and Machine Intelligence, vol. 23, no. 7,
2001. IEEE, 2001.
[21] Hannes Kruppa, Martin A. Bauer, and Bernt Schiele, [34] Gines Garcia Mateos and Cristina Vicente Chicote,
Skin Patch Detection in Real-World Images, Face Detection on Still Images Using HIT Maps,
DAGM 2002, LNCS 2449, pp. 109-116, Springer- AVBPA 2001, LNCS 2091, pp. 102-107, Springer-
Verlag Berlin Heidelberg, 2002. Verlag Berlin Heidelberg, 2001.
[22] Jia Kui and Liyanage C. De Silva, Combined Face [35] M. Castrillon Santana, J. Lorenzo Navarro, J.
Detection/Recognition System for Smart Rooms, Cabrera Gamez, F. M. Hernandez Tejera, and J.
AVBPA 2003, LNCS 2688, pp. 787-795, Springer- Mendez Rodriguez, Detection of Frontal Faces in
Verlag Berlin Heidelberg, 2003. Video Streams, Biometric Authentication, LNCS
[23] Jin Ok Kim, Sung Jin Seo, Chin Hyun Chung, Jun 2359, pp. 91-102, Springer-Verlag Berlin
Hwang, and Woonglae Lee, Face Detection by Heidelberg, 2002.
Facial Features with Color Images and Face [36] Chang-Woo Lee, Yeon-Chul Lee, Sang-Yong Bak,
Recognition Using PCA, ICCSA 2004, LNCS and Hang-Joon Kim, Real-Time Face Detection and
3043, pp. 1-8, Springer-Verlag Berlin Heidelberg, Tracking Using PCA and NN, PRICAI 2002, LNAI
2004. 2417, pp. 615, Springer-Verlag Berlin Heidelberg,
[24] YoungOul Kim, SungHo Jang, SangJin Kim, chang- 2002.
Woo Park, and Joonki Paik, Pose-Invariant Face [37] Jie Chen, Yuemin Li, Laiyun Qing, Baocai Yin, and
Detection Using Edge-Like Blob Map and Fuzzy Wen Gao, Face Samples Re-lighting for Detection
Logic, IEA/AIE 2005, LNCS 3533, pp. 695-704, Based on the Harmonic Images, PCM 2004, LNCS
Springer-Verlag Berlin Heidelberg, 2005. 3332, pp. 585-592, Springer-Verlag Berlin
[25] Bernhard Froba and Walter Zink, On the Heidelberg, 2004.
Combination of Different Template Matching [38] Hazem El-Bakry, A Rotation Invariant Algorithm
Stategies for Fast Face Detection, MCS 2001, for Recognition, Fuzzy day 2001, LNCS 2206, pp.
LNCS 2096, pp. 418-428, Springer-Verlag Berlin 284-290, Springer-Verlag Berlin Heidelberg, 2001.
Heidelberg, 2001. [39] Kenji Iwata, Hitoshi Hongo, Kazuhiko Yamamoto,
[26] Henry Schneiderman, Statistical Approach to 3D and Yoshinori Niwa, Robust Facial Parts Detection
Object Detection Applied to Faces and Cars, by Using Four Directional Features and Relaxation
Ph.D. Thesis, Carnegie Mellon University, 2000. Matching, KES 2003, LNAI 2774, pp. 882-888,
[27] Kang Ryoung Park, Gaze Detection System by Springer-Verlag Berlin Heidelberg, 2003.
Wide and Auto Pan/Tilt Narrow View Camera, [40] Ai Haizhou, Liang Luhong, and Xu Guangyou, A
DAGM 2003, LNCS 2781, pp. 76-83, Springer- General Framework for Face Detection, ICMI
Verlag Berlin Heidelberg, 2003. 2000, LNCS 1948, pp. 119-126, Springer-Verlag
[28] Michael Elad, Yacov Hel-Or, and Renato Keshet, Berlin Heidelberg, 2000.
Pattern Detection Using Maximal Rejection [41] Hongmin Zhang, Debin Zhao, Wen Gao, and Xilin
Classifier, LNCS 2059, Springer-Verlag Berlin Chen, Combining Skin Color Model and Neural
Heidelberg, 2001. Network for Rotation Invariant Face Detection,
[29] Gil Friedrich and Yehezkel Yeshurun, Seeing ICMI 2000, LNCS 1948, pp. 237-244, Springer-
People in the Dark: Face Recognition in Infrared Verlag Berlin Heidelberg, 2000.
Images, BMCV 2002, LNCS 2525, pp. 348-359, [42] Rob McCready, Real-Time Face Detection on a
Springer-Verlag Berlin Heidelberg, 2002. Configurable Hardware System, FPL 2000, LNCS
[30] Hong-Mo Je, Daijin Kim, and Sung Yang Bang, 1896, pp. 157-162, Springer-Verlag Berlin
Human Face Detection in Digital Video Using Heidelberg, 2000.
SVM Ensemble, Neural Processing Letter 17:239- [43] J. Fritsch, S. Lang, M. Kleinehagenbrock, G. A.
252, Kluwer Academic Publishers, Netherlands, Fink, and G. Sagerer, Improving Adaptive Skin
2003. Color Segmentation by Incorporating Results from
[31] Jongmoo Choi, Sanghoon Lee, Chilgee Lee, and Face Detection, In Proc. IEEE Int. Workshop on
Juneho Yi, PrimeEye: A Real-Time Face Detection Robot and Human Interactive Communication,
and Recognition System Robust to Illumination Germany, IEEE, 2002.
Changes, AVBPA 2001, LNCS 2091, pp. 360-365, [44] Raquel Montes Diez, Cristina Conde, and Enrique
Springer-Verlag Berlin Heidelberg, 2001. Cabello, Automatic Detection of the Optimal
[32] A Jonathan Howell and Hilary Buxton, Visual Acceptance Threshold in a Face Verification
Mediated Interaction Using Learnt Gestures and System, BioAW 2004, LNCS 3087, pp. 70-79,
Camera Control, GW 2001, LNAI 2298, pp. 272- Springer-Verlag Berlin Heidelberg, 2004.
284, Springer-Verlag Berlin Heidelberg, 2002. [45] Henry Nicponski, Understanding In-Plane Face
[33] Daniel Keren, Margarita Osadchy, and Craig Rotations Using Integral Projections, ICIAR 2004,
Gotsman, Antifaces: A Novel, Fast Method for LNCS 3213, pp. 633-642, Springer-Verlag Berlin
Image Detection, IEEE Transaction on Pattern Heidelberg, 2004.

26
[46] Tony W.H. Ao leong and Raymond S.T. Lee, iJade [58] Juwei Lu, K.N. Plataniotis, A.N. Venetsanopoulos,
Face Recognizer A Multi-agent Based Pose and and Stan Z. Li, Ensenble-based Discriminant
Scale Invariant Human Face Reconition System, Learning with Boosting For Face Recognition,
KES 2004, LNAI 3214, pp. 594-601, Springer- IEEE Transactions on Neural Networks, vol. 17, pp.
Verlag Berlin Heidelberg, 2004. 166-178, 2006.
[47] Jin Ok Kim, Jin Soo Kim, and Chin Hyun Chung, [59] Rogerio S. Feris, Jim Gemmell, Kentaro Toyama,
Face Region Dectection on Skin Chrominance from and Volker Kruger, Hierarchical Wavelet Networks
Color Images by Facial Features, ICADL 2004, for Facial Feature Localization, Proceedings of
LNCS 3334, pp. 646, Springer-Verlag Berlin Fifth IEEE International Conference on Automatic
Heidelberg, 2004. Face and Gesture Recognition, pp. 118-123, 2002.
[48] Henry A. Rowley, Shumeet Baluja, and Takeo [60] Baochang Zhang, Gabor-Kernel Fisher Analysis for
Kanade, Neural Network-Based Face Detection, Face Recognition, LNCS 3332, Springer-Verlag
IEEE Trans. Pattern Analysis and Machine Berlin Heidelberg, 2004.
Intelligence, vol. 20, no. 1, pp. 23-38, IEEE, 1998. [61] Paola Campadelli, Raffaella Lanzarotti, Chiara
[49] Henry A. Rowley, Shumeet Baluja, and Takeo Savazzi, A feature-based face recognition system,
Kanade, Rotation Invariant Neural Network-Based Proceedings of 12th International Conference on
Face Detection, Proc. IEEE Conf. Computer Vision Image Analysis and Processing, pp. 68-73, IEEE,
and Pattern Recognition, pp. 38-44, 1998 2003.
[50] Stephanie Jehan-Besson, Michel Barlaud, and Gilles [62] Julien Meynet, Vlad Popovici, and Jean-Philippe
Aubert, DREAMS: Deformable Region Driven by Thiran, Face Detection with Mixtures of Boosted
an Eulerian Accurate Minimization Method for Discriminant Features, Technical Report,
Image and Video Segmentation (Application to Face http://biomed.epfl.ch/index.php?module=people&PE
Detection in Color Video Sequences), ECCV 2002, OPLE_MAN_OP=view&PHPWS_MAN_ITEMS%
LNCS 2352, pp. 365-380, Springer-Verlag Berlin 5B%5D=14&show_publications=1, 2005.
Heidelberg, 2002. [63] Filareti Tsalakanidou, Sotiris Malassiotis, and
[51] Hongming Zhang and Debin Zhao, Spatial Michael G. Strintzis, Face Localization and
Histogram Features for Face Detection in Color Authentication Using Color and Depth Images,
Images, PCM 2004, LNCS 3331, pp. 377-384, IEEE Transactions on Image Processing, vol. 14, no.
Springer-Verlag Berlin Heidelberg, 2004. 2, IEEE, 2005.
[52] Paul Viola, Robust Real-Time Face Detection, [64] Yasunori Ishii, Kazuyuki Imagawa, Eiji Fukumiya,
International Journal of Computer Vision 57(2), 137- Katsuhiro Iwasa, Yasunobu Ogura, Profile Face
154, Kluwer Academic Publishers, Netherlands, Detection using Block Difference Feature For
2004. Automatic Image Annotation, International
[53] Kim-Fung Jang, Ho-Man Tang, Michael R. Lyu, and Conference on Consumer Electronics ICCE '06.
Irwin King, A Face Processing System Based on 2006 Digest of Technical Papers, IEEE, 2006.
Committee Machine: The Approach and [65] Jong-Bae Kim, Su-Woong Jung, and Hang-Joon
Experimental Results, CAIP 2003, LNCS 2756, pp. Kim, Face Detection by Integrating
614-622, Springer-Verlag Berlin Heidelberg, 2003. Multiresolution-Based Watersheds and Skin-Color
[54] James W. Davis and Serge Vals, A Perceptual User Model, IEA/AIE 2002, LNAI 2358, pp. 715-724,
Interface for Recognizing Head Gesture Springer-Verlag Berlin Heidelberg, 2002.
Acknoledgements, ACM International Conference [66] Gary G. Yen and Nethrie Nithianandan, Automatic
Proceeding Series, vol. 15, pp. 1-7, 2001. Facial Feature Extraction Using Edge Distribution
[55] Chengjun Liu and Harry Wechsler, Gabor Feature and Genetic Search, International Journal of
Based Classification Using the Enhanced Fisher Computational Intelligence and Applications, vol. 3,
Linear Discriminant Model for Face Recognition, no. 1, pp. 89-100, Imperial College Press, 2003.
IEEE Transactions on Iamge Processing, vol. 11, no. [67] Hideaki Sato, Katsuhiro Sakamoto, Yasue
4, IEEE, 2002. Mitsukura, and Norio Akamatsu, Face Edge
[56] Wenchao Zhang, Shiguang, Wen Gao, Xilin Chen, Detection System by Using the GAs, KES 2004,
and Hongming Zhang, Local Gabor Binary Pattern LNAI 3213, pp. 847-852, Springer-Verlag Berlin
Histogram Sequence (LGBPHS): A Novel Non- Heidelberg, 2004.
Statistical Model for Face Representation and [68] J.S. Taur and C.W. Tao, A New Neuro-Fuzzy
Recognition, Proceedings of the 10th IEEE Classifier with Application to On-Line Face
International Conference on Computer Vision Detection and Recognition, Journal of VLSI Signal
(ICCV05), IEEE, 2005. Processing 26, 397-409, Kluwer Academic
[57] Peter N. Belhumeur, Ongoing Challenges in Face Publishers, the Netherlands, 2000.
Recognition, Frontiers of Engineering: Reports on [69] Sang-Burm Rhee and Young-Hwan Lee, Improved
Leading-Edge Engineering from the 2005, Face Detection Algorithm in Mobile Enviroment,
Symposium (2006). ICCS 2004, LNCS 3036, pp. 638-686, Springer-
Verlag Berlin Heidelberg, 2004.

27
[70] Douglas Chai and Kim N. Ngan, Locating Facial [83] Jianzhong Fang and Guoping Qiu, Learning an
Region of a Head-and-Shoulders Color Image, Information Theoretic Transform for Object
Proc. Third Intl Conf. Automatic Face and Gesture Detection, ICAR 2004, LNCS 3211, pp. 503-510,
Recognition, pp. 124-129, 1998. Springer-Verlag Berlin Heidelberg, 2004.
[71] Bae-Ho Lee, Kwang-Hee Kim, Yonggwan Won, and [84] T. Darrell, G. Gordon, M. Harville, and J. Woodfill,
Jiseung Nam, Efficient and Automatic Faces Integrated Person Tracking Using Stereo, Color,
Detection Based on Skin-Tone and Neural Network and pattern Detection, International Journal of
Model, IEA/AIE 2002, LNAI 2358, pp. 57-66, Computer Vision 37(2), 175-185, Kluwer Academic
Springer-Verlag Berlin Heidelberg, 2002. Publishers, the Netherlands, 2000.
[72] Shu-Fai Wong and Kwan-Yee Keneth Wong, Fast
Face Detection Using QuadTree Based Color
Analysis and Support Vector Verification, ICIAR
2004, LNCS 3212, pp. 676-683, Springer-Verlag
Berlin Heidelberg, 2004.
[73] Henry Schneiderman and Takeo Kanade,
Probabilistic Modeling of Local Appearance and
Spatial Relationship for Object Recognition,
Proceedings of IEEE Computer Society Conference
on Computer Vision and Pattern Recognition, pp.
45-51, IEEE, 1998.
[74] Gines Garcia_Mateos, Alberto Ruiz, and Perdo E.
Lopez-de-Teruel, Face Detection Using Integral
Projection Models, SSPR&SPR 2002, LNCS 2396,
pp. 644-653, Springer-Verlag Berlin Heidelberg,
2002.
[75] QingHua Wang, Luis Seabra Lopes, and David M.J.
Tax, Visual object recognition through one-class
learning, Image analysis and recognition:
international conference ICIAR, vol. 1, pp. 463-470,
2004.
[76] Selin Baskan, M. Mete Bulut, Volkan Atalay,
Projection based method for segmentation of
human face and its evaluation, Pattern Recognition
Letters 23, pp. 1623-1629, Elsevier Science, 2002.
[77] Thang V. Pham. Marcel Worring, and Arnold W.M.
Smeulders, Face Detection by Aggregated Bayesian
Network Classifiers, MLDM 2001, LNAI 2123, pp.
249-262, Springer-Verlag Berlin Heidelberg, 2001.
[78] Shou-Der Wei and Shang-Hong Lai, An Efficient
Algorithm for Detecting Faces from Color Images,
PCM 2002, LNCS 2532, pp. 1177-1184, Springer-
Verlag Berlin Heidelberg, 2002.
[79] Yen-Yu Lin, Tyng-Luh Liu, and Chiou-Shann Fuh,
Fast Object Detection with Occlusions, ECCV
2004, LNCS 3021, pp. 402-413, Springer-Verlag
Berlin Heidelberg, 2004.
[80] Kyongpil Min, Junchul Chun, and Goorack Prak, A
Nonparametric Skin Color Model for Face Detection
from Color Images, PDCAT 2004, LNCS 3320, pp.
116-120, Springer-Verlag Berlin Heidelberg, 2004.
[81] Herve Abdi, A Generalized Approach for
Connectionist Auto-Associative Memories:
Interpretation, Implication & Illustration for Face
Processing, Artificial intelligence and cognitive
sciences, Manchester University Press. pp. 149-165,
Manchester, 1988.
[82] Jianping Fan, David K.Y. Yau, Ahmed K.
Elmagarmid, an Walid G. Arref, IEEE Transactions
on Image Processing, vol. 10, no. 10, IEEE, 2001.

28

You might also like