You are on page 1of 37

I HC QUC GIA THNH PH H CH MINH TRNG I HC CNG NGH THNG TIN -------------------------------

TRNH NH DUY

RT TRCH THNG TIN T KHUN MT V TI HIN LI TRN M HNH 3D

Chuyn ngnh : KHOA HC MY TNH M s : CH0601013

LUN VN THC S CNG NGH THNG TIN

NGI HNG DN KHOA HC: PGS.TS. L HOI BC

Thnh ph H Ch Minh Nm 2009

Li Cm n
(VIT SAU)

Li Cam oan
` (VIT SAU)

Mc Lc
Trang Trang Ph Ba Li Cm n..............................................................................................................1 Li Cam oan..........................................................................................................2 Mc Lc...................................................................................................................3 Danh Mc Cc K Hiu, Cc Ch Vit Tt..............................................................5 Danh Mc Cc Bng.................................................................................................6 Danh Mc Cc Hnh.................................................................................................7 M U..................................................................................................................8 Chng 1 - TNG QUAN......................................................................................12 1.1. Gii Thiu.....................................................................................................12 1.1.1. Do tim khun mt..................................................................................13 1.2. Nhng Thch Thc.......................................................................................13 1.3. Phat Biu Bai Toan.......................................................................................14 1.4. Pham Vi Tai.............................................................................................15 1.5. Nhng ng Gp.........................................................................................15 1.6. Cu Truc Lun Vn.......................................................................................15 Chng 2 - RUT TRICH THNG TIN KHUN MT..........................................16 2.1. Do Tim Khun Mt.......................................................................................16 2.2. Xac inh vung cn quan tm.........................................................................17 2.3. Rut Trich Thng Tin T Khun Mt.............................................................18 2.3.1. Nhng c Trng da trn Gn Sng (Wavelet-based Features)..........18 2.4. Haar cascades file.........................................................................................22 2.5. C s d liu anh..........................................................................................24 Chng 3 - XY DNG CHNG TRINH..........................................................25 Chng 4 - TH NGHIM.....................................................................................26 4.1. B D Liu Th Nghim..............................................................................26

Chng 5 - KT LUN..........................................................................................27 5.1. Kt Qua.........................................................................................................27 5.2. Khuyn Nghi.................................................................................................27 TAI LIU THAM KHAO.......................................................................................28

Danh Mc Cc K Hiu, Cc Ch Vit Tt


Computer vision: Thi giac may tinh coarse-to-ne: t th x n tinh vi a multi-stage approach: mt phng php gm nhiu giai on & nnk: nhng ngi khc (nnk) SVM: Support Vector Machine frontal view: pha trc multi view: nhiu gc nhn nh nhng hng xoay khc nhau state-of-the-art: tinh xao scale: ty l recall paper: a low false positive rate: ty l xac thc sai s thp A weak classifier robust integral image filters alignment: phn oan - segmentation (nh phn oan khun mt) classifier: b phn lp

Danh Mc Cc Bng

Danh Mc Cc Hnh
Hinh 1-1: M hinh cac bc xy dng...............................................................14 Hinh 2-2: Chia vung khun mt thanh nhng vung nho hn thao tac. (a) Xac inh vung mt (b) Xac inh vung ming da trn khoang cach mt (Eye Distance ED)..........................................................................................................................17 Hinh 2-3: Nhng im c trng trn khun mt....................................................18 Hinh 2-4: Nhng c tnh ca gn sng Haar .........................................................19 Hinh 2-5: Cach tao ra anh tich phn........................................................................21

M U
Vi nhng nghin cu lin quan n s tng tac gia khun mt va may tinh se giup ich rt nhiu cho nhng ngi khuyt tt, nhng ng dung an ninh, truy tim ti pham, cung nh ngay nay cang nhiu hinh anh, oan phim dai c chia se va co nhu cu truy tim thng tin tr nn cn thit, mt trong nhng hng co th tip cn la da vao nhng c trng khun mt, va bai toan rut trich c trng la nhng bc c ban cho hng nghin cu nay. Cung nh t nhng bc c ban nay, chung ta phat trin nhng nghin cu v nhn dang biu cam, nen anh, hiu cach ra hiu cua mi (lip-reading) . Ngay nay, vi s phat trin cua webcam th nhu cu ca vic ng dung thng minh vo h thng an ninh cang tr nn cp thit. iu ny giup cho h thng t ng nhn dang cc i tng mt v tr nht nh mt cch d dang. Trong mt thp k qua, vn sinh trc hoc nh mng mt, vn tay, khun mt trong lnh vc an ninh tr thnh mt trong nhng ch quan trong cua cac chnh ph va nhng nh nghin cu trn th gii. Tuy nhin, co c kt qua phn tich chinh xac cao trong sinh trc hoc nh mng mt hay vn tay, oi hoi phai co s hp tac cht che t phia i tng c phn tich. Chng han i vi vn tay, ngi cn phn tich phai tun thu cac iu kin la tay khng c t, cung nh cach quet ln h thng o phai u manh va u thi h thng mi nhn dang c. Bn canh o, vic phn tich mng mt oi hoi i tng cn phn tich phai a mt cua minh vao ung vi tri ma h thng yu cu, mt khac co c hinh anh mng mt cht lng tt thi cn phai co mt cng ngh tin tin vi chi phi cao thi h thng mi co ung d liu u vao phn tich. iu nay gy kho khn cho vic ng dung cng ngh phn tich vn tay va mng mt vao thc t. Do o, ngi ta a chu y n nhng yu t khac cua sinh trc hoc va khun mt la i tng thu hut c s quan tm cua nhiu ngi, vi cng ngh phn tich khun mt khng oi hoi cac iu kin khc khe nhng vn cho ta kt qua phn tich mc chp nhn c.

Mc khac, tng lai trong vic xy dng nhng engine tim kim hinh anh, video ap ng c nhu cu thc t tr thanh mt thach thc ln. Nu nh cac engine tim kim hin nay, d liu u vao la chui cac keyword tim kim, thi tim kim multimedia cn nhng keywork trong hinh anh (thinh thoang co ngi goi la visual words). Do o, thach thc nay lin quan n hiu ni dung hinh anh, TRECVID - high level feature extraction task (hay cn gi l concept detection) chnh l mt trong nhng nhim vu gii quyt vn ny. Nhng kt qua con kha xa, vi vy chung ta co th tp trung pham vi hep, o la con ngi, xy dng bai toan do tim/nhn dang khun mt. Vi vic rut trich thng tin t khun mt, y la bc c ban cho nhiu ng dung thc t. cung a chi ra nhng ng dung nh: nhn dang khun mt (face recognition), theo doi khun mt (face tracking), phn tich biu l khun mt, theo doi cai nhin chm chm (gaze tracking), hiu cach ra hiu cua mi (lip-reading) Cng nh trong mt kho st ca ch ra s quan trng ca vic nh v mt, y l l bc khi u cho nhiu k thut x ly khun mt. Nh chung ta bit, sai s trong giai oan inh vi nhng c tinh anh hng rt ln n ty l nhn dang ma chung ta khng th bit trc c. o la ly do tai sao cn thit at c qua trinh rut trich nhanh, t ng, va chinh xac nhng c tinh mong mun trc khi ng dung mt ky thut nhn dang khun mt. inh vi c tinh trn khun mt t ng a c th thach lu dai trong linh vc thi giac may tinh trong nhiu thp ky qua. iu nay co th c giai thich bi kha nng rng ln, mt khun mt trong mt bc hinh co th co c la nh nhng yu t nh vi tri, biu cam, t th, chiu sang va nn ln xn. S xut hin cua khun mt thay i quyt lit do biu cam, t th, chiu sang, (expression [Peter & nnk, 1997], pose [Georghiades & nnk, 2001] and illumination [Adini & nnk, 1997]) tui tac, va nhng thay i hinh tng (chng han ru, kinh). S bin i ln trong s xut hin cua khun mt lam nhng thut toan nhn dang trn nhng bc hinh va video tr thanh mt thach thc. Cn xy dng mt phng phap nhanh gii quyt nhng vn kh khn ny.

10

Va y la c hi tip tuc nghin cu trong linh vc x ly hinh anh nay. T nhng thng tin rut trich c, co th ng dung trong linh vc an ninh (chng han tim ra nhng tn trm trong h s lu tr thng tin con ngi), nen anh, va nhng tng tac gia con ngi va may tinh. Trong lun vn ny, ti tp trung vo vn rt rt trch c nhng thng tin trn khun mt, chng ti a ra mt phng php gm nhiu giai on gim thi gian d tm trong khi vn m bo t l d tm cao. u tin chng ta cn phi xem xt phng php d tm khun mt nhanh, gii hn li vng cn rt trch. Tip n, chng ta p dng mt phng php rt trch trn vng va d tm xc nh c nhng im cn quan tm trn khun mt nh mt, mi, ming. Trong giai on d tm khun mt, ti ch xem xt nhng khun mt pha trc (frontal view) v ti xut phng php Adaboost , bi phng php ny cho kt qu nhanh. V giai on sau, ti s tp trung vo xem xt mt trong hai phng php SVM, Gabor, Kalman cho qu trnh rt trch nhng thng tin trn khun mt vi chinh xac cao. ----------------------------------------------------------------------------Nen anh Nhng tng tac gia may tinh va con ngi. animation development trong game [47] For example, many car companies (currently in Japan and soon in the US and Europe) are installing cameras in the dashboard with the goal of detecting angry, drowsy, or drunk drivers. Similarly, advertisers on web portals are seeking to use facial information to determine the effect of specic billboards and logos, with the intention of dynamically changing the appearance of a website in response to users emotions regarding the

11

advertisements. Moreover, video game companies are interested in assessing the players emotions during game play to help gauge the success of their products. The extraction of facial feature point, for example, eyes, nose, mouth corners and others, is an important stage in many facial image interpretation tasks such as face verification, face expression recognition, model based image coding and head pose determination Active Shape Model (ASM) [7] proposed by Cootes

12

Chng 1 - TNG QUAN


1.1. Gii Thiu Nhng thch thc trong nhng nm qua l xy dng nhng ng dng tm kim nhng d liu a phng tin sn sng cho ngi dng. Vic x l khun mt, cng nh rt trch thng tin t khun mt ng vai tr quan trng. Bi l, hin nay vn tm kim thng tin t d liu video vn cn b ng, v mt trong nhng hng tip cn l tm kim da trn khun mt. Trong cng trnh ny, tc gi a ra nhng phng php rt trch t ng v t chc s lng ln nhng khun mt cho qu trnh lp ch mc, t ly d liu video. Hin nay, Adaboost l mt phng php ph bin cho vic d tm khun mt. N cng c OpenCV s dng. Nguyn Thnh Thi (2006), Nhn Dng Mt Ngi Dng SVM v Mng Nron, Lun Vn Thc S, DHCNTT TpHCM, TpHCM. [tm tt] kt hp gia SVM v mng nron nhn dng khun mt. Pham Th Bao & nnk (2006), Tng quan cac phng php xac inh khun mt. a phat trin mt h thng cho php t ng xc nh c nhiu khun mt cng lc vi cc thnh phn mt mi ming v t c chnh xc kh cao bi vic loai cac thanh phn gy nhiu. xy dng mt chng trnh d tm khun mt v mt da trn th vin m OpenCV. Nhng chnh xc vn khng cao. a co nhng khao sat trong linh vc nhn dang, ng va cac ng nghip a chi ra rng nhng phng php xac thc da trn mt khu hay ky hiu thi qua d pha v. Nhng phng php sinh trc hoc la mt la chon hp ly nhng cung co nhng mt han ch. Phng phap quet mng mt rt tin cy nhng ap t ngi s dung qua nhiu, chi phi mc thc hin va khng c chp nhn bi nhiu ngi. Nhn dang vn tay c chp nhn mang tinh cht xa hi, nhng khng th ng dung i vi nhng ngi khng tan thanh.

13

Ngc lai, nhn dang khun mt tng ng vi mt s thoa hip gia s chp nhn co tinh cht xa hi va tinh cy.

Trong nhiu thp ky qua, nhng tin b chinh a xut hin trong nhn dang vi nhiu h thng co kha nng at c ty l nhn dang hn 90%. Tuy nhin, trong ng canh cua th gii thc vn tn tai thach thc, bi nhng x ly khun mt co th chiu dng mt s bin i ln. Trong qua trinh rut trich thng tin khun mt, bc u tin cn phai xac inh khun mt. Trong nghin cu nay, chung ta xem xet phng phap 1.1.1. Do tim khun mt Pham Th Bao & nnk, 2006 a co khao sat tng quan v nhng phng phap xac inh khun mt. 1.2. Nhng Thch Thc Bai toan rut trich thng tin t khun mt la mt vn kho trong linh vc thi giac may tinh (computer vision) trong nhiu nm qua. iu nay co th giai thich bi nhng thay i ln cua khun mt trong mt khung canh qua nhng yu t nh: S biu cam cua khun mt (expression [Peter & nnk, 1997]): co anh hng ang k ln cac thng s cua khun mt. Chng han, cung mt khun mt mt ngi, nhng se rt khac khi ho ci hay bun, T th, goc nhin (pose [Georghiades & nnk, 2001]: nhng t th khac nhau co th se lam khut mt phn mt, mui, ming hoc thm chi khut ht. Gy kho khn cho vic rut trich nhng im c trng. Nhng thay i chiu sang illumination [Adini & nnk, 1997]): nhng bin i chiu sang do nhng thuc tinh phan xa vi da, va do tinh cht camera se anh hng n cht lng anh. Tui tac, nhng thay i hinh dang (chng han s che khut cua ru, kinh): lam bai toan tr nn phc tap hn, phai xem xet nhiu trng hp hn. phn giai khac nhau (resolution):

14

S a dang hoa cua mau da Cac khun mt dinh vao nhau trong cung 1 bc anh

Rut trich c tinh ming vi chinh xac cao, theo thi gian thc. Bi mt trong nhng ng dung cua no la oc mi (lip reading), chng han nh trong 1.3. Phat Biu Bai Toan T mt anh vi kich thc nao o, chung ta tim mt phng phap xac inh nhanh trong anh o co khun mt hay khng. Sau o tim phng phap rut trich c trng cua khun mt o va kt hp ky thut 3D tai hin nhng c ng trn m hinh 3D o.
Nhn dng khun mt (Face detection FD) Rt trch nhng c tnh (Feature extraction - FE) Ti hin trn m hnh 3D

Hinh 1-1: M hinh cac bc xy dng

Vic nh v mt cng gp kh khn nh s c mt ca knh cng l mt thch thc ln cho vic nghin cu. Chung ta cn xem xet xac inh mui trc khng? ASM co th do tim nhng im rut trich nhanh, nhng AAM thi sao? Mt iu na la xem xet bai toan mc chinh xac cao, c bit la vi thng tin ming, y co th ng dung nhiu vao nganh giai tri 3D.

15

1.4. Pham Vi Tai Trong tai nay, chung ti tp trung tim kim va xut mt thut toan hiu qua cho vic rut trich c trng anh nh mt, mui, ming. Do s phc tap cua bai toan a noi trong muc 1.2, chung ti a ra nhng gia inh va rang buc sau nhm giam phc tap cua bai toan: Anh khun mt phia trc (frontal view) iu kin anh sang binh thng

1.5. Nhng ng Gp Tim hiu mt phng phap manh cho qua trinh rut trich thng tin khun mt vi mc chinh xac cao, ng thi cung ap ng x ly nhanh theo thi gian thc (realtime). Va xy dng ng dung minh hoa thy c kha nng ng dung cua ky thut ma ti tim hiu, nghin cu.

1.6. Cu Truc Lun Vn Lun vn nay c t chc nh sau:

16

Chng 2 - RUT TRICH THNG TIN KHUN MT


Nhng thng tin khun mt c hiu nh nhng im c trng, nhng im ni bt cua khun mt nh nhng goc cua mt, nhng goc cua lng may, nhng goc va nhng im gia ngoai cua mi, nhng goc cua l mui, inh cua mui (tip of the nose), inh cua cm (tip of the chin) (xem Hinh 2-3). inh vi nhng im c trng trn khun mt la giai oan quan trong cho nhiu cng trinh lin quan n x ly anh khun mt. y, trong nhiu cng trinh xut chon mt thut toan do tim khun mt nhanh va manh, o la mt phin ban cua do tim khun mt Viola-Jones . Sau o, nghi chia vung khun mt a do tim ra thanh 20 vung cn quan tm (regions of interest) inh vi nhng im c trng nay. Chung ta xem xet tng giai oan cho qua trinh rut trich thng tin nh sau: 2.1. Do Tim Khun Mt Trong giai on nay, chng ta cn xem xt hng, t l khc nhau cua khun mt. Tim phng php do tim nhanh, manh khun mt phia trc (front face), nu tim khun mt nhiu goc nhin khac nhau (multi-view face) cang tt. Nhng phai trn tiu chi nhanh, lam tin tt cho giai oan rut trich. Phng phap xut la s dung Adaboost do tim khun mt hng phia trc (frontal) . y la phng phap xac inh khun mt nhanh va manh, a c ViolaJones a ra. Vic do tim se thc hin trn mi khung hinh theo thi gian thc, co chinh xac cao va hiu sut thc hin nhanh. Cung co phng phap khac vi tin cy cao hn bi vic do tim khun mt nhiu goc nhin , nhng kt qua cua no se khng u thng tin nh mt, mui, ming, ap ng cho giai oan rut trich.

17

2.2. Xac inh vung cn quan tm Xc nh mt trc l mt li th? Nhng nu eo kinh hay mt nhm thi chuyn gi xay ra?

Hinh 2-2: Chia vung khun mt thanh nhng vung nho hn thao tac. (a) Xac inh vung mt (b) Xac inh vung ming da trn khoang cach mt (Eye Distance ED)

Chung ta chia vung khun mt ra lam 2 phn: phn trn cha mt va phn di cha ming. Vung trn ta lai chia ra lam 2 phn theo chia ngang xac inh vi tri mt cho chinh xac. Khi chung ta bit vi tri x, y cua mt, chung ta co th xac inh c goc nghing cua khun mt va co th cn thit la xoay khun mt v thng ng vi goc nghing o tin li cho vic rut trich.

18

2.3. Rut Trich Thng Tin T Khun Mt

Hinh 2-3: Nhng im c trng trn khun mt

Thc vy, trong khao sat cua a chi ra s quan trong cua inh vi mt trc, cung nh giup c lng c ty l (scale) cua khun mt. Chung ta se xem xet bai vit nay , cung nh kt hp . a chi ra vai phng phap inh vi nhng im c tinh trn khun mt: 1. Phng phap da vao c tinh (feature based approach) 2. Phng phap da vao mu (template based approach) Chung ta cn phn bit gia nhng phng phap chi oi hoi inh vi nhng c tinh nhanh (rough) va nhng cai oi hoi anh gia nhng im trn khun mt mt cach chinh xac. [Bartlett, 2005] So sanh LDA va SVM trong vic chon c trng (feature selection)

chi ra mt phng phap rut trich nhng c trng khu vc ming. y co th m rng cho h thng nhn dang li noi mt cach t ng (automatic speech recognition ASR) rng mt s cng trinh a tht bai do chi x ly trong m thanh vi iu kin ting n. 2.3.1. Nhng c Trng da trn Gn Sng (Wavelet-based Features) According to the study of C. Papageorgiou [70], the wavelet coecients preserve all the information in the original image,

19

but the coding of the visual information diers from the pixel-based representation Theo nghin cu ca C. Papageorgiou, nhng h s gn sng (wavelet) bo tn c tt c thng tin ca hnh nh ngun, nhng vic m ho ca nhng thng tin nhn thy c khc vi vic biu din da vo im nh (pixel) trong 2 cch: lm nhng bin ca lp bn trong ti thiu, v ca lp bn ngoi ti a mt cch ng b. u tin, s khc nhau cng trung bnh gia nhng vng cc b theo nhng hng khc nhau c m ho trong mt khung a t l. Nhng rng buc trn nhng gi tr ca gn sng c th biu din nhng c tnh nhn thy c ca lp i tng: phn ng mnh t gn sng ch ra s hin din ca s khc nhau mnh m, hoc bin gii t gi sng trong bc nh, trong khi phn ng yu t gn sng ch ra khu vc tng ng. Th hai, vic s dng nhng nn tng hon thin, v d, nn tng Haar, cho php chng ta lan truyn nhng rng buc gia nhng vng ln cn v m t nhng mu phc tp. Bin i gn sng mt gn bn ln cung cp phn gii cao v dn n mt b t in phong ph, hon thin ca nhng c tnh.

2.3.1.1. Haar wavelet

Hinh 2-4: Nhng c tnh ca gn sng Haar

Nhng c trng Haar (Haar like features) la gi?

20

[OpenCVWiki]Nhng c trng Haar ma hoa s tn tai cua nhng tng phan c inh hng gia nhng vung trong anh. Mt b cua nhng c trng nay co th c s dung ma hoa nhng tng phan cua mt khun mt va nhng lin h khng gian gia chung. u tin, mt b phn lp (classifier) la mt cai thac (cascade) cua nhng b phn lp nng cao dn, c hun luyn vi vai trm nhng mu cua i tng cn quan tm (chng han khun mt, xe ), c goi la nhng mu khng inh (positive), nhng mu nay c co gian v cung mt kich thc (20x20) va nhng mu phu inh (nagetive), la nhng mu gn ging vi i tng quan tm nhng khng phai i tng o, cung co cung kich thc. Sau khi b phn lp c hun luyn, no co th thc hin trn nhng vung quan tm trong mt anh u vao. B phn lp se xut ra 1 nu vung o ging i tng quan tm va ngc lai la 0. tim kim i tng trong toan b anh, chung ta phai di chuyn mt ca s tim kim doc theo anh kim tra mi vi tri bi vic s dung b phn lp o. B phn lp phai c thit k co th d dang thay i kich thc cho vic tim kim nhng i tng quan tm vi nhng kich thc khac nhau, iu nay hiu qua hn vic thay i chinh kich thc cua anh. Va mt iu na la vic tim kim nhng i tng vi kich thc cha bit, chung ta cn co mt thut toan quet qua vai ln trn anh vi nhng ty l co gian khac nhau. Mt cai thac cua nhng b phn lp nghia la b phn lp kt qua gm vai b phn lp n gian hn thc hin trn vung quan tm cho n khi tai mt giai oan nao o, i tng bi t chi bi b phn lp cua giai oan tng ng, thi nhng giai oan sau o bi bo qua. Va tai mi giai oan cua cai thac phn lp o, b phn lp sau se phc tap hn b phn lp trc va c xy dng da trn out of basic classifiers bi vic s dung mt trong bn thut toan bu chon trong s (weighted voting) - Discrete Adaboost, Real Adaboost, Gentle Adaboost and Logitboost. Nhng b phn lp c ban la nhng b phn lp cua cy quyt inh vi it nht hai la. Nhng c tinh Haar la u vao i vi nhng b phn lp c ban. c trng a

21

s dung trong mt b phn lp c thu c m ta hinh dang, vi tri trong vung quan tm va mt ty l co gian. Anh tich phn (Integral image): [Chesnokov Yuriy, 2008] La mt anh c x ly trc nhm tng cng cho vic rut trich nhng c trng Haar. Tai mi im (i, j) trong anh gc, chung ta tinh tng gia tri tt ca nhng im anh phia bn trai va trn so vi im (i, j): I(x) = sum sum (i, j)

Hinh 2-5: Cach tao ra anh tich phn

unsigned char** pimage; unsigned int** pintegral_image; for (unsigned int i = 0; i < height; i++) { for (unsigned int j = 0; j < width; j++) { pintegral_image[i][j] = 0; for (unsigned int y = 0; y <= i; y++) for (unsigned int x = 0; x <= j; x++) pintegral_image[i][j] += pimage[y][x]; } }
Bang 2-1: oan ma cach tao ra anh tich phn

22

2.4. Haar cascades file http://alereimondo.no-ip.org/OpenCV/34 Description & References Frontal stump 20x20gentle, 20x20tree Profile Face (20x20) Human 14x28 full body, body, David Bradley 1.0 body10.zip Pedestrian Detection 19x23 lower body, 22x18 upper body Unknown Frontal eyes) eyes (both Ref. to author Old & rights is format New Frontal eyes) eyes (both Yusuf Bediz Format cascade XML frontalEyes35x16XML.zip welcome cascade frontalEyes35x16.zip David Bradley 1.0 profileFace10.zip Face 24x24, Rainer Lienhart 1.0 frontalFace10.zip Author(s) / Licence Version Haar cascade file

Converted w/HaarConv

Modesto Right 18x12 Reference paper Eye Castrilln Santana Conditions use of 1.0 REye18x12.zip

23

Modesto Left 18x12 Reference paper Eyes 22x5 detector with 7000 samples Reference paper Eyes 45x11 detector with 7000 samples Reference paper Head 22x20 shoulders and shoulers and classifier Eye pair positive computed Eye pair positive computed Eye Castrilln Santana Conditions use Modesto Castrilln Santana Conditions use Modesto Castrilln Santana Conditions use Modesto Castrilln Santana Conditions use Modesto Mouth 25x15 Reference paper Nose Castrilln Santana Conditions use Modesto 1.0 Nose25x15.zip of 1.0 Mouth25x15.zip of 1.0 HS22x20.zip of 1.0 Eyes45x11.zip of 1.0 Eyes22x5.zip of 1.0 LEye18x12.zip

Head

Reference paper

24

Castrilln 25x15 Reference paper Eye 24x12 trained to detect one Ting Shan eye (either left or right) Wall 30x30 Reference Detect eyes one at a time 20x20 clock Message Celal ELGN shameem an 1.0 WallClock.zip 1.0 EyeTingShan.zip Santana Conditions use of

1.0

oneEye22x22.zip

2.5. C s d liu anh FERET database Cohn-Kanade database The Carnegie Mellon University Pose, Illumination, and Expression database

25

Chng 3 - XY DNG CHNG TRINH


Xy dng chng trinh vi ngn ng C++, nhng cung tham khao cach lam vic trn Matlab. Code tai http://subversion.assembla.com/svn/james_research/FXFace Yu cu h thng

Microsoft Windows XP/Vista Intel Pentium 4-Class (Intel Core2 Duo recommended) 512 MB RAM (1 GB RAM recommended) 30 MB free disk space Webcam

26

Chng 4 - TH NGHIM
4.1. B D Liu Th Nghim Danh sach nhng b d liu cho vic th nghim: 1. The Facial Recognition Technology (FERET) Database 2. CMU Face Database (CMU-MIT Frontal Face Test Set - a ground truth text including information for locations of eyes, noses, and lip centers and tips, however, it does not have locations of faces expressed by rectangle regions required by the haartraining utilities as default) 3. MIT CBCL Face Data (2,429 frontal faces with few illumination variations and pose variations) 4. BANCA Database 5. The Yale Face Database 6. Essex Face Recognition Data 7. The AR Face Database 8. The UMIST Face Database (large variations in poses) 9. The PASCAL Object Recognition Database Collection 10. 2005 OTCBVS Benchmark Dataset Collection 11. Surveillance Performance EValuation Initiative (SPEVI) 12. CMU PIE Database 13. BioID face database and the FG-NET facial expression database

27

Chng 5 - KT LUN
5.1. Kt Qua

5.2. Khuyn Nghi

28

TAI LIU THAM KHAO


Keywords: face recognition, eye localization, asm, svm, facial features extraction, locating facial feature points, facial feature point detection, face detection, adaboost, floatboost, GentleBoost, gabor filter, gabor wavelet features, biometrics, geometric invariants, xac inh thanh phn khun mt, trich chon c trng. Ting Vit [1] [2] [3] [4] Trn L Hng D (2005), Pht Hin Khun Mt Da Trn c Trng Li Lm, Lun Vn Thc S, DHKHTN TpHCM, TpHCM. [pdf] Nguyn Thnh Thi (2006), Nhn Dng Mt Ngi Dng SVM v Mng Nron, Lun Vn Thc S, DHCNTT TpHCM, TpHCM. [tm tt] Nguyn Minh c (2008), VN gy ting vang ti din n bo mt Black Hat, VNExpress.net. [chi tit] Trn Anh Tun, Nguyn Thnh Nht, Cao Minh Thnh,Trn Anh Tun, Phan Phc Don, Phm Th Bo (2006), Xc inh Cc Thnh Phn Cua Mt Ngi Trong nh Mu Da Vo Logic M, Xc Xut V Thut Ton Loang, Hi Thao tai Trng H a Lat. Ting Anh [5] [6] http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.97.4966 (Precise eye localization through a) http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.96.1821 (EMoTracker: Eyes and Mouth Tracker Based on Energy Minimization Criterion) [7] [8] [code] http://www.eee.metu.edu.tr/~alatan/PAPER/MSbahtiyar.pdf (following the paper) Zeeshan Ejaz Bhatti (2008), Face and Eyes Detection Using OpenCV

29

[9]

http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.43.7504 Abstract---In this paper we present a way to regard the combined face detection and facial feature extraction problem as an optimization problem. This problem is of too high computational complexity to be of practical use, and we investigate different methods to reduce the complexity. Our proposed system uses techniques known from the literature (skin colour classifcation, statistical pattern matching,...) as well as a novel method based on a deformable graph and an extended Viterbi algorithm. Keywords--- Face localization, Face detection, Facial feature extraction, Model-based coding, Mpeg-4, Face Animation

[10]

R. Beveridge, B. Draper, CSU baseline results on the FERET database, CSU face identication evaluation system 5.0, Available from http://www.cs.colostate.edu/evalfacerec/algorithms/version5/CSUBaselineRe sultsV5/index.html (2003) (khao sat vai thut toan) http://code.google.com/p/facerec/source/browse/?r=23

[11]

http://arxiv.org/PS_cache/cs/pdf/0605/0605027v1.pdf (Recognition of expression variant faces using masked log-Gabor features and Principal Component Analysis)

[12] [13] [14] [15] [16]

NIST,

FERET

evaluation,

Available

from

http://www.itl.nist.gov/iad/humanid/feret/perf/eval.html (2001). http://www.kung-foo.tv/gaborapi.html#2 (Gabor API) http://infinitesushi.com/2003/04/gabor-api-updated/ http://www.personal.rdg.ac.uk/~sir02mz/CGabor/example.html (Use Gabor Class) http://www.visagetechnologies.com/downloads.html VISION) http://eyeteck.vn/nc-ht/dieu%20khien-tu%20dong/mang%20SVM %20va%20ung%20dung%20trong%20nhan%20dang%20thanh%20phan %20hon%20hop%20khi.htm (bai v SVM) (visage|SDK

30

[17] [18] [19]

http://www.face-rec.org/algorithms/EBGM/WisFelKrue99FaceRecognition-JainBook.pdf http://www.wavelet.org http://www1.cs.columbia.edu/~jebara/htmlpapers/UTHESIS/thesis.ht ml Tony S. Jebara, 3D Pose Estimation and Normalization for Face Recognition (1995)

[20] [21] [22] [23] [24]

http://face.360degreeweb.com/ access control software based on facial recognition http://www.i-secure.sg/Products/faceITSDK.aspx FaceIt ARGUS http://identix.com/ http://pages.cs.wisc.edu/~kamenick/tomoeye/final_report.html Source code tracking eye by SVM http://faceapi.com/ (performance is very good ) http://sibgrapi.sid.inpe.br/rep-/sid.inpe.br/sibgrapi@80/2007/07.18.03. 11?languagebutton=en (trang nay search bai bao) Jos Gilvan Rodrigues Maia, Fernando de Carvalho Gomes, Osvaldo de Souza - Automatic Eye Localization in Color Images (vn rt thit thc) FaceIT SDK -

[25]

http://s.i-techonline.com/Book/Face-Recognition/ISBN978-3-90261303-5-fr20.pdf Maja Pantic and Marian Stewart Bartlett. Machine Analysis of Facial Expressions. Face Recognition, ISBN 978-3-902613-03-5, pp.558, ITech, Vienna, Austria, June 2007

[26]

http://s.i-techonline.com/Book/Face-Recognition/ISBN978-3-90261303-5-fr04.pdf Vienna, Dao-Qing Dai and Hong Yan. Wavelets and Face Austria, June 2007 Recognition. Face Recognition, ISBN 978-3-902613-03-5, pp.558, I-Tech, http://citeseerx.ist.psu.edu/viewdoc/versions;jsessionid=390272219CABAE C01CD625C09B6ABC3C?doi=10.1.1.92.4220

[27]

Paul Viola, Michael Jones (2001), Rapid Object Detection using a Boosted Cascade of Simple Features, Proceedings of Computer Vision and

31

Pattern

Recognition.

http://www-cse.ucsd.edu/classes/fa01/cse291/ViolaJones.ppt http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.6.2036 (bai c ban) [28] G. Lipori, A general-to-specific Eye Locator based on Haar wavelets and Support Vector Machines, PhD thesis, discussed in March 2007. [pdf ~ 10 Mb] [29] P. Campadelli, R. Lanzarotti and G. Lipori, Automatic facial feature extraction for face recognition, Chapter at pagg. 31-58 of the book Face Recognition, edited by Kresimir Delac and Mislav Grgic, published by ITech Education and Publishing, Vienna, July 2007. [online book] [pdf] [30] http://www.cs.colostate.edu/evalfacerec/papers/cvprip02.pdf K. Baek, B. A. Draper, J. R. Beveridge, and K. She. PCA vs. ICA: A comparison on the FERET data set, presented at Joint Conference on Information compare [31] Sciences, two subspace Durham, N.C., 2002. techniques projection

http://www.cs.colostate.edu/evalfacerec/papers.html P. Campadelli, R. Lanzarotti, G. Lipori, Eye localization: a survey, in the book of The Fundamentals of Verbal and Non-verbal Communication and the Biometrical Issue, NATO Science Series, May 2007. [pdf] [32] [33] http://face.nist.gov/Documents_Overview.htm Recognition http://frvt.org/FRGC/ Danijela Vukadinovic and Maja Pantic, Fully Automatic Facial Feature Point Detection Using Gabor Feature Based Boosted Classifiers, IEEE International Conference on Systems, 2005. [pdf] [34] Faces, http://sprg.massey.ac.nz/ivcnz/Proceedings/IVCNZ_28.pdf [35] Mkinen Erno, Face Analysis Techniques for Human-Computer Interaction, 2007-12-14 Hua Gu, Guangda Su, Cheng Du, Feature Points Extraction from 2003 Overview Face

32

http://acta.uta.fi/english/teos.phtml?11022 http://www.cs.uta.fi/~etm/research/ (Gabor, face detection, ) [36] Qiong Wang, Jingyu Yang, Eye Detection in Facial Images with Unconstrained [37] Background, 2006 JPRR_Vol1_Issue1_2006_pp55_62_2006_09_25.pdf J.Cook, V. Chandran & S.Sridharan, Multi-Scale Representation for 3D Face Recognition, in IEEE Transactions on Information Forensics ans Security: Special Issue on Human Detection and Recognition, 2007 http://jamie.homelinux.org/wp-content/uploads/2007/03/cook-07-multiscalerepresentationfor3dfacerecognition.pdf (download website) [38] ISBN Publisher: Publication [39] Avatar [40] IN-TECH, date: Vienna, December Recent Advances in Face Recognition 978-953-7619-34-3 Austria 2008 Edited by: Kresimir Delac, Mislav Grgic and Marian Stewart Bartlett

URL: http://intechweb.org/book.php?id=101 Jahmar Ignacio - Constructing Individual Facial Expressions in an from Observation (Slide tao avatar) http://ml.cecs.ucf.edu/meli/MLII/files/Ignacio_Pres_Final.pdf Fei Zuo, Peter H. N. de With (2005), Real-time Face Recognition for Smart Home Applications, International Conference on Consumer Electronics (ICCE2005), vol. 51 p. 183-190, February 2005, Las Vegas, U.S.A. [41] using 2304.pdf (c ni ly do essential in many modern multimedia applications such as face [pdf] [details] [others] (mc ch ny rt hay) http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.59.3603 Aysegul Gunduz, Hamid Krim (2003), Facial feature extraction topological methods, Raleigh. http://www.busim.ee.boun.edu.tr/~cdemirkir/files/FacialFeatureDetection/cr

33

recognition, facial gesture recognition, and low-bit video coding. These applications play an important role in security systems, human-computer interaction, and teleconferencing) [42] in [43] Mohammad H. Mahoor, Mohamed Abdel-Mottaleb, and A-Nasser Ansari (2006), Improved Active Shape Model for Facial Feature Extraction Color Images, Journal of multimedia, Vol. 1. www.academypublisher.com/jmm/vol01/no04/jmm01042128.pdf Mark Everingham, Josef Sivic and Andrew Zisserman, Hello! My name is... Buffy Automatic Naming of Characters in TV Video, Proceedings of the 17th British Machine Vision Conference (BMVC 2006). [pdf] [details] [44] Timo Ahonen1 (2004), Face , Abdenour Hadid1 with Local and Matti Pietikinen1 Binary Patterns, Filand. Recognition

http://www.springerlink.com/content/p5d9xp9gfkex5gk9/ (Face Recognition with Local Binary Patterns-Timo.pdf) [45] Hyun-Chul Kim1 Seok-Cheol Kee2 Korea. http://www.springerlink.com/content/4828k6141231741u/#ContactOfAuthor 4 (HWJ_07C_MIRAGE.pdf) [46] L inh Duy (2006), Human Face Processing Techniques with Application to Large Scale Video Indexing, PhD Thesis, Tokyo, Japan. [pdf] [details] [blog] [47] Rudy Adipranata, Cherry G. Ballangan, Silvia Rostianingsih, Ronald Paska Ongkodjodjo , Real-Time Human Face Tracker Using Facial Feature Extraction,Surabaya, (s dung phng phap SkinColor) Indonesia (human tracker.pdf) http://fportfolio.petra.ac.id/user_files/99-015/104.pdf , Hyoung-Joon Kim1 , Wonjun Hwang2 , and Whoi-Yul Kim1 (2007), Facial Feature Point

Extraction Using the Adaptive Mean Shape in Active Shape Model, Seoul,

34

[48]

Jeremy N. Bailensona, Emmanuel D. Pontikakis, Iris B. Mauss, James J. Gross, Maria E. Jabon, Cendri A.C. Hutchersond, Clifford Nass, Oliver John (2007), Real-time classication of evoked emotions using facial feature tracking and physiological responses, USA. [pdf]

[49] [50]

Platform for developing 3-D virtual reality worlds, web. Address at http://www.worldviz.com/products/vizard/index.html. H. K. Ekenel, H. Gao, R. Stiefelhagen, "3-D Face Recognition using Local Appearance-Based Models", IEEE Transactions on Information Forensics and Security, Vol. 2, No. 3, pp. 630-635, September 2007.

[51]

J. Stallkamp, H.K. Ekenel, R. Stiefelhagen, "Video-based Face Recognition on Real-World Data", International Conference on Computer Vision (ICCV'07), Rio de Jenario, Brasil, October 2007.

[52]

H.K. Ekenel, J. Stallkamp, H. Gao, M. Fischer, R. Stiefelhagen, "Face Recognition for Smart Interactions", International Conference on Multimedia & Expo, Beijing, China, July 2007.

[53] [54]

H.K. Ekenel, M. Fischer, R. Stiefelhagen, "Face Recognition in Smart Rooms", 4th MLMI, Brno, Czech Republic, June 2007. H.K. Ekenel, M. Fischer, Q. Jin, R. Stiefelhagen, "Multi-modal Person Identification in a Smart Environment", CVPR Biometrics Workshop, Minneapolis, USA, June 2007.

[55]

H.K. Ekenel, R. Stiefelhagen, "Local Wavelet Analysis for Face Recognition", IEEE Signal Processing and Communications Applications Conference, Eskisehir, Turkey, June 2007

[56]

H.K. Ekenel, R. Stiefelhagen, "Two-class Linear Discriminant Analysis for Face Recognition", IEEE Signal Processing and Communications Applications Conference, Antalya, Turkey, June 2007.

[57]

H.K. Ekenel, R. Stiefelhagen, "An Un-awarely Collected Real World Face Database: The ISL-Door Face Database", International Conference on Computer Vision Systems, ICVS 2007, Bielefeld, Germany, March 2007.

35

[58]

Minh Hoai Nguyen, Joan Perez, Fernando De la Torre (2008), Facial Feature Detection with Optimal Pixel Reduction SVMs, 8th IEEE Intl Conference on Automatic Face and Gesture Recognition, September 2008, Amsterdam, The Netherlands. [pdf] [details]

[59] [60]

Steve Milborrow (2007), Locating Facial Features with Active Shape Models, Masters Thesis, University of Cape Town, South Africa. [pdf] Steve Milborrow and Fred Nicolls (2007), Locating Facial Features with an Extended Active Shape Model, ECCV '08: Proceedings of the 10th European Conference on Computer Vision, 5305, 504-513, Marseille, France. [pdf] [details]

[61]

Kuang-chih Lee (2005), The Extended Yale Face Database B (web). Address http://vision.ucsd.edu/~leekc/ExtYaleDatabase/download.html http://vision.ucsd.edu/~leekc/ExtYaleDatabase/ExtYaleB.html at

[62] [63]

Daniel Riccio, Jean-Luc Dugelay (2007), Geometric invariants for 2D/3D face recognition. [pdf] Deva Ramanan, Simon Baker and Sham Kakade (2007), "Leveraging Archival Video for Building Face Datasets", International Conference on Computer Vision (ICCV), Rio de Janeiro, Brazil, Oct 2007. [pdf] [details]

[64]

Stan Z. Li, Senior Member, Zhenqiu Zhang (2004), FloatBoost Learning and Statistical Face Detection, IEEE Transactions on Pattern Analysis and Machine Intelligence. [pdf]

[65]

Stefano Arca, Paola Campadelli, Raffaella Lanzarotti (2005), A face recognition system based on automatically determined facial fiducial points, Pattern Recognition Journal. [pdf]

[66]

P. Campadelli, R. Lanzarotti, G. Lipori, E. Salvi, Face and facial feature localization, In the Proceedings of the International Conference on Image Analysis and Processing (ICIAP 2005). [pdf]

36

[67] [68] [pdf] [69] [70]

Andrea F. Abate, Michele Nappi, Daniel Riccio, Gabriele Sabatino (2007), 2D and 3D face recognition: A survey Paul Viola, Michael J. Jones (2004), Robust real-time face detection, International Journal of Computer Vision 57(2), 137154, 2004. Jake Newman (2009), Visual-Only Language Identication, Thesis at University of East Anglia. [pdf] Roland Gocke, J Bruce Millar, Alexander Zelinsky, and Jordi RobertRibes (2000), Automatic Extraction of Lip Feature Points, Proceedings of Australian Conference on Robotics and Automation (ACRA2000) [ps, demo]

[71]

Ce Zhan, Wanqing Li, Philip Ogunbona, and Farzad Safaei (2007), Real-Time Facial Feature Point Extraction, PCM 2007 (under review)

[72]

Rainer Lienhart, Jochen Maydt (2002), An Extended Set of Haar-like Features for Rapid Object Detection, in: Proceedings of the International Conference on Image Processing, vol. 1, pp. I900I903. [pdf]

[73]

Gwen Littlewort, Marian Stewart Bartlett, Ian Fasel, Joshua Susskind, Javier Movellan (2004), Dynamics of facial expression extracted automatically from video, cvprw, vol. 5, pp.80, 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 5, 2004.

You might also like