You are on page 1of 113

B GIO DC V O TO TRNG I HC BCH KHOA H NI

----------------o0o---------------

H NI 2006

NGUYN TH KIM TUYN CNG NGH THNG TIN 2004-2006

LUN VN THC S KHOA HC


NGNH: CNG NGH THNG TIN

NGHIN CU TNG QUAN V TNH TON LI V CI T M HNH TH NGHIM

NGUYN TH KIM TUYN

H NI 10-2006

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

LI CM N
Trong qu trnh tm hiu nghin cu hon thnh lun vn, ti gp khng t kh khn, nhng nhng lc nh vy, ti lun nhn c s ng vin, khch l ca thy gio, TS. Nguyn Kim Khnh. Thy gip ti rt nhiu trong qu trnh nghin cu, hng dn tn tnh trong cch thc v phng php nghin cu khoa hc cng nh h tr ti trong vic tm ti liu. c c nhng kt qu trong lun vn ny, ti xin gi li cm n su sc n thy gio, TS. Nguyn Kim Khnh khoa Cng ngh thng tin trng HBKHN. Ti cng xin gi li cm n n cc thy c v cc bn trn Trung tm tnh ton hiu nng cao, trng HBKHN. Cui cng ti xin cm n ng nghip, gia nh v cc bn ca ti nhng ngi lun bn cnh, ng vin v khch l ti c c kt qu nh ngy hm nay.

H Ni, ngy 05 thng 10 nm 2006 Tc gi

Nguyn Th Kim Tuyn


Lp Cao hc CNTT 2004-2006

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 1

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

LI CAM OAN
Ti l Nguyn Th Kim Tuyn, hc vin lp cao hc kho 2004-2006, chuyn ngnh Cng ngh thng tin. Ti xin cam oan bi lun vn "Nghin cu tng quan v tnh ton li v ci t m hnh th nghim" l do ti nghin cu, tm hiu di s hng dn ca thy gio TS. Nguyn Kim Khnh, khng phi s sao chp ca ngi khc. Ti xin chu trch nhim v li cam oan ny.

H Ni, ngy 05 thng 10 nm 2006 Tc gi

Nguyn Th Kim Tuyn


Lp Cao hc CNTT 2004-2006

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 2

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

MC LC
LI CM N ..........................................................................................................1 LI CAM OAN ....................................................................................................2 MC LC................................................................................................................3 DANH MC THUT NG ...................................................................................6 DANH MC HNH .................................................................................................8 LI NI U .........................................................................................................9 CHNG 1. TNH TON LI........................................................................10

1.1 Tng quan v Tnh ton li .............................................................. 10


1.1.1 Tnh ton li l g?..........................................................................10 1.1.2 So snh vi cc m hnh, cng ngh khc ........................................12 1.1.3 Phn loi mng li ..........................................................................13

1.2 Kin trc v thnh phn chnh ca h thng li............................... 15


1.2.1 Tng quan kin trc li...................................................................15 1.2.2 Cc thnh phn theo m hnh chc nng ..........................................18 1.2.3 Cc thnh phn theo m hnh vt l..................................................19

1.3. Cc chun cho tnh ton li ............................................................. 19


1.3.1 OGSA/OGSI l g? ...........................................................................20 1.3.2 Chun OGSI .....................................................................................20 1.3.3 Chun OGSA....................................................................................22

1.4 Cc thnh phn chnh trong m hnh chc nng ca li .................. 24


1.4.1 Bo mt.............................................................................................24
1.4.1.1 C ch bo mt trong mi trng li..............................................25 1.4.1.2 Cc chnh sch bo mt trong mi trng li.................................25 1.4.1.3 H tng an ninh mng li GSI (Grid Security Infrastructure) ........26

1.4.2 Qun l ti nguyn li ....................................................................27


1.4.2.1 Nhng thch thc trong qun l ti nguyn li ..............................27 1.4.2.2 H qun tr ti nguyn GRAM ..........................................................29

1.4.3 Qun l d liu .................................................................................30


1.4.3.1 Giao thc truyn tp tin mng li GridFTP ....................................30 1.4.3.2 Dch v nh v bn sao RLS ............................................................33

1.4.4 Lp lch trong mi trng li .........................................................36 1.4.5 Grid Portal ........................................................................................38


1.4.5.1 Cc yu cu i vi Grid Portal ........................................................39 1.4.5.2 Chuyn ti cc Job trong Grid Portal ................................................39
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 3

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

1.4.6 Gim st li ....................................................................................40


1.4.6.1 Quy trnh gim st.............................................................................41 1.4.6.2 Yu cu i vi mt h thng gim st li .....................................41 1.4.6.3 Kin trc b gim st li GMA (Grid Monitoring Architecture) ...42 1.4.6.4 Phn loi cc h thng gim st li................................................43

1.5 Kt chng.......................................................................................... 44
CHNG 2. TNG QUAN V GLOBUS .........................................................45

2.1 Tng quan kin trc chung ca GT .................................................... 45


2.1.1 Cc chc nng chnh ca GT ............................................................45 2.1.2 Cc c trng ca GT4 .....................................................................46 2.1.3 Tm lc v kin trc ca GT4 ........................................................48

2.2 Kin trc hng dch v ..................................................................... 48


2.2.1 GT4, cc h thng phn tn, cc dch v Web ..................................48 2.2.2 C s h tng v ng dng hng dch v........................................49 2.2.3 Kin trc hng dch v (Service Oriented Architecture-SOA) .......50

2.3 Kin trc GT4 ..................................................................................... 51


2.3.1 Kin trc tng quan ..........................................................................51 2.3.2 Trin khai dch v Web trn GT4 .....................................................53

2.4 Qun l thc thi trong GT4 ................................................................ 54


2.4.1 Tng quan v GT4 GRAM ...............................................................55 2.4.2 Lnh globusrun-ws ...........................................................................56 2.4.3 Cch thc hot ng ca GT4 GRAM..............................................60 2.4.4 Cu hnh v qun tr GT4 GRAM.....................................................62

2.5 Qun l d liu trong GT4 ................................................................. 63


2.5.1 Tng quan v qun l d liu trong GT4 ..........................................63 2.5.2 Di chuyn d liu..............................................................................63 2.5.3 To bn sao d liu...........................................................................64

2.6 Theo di v pht hin ......................................................................... 65


2.6.1 H thng theo di v pht hin - MDS4 ...........................................65 2.6.2 B gp (aggregator) v ngun thng tin ...........................................66 2.6.3 Ngun thng tin v vic ng k ......................................................67

2.7 Kt chng.......................................................................................... 67
CHNG 3. CC K THUT LI HIN C TRIN KHAI VIT NAM .......................................................................................................................68

3.1 Desktop Grids ..................................................................................... 68


3.1.1 Tnh ton phn tn trong cc x nghip.............................................68
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 4

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

3.1.2 nh ngha Desktop Grid..................................................................69 3.1.3 Gi tr ca li Desktop Grid ...........................................................70 3.1.4 Cc phn t k thut chnh ...............................................................70 3.1.5 Cc kha cnh thc t cn xem xt....................................................72 3.1.6 Grid Server .......................................................................................73

3.2 Cluster Grids....................................................................................... 74


3.2.1 Kin trc li Cluster .......................................................................74 3.2.2 B phn mm li cluster ca Sun ...................................................75 3.2.3 Yu cu thit k ................................................................................78 3.2.4 Phn cng mng ...............................................................................79 3.2.5 Qun l mt Cluster Grid.................................................................80

3.3 Kt ni Cluster vo Grid..................................................................... 81


3.3.1 S cn thit ca vic kt ni grid v cluster......................................82 3.3.2 Kt ni Globus-based Grid v PBS-based Cluster ............................82
3.3.2.1 GRAM...............................................................................................82 3.3.2.2 PBS....................................................................................................83 3.3.2.3 Cc yu cu i vi thnh phn kt ni ............................................87

3.4 Kt chng.......................................................................................... 90
CHNG 4. TRIN KHAI TH NGHIM......................................................91

4.1 Lp bn thit k kin trc li............................................................ 91 4.2 Ci t mt Grid ................................................................................. 95


4.2.1 Cu hnh phn cng ca li ............................................................95 4.2.2 Yu cu trc khi ci t ..................................................................96 4.2.3 Ci t cho nt chnh ........................................................................96 4.2.4 Ci t cc nt tnh ton .................................................................101 4.2.5 ng b thi gian gia cc nt trong li ......................................102 4.2.6 Cu hnh cc dch v mc li .......................................................103

4.3 Kt ni mt Cluster vo Grid ........................................................... 105


4.3.1 Cu hnh phn cng ........................................................................105 4.3.2 Cu hnh cluster-based PBS............................................................106 4.3.3 Cu hnh li da trn GT ..............................................................108

4.4 Kt chng........................................................................................ 109


KT LUN V KIN NGH.............................................................................110 TI LIU THAM KHO...................................................................................111

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 5

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

DANH MC THUT NG
Vit tt Ch gii Tnh ton li B cng c middleware h tr tnh ton li, cung cp mt s dch v trnh cng vic, qun l ti nguyn, h tng bo mt, cng nh h tr vic xy dng cc dch v li Web Service Dch v web mt kin trc pht trin bi W3C nhm cung cp cc chc nng cho ngi dng t xa Application Programming Giao din lp trnh ng dng, thng l mt tp cc hm Interface gip lp trnh vin d dng tng tc vi dch v hoc h thng. Community Dch v chng thc cng ng. Mt dch v bo mt Authorization Service trong mi trng li cho php dung ha gia chnh sch s dng ti nguyn ca cng ng ngi dng vi chnh sch s dng ti nguyn ca nhng nh cung cp Distributed Parallel H thng lu tr song song phn tn: k thut t chc Storage System mt tp cc a cng nm trn cc server kt ni vi nhau qua mng din rng, cung cp kh nng truy cp mc khi logic n nhng b d liu ln. Data Transfer Process Tin trnh qun l vic truy cp d liu thc s v truyn qua knh d liu trong kin trc GridFTP Grid Resource Allocation Dch v qun l v nh v ti nguyn li and Management Service Grid Telecontrol Protocol Giao thc iu khin li t xa File Transfer Protocol Giao thc truyn tp ni ting qua mng Globus eXtensible Giao din vo ra mc thp trong kin trc Globus Input/Ouput Grid Monitoring H thng gin st li Architecture Grid File Transfer GridFTP l m rng ca giao thc FTP, tch hp kh Protocol nng bo mt li, truyn d liu tt hn so vi FTP Grid Security C s h tng bo mt li trong kin trc ca Globus, Infrastructure h tr giy chng nhn theo chun X509 v dng h m cng khai. High Performance H thng qun l hiu qu hng trm terabyte ti Storage System petabyte c lu trn cng hoc bng t, liu thng xuyn c s dng s c lu trn a cng, cn d liu c tn sut s dng t hn s c lu trn bng t. Hypertext Transfer Giao thc truyn siu vn bn, c s dng truyn Protocol thng tin t cc my phc v www n cc trnh duyt. Lightweight Directory Giao thc c t cc k thut nh danh i tng, m Access Protocol hnh d liu, tm kim v ghi cc khon mc d liu. Logical File Name Tn logic ca mt thc th d liu trong li d liu, hm cha ni dung ca thc th d liu . Local Replica Catalogue Catalog nh v bn sao a phng, lu tr tp cc nh x bao gm hai trng: {tn logic ca thc th d liu, v tr vt l c th ca thc th } Trang 6 Tn y Grid Computing Globus Toolkit

API CAS

DPSS

DTP GRAM GTCP FTP Globus XIO GMA GridFTP GSI HPSS

HTTP LDAP LFN LRC

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

MCS MDS MPI OGSA OGSI PBS

Metadata Catalog Service Monitoring and Discovery Service Message Passing Interface Open Grid Service Architecture Open Grid Service Infrastructure Protable Batch System

Dch v siu d liu ca kin trc li d liu Globus, cho php gn cc i tng d liu vi mt s thuc tnh m t. Dch v theo di v nh v ti nguyn Giao din truyn thng ip, cch thc trao i thng tin gia cc tin trnh. Kin trc dch v li, nh ngha cc giao din chun v c ch hot ng ca dch v li H tng dch v li m L h thng phn ti v qun l ti nguyn rt mnh. c s dng ph bin trong cc h thng tnh ton song song. Cung cp kh nng khi to v lp lch cho vic thc thi v sp xp cc cng vic trn my trm. B thng dch giao thc, c nhim v qun l cc knh iu khin trong kin trc GridFTP. Dch v truyn file tin cy Lu cc thng tin ch mc cho dch v nh v bn sao, mi bn ghi bao gm {LFN, v con tr ti LRC tng ng} Dch v nh v bn sao trong kin trc li d liu Globus, cho php xc nh v tr ca cc bn sao ca thc th d liu trong li. Gi phng thc t xa, s dng trong Java khi chy RMI, cc i tng trong Java c th gi cc phng thc ca cc i tng xa ang chy trn mt my o khc. Ngn ng c t ti nguyn Giao thc t my phc v n my phc v h tr phn tn th in t Kin trc hng dch v Giao thc truy cp i tng t xa n gin, dng trong xy dng cc dch v web, lp tnh phn tn. Tp cng c h tr pht trin phn mm B mi gii ti nguyn lu tr: thc hin vic mi gii gia cc ng dng v cc ch ti nguyn lu tr, xc nh ti nguyn ph hp nht cho ng dng. Giao thc bo mt li Mi trng ngi dng Cc t chc o Ngn ng c t dch v web Framework a ra bi GT4 h tr kin trc lp trnh mi. L mt cch thc linh ng to ra cc nh dng thng tin v chia s c nh dng v d liu trn web Trang 7

PI RFT RLI RLS RMI

Protocol Interperter Reliable File Transfer Service Replica Location Index Replica Location Service Remote Method Invocation Resource Specification Language Simple Mail Transfer Protocol Service Oriented Architecture Simple Object Access Protocol Software Development Kit Storage Resource Broker Secure Socket Layer User Host Environment Virtual Organizations Web Service Deployment Descriptor Web Service Protocol Extensible Markup Language

RSL SMTP SOA SOAP SDK SRB SSL UHE VO WSDD/ WSDL WSRF XML

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

DANH MC HNH
Hnh 1-1 S pht trin t Networking n Grid Computing ...............................................11 Hnh 1-2 Kin trc phn tng li.......................................................................................15 Hnh 1-3 Cc thnh phn theo m hnh chc nng..............................................................18 Hnh 1-4 Mi quan h gia OGSA v OGSI.......................................................................21 Hnh 1-5 Bo mt mc giao vn..........................................................................................26 Hnh 1-6 Bo mt mc thng ip.......................................................................................26 Hnh 1-7 M hnh thng lng ti nguyn li ................................................................29 Hnh 1-8 Kin trc ca GridFTP .........................................................................................31 Hnh 1-9 ng ng truyn d liu DTP ............................................................................32 Hnh 1-10 Cng in t GridPortal .....................................................................................38 Hnh 1-11 Cc thnh phn ca kin trc GMA ...................................................................42 Hnh 2-1 Cc thnh phn trong kin trc dch v Web .......................................................49 Hnh 2-2 Cc thnh phn chc nng chnh trong ci t dch v Web ...............................49 Hnh 2-3 Cc thnh phn trong kin trc GT4 ....................................................................51 Hnh 2-4 GT4 Container tch hp cc dch v v cng c ..................................................53 Hnh 2-5 Bn cu hnh GT4 container.................................................................................53 Hnh 2-6 S dch chuyn trng thi .....................................................................................58 Hnh 2-7 Kin trc GRAM ..................................................................................................60 Hnh 3-1 Xp hng ca cc Desktop Grid trn 500 siu my tnh hng u.......................70 Hnh 3-2 Ba lp trong kin trc li cluster........................................................................74 Hnh 3-3 Ngn xp phn mm ca li Sun Cluster Grid...................................................75 Hnh 3-4 Lung cng vic trong Sun Grid Engine..............................................................76 Hnh 3-5 Cc thnh phn trong PBS....................................................................................85 Hnh 3-6 C ch hot ng ca PBS ...................................................................................86 Hnh 3-7 Hot ng ca Globus Scheduler Pbs ..................................................................90 Hnh 4-1 M hnh li c trin khai th nghim ............................................................96 Hnh 4-2 Topo mng trin khai kt ni Cluster vo li...................................................105

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 8

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

LI NI U
Trong nhng nm gn y rt nhiu thit b phn cng mnh phc v cho yu cu tnh ton hiu nng cao c to ra. Nhng, do nhu cu ca con ngi l khng gii hn nn h lun thy l cha , v th tnh ton li ra i nhm p ng nhu cu ny. Tuy nhin im chnh yu ca li khng phi l sc mnh tnh ton m l tnh thc tin, tnh thc tin ny th hin ch cc li tnh ton thng c to ra da trn vic tn dng cc ngun ti nguyn bnh thng, sn c m khng cn phi mua hoc to ra mt h tng phn cng mi. Do , tnh ton li ni ln nh mt phng tin tp hp ti nguyn tnh ton chi ph thp gii quyt nhng bi ton ln. Vit Nam, cng ngh li cn kh mi, ch c trin khai mt s t cc trung tm tnh ton ti vin nghin cu hoc cc trng i hc chuyn ngnh ln. V th, lun vn c vit vi mc ch nhm nghin cu l thuyt v tnh ton li, h tng cn thit cho li, xy dng mt mi trng tnh ton li phc v cho nhu cu nghin cu cc lnh vc chuyn su v li sau ny. Lun vn c chia lm 5 chng vi ni dung c th nh sau: Chng 1. Tng quan v tnh ton li, trnh by nhng vn chung nht v tnh ton li nh nh ngha, kin trc, cc thnh phn chnh Chng 2. Tng quan v Globus, trnh by chi tit v thnh phn nn tng ca tnh ton li l b cng c Globus Toolkit. Chng 3. Cc k thut li hin ang c trin khai nc ta, trnh by cc k thut li ang c mt s trung tm tnh ton nc ta trin khai nh Desktop Grid, Cluster Grid v Kt ni mt Cluster vo mt Grid thng qua PBS. Chng 4. Xy dng li th nghim, trnh by cc bc c bn thit k mt li. Ci t th nghim mt li ng thi thc hin kt ni mt cluster vo li. Phn Kt lun, trnh by tm tt kt qu t c v hng pht trin tip theo ca lun vn.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 9

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

CHNG 1. TNH TON LI


1.1 Tng quan v Tnh ton li
1.1.1 Tnh ton li l g?
nh ngha T trc n nay, mi t chc, c nhn ty theo cch quan nim v thc t xy dng h thng ca mnh m a ra nhng nh ngha khc nhau v li. Di quan im c nhn ca I.Foster v cc ng nghip th "Mt li l mt h thng c cc c trng nh ti nguyn c iu phi mt cch phi tp trung; s dng cc giao thc chun, m v a nng; cung cp cht lng dch v khng tm thng" - I. Fosters Three-Point Checklist (HPCWIRE - 22.07.2002) Cn di quan im ca mt s cng ty v lin minh pht trin li uy tn trn th gii th tnh ton li c nh ngha nh sau [2]: nh ngha ca Oracle: tnh ton li l vic lin kt nhiu my ch v thit b lu tr thnh mt siu my tnh nhm ti u ha c tnh u vit ca cc h thng my ch cng nh h thng ng dng, nh gim thiu n mc thp nht chi ph. nh ngha ca IBM: tnh ton li l mt mi trng tnh ton o. Mi trng ny cho php b tr song song, linh hot, chia s, tuyn la, tp hp cc ngun ti nguyn hn hp v mt a l, ty theo mc sn sng, hiu sut, chi ph ca cc ti nguyn tnh ton v yu cu v cht lng dch v ca ngi s dng. nh ngha ca lin minh in ton li: mi trng tnh ton li c hiu nh mt h tng kt ni h thng my tnh, h thng mng, h thng c s d liu c s hu v qun l bi nhiu t chc, c nhn nhm cung cp mi trng tnh ton o duy nht vi hiu nng cao cho ngi s dng. c ci nhn ton din v tnh ton li, lun vn xin php khng a ra mt nh ngha c th no, thay vo chng ta s xem xt khi nim tnh ton li theo mt s c im chung sau:
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 10

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Kch thc ln: theo s lng ti nguyn v khong cch a l gia chng Phn tn: c tr ng k trong truyn d liu, ti nguyn tri di trn cc vng a l khc nhau ng: cc ti nguyn c th thay i khi ng dng ang c thc hin Hn tp: kin trc v tnh cht ca cc nt li c th l hon ton khc nhau. Ti nguyn li c th l cc my n hoc mng con khc nhau Vt qua phm vi mt t chc: c nhiu trm v cc chnh sch truy nhp c th khc nhau trn cc trm, tng th li s to ra mt t chc o thng nht C ch v chnh sch an ton bo mt phc tp. C ch qun l ti nguyn a dng, phc tp Lch s ra i Qu trnh ra i ca tnh ton li c th tm tt nh sau:

Hnh 1-1 S pht trin t Networking n Grid Computing

C s h tng truyn thng ca tnh ton li l mng Internet. Sau ny WWW ra i cui nhng nm 80 to ra mt cuc cch mng trong qu trnh tnh ton v chia s thng tin trn mng. Giao thc HTTP cng vi trnh duyt ngn ng HTML cung cp mt c ch cho php lin kt cc vn bn v truy cp cc thng tin trc tuyn, d dng v bt k u. Cng ngh XML c ra i vo nhng nm 1994, y l mt chun mi cho qu trnh trao i thng tin trn mng. Cng ngh quang hc cung cp cc dch v v kh nng kt ni nhanh, rng vi chi ph hp l. Tc mng gia nhng nm 1990 ln n 2.4Bbps. Thng lng ln l nhn t chnh thc y hnh thnh cng ngh tnh ton li. Vo nhng nm 1990, khi cc my tnh, my trm v mng tc cao cng vi cc thit b hin i khc ra i lm ny sinh nhu cu gp nhm cc my tnh thnh cc cluster dng cho cng vic tnh ton tc cao. S xut hin ca cluster
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 11

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

chnh l mm mng ca h thng tnh ton ngang hng v cc li tnh ton sau ny. Li d liu c bt u nm 1999 vi Globus Tookit 2.0+. Giai on tip theo vi s ra i ca kin trc dch v li m rng (OGSA) vo nm 2001 v sn phm Globus Tookit 3.0. Giai on hin ti tnh t nhng nm 2003 n nay, c nh gi bi cc c gng chun ha cng ngh v giao thc tnh ton. Li ch ca Tnh ton li Cc li ch m tnh ton li mang li [2] bao gm: Kh nng khai thc cc ti nguyn nhn ri Cung cp kh nng x l song song Gip hp tc gia cc t chc Gip truy nhp ti nguyn khc nh ng truyn v cc phn mm t tin Gip cn bng trong s dng ti nguyn Mang li tin cy cao Phm vi ng dng Tnh ton li thng c s dng gii quyt cc bi ton khoa hc i hi kh nng tnh ton v thng lng cao nh m phng, thit k vi mch, chia s ni dung, truy nhp/thu cc phn mm/dch v t xa. Hoc cc bi ton i hi d liu ln, thi gian thc, phc v theo yu cu v cc bi ton tnh ton cng tc nh thit k cng tc, khai ph d liu, gio dc in t

1.1.2 So snh vi cc m hnh, cng ngh khc


Khi so vi cluster, ngi ta thy tng trung gian ca c hai u a ra c ch truyn thng ip cho cc ng dng song song. Nh vy, v bn cht kin trc mc cao ca mt cluster l tng t nh ca mt li. Tuy nhin gia chng vn c nhng im khc bit quan trng nh: V mt phm vi, Clusters c t trong mt phng hoc mt to nh cn Grids li c th c t phn tn trn trn cc vng a l khc nhau Clusters c mt h qun tr duy nht, ngc li Grids c cc bin qun tr ko di
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 12

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Clusters ch ch tm n cc bi ton chuyn v tnh ton hiu nng. Cn Grids gii quyt cc bi ton tnh ton phn tn v chia s ti nguyn Clusters thng bao gm cc h thng con ng nht cn Grids li l s kt hp ca nhiu dng h thng con khc nhau Clusters thng c lng ti nguyn c nh nhng ti nguyn trn Grids thay i theo yu cu thc t v kh nng dng c ca ti nguyn Tuy c im khc bit nhng vn c th coi tnh ton phn cm nh mt trng hp c bit ca tnh ton li. Trn thc t Grid khng phi l mt cuc cch mng mi m thc cht n l bc tin trong cng ngh in ton phn tn. Ging nh Web, tnh ton li gim bt phc tp khi nhiu ngi cng khai thc mt nn hot ng thng nht n ch khc s h tr lin lc. Khi so vi mng ngang hng (P2P) th tnh ton li cng cho php ngi s dng chia s file v nhiu loi ti nguyn khc na. Ni chung, vi cc cng ngh o khc, grid ging ch cho php o ha cc ngun nhn lc CNTT nhng khc bit l grid cho php o ha nhng ngun ti nguyn tn mt v v cng rng ln.

1.1.3 Phn loi mng li


Li thng c ci t nhiu dng khc theo ng dng c th hoc theo cu trc ca t chc o tham gia li hoc theo tnh cht ca ti nguyn c chia s. Sau y l mt s dng li [3]: Departmental Grids Cluster Grids: gm mt hoc nhiu h thng kt hp li nhm cung cp mt im truy xut n cho ngi dng. N ph hp cho cc ng dng yu cu hiu nng tnh ton v bng thng ln
-

Infra Grids: do IBM xut nhm nh ngha mt li ti u vic s dng ti nguyn trong mt x nghip

Enterprise Grids Enterprise Grids: c trin khai trong cc cng ty ln c chi nhnh nhiu ni trn th gii c nhu cu chia s ti nguyn

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 13

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Intra Grids: ti nguyn chia s trong cc nhm khc nhau ca mt x nghip to thnh mt intra grid Campus Grids: cho php nhiu d n hoc nhiu b phn chia s ti nguyn tnh ton theo hng cng tc.

Extraprise Grids -

Extra Grids: cho php chia s ti nguyn vi cc i tc bn ngoi. Lin kt gia cc t chc ny c thit lp bng cc dch v tin cy. Partner Grids: l cc mng li gia cc t chc, cng ty, x nghip trong cng lnh vc m c nhu cu cng tc nhm thc hin d n chung

Global Grids -

Global Grids: cho php ngi dng khai thc cc ti nguyn bn ngoi. N cung cp sc mnh ca cc ti nguyn phn tn trn khp th gii. Inter Grids: cung cp kh nng chia s ti nguyn v lu tr thng qua Web Compute grids c thnh lp nhm mc tiu chia s ti nguyn tnh ton.

Compute Grids Desktop Grids: y l mng li tp hp sc mnh tnh ton ca cc my tnh bn. Cc mng li thuc dng ny ch yu chy trn nn Windows Server Grids: mt s tp on ln thng c cc ti nguyn my ch c qun l bi b phn IT nm ti nhiu a im khc nhau. H mun to Server Grids chia s nhng ti nguyn t gi ny
-

High-Performance/Cluster Grids: mng li gm cc h thng dnh ring cho tnh ton nh cc siu my tnh hoc nhm my tnh hiu nng cao

Data Grids Ti nguyn ch yu c chia s trong mng li ny l d liu. N c xy dng nhm ti u cc thao tc hng d liu nh lu tr, truy xut, trao i. Utility Grids Utility grids l mng li bao gm cc ti nguyn tnh ton thng mi m c duy tr, qun l bi cc nh cung cp dch v.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 14

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Service Grids: cung cp truy xut ti cc ti nguyn m c th c mua bi mt t chc

1.2 Kin trc v thnh phn chnh ca h thng li


1.2.1 Tng quan kin trc li
Li c xy dng trn nn tng kin trc m v phn tng. Trong mi tng ca li, cc thnh phn chia s nhng thuc tnh chung v c b sung nhng tnh nng mi m khng lm nh hng n cc tng khc [4]. Ta c th tng hp kin trc li thnh 5 tng nh sau:

Hnh 1-2 Kin trc phn tng li

Tng ch tc (Fabric layer) Bao gm cc ti nguyn cc b phn tn trn mng, chng b rng buc bi C ch qun l ti nguyn v C ch thm tra. Ngi ta phn ti nguyn ca tng ny thnh cc nhm chnh sau: Ti nguyn tnh ton: cho php kim sot, iu khin vic thc thi cng vic Ti nguyn lu tr: dng ly v/ti ln cc tp tin, cho php c mt phn tp tin hoc chn lc d liu t tp tin xa Ti nguyn mng: l mi trng mng truyn thng Cc kho m ngun: l ni qun l tt c cc loi ti nguyn v cc phin bn ca m ngun Tng kt ni (Connectivity layer)

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 15

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Tng ny ng vai tr rt quan trng, n gm cc giao thc xc thc v truyn thng. Truyn thng bao gm vic truyn thng tin, nh tuyn v t tn. Nhng giao thc ny tng t cc giao thc IP, TCP, UDP trong b giao thc TCP/IP v cc giao thc tng ng dng nh DNS, OSPF, RSVPVn bo mt c gii quyt bng gii php xc thc nh: C ch ng nhp mt ln (Single Sign On): cho php ngi dng ch cn ng nhp vo mng li mt ln duy nht cho tt c cc truy cp cc ti nguyn c php trong tng ch tc cho n khi kt thc ng nhp C ch y quyn (Delegation, Proxy): ngi dng c th y quyn truy cp ti nguyn hp php li cho mt chng trnh trong mt khong thi gian xc nh. Chng trnh ny cng c th y quyn c iu kin mt phn cc tp quyn ca n cho chng trnh con khc C ch tch hp a gii php bo mt a phng (Integration with various local security solutions): c ch bo mt mng li phi c kh nng giao tip trong vi cc c ch bo mt a phng m khng yu cu thay th ton b cc gii php bo mt hin c, nhng cn c c ch nh x bo mt trong cc mi trng cc b khc nhau C ch quan h tin tng da trn ngi dng (User-based Trust Relationships): ngi dng c th s dng cc loi ti nguyn c c t s kt hp ca nhiu nh cung cp khc nhau Tng ti nguyn (Resource layer) Tng ny c xy dng trn nn tng sn c ca tng kt ni. Nhng giao thc trong tng ti ny s gi cc chc nng trong tng ch tc truy cp v s dng cc loi ti nguyn cc b. N c hai loi giao thc chnh: Giao thc thng tin (Information protocol): cho php ly cc thng tin v cu trc, tnh trng ca mt loi ti nguyn no trong mng li Giao thc qun l (Management protocol): dng sp xp qun l th t cc truy cp n cc ti nguyn c chia s Tng kt hp (Collective layer)
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 16

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Trong khi tng ti nguyn ch cho php truy cp n mt loi ti nguyn n th tng kt hp li cha cc giao thc v dch v cho php giao tip gia cc ti nguyn trong mng li. Tng ny bao gm cc dch v chnh nh sau: Cc dch v th mc (Directory Services): cho php tm hiu s tn ti cng nh thuc tnh ca cc nguyn nh loi ti nguyn, tnh kh dng Cc dch v cp pht chung, lp lch, mi gii (Co-allocation, Scheduling & Broker Services): cho php gi yu cu n mt hay nhiu my ch cho mc ch cp pht, lp lch v mi gii ti nguyn tng ng Cc dch v gim st v d bo (Monitoring and Diagnostic Services): cho php h thng h tr kim sot ti nguyn trong li Cc dch v nhn bn d liu (Data Replication Services): h tr vic qun l lu tr ti nguyn, gip vic truy cp ti nguyn li tr nn d dng hn Cc h h thng tr lp trnh li (Grid-enable Programming Systems): gm cc th vin lp trnh H thng qun l ti v mi trng cng tc (Workload Management System & Collaboration Framework): cung cp cc c t, qun l tnh ng b, a lung, a thnh phn trong cc tin trnh tnh ton Dch v tm kin phn mm (Software Discovery Service): h tr tm kim v la chn phn mm ci t v lm nn tng cho mng li Tng ng dng (Application layer) Tng ny bao gm cc ng dng c pht trin trn mi trng li nh: cc ng dng sinh hc, vy l, thin vn, ti chnh V nguyn tc, ngi s dng c th tng tc vi li thng qua tng ng dng mt cch trong sut m khng nhn bit c s c mt ca cc tng khc trong li.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 17

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

1.2.2 Cc thnh phn theo m hnh chc nng

Hnh 1-3 Cc thnh phn theo m hnh chc nng

Nhn t hnh trn ta thy v mt chc nng th li gm cc thnh phn sau: 1. Cng tng tc (Grid portal): l mt giao din cho php ngi dng s dng cc ng dng li, do li tr nn trong sut vi ngi dng. 2. Thnh phn bo mt (Security): l c ch m bo cc hot ng nh xc thc, cp quyn, bo mt-ton vn d liu v tnh sn sng ca d liu. 3. Chc nng an ninh nt (Node Security Function): chc nng ny chu trnh nhim xc thc v bo mt cho tng nt trong qu trnh giao tip gia n v cc thnh phn khc bn trong mng li. N ph thuc vo h iu hnh v cc h thng li c th, thng thy l c ch cp chng ch quyn truy cp. 4. B lp lch (Scheduler): l phn phi hp qu trnh thc thi ca nhiu cng vic song song. n gin, ngi s dng c th chn nt thch hp chy tc v, sau ch vic kch hot lnh nh tuyn cng vic ti nt chn. 5. Thnh phn mi gii (Broker): sau khi ngi dng c xc nhn quyn gia nhp vo mng li bi thnh phn an ninh nt, thnh phn ny s ch r ng dng ca ngi dng c s dng ti nguyn no v m bo ti nguyn c sn sng s dng theo tham s truyn vo. 6. Qun l, phn b ti nguyn (grid resource allocation manager, GRAM): cung cp dch v kch hot tng cng vic trn tng ti nguyn c th; kim tra trng thi cng vic; c kt qu khi cng vic kt thc. Cc thng tin ca thnh phn ny sau s c b lp lch s dng.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 18

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

7. Ti nguyn (Resource): ti nguyn li bao gm b x l, b lu tr, cc ng dng v cc thnh phn. 8. Qun l d liu (Data management): d liu c th nm ti nguyn, hoc l kt qu thc thi ca mt tc v no . Thnh phn qun l d liu phi m bo an ton v n nh trong qu trnh di chuyn d liu gia cc li. 9. Giao thc (Protocol): l thnh phn m bo lin kt cc thnh phn chc nng k trn c th hot ng v tng tc c vi nhau trong mng li.

1.2.3 Cc thnh phn theo m hnh vt l


Da trn ti liu [3], cc thnh phn ca li theo m hnh vt l bao gm: 1. Thnh phn mng (Networks): mng ng vai tr l c s h tng truyn s liu v cc thng tin gim st cng vic gia cc im trong mng li. Bng thng mng l mt thuc tnh rt quan trng lin quan n hiu sut li. 2. Thnh phn tnh ton (Computation): c cung cp bi cc b x l trong li, chng a dng v tc , kin trc, nn tng phn mm v lu tr. 3. Thnh phn lu tr (Storage): d liu c th c lu tr phn tn trn nhiu thit b x l hoc mt mng SAN. Mi b x l thng cung cp mt dung lng lu tr nht nh. H thng file thng c dng l NFS, DFS hoc GPFS 4. Phn mm v bn quyn (Software and License): v phng din phn mm trong mi trng tnh ton li th mc n nh ca ng dng phn mm v bn quyn phn mm l hai vt cn c quan tm nht. 5. Cc thit b c bit: mt vi nt trn li c th c nhng thit b c bit, chng hn cc thit b qun s, y t, hay cc thit b chuyn dng khc.

1.3. Cc chun cho tnh ton li


Li tnh ton thng gm mt tp cc ti nguyn khng ng nht. Mt ng dng li thng c nhiu thnh phn, dch v khc nhau. ng thi cc dch v ny li thng xuyn tng tc vi nhau. Cng nhiu dch v th s tng tc gia chng cng tng v rt d dn n tnh trng hn lon. Nu mi dch v s dng mt cch ring tng tc vi cc dch v khc th vn giao tip gia cc
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 19

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

dch v li s rt phc tp. Do , cn thit l phi c chun nh ngha giao din giao tip chung cho cc dch v ny.

1.3.1 OGSA/OGSI l g?
OGSA v OGSI [2] c Global Grid Forum (GGF) pht trin cho mc tiu chun ho. GGF nh ngha cc chun mng li trong phm vi cc trnh ng dng, cc m hnh lp trnh, qun tr d liu, bo mt, thc thi, lp lch v qun l ti nguyn. OGSA (Open Grid Services Architecture): nh ngha mt chun kin trc mi cho cc ng dng chy trn li. OGSA nh ngha dch v li l g, chng c kh nng g, v da trn nn cng ngh no. Nhng OGSA khng a ra c t chi tit v k thut cn trin khai mt dch v li. OGSI (Open Grid Services Infrastructure): nng v c t k thut cho cc khi nim c a ra trong OGSA. OGSI nh ngha cc c ch to mi, qun tr v trao i thng tin gia cc dch v li. Mt dch v li l dch v web thch ng vi tp hp cc quy c v giao din v cch p ng xc nh cch mt client tng tc vi mt li. c bit, OGSI cn nh ngha cc giao din chun v qui tc ca dch v li xy dng trn c s cc dch v web. Globus Toolkit l phn trin khai ca OGSI, n rt hu dng trong vic trin khai nhng g c c t bi. Grid Services l cc dch v li da trn nn tng Web Services v c m rng t Web Services.

1.3.2 Chun OGSI


OGSI gii thiu mt m hnh tng tc cho cc dch v li. M hnh ny cung cp mt phng thc c nh bng cch a ra cc giao din dng trong pht hin, qun l vng i, trng thi, to-hu, thng bo s kin v qun l tham chiu.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 20

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Hnh 1-4 Mi quan h gia OGSA v OGSI

1. Factory: l c ch (giao din) cung cp cch to cc dch v li mi. Cc Factory c th to ra nhiu th hin tm ca mt chc nng hn ch, chng hn mt b lp lch to mt dch v m t cch thc hin mt cng vic thng thng; hoc chng c th to ra cc dch v tn ti trong thi gian di nh vic nhn bn cc b mt tp d liu c s dng lin tc. 2. Life cycle: l c ch dng ngn cc dch v li truy cp n cc ti nguyn khng c yu cu. Cc dch v li c to vi vng i xc nh. 3. State managemen: mi dch v li u c mt trng thi. OGSI xc nh mt khung biu din cc trng thi v mt c ch kim duyt hoc sa i chng. OGSI cng quy nh s cc trng thi ti thiu m mi dch v li phi c. 4. Service groups: l tp cc dch v li c ch nh cho mt mc ch ring no . 5. Notification: cc dch v tng tc vi nhau thng qua c ch trao i thng ip trn cc li triu gi dch v. Thng tin trng thi c m hnh cho cc dch v li s thay i khi h thng chy. 6. Handle Map: dng cho cc vn nhn dng. Khi cc factory c s dng to ra mt th hin mi ca dch v li, Factory tr v nh danh ca th hin mi ny. nh danh ny l s kt hp ca Grid Service Handle (GSH) v mt Grid Service Reference (GSR). GSH cung cp tham chiu n nh danh ca dch v li cn GSR c th thay i theo thi gian sng ca cc dch v li. Cc ci t ca OGSI

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 21

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

ng vai tr l phn ct li ca kin trc dch v li, OGSI cn phi c trin khai trn mt nn c h tr cc dch v Web. Thng th nh cung cp s ci t OGSI bng cch s dng trc tip cc thnh phn thc thi m m c sn ca Globus v sau tch hp n vi platform ca ring h nh Webspere, Weblogic... Mt s giao thc chun SOAP (Simple Object Access Protocol): l c ch truyn thng c lp vi cu hnh nn v cc giao thc khc. N dng XML trao i thng tin trong mi trng phn tn. N h tr nhiu kiu trao i thng tin khc nhau bao nh: m hnh trao i thng tin sau khi gi th tc t xa hoc m hnh trao i thng tin trong mt c ch hng thng bo. SOAP thng c lp v giao thc, c lp v ngn ng, c lp v cu hnh v h iu hnh, h tr cc thng bo XML SOAP. UDDI (Universal Description Discoverry and Intergation): kho cha phn lu tr ca cc dch v Web. UDDI l phn nh dng m t dch v Web chun. Mt ng k UDDI c th cha siu d liu cho bt k kiu dch v no. UDDI cho php tm kim v khai thc cc dch v Web hiu qu hn. WSDL (Web Service Definition Language): ngn ng nh ngha cc dch v web. L ngn ng cung cp cch m t giao din c bit ca cc dch v web v cc API. WSDL c th c xem nh l ti liu c vit bng XML. XML (eXtensible Markup Language): ngn ng nh du m rng. L mt siu ngn ng, s dng m t ng ngha ca i tng v m t phn cu trc bn trong mt kiu m. WSIL (WS-Inspection Language): tn dng nh dng ca XML lm tng kh nng khai thc v tp hp nhng m t ca dch v web n gin v m rng hn. WSIL l c ch n gin khai thc dch v web, n l mt nh dng XML.

1.3.3 Chun OGSA


OGSA xc nh phm vi cc nh dng dch v quan trng c yu cu h tr cho h thng mng li v cc ng dng trong lnh vc thng mi in t v khoa hc. N xc nh tp cc dch v nn tng thit yu cho cho rt nhiu trnh

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 22

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

ng dng v cc h thng; xc nh cc chc nng c yu cu mc cao i vi mi quan h tng tc gia cc dch v li. Mc tiu ca OGSA l nh ngha cc phng thc v c ch chun cho vn c bn chung ca cc h thng li, chng hn nh cch giao tip gia cc dch v, thit lp nh danh, nh quyn truy cp, pht hin ti nguyn-dch v, thng bo li, v qun l tp cc dch v Cc thnh phn c bn ca OGSA C ba thnh phn chnh ca OGSA: C s h tng dch v li m OGSI: xy dng trn cc k thut dch v web v li, OGSI nh ngha c ch to, qun l v chuyn i thng tin gia cc dch v li. Cc dch v OGSA: xy dng trn cc c ch OGSI nh ngha cc giao din v cc hnh vi kt hp cho cc chc nng khng c h tr trc tip bi OGSI nh pht hin dch v, truy xut d liu, tch hp d liu Cc m hnh OGSA: h tr cc c t giao din bng cch nh ngha cc m hnh cho cc ti nguyn chung v cc kiu dch v. Cc dch v nn (Platform services) OGSA dng thut ng platform services ch nhng dch v cung cp cc chc nng c bn. Platform services (i) cung cp cc chc nng nn dng xy dng cc dch v li khc (ii) cung cp cc chc nng chung dng trong mt s cc dch v mc cao (iii) cung cp cc chc nng c thit k dng cho cc quan h m rng. Mt chc nng c cung cp bi mt dch v nn s c m t trong mt s cc dch v mc cao. Hin ti, tp cc dch v nn ca OGSA gm: OGSI: nh ngha cc dch v li v cc c ch nn to, qun l v trao i thng tin gia cc dch v WS-Agreement: cung cp mt tp giao din h tr vic iu chnh cc chnh sch, cc tho thun mc dch v, t trc CMM (Common Management Model): cung cp mt c s h tng c th qun l c cho cc ti nguyn trong OGSA. CMM nh ngha m hnh c
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 23

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

x c s cho tt c cc ti nguyn v cc b qun l ti nguyn trong li, cng thm chc nng qun l cc mi quan h v qun l vng i OGSA Data Services (cc dch v d liu OGSA): cung cp cc chc nng c bn qun l d liu trong mt mi trng li Cc yu cu v mt chc nng -

Yu cu chc nng c bn: khm ph v mi gii; o c v tnh ton; chia s d liu; trin khai; t chc o; gim st; chnh sch Yu cu bo mt: bo mt a phn; gii php bo mt phm vi; xc thc, u quyn v cp quyn; m ho; chng thc Yu cu v qun l ti nguyn: ng nht cch cung cp, o ho ti nguyn, ti u vic s dng, c kh nng lp lch v cung cp bng thng ng, c kh nng truy cp theo l v truy cp tng tc, h tr qun l v gim st vic s dng, lp lch ng cho cc tc v, m bo cc ti nguyn c s dng nh nhau, c kh nng t trc ti nguyn, c c ch ghi li cc x l, v phi qun l c lung cng vic v phi nh gi c vic s dng ti nguyn lp ho n cho ngi dng

Cc yu cu v c tnh ca h thng: phi c kh nng chu li, pht hin c him ho, t "chm sc sc khe" ca ti nguyn, gim st, theo di s tn cng, quy ry, qun l c cc ng dng k tha, c th "h thng ho" v "t ng ho" cc hot ng chun cho b qun tr, c kh nng khi to yu cu tng tc theo tho thun gia client v server v to nhm/tp cc dch v, cho php mt s dch v c k tha v s dng li cc dch v tn ti

1.4 Cc thnh phn chnh trong m hnh chc nng ca li


1.4.1 Bo mt
Bo mt lun l mt thnh phn quan trng trong bt k h thng tnh ton no trong c mi trng li. Khi ngi s dng thc hin cng vic t xa trn h thng khc, h thng quan tm ti vic liu h thng c m bo c rng
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 24

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

cng vic v d liu ca h khng b truy cp tri php. Cn nh cung cp dch v th li phi m bo ng dng li khng lm gin on cc ng dng ang chy trn my ngi dng hoc khng giao tip, truy cp vi cc d liu c nhn. 1.4.1.1 C ch bo mt trong mi trng li Cc thnh phn tham gia li li chu tc ng ca chnh sch cc b trong phm vi ca mi thc th tham gia li. gii quyt kh khn ny, c ch bo mt li cho php t chc o dng chung mt phn chnh sch vi cc t chc thc. Gii php ti chng cc chnh sch nh trn bt buc bo mt li phi m bo cc chc nng nh: h tr nhiu c ch bo mt khc nhau; khi to ng cc dch v; thit lp ng cc min chng thc tin tng. 1.4.1.2 Cc chnh sch bo mt trong mi trng li Sau y l cc chnh sch bo mt: Mi trng li bo mt a min: tp trung iu khin cc tng tc lin min, nh x hot ng lin min vi cc chnh sch bo mt a phng Hot ng li hn ch trong n min qun tr: cc hot ng a min phi tun theo chnh sch bo mt a phng trn min qun tr n Cc ch th ton cc v cc b u tn ti: ti mi min qun tr n u tn ti hai ch th trn Chng thc a phng: hot ng gia cc thc th trong cc min tin tng khc nhau i hi phi c chng thc a phng Mi i tng ton cc c nh x vo i tng cc b c coi nh chng qua chng thc a phng trn i tng cc b Tt c cc quyt nh iu khin c a ra u l cc b hay da trn c s ca i tng cc b C th dng chung tp giy chng nhn vi cc chng trnh thay mt cho cng mt tin trnh, chy trn cng mt ch th trong cng mt min tin tng
Formatted: Bullets and Numbering

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 25

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim


Formatted: Bullets and Numbering

1.4.1.3 H tng an ninh mng li GSI (Grid Security Infrastructure) GSI l c ch cho php xc thc v truyn thng an ton trn mng li. N cung cp mt s dch v nh: kh nng xc thc ln nhau, c ch ng nhp mt ln, c ch u quyn. GSI da trn cc cng ngh m kho cng khai (Public Key Infrastructure hay PKI), Chng th X.509 (Certificate), Nghi thc truyn thng bo mt (Secure Socket Layer hay SSL). Nhng chun cng nghip v bo mt trn c thm vo c ch ng nhp mt ln (SSO) v u quyn (Proxy) to nn nn tng bo mt vng chc ca mng li. Sau y l mt s c im ca GSI v cc ci t ng dng ca n. C s h tng kha cng khai Chng thc trong GSI l thao tc cung cp cho mi thc th mt tn nh danh duy nht bng cch a ra khi nim giy y quyn li, n l mt cp giy chng nhn v kha m ha (kha b mt). Trong mi trng PKI, mi thc th phi trao quyn s hu kha b mt ca mnh bo m s ton vn ca h thng. Bo mt mc thng ip v mc giao vn GSI cho php thc hin bo mt mc giao vn v mc thng ip. Nu chng ta s dng bo mt mc giao vn, ton b truyn thng c m ha. Nu s dng bo mt mc thng ip th ch ni dung ca thng ip SOAP c m ha.
Formatted: Bullets and Numbering Formatted: Bullets and Numbering

Hnh 1-5 Bo mt mc giao vn

Hnh 1-6 Bo mt mc thng ip

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 26

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

C hai mc bo mt ny u da trn kha cng khai, v do c th m bo tnh ton vn, ring t v kh nng chng thc. Thng th hi thoi an ton phi m bo ti thiu kh nng chng thc. Ton vn thng rt cn thit, nhng c th b qua. M ha c th c kch hot m bo tnh ring t. Giy y nhim Trong mi trng li, ngi s dng cn c chng thc nhiu ln trong khong thi gian tng i ngn. GSI gii quyt vn ny vi khi nim giy y nhim. Mi giy y nhim s hot ng thay mt ngi dng trong mt khong thi gian y quyn ngn hn. Giy y nhim c giy chng nhn v kha b mt ring ca n, c to ra bng cch k ln giy chng nhn di hn ca ngi dng. S y quyn Cc ng dng ca ngi dng c th thay mt h trong mi trng li. GSI cho php ngi dng y quyn giy y nhim ca mnh giao dch cc my t xa. Chng thc GSI h tr c ch cho php chuyn cc tn nh danh GSI ca ngi dng vo trong cc nh danh a phng (ti khon ca mt ngi dng Unix cc b). Vic chng thc cc nh danh GSI s chuyn v chng thc cc nh danh a phng, cng vi vic , cc chnh sch a ra cng nm trong phm vi cc b nh: quyn truy nhp file, dung lng a, tc CPU,...
Formatted: Bullets and Numbering Formatted: Bullets and Numbering Formatted: Bullets and Numbering

1.4.2 Qun l ti nguyn li


1.4.2.1 Nhng thch thc trong qun l ti nguyn li Xut pht t c trng ca ti nguyn li Li khng gi quyn iu khin tuyt i i vi ti nguyn, ta phi pht trin phng thc qun l trn cc vng khc nhau v ngun ti nguyn khng ng nht. Cc ti nguyn li l s khng ng nht, cc t chc khc nhau c cc chnh sch qun l ti nguyn khc nhau, v mc ch ca ngi dng ti nguyn

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 27

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

li khc nhau hoc thm ch c th mu thun nhau. Hu ht cc ng dng yu cu s dng ng thi nhiu ti nguyn nhiu ni khc nhau hon thnh cng vic. Mt thch thc na i vi qun l ti nguyn l vn bo mt ti nguyn. V h thng li l phn tn c v a l v t chc. Cc t chc khc nhau c cc chnh sch v bo mt khc. Mt li c a dng ngun ti nguyn, mi ti nguyn li yu cu cc mc bo mt khc nhau. nh v ti nguyn li Khi c yu cu ca ngi dng, b qun l ti nguyn s tm ti nguyn t dch v ch mc (Index Service) sau nh v ti nguyn n mt s nt c th no trong li v ti cc nt ny th ti nguyn s c lp lch s dng. Khi mt ng dng ang chy, b qun l ti nguyn cn theo di trng thi ti nguyn v thng bo tr li cho b lp lch v h thng k ton. Khi c 2 yu cu c trnh n li cng lc th c 2 s cng c x l theo quy c hot ng ca hng i: khi mt ng dng yu cu s dng ti nguyn m hin ti ti nguyn ang phc v cho mt ng dng khc th n s c xp vo hng i cho n khi ti nguyn c s dng xong v sn sng phc v. Mi trng li phn tn v a l v ti nguyn li l khng ng nht, nn nh v ng ti nguyn, ta cn phi thit k mt h thng qun l ti nguyn ph hp v phi chuyn sang hng tip cn a tng v t chc ti nguyn phi tp trung. Vn thng lng ti nguyn li Qu trnh thng lng ti nguyn li da trn cc giao thc hay cc lut trong kinh doanh chuyn i cc lnh bun bn gia ngi s dng ti nguyn v cc nh cung cp ti nguyn. Hnh 1-7 minh ha cc giao thc thng lng m c hai pha mua v bn cn trong qu trnh mc c. u tin, pha khch hng kt ni vi nh cung cp. Sau khi nhn c gi ti nguyn, c hai bn bn v mua s tin hnh thng lng. Khi thng lng thnh cng, pha khch hng s yu cu ngng kt ni v s dng ti nguyn .

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 28

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Hnh 1-7 M hnh thng lng ti nguyn li

1.4.2.2 H qun tr ti nguyn GRAM GRAM l dch v c xy dng trn c ch bo mt GSI, n ng vai tr l b qun l, phn chia ti nguyn trong ton b h thng tnh ton li. Kin trc ca GRAM Kin trc bn ngoi: c th trnh mt cng vic ln mt host, ngi dng s thng qua cc API ca GRAM Client xc lp cc thng tin v ti nguyn m cng vic cn , ng thi to ra tin trnh mi. Nhng thng tin ny s c gi n Gatekeeper tng ng. Gatekeeper ny s xc thc nhng thng tin c gi n da vo c ch bo mt GSI. Nu tt c u hp l, Gatekeeper s to ra mt Jobmanager phc v cho job. Job manager s phn tch kch bn RSL do ngi s dng gi ti. Nhng kt qu phn tch c ngay lp tc c gi ti cc ngun ti nguyn cc b v tin hnh thc thi cng vic. Bn cnh , jobmanager cng s to ra cc tin trnh lm nhim v theo di v iu khin cng vic trong sut qu trnh x l. Trong lc cng vic ang thc thi hay thc thi xong, cc ngun ti nguyn cc b s phi thng xuyn cp nht thng tin ti nguyn v cho MDS. MDS sau s
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 29

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

hin th nhng thng tin ny cho php ngi dng xem xt v la chn ngun ti nguyn no thch hp vi cng vic ca mnh. Kin trc bn trong: c th thc thi mt cng vic t xa, GRAM Gatekeeper phi c chy trn mt remote computer, lng nghe mt cng c quy nh trc; cng vic s c thc thi trn remote computer . Vic thc thi bt u khi ng dng ngi dng chy trn my cc b gi job request n remote computer. Request s mang cc thng tin v lnh thc thi, lung vo, lung xut cng nh cc thng tin v tn v cng giao tip ca remote computer. Request job s c x l bi GRAM Gatekeeper, t n s to ra mt job manager tng ng m cng vic yu cu. Lc , Jobmanager s theo di tnh trng thc thi job v chu trch nhim thng bo thng tin ca job cho ngi s dng.

1.4.3 Qun l d liu


Qun l d liu l mt phn quan trng trong tnh ton li n cho php truy nhp ti nguyn trn li vi khi lng ln hng giga-bytes thm ch hng terabytes d liu. Qun l d liu phi m bo c tnh an ton v n nh trong qu trnh di chuyn d liu gia cc nt trong mng li h tr qu trnh thc thi cc cng vic trong h thng tnh ton li. 1.4.3.1 Giao thc truyn tp tin mng li GridFTP GridFTP l giao thc truyn tp tin ging nh FTP [29] hay truyn d liu nh HTTP. y l giao thc c hiu nng cao, an ton v ng tin cy nht trn mng Internet hin nay. GridFTP c cc nh chuyn mn nh gi cao v n cung cp cc tnh nng c trng ph hp vi kin trc mng li nh: Bo mt theo chun GSI trn cc knh iu khin v knh truyn d liu To lp v qun l cc knh truyn d liu song song, cho php tng tc truyn d liu ti mc k lc. Trao i tng phn tp tin d liu, c bit hiu qu vi cc tp tin d liu c dung lng cc k ln.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 30

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Trao i d liu vi s tham gia ca pha th ba. y l nghi thc cho php chuyn tp tin trc tip t my ch ti my ch khi knh iu khin nm trn my ch th ba Xc thc cc knh truyn d liu Ti s dng cc knh truyn d liu v dn truyn cc lnh iu khin M rng t FTP GridFTP bao gm mt s chc nng trong giao thc FTP m rng v c chun ha nhng t c ci t trong cc h thng hin ti. Cc chc nng khc l cc chc nng mi so vi FTP nh: C s h tng bo mt li v h tr Kerberos iu khin bi i tc th ba Truyn d liu song song; phn on v tng phn T ng thng lng vng m TCP Truyn d liu tin cy v c kh nng khi ng li

Kin trc ca dch v GridFTP

Hnh 1-8 Kin trc ca GridFTP

B thng dch giao thc PI: Server PI c nhim v qun l knh iu khin, trao i thng tin vi my khch qua knh ny. my khch lin lc c vi my ch GridFTP, server PI
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 31

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

phi chy nh mt chng trnh thng tr, lun lng nghe cng no . Hoc mt dch v khc ca h thng phi lng nghe trn cng ny, v khi nhn c yu cu th s chuyn li gi ti Server PI. Tip , Client PI v Server PI ni chuyn vi nhau qua giao thc nh sn. Trong sut qu trnh truyn thng, Server PI ch quan tm ti vic xy dng m t cho qu trnh truyn d liu. Thi in ny, n khng lin h vi DTP hoc DTP c th cha chy. Khi nhn lnh yu cu hot ng lin quan ti DTP, Server PI gi bn m t qu trnh truyn cho DTP. DTP t thc hin truyn da vo bn m t ny. Khi bn m t c gi i, Server PI ch ng vai tr l tng chuyn tip cc thng tin trng thi. Tin trnh truyn d liu DTP: Bn thn DTP c cu to bi ba mun kt hp nh sau:

Hnh 1-9 ng ng truyn d liu DTP

1. M-un truy cp d liu: chu trch nhim c/ghi d liu ti ngun/ch. Giao din truy cp gm cc lnh gi, nhn, to, xa, i tn, tnh tng, kim tra. 2. M-un x l d liu: x l d liu pha my ch: nn, co dn, ghp ni cc tp. Hin ti chc nng x l d liu c ci t cng mun truy cp d liu. 3. M-un giao thc knh d liu: m nhim vic x l knh d liu, gm cc thao tc np/gi d liu. Mt my ch c th h tr nhiu knh truyn d liu. Bo mt trong GridFTP GridFTP cung cp vic chng thc an ton knh iu khin, m bo tnh ton vn v b mt cho knh d liu. C ch bo mt ca n xy dng trn nn GSI. Phin lm vic c thit lp khi my khch khi to kt ni TCP ti cng m my ch GridFTP server ang lng nghe. u tin din ra qu trnh chng thc. y l qu trnh bt tay ba bc. My khch trnh mt giy y nhim, giy ny

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 32

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

cha thng tin v ngi dng i din cho my khch gm nh danh, kha cng khai, tn nh thm quyn My ch cng phi a ra mt giy chng nhn ring c cp bi nh thm quyn m my khch tin tng. Nu qu trnh kim tra thng tin trn cc giy chng nhn ny tht bi, lin kt khng c thit lp. Ngc li, giai on xc nh thm quyn din ra: xc nh quyn hn truy cp ca my khch i vi d liu trn my ch. iu ny c thc hin bng cch nh x my khch vi mt ngi dng a phng trn my ch. Quyn truy cp ca ngi dng a phng s tng ng vi quyn truy nhp ca my khch. Thng tin nh x c lu trn my ch trong mt tp grid-mapfile. Nu cha c thng tin nh x, tin trnh truyn d liu khng c hon thnh. Mc nh, knh iu khin c m ha bo m tnh ton vn. Ci t dch v GridFTP Ci GridFTP trn nt li cung cp dch v. Nt ny c gi l my ch GridFTP. Ci GridFTP Client ci trn my khch, thc hin gi yu cu ti my ch GridFTP truy xut d liu. thc hin chc nng truyn tp iu khin bi i tc th ba, hai nt li tham gia qu trnh truyn phi c ci t GridFTP Server. 1.4.3.2 Dch v nh v bn sao RLS Mc ch to bn sao l lm gim tr truy cp, tng tnh a phng ca d liu, tng hiu nng, kh nng m rng, v tnh chu li ca cc ng dng phn tn. H thng s dng bn sao cn c k thut xc nh v tr bn sao. Yu cu i vi mt dch v nh v bn sao RLS phi tha mn cc yu cu sau: Bn sao c tnh ch c: RLS ch qun l tp khng thay i hoc thay i khng thng xuyn, c nh danh duy nht di cc phin bn khc nhau Phm vi s dng: h thng phi c kh nng tri rng trn hng trm min, qun l khong 50 triu tp lgic v 500 triu bn sao vt l

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 33

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Hiu nng: h thng phi c kh nng h tr khong 1000 truy vn v 200 ln cp nht trn mt giy. Thi gian hi p trung bnh phi t hn 10 miligiy, v thi gian hi p truy vn trung bnh khng vt qu 5 giy

Bo mt: RLS quan tm nhiu nht ti bo v tnh ring t v ton vn ca thng tin tn ti v v tr d liu Tnh nht qun: RLS khng h tr khung nhn nht qun hon ton i vi tt c bn sao Tnh tin cy: li xy ra mt min khng lm nh hng ti ton b hot ng ca h thng

Kin trc ca dch v nh v bn sao Kin trc ca dch v qun l bn sao phi m bo c yu cu thc thi trn mi trng phn tn cao. Trong kin trc RLS, my ch nh v bn sao cc b cho tng min c gi l LRC Local Replica Catalog. My ch thc hin nhim v nh ch mc cc LRC. Giao din truy xut ca ngi s dng c gi l RLI Replica Location Index. Thng qua RLI, ngui s dng c th tm n cc LRC mt cch d dng. LRC phc v ngui dng cc b trong t chc, cn RLI phc v ngi s dng trn phm vi ton b li. Nh vy, trn phm vi ton li d liu, dch v RLS c trin khai di dng mt tp cc LRC phn tn ti site a phng v mt s RLI nh ch mc cho cc LRC. Kho nh v bn sao cc b LRC LRC lu gi thng tin v cc bn sao ca mt t chc c th. LRC c mt s chc nng nh: V ni dung: lu tr nh x gia tn tp lgic bt k vi tn tp vt l V truy vn: p ng c cc truy vn: Cho mt LFN, tm tp cc PFN tng ng vi LFN , Cho mt PFN, tm tp LFN tng ng vi PFN V tnh ton vn cc b: qun l tnh ton vn gia ni dung ca tn lgic vi ni dung thc s c lu trn cc h thng lu tr V bo mt: thng tin trong LRC c th lin quan n iu khin truy cp, v th h tr k thut chng thc v xc nhn khi x l yu cu t xa
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 34

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

V s lan truyn trng thi: LRC thng xuyn gi thng tin trng thi thng tin v s thay i cc nh x ti RLI, bng cch s dng thut ton lan truyn trng thi

Ch mc nh v bn sao RLI: LRC ch lu tr thng tin nh v bn sao ti cc t chc, ch phc v ngi s dng trong phm vi t chc . N khng h tr ngi dng truy vn nhiu t chc cng mt lc. Thng tin ch mc trong dch v nh v bn sao c lu di dng mt tp cc RLI, mi RLI bao gm tp bn ghi gm hai trng (LFN, con tr ti LRC). RLI c th nh ch mc cho RLI khc. Da trn k thut d tha, phn on, v trng thi mm, c th ch ra cc yu cu i vi mt nt ch mc nh v bn sao ton cc RLI nh sau: Truy cp t xa an ton: RLI phi h tr chng thc, xc nhn, tnh ton vn, tnh tin cy, v phi trin khai quyn iu khin truy cp cc b trn thng tin m n qun l Lan truyn trng thi: RLI phi c kh nng nhn thng tin m t trng thi do cc LRC gi n nh k Truy vn: RLI phi tr li truy vn ti bn sao ca mt LFN c th bng cch tr v v tr vt l ca LFN hoc thng bo rng LFN khng nm trong ch mc hin thi, trong trng hp khng tm thy Trng thi mm: RLI phi n nh thi gian ht hn i vi thng tin lu tr trong ch mc. Nu mt mc gn lin vi mt LRC khng nhn c thng tin trng thi cp nht t LRC trong khong thi gian n nh, RLI phi loi b mc Phc hi khi li xy ra: RLI khng c php cha thng tin trng thi bn vng v cc bn sao. N phi khi phc ni dung sau s c ch bng cch s dng cp nht trng thi ng t cc LRC Cc tham s c trng ca kin trc RLS c t mt phm vi rng ln kin trc ca RLS, ngi ta dng b su tham s (G, PL, PR, R, S, C). Bn tham s u tin (G, PL, PR, R) m t tnh phn
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 35

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

tn ca thng tin bn sao. Hai tham s sau nh ngha cch thng tin c gi t LRC n RLI. Phn tip theo phn tch ngha tng tham s: G: S lng RLI trong h thng PL: c trng cho kiu phn nhm tn tp lgic trong RLI PR: c trng cho kiu phn nhm khng gian tn LRC R: Ni n mc d tha trong vic nh ch mc i vi mi tn tp lgic LFN S: Tn sut v cch thc cp nht thng tin t LRC n RLI C: Phng php nn thng tin trao i gia LRC v RLI

1.4.4 Lp lch trong mi trng li


Sau khi xc nh c ti nguyn cn thit ta phi lp lch trnh cc cng vic c thc thi. Nu cc cng vic l hon ton c lp th c th khng cn b lp lch. Nhng thng th ng dng i hi cn phi d tr ti nguyn no , hoc cc cng vic cn giao tip vi nhau. Do , cn c b lp lch phi hp cc cng vic. Lp lch trong li bao gm 3 giai on chnh [16]: Khm ph ti nguyn v a ra danh sch ti nguyn c th s dng c La chn ti nguyn ph hp nht i vi yu cu cng vic Thc thi cng vic Giai on 1: khai ph ti nguyn Xc nh xem ti nguyn no kh dng i vi ngi dng hin ti. Bc 1: Tm cc ti nguyn kh dng: xc nh tp ti nguyn m ngi dng c thm quyn truy nhp ti Bc 2: Xc nh yu cu ng dng: ngi dng phi nh ra mt tp cc yu cu ti thiu thc hin cng vic lc cc ti nguyn kh dng Bc 3: Loi b nhng ti nguyn khng p ng c yu cu ti thiu ca cng vic: cn c vo danh sch cc ti nguyn m ngi dng c quyn s dng v cn c vo kt qu phn tch yu cu cng vic bc hai, ta loi b tt c nhng ti nguyn khng p ng c nhng yu cu ti thiu ca
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 36

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

cng vic. n cui bc ny ngi s dng s c trong tay mt tp cc ti nguyn c th dng trin khai cng vic Giai on 2: La chn ti nguyn. Tin hnh thu thp cc thng tin lin quan ti cc yu cu cn li ca cng vic v la chn ra ti nguyn thch hp nht thc thi cng vic. Bc 1: thu thp thng tin ng: xc nh xem thng tin no sn c v ngi dng c th truy nhp n n nh th no Bc 2: La chn ti nguyn: sau khi c y thng tin v ti nguyn ngi dng s la chn nhng ti nguyn ph hp nht cho yu cu v mc ch ca h. Bc ny thng do b lp lch v qun l ti nguyn thay mt ngi dng m nhn t ng bng cch gii bi ton ti u Giai on 3: Thc thi cng vic. Tin hnh cc bc thc thi cng vic trn ti nguyn chn, gim st trng thi cng vic v gi kt qu li cho ngi s dng Bc 1: t trc ti nguyn (ty chn): c th s dng tt nht mt h thng no , mt phn hoc ton b ti nguyn phi c t trc Bc 2: trnh cng vic: sau khi chn c ti nguyn ng dng, cng vic cn phi c trnh ln ti nguyn thc hin bng cch chy mt dng lnh n hoc chy mt dy cc kch Bc 3: Cc cng vic chun b: trong bc ny pha ngi dng s lm cc cng vic cn thit ng dng c th chy c. V d: dng GridFTP chuyn cc file d liu cn thit n a im ni cng vic s chy Bc 4: Theo di tin : ty thuc vo ng dng v thi gian chy ca n m ngi dng c th mun theo di tin v c th s thay i nh ca h v vic cng vic s c thc hin u v nh th no Bc 5: Hon thnh cng vic: khi cng vic kt thc th cn phi bo cho ngi s dng bng mt hnh thc no

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 37

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Bc 6: Dn dp v kt thc: sau khi mt cng vic c thc hin xong, kt qu cng vic phi c gi li cho ngi trnh, ng thi cc file tm thi cng phi c xa i

1.4.5 Grid Portal


H thng TTL ch cung cp cho ngi s dng mt tp hp cc dch v chung v kh nng khai thc cc ngun ti nguyn phn tn. N khng cung cp cc thnh phn giao din thn thin phc v ngi s dng. V vy, i hi cn phi c mt cng c cung cp cc thnh phn giao din phc v ngi s dng. Trc thc t , mt cng giao tip h thng Gird vi tn gi Grid Portal ra i.

Grid Services

Hnh 1-10 Cng in t GridPortal

Grid Portal l cng kt ni dch v gia ngi dng v nh cung cp dch v, c pht trin nh mt phn mm trn mng Intenet cung cp cc chc nng cn thit theo hng ngi dng. Vic s dng cng ngh Portal cho php to mi trng lm vic ring bit cho tng ngi dng, ng thi tch bit cc chc nng dch v ring bit t pha my ch v ti s dng cc thnh phn chc nng ca Web. Grid Portal c hnh thnh t hai khi nim cng (portal) khc nhau: Cng chuyn cho ng dng (Application Specific Portal) cung cp mt tp con cc thao tc truy cp Grid chuyn bit t bn trong mt ng dng, t cc min c bit. V Cng chuyn cho ngi dng (User Specific Portal) cung cp cc dch v ring lin quan cc site phc v cho mt tc v truyn thng no .
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 38

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

trin khai cng ngh GridPortal, chng ta c th s dng cng c pht trin GPDK (Grid Poratl Development Kits) [30]. 1.4.5.1 Cc yu cu i vi Grid Portal 1. Cc yu cu v an ton bo mt: hin ti, khi pht trin cc Grid Portal, mt s yu cu sau lin quan ti vn bo mt nh qun l giy y nhim li, giao tip vi c s h tng bo mt GSI thng qua Portal: Ngi dng phi c mt trnh duyt Web chun tip cn cc Grid Portal Ngi dng c th truy nhp ti nhng im m h khng c sn giy chng nhn Grid Ngi dng c th lm bt c iu g thng qua Grid Portal trong phm vi cho php ca giy y nhim 2. Qun l cc file t xa: ngi dng c th truy cp cc dch v th mc v file xa v c th s dng cc cng c phn loi, lp cc bn sao d liu. 3. Qun l cng vic thc hin t xa: ngi dng c th thc thi v quan st cc cng vic thc hin t xa, ng thi c th thc thi cc cng vic phc tp theo. 4. Truy cp cc dch v thng tin li: ngi dng c th tm kim, truy vn cc yu cu thng qua cc dch v lu tr thng tin ca ton b h thng li. 5. Giao din ng dng: cho php n i nn tng phc tp ca c s h tng li. Ngi dng tng tc vi li thng qua mt giao din thn thin. 1.4.5.2 Chuyn ti cc Job trong Grid Portal Mt h tng bo mt GSI m rng t phin bn ca SSH c th tng tc vi cc jobs c ti ln h thng Grid. Tt c cc jobs c ti ln s c b qun tr GRAM gi n cc Gatekeepers. Cc thnh phn chnh ca cng c GDPK dng chuyn ti cc job l JobBean, JobSubmissionBean v JobInfoBean. Bn cnh ba thnh phn chnh, GDPK cn cung cp hai thnh phn bean ph khc c lin quan n job l JobMonitorBean v JobHistoryBean. Truyn tp tin (File Tranfer)
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN
Formatted: Bullets and Numbering

Trang 39

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Vic di chuyn tp tin hin nay c cung cp bi thnh phn FileTranferBean. N c hai dng: GSISCPTranferBean: c s dng i vi cc h thng c ci t GSISSH gip di chuyn file thng qua c ch GSI c m rng t scp GSIFPTTranferBean: ci t GSI m rng t FTP thun java v n c kh nng di chuyn file gia cc third-party khc nhau C hai dng trn u bo m an ton cho vic sao lu d liu t my ngun n my ch thng qua c ch xc thc user. H s ngi dng (User Profile) User profile hot ng da trn ba thnh phn bean sau : UserLoginBean: cung cp mt dch v la chn xc thc user trong qu trnh login vo Portal. N ch khi to username, password da vo cc file mt m lu tr tng ng trn web server to ra mt truy cp hp php UserAdminBean: cung cp kh nng sp xp vic x l tun t cho UserProfileBean v hp thc ha cc user profile c bo v bng password
-

UserProfileBean: dng duy tr cc thng tin trc y v user nh thng tin xc thc, ti nguyn tnh ton, cc job submit trong qu kh UserProfileBean c khi to cng lc vi session theo di trong sut qu trnh user thao tc trn portal

1.4.6 Gim st li
Trong mi trng li, nhu cu gim st cc ti nguyn l rt cn thit. Cc ti nguyn ca li lun trng thi ng, chng c th gia nhp vo li ri sau rt ra khi li vo bt k thi im no. Ngi dng phi c kh nng tm kim nhng ti nguyn mong mun v gim st cc ti nguyn . Ngoi vai tr cung cp thng tin cho ngi dng, h thng gim st li cn ng vai tr quan trng trong cc hot ng lp lch, nhn bn d liu, phn tch hiu nng, xy dng ng dng t iu chnh,...

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 40

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

1.4.6.1 Quy trnh gim st Quy trnh gim st cc h phn tn thng bao gm bn bc nh sau: 1. Sinh cc s kin: b cm bin tin hnh o c trn cc thc th v m ha kt qu thu c theo mt lc cho trc. 2. X l cc s kin: cc s kin c x l theo tng ng dng c th. 3. Phn phi cc s kin: cc s kin c chuyn n cc bn quan tm. 4. Trnh din cc s kin: cc s kin c x l t ti mc tru tng cao, ngi dng rt ra c kt lun v trng thi ca h thng. Giai on ny thng c thc hin bi mt ng dng ha, hin th d liu tc thi theo thi gian thc hoc ly d liu t cc kho lu tr v hin th. 1.4.6.2 Yu cu i vi mt h thng gim st li Mt h thng gim st li phi tha mn c nhng yu cu sau y: Kh nng m rng: phi hot ng tt khi s lng ti nguyn v ngi dng tng ln tr x l nh: trong li, cc s kin lin tc c sinh ra vi tc cao v s lng ln, ng thi trnh tnh trng d liu b lc hu th h thng gim st phi c tc x l d liu cao nhm t c tr nh nht t xm phm n cc ti nguyn: thao tc o c din ra thng xuyn s tiu tn ng k cc ti nguyn. H thng gim st phi gi c mc tiu th ti nguyn ca mnh mc chp nhn c H tr nhiu m hnh truyn d liu: thng tin gim st bao gm cc s kin tnh v cc s kin ng nn n i hi cc chnh sch o c khc nhau nh o nh k hay o mi khi c yu cu Kh chuyn: cc ti nguyn trong li l rt khng ng nht, bi vy cc thnh phn ca h thng gim st phi c tnh kh chuyn cao Bo mt: h thng gim st phi h tr cc dch v bo mt nh iu khin truy nhp, chng thc, vn chuyn an ton cc thng tin gim st Kh nng ng b ha cao: bn nhn cn phi bit mi ca mt s kin,

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 41

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

do h thng gim st phi c kh nng ng b ha cao gia cc thnh phn 1.4.6.3 Kin trc b gim st li GMA (Grid Monitoring Architecture) T chc li ton cu GGF (Global Grid Forum), xut mt kin trc chung cho cc h thng gim st li gi l kin trc GMA. tng ca GMA l tch bit thao tc tm kim d liu vi thao tc truyn d liu v xy dng cc thnh phn c kh nng hot ng trong theo cu trc phn cp.

Hnh 1-11 Cc thnh phn ca kin trc GMA

1. Dch v th mc (Directory Service): phc v vic m t v khm ph cc s kin. N lu tr thng tin v cc b sinh v b tiu th c kh nng tham gia tng tc. Bn chc nng ca mt dch v th mc l: Thm, Cp nht, Xa mt mc thng t th mc; Tm kim cc b sinh hay b tiu th da trn mt s tiu ch. 2. B sinh (Producer): l bt k thnh phn no s dng giao din sinh gi cc s kin ti mt b tiu th. Mt b sinh cung cp cc chc nng c bn nh: Bo tr bn ng k; Chp nhn truy vn; Chp nhn ng k; Chp nhn ngng ng k; nh v b tiu th; Thng bo; Khi to ng k; Khi to ngng ng k 3. B tiu th (Consumer): b tiu th l bt k thnh phn no ci t giao din tiu th nhn cc s kin t mt b sinh. Mt b tiu th c cc chc nng c bn nh: nh v b sinh; Khi to truy vn; Khi to ng k; Khi to ngng ng k; Bo tr bn ng k; Chp nhn thng bo; Chp nhn ng k; Chp nhn ngng ng k; nh v s s kin.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 42

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Cc tng tc vi dch v th mc Cc b sinh cng b s tn ti ca mnh nh cc mc thng tin trong dch v th mc. Nh , dch v th mc c dng nh v cc b sinh v b tiu th. Cc b tiu th s dng dch v th mc tm kim cc b sinh ph hp vi yu cu ca mnh, ngc li cc b sinh cng dng dch v th mc tm kim cc b tiu th ph hp. C b sinh ln b tiu th u c th ng vai tr khi to tng tc khi tm c i tc ph hp, khi cc thng bo iu khin v cc d liu gim st s c truyn trc tip gia ni sinh v ni nhn m khng lin quan n dch v th mc na. Cc tng tc gia B sinh v B tiu th Kin trc GMA h tr ba kiu giao tip gia b sinh v b tiu th. l: 1. Xut bn/ng k: kiu giao tip ny gm ba giai on. Giai on u tin, nt khi to (b sinh hoc b tiu th) lin h vi mt nt phc v v thng bo cc tp s kin m n quan tm. giai on hai, b sinh (nt khi to hoc nt phc v) gi cc s kin cho b tiu th. Giai on cui cng, b sinh hoc b tiu th chm dt tng tc bng cc thng bo iu khin. 2. Truy vn/tr li: trong kiu tng tc ny, nt khi to bt buc phi l mt b tiu th. Giai on mt l giai on thit lp tng t nh giai on mt ca tng tc xut bn/ng k. Tuy nhin trong giai on hai, tt c cc s kin c truyn mt ln duy nht ti b tiu th m khng c thng bo kt thc. 3. Thng bo: l kiu tng tc mt giai on, nt khi to l mt b sinh. B sinh s truyn tt c cc s kin ti b tiu th trong mt thng bo duy nht. 1.4.6.4 Phn loi cc h thng gim st li Cc h thng gim st c chia thnh bn mc nh sau: Mc 0 (Level 0): cc s kin c chuyn trc tip t b cm bin ti b tiu th theo mt trong hai ch online hoc offline. ch online, cc kt qu o c thng c truy nhp ti thng qua mt giao din web. ch offline, kt qu o c b cm bin ghi vo kho lu tr v sau c b tiu th ly ra. Mc 1 (Level 1): trong cc h thng loi ny, cc b cm bin c xy dng
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 43

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

ring v nm trn cng mt my vi cc b sinh, hoc chng c tch hp vo trong cc b sinh. Trong c hai trng hp, cc s kin c truy nhp thng qua cc API ca b sinh. Mc 2 (Level 2): so vi cc h thng mc 1, cc h thng mc 2 c thm cc thnh phn trung gian. Cc chc nng c phn b trn c b sinh v thnh phn trung gian (c th nm trn my khc) thay v ch nm trn mt b sinh duy nht. Mc 3 (Level 3): cc h thng mc ny c tnh linh hot cao nh cc thnh phn trung gian c t chc theo cu trc phn cp. Mi thnh phn trung gian thu thp v x l cc s kin t cc thnh phn trung gian hay b sinh nm mc thp hn v sau gi chng ln cc thnh phn trung gian mc cao hn. Cc h thng mc 3 rt thch hp cho mi trng li. Mt h thng tiu biu thuc loi ny l Globus MDS.

1.5 Kt chng
Chng ny tm hiu c cc khi nim, cu trc, thnh phn ca mt li tnh ton. ng thi cng gii thiu v cc chun kin trc c dng trong tnh ton li v cc dch v chnh cn cung cp trong mt liQua ta thy rng, xy dng c mt li tnh ton ch da vo h tng phn cng l khng th, im quan trng trong li l c s h tng phn mm. Hin ti, c nhiu b cng c phn mm h tr cho vic trin khai mt li, trong b cng c ni ting nht v c s dng rng ri nht hin nay l Globus Toolkit. Chi tit v b cng c ny s c trnh by Chng 2 ca lun vn.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 44

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

CHNG 2. TNG QUAN V GLOBUS


Globus l mt h tng cung cp cc cng c phn mm xy dng tnh ton mng li v cc ng dng khc da trn cng ngh Grid mt cch d dng. Cc cng c ny c gi chung l Globus Toolkit (GT). N bao gm cc module cng c, mi cng c nh ngha mt giao din phc v cp cao v cung cp nhiu ton t cp thp c th thc hin c trn nhng mi trng khc nhau. Chng ny gii thiu ch yu l v phin bn 4 ca GT (GT4), n chnh thc c ban hnh thng t nm 2005 vi nhiu chc nng mi ca dch v Web nh: cc thnh phn, tnh tng thch vi cc chun, tnh tin li, v cht lng ti liu.

2.1 Tng quan kin trc chung ca GT


2.1.1 Cc chc nng chnh ca GT
V GT c coi l cng c nn dng xy dng li tnh ton, v vy ta s nghin cu kin trc ca GT bng cch nh x tng ng cc thnh phn chc nng chnh ca li sang cc thnh phn chc nng chnh ca GT [6] bao gm: nh v v cp pht ti nguyn: cung cp k thut dng cho cc ng dng i hi ti nguyn ngay lp tc hoc ti nguyn c t trc. y l chc nng bt buc phi c v ng dng khng th xc nh ng v tr ca ti nguyn cn thit nht l khi cc ti nguyn c sn b thay i Truyn thng: cung cp k thut truyn thng c bn, cho php thc thi hiu qu vic truyn t thng tin trn din rng bao gm vic truyn thng ip, gi th tc, phn phi b nhvi tin cy cao trn mt bng thng rng Thng nht thng tin phc v: cung cp k thut ng b thng tin v cu trc v trng thi thc ca h thng metasystem. Ngoi ra, n phi c thnh phn cung cp thng tin v h tr vic iu khin yu cu, truy cp thng tin Giao din xc thc: cung cp nhng k thut c bn v xc thc dng xc nhn tnh hp l ca ngi dng v ti nguyn

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 45

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

To tin trnh hot ng: c s dng thc hin tnh ton trn ti nguyn c nh v v phn pht. N bao gm: thc thi s iu chnh, to ra mi trng thc thi, chuyn i s, kt hp vi tin trnh mi vo, qun l s kt thc v tin trnh shutdown Truy xut d liu: chu trch nhim truy xut d liu tc cao lu tr di dng cc tp tin

2.1.2 Cc c trng ca GT4


Cc c ch v giao din c chun ho Trong mi trng tnh ton li, cc thnh phn thng tng tc vi nhau qua mng. Do cch xy dng h thng phn tn l khng ng nht nn cn thit phi th hin chun ho cc c ch x l cng vic chung. Vic chun ho trong mi trng li nhm mc ch: D dng xy dng v hiu cc thnh phn ring l D lin thng gia cc ci t khc nhau ca cng mt giao din D chia s cc thnh phn v pht trin cc cng c cho php dng li GT4 chn c ch dch v Web nh ngha giao din v cu trc cc thnh phn [6]. Cc dch v Web ny cung cp c ch mm do, m da trn XML m t, pht hin, v thc thi cc dch v t xa. Giao din dch v Web c GT4 dng cho hu ht cc thnh phn chnh, do cho php s dng cc c ch dch v Web chun m t cc giao din dch v GT4. Trong mt vi nm tr li y, nhiu phn mm c pht trin h tr trin khai nhiu thnh phn h thng phn tn c ci t giao din dch v Web. GT4 dng GT4 Web Services Containers trin khai v qun l cc dch v GT4 c vit bng nhiu ngn ng nh C, Java. Tuy nhin, ta cn ch l dch v Web thng khng ch trng vo hiu nng ca h thng. C s h tng ng dng ca ngi dng thng yu cu tng trung gian ca GT4 phi cung cp cc cng c duy tr v qun l c s h tng phn cng ca Grid. Cc khi nim hng dch v v cc c ch dch v Web c dng truy xut v qun l
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 46

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

cc phn t h tng thng qua cc giao din ca chng. GT4 ci t cc giao din dch v Web : qun l cc ti nguyn tnh ton v theo di qu trnh hot ng ca cc ti nguyn ny (GRAM); qun l thit b o c (GTCP); v qun l vic truyn d liu tin cy (RFT). Ngoi ra, GT4 cn cung cp dch v GridFTP nhg cha c nh ngha giao din dch v Web. Pht hin v theo di Pht hin v theo di l 2 chc nng quan trng trong li. Vic pht hin ti nguyn hoc dch v ph hp vi yu cu l cn thit, trong khi vic theo di gip duy tr hot ng ca cc ti nguyn hoc dch v trong h thng. u tin, GT4 cung cp cc c ch chun gn kt cc thuc tnh ti nguyn da trn XML vi thc th mng v truy xut cc thuc tnh ny bng c ch ko-y. Cc c ch ny l nhng ci t ca c t WSRF v WSNotification c xy dng trong mi dch v v container ca GT4. Th hai, GT4 cung cp ba dch v kt [6] nhm thu thp thuc tnh trng thi t cc ngun thng tin ng k. Ba dch v ny ci t mt bn ng k (MDS-Index), mt b lu tr (MDSArchive), v mt b lc d liu hng s kin (MDSTrigger). Cui cng, i vi chc nng pht hin v theo di, GT4 cn cung cp cc giao din trnh duyt, cc lnh, v cc giao din dch v Web ngi dng truy vn v nhn thng tin. Bo mt Bo mt l yu t rt quan trng trong bt c h thng no, nht l i vi h thng phn tn gm nhiu ti nguyn v ngi s dng nm ri rc nhiu ni nh Grid. GT4 v cc cng c lin quan cung cp nn tng bo mt hon chnh khng ch i vi truyn thng trn mng m cn i vi cc ti nguyn v ngi dng. D liu Trong li, d liu thng rt ln v nm ri rc. V vy chng cn c qun l, tch hp li truy xut nhanh chng. GT4 c rt nhiu thnh phn h tr vn ny nh GridFTP, RLS, DAI (cung cp kh nng truy xut ti d liu XML v cc c s d liu quan h) [6]. Phi hp
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 47

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

GT4 c nhiu cng c phi hp mc cao nh DAGman v MPICH-G2.

2.1.3 Tm lc v kin trc ca GT4


Tm lc kin trc ca GT4 [6] gm: Kin trc hng dch v: gm nhiu dch v hon chnh (i km cc th vin chun) tng tc vi nhau qua cc giao thc chun ho Dch v h tng: gm cc dch v truy xut, theo di, qun l v iu khin cc phn t h tng Dch v Web: tun theo cc giao thc dch v Web c s dng rng ri v cc c ch m t, pht hin, truy xut, xc thc, phn quyn dch v GT4 Containers: gm cc thnh phn to nn GT4 container ni cha cc dch v Web vit bng Java, C v Python Bo mt: cho php thit lp mt h thng bo mt cao, m, v uyn chuyn nhm bo v thng tin, xc thc v phn quyn ngi dng, i din v y quyn Chun ho: c ci t theo cc chun c chp nhn rng ri, h tr to cc thnh phn m, d dng s dng li nh cc cng c chun Cng c lin quan: GT4 l tng trung gian cung cp nn tng tt nht cho cc ng dng li Hn ch: GT4 cn nhiu hn ch v hiu nng, tnh d s dng, v sc mnh

2.2 Kin trc hng dch v


Phn ny s ni n cc khi nim c bn v dch v Web v vic s dng cng nh ci t chng trong GT4, nht l trong cc thnh phn li ca GT.

2.2.1 GT4, cc h thng phn tn, cc dch v Web


GT4 gm cc phn mm ci t c ch dch v Web xy dng cc h thng phn tn, cc thng ny lun phi i u vi cc vn nh tr mng, s c cc b, tng tranh. Ngoi ra, cc thnh phn h thng c th nm ri rc trn cc vng a l khc nhau v c qun tr ring bit.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 48

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Hnh 2-1 Cc thnh phn trong kin trc dch v Web

Mt dch v Web [27] l mt h thng phn mm c thit k h tr tng tc gia my vi my (machine-to-machine) trn mng, ni cch khc cho php cc phn mm d dng tng tc vi nhau theo mt chun thng nht. Mi dch v c mt giao din c m t theo khun dng, ngn ng chun (WSDL) gip ngi lp trnh v cc phn mm hoc dch v khc d dng phn tch v hiu c dch v cung cp ci g, nh th no. Thng ip c dng trao i gia cc dch v tun theo giao thc SOAP.

2.2.2 C s h tng v ng dng hng dch v


Cc cng ngh dch v Web-GT4 c th dng xy dng c c s h tng hng dch v v ng dng hng dch v. Cc ng dng hng dch v gm nhiu dch v giao dch vi nhau. C s h tng hng dch v gm cc dch v nn tng cung cp cc c ch qun l ti nguyn phn cng v phn mm phn tn. Ci t dch v Web xy dng mt ng dng hng dch v Web, ta cn phi ci t 2 phn l: Mi trng ch (container): m nhim cc chc nng trao i vi client bng thng ip SOAP, xc nh v gi dch v x l thng ip, qun l, iu hnh cc dch v Web. Ci t dch v Web: vit on m x l thng ip gi ti.

Hnh 2-2 Cc thnh phn chc nng chnh trong ci t dch v Web
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 49

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

c t dch v Web y l cc c t dch v Web c ci t trong GT4 [6]: XML c dng trong dch v Web nh dng d liu chun, mm do v m rng. GT4 cho php XML xut d liu dng text v nh phn.
SOAP 1.2 cung cp c ch chun, m rng v hng thnh phn ng gi

v trao i thng ip XML gia nh cung cp dch v v khch hng. SOAP c lp vi nghi thc truyn thng, tuy nhin thng thng HTTP c dng nh giao thc nn tng.
WSDL 1.1 l mt ti liu XML m t dch v Web, bao gm cc qui c

gn kt chun gia SOAP vi cc giao thc nn khc.


WS-Addressing nh ngha c ch nh a ch cho dch v web v thng

ip. N nh ngha cc phn t XML xc nh cc endpoint ca dch v Web v m bo phn bit cc endpoint ny trong thng ip.
WS Resource Framework (WSRF) nh ngha c cu m v tng qut cho

vic m hnh ha v truy xut cc ti nguyn.


WS-Notification nh ngha nn cho php ph bin thng tin gia cc dch v

Web. Ngoi ra GT4 cn bao gm mt s ci t m rng khc nh: GridFTP l m rng ca FTP; DAIS tun theo c t OGSA-DAIS ca din n li ton cu [6].

2.2.3 Kin trc hng dch v (Service Oriented Architecture-SOA)


Kin trc hng dch v SOA l mt form ca kin trc cc h thng phn tn c cc c trng sau [6]: 1. Logical view: dch v l mt cch nhn logic, tru tng ca chng trnh, c s d liu, quy trnh nghip v 2. Message orientation: dch v c nh ngha theo thng ip trao i gia ngi cung cp v ngi yu cu ch khng phi v bn thn cc i tc. Cu trc bn trong ca i tc c tru tng ho trong kin trc SOA.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 50

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

3. Description orientation: dch v c m t nh siu d liu nhng d dng c x l bng my tnh. Cch thc s dng dch v v ng ngha ca n l ni dung chnh cn m t. 4. Granularity: dch v ch gm mt s t cc thao tc ng vi cc thng ip di v phc tp. 5. Network orientation: dch v c xu hng s dng mng my tnh. 6. Platform neutral: thng ip c trao i trn mt nn tng trung lp, c chun ho khun dng v c phn phi thng qua cc giao din. XML l la chn tt nht tho mn iu kin ny.

2.3 Kin trc GT4


2.3.1 Kin trc tng quan
Hnh sau m t kin trc ca GT4 bao gm cc dch v c ci t bn pha ch v cc th vin gn vi pha khch. GT4 cung cp cc thnh phn dch v Web (bn tri) v c thnh phn khng l dch v Web (bn phi) [6].

Hnh 2-3 Cc thnh phn trong kin trc GT4

Cc hnh hp trng trong phn "Client" ch cc ng dng hoc cng c truy xut n dch v GT4 hoc cc dch v c pht trin trn GT4. Cc dch v GT4 c nh ngha trc

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 51

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

GT4 cung cp cc dch v Web c giao din, l: GRAM, RFT, y quyn, dch v theo di v pht hin ti nguyn (MDS-Index, MDS-Trigger), MDSArchive, dch v phn quyn trong cng ng, dch v tch hp v truy xut d liu (OGSA-DAI) v dch v iu khin thit b t xa (GTCP - Grid TeleControl Protocol) [6]. GRAM v MDS-Index tuy li thi nhng vn c cung cp trong GT4. Ba dch v GridFTP, RLS v MyProxy (kho u nhim th trc tuyn) tuy c ci t nhng cha c giao din dch v Web y . Ngoi ra GT4 cn cung cp cc th vin lin kt vi cc c ch xc thc v y quyn khc nh MD5, LDAP, Kerberos, cng nh th vin IO m rng (XIO) cho php truy xut nhiu giao thc vn chuyn c s. Ta c th s dng SimpleCA nh l mt c ch cp chng th gn nh. GT4 Containers GT4 ci t cc dch v Web quan trng cho ng dng li, mt s trong chng c chun ho nh WSRF, WS-Notification, v mt s ch c trong Globus nh GRAM, RFT. Cc dch v ny c th kt hp vi cc thnh phn khc (my phc v Web, my SOAP) to ra nhiu loi GT4 container khc nhau. Mt GT4 container [6] l mt thng ng cha cc dch v Web vi cc c trng sau: Ci t giao thc trao i thng ip SOAP trn nn HTTP cng nh c ch bo mt mc giao vn v mc thng ip WS-Sercurity cho qu trnh truyn thng Ci t cc dch v WS-Addressing, WSRF, v WS-Notification H tr ghi chp qua Log4j m ci t giao din lp trnh Jakarta Commons Logging API nh ngha ti nguyn dch v Web (WS-Resources) vi cc thuc tnh cung cp thng tin v cc dch v c trin khai trong container v cc thuc tnh ca container nh phin bn v thi im khi ng

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 52

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

v th mt GT4 container c th cha cc dch v m giao din ca chng c nh ngha theo c t dch v Web c bn v cc dch v m giao din ca chng theo nn WSRF. Ngoi ra GT4 Java container cn c th cha cc dch v nng cp cung cp bi Globus nh GRAM, MDS, v RFT. Cc ng dng pha client c th s dng giao din ng k trong GT4 container n xc nh dch v no hin hu trong container, v truy xut giao din qun tr GT4 container thc hin cc chc nng qun tr c bn.

Hnh 2-4 GT4 Container tch hp cc dch v v cng c

GT4 h tr 3 ngn ng ci t dch v Web l Java, C, v Python:

Hnh 2-5 Bn cu hnh GT4 container

2.3.2 Trin khai dch v Web trn GT4


Trin khai mt dch v Web bao gm cc cng vic ci t v khi to vic thc thi mt dch v Web trn mt my tnh c th hay ni r hn trn mt container. Bi v mt container dch v Web c th c cu hnh cho mt mc ch c th, nn vic trin khai dch v Web c th bao hm: 1. Trin khai dch v Web vo trong mt GT4 container. 2. Ti cc th vin b sung (WSRF) vo mt non-GT4 container, sau trin khai dch v Web vo container ny.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 53

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

3. Trin khai mt GT4 container mi, sau mi trin khai dch v Web hoc 4. Trin khai GT4 container v dch v Web ng thi. Gi phn mm GT4 cung cp cc cng c d dng thc hin cng vic ny trong cc mi trng khc nhau. Chng hn nh, trin khai dch v GT4 GRAM th cn: 1. Nu GT4 Java container c ci t, ta c th trin khai dch v GRAM mt cch trc tip. 2. Nu my phc v ci t Tomcat, nhng cha ci gi Globus no th ta cn trin khai cc dch v WSRF, WS-Notification, v sau l GRAM. 3. Nu cha c g th ta nn ci t gi GT4 hon chnh gm GT4 Java container v dch v Web GRAM. Gi ny c th ci t, cu hnh, v chy d dng.

2.4 Qun l thc thi trong GT4


Cc cng vic lin quan ti qun l thc thi trong mi trng phn tn [6] thng l: to mt chng trnh chy nh mt dch v mng; phn b; chy mt chng trnh thc thi t xa; chy mt chng trnh song song trn cc my tnh phn tn; v chy mt tp cc cng vic khng thng xuyn trao i d liu vi nhau. Cc cng vic ny i hi phi cn n cc cng c qun l thc thi nh: khi to; theo di; qun l; lp lch v iu phi cc cng vic tnh ton t xa trong mi trng phn tn. GT4 cung cp gi "Qun l v cp pht ti nguyn li - GRAM" qun l thc thi. Thng thng GRAM c trin khai cng vi cc gi MyProxy v RFT nhm tng tnh bo mt, xc thc v trao i d liu. Gi GRAM gm 3 thnh phn chnh: B lp lch GRAM cho cc cm my trong mt mng cc b (Condor, OpenPBS, Torque, PBSPro, SGE, LSF). H thng cung cp cc giao din cho cc my tnh xa (OpenSSH) hoc ci t cc m hnh lp trnh song song khc nhau trong mi trng li bng cch dng GRAM phn b cc cng vic tnh ton ti cc my tnh t xa (Condor-G, DAGman, MPICHNguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 54

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

G2, GriPhyN VDS, Nimrod-G). Cui cng l B siu lp lch nh x cng vic ti cc nhm (CSF, Maui).

2.4.1 Tng quan v GT4 GRAM


ngi dng xa thc thi mt chng trnh thng qua mt dch v Web, ta cn phi nh ngha v ci t mt dch v Web gm mt phng thc gi thc thi chng trnh t xa. Tuy nhin, ci t c ta phi gii quyt cc vn : 1. State: cng vic tnh ton c th thc hin cc thao tc nhp/xut trong khi chy lm nh hng ti trng thi ca ti nguyn tnh ton hoc h thng tp tin gn vi cng vic ny. 2. User executables: ngi dng c th cung cp chng trnh ca ring h gi thc thi t xa. 3. Staging of input and output: chng trnh thc thi, d liu vo/ra c th ln, xa, v/hoc c chia s vi cc li gi thc thi khc. V th, yu cu b tr d liu vo/ra l cn thit. 4. Streaming output: mt s chng trnh thc thi cn c kh nng cung cp kt qu kp thi cho ngi dng khi ang chy. 5. Control: ngi dng i khi cn ngng mt cng vic ang thc thi. 6. Scheduler: cc ti nguyn tnh ton ln thng chu s iu khin ca mt b lp lch cp pht ti nguyn theo cc chnh sch u tin mt cch ti u v hiu nng. 7. Monitoring: i vi mt s cng vic phc tp cn c theo di qu trnh thc thi v cc thao tc can thip kp thi nh pending, suspending, staging, H thng GRAM kt hp vi cc cng c qun l c c pht trin nhm gii quyt cc yu cu trn. GRAM cho php: thc thi mt chng trnh bt k; hon thnh cc thao tc tin cy; thc hin theo di cc trng thi; qun l y nhim th; b tr tp tin; tng tc vi cc b lp lch c. Mi cng vic khi c trnh s c: mt thc th stateful l ManagedJob c to ra trn my ch tnh ton vi chu k sng ng vi cng vic; mt handle hay mt tham chiu a ch dch v Web (WS-Addressing endpoint refrence). Client
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 55

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

s nhn c handle ny v s dng n truy vn trng thi ca cng vic, hy cng vic, nhn "thng bo" khi cng vic thay i trng thi, v/hoc nhn kt qu ra. Di quan im ca ngi qun tr h thng, GRAM khng phi l mt b lp lch ti nguyn m l mt phng tin trung gian tng tc vi cc b lp lch cc b khc thng qua mt khun dng thng ip chun. Gi GRAM ca GT4 cho php lin kt vi cc b lp lch nh Condor, LSF, SGE, v PBS

2.4.2 Lnh globusrun-ws


Lnh ny bao gm cc chc nng qun l cng vic thc thi nh: gi cng vic thc thi, theo di qu trnh thc thi v qun l cc cng vic v cc chc nng tinh vi khc nh: truy xut file, y quyn, gi cng vic ch mt ln, timeout, bo mt thng ip, v phn quyn. Chc nng Basic Job submission Lnh trnh thc thi chng trnh "/bin/touch" vi tham s vo "touched_it" c g lnh nh sau:
% globusrun-ws -submit -job-command /bin/touch touched_it

Nu trnh thnh cng, ta s nhn c thng tin trng thi ca chng trnh ny khi n ang chy v khi kt thc nh sau (cc thng tin ny c hin th ngay di lnh va g):
Submitting job...Done. Job ID: uuid:c51fe35a-4fa3-11d9-9cfc-000874404099 Termination time: 12/17/2004 20:47 GMT Current job state: Active Current job state: CleanUp Current job state: Done Destroying job...Done.

Nu khng thch g lnh, ta c th m t cng vic cn trnh trong mt file XML, v lnh gi thc thi cng vic ny nh sau:
% cat touch.xml <job> <executable>/bin/touch</executable> <argument>touched_it</argument>

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 56

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

</job> % globusrun-ws -submit -job-description-file touch.xml

xc nh ni chng trnh s thc thi, ta phi m t mt tham chiu ti im cui trong mt file.epr nh:
% cat gram.epr <factoryEndpoint mlns:gram=http://www.globus.org/namespaces/2004/10/gram/job xmlns:wsa="http://schemas.xmlsoap.org/ws/2004/03/addressing"> <wsa:Address> https://vizlogin.isi.edu:9000/wsrf/services/ManagedJobFactoryService </wsa:Address> <wsa:ReferenceProperties> <gram:ResourceID>PBS</gram:ResourceID> </wsa:ReferenceProperties> </factoryEndpoint> % globusrun-ws -submit -job-description-file touch.xml factory-epr-file gram.epr

Gi tr mc nh ca -factory v -factory-type ln lt l "localhost" v "fork". Trong v d trn th cng vic s c thc thi trn my hin ti bng li gi fork() ca h iu hnh UNIX.
% globusrun-ws -submit -factory localhost -factory-type fork \ -job-command /bin/touch touched_it

Tng tc vi cng vic gi C 2 cch tng tc vi cng vic sau khi gi l: tng tc Online, ngha l to v duy tr mt lin kt lin tc vi cng vic gi nhn c thng tin v trng thi cng vic cng nh ra lnh ngng gia chng; tng tc Offline hoc Batch mode, ngha l khng duy tr lin kt lin tc m tr v mt tham chiu n im cui EPR ca cng vic. Ngi dng c th yu cu cho bit trng thi cng vic khi cn. lm iu ny, ta dng ty chn -job-epr-output-file tip theo l file s cha EPR trong lnh trnh cng vic pha trn [6]. nhn c trng thi hin ti ca cng vic, ta g lnh:
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 57

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

% globusrun-ws -status -job-epr-file mj.epr

hoc chuyn sang ch online:


% globusrun-ws -monitor -job-epr-file mj.epr

hoc ngng (kill) cng vic:


% globusrun-ws -kill -job-epr-file mj.epr

Trng thi v chu k sng ca cng vic Hnh sau m t s dch chuyn trng thi trong chu k sng ca cng vic.

Hnh 2-6 S dch chuyn trng thi

Unsubmitted. Cng vic cha c gi StageIn. ang i truy xut file input Pending. B lp lch cc b vn cha lp lch cho cng vic thc thi Active. Cng vic ang thc thi Suspended. Tm ngng thc thi cng vic StageOut. Cng vic thc thi xong, ang i ghi kt qu ra CleanUp. thc thi xong, hon thnh in kt qu, thc hin dn dp Done. Hon thnh mi vic Failed. B li Trong nhiu trng hp, tp tin thc thi v d liu vo cn c a n

B tr file vo, ra, li v cc c tnh khc my ch v/hoc d liu ra phi c a ra. Vic b tr tp tin ny c c t bng cc ch th chuyn tp tin trong file m t cng vic. Nhng ch th ny tun theo c php RFT gm a ch URL ngun v ch [6]. i din tin cy

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 58

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Mt cng vic gi ti my ch GRAM nhiu khi cn c y nhim th nu mun: Thc hin cc thao tc t xa, nh gi cng vic GRAM, hay gi cc dch v Web khc. Trng hp ny cn c y nhim th cng vic job credentials. Ngi dng s cung cp u nhim th km vi yu cu gi thc hin cng vic. y nhim s ht tc dng ngay khi cng vic kt thc Thc hin cc thao tc b tr sp xp d liu. Trng hp ny cn y quyn b tr staging credentials. Ngi dng s gi y nhim th cho dch v Delegation chy trn my ch GRAM, v chng c th c dng cho nhiu yu cu thc thi cng vic GT4 cung cp cc cng c sau p dng 2 ch u quyn ny: Cu lnh globus-credential-delegate cho php to c 2 loi u nhim th ti my ch GRAM v tr v mt tham chiu EPR ti y nhim mi c to. (client c th dng lnh globus-credential-refresh lm mi li u nhim th) Sau khi c u nhim th bng cch trn, clien c th s dng u nhim th ny bng cch nhp tham chiu EPR vo tp tin m t cng vic trong cc phn t jobCredential, stagingCredential, v/hoc transferCredential S dng cc tu chn -job-delegate v -staging-delegate ca lnh gbusrun-ws yu cu to y nhim th i km vi mt cng vic. ngha ca cc tu chn ny nh sau: -job-delegate: Nu trong tp tin m t cng vic khng cha phn t jobCredential, globusrun-ws s to u nhim th v a phn t tng ng vo file m t cng vic staging-delegate: Nu trong file m t cng vic c cha cc ch th staging hoc cleanup v khng c cc phn t stagingCredential hoc transferCredential, globusrun-ws s t ng to phn t tng ng v a vo trong file m t cng vic

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 59

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

2.4.3 Cch thc hot ng ca GT4 GRAM


Phn ny trnh by cch thc hot ng ca GRAM hiu r hn v c ch hot ng ca n cng nh cch thc pht trin dch v da trn WSRF [15]. Kin trc GRAM Kin trc ca GRAM c m t nh trn hnh 2.7 bao gm:

Hnh 2-7 Kin trc GRAM

1. Virtual Host Environment Redirector: tip nhn cc thng ip SOAP (SOAP message) v gi chng n cho mi trng ngi dng . 2. Master Managed Job Factory Service MMJFS: chu trch nhim cung cp dch v chy ng dng GRAM cho cc yu cu t bn ngoi.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 60

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

3. Managed Job Factory Service MJFS: chu trch nhim to ra mt dch v qun l cng vic MJS (Managed Job Service) mi ngay khi n nhn c mt yu cu to dch v. MJFS tn ti t khi khi to mi trng ngi dng (UHE) cho n khi phin lm vic kt thc. 4. Managed Job Service MJS: l mt dch v OGSI cho php cc c t cng vic c th trnh cng vic ti h thng lp lch a phng, theo di tnh trng v a ra cc thng bo. 5. File Stream Factory Service FSFS: c coi nh l mt giao din nhn yu cu, chu trch nhim ci t mt dch v v file (File Stream Service) mi khi n nhn c yu cu to dch v. 6. File Stream Service FSS: mt dch v OGSI cho mt a ch URL ch s a file a phng va c to n a ch URL ch. 7. Resource Information Provider Service RIPS: l mt dch v khai bo cung cp thng tin v h thng lp lch ti nguyn, h thng file, h thng my ch, 8. Grid Resource Identity Mapper GRIM: l thnh phn chu trch nhim to ra giy chng nhn cho ngi dng sau khi kim tra grid-mapfile. Bo mt GRAM h tr cc mc kim tra an ton bo mt: C ch WS-Sercurity nhm xc nhn nhng u quyn gn vi yu cu v cng xc thc ngi yu cu. Phn quyn c thc hin bng mt callout cp quyn. Callout ny s truy vn file gridmap, SAML server, hay mt c ch cp quyn khc. Nu c quyn, cng vic s c thc thi v tr v mt s hiu ID cc b. Lnh tin ch sudo s c dng qun l ti nguyn cc b. Cc dch v chy trong GT4 container khng yu cu mt quyn c bit: cc thao tc cn quyn cao s c thc hin qua hm sudo. Thao tc lin quan ti d liu Cc thao tc lin quan ti vic b tr d liu vo/ra ca cng vic c GRAM giao cho dch v RFT m nhim. Tu theo yu cu, dch v RFT khi to
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 61

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

kt ni GridFTP gia my ngun v ch. Ngoi cc thao tc b tr d liu chun, GRAM cn h tr c ch cp nht d liu ra mt cch lin tc khi cng vic ang tin hnh. Giao din lp trnh GRAM client 4 PortType ca giao din lp trnh cho GRAM client nh ngha cc thao tc trn 4 WS-Resources m i din cho trng thi ca GRAM server, trng thi ca mt cng vic n l, chui u nhim th gn vi delegation factory, v mt u nhim th. GT4 cung cp cc gn kt (binding) Java, C, v Python vi 4 portType ny: PortType ManagedJobFactory: cung cp thao tc truy vn thng tin v trng thi ca GRAM server. Ngoi ra, n cn c thao tc createManagedJob trnh cng vic. Nu thnh cng, n s tr v mt WS-Resource ca ManagedJob v mt tham chiu EPR ca ManagedJob mi ny PortType ManagedJob: nh ngha thao tc trn WS-Resource ManagedJob cho php truy xut thng tin v trng thi cng vic, qun l thi gian sng ca cng vic qua WS-Resource WS-ResourceLifetime, v cho php client yu cu c thng bo khi c thay i qua WS-Resource WSBaseNotification PortType DelegationFactory: h tr cc thao tc nhm truy vn thng tin v cc dy u nhim th gn vi delegation factory. N cn nh ngha mt thao tc RequestSecurityToken nhm thc hin ti mt th bo mt ln dch v i din v tr v mt tham chiu EPR ca WS-Resource mi ny PortType Delegation: cung cp cc thao tc WS-ResourceLifeTime i vi th bo mt. Thao tc refresh ca portType ny cho php client ti ln mt u nhim th mi

2.4.4 Cu hnh v qun tr GT4 GRAM


Ta cn phi ci t mt GRAM server cho php cc dch v Web c th truy xut cc ti nguyn tnh ton. Hai bc chnh cu hnh gm:

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 62

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

1. Local scheduler interface. GRAM c th dng cc c ch, cng c c sn khi to v iu khin cc cng vic. Mc nh GRAM dng li gi fork() ca h thng. Cc cng c lp lch theo l nh Condor, SGE, PBS, v LSF phi c cu hnh cho php GRAM gi cc cng vic v theo di trng thi hot ng ca cng vic. 2. Authorization to user mapping. GRAM dng file /etc/grid-security/gridmapfile xc nh ngi dng a phng c quyn thc thi cng vic do mt yu cu ca mt ngi dng Grid gi ti.

2.5 Qun l d liu trong GT4


2.5.1 Tng quan v qun l d liu trong GT4
Cc cng c qun l d liu trong GT4 lin quan n vic lu tr, vn chuyn v qun l cc d liu phn tn trong mi trng khng ng nht. GT4 cung cp nhiu thnh phn bao gm [6]: GridFTP chu trch nhim vn chuyn d liu tin cy v hiu qu RFT qun l nhiu qu trnh vn chuyn RLS duy tr thng tin v cc bn sao d liu OGSA-DAI cho php truy xut v tch hp d liu c/bn cu trc Ngoi ra cn phi k n cc cng c kt hp khc nh: NeST gii quyt vic t ch lu d liu UberFTP cung cp client dng lnh cho GridFTP SRB cung cp giao din chung cho d liu phn tn DataCutter, STORM h tr x l pipeline cho d liu phn tn SRM Storage Resource Manager cung cp truy xut GridFTP trn nn bo mt GSI ti cc kho lu tr GridNFS ci t NFSv4 trn nn GSI pht trin bi i hc Michigan Cc thnh phn chnh ca GT4 chia thnh 2 nhm chnh: di chuyn d liu v sao lu d liu.

2.5.2 Di chuyn d liu


C 2 thnh phn lin quan n vn chuyn d liu, l GridFTP v RFT.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 63

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

GridFTP Giao thc ny trong GT4 cung cp kh nng chuyn d liu an ton, nhanh, mnh, hiu qu v cho php truyn file theo m hnh third-party. Trong m hnh truyn file truyn thng, vic truyn file ch thc hin c gia client v server. Cn trong m hnh truyn file third-party, Client c th thc hin truyn file gia cc server vi nhau. B Globus Toolkit cung cp gn nh hon chnh ci t ca giao thc ny bao gm: GridFTP globus-gridftp-server, lnh globus-url-copy v th vin lp trnh. chia s d liu ta ci t mt GridFTP server trn my ch m c th truy xut c d liu, v mt giao din lu tr d liu Data Storage Interface (DSI) ph hp vi h thng ang lu tr d liu. truy xut n d liu c chia s, ta cn mt GridFTP client. GT cung cp mt lnh client l globus-url-copy. Lnh ny cho php truy xut d liu qua nhiu giao thc (http, https, ftp, gsiftp, v file). Dch v chuyn tp tin tin cy - RFT Khi yu cu chuyn tp tin c chp nhn, chng ta c th theo di trng thi truyn qua cc phng thc m dch v RFT cung cp. GT4 cung cp mt ci t ca dch v RFT trong GT container v mt client kh n gin. Ngoi ra lp trnh vin c th dng mt s lp Java pht trin theo mun.

2.5.3 To bn sao d liu


Dch v nh v bn sao - RLS RLS l dch v cho php ngi dng hoc dch v khc ng k tp tin trong RLS khi tp tin c to ra, sau n chu trch nhim to v duy tr cc bo sao ca tp tin ny trn cc h thng lu tr phn tn. Ngi dng c th truy vn RLS server tm cc bn sao ny. RLS gm nhiu server nm ri rc lu d liu ng k. Bng cch phn tn d liu, kch thc ca h thng c th m rng v lu tr c nhiu bn sao hn h thng qun l tp trung. Trong GT4, nhim v ca RLS l duy tr cc nh x gia tn tp tin logic vi mt hoc nhiu tn tp tin vt l. Ngi dng c th yu cu danh sch cc tp vt l ca mt tn tp tin logic v ngc li. Ngoi ra, RLS cho php gn cc thuc tnh
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 64

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

hoc thng tin m t (kch thc hoc checksum) vi cc tn tp tin logic hoc vt l, v cho php truy vn theo nhng thuc tnh ny. Dch v sao d liu DRS GT4 thit k v ci t dch v DRS da trn 2 thnh phn qun l d liu ca GT l RFT v RLS. Chc nng ca DRS l m bo mt tp cc files tn ti trn mt ni lu tr. u tin, dch v DRS s truy vn RLS server xc nh v tr ca cc file mong mun trong mng li. Sau khi nh v xong, DRS s yu cu chuyn tp tin ti ni mong mun bng dch v RFT. Cui cng, DRS s ng k bn sao mi ny vi RLS server. DRS c ci t nh mt dch v Web tun theo c cu WSRF. Khi nhn mt yu cu DRS, mt WS-Resource c sinh ra lu trng thi cho mi file cn sao chp, bao gm c thao tc trn file thnh cng hoc khng.

2.6 Theo di v pht hin


Cc c ch theo di v pht hin nhm gii quyt vn lin quan n thu thp, phn tn, ch mc ha, lu tr v x l thng tin v cu hnh v trng thi ca dch v v ti nguyn. GT4 cung cp: 1. Dch v Web c bn cho cc giao din WSRF v WS-Notification. y l nn tng xy dng cc dch v theo di v pht hin, nh ngha cc thuc tnh cn thit cho cc dch v ny, v h tr ch truy xut ko/y. 2. Dch v GRAM v RFT nh ngha cc Resource Properties lm c s cho dch v theo di v pht hin. 3. Dch v ch mc ha (MDS-Index), lu tr (MDS-Archive), v phn tch d liu i vi cc s kin (MDS-Trigger). 4. Mi GT container cha dch v MDS-Index pht hin cc dch v trong container cng nh lin kt vi cc ch mc khc trn mng li.

2.6.1 H thng theo di v pht hin - MDS4


H thng theo di v pht hin MDS4 l mt thnh phn ca GT4 thc hin theo di v pht hin cc dch v, ti nguyn trong h thng phn tn. Monitoring l qu trnh thu thp thng tin v cc ti nguyn hoc dch v, nhm khc phc li v
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 65

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

theo di qu trnh hot ng. Ngi dng c th dng h thng theo di ny nhn bit ti nguyn ang cn kit tm gii php khc phc kp thi. Discovery l qu trnh tm kim ti nguyn thch hp thc hin mt cng vic, v d nh: tm mt my ch tnh ton thc thi mt job. Qu trnh ny bao gm vic tm ti nguyn thch hp v la chn ci no ph hp nht. Nhng ng dng theo di v pht hin u i hi kh nng thu thp thng tin t nhiu ngun phn tn. thc hin iu ny, MDS4 cung cp: Cc dch v kt hp dng thu thp thng tin trng thi t cc ngun c ng k Giao din da trn trnh duyt, cc lnh, v giao din dch v Web cho php ngi dng truy vn thng tin thu thp c MDS4 cung cp 3 dch v kt hp khc nhau MDS-Index h tr truy vn Xpath MDS-Trigger thc hin hnh ng do ngi dng m t (gi email hoc ghi vo log file) khi thng tin nhn c ph hp vi yu cu ngi dng t ra MDS-Archiver lu tr thng tin trong c s d liu

2.6.2 B gp (aggregator) v ngun thng tin


hiu hn v MDS4 ta s tm hiu c cu b gp-ngun thng tin (aggregator-information source). tng chnh ca c cu ny l: Cc ngun thng tin cn thit s c ng k vi mt dch v gp Vic ng k c hiu lc trong mt khong thi gian. V th b gp gn vi n s t hu nu ng k khng c lm mi B gp lin tc cp nht trng thi hin thi ca ngun thng tin ng k qua c ch truy xut theo c t ngun thng tin (information-source-specific access mechanism) Sau thng tin ny s c b gp cng b ra ngoi thng qua giao din dch v Web theo c t ca b gp (aggregator-specific Web services interface)

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 66

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Cc b gp MDS4 hot ng khc vi cc phng php ng k tnh truyn thng nh UDDI ch chng cho php ng k softstate cc ngun thng tin v lun lm ti cc gi tr ngun thng tin theo chu k.

2.6.3 Ngun thng tin v vic ng k


Vic ng k c thc hin thng qua thao tc Add ca dch v Web WSServiceGroup. C 2 ch ng k ty theo c ch dng truy xut: 1. Ch ng k tng qut: cho php nhn thng tin t mt ngun bt k. Mt ngun thng tin c ng k thng qua mt chng trnh ca ngi dng chy nh k cung cp thng tin cho b gp. Chng trnh ny c th sinh ra thng tin mt cch cc b hoc dng giao thc theo c t ngun truy xut thng tin t xa, sau chuyn i d liu non-XML thnh XML. 2. ng k qua dch v Web theo c cu WSRF: thng tin trng thi c biu din nh l cc WS-Resource properties.

2.7 Kt chng
Nh vy, Globus l b cng c rt hu dng trin khai mt li. N l h tng phn mm cung cp cc cng c xy dng tnh ton mng li v cc ng dng khc da trn cng ngh li mt cch d dng. Cc cng c ny c gi chung l Globus Toolkit, gm nhiu module cng c, mi cng c nh ngha mt giao din phc v cp cao v cung cp nhng ton t cp thp c th thc hin c trn nhiu mi trng khc nhau. Hu ht cc li ang c trin khai hin nay u c thc hin da trn nn Globus Toolkit.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 67

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

CHNG 3. CC K THUT LI HIN C TRIN KHAI VIT NAM


Trong chng ny, ta s i tm hiu chi tit v mt s k thut xy dng cc li v ang c mt s nhm nghin cu v tnh ton li ca nc ta trin khai trong thi gian gn y.

3.1 Desktop Grids


3.1.1 Tnh ton phn tn trong cc x nghip
Nhu cu tnh ton phn tn trong cc tp on lm xut hin cc li tnh ton c to ra nh kt ni nhiu my tnh chy h iu hnh Windows hoc Unix vi nhau. Ngi s dng mong i vic qun l ti nguyn v bo mt trong li cng nh kh nng s dng li c thit lp bng cch k tha h tng tnh ton phn tn sn c ca h. Tuy nhin, vi dng li tnh ton ny, mi quan tm c tp trung ch yu vo bn kha cnh l: Qun l ti nguyn-Resource Management Bo mt d liu v giao tip-Communication and Data Security Tnh khng ng nht ca cc my trong li-Machine Heterogeneity Tnh sn dng ca ti nguyn-Resource Availability Vi Desktop Grid, ngi xy dng ra n c gng khai thc nhng khc bit c bn gia mt tp cc PC c kt ni trong mt mng cng ty vi mt tp cc my PC t ti nh c kt ni bng ng Internet bao gm: Vic kt ni mng- Network Connectivity: nhiu tp on c cc my PC c kt ni vo cc mng c tc cao, t 100 Mbps hay 1Gbps. Tuy nhin, vi cc my tnh ch thnh thong mi kt ni vo mng ca tp on th khong thi gian v cht lng kt ni thng rt kh nh trc S tham gia c yu cu-Required Participation: vic tham gia vo l lc tnh ton phn tn c th l mt phn yu cu ca mt cng ty. Nh n cung cp tnh m bo cao hn v thnh phn ca li
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 68

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Bo mt v Qun tr cc PC-PC Administrator and Security: y thc s l vn quan trng trong li v nhng thng tin nhy cm c th c phn b trn cc nt tnh ton ca li Truy nhp n cc ti nguyn dng chung-Access to Shared Resource: hu nh tt c cc t chc u c kho d liu chung trn mng ni b ca h. y c th ch n gin l mt nh x a dng chung c thit lp qua mt kt ni mng hoc phc tp nh mt kho d liu vi dung lng ln n muti-terabyte v multi-tier

3.1.2 nh ngha Desktop Grid


Destop Grid l mt li c cc c im sau: Gm mt tp cc my kt ni khng lin tc v my dng chung c t tn trn mt mt mng chia Gm mt tp chnh sch c iu khin bi ngi dng m t cch mi my tham gia vo li Mt topo mng o c kim sot bi mt my server chuyn dng. Cc my trn li thng bo vi nhau thng qua mt server trung tm Mt c ch qun l ch ng i vi vic phn phi, thc thi v thu hi cng vic n v i khi li nm di s iu khin ca mt server trung tm Mc ch pha sau vic hnh thnh mt Desktop grid l kt hp nhiu ti nguyn li thnh mt ti nguyn o duy nht, d s dng v c th qun l c. Mt s nh ngha chung cho cc thnh phn trong Desktop Grid nh sau: Grid: l thut ng c dng thay cho cm t Desktop Grid Grid Server: l mt my trung tm dng iu khin v qun tr li Grid Client: l mt nt ring l, thng l mt my PC bn hoc mt my xch tay c sn Grid Client Executive: l mt thnh phn phn mm ca h tng li, c tr trn mt PC, n c th lm cho mt PC phc v nh mt Grid Client, v qun l tt c cc tng tc gia Grid Client v Grid Server

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 69

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Work Unit: l n v cng vic c Grid Server giao cho mt Grid Client. Bao gm mt phin bn c th chy trn li ca ng dng, cc ch dn thit lp mt mi trng cho ng dng trn Grid Client, d liu vo, v cc ch dn thc thi ng dng cng nh to d liu ra

3.1.3 Gi tr ca li Desktop Grid


Desktop Grid to ra c hi gom ti nguyn tnh ton rnh ca cc PC; y l mt gii php hiu qu tng sc mch tnh ton cho t chc. Mt Desktop Grid i khi c xem nh mt siu my tnh o virtual supercomputer Thc t quan nim ny cng khng sai l my. Trong hnh di, c th thy biu ca top 500 siu my tnh hng u th gii c tng kt hi thng 11 nm 2002 [3].

Hnh 3-1 Xp hng ca cc Desktop Grid trn 500 siu my tnh hng u

3.1.4 Cc phn t k thut chnh


Cc phn t k thut chnh cn c nh gi i vi mt Desktop Grid l: K thut bo mt Khng cho php ng dng phn tn truy xut n ti nguyn mng hoc ti nguyn cc b bng cch: M ho ng dng v d liu; m bo phn mi trng Grid Client cn li khng b thay i sau khi chy mt ng dng phn tn; Ngn ngi dng cc b gy phin phc n s thc thi ca ng dng phn tn ang chy

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 70

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

trn my ca h; Ngn ngi dng cc b lc li v sa i hoc xo d liu i km cc ng dng phn tn. K thut kn o Grid Client Executive phi t ng gii phng cc ti nguyn khi khng cn dng; m bo vic thao tc l trong sut vi ngi dng a phng; Ngn cc ng dng phn tn hin th hp thoi thng bo hoc hin th cc yu cu tng tc; Ngn chn hin tng gim hiu nng (hoc gy li) trn my client do vic thc hin ng dng phn tn; Khng c yu cu tng tc vi ngi dng trn Grid Client. K thut tch hp ng dng K thut tch hp ng dng yu cu: Kh nng m phng mt mi trng chun vi mt Grid Client; Tch hp mc Binary (khng bin dch li, khng kt ni li, khng truy cp n m ngun); Tch hp mt cch d dng; Bo mt tch hp, m ho cc d liu nhy cm v phi h ch bt c ng dng Win 32 no. K thut Robustness K thut Robusness phi m bo c vic: T ng ti ch nh cc n v cng vic khi cc Grid Client b xo khi li; T ng ti ch nh cc n v cng vic do li ti nguyn hoc li mng; Ngn cc ng dng khc chi phi ti nguyn Grid Client; Cung cp kh nng h tr trong sut cho tt c cc phin bn Windows khc nhau trong Grid Client. K thut co dn Mt li Grid Desktop phi c th: t ng thm, cu hnh, ng k cc Grid Client mi; tng thch vi nhiu ti nguyn khc nhau; c th cu hnh trn nhiu v tr a l. K thut qun l tp trung Chnh l vic gim st ti nguyn li t ng; Qun l v xp hng cc n v cng vic tp trung; Qun tr vi chnh sch tp trung i vi vic truy xut v s dng li; Tng thch vi cc h thng qun l IT ang tn ti; Vic khi to v

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 71

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

cp nht sn phn c th c hon thnh bng cch s dng cc mi trng qun l phn mm x nghip nh SMS, WinInstall; v Trin khai, qun l client t xa.

3.1.5 Cc kha cnh thc t cn xem xt


L cc kha cnh thc t chnh ny sinh khi tch hp my bn chuyn dng trong x nghip thnh mt li. Cc ng dng Cc ng dng chy trn li bn thng c chia thnh 3 nhm l: Data Parallel: x l ctp d liu vo ln, lin tc v khng l thuc nhau Parameter Sweep: s dng phng thc tng tc to ra mt chui cc gi tr u vo nhiu Probabilisti: x l mt s th nghim s dng nhp liu ngu nhin to ra cc gi tr u vo dng nh gi mt tp cc chc nng output Phn tch kh nng phn tn ca ng dng Hiu tham s no ca ng dng l hard-code v tham s no l c th thay i c da trn cc gi tr u vo. Tip n ta phi hiu cch phn tch u vo ca mt cng vic ln thnh mt tp cc u vo nh hn c th x l theo kiu phn tn v hiu cch tp hp kt qu u ra t thnh kt qu cui cng. Xc nh tnh ph hp ca ng dng Tnh thch hp ca ng dng thng c nh gi thng qua t l Compute Intensity (CI) c tnh nh sau: CI = 4*WorkUnitDuration (giay)/(InputSize(KB) + OutputSize(KB)) Trong WorkUnitDuration l khong thi gian thc hin mt n v cng vic. Thng thng, nu mt ng dng c th li ho c CI>1.0 th thng c cho l c th thc hin x l phn tn bng gii php li bn. Nhng i khi vn c mt s ngoi l. Mi trng tnh ton Desktop Grid c coi nh mt tu chn c chi ph thp v c th c dng thay cho cc ti nguyn tnh ton mi. Trong bt c trng hp no, vic thm

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 72

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

mt Desktop Grid vo mt t chc l mt thay i ng k i vi ton b mi trng tnh ton. Vn ho u tin l vn ho ca nhn vin, anh ta phi bit c vai tr ca PC m anh ta ang s dng c tm quan trong ra sao i vi thnh cng ca c t chc khi tham gia vo li. H phi c kin thc nht nh v Grid Client Executive khng thc hin nhng hnh ng sai lm dn n gy ra li trn PC ca h hoc ly nhim virus cng nh cc chng trnh gin ip trn my

3.1.6 Grid Server


Grid Server cung cp kh nng qun l/qun tr ton b n v cng vic, gn cc n v cng vic cho Grid Client, qun l/qun tr Grid Client. Ngoi ra, n cn c th cung cp mt s chc nng khc nh: Cc thao tc mc nhm cc Client (Client Group-level Operations): khi khch thc ca li tng ln s chia nhm nh qun l Data Caching: thi gian c yu cu truyn d liu i hoc n cc Client ng mt vai tr kh quan trng trong vic tnh ton t s CI Qun tr v lp lch mc cng vic (Job-level Scheduling and Administration): h tr nhiu mc u tin cng vic khc nhau theo kh nng chn la Client hoc nhm Client iu chnh v phn tch hiu nng (Performance Tuning and Analysis): cung cp cc bo co v d liu cn thit xc nh c nhng c trng hiu nng quan trng ca mi Grid Client cng nh ca c li Bo mt (Security): mi chc nng c nh ngha ring bit trn Grid Server, mi trng ngi dng nn bao gm c bo mt mc ngi dng Cc giao din h thng (System Interfaces): h tr mt tp cc giao din khc nhau cho nhiu chc nng ngi dng v chc nng qun tr khc nhau

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 73

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

3.2 Cluster Grids


Mt li cluster l dng n gin nht ca Grid, n cung cp dch v tnh ton cho mt nhm hoc mt phng. Kin trc li dng cluster gip cc i ho kh nng s dng cc ti nguyn v tng thng lng cho cng vic ca ngi dng thng qua vic o ha ti nguyn tnh ton. Phn ny ta i tm hiu v k thut Cluster Grid vi trng hp c th l cc Cluster Grid ca Sun [13].

3.2.1 Kin trc li Cluster


Kin trc ca li cluster c chia thnh 3 lp logic [13] khng phn cp bao gm: lp truy xut, lp qun l v lp tnh ton.

Hnh 3-2 Ba lp trong kin trc li cluster

Lp truy xut Lp ny cung cp cc dch v truy xut, xc minh quyn cho ngi dng li. Cc lnh truy xut thng thng nh telnet, rlogin, ftp, ssh truy nhp vo h thng v cc dnh v da trn nn Web. Lp qun l y l lp trung gian, chu trch nhim cung cp cc dch v chnh trong li cluster nh: DRM, phn mm chun on phn cng t ng, kim sot hiu nng ca h thngTrong DRM l dch v bt buc phi c ca mt li

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 74

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

cluster, ngoi ra lp ny cn c thm mt s nhim v khc nh: Dch v file; Dch v kha bn quyn; Qun l sao lu; Dch v ci t. Lp tnh ton Lp tnh ton cung cp sc mnh tnh ton cho li cluster. Cc cng vic trnh thng qua cc lp bn trn trong kin trc ny c xp lch chy trn mt hoc nhiu nt trong lp tnh ton. Cc nt trong lp tnh ton chy cc tin trnh pha khch hoc tc t ca phn mm DRM, v cc tc t gim st sc khe ca h thng. Lp tnh ton giao tip vi lp qun l thng qua vic nhn cc cng vic, bo co trng thi hon thnh cng vic v cc chi tit tnh ton. Lp tnh ton c th l s lai ghp ca nhiu loi my phc v, nhiu nn h iu hnh khc nhau. Ngoi ra lp ny cn c s khc nhau v nhm cc chc nng trong mi nt tnh ton v c cch mi nt ny c gn vo li.

3.2.2 B phn mm li cluster ca Sun


B phn mm li cluster ca Sun bao gm mt s cc thnh phn chnh l Sun Grid Engine (SGE) software v Sun HPC ClusterTools software [13].

Hnh 3-3 Ngn xp phn mm ca li Sun Cluster Grid

Sun Grid Engine - SGE Phn mm qun l ti nguyn phn tn SGE l thnh phn ct li, n cung cp tt c cc chc nng qun l ti nguyn truyn thng, chng hn nh xp hng theo l, cn bng ti, thng k ti khon cng vic, qun l cc ti nguyn hoc ngi dng c bit, treo hoc phc hi cc cng vic v qun l ti nguyn trn ton cluster.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 75

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Host
Master

M t
X l cc yu cu n t pha user (tin trnh Qmaster), thc hin xp lch cng vic v gi chng ti cc host thc hin (tin trnh Schedd). C mt Master Host ch duy nht trong mi phn mm SGE.

Shadow master Execution

Lin tc gim st master host chnh (tin trnh Shadowd) mt cch trong sut v t ng, n s chin quyn kim sot ca master host ny trong trng hp n b li L host thch hp nht thc thi cc cng vic. Tin trnh Execd nhn cng vic t Qmaster v sinh ra mt tin trnh dn thc hin cng vic ny trn local. Thng bo thng tin ngc li cho Qmaster.

Submit Admin

Cc host ny c cu hnh trnh, gim st v qun tr cc cng vic. Khng c tin trnh ngm no c yu cu. L cc host to thay i cu hnh ca cluster, chng hn nh l thay i cc tham s DRM nh thm cc nt mi hoc user mi. Khng c tin trnh ngm no c yu cu cho cc host dng ny.

Lung cng vic Tt c cc cng vic c trnh ti SGE master v c lu trong spooling area cho n khi b xp lch quyt nh xem cng vic no c php thc hin. Phn mm SGE ni cc ti nguyn c th ti cc yu cu cng vic. Ngay khi tm c mt ti nguyn thch hp cho cng vic mi, SGE chuyn cng vic ny thnh cng vic u tin cao nht. Sau b xp lch SGE t chng vo cc my c th thc hin.

Hnh 3-4 Lung cng vic trong Sun Grid Engine

1. Cng vic trnh t mt submit host, yu cu c gi n master host.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 76

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

2. Master host xc nh host s thc thi cng vic, nh gi mc cng vic, kim tra giy php bn quyn v nh gi cc yu cu cng vic khc. 3. Master host gi cng vic ti execution host c chn. Execution host lu li cng vic trong mt job information database v bt u mt tin trnh dn, v i n hon thnh. 4. Khi cng vic c hon thnh, tin trnh dn tr v thng tin cng vic v execution host sau s thng bo vic hon thnh cng vic ti master host, ng thi xa cng vic khi job information database. Master host cp nht job accountisang database thnh hon thnh cng vic. Sun HPC ClusterTools Software Sun HPC ClusterTools software cung cp mi trng phn mm tch hp cho cc ng dng MPI. Phn mm Sun HPC ClusterTools l cc ng dng song song lai rt linh hot v an ton, n c trn gia k thut tuyn on v song song to ra cc ng dng s dng MPI giao tip gia cc tin trnh. Ngoi cc phn mm chnh trn, cn c thm cc phn mm khc nh: Solaris JumpStart v Solaris Flash Software L cng c tng tc v khi to t ng trong mi trng h iu hnh Sun. Solaris JumpStart l mt chng trnh ci t t ng mi trng h iu hnh Solaris ti bt c v tr no trn mng. Solaris Flash bt gi mt nh h thng, bao gm mi trng h iu hnh Solaris, hu ht cc ng dng, v thng tin cu hnh h thng thng qua vic s dng flarcreate command (flash tool) gip qun tr vin c th lp li cc cu hnh my ch trn nhiu my ch khc. Phn mm Qun l tp trung - SunMC L mt phn mm qun l h thng gip nh qun tr h thng thc hin vic qun l h thng t xa, iu khin v cch l phn cng khi cc li phn mm trn cc h thng Sun. N c 3 thnh phn: Console cung cp giao din ngi dng di dng GUI hoc giao din trnh duyt web. My ch SunMC gim st v iu khin cc thc th c qun l bng cch gi yu cu ti cc tc t tr ng trn cc nt c qun l. Cc tc t l cc thnh phn phn mm da trn giao thc SNMP, n
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 77

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

thu thp d liu qun l nhn danh my ch, sau gi v cho my ch v chy cc tin trnh. Phn mm ph chun kim tra tnh ph hp Phn mm SunVTS l mt cng c chun on, kim tra v ph chun phn cng Sun thng qua vic xc minh tnh lin thng v chc nng ca hu ht cc thit b phn cng trong h thng Sun.

3.2.3 Yu cu thit k
Trong phn ny, cc thng tin c trnh by i su vo cc gii thiu v cc yu cu cho mt thit kt li cluster ca Sun. Lp truy xut mc ti thiu, lp truy xut phi c th h tr vic ng nhp, h tr cc phin telnet v trnh cng vic. Cc cng vic khng tng tc c thc hin thng qua h tng hng i bng cch trnh mt mt batch shell script n gin. Nu ch mt h thng c yu cu h tr hng trm ng nhp t xa, h thng ny cn c m bo chc chn mt s ti nguyn dnh ring. Vic truy xut ca ngi qun tr vo li c cung cp thng qua phn mm Sun Grid Engine v SunMC qua mt giao din dng lnh hoc mt giao din ha. Nu cc truy nhp da trn web phi c thc hin, th my ch phi c c lng ph hp thc hin c yu cu. Lp qun l i vi cc li cluster nh, cung cp dch v ti thiu (ch c DRM), th mt b x l n l . Nu mt my a b x l c trin khai trong li cluster vi vai tr l SGE master th n c th dng cung cp cc dch v khc nh Sun MC server, NFS and backup server ng thi cng c th h tr cc tc v tnh ton. Cc phn t khc nhau trong mt li c th yu cu mt h thng file chia s l: phn mm Sun Grid Engine, th mc c nhn ca ngi dng, cp Sun HPC

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 78

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

ClusterTools 4.0 , cc my ch ng dng, my ch kim tra kho bn quyn v cc my ch ci t. Lp tnh ton Vic quyt nh chn phn cng no cho lp tnh ton ph thuc ch yu vo hiu nng cc i c th thu c trn cng lng tin b ra. Mt s im nn ch khi chn l:

- Thng lng: s lng cc nt n trong mng. Mong mun cc i ho s


cc b x l trn mi volume hn l hiu nng trn tng b x l n Tnh song song cao: ph thuc vo cc thuc tnh ca ng dng, hoc mt lng nh nt SMP ln hoc mt lng ln cc nt yu c h tr bi mt mi trng cluster thi gian chy (CRRE) s l thch hp Cc yu cu v b nh v b nh m: mt s ng dng thu c kt qu rt kh quan t cc b x l c b nh m ln. Trong trng hp nh th mc ch l nhm cc i ho tnh cn xng ca d liu ang active, d liu ny dc lu trong cc b nh m nhanh, gip cho vic truy nhp d liu tn t thi gian hn l truy nhp vo b nh lu tr thng thng

3.2.4 Phn cng mng


C 3 kiu lin kt ni c th c xem xt: cc kt ni ni tip, kt ni qua Ethernet v cc kt ni c tr c bit thp Ni tip Mt mng ni tip cho php nh qun tr h thng thc hin vic truy nhp t mn hnh console ti tt c cc my trong cluster. i vi cc mi trng ln, cho php hon thnh hu ht cc iu khin trn ton b h thng thng qua vic truy nhp t xa. Vic s dng mt my trm tp trung gip nh qun tr truy nhp ti cc cng console thng qua mng Ethernet. Ethernet Ti ca mng trong mt li cluster l do cc giao tip chuyn thng ip MPI hoc PVM ch chy. Lu lng NFS t rt nhiu ngun nh cc truy

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 79

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

nhp cc dch v li, cc file spool, cc th mc ngi dng ang c truy nhp ti thi im chy, cc giao tip Sun Grid Engine v vic truyn d liu do cc thao tc sao lu v khi to. Ti c to ra bi cc lu lng ny c x l rt tt trn mng Ethernet chun hoc cc mng Ethernet gigabit. Cc lin kt ni c bit Mng Myrinet c kt hp t nhiu giao din, kt ni v nhiu chuyn mch. i vi cc my Sun, giao din PCI c dng vi cc driver c thay i dng trong kin trc Sun, c th ly t Mycricom. Mycricom h tr mt module giao thc c th ti c (PM) cho phn mm Sun HPC ClusterTools 4.0 software. Cc PMs c s dng bi phn mm Sun HPC ClusterTools mang lu lng gia cc tin trnh khai thc tr thp v t l d liu cao ca Myrinet. Lin kt ni Myrinet c th lm gim tr thng qua vic xp th t theo ln v c th a ra lng bng thng trung bnh cao hn cho cc cluster tnh ton

3.2.5 Qun l mt Cluster Grid


Trong nhiu mi trng li cluster, mt lng ln cc h thng c cu hnh tng t nhau ang tn ti, n gip ti thiu ho thi gian qun tr bng cch ly nhng im thun tin ca cc cng c ci t v chun on t ng. SunVTS Software Phn mm SunVTS c th c dng thc hin kim tra phn cng trn cc nt mi trc khi cc nt ny c y vo li s dng. N cng c th c dng cho cc thao tc kim tra phn cng trong khi chy, hoc dng iu tra nghin cu mt nt ang thao tc. Cluster Console Manager Trong mi trng tnh ton dng compute farm environment, cc tc v yu cu vic nhp lnh nh nhau ti nhiu clients, chng hn nh ng dn ci t v cc bn update phn mm, c th c thc hin bng cch s dng b cng c Cluster Console c cung cp km vi phn mm Sun HPC ClusterTools. Phn mm The Cluster Console Manager (CCM) gip bn c th y cc cu lnh ti cc
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 80

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

nt trong li cng mt lc thng qua mt giao din ho. CCM cp n 3 m hnh thc hin: console, ctelnet v crlogin Solaris Jumpstart v Flash Software Phn mm Solaris Jumpstart thng c dng thc hin cc ci t t nht l cho lp tnh ton ca li cluster. Nu cc host mi c thm vo trong lp ny, hoc c cc nt cn c ci t li th mt mi trng Jumpstart c cu hnh tt s lm gim ng k thi gian qun l cho cc tc v kiu ny Sun Grid Engine Software Kh nng ng nhp v nh ti khon ca Sun Grid Engine cho php nh qun tr gi c vt cc cng vic tnh ton ang chy trn li cluster. Phn mm ny lu mt bn ghi cho mi cng vic c chy thng qua SGE, bao gm c nhng thng tin chi tit nh thi gian bt u chy, khong thi gian thc hin, thng tin v CPU, b nh v I/O, cc thng k ngi dng v trong trng hp SGEEE, n cn ghi li c thng tin v phng v d n.

3.3 Kt ni Cluster vo Grid


thc hin cc bi ton song song ngi ta s dng mt h thng gi l h thng tnh ton song song phn cm. l mt h thng my tnh cc b bao gm mt tp cc my tnh c lp v mt mng lin kt gia cc my tnh. Mt h thng tnh ton song song phn cm thng mang tnh cht cc, c qun l tp trung nh l mt h thng thng nht. H thng tnh ton song song l h thng c b nh phn tn, truyn thng gia cc nt trong qu trnh tnh ton thng l truyn thng ip. Cc th vin truyn thng ip dnh cho h thng b hin nay c th k n l MPI, PVM, Active Message v CMMD (Connected Machine Message...), trong MPI vi nhiu u im c th coi l s la chn tt cho cc clusster. Trong khi cc th vin v cc ngn ng lp trnh song song to cho ngi lp trnh mi trng lp trnh trn h thng tnh ton song song, th cc phn mm qun l ti nguyn v phn ti li to ra mt mi trng tnh ton hiu qu v n

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 81

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

nh. Mt s phn mm phn ti v qun l ti nguyn thng c s dng trong cc h thng tnh ton song song hin nay l: PBS (Portable Batch System), CODINE v CONDOR. y, ta chn PBS lm phn mm phn ti v qun l ti nguyn trong Cluster.

3.3.1 S cn thit ca vic kt ni grid v cluster


Trong Chng 1 ta c s so snh gia Cluster v Grid, cng nh a ra im ging v khc gia chng. C nhiu ng dng li rt cn c gii quyt trong phm vi mt t chc nh cluster ch khng cn c phn b trn mt tp ti nguyn rng phn tn v mt a l nh li. Ngoi ra cluster cn c cc c im m mt li thng khng c nh: N thuc v mt t chc, cc ti nguyn l cc my tnh ng nht, phn b trn mt gii hn a l rt hp, c lin kt vi nhau bi cc mng LAN tc cao (c vi trm Mbits Gbits) do vy c tin cy rt cao Trong cluster cc vn v an ninh khng c coi trng. Vn ch yu l lm sao c c hiu nng cao v khng cn thit phi hy sinh mt phn hiu nng v cc th tc an ninh. Trong cluster ngi ta c th lm mi th i ly hiu nng Cui cng cng cn phi ni thm rng tnh ton li khng phi l cha kha vn nng dng gii quyt mi vn . N c dng b tr ch khng phi l thay th hon ton cc cng ngh tnh ton hin ti V vy, vic kt ni li vi cluster ng vai tr cc k quan trng, n s em li mt sc mnh tnh ton v cng to ln. Vi s kt hp ny ta s tn dng c sc mnh sn c ca cc cluster v s dng tnh ton li nh l phng tin phi hp cc cluster ti cc min a l khc nhau cng gii quyt mt bi ton.

3.3.2 Kt ni Globus-based Grid v PBS-based Cluster


3.3.2.1 GRAM Trong chng 2 ta ni kh r v thnh phn GRAM trong b cng c Globus Toolkit. Do , trong mc ny ta ch tp trung m t xem GRAM ng vai
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 82

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

tr nh th no? N lm g? V lm th no? khi ta thc hin kt ni mt Cluster vo li. Ta c th coi GRAM nh l thnh phn trung gian, vi cc ng dng v dch v mc cao hn pha trn v cc c ch truy nhp, iu khin a phng pha di. C hai pha ch cn lm vic vi GRAM, bi vy s nh hng ln nhau, cc hm API, v cc giao thc cn s dng c gim i nhiu. GRAM khng cung cp kh nng lp lch cng nh mi gii ti nguyn. GRAM cng khng cung cp cc c trng v k ton v lp ha n. Nhng c trng c cung cp bi cc c ch qun l a phng khc nh h thng hng i hoc b lp lch. GRAM lm vic nh mt giao tip tru tng cho cc ngun ti nguyn khng ng nht trn li, c th n chu trch nhim: Phn b ti nguyn cho cc cng vic trnh cng vic, chy ng dng trn cc my t xa v nhn kt qu tr v Qun l trng thi cng vic v cc tin trnh Nh vy c th thy c GRAM ng vai tr nh thnh phn trung gian lm nhim v giao tip gia mi trng tnh ton ton cc vi cc thnh phn cc b c kh nng to ra cc tin trnh chy. V vy, im quan trong ca GRAM l, n phi p ng vic giao tip vi cc b qun l ti nguyn a phng chng hn nh: Condor, Fork, PBS, LSF, NQE, v.v... trong phn mm qun l ti nguyn v phn ti PBS c s dng rng ri nht. 3.3.2.2 PBS Gii thiu PBS PBS (Portable Batch System) [15] [14] l h thng phn ti v qun l ti nguyn, dng trong cc h thng tnh ton song song phn cm. Nhim v chnh ca h thng PBS l cung cp cc kh nng v khi to v lp lch cho vic thc thi cc cng vic v sp xp cc cng vic trn cc my trm. H thng cng cho php xc nh trc cc ngun ti nguyn s c s dng khi thc thi cng vic v m bo chc chn dng ng ti nguyn yu cu.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 83

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

PBS m nhn giao tip gia Server v cc my trm, my ch thng qua PBS qun l hot ng ca cc my trm. PBS bao gm ba mun chnh: PBS_Server, PBS_Scheduler, PBS_MOM. PBS cung cp rt nhiu chc nng cho cc h thng tnh ton hiu nng cao. Sau y l mt s chc nng quan trng nht ca PBS: 1. Chia s ti nguyn: cung cp mt c ch lp lch cho cc cng vic mt cch trong sut trn bt k mt h thng PBS no, bi bt k ngi s dng no c thm quyn. Cng vic c th c yu cu t mt my client bt k, cc b hay t xa. 2. Giao din ho: gip ngi s dng chuyn cc yu cu tnh ton ch l hoc ch tng tc, truy vn cc cng vic, hng i cng vic v tnh trng h thng; v theo di s tin trin ca cc cng vic. Ngoi ra PBS cng h tr giao din s dng ch dng lnh cho nhng ngi s dng chuyn nghip. 3. C ch bo mt: cho php qun tr thit lp hoc hu b vic truy cp n PBS ca mt ngi s dng, mt nhm ngi, mt my hoc mt mng no . 4. Nht k: cho php ghi li tt c cc hot ng trn ca h thng theo tng ngi, tng nhm ngi hoc tng my. 5. T ng chuyn tp: l c ch sao chp cc tp cn thit cho vic thc hin mt cng vic trn cc my trm tnh ton. Cc tp cn chuyn c th l cc tp d liu hoc cc tp thc thi. 6. H tr cc cng vic song song: cho php hot ng cng vi cc th vin lp trnh song song nh MPICH, MPI-LAM. Cc chng trnh c th c lp lch chy trn cc h a b x l hoc trn cc h thng a my tnh. 7. H tr tnh ton li: cung cp cng ngh siu tnh ton (meta-computing) v tnh ton li, bao gm vic h tr cho GGT (Globus Grid Toolkit). Vic h tr ny ch c thc hin trong phin bn thng mi PBS Pro. 8. Giao din lp trnh c PBS: cung cp ngi lp trnh c th t vit cc lnh mi cho PBS, tch hp PBS vo cc ng dng ca h hoc ci t cc c ch lp lch ring.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 84

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

9. T ng phn ti: l c ch cho php phn ti cc cng vic trn cc ti nguyn ca h thng Cluster. Cc thnh phn c bn ca PBS

Hnh 3-5 Cc thnh phn trong PBS

1. M-un qun l cng vic (Job Server): l thnh phn trung tm ca PBS. Tt c cc thnh phn khc ca PBS giao tip vi module qun l cng vic qua mt a ch IP duy nht. Chc nng ca module qun l cng vic l cung cp cc dch v, nh l nhn/to ra cc cng vic l, thay i cc cng vic, bo v cng vic khi c s c h thng, v thc hin cng vic. Module ny qun l mt hoc nhiu hng i cng vic, mt cng vic phi thuc vo mt hng i. Cc hng i c server qun l bi mt tp thuc tnh nh kiu, ti nguyn, tn,... 2. Module thc hin cng vic (PBS Mom): c chia nh thnh cc thnh phn chc nng sau: Job Executor (JE), l tin trnh ngm chu trch nhim thc thi cng vic. Khi thc hin, JE nhn mt bn copy cng vic t Server, x l cng vic, v sau c th s m nhn lun cng vic tr kt qu v cho user nu c Server yu cu. Mi mt tin trnh ngm JE chy trn mt my tnh trm trong mng; Resource Monitor (RM), l b phn chu trch nhim theo di, kim tra cc ngun ti nguyn ca h thng v bo cho b lp lch Scheduler. 3. M-un qun l ti nguyn (Job Scheduler): l thnh phn chu trch nhim xc nh xem cng vic no s chy v chy trn ti nguyn no.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 85

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Hnh 3-6 C ch hot ng ca PBS

Tng tc gia cc module trn c th c m t nh sau: 1. S kin kch hot module qun l cng vic bt u mt vng lp lch. 2. Module qun l cng vic gi lnh lp lch cho b lp lch. 3. Module lp lch yu cu thng tin v ti nguyn t module qun l ti nguyn. 4. Module qun l ti nguyn tr v cc thng tin yu cu. 5. B lp lch yu cu cc thng tin t module qun l cng vic. 6. Module qun l cng vic gi thng tin trng thi ca cc cng vic ti b lp lch, b lp lch a ra chnh sch quyt nh vic thc hin cng vic. 7. B lp lch gi yu cu thc hin cng vic n module qun l cng vic. Module qun l cng vic gi cng vic n cc module thc thi cng vic thc hin. trnh cng vic trong PBS
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 86

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Ngi s dng c th thc hin lnh trc tip trn mn hnh console truyn cng vic cng nh cc yu cu ca mnh ti Server. Cc shell [14] chun ca linux c th gip ta bin dch m lnh thc hin cc job. Lnh qsub l lnh dng trnh cng vic mnh mun thc hin ln server. Cu trc lnh nh sau:
% qsub [option] mysubrun

trong option l cc la chn khi thc hin cng vic v mysubrun l cng vic cn thc hin. Cc tu chn dng m t ti nguyn; trong v d sau:
% qsub -l ncpus=16, walltime=4:0:0 mysubrun

ncpus l s CPU s dng thc hin cng vic walltime l thi gian d kin thc hin cng vic Cc cng vic cng c th c miu t trong mt file c t, trong file thc hin m t tt c cc thng s ti nguyn cn s dng, u vo, u ra Khi vic trnh cng vic trong PBS s c thc hin bng mt lnh n gin nh sau:
% qsub mysubrun

Trong mysubrun l file m t cng vic. Nh vy c th a mt cng vic t trn li xung cho PBS_Server th ta phi lm thng sut qu trnh chuyn i cng vic t file c t RSL ca Globus sang file miu t cng vic ca PBS, vn ny s c cp trong phn tip theo ca n. 3.3.2.3 Cc yu cu i vi thnh phn kt ni Coi Cluster l mt ti nguyn li nh cc ti nguyn khc, n cung cp mt giao din truy cp thng nht cho cc thnh vin li khc s dng. Globus toolkit cung cp cc modul cho php cc thnh phn ca globus giao tip c vi hu ht cc h qun l cluster ph bin nh PBS, LSF, Condor..(Chng 2). Nh vy, mt cluster tr thnh mt ti nguyn li ta s ci Globus Toolkit ln nt ch ca cluster, thm mt m-un thc hin giao tip vi nt ch ca cluster v thc hin cc cu hnh cn thit cho Globus c th giao tip c vi thnh phn qun l ti nguyn a phng ca cluster.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 87

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Thng thng trn li lun tn ti hai loi cng vic c s khc nhau r rt. Th nht, l nhng cng vic thng thng c thc hin qua dch v li, ngi s dng ch vic a ra mt b d liu u vo qua GridPortal v nhn kt qu tr v. Loi cng vic th hai, l nhng cng vic c tnh chuyn mn cao, i hi khi lng tnh ton ln, i vi nhng cng vic ny th ngi s dng phi bit r v ti nguyn s thc hin cng vic (siu my tnh hay Cluster), vit chng trnh thc hin i vi loi ti nguyn tng ng, sau trnh cng vic thng qua giao din GridPortal, li s tm ra ti nguyn tng ng da trn thi gian thc hin v s nt tnh ton, li thc hin truyn cng vic cho cluster cng vi cc tham s thc hin, cluster thc hin cng vic v sau tr li kt qu cho li sau li mi thc hin tr kt qu v cho ngi dng. Vic kt ni gia li da trn GT v cluster da trn PBS thc cht l thc hin giao tip gia thnh phn GRAM trong li v thnh phn PBS Server trn nt ch ca Cluster thng qua thnh phn Globus-scheduler-pbs.
GRAM: cung cp mt giao din chun cho vic yu cu v s dng ti

nguyn h thng t xa thc hin cc cng vic. i vi cc cng vic ch yu cu my tnh n hay siu my tnh th GRAM thc hin cng vic ngay trn my tnh . i vi nhng cng vic yu cu thc hin trn Cluster th GRAM s chuyn ni dung cng vic cho thnh phn Globus-scheduler-pbs.
Globus-scheduler-pbs: thnh phn ny c nhim v chuyn ni dung cng

vic sang dng m PBS_Server c th hiu c.


PBS_Server: l nt ch thc hin lp lch (a phng), qun l v thc hin

phn chia cng vic thc hin trn cc nt con. Nh vy c th thy vic kt ni cluster vo li khng c ngha l chng ta phi qun l n tng my n trong cluster m qun l cluster mt cch tng th. Li s qun l c cm my tnh trong cluster nh mt ti nguyn n nht thng qua b qun l ti nguyn a phng PBS. Thnh phn Globus Scheduler PBS

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 88

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Globus Toolkit c cung cp cho chng ta mt s dch v giao tip vi PBS_Server. Nhng dch v ny nm trong gi ci t scheduler-pbs-3.2src_bundle.tar.gz. Sau khi ci t xong th vic tip theo l cu hnh cho h thng. y thc s l mt vn khng n gin. Chi tit v vic cu hnh v xy dng mt shell script cu hnh t ng s c trnh by trong chng 4. Khi vic cu hnh hon tt, thc hin khi ng trnh cha (globus-start-container) th trong cc dch v ca Globus Toolkit s c thm hai dch v:
http://hostname:8080/ogsa/services/base/gram/PbsManagedJobFac toryService http://hostname:8080/ogsa/services/base/gram/MasterPbsManaged JobFactoryService

Dch v MasterPbsManagedJobFactoryService ng vai tr l mt giao din trnh cng vic, trc tip nhn yu cu cng vic t pha client. i vi tng cng vic dch v MasterPbsManagedJobFactoryService s gi n dch v PbsManagedJobFactoryService s sinh ra cc th hin (instance) tng ng trc tip lm vic vi client. Hot ng ca Globus Scheduler Pbs Nh trnh by trn, Globus Scheduler Pbs gm 4 thnh phn (mjs-pbs, mmjfs-pbs, rips-pbs-provider, globus-job-manager-pbs) gip cho vic hot ng trnh cng vic ln PBS_Server thc hin c. Giao tip gia cc thnh phn ny c m t trong hnh sau:

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 89

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Hnh 3-7 Hot ng ca Globus Scheduler Pbs

1. Ngi dng c giy chng nhn c cp bi nh thm quyn, c thc hin trn ti nguyn li (Quy nh trong Grid-mapfile) thc hin trnh cng vic vi trnh qun l cng vic MMJFS-PBS. 2. Trnh qun l cng vic MMJFS-PBS thc hin sinh ra mt th hin qun l cng vic phc v cho ngi dng MJS-PBS. 3. Trnh qun l cng vic MJS-PBS thc hin ly file c t cng vic ca ngi dng v chuyn cho thnh phn bin dch Globus-job-manager-pbs. 4. Thnh phn Globus-job-manager-pbs thc hin chuyn file c t ngi dng thnh cng vic cho PBS_Server. 5. Trnh qun l cng vic ngi dng yu cu PBS_Server thc hin cng vic. 6. PBS_Server thc hin cng vic v a kt qu tr v cho thnh phn qun l thng tin ti nguyn RIPS-PBS. 7. RIPS-PBS tr kt qu v cho trnh qun l cng vic ngi dng. 8. Trnh qun l cng vic ngi dng MJS-PBS tr kt qu v cho ngi s dng.

3.4 Kt chng
Chng ny a ra cc khi nim, kha cnh k thut v c ch trin khai, kt ni mt s dng li ang c trin khai ti cc trung tm nghin cu v tnh ton li nc ta. Phn kin thc c trnh by trong chng ny s lm tin kt hp vi Chng 1 v Chng 2 tin hnh xy dng mt li c th trong chng tip theo.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 90

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

CHNG 4. TRIN KHAI TH NGHIM


4.1 Lp bn thit k kin trc li
Vic xy dng mt h thng tnh ton li khng ch n thun l ci t cc phn mm, cc ng dng trn cc ti nguyn kt ni sn c m phi tun theo quy trnh quy hoch, thit k mt cch bi bn, chuyn nghip. V c bn, vic thit k li bao gm mt s bc chnh sau [2]: Kho st yu cu nghip v ca t chc Quy hoch h tng mng sn c La chn loi li La chn topo li La chn h tng (h tng phn mm, phn cng, cc chun m, c ch an ton, bo mt cho li) Chnh sch qun tr li.. Trin khai mt li Ngoi ra, khi thit k mt li th kin trc li cn phi tho mn c cc yu cu c bn nh: tnh an ton, tnh sn sng v hiu sut cao. Kho st yu cu Trong thit k kin trc li, bc kho st yu cu l bc rt quan trng, kt qa ca n s nh hng trc tip n tt c cc bc tip theo. Thng trong bc ny, ngi thit k phi a ra c nhng thng tin cn thit nh mc ch ca vic xy dng li l g? Li c nhng yu cu g v nghip v, h tng v ng dng? Cui cng cn phi tinh chnh v xem xt li xem nhng yu cu ny c ph hp vi h tng phn cng v phn mm cng nh ngun nhn lc sn c hay khng. Sau khi xc nh c cc yu cu t ra cho li, ng thi nhn thy rng h tng hin c l p ng xy dng li, ta phi tin hnh qui hoch li h tng mng phc v vic trin khai li c thun tin v khoa hc. La chn loi li

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 91

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Thng th tu theo yu cu nghip v ca li m ta s chn ra loi li thch hp. Nhng nhn chung, cc li c trin khai thng ri vo hai dng sau: Li tnh ton (Computational Grid): dng khi mun kt hp v tng sc mnh tnh ton ca cc h thng tnh ton phn tn. c trng c bn nht ca li tnh ton l da trn s tch hp cc n v c sc mnh tnh ton nhng kh nng lu tr khng cao. Li ch ca cc li tnh ton l gim thiu chi ph s hu v rt ngn thi gian pht trin. N c dng cho mc ch nng cao hiu sut s dng cc h thng my tnh v h thng cc ng dng sn c v dng gii cc bi ton khoa hc, cc bi ton m phng, cc bi ton nh gi c hi u Li d liu (Data Grid): ch yu dng vo vic cung cp kh nng truy cp n cc ngun d liu hn hp, phn tn v bo mt cho cc thc th tham gia li. Li d liu c th c hiu nh cc c s d liu lin hp, n gi vai tr lin kt cc d liu ri rc thnh mt c s d liu o thng nht v ngi dng c th truy cp c s d liu ny qua mt giao din duy nht. Li d liu thch hp vi cc t chc, cc doanh nghip c nhu cu m rng kh nng khai ph, s dng d liu trn phm vi rng nhm ti u ho vic s dng h tng thit b lu tr sn c v gim thiu ho s phc tp trong vic qun l d liu. La chn topo li Hin nay khi trin khai mt li, ngi thit k thng la chn mt trong s cc topo li thng dng sau [2]: Intragrid: thng c trin khai trong cc t chc hoc doanh nghip. Da trn kin trc mng LAN hoc mng Intranet dng ring ca t chc, doanh nghip . N c: Bng thng v mc sn sng cao; C ch an ton, bo mt ring r; Mi trng tc nghip c lp. Topo li dng ny rt ph hp khi trin khai cc li tnh ton hoc li d liu. Extragrid: c thit lp da trn hai hoc nhiu Intragrid. c trng ca loi topo ny l: C ch an ton, bo mt phn tn; C s tham gia ca nhiu t chc doanh nghip; Da trn mng WAN. Extragrid thch hp vi cc t chc

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 92

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

mun xy dng kt ni mng vi cc i tc ca mnh (B2B) nhm chia s ti nguyn, d liu da trn s tin tng ln nhau. Intergrid: xy dng trn mng WAN hoc Internet v c s dng bi cc cng ty cng ngh, tp on cng nghip, hoc nh sn xut cng nghip. c trng ca Intergrid l: C s tham gia ca nhiu t chc; Kt ni nhiu i tc; Kt ni nhiu mng lin kt; C ch an ninh phc tp, phn tn. La chn h tng li Khi nim h tng y bao gm c phn cng vt l v cc phn mm c s dng kt ni cc my tnh ca mt li li vi nhau. H tng ny cung cp cc dch v cho vic kt ni, m bo an ton v qun tr. H tng bo mt: gii php bo mt hay c p dng cho cc h thng tnh ton li l s dng tng la. Gii php ny bo v c cc my ch trong li trnh khi nhng tn cng t bn ngoi v to lp thm mt hng ro bo v, ngn cn nhng truy nhp khng mong mun t ngi s dng bn ngoi ln bn trong. H tng mng: phi cung cp bng thng ln cho cc ng dng v dch v trn n. H tng mng cn phi thng xuyn c bo tr, tu chnh nhm m bo hiu sut hot ng v tnh sn sng cao nht. H tng lu tr: y l mt trong nhng sc mnh v tn ca h thng tnh ton li. Khi thit k h thng, ngi thit k phi tr li c cc cu hi nh: cch thc no dng bo m an ton cho cc thit b lu tr, cch thc no dng trong nhn bn d liu, v lm sao qun l cc thit b ny mt cch hiu qu nht. Mc tiu cui cng ca vic thit k cc li l phi m bo s sn sng ca d liu cho cc ti nguyn v ngi dng khi c yu cu. La chn tp cc dch v c bn cho li Tp cc dch v ny gm cc thnh phn: B cung cp cc tnh nng bo mt; Tp cc thnh phn c bn cu GT nh Grid clients; CA; GateKeepers; MDS (GRIS/GIIS); Cc chng ch s cho vic xc thc; B lp lch cng vic c bn; Cc thit b h tng thng dng nh mng qun tr, tng la, h thng pht hin truy cp tri php; Mng cc thit b lu tr
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 93

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Qun tr li tnh ton i vi ngi qun tr li, cn phi nm r cc sn phm lin quan v cc tnh nng ca h thng tnh ton li. Cc sn phn lin quan n h thng tnh ton li bao gm: Phn mm tng trung gian; H thng gim st hiu sut li; Grid portals; Mi trng lp trnh; B lp lch; H thng pht trin v kim th li. Cc tnh nng ca h thng tnh ton li: Qun l dch v; Truyn thng dch v; Qun l chnh sch; La chn v trin khai cc phn mm trung gian, la chn cc ng dng, tr kt qu v cho cc ng dng; iu khin dch v; SLAs; Qun l hiu sut/ti khon/li; an ton v bo mt. Trin khai mt li Mt li tnh ton n gin c th c xy dng da trn mt th vin lp trnh h tr cc tnh nng tnh ton li, nh s dng cc tnh nng c cung cp bi b cng c Globus Toolkit. Tuy nhin, khi h thng c m rng v tr nn phc tp, ta cn phi tip cn theo phng thc khc. u tin l bc thip lp h tng cho mt li tnh ton n gin da trn Globus Toolkit bao gm cc ci t trn my ch v cc my trm nh: H iu hnh, cc th vin, b cng c GT; Thit lp cc dch v, n v chng thc, b qun l ti nguyn Tip theo l cc bc thit lp ng dng, dch v cho h thng tnh ton li va c ci t da trn Globus Toolkit hoc bt c b cng c no da trn chun OGSA bao gm: nh ngha cc dch v ca li bng ngn ng WSDL; Sinh m Java cho vic nh ngha WSDL; Vit m ci t pha my ch, my khch; Trin khai v th nghim cc dch v li s dng trnh duyt OGSA. Sau y l danh sch cng vic cn thc hin: D kin v quy nh ti nguyn ca li; La chn phn mm trung gian; La chn cc cng c qun tr, xy dng quy trnh qun tr i vi phn mm trung gian; Ci t, th nghim, tu chnh v pht hnh cc phn mm, ng dng ca li; Thit lp b iu khin dch v, cc cng c, quy trnh kim sot, iu khin an ninh; Thit lp cc cng c qun l li, hiu nng, h tng mang tnh tch hp.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 94

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Cc yu cu c bn ca li sau khi thit lp Tnh an ton: li sau khi trin khai phi gim thiu ri ro n mc chp nhn c, ngha l trong bn thit k cn phi vch r cc yu cu v an ton, bo mt ca h thng, ng thi phi a ra cc cng c v cc bin php nhn tng tnh an ton v bo mt cho h thng. Mc an ton v bo mt ca h thng tnh ton li cn ph thuc vo topo li v d liu cng nh ti nguyn cn bo mt. Mc bo mt cng khc nhau gia cc li c dng trong cc cng ty c tnh cht nghip v khc nhau. Bn thn cc m hnh an ton v bo mt cho mt h thng tnh ton li cng c nhng ri ro tim n. Do , khi thit k mt m hnh bo mt cho li cng cn tnh n mt vi phng n d phng. Ngoi ra, nn c tng la, h thng pht hin truy cp tri php, cc gii php phng chng virus, Tnh sn sng: c hiu mt cch n gin l phn trm thi gian m h thng p ng tr li cc yu cu ca thc th tham gia li. Trong giai on thit k, cng cn a ra mc sn sng ca h thng. Da trn mc ny, ngi thit k xc nh c ti nguyn d tha cn c ca h thng v d phng c cc phng thc x l s c hng hc khi cn thit. Cui cng, da trn nhng kin thc thu c t vic nghin cu v tnh ton li v cc cng c lin quan c trnh by trong cc chng trc. ng thi vi s gip ca thy hng dn v cc bn trn trung tm tnh ton hiu nng cao. Ti v cc bn trn trung tm tin hnh ci t th nghim mt li vi mc ch hiu su hn cc vn nghin cu ng thi to ra mt li dng phc v vic th nghim cc nghin cu tnh ton li sau ny.

4.2 Ci t mt Grid
Phn ny lit k cc bc v gii thch ngn gn qu trnh ci t h thng c s h tng cho mt li da trn b Globus Toolkit 4.

4.2.1 Cu hnh phn cng ca li


Topo li trn trung tm c b tr nh hnh 4.1, h thng bao gm: Mt nt chnh l Gridsv-172.16.14.130 v Nm nt tnh ton l: Alpha-172.16.14.182;
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 95

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Beta-172.16.14.144;

Gama-172.16.14.133;

Delta-172.16.14.175;

Omega-

172.16.14.202. Ti thi im ci t th nghim tt c cc my tnh ny u l PenIII800, v c kt ni vi nhau qua mng LAN.

Hnh 4-1 M hnh li c trin khai th nghim

4.2.2 Yu cu trc khi ci t


Cc nt li l cc my tnh PC Intel Pentium ni mng vi nhau qua Fast Ethernet, ci Linux Fedora Core 3 v tt c cc gi phn mm cn thit [5] cho vic ci t Globus Toolkit. cc nt trong mng c th nhn v lin lc c vi nhau th file /etc/hosts trn tt c cc nt li phi c cng ni dung v cha y 6 a ch IP cng nh tn ca cc my trc tip tham gia li.

4.2.3 Ci t cho nt chnh


Trc khi thc hin ci t cho nt chnh, ta to mt user "globus", y l user thc hin cng vic qun tr Globus Toolkit. Phi ng nhp vo user root (to khi ci t Fedora Core 3) thc hin lnh to user, v ch root mi c quyn ghi vo cc file h thng.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 96

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

[root]#adduser -p "globus" -g users -d /home/globus globus [root]#mkdir /usr/local/GlobusToolkit4.0.1 [root]#ln -s /usr/local/GlobusToolkit4.0.1 /usr/local/globus [root]#chown globus:users /usr/local/GlobusToolkit4.0.1

Bc 1: Ci gi gt4.0.1-x86_fc_3-binary-installer bng user globus va to:


[globus]$tar xzf /opt/gt4.0.1-x86_fc_3-binaryinstaller.tar.gz [globus]$export JAVA_HOME=/usr/local/java [globus]$export ANT_HOME=/usr/local/ant [globus]$export GLOBUS_LOCATION=/usr/local/globus [globus]$cd gt4.0.1-x86_fc_3-binary-installer [globus]$./configure --prefix=/usr/local/globus --withiodbc=/usr/local/lib [globus]$make [globus]$make install

Bc 2: Cu hnh an ton bo mt cho nt chnh bng user globus. y l bc to simpleCA, k chng thc cho host:
[globus]$$GLOBUS_LOCATION/setup/globus/setup-simple-ca

la chn cc tu chn cho lnh ny nh trong ph lc A ca [5]. Login vo user root, g lnh:
[root]#$GLOBUS_LOCATION/setup/globus_simple_ca_ebb88ce5_setup /setup-gsi [root]#source $GLOBUS_LOCATION/etc/globus-user-env.sh [root]#grid-cert-request -host `hostname`

Login vo user globus, g lnh:


[globus]$grid-ca-sign -in /etc/gridsecurity/hostcert_request.pem -out hostsigned.pem

Login vo user root, g lnh:


[root]#cp ~globus/hostsigned.pem /etc/gridsecurity/hostcert.pem [root]#cp /etc/grid-security/hostcert.pem/etc/gridsecurity/containercert.pem [root]#cp /etc/grid-security/hostkey.pem/etc/gridsecurity/containerkey.pem
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 97

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

[root]#chown globus:users container*.pem

Cp v k chng thc ngi dng "usergrid". To ngi dng c tn l usergrid. y l ngi dng s s dng cc dch v ca h thng Grid. Dng lnh to user nh vi user globus, sau to to simpleCA, k chng thc cho host nh bc 2. Cc bc tip theo l ci t v cu hnh cc dch v, lnh chi tit trong [5]. Bc 3: Ci t dch v GRAM Login vo user root, g lnh cu hnh sudo:
[root]#visudo

sau , thm 2 dng sau vo file etc/sudoers:


# Globus GRAM entries Runas_Alias GRAM_USERS = usergrid globus ALL=(GRAM_USERS)

NOPASSWD: /usr/local/globus/libexec/globus-gridmap-andexecute -g /etc/grid-security/grid-mapfile

/usr/local/globus/libexec/globus-job-manager-script.pl * globus ALL=(GRAM_USERS)

NOPASSWD: /usr/local/globus/libexec/globus-gridmap-andexecute -g /etc/grid-security/grid-mapfile

/usr/local/globus/libexec/globus-gram-local-proxy-tool *

Bc 4: Ci t GridFTP Login vo user root, dng bt k trnh son tho dng console no ca Fedora to file /etc/xinetd.d/gridftp c ni dung nh sau:
service gsiftp { instances = 100 = stream

socket_type wait user env env server server = no

= root += GLOBUS_LOCATION=/usr/local/globus += LD_LIBRARY_PATH=/usr/local/globus/lib = /usr/local/globus/sbin/globus-gridftp-

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 98

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

server_args

= -i

log_on_success+= DURATION nice disable } = 10 = no

Sau g lnh:
[root]#echo "gsiftp 2811/tcp" >> /etc/services [root]#/etc/init.d/xinetd reload [root]#netstat -an | grep 2811

Bc 5: Khi ng GT container t ng khi boot my Login vo user globus, to file $GLOBUS_LOCATION/bin/start-stop c ni dung nh sau:
#!/bin/sh set e export GLOBUS_LOCATION=/usr/local/globus export JAVA_HOME=/usr/local/java export ANT_HOME=/usr/local/ant export GLOBUS_OPTIONS="-Xms256M -Xmx512M" . $GLOBUS_LOCATION/etc/globus-user-env.sh cd $GLOBUS_LOCATION case "$1" in start) $GLOBUS_LOCATION/sbin/globus-start-containerdetached -p 8443 ;; stop) $GLOBUS_LOCATION/sbin/globus-stop-containerdetached ;; *) echo "Usage: globus {start|stop}" >&2 exit 1 ;; esac

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 99

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

exit 0 [globus]$chmod +x $GLOBUS_LOCATION/bin/start-stop

Login vo user root, to file /etc/init.d/globus c ni dung sau:


#!/bin/sh e ## BEGIN chkconfig header # chkconfig: 345 80 30 # processname: /usr/local/globus/bin/start-stop # description: sysv init script to start GT container ## END chkconfig header case "$1" in start) su - globus /usr/local/globus/bin/start-stop start ;; stop) su - globus /usr/local/globus/bin/start-stop stop ;; restart) $0 stop sleep 1 $0 start ;; *) echo "Usage: $0 {start|stop|restart}n" >&2 ;; esac exit 0 [root]#755 /etc/init.d/globus [root]#chkconfig --add globus [root]#service globus start

Bc 6: Cu hnh RFT. Cu hnh h qun tr c s d liu PostgresSQL Thm dng "host IP ca nt ta ang ci t.
[root]#/etc/init.d/postgresql restart
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

rftDatabase

"globus" "172.16.14.130" 255.255.255.0

trust" vo tp tin /var/lib/pgsql/data/pg_hba.conf, trong 172.16.14.130 l a ch

Trang 100

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

[root]#su postgres -c "createuser -P globus"

To c s d liu ban u cho RFT


[globus]$createdb rftDatabase [globus]$psql -d rftDatabase -f [globus]$$GLOBUS_LOCATION/share/globus_wsrf_rft/rft_schema.sq l

Thay i mt khu trong file $GLOBUS_LOCATION/etc/globus_wsrf_rft/jndi-config.xml thnh "globus"

4.2.4 Ci t cc nt tnh ton


Phn ny trnh by cc bc tip theo ci t c s h tng Grid ln cc nt tnh ton. Ci t ny phi thc hin lp trn tt c cc nt tnh ton c trong li. Trn cc nt tnh ton vic ci t nh to user hay ci t GT4, GridFTP, GT container, GRAM v RFT u thc hin tng t nh vi nt server. Ring ci t GSI th c cht khc bit sau: Dng user globus, sao chp file globus_simple_ca_ebb88ce5_setup0.18.tar.gz trong th mc ~globus/.globus/simpleCA trn nt chnh vo th mc home ca user globus trn nt tnh ton.
[globus]$export GLOBUS_LOCATION=/usr/local/globus [globus]$$GLOBUS_LOCATION/sbin/gpt-build globus_simple_ca_ebb88ce5_setup-0.18.tar.gz [globus]$$GLOBUS_LOCATION/sbin/gpt-postinstall

Login vo user root, g lnh:


[root]#export GLOBUS_LOCATION=/usr/local/globus [root]#source $GLOBUS_LOCATION/etc/globus-user-env.sh [root]#$GLOBUS_LOCATION/setup/globus_simple_ca_ebb88ce5_setup /setup-gsi

To yu cu cp chng thc v k chng thc host cho nt tnh ton bng user root:
[root]#grid-cert-request -host `hostname`

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 101

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

copy tp tin /etc/grid-security/hostcert_request.pem ti ngi qun tr globus trn nt CA: globus@gridsv.it.bkvn vo th mc ~globus/.globus/hosts/<hostname>, trong (<hostname>: tn ca nt tnh ton xin cp chng thc) Ngi qun tr globus@gridsv.it.bk.vn s k chng thc bng lnh
grid-ca-sign -in.globus/hosts/<hostname>/hostcert_request.pem -out hostcert.pem

sau chuyn tp tin hostcert.pem ny vo li th mc /etc/grid-security trn nt tnh ton bng quyn root.
[root]#cp ~globus/hostsigned.pem /etc/gridsecurity/hostcert.pem [root]#cp /etc/grid-security/hostcert.pem/etc/gridsecurity/containercert.pem [root]#cp /etc/grid-security/hostkey.pem/etc/gridsecurity/containerkey.pem [root]#chown globus:users container*.pem [root]#ls -l /etc/grid-security/*.pem [root]#echo ""/O=Grid/OU=GlobusTest/OU=simpleCAgridca.math.hcmuns.edu.vn/OU=math.hcmuns.edu.vn/CN=igrid" igrid" >> /etc/grid-security/grid-mapfile

To yu cu cp chng thc v k chng thc ngi dng usergrid trn nt tnh ton. Bc ny ch cn thc hin nu cho php ngi dng usergrid login vo nt tnh ton s dng li.

4.2.5 ng b thi gian gia cc nt trong li


y l bc kh quan trng bi cc chng thc u ch c hiu lc trong mt khong thi gian. Yu cu tt c cc nt u phic gi ntp-4.0.0 tr. Ci t NTP server trn nt chnh u tin thay i ni dung tp tin /etc/ntp.conf thnh:
restrict default noserve restrict 127.0.0.1 restrict 172.16.14.0 mask 255.255.255.0 server 0.pool.ntp.org

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 102

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

server server fudge

1.pool.ntp.org 2.pool.ntp.org 172.16.14.0 stratum 10

driftfile /etc/ntp/drift broadcastdelay authenticate no 0.008

Cp nht ngy gi cho nt chnh:


[root]#ntpdate -u 0.pool.ntp.org

(lp li lnh ny cho n khi gi ca nt chnh l chnh xc) Khi ng NTP server:
[root]#chkconfig ntpd on [root]#/etc/init.d/ntpd

Cp nht trng thi cho NTP server:


[root]#ptrace 0.pool.ntp.org [root]#ntptrace 1.pool.ntp.org [root]#ntptrace 2.pool.ntp.org [root]#ntpq p [root]#ntptrace localhost

Ci t NTP client trn cc nt tnh ton


[root]#ntpdate -b 172.16.14.130

(Chy nhiu ln lnh ny cho n khi thi gian ca h thng l chnh xc)
[root]#chkconfig crond on

Thm tp tin /etc/cron.hourly/ntp.client ni dung nh sau:


#!/bin/sh # Update system time every hour from time server: 172.29.3.131 ntpdate -b 172.16.14.130 [root]#chmod +x /etc/cron.hourly/ntp.client [root]#/etc/init.d/crond reload

4.2.6 Cu hnh cc dch v mc li


Sau khi ci t xong ht cc gi cn thit, ta tin hnh cu hnh cc dch v IndexService, WebMDS, MyProxy server. Thng tin v lnh cu hnh c thc hin nh trong [5].
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 103

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Cu hnh IndexService Ta s cu hnh cc nt tnh ton ng k dch v Index vi nt chnh 172.29.3.130, bng cch thm dng:
<upstream>https://gridca.math.hcmuns.edu.vn:8443/wsrf/service s/DefaultIndexService</upstream>

vo tp tin $GLOBUS_LOCATION/etc/globus_wsrf_mds_index/hierarchy.xml, sau khi ng li GT container:


[globus]$globus-stop-container-detached [globus]$sleep 5 [globus]$globus-start-container-detached

Cu hnh WebMDS y l ng dng Web cho php hin th cc thng tin MDS trong mt li. Ch ci t WebMDS trn nt chnh bng gi Tomcat jakarta-tomcat-4.1.31.tar.gz:
[root]#tar zxf /opt/jakarta-tomcat-4.1.31.tar.gz [root]#ln -s jakarta-tomcat-4.1.31 tomcat [root]#echo "export CATALINA_HOME=/usr/local/tomcat" >>/etc/profile [root]#echo "export PATH=$CATALINA_HOME/bin:$PATH" >> /etc/profile

Cu hnh v trin khai WebMDS vo Tomcat container Sa tham s "endpoint" trong file $GLOBUS_LOCATION/lib/webmds/conf/indexinfo thnh:
"https://172.29.3.130:8443/wsrf/services/DefaultIndexService" [root]#$GLOBUS_LOCATION/lib/webmds/bin/webmds-create-contextfile $CATALINA_HOME/conf/Catalina/localhost [root]#$CATALINA_HOME/bin/startup.sh

Ci t v cu hnh MyProxy server trn nt chnh Dng qun l cc chng thc ngi dng ti cc nt khc trong li.
[root]# cp $GLOBUS_LOCATION/etc/myproxy-server.config /etc

Sau , son li tp tin /etc/myproxy-server.config nh [5].

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 104

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

4.3 Kt ni mt Cluster vo Grid


Sau khi c mt li, thc hin thit lp cc mng con, mi mng con c coi l mt h thng tnh ton song song phn cm (Cluster). Thc hin kt ni cc mng con ny vo li.

4.3.1 Cu hnh phn cng


Trn trung tm trin khai 2 cluster kt ni vo h thng li c thit lp trong phn trc vi topo mng nh hnh di:

Hnh 4-2 Topo mng trin khai kt ni Cluster vo li

t ci t th nghim ny s dng hai cluster do mt nhm sinh vin thc tp trn trung tm thit lp trc , cluster th nht gm 4 my tnh c a ch IP t 172.17.14.1 n 172.17.14.4 c ci t Globus Toolkits, GlobusPbsScheduler, PBS v MPICH. S dng my c a ch 172.17.14.1 lm my ch, ng thi n c mt a ch IP dng giao tip vi bn ngoi l 172.16.14.150. Cluster th hai gm 6 my tnh c thit lp a ch IP t
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 105

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

172.17.14.10 n 172.17.14.15, trn cc my ny ci t Globus Toolkits, GlobusPbsScheduler, PBS v MPICH. cluster th hai, s dng my c a ch 172.17.14.10 lm my ch, my ch ny c mt a ch IP dng giao tip vi bn ngoi l 172.16.14.152. Bn cnh cc cluster thm mi, h thng li tnh ton vn gi nguyn ton b cc my n thit lp mc trc.

4.3.2 Cu hnh cluster-based PBS


Cng vic cu hnh cluster-based PBS c thc hin trn nt ch ca cluster. PBS cung cp cho chng ta mt lnh qun l cu hnh PBS dng cho vic cu hnh trn giao din dng lnh l lnh qmgr. Cc lnh cu hnh PBS Cc lnh cu hnh cho PBS thng c cu trc nh sau:
command server [names] [attr OP value[,...]] command queue command node [names] [attr OP value[,...]] [names] [attr OP value[,...]]

y command thc hin mt tc ng ln cc i tng server, queue, node. Cc lnh ny bao gm [14]: 1. Active: dng kch hot i tng 2. Create: to mt i tng mi nh hng i hoc nt 3. Delete: xa mt i tng hng i hoc nt 4. Help: hin th thng tin chi tit ca lnh 5. List: lit k cc thuc tnh v gi tr ca i tng 6. Print: hin th cc hng i c thit lp v cc thuc tnh ca server 7. Set: nh ngha hoc thm cc thuc tnh cho i tng, ngc li Unset Cu hnh cho i tng Server S dng cc thuc tnh ca lnh cu hnh PBS [14] nh: 1. scheduling: kch hot hoc tt b lp lch 2. max_user_run: s cng vic ti a mt ngi c th thc hin cng lc 3. acl_host_enable: cho php server s dng danh dch my trong acl_hosts
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 106

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

4. acl_hosts: danh sch cc my c th yu cu t server 5. default_queue: hng i mc nh dng khi ko c ch r tn 6. log_events: di xu k t s ghi li cc s kin xy ra trong h thng 7. mail_from: nh danh ca ngi dng m server s dng gi th 8. queue_other_job: cho php ngi dng khng s hu cng vic nhng xem c tnh trng ca n 9. resources_default: tp cc gi tr mc nh 10. scheduler_interation: khong thi gian gia cc ln lp lch 11. default_node: nt tnh ton mc nh dng khi khng ch r tn nt cu hnh cho server. Ton b vic cu hnh cho i tng Server c thc hin bng cc lnh nh trong shell script [17]. Cu hnh i tng hng i (queue) Trong PBS c hai kiu hng i khc nhau [14]: hng i nh tuyn (routing) v hng i thc thi (execution). Mt hng i nh tuyn dng chuyn cng vic t hng i ny sang mt hng i khc trong PBS Server. Cn hng i thc thi dng thc hin cng vic, cng vic tn ti trong n sut qu trnh chy. Thng thng mt PBS Server mun hot ng tt phi c c hai loi hng i ny. Cc thuc tnh ca hng i chia thnh ba nhm: Nhm thuc tnh ch p dng vi hng i thc thi Nhm thuc tnh ch p dng vi hng i nh tuyn Nhm thuc tnh p dng vi c hai kiu hng i 1. queue_type_route: l kiu hng i nh tuyn, dng phn cng vic 2. max_running: s cng vic ti a c th nm trong hng i cng lc 3. route_destination: cc hng i m cng vic s c chuyn n 4. enable: cho php nhn/khng nhn cng vic mi 5. started: cng vic c lp lch s c thc thi t hng i ny Cc thuc tnh ca hng i thc thi
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Cc thuc tnh ca hng i nh tuyn

Trang 107

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

1. queue_type_excecute: kiu hng i thc thi 2. priority: mc u tin ca hng i ny so vi cc hng i . 3. max_running: s cng vic ti a ng thi c trong hng i 4. resource_max: lng ti nguyn ln nht c th c yu cu cho mt cng vic, ngc li l resource_min 5. resource_default: ti nguyn mc nh 6. enable: cho php nhn/khng nhn cng vic mi 7. started: cng vic c lp lch s c thc thi t hng i ny Vic cu hnh i tng hng i nh tuyn v hng i thc thi c thc hin thng qua cc lnh trong shell script [17]. to ra cc loi hng i khc nhau ra ch cn thay i cc gi tr nhp cho cc thuc tnh ca lnh trong shell script dng to mt hng i thc thi c tn l small, mc u tin 100, dng thc hin cc cng vic c thi gian yu cu t hn 20 pht.
# Create and define queue small Qmgr: create queue small Qmgr: set queue small queue_type = Execution Qmgr: set queue small Priority = 100 Qmgr: set queue small max_running = 10 Qmgr: set queue small resources_max.cput = 00:20:00 Qmgr: set queue small resources_default.cput = 00:20:00 Qmgr: set queue small enabled = True Qmgr: set queue small started = True

Tt c cc shell script [17] c dng v kim th trn trung tm tnh ton hiu nng cao, v cho kt qu kh tt.

4.3.3 Cu hnh li da trn GT


Trc khi thc hin cu hnh cho li ta cn phi thc hin ci t Globus Toolkits 4.0.1 trn nt ch ca cluster, ci t gi phn mm scheduler-pbs-3.2src_bundle.tar.gz. Sau y s l mt s bc cu hnh cn thit thit lp PBS l trnh qun l cng vic cho li.
Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 108

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

% gpt-postinstall

lnh ny thc hin khi to cc cu hnh mc nh cho li.


$GLOBUS_LOCATION/setup/globus/setup-mmjfs-pbs

lnh ny thc hin cu hnh thm dch v MasterPbsManagedJobFactoryService vo trong cc dch v ca Globus Toolkit.
$GLOBUS_LOCATION/setup/globus/setup-mjs-pbs

lnh ny thc hin cu hnh thm dch v PbsManagedJobFactoryService vo trong cc dch v ca Globus Toolkit.
$GLOBUS_LOCATION/setup/globus/setup-pbs-provider

lnh ny thc hin cc cu hnh thnh phn RIPS (Resource Information Provider) trong li lm vic vi PBS, RIPS nhn cc thng tin tr v t PBS_Server v cung cp cho trnh qun l cng vic.
$GLOBUS_LOCATION/setup/globus/setup-globus-job-manager-pbs -cpu-per-node=1

lnh ny thit lp s lng cc b vi x l (CPU) trn mt nt n trong cluster, thng s ny phi ph hp vi ti nguyn tng ng trong cluster.
$GLOBUS_LOCATION/setup/globus/setup-globus-job-manager-pbs -remote-shell=rsh

lnh ny thc hin vic chuyn i remote-shell mc nh thnh rsh, thng thng cc h thng tnh ton song song phn cm thng dng rsh truy cp cc ti nguyn trong cluster.
$GLOBUS_LOCATION/setup/globus/setup-globus-job-manager-pbs validate-queues=yes

lnh ny thc hin kim tra hng i tng ng trong PBS ph vi yu cu hay khng.

4.4 Kt chng
Trong chng 4, lun vn gii thiu cc bc cn thc hin trong vic lp k hoch trin khai mt li. ng thi cng a ra mt m hnh li th nghim, bao gm c cc hng dn ci t. y l bc p dng thc t cho phn l thuyt c nghin cu trong 3 chng u tin.

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 109

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

KT LUN V KIN NGH


Kt qu thu c
Trong qu trnh tm hiu v tnh ton li v thc hin ci t mt li th nghim, lun vn thu c cc kt qu sau: Bc u lm ch cng ngh tnh ton li cn rt mi nc ta Nm bt c b cng c Globus Toolkit v vn dng vo ci t thc t Trin khai h thng li th nghim Bit cch kt ni cluster vo li, nhm tng sc mnh tnh ton cho li

Hng pht trin


C th ni vic pht trin mi trng tnh ton Mng li l mt hng nghin cu cng ngh c trin vng rt ln trong tng lai, c nhng ng dng ln lao c th tn dng ht c ngun lc ti nguyn rt a dng v phong ph hin nay. Cc vn m Grid t ra v ang c cng ng th gii gii quyt. Cc thnh qu m Gird Computing em li gip giai quyt c nhiu bi ton m trc y khng th gii quyt bng k thut ton song song. Tuy nhin cng c mt s bi ton m cng ngh Grid cha th th gii quyt c. D sao vic tn dng c ngun ti nguyn rnh ri tnh tan cc bi tan i hi lng tnh tan v ti nguyn cao lun l mt th mnh pht trin tt mi trng tnh ton mng li ny. Da trn kin thc thu c trong qu trnh lm lun vn. Trong thi gian ti, vic nghin cu tip v tnh ton li s i theo mt trong cc hng pht trin sau: Nghin cu h tng truyn thng trong mi trng li, tm hiu a ra cc gii php kt ni cc nt li vi nhau trong iu kin h tng truyn thng cha tt nh Vit Nam. V mt lp trnh th tin hnh xy dng cc mdun ring nh: qun tr ngi dng li, mi gii ti nguyn, lp lch kinh t, .

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 110

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

TI LIU THAM KHO


Sch ting Anh:
[1] Ian Foster, Carl Kesselman (eds) - The Grid: Blueprint for a New Computing Infrastructure, 1st edition, Morgan Kaufmann Publishers, San Francisco, USA (1 November 1998), ISBN: 1558604758. [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] Daniel Minoli - A Networking Approach to Grid Computing Ahmar Abbas - Grid Computing: A Practical Guide to Technology and Applications Fran Berman, Anthony J.G.Hey, Geoffrey C.Fox Grid computing: Make the Global Infrastructure a Reality Globus Books - Globus Toolkit 4 Admin Guide Globus Books - A Globus Primer 0.6 (2005) IBM Red Books Globus Toolkit 3.0 Quick Start Joshy Joseph, Craig Fellenstein - Grid Computing IBM Red Books Fundamentals of Grid Computing. IBM Red Books Introduction to Grid Computing with Globus Toolkit The Globus Alliance - How to Build a Service using GT4 Sun BluePrints Online - The Sun Infrastructure Solution for Grid Computing Sun BluePrints Online - Introduction to the Cluster Grid Proytable Batch System - OpenPBS Release 2.3 - Administrator James Patton Jones - PBS Pro 5.0 - Administrator Guide Jarek Nabryski, Jenifer M.Schopf, Jan Weglars - Grid Resource Management

Sch ting Vit:


[17] n i hc - Kt ni Globus v PBS - Nguyn c Mnh kho K45 trng HBKHN

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 111

Lun vn thc s khoa hc

Nghin cu tng quan v tnh ton li v ci t m hnh th nghim

Trang Web:
[18] [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] Workspace Admin Guide:http://workspace.globus.org/vm/TP1.2/doc/adminindex.html#installing-workspace-service SimpleCA: http://gdp.globus.org/gt3-tutorial/multiplehtml/ch12s02.html Website Globus Toolkits http://www.globus.org Globus Toolkit Tutorial: http://www.globusconsortium.org/tutorial/ http://www.globus.org/toolkit/docs/4.0/admin/docbock/index.html http://www.globus.org/toolkit/docs/4.0/data/gridftp/admin-index.html http://www.globus.org/toolkit/docs/development/4.1.0/admin/docbook/quick start.html#q-container ML. Levisetto - About PBS and the Grid: http://www.bo.infn.it/alice/introgrid/pbsabout/index.html Quick Tutorial for PBS: http://www.princeton.edu/~letchu/compting/scientific_computing/PBS.html PBS Userguide: http://www.doesciencegrid.org/public/pbs/homepage.html http://www.w3.org/2002/ws/ J. Postel, J. Reynolds, File Transfer Protocol, ti nguyn sn c trn web ti a ch http://www.w3.org/Protocols/rfc959/ http://doesciencegrid.org/projects/GPDK/

Nguyn Th Kim Tuyn Lp Cao hc CNTT Kho 2004-2006 - i hc BKHN

Trang 112

You might also like