You are on page 1of 5

TNG QUAN V BIG DATA: Bigdata l mt tp d liu rt ln v rt a dng n ni nhng cng c qun l d liu truyn thng khng th no qun

l ni. Kch c ca big data ngy mt tng ln theo thi gian. Chng ta ang l mt phn ca big data mi ngy. Ti sao li ni nh vy, n gin hy ly mt vi v d: - i vi cc mng x hi ln chng hn Facebook hay Twiter, mi mt ln chng ta ng ln mt status hay mt tweet chnh l ta to ra mt lng d liu nht nh v lng d liu c Facebook hoc Twiter qun l. Lng d liu nu ch tnh ring mt ngi th khng ng k nhng trn th gii c rt rt nhiu ngi tham gia vo cc mng x hi v thng qua lng d liu sinh ra n gin l mt con s v cng ln. - Vic tm kim theo thi gian thc trn google cng l mt v d v big data, lng d liu m google c ngi dng c th tm kim chnh xc l v cng ln. - Ngoi ra cn kh nhiu v d khc nh theo gii v tnh hnh ti chnh, giao dch tin t, i vi d liu hng trm PB th r rng vic lu tr bng nhng cng c truyn thng l qu kh khn, tn km v i khi l bt kh thi. Khi ni n bigdata chng ta thng nhc ti: lng (volume), tc (velocity) v chng loi (variety). Big data v cng a dng: t nhng d liu do my mc t ng t sinh ra cho ti d liu c to ra hng ngy trn Internet, d liu t cc v tinh,Big data c th c cu trc hoc khng c cu trc no. Ti sao big data ang ngy mt quan trng hin nay: Big data hin nay ngy mt c quan tm v nhng u im rt ln ca n, c rt nhiu lnh vc cn s dng ti big data, sau y l mt s lnh vc m chng ta c th thy c vic ng dng bigdata l cn thit:

Phn loi hnh nh Picasa t Google l cng c hu ch cho php ngi dng sp xp, truy vn, v nhn dng khun mt t ng s dng nhng cng ngh th gic my tnh kt hp vi my hc. y l mt kt qu tuyt vi c c t gi tr ca nhng ng dng v dch v trn d liu ln. Cng c ny lm r vic phn tch d liu ln s i hi phn tch nng cao nh th gic my tnh v phng php nh th gic my tnh. Nhng h thng c khuyn co nh Pandora (m nhc), Netflix (phim nh), v Amazon (sch v sn phm) dng d liu khch hng v nhiu i l trong mt phng php tip cn c bit n l collaborative filtering (phng php lc cng tc). Dch v d liu ln ny l ch ca nhiu nghin cu tin tin trong my hc v khai ph d liu. R rng l nu kh nng thc hin gi tt th c th tng doanh s bn hng v s hi lng ca khch hng. Nhng phn tch da trn khch hng c th phn tch tm l cho khch hng ca bn da trn d liu mng x hi (Facebook v Twitter) khi d liu vn bn ny c lin quan n BI (Business Intelligence) thu thp t nhng thng tin giao dch khch hng truyn thng. Phn tch tm l cho php mt doanh nghip bit nhu cu khch hng mun v sn phm ca h, s quan tm ca khch hng v sn phm ca h hay ca i th cnh tranh, nhng g khch hng thch v khng thch, v.v.... D liu c to ra t my tnh c ngun gc t cc h thng mng bo hiu (v d: nhng cm bin nhng trong cc h thng ln nh giao thng th, n giao thng, v c s h tng); d liu gia cc thit b my mc, theo m cc thit b cm bin hay d liu ng nhp t mt my (thng l trong lnh vc ny) c truyn n cc my khc; v cc tp tin nht k, thng c s dng nhiu nht trong cng ngh thng tin g li nhiu vn v x l cc s c h thng ngoi mun (b qua cc vn cn s can thip ca con ngi khi phc v tip tc hot ng). H thng t ch cho du lch th ang c ci thin bng cch kt hp s thch khch hng, hu cn v tin s trc khi a ra nhng ngh hu ch, lun l mt nhim v kh khn. Gii tr trn mng x hi ang thay i cc kha cnh x hi ca vic pht sng truyn hnh v cc on phim, ni m k thut s p ng yu cu ni dung mi lc mi ni, nhng vic chia s kinh nghim vn phi thng qua mng x hi. Mc d vy, iu ny gip khn gi cm thy th v hn,

cho php khn gi sng to ni dung, bin kch, v nhng ngh s c th bit nhiu hn v khn gi ca h. H chun on y hc bao gm nhng h thng c s h tr ca h chuyn gia da trn lut (gi l DSSes: Rule-based Expert Decision Support Systems), nhng vi d liu ln, bng chng tn ti nhng h thng ny c th ra khi nghin cu v tr thnh nhng ngi ph t y t chnh. V d: Mt H chuyn gia h tr nh gi tm l ngi bnh c nguy c t t c cp trong bi nghin cu (xem phn Ti nguyn). Phn chng minh nhng h thng ny so snh vi d liu trong lch s. Nhng h thng c th thay con ngi quyt nh lm m cn c th ci thin n khi vic s dng nh mt cng c h tr.

BIG DATA V NOSQL Nh ni trn, big data khng c lu tr, qun l bng nhng cng c truyn thng, vy chng ta qun l n nh th no, hy tm hiu v NoSQL. Thut ng NoSQL thng c hiu l Non Relation hoc hin nay thng c nh ngha l Not Only SQL. NoSQL khng s dng m hnh quan h d liu quan h

MONGODB L mt document-oriented database, khng phi l c s d liu quan h. tng cn bn l thay th cc row trong c s d liu quan h bng cc document. Bng cch cho php nhng vo nhng document v array, mt document-oriented database c th th hin c nhng quan h phn cp phc tp vi ch mt record.

Scaling: Vic la chn scale database: C 2 la chn: 1 l scaling up (dng mt my tnh khc c th cha c nhiu d liu hn); 2 l scaling out (Chia data ra nhiu my tnh khc nhau). Vic scale up tt nhin l nu thc hin s n gin, ch vic thay th mt thit b c dung lng lu tr thp hn bng thit b c dung lng lu tr cao hn, tuy nhin vic thay th ny s dn ti chi ph rt tn km. Hn na vi s pht trin ca ngy nay, d liu s ngy mt ln hn na, nu la chn scale up th chi ph thay th s ngy mt tng, tuy nhin nu nh i vi mt sn phm ng dng web ln, c nhiu ngi mong mun tham gia xy dng n th vic s dng chung d liu trn mt my dng nh l qu kh khn. Scale out c th gii quyt c nhng vn m scale up khng lm c l: va m rng c data, va mang li hiu qu kinh t bng vic t tn km. MongoDB c xy dng theo hng scale out. M hnh d liu theo hng document oriented cho php n t ng tch data ra nhiu server khc nhau. N c th cn bng d liu v cho php load li d liu bng cch gom cm d liu, redistributing(ngc li vi distributed) document mt cch t ng.

Nh ni trn document trong MongoDB tng ng vi row trong Relational database (n c th cn th hin c thm mt s ng ngha khc tt hn so vi row). Ngoi ra mt collection tng ng vi table trong Relational database. Document: l mt tp hp c th t ca nhng key v value tng ng ca chng. VD: {"greeting" : "Hello, world!", "foo" : 3}, tuy nhin do c th t nn chng c vit li cho ng l: {"foo" : 3, "greeting" : "Hello, world!"}. Thc ra vic sp xp th t ny khng mang tnh quan trng qu cao, c rt nhiu ngn ng c c ch sp xp li khi cn thit. Mi document trc c phn bit vi document sau (tc l khng c document no ging nhau).

Values trong document khng nht thit phi cng loi ging nhau, chng c th thuc nhiu loi data. Key trong document l mt chui, bt k mt k t UTF-8 no cng c th c chp nhn l mt key trong document. Tuy nhin c mt s lu : khng c dng k t null th hin mt key, i vi nhng key bt u bng . hoc $ phi s dng chng trong nhng hon cnh chc chn hai k t ny cng vi _ to nn nhng key c xem l reserved. Mt key l duy nht trong mt document, khng th c hai key ging nhau.

Collection: L tp hp ca cc document. Trong mongoDB, collection l schema-free, ngha l collection c th tp hp nhiu kiu document khc nhau vo cng mt collection, tuy nhin thng thng ngi ta vn c gng khng gom nhng kiu document khc nhau vo chung mt collection m c gng phn ra cc nhm collection khc nhau, mc ch ca vic ny l gip thun tin trong vic truy xut, tm kim d liu, th tng tng trong cng mt collection c rt nhiu kiu document, vi shape khc nhau, khi ta mun tm kim mt thng tin no trong collection ny th vic tm kim s tr nn khng n gin v thay v tm kim trong cng mt nh dng gia cc document, ta phi tm kim theo nhng dng khc nhau, iu ny gy ra nhiu phin toi. Databases: Tp hp ca cc collection c gi l database.

Bi vit cn s si, ch yu nu ln mt vi chnh, cn khi bo co s ni r hn.

You might also like