You are on page 1of 4

Cc bc chnh gii quyt bi ton phn lp

Phn lp d liu gm hai bc x l chnh: Bc 1: Hc (training), mc ch ca bc ny l xy dng mt m hnh xc nh mt tp cc lp d liu. M hnh ny c xy dng bng cch phn tch cc b d liu ca mt c s d liu, mi b d liu c xc nh bi gi tr ca cc thuc tnh. Gi s mi b d liu thuc v mt trong cc lp c nh ngha trc, iu ny c xc nh bi mt trong cc thuc tnh, gi l thuc tnh phn lp. Trong ng cnh ca bi ton phn lp, mi b d liu c xem nh l mt mu, mt v d, hay mt i tng. Nhng b d liu c phn tch xy dng m hnh phn lp c ly t trong tp d liu hc hay d liu hun luyn (training data set). Nhng b d liu ring l to thnh tp d liu hun luyn cn gi l nhng mu hun luyn (training samples) v c chn ngu nhin t mt kho cc mu. Bc ny c xem l hc c gim st, ngc li vi hc c gim st l hc khng c gim st (unsupervised learing), tiu biu l bi ton gom cm (clustering) trong cc lp m cc mu hun luyn thuc v l khng bit trc v s lp d liu cng khng c bit trc.

Hnh 1-1: Bc 1 - Hc xy dng m hnh phn lp M hnh c a ra sau khi phn tch xong tp d liu hun luyn thng c dng l nhng quy tc phn lp, cy quyt nh hay cc cng thc ton hc. V d, hnh 1.1 c mt c s d liu v thng tin khch hng, mt m hnh phn lp (hay lut phn lp) c xy dng sau qu trnh hc bc 1 c th xc nh nhng khch hng tin cy v nhng khch hng bnh thng ca mt ca hng. Lut phn lp ny c th c s dng phn loi cc mu d liu liu trong tng lai, cng nh n cung cp mt tri thc hu ch cha trong c s d liu. Bc 2 : Kim tra v nh gi, bc ny s dng m hnh phn lp c xy dng bc 1 vo vic phn lp.

Hnh 1-2: Bc 2 - Kim tra v nh gi u tin, nh gi chnh xc ca m hnh hay b phn lp ny, bng cch s dng mt tp cc mu c phn lp th (test) gi l b th (test set). Nhng mu ny c chn ngu nhin v c lp vi cc mu c hc bc 1 gi l mu th (test sample). chnh xc ca mt m hnh phn lp da trn b th l t l nhng mu th c phn lp ng bng m hnh phn lp . Ngha l vi mi mu th, so snh lp ng m mu th thuc v vi lp m m hnh phn lp ny d on cho mu th . Lu , nu chnh xc ca m hnh ny da trn tp d liu hun luyn, th m hnh ny c nh gi l ti u, n phn lp ng hon ton trn cc mu c hc, trong trng hp ny, m hnh hng ti s qu kht (overfitting) ca d liu. V vy phi s dng mt b d liu liu th. Nu chnh xc ca mt m hnh c xem xt c th chp nhn c th m hnh c dng phn lp cho cc b d liu hoc cc i tng trong

tng lai. V d, m hnh phn lp c xy dng trong bc 1 bng cch phn tch d liu ca cc khch hng bit, c dng d on s nh gi cc khch hng mi trong tng lai hnh 1-2.

You might also like