Professional Documents
Culture Documents
Machine Learning Cơ Bản
Machine Learning Cơ Bản
Bi2:PhnnhmccthuttonMachineLearning
General (/tags#General)
C hai cch ph bin phn nhm cc thut ton Machine learning. Mt l da trn phng thc hc (learning style), hai l da trn chc nng (function)
(ca mi thut ton).
1.Phnnhmdatrnphngthchc
Theo phng thc hc, cc thut ton Machine Learning thng c chia lm 4 nhm: Supervise learning, Unsupervised learning, Semi-supervised
lerning v Reinforcement learning. C mt s cch phn nhm khng c Semi-supervised learning hoc Reinforcement learning.
SupervisedLearning(Hccgimst)
Supervised learning l thut ton d on u ra (outcome) ca mt d liu mi (new input) da trn cc cp (input, outcome) bit t trc. Cp d
liu ny cn c gi l (data, label), tc (d liu, nhn). Supervised learning l nhm ph bin nht trong cc thut ton Machine Learning.
Mt cch ton hc, Supervised learning l khi chng ra c mt tp hp bin u vo X = {x1 , x2 , , xN } v mt tp hp nhn tng ng
Y = {y , y , , y }, trong xi , y l cc vector. Cc cp d liu bit trc (xi , y ) X Y c gi l tp training data (d liu hun luyn).
1 2 N i i
y f (xi ), i = 1, 2, , N
i
V d 1: trong nhn dng ch vit tay, ta c nh ca hng nghn v d ca mi ch s c vit bi nhiu ngi khc nhau. Chng ta a cc bc nh
ny vo trong mt thut ton v ch cho n bit mi bc nh tng ng vi ch s no. Sau khi thut ton to ra (sau khi hc) mt m hnh, tc mt
hm s m u vo l mt bc nh v u ra l mt ch s, khi nhn c mt bc nh mi m m hnh cha nhn thy bao gi, n s d on bc nh
cha ch s no.
http://machinelearningcoban.com/2016/12/27/categories/ 1/7
5/9/2017 MachineLearningcbn
V d ny kh ging vi cch hc ca con ngi khi cn nh. Ta a bng ch ci cho mt a tr v ch cho chng y l ch A, y l ch B. Sau mt
vi ln c dy th tr c th nhn bit c u l ch A, u l ch B trong mt cun sch m chng cha nhn thy bao gi.
V d 2: Thut ton d cc khun mt trong mt bc nh c pht trin t rt lu. Thi gian u, facebook s dng thut ton ny ch ra cc
khun mt trong mt bc nh v yu cu ngi dng tag friends - tc gn nhn cho mi khun mt. S lng cp d liu (khun mt, tn ngi) cng
ln, chnh xc nhng ln t ng tag tip theo s cng ln.
V d 3: Bn thn thut ton d tm cc khun mt trong 1 bc nh cng l mt thut ton Supervised learning vi training data (d liu hc) l hng
ngn cp (nh, mt ngi) v (nh, khng phi mt ngi) c a vo. Ch l d liu ny ch phn bit mt ngi v khng phi mt ngi m
khng phn bit khun mt ca nhng ngi khc nhau.
Thut ton supervised learning cn c tip tc chia nh ra thnh hai loi chnh:
Classification(Phnloi)
Mt bi ton c gi l classication nu cc label ca input data c chia thnh mt s hu hn nhm. V d: Gmail xc nh xem mt email c phi
l spam hay khng; cc hng tn dng xc nh xem mt khch hng c kh nng thanh ton n hay khng. Ba v d pha trn c chia vo loi ny.
Regression(Hiquy)
(ting Vit dch l Hi quy, ti khng thch cch dch ny v bn thn khng hiu n ngha l g)
Nu label khng c chia thnh cc nhm m l mt gi tr thc c th. V d: mt cn nh rng xm2 , c y phng ng v cch trung tm thnh ph
zkm s c gi l bao nhiu?
Gn y Microsoft c mt ng dng d on gii tnh v tui da trn khun mt (http://how-old.net/). Phn d on gii tnh c th coi l thut ton
Classication, phn d on tui c th coi l thut ton Regression. Ch rng phn d on tui cng c th coi l Classication nu ta coi tui l
mt s nguyn dng khng ln hn 150, chng ta s c 150 class (lp) khc nhau.
UnsupervisedLearning(Hckhnggimst)
Trong thut ton ny, chng ta khng bit c outcome hay nhn m ch c d liu u vo. Thut ton unsupervised learning s da vo cu trc ca
d liu thc hin mt cng vic no , v d nh phn nhm (clustering) hoc gim s chiu ca d liu (dimension reduction) thun tin trong
vic lu tr v tnh ton.
Mt cch ton hc, Unsupervised learning l khi chng ta ch c d liu vo X m khng bit nhn Y tng ng.
Nhng thut ton loi ny c gi l Unsupervised learning v khng ging nh Supervised learning, chng ta khng bit cu tr li chnh xc cho mi
d liu u vo. Ging nh khi ta hc, khng c thy c gio no ch cho ta bit l ch A hay ch B. Cm khng gim st c t tn theo ngha
ny.
Clustering(phnnhm)
Mt bi ton phn nhm ton b d liu X thnh cc nhm nh da trn s lin quan gia cc d liu trong mi nhm. V d: phn nhm khch hng
da trn hnh vi mua hng. iu ny cng ging nh vic ta a cho mt a tr rt nhiu mnh ghp vi cc hnh th v mu sc khc nhau, v d tam
gic, vung, trn vi mu xanh v , sau yu cu tr phn chng thnh tng nhm. Mc d khng cho tr bit mnh no tng ng vi hnh no
hoc mu no, nhiu kh nng chng vn c th phn loi cc mnh ghp theo mu hoc hnh dng.
Association
L bi ton khi chng ta mun khm ph ra mt quy lut da trn nhiu d liu cho trc. V d: nhng khch hng nam mua qun o thng c xu
hng mua thm ng h hoc tht lng; nhng khn gi xem phim Spider Man thng c xu hng xem thm phim Bat Man, da vo to ra mt h
thng gi khch hng (Recommendation System), thc y nhu cu mua sm.
SemiSupervisedLearning(Hcbngimst)
http://machinelearningcoban.com/2016/12/27/categories/ 2/7
5/9/2017 MachineLearningcbn
SemiSupervisedLearning(Hcbngimst)
Cc bi ton khi chng ta c mt lng ln d liu X nhng ch mt phn trong chng c gn nhn c gi l Semi-Supervised Learning. Nhng bi
ton thuc nhm ny nm gia hai nhm c nu bn trn.
ReinforcementLearning(HcCngC)
Reinforcement learning l cc bi ton gip cho mt h thng t ng xc nh hnh vi da trn hon cnh t c li ch cao nht (maximizing the
performance). Hin ti, Reinforcement learning ch yu c p dng vo L Thuyt Tr Chi (Game Theory), cc thut ton cn xc nh nc i tip
theo t c im s cao nht.
(/2016/12/27/categories/)
v tt nhin, khng th p dng thut ton tng t nh IBM Deep Blue (https://en.wikipedia.org/wiki/Deep_Blue_(chess_computer)) (IBM Deep Blue
thng con ngi trong mn c vua 20 nm trc). V c bn, AlphaGo bao gm cc thut ton thuc c Supervised learning v Reinforcement learning.
Trong phn Supervised learning, d liu t cc vn c do con ngi chi vi nhau c a vo hun luyn. Tuy nhin, mc ch cui cng ca
AlphaGo khng phi l chi nh con ngi m phi thm ch thng c con ngi. V vy, sau khi hc xong cc vn c ca con ngi, AlphaGo t chi vi
chnh n vi hng triu vn chi tm ra cc nc i mi ti u hn. Thut ton trong phn t chi ny c xp vo loi Reinforcement learning. (Xem
thm ti Google DeepMinds AlphaGo: How it works (https://www.tastehit.com/blog/google-deepmind-alphago-how-it-works/)).
V d 2: Hun luyn cho my tnh chi game Mario (https://www.youtube.com/watch?v=qv6UVOQ0F44). y l mt chng trnh th v dy my tnh
chi game Mario. Game ny n gin hn c vy v ti mt thi im, ngi chi ch phi bm mt s lng nh cc nt (di chuyn, nhy, bn n) hoc
khng cn bm nt no. ng thi, phn ng ca my cng n gin hn v lp li mi ln chi (ti thi im c th s xut hin mt chng ngi vt
c nh mt v tr c nh). u vo ca thut ton l s ca mn hnh ti thi im hin ti, nhim v ca thut ton l vi u vo , t hp phm
no nn c bm. Vic hun luyn ny c da trn im s cho vic di chuyn c bao xa trong thi gian bao lu trong game, cng xa v cng
nhanh th c im thng cng cao (im thng ny khng phi l im ca tr chi m l im do chnh ngi lp trnh to ra). Thng qua hun
luyn, thut ton s tm ra mt cch ti u ti a s im trn, qua t c mc ch cui cng l cu cng cha.
http://machinelearningcoban.com/2016/12/27/categories/ 3/7
5/9/2017 MachineLearningcbn
2.Phnnhmdatrnchcnng
C mt cch phn nhm th hai da trn chc nng ca cc thut ton. Trong phn ny, ti xin ch lit k cc thut ton. Thng tin c th s c trnh
by trong cc bi vit khc ti blog ny. Trong qu trnh vit, ti c th s thm bt mt s thut ton.
RegressionAlgorithms
1. Linear Regression (/2016/12/28/linearregression/)
2. Logistic Regression (/2017/01/27/logisticregression/#sigmoid-function)
3. Stepwise Regression
ClassificationAlgorithms
1. Linear Classier
2. Support Vector Machine (SVM)
3. Kernel SVM
4. Sparse Representation-based classication (SRC)
InstancebasedAlgorithms
1. k-Nearest Neighbor (kNN) (/2017/01/08/knn/)
2. Learning Vector Quantization (LVQ)
RegularizationAlgorithms
1. Ridge Regression
2. Least Absolute Shrinkage and Selection Operator (LASSO)
3. Least-Angle Regression (LARS)
BayesianAlgorithms
1. Naive Bayes
2. Gaussian Naive Bayes
ClusteringAlgorithms
1. k-Means clustering (/2017/01/01/kmeans/)
2. k-Medians
3. Expectation Maximization (EM)
ArtificialNeuralNetworkAlgorithms
1. Perceptron (/2017/01/21/perceptron/)
2. Softmax Regression (/2017/02/17/softmax/)
3. Multi-layer Perceptron (/2017/02/24/mlp/)
http://machinelearningcoban.com/2016/12/27/categories/ 4/7
5/9/2017 MachineLearningcbn
4. Back-Propagation (/2017/02/24/mlp/#-backpropagation)
DimensionalityReductionAlgorithms
1. Principal Component Analysis (PCA)
2. Linear Discriminant Analysis (LDA)
EnsembleAlgorithms
1. Boosting
2. AdaBoost
3. Random Forest
3.Tiliuthamkho
1. A Tour of Machine Learning Algorithms (http://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/)
11Comments tiepvu
1 Login
Jointhediscussion
Laicasaane2monthsago
Mtslichnhttrongbivit:
1/"Cpdliunycncgil(data.label)"
=>Chmhayphy?
2/"mtngivkhngphimtngi"
=>mtngi
3/"gimschiucadliu(dimentionreduction)"
=>dimension
4/"sauyucutrphnchngthnhtngnhm"
=>yucu
5/"ThctchothyrtnhiuccbitonMachineLearning"
=>biton
6/"ccthuttoncnxcnhncitiptheo"
=>nci
7/"SparseRepresntationbasedclassification"
=>Representation
8/"LeastAbsoluteShringkageandSelectionOperator"
=>Shrinkage
9/"LearninVectorQuantization(LVQ)"
=>Learning
2 Reply Share
Cmnbnvcrtk.Tisalinhngli.
Reply Share
PhmHng3monthsago
SupportVectorMachine(SMV)saichnhtkabn
http://machinelearningcoban.com/2016/12/27/categories/ 5/7
5/9/2017 MachineLearningcbn
2 Reply Share
Cmnbn,mnhssasm.
Reply Share
NamVu3monthsago
LogisticRegressionlchoClassificationmanhnh?
Reply Share
Ciranhgiirtmongmanh.NvncgitnlRegressionnnanhxemnvo.
Reply Share
Dung4monthsago
AnhcthvitmtbivthuttonExpectationMaximisation(EM)dcko.EManghcmkohiuvnmy.
Reply Share
Anhsvit,nhnganhcnvitbikmeansclusteringtrc.
1 Reply Share
Phucnh4monthsago
Cccngthctonhcblimathprocessingerror,anhcthxemlikhng.
Reply Share
Choem,
Emvobngtrnhduytnovhiuhnhgi?Emthtrncctrnhduytkhccha?
AnhkimtratrnChrome,Firefox,OperatrnUbuntuvChrometrnAndroidthycngthctonkhngbli.
Reply Share
Phucnh>TiepVuHuu4monthsago
Dchoanh,
khinyemvobngsafariiosthgpli,githxembnhthngri.Clllitmthi.
Cmnanh.
1 Reply Share
ALSOONTIEPVU
KmeansClustering Bi6:Knearestneighbors
19comments4monthsago 5comments4monthsago
NguyenBinhEmthytrongbicphnbiudinmatrncenterca DaoTuanbinyphidngnci"t10klabelsidx1ubyte"bitrc
miclusterlM=[m1,m2,mN],cphichnnghil michdngn"t10kimagesidx3ubyte"thia.Nnemngh
GiithiuvMachineLearning Bi8:GradientDescent(phn2/2)
21comments4monthsago 12comments4monthsago
LuongphatNguyenChoanh,Blogcaanhththayvemrtcmn NguyentuananhEmcthcmcchtphnohmcaSGD:ti
anh.Emhchicrtnhiuttrangblogcaanh. saoJ(wxy)=1/2(x*wy)^2lyohmlivncnbc2
http://machinelearningcoban.com/2016/12/27/categories/ 6/7
5/9/2017 MachineLearningcbn
http://machinelearningcoban.com/2016/12/27/categories/ 7/7