Professional Documents
Culture Documents
Real Time Machine Learning
Real Time Machine Learning
net/publication/343062506
CITATIONS READS
0 2,576
1 author:
Saed Sayad
Rutgers, The State University of New Jersey
17 PUBLICATIONS 100 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Saed Sayad on 20 July 2020.
(Linear Models)
By
Saed Sayad
i
REAL TIME MACHINE LEARNING
No part of this book may be reproduced, stored in a retrieval system, or transmitted by any
means, electronic, mechanical, photocopying, recording, or otherwise, without written
permission from the publisher, editor, author and author department.
ii
REAL TIME MACHINE LEARNING
LINEAR MODELS
iii
REAL TIME MACHINE LEARNING
Devoted to:
iv
REAL TIME MACHINE LEARNING
v
REAL TIME MACHINE LEARNING
vi
REAL TIME MACHINE LEARNING
CONTENTS
Two Kinds of Intelligence ...................................................................... v
1.0 Introduction ....................................................................... 10
2.0 The Real Time Learning Machine ....................................... 14
2.1 The Basic Elements Table ..................................................... 18
2.2 Attribute Types ...................................................................... 20
2.2.1 Missing Values ......................................................... 21
3.0 Real Time Data Exploration ...................................................... 25
3.1 Real Time Univariate Data Exploration ................................ 26
3.1.1 Count ......................................................................... 26
3.1.2 Mean (Average)......................................................... 27
3.1.3 Variance .................................................................... 28
3.1.4 Standard Deviation ................................................. 29
3.1.5 Coefficient of Variation .......................................... 29
3.1.6 Skewness ................................................................... 30
3.1.7 Kurtosis .................................................................... 31
3.1.8 Median ...................................................................... 33
3.1.9 Mode ......................................................................... 34
3.1.10 Minimum and Maximum...................................... 35
Summary of Real Time Univariate Data Analysis .......................... 36
Real Time Univariate Data Analysis - Examples............................ 37
3.2 Real Time Bivariate Data Analysis ........................................... 40
3.2.1 Covariance ................................................................... 40
3.2.2 Linear Correlation Coefficient .................................. 41
3.2.3 Conditional Univariate Statistics ............................... 42
3.2.4 Z test ............................................................................. 45
vii
REAL TIME MACHINE LEARNING
viii
REAL TIME MACHINE LEARNING
ix
REAL TIME MACHINE LEARNING
1.0 Introduction
10
REAL TIME MACHINE LEARNING
11
REAL TIME MACHINE LEARNING
12
REAL TIME MACHINE LEARNING
Note: The previous name for this book was “Real Time Data
Mining”. However, because “Data Mining” is almost an obsolete
term the name of this book has been changed to “Real Time
Machine Learning”.
13
REAL TIME MACHINE LEARNING
14
REAL TIME MACHINE LEARNING
The tasks assigned to each of the four real time components are as
follows:
• Learner: updates (incrementally or decrementally) the
Basic Elements Table utilizing the data in real time.
• Explorer: does univariate and bivariate statistical data
analysis using the Basic Elements Table in real time.
• Modeler: constructs models using the Basic Elements Table
in real time.
• Predictor: uses the models for prediction in real time.
15
REAL TIME MACHINE LEARNING
16
REAL TIME MACHINE LEARNING
17
REAL TIME MACHINE LEARNING
BET 𝑋𝑗
𝑋𝑖 𝐵𝑖𝑗
• ∑ 𝑋𝑖 𝑋𝑗 : Sum of multiplication
2
• ∑(𝑋𝑖 𝑋𝑗 ) : Sum of squared multiplication
18
REAL TIME MACHINE LEARNING
𝑛𝑒𝑤
𝐵𝑖𝑗 ∶= 𝐵𝑖𝑗 ± 𝐵𝑖𝑗
where:
• 𝐵𝑖𝑗 = 𝐵𝑗𝑖
19
REAL TIME MACHINE LEARNING
20
REAL TIME MACHINE LEARNING
98 1 0 0 88 0 1
75 0 1 0 70 1 0
90 1 0 0 96 0 1
78 0 1 0 ? 1 0
65 0 0 1 60 1 0
Table 2.3 Final transformed dataset with three new binary attributes which are
created by discretizing Temperature.
21
REAL TIME MACHINE LEARNING
The Basic Element Table for the above sample dataset with four
basic elements is shown below. The RTLM Learner updates the
basic elements table with any new incoming data.
𝑵𝒊𝒋 𝐗𝟏 𝐗𝟐 𝐗𝟑 𝐗𝟒 𝐗𝟓 𝐗𝟔 𝐗𝟕
∑ 𝑿𝒊 , ∑ 𝑿𝒋
∑ 𝑿𝒊 𝑿𝒋 Temp. Temp.hot Temp.moderate Temp.mild Humidity Play.yes Play.no
𝑿𝟏 5 5 5 5 4 5 5
𝑿𝟐 5 5 5 4 5 5
2, 2 2, 2 2, 1 2, 314 2, 3 2, 2
Temp.hot 2 0 0 184 0 2
𝑿𝟑 5 5 4 5 5
2, 2 2, 1 1, 314 2, 3 2, 2
Temp.moderate 2 0 70 2 0
𝑿𝟒 5 4 5 5
1, 1 1, 314 1, 3 1, 2
Temp.mild 1 60 1 0
𝑿𝟓 4 4 4
𝑿𝟔 5 5
3, 3 3, 2
Play.yes 3 0
𝑿𝟕 5
2, 2
Play.no 2
22
REAL TIME MACHINE LEARNING
Numerical Variable
∑ 𝑋1 406
𝐴𝑣𝑒𝑟𝑎𝑔𝑒 (𝑇𝑒𝑚𝑝𝑒𝑟𝑎𝑡𝑢𝑟𝑒) = = = 81.2
𝑁1,1 5
(∑ 𝑋1 )2
∑ 𝑋1 𝑋1 −
𝑁1
𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒 (𝑇𝑒𝑚𝑝𝑒𝑟𝑎𝑡𝑢𝑟𝑒) =
𝑁1
(406)2
33638 −
= 5 = 134.16
5
Categorical Variable
𝐶𝑜𝑢𝑛𝑡(𝑃𝑙𝑎𝑦. 𝑦𝑒𝑠) = ∑ 𝑋6 = 3
∑ 𝑋6 3
𝑃𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦(𝑃𝑙𝑎𝑦. 𝑦𝑒𝑠) = = = 0.6
𝑁6,6 5
23
REAL TIME MACHINE LEARNING
∑ 𝑋1 ∑ 𝑋5
∑ 𝑋1 𝑋5 −
𝑁1,5
𝐶𝑜𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒 (𝑇𝑒𝑚𝑝𝑒𝑟𝑎𝑡𝑢𝑟𝑒, 𝐻𝑢𝑚𝑖𝑑𝑖𝑡𝑦) =
𝑁1,5
(328 × 314)
26414 −
= 4 = 166.5
4
∑ 𝑋1 𝑋6 218
𝐴𝑣𝑒𝑟𝑎𝑔𝑒 (𝑇𝑒𝑚𝑝𝑒𝑟𝑎𝑡𝑢𝑟𝑒|𝑃𝑙𝑎𝑦. 𝑦𝑒𝑠) = = = 72.7
∑ 𝑋6 3
∑ 𝑋4 𝑋7 0
𝑃𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦(𝑇𝑒𝑚𝑝𝑒𝑟𝑎𝑡𝑢𝑟𝑒. 𝑚𝑖𝑙𝑑|𝑃𝑙𝑎𝑦. 𝑛𝑜) = = = 0.0
∑ 𝑋7 2
24
REAL TIME MACHINE LEARNING
25
REAL TIME MACHINE LEARNING
3.1.1 Count
𝑁 ∶= 𝑁 ± 𝑁 𝑛𝑒𝑤 (1)
26
REAL TIME MACHINE LEARNING
∑ 𝑋 ± ∑ 𝑋 𝑛𝑒𝑤
𝑋̅ = (3)
𝑁 ± 𝑁 𝑛𝑒𝑤
where:
• 𝑁: Count (1)
27
REAL TIME MACHINE LEARNING
3.1.3 Variance
(∑ 𝑋)2
∑ 𝑋2 −
𝑆2 = 𝑁
𝑁
Now, we can write real time equations for the variance and the
standard deviation using the basic elements.
28
REAL TIME MACHINE LEARNING
𝑛𝑒𝑤 2
2 (∑ 𝑋 ± ∑ 𝑋
2 𝑛𝑒𝑤 )
(∑ 𝑋 ± ∑ 𝑋 )− 𝑛𝑒𝑤
(𝑁 ± 𝑁 )
𝑆2 = 𝑛𝑒𝑤
(5)
(𝑁 ± 𝑁 )
Note: If the number of data is less than 30 we should replace N with N-1.
𝑛𝑒𝑤 2
(∑ 𝑋 ± ∑ 𝑋 )
√ (∑ 𝑋 2 ± ∑ 𝑋 2 𝑛𝑒𝑤 )
− 𝑛𝑒𝑤
(𝑁 ± 𝑁 )
𝑆 = √𝑆 2 = 𝑛𝑒𝑤
(6)
(𝑁 ± 𝑁 )
29
REAL TIME MACHINE LEARNING
𝑆
𝐶𝑉 = × 100% (7)
𝑋̅
where:
3.1.6 Skewness
3
𝑁 𝑋 − 𝑋̅
𝑆𝑘𝑒𝑤 = ∑( )
(𝑁 − 1)(𝑁 − 2) 𝑆
3
𝑋 − 𝑋̅ 𝑋 3 − 3𝑋 2 𝑋̅ + 3𝑋𝑋̅ 2 − 𝑋̅ 3
∑( ) = ∑( )
𝑆 𝑆3
1
= (∑ 𝑋 3 − 3𝑋̅ ∑ 𝑋 2 + 3𝑋̅ 2 ∑ 𝑋 − 𝑁𝑋̅ 3 )
𝑆3
30
REAL TIME MACHINE LEARNING
where:
• 𝑁: Count (1)
• ∑ 𝑋: Sum of data (2)
• 𝑋̅: Average (3)
• 𝑆: Standard Deviation (6)
𝑁 1
𝑆𝑘𝑒𝑤 = × (∑ 𝑋 3 − 3𝑋̅ ∑ 𝑋 2 + 3𝑋̅ 2 ∑ 𝑋 − 𝑁𝑋̅ 3 ) (9)
(𝑁 − 1)(𝑁 − 2) 𝑆 3
3.1.7 Kurtosis
4
𝑁(𝑁 + 1) 𝑋 − 𝑋̅ 3(𝑁 − 1)2
𝐾𝑢𝑟𝑡 = [ ∑( ) ]−
(𝑁 − 1)(𝑁 − 2)(𝑁 − 3) 𝑆 (𝑁 − 2)(𝑁 − 3)
31
REAL TIME MACHINE LEARNING
1
= (∑ 𝑋 4 − 4𝑋̅ ∑ 𝑋 3 + 6𝑋̅ 2 ∑ 𝑋 2 − 4𝑋̅ 3 ∑ 𝑋 + 𝑁𝑋̅ 4 )
𝑆4
where:
• 𝑁: Count (1)
32
REAL TIME MACHINE LEARNING
𝑁(𝑁 + 1)
𝐾𝑢𝑟𝑡 = [
(𝑁 − 1)(𝑁 − 2)(𝑁 − 3)
1
× 4 (∑ 𝑋 4 − 4𝑋̅ ∑ 𝑋 3 + 6𝑋̅ 2 ∑ 𝑋 2 − 4𝑋̅ 3 ∑ 𝑋 + 𝑁𝑋̅ 4 )]
𝑆
3(𝑁 − 1)2
− (11)
(𝑁 − 2)(𝑁 − 3)
3.1.8 Median
The median is the middle data point where below and above it, lie
an equal number of data points. The median equation is not real
time and cannot be directly transformed to. However, by using a
discretized (binned) version of the attribute we can often have a
good estimation of the median.
33
REAL TIME MACHINE LEARNING
𝑁+1
− 𝐹𝑗−1
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿1 + [ 2 ]×ℎ (12)
𝑁𝑗
3.1.9 Mode
34
REAL TIME MACHINE LEARNING
35
REAL TIME MACHINE LEARNING
All the above univariate real time statistical equations are based on
only five basic elements. To calculate any of the univariate
quantities, we only need to save and update the following five
elements in the Basic Elements Table (BET).
𝑁 ∶= 𝑁 ± 𝑁 𝑛𝑒𝑤
36
REAL TIME MACHINE LEARNING
sepal_length
indices 𝑖 = 1 𝑎𝑛𝑑 𝑗 = 1
Count 𝑁 = 𝑁1,1 = 150
∑ 𝑋1 876.5
Mean 𝑋̅ = = = 5.843
𝑁 150
(∑ 𝑋1 )2 (876.5)2
Variance ∑ 𝑋12 − 5223.85 −
𝑆2 = 𝑁 = 150 = 0.681
𝑁 150
Standard
𝑆 = √𝑆 2 = √0.681 = 0.825
Deviation
Coefficient 𝑆 0.825
𝐶𝑉 = × 100% = × 100% = 14%
of Variation ̅
𝑋 5.843
37
REAL TIME MACHINE LEARNING
𝑁
𝑆𝑘𝑒𝑤 =
(𝑁 − 1)(𝑁 − 2)
1
× (∑ 𝑋13 − 3𝑋̅ ∑ 𝑋12 + 3𝑋̅ 2 ∑ 𝑋1
𝑆3
− 𝑁𝑋̅ 3 )
Skewness 150
=
(150 − 1)(150 − 2)
1
× (31745 − 3 × 5.843 × 5223.85
0.8253
+ 3 × 5.8432 × 876.5 − 150 × 5.8433 )
= 0.32
𝑁(𝑁 + 1)
𝐾𝑢𝑟𝑡 = [
(𝑁 − 1)(𝑁 − 2)(𝑁 − 3)
1
× (∑ 𝑋14 − 4𝑋̅ ∑ 𝑋13 + 6𝑋̅ 2 ∑ 𝑋12
𝑆4
3(𝑁 − 1)2
− 4𝑋̅ 3 ∑ 𝑋1 + 𝑁𝑋̅ 4 )] −
(𝑁 − 2)(𝑁 − 3)
150 × (150 + 1)
=[
(150 − 1) × (150 − 2) × (150 − 3)
Kurtosis 1
× (196591.7 − 4 × 5.843 × 31745
0.8254
+ 6 × 5.8432 × 5223.85
3 × (150 − 1)2
− = −0.52
(150 − 2) × (150 − 3)
where:
2
∑ 𝑋 4 = ∑(𝑋𝑖 𝑋𝑗 ) = ∑(𝑋1 𝑋1 )2 = ∑(𝑋12 )2 = ∑ 𝑋14
38
REAL TIME MACHINE LEARNING
Iris_setosa
indices 𝑖 = 13 𝑎𝑛𝑑 𝑗 = 13
∑ 𝑋13 50
Mean 𝑋̅ = 𝑃𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 = = = 0.333
𝑁 150
2 (∑ 𝑋13 )2 (50)2
∑ 𝑋13 − 50 −
𝑆2 = 𝑁 = 150 = 0.222
𝑁 150
Variance
Standard
𝑆 = √𝑆 2 = √𝑃(1 − 𝑃) = √0.222 = 0.471
Deviation
39
REAL TIME MACHINE LEARNING
3.2.1 Covariance
∑ 𝑋𝑖 ∑ 𝑋𝑗
∑ 𝑋𝑖 𝑋𝑗 −
𝑁𝑖𝑗
𝐶𝑜𝑣𝑎𝑟(𝑋𝑖 , 𝑋𝑗 ) = (13)
𝑁𝑖𝑗
where:
• 𝑁: Count (1)
40
REAL TIME MACHINE LEARNING
2 2 2 𝑛𝑒𝑤
∑(𝑋𝑖 𝑋𝑗 ) ∶= ∑(𝑋𝑖 𝑋𝑗 ) ± ∑(𝑋𝑖 𝑋𝑗 ) (15)
𝐶𝑜𝑣𝑎𝑟(𝑋𝑖 , 𝑋𝑗 )
𝑅= (16)
√𝑆𝑖2 × 𝑆𝑗2
41
REAL TIME MACHINE LEARNING
where:
2
𝑆𝑢𝑚 𝑜𝑓 𝑆𝑞𝑢𝑎𝑟𝑒𝑠(𝑋𝑖 |𝑋𝑗 = 1) = ∑ 𝑋𝑖|𝑗 = ∑(𝑋𝑖 𝑋𝑗 )2 (19)
42
REAL TIME MACHINE LEARNING
∑ 𝑋𝑖|𝑗 ∑ 𝑋𝑖 𝑋𝑗
𝑀𝑒𝑎𝑛(𝑋𝑖 |𝑋𝑗 = 1) = 𝑋̅𝑖|𝑗 = = (20)
𝑁𝑖|𝑗 ∑ 𝑋𝑗
2
(∑ 𝑋𝑖|𝑗 )
∑ 2
2
√ 𝑋𝑖|𝑗 − 𝑁
𝑖|𝑗
𝑆𝑖|𝑗 = √𝑆𝑖|𝑗 =
𝑁𝑖|𝑗
2
(∑ 𝑋𝑖 𝑋𝑗 )
√
∑(𝑋𝑖 𝑋𝑗 )2−
∑ 𝑋𝑗
= (22)
∑ 𝑋𝑗
43
REAL TIME MACHINE LEARNING
2
𝑆𝑢𝑚 𝑜𝑓 𝑆𝑞𝑢𝑎𝑟𝑒𝑠(𝑋𝑖 |𝑋𝑗 = 0) = ∑ 𝑋𝑖|𝑗̅
∑ 𝑋𝑖|𝑗̅ ∑ 𝑋𝑖 − ∑ 𝑋𝑖 𝑋𝑗
𝑀𝑒𝑎𝑛(𝑋𝑖 |𝑋𝑗 = 0) = 𝑋̅𝑖|𝑗̅ = = (26)
𝑁𝑖|𝑗̅ 𝑁𝑖𝑗 − ∑ 𝑋𝑗
44
REAL TIME MACHINE LEARNING
2
(∑ 𝑋𝑖|𝑗̅ )
∑ 2
2
√ 𝑋𝑖|𝑗̅ − 𝑁
𝑖|𝑗̅
𝑆𝑖|𝑗̅ = √𝑆𝑖|𝑗̅ =
𝑁𝑖|𝑗̅
2
(∑ 𝑋𝑖 − ∑ 𝑋𝑖 𝑋𝑗 )
(∑ 𝑋𝑖2 − ∑(𝑋𝑖 𝑋𝑗 )2 ) −
√ (𝑁𝑖,𝑗 − ∑ 𝑋𝑗 )
= (28)
(𝑁𝑖𝑗 − ∑ 𝑋𝑗 )
3.2.4 Z test
45
REAL TIME MACHINE LEARNING
𝑋̅ − 𝜇0
𝑍= (29)
𝑆
√𝑁
where:
(6 or 22)
• 𝜇0 : known average
𝑋̅1 − 𝑋̅2
𝑍= (30)
𝑆12 𝑆22
√
𝑁1 + 𝑁2
where:
46
REAL TIME MACHINE LEARNING
3.2.5 T test
𝑋̅ − 𝜇0
𝑡= (31)
𝑆
√𝑁
where:
(6 or 22)
• 𝜇0 : Known average
47
REAL TIME MACHINE LEARNING
𝑋̅1 − 𝑋̅2
𝑡= (32)
1 1
√𝑆 2 (
𝑁1 + 𝑁2 )
2
(𝑁1 − 1)𝑆12 + (𝑁2 − 1)𝑆22
𝑆 =
𝑁1 + 𝑁2 − 2
where:
3.2.6 F test
48
REAL TIME MACHINE LEARNING
(𝑁 − 1)𝑆 2
2
𝜒 = (33)
𝜎02
where:
𝑆12
𝐹= 2 (34)
𝑆2
where:
49
REAL TIME MACHINE LEARNING
ANOVA assesses whether the averages of more than two groups are
statistically different from each other, under the assumption that the
corresponding populations are normally distributed. ANOVA is
useful for comparing averages of two or more numerical attributes
or two or more conditional averages of a numerical attribute given
two or more binary attributes (two or more categories of the same
categorical attribute).
Within 𝑆𝑆𝑤
𝑆𝑆𝑊 𝑑𝑓𝑤 𝑀𝑆𝑤 =
Groups 𝑑𝑓𝑤
𝑑𝑓𝐵 = 𝑀 − 1
50
REAL TIME MACHINE LEARNING
where:
𝑑𝑓𝑤 = ∑ 𝑁𝑚 − 𝑀
𝑚=1
where:
𝑑𝑓𝑇 = ∑ 𝑁𝑚 − 1
𝑚=1
51
REAL TIME MACHINE LEARNING
𝑛
− 𝑃0
𝑍= 𝑁 (38)
√𝑃0 (1 − 𝑃0 )
𝑁
where:
• 𝑃0 : known probability
52
REAL TIME MACHINE LEARNING
𝑛1 + 𝑛2
𝑃̂ =
𝑁1 + 𝑁2
where:
53
REAL TIME MACHINE LEARNING
𝑛𝑖. 𝑛.𝑗
𝑒=
𝑛
𝑑𝑓 = (𝑟 − 1)(𝑐 − 1)
where:
54
REAL TIME MACHINE LEARNING
freedom
55
REAL TIME MACHINE LEARNING
All the above real time bivariate statistical equations are based on
only 5 basic elements. As in the case of the real time univariate
statistical analysis, to calculate the required statistical quantities we
only need to save and update these five elements in the basic
elements table (BET).
𝑁 ∶= 𝑁 ± 𝑁 𝑛𝑒𝑤
56
REAL TIME MACHINE LEARNING
2 2 2 𝑛𝑒𝑤
∑(𝑋𝑖 𝑋𝑗 ) ∶= ∑(𝑋𝑖 𝑋𝑗 ) ± ∑(𝑋𝑖 𝑋𝑗 )
directly from the above five basic elements. These equations define
when 𝑋𝑗 = 1.
2
𝑆𝑢𝑚 𝑜𝑓 𝑆𝑞𝑢𝑎𝑟𝑒𝑠(𝑋𝑖 |𝑋𝑗 = 1) = ∑ 𝑋𝑖|𝑗 = ∑(𝑋𝑖 𝑋𝑗 )2
57
REAL TIME MACHINE LEARNING
2
𝑆𝑢𝑚 𝑜𝑓 𝑆𝑞𝑢𝑎𝑟𝑒𝑠(𝑋𝑖 |𝑋𝑗 = 0) = ∑ 𝑋𝑖|𝑗̅ = ∑ 𝑋𝑖2 − ∑(𝑋𝑖 𝑋𝑗 )2
58
REAL TIME MACHINE LEARNING
59
REAL TIME MACHINE LEARNING
indices 𝑖 = 1 𝑎𝑛𝑑 𝑗 = 3
∑ 𝑋1 ∑ 𝑋3
∑ 𝑋1 𝑋3 −
𝑁1,3
𝐶𝑜𝑣𝑎𝑟(𝑋1 , 𝑋3 ) =
𝑁1,3
Covariance
876.5 × 563.8
3484.25 −
= 150 = 1.265
150
𝐶𝑜𝑣𝑎𝑟(𝑋1 , 𝑋3 ) 1.265
𝑅= = = 0.872
√𝑆12 × 𝑆32 √0.681 × 3.092
where:
Linear (∑ 𝑋1 )2 (876.5)2
∑ 𝑋12 − 5223.85 −
𝑁1,3 150 = 0.681
Correlation 𝑆12 = =
𝑁1,3 150
(∑ 𝑋3 )2 (563.8)2
∑ 𝑋32 − 2583 −
𝑁1,3 150 = 3.092
𝑆32 = =
𝑁1,3 150
60
REAL TIME MACHINE LEARNING
indices 𝑖 = 1 𝑎𝑛𝑑 𝑗 = 7, 8
where:
∑ 𝑋1 𝑋7 339.3
𝑋̅1 = 𝑋̅1|7 = = = 5.953
∑ 𝑋7 57
∑ 𝑋1 𝑋8 537.2
Z test 𝑋̅2 = 𝑋̅1|8 = = = 5.776
∑ 𝑋8 93
(∑ 𝑋1 𝑋7 )2 (339.3)2
∑(𝑋1 𝑋7 )2 − 2045.37 −
2 ∑ 𝑋7 57
𝑆12 = 𝑆1|7 = = = 0.450
∑ 𝑋7 57
(∑ 𝑋1 𝑋8 )2 (537.2)2
∑(𝑋1 𝑋8 )2 − 3178.48 −
2 ∑ 𝑋8 93
𝑆22 = 𝑆1|8 = = = 0.811
∑ 𝑋8 93
𝑁1 = 𝑁1|7 = ∑ 𝑋7 = 57
𝑁2 = 𝑁1|8 = ∑ 𝑋8 = 93
61
REAL TIME MACHINE LEARNING
𝑟 𝑐
2
(𝑛𝑖𝑗 − 𝑒𝑖𝑗 )2
𝜒 = ∑∑
𝑒𝑖𝑗
𝑖=1 𝑗=1
𝑑𝑓 = (2 − 1) × (3 − 2) = 2
𝑃(𝜒 2 , 𝑑𝑓) = 𝑃(150,2) = 0
Chi2
test 𝑛𝑖𝑗 Iris_setosa Iris_versicolor Iris_virginica
∑ 𝑋11|13 ∑ 𝑋11|15
petal_width_b1 ∑ 𝑋11|14 = 0 50
= 50 =0
50 50 50 150
62
REAL TIME MACHINE LEARNING
50 × 50 50 × 50 50 × 50
petal_width_b1 150 150 150 50
= 16.67 = 16.67 = 16.67
50 × 100 50 × 100 50 × 100
petal_width_b2 150 150 150 100
= 33.33 = 33.33 = 33.33
50 50 50 150
63
REAL TIME MACHINE LEARNING
• Naïve Bayesian
• Linear Discriminant Analysis
• Linear Support Vector Machines
64
REAL TIME MACHINE LEARNING
65
REAL TIME MACHINE LEARNING
Bayes’ Rule:
𝑃(𝑐) × 𝑃(𝑎|𝑐)
𝑃(𝑐|𝑎) =
𝑃(𝑎)
∑ 𝑋𝑎 𝑋𝑐
𝑃(𝑎|𝑐) = 𝑋̅𝑎|𝑐 = (41)
∑ 𝑋𝑐
∑ 𝑋𝑐
𝑃(𝑐) = 𝑋̅𝑐 = (42)
𝑁𝑐
66
REAL TIME MACHINE LEARNING
∑ 𝑋𝑎
𝑃(𝑎) = 𝑋̅𝑎 = (43)
𝑁𝑎
where:
If the attribute (A) is numerical the likelihood for its value (a) can be
1 (𝑎−𝑋̅ )
−
𝑃(𝑎|𝑐) = 𝑒 2𝑆 2
√2𝜋 × 𝑆 2
∑ 𝑋𝑎 𝑋𝑐
𝑋̅ = 𝑋̅𝑎|𝑐 =
∑ 𝑋𝑐
67
REAL TIME MACHINE LEARNING
(∑ 𝑋𝑎 𝑋𝑐 )2
∑(𝑋𝑎 𝑋𝑐 )2 −
2 ∑ 𝑋𝑐
𝑆 2 = 𝑆𝑎|𝑐 =
∑ 𝑋𝑐
where:
2
• 𝑆𝑎|𝑐 : Conditional Variance (21)
Attribute Contribution
Kononenko's information gain as a sum of information contributed
by each attribute (A) can offer an explanation on how values of the
predictors influence the class probability.
68
REAL TIME MACHINE LEARNING
69
REAL TIME MACHINE LEARNING
All the following real time statistical quantities are based on the Iris
dataset in the Appendix A. To calculate any Bayesian quantity, we
only need to use the elements of the Basic Elements Table (BET)
generated from the transformed Iris dataset.
indices 𝑖 = 5 𝑎𝑛𝑑 𝑗 = 14
∑ 𝑋14
𝑃(𝑐) = 𝑃(𝐼𝑟𝑖𝑠_𝑣𝑒𝑟𝑠𝑖𝑐𝑜𝑙𝑜𝑟 = 1) = 𝑋̅14 =
Class Prior 𝑁14,14
Probability 50
= = 0.333
150
70
REAL TIME MACHINE LEARNING
∑ 𝑋5
Attribute 𝑃(𝑎) = 𝑃(𝑠𝑒𝑝𝑎𝑙_𝑙𝑒𝑛𝑔𝑡ℎ_𝑏1 = 1) = 𝑋̅15 =
𝑁5,5
Prior
83
Probability = = 0.553
150
indices 𝑖 = 1 𝑎𝑛𝑑 𝑗 = 14
𝑃(𝑐|𝑎) = 𝑃(𝐼𝑟𝑖𝑠_𝑣𝑒𝑟𝑠𝑖𝑐𝑜𝑙𝑜𝑟 = 1|𝑠𝑝𝑒𝑎𝑙_𝑙𝑒𝑛𝑔𝑡ℎ = 6)
Posterior 𝑃(𝑐) × 𝑃(𝑎|𝑐) 0.333 × 0.69
= =
Probability 𝑃(𝑎) 0.431
= 0.533
∑ 𝑋14
𝑃(𝑐) = 𝑃(𝐼𝑟𝑖𝑠_𝑣𝑒𝑟𝑠𝑖𝑐𝑜𝑙𝑜𝑟 = 1) = 𝑋̅14 =
Class Prior 𝑁14,14
Probability 50
= = 0.333
150
71
REAL TIME MACHINE LEARNING
where:
Likelihood
∑ 𝑋1 𝑋14 296.8
𝑋̅ = 𝑋̅1|14 = = = 5.936
∑ 𝑋14 50
(∑ 𝑋1 𝑋14 )2
∑(𝑋1 𝑋14 )2 −
2 ∑ 𝑋14
𝑆 2 = 𝑆1|14 =
∑ 𝑋14
(296.8)2
1774.9 −
= 50 = 0.262
50
1 (𝑥−𝑋̅ )
−
𝑃(𝑎) = 𝑃(𝑠𝑒𝑝𝑎𝑙_𝑙𝑒𝑛𝑔𝑡ℎ = 6) = 𝑒 2𝑆 2
√2𝜋 × 𝑆 2
1 (6−5.843)
= 𝑒− 2×0.681 = 0.431
√6.28 × 0.681
Attribute
Prior
where:
Probability
∑ 𝑋 876.5
𝑋̅ = = = 5.843
𝑁 150
(∑ 𝑋)2 (876.5)2
∑ 𝑋2 − 5223.85 −
𝑆2 = 𝑁 = 150 = 0.68
𝑁 150
72
REAL TIME MACHINE LEARNING
73
REAL TIME MACHINE LEARNING
a binary attribute.
𝑍 = 𝛽1 𝑋1 + 𝛽2 𝑋2 + 𝛽3 𝑋3 + ⋯ + 𝛽𝑛 𝑋𝑛
score function.
𝜷𝑇 𝝁0 − 𝜷𝑇 𝝁1 𝑍̅0 − 𝑍1̅
𝑆(𝜷) = =
𝜷𝑇 𝑪𝜷 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒 𝑜𝑓 𝑍 𝑤𝑖𝑡ℎ𝑖𝑛 𝑔𝑟𝑜𝑢𝑝𝑠
1
𝑪= (𝑁 𝑪 + 𝑁1 𝑪1 )
𝑁0 + 𝑁1 0 0
where:
74
REAL TIME MACHINE LEARNING
where:
(24)
75
REAL TIME MACHINE LEARNING
The pooled variance equation (29) is valid if 𝑁𝑖|𝑘 = 𝑁𝑗|𝑘 and two
attributes should not have uneven number of missing values.
Otherwise, it is better to use the overall covariance instead of the
pooled covariance.
∑ 𝑋𝑖 ∑ 𝑋𝑗
∑ 𝑋𝑖 𝑋𝑗 −
𝑁𝑖𝑗
𝐶𝑖𝑗 =
𝑁𝑖𝑗
where:
Now, by having the covariance matrix and given the score function,
the problem is to estimate the linear coefficients that maximize the
score which can be solved by the following equations.
𝜷 = 𝑪−1 (𝝁0 − 𝝁1 )
1 𝑃(𝑐0 )
𝜷𝑇 (X - (𝝁0 − 𝝁1 )) > − 𝑙𝑜𝑔
2 𝑃(𝑐1 )
𝑁0 𝑁1
𝑃(𝑐0 ) = 𝑃(𝑐1 ) =
𝑁0 + 𝑁1 𝑁0 + 𝑁1
76
REAL TIME MACHINE LEARNING
where:
∆2 = 𝜷𝑇 (𝝁𝟎 − 𝝁𝟏 )
1 1
𝑍𝑘 (𝑿) = − (𝑿 − 𝝁𝑘 )𝑇 𝑪−1
𝑘 (𝑿 − 𝝁𝑘 ) − ln|𝑪𝑘 | + ln𝑃(𝑐𝑘 )
2 2
Where 𝑪−1
𝑘 is the inverse covariance matrix for the class k and 𝝁𝑘 is
77
REAL TIME MACHINE LEARNING
The real time version of QDA using the Basic Elements Table is
only possible if we replace the covariance matrix for each class 𝑪𝑘
with the overall covariance matrix 𝑪.
1 1
𝑍𝑘 (𝑿) = − (𝑿 − 𝝁𝑘 )𝑇 𝑪−𝟏 (𝑿 − 𝝁𝑘 ) − ln|𝑪𝑘 | + ln𝑃(𝑐𝑘 )
2 2
78
REAL TIME MACHINE LEARNING
All the following real time statistical quantities are based on the Iris
dataset in the Appendix A. To calculate any LDA quantity, we only
need to use the elements of the Basic Elements Table (BET)
generated from the transformed Iris dataset.
𝝁0 5.797 1.135
𝑁0 𝑃(0)
0 0.8936 0.7414
= 100 = 0.666 𝑪0 [ ]
0.7414 0.8449
0.6846 0.5129
𝑪=[ ]
0.5129 0.5763
79
REAL TIME MACHINE LEARNING
𝐶𝑜𝑢𝑛𝑡(𝑋1|𝑋14 = 0) = 𝑁1|14
̅̅̅̅ = 𝑁1,14 − ∑ 𝑋14
𝑁0
= 150 − 50 = 100
∑ 𝑋1|14
̅̅̅̅ ∑ 𝑋1 − ∑ 𝑋1 𝑋14
𝑀𝑒𝑎𝑛(𝑋1|𝑋14 = 0) = 𝑋̅1|14
̅̅̅̅ = =
𝑁1|14
̅̅̅̅ 𝑁1,14 − ∑ 𝑋14
876.5 − 296.8
= = 5.797
100
𝝁0
∑ 𝑋4|14
̅̅̅̅ ∑ 𝑋4 − ∑ 𝑋4 𝑋14
𝑀𝑒𝑎𝑛(𝑋4|𝑋14 = 0) = 𝑋̅4|14
̅̅̅̅ = =
𝑁4|14
̅̅̅̅ 𝑁4,14 − ∑ 𝑋14
179.8 − 66.3
= = 1.135
100
∑ 𝑋1|14 ∑ 𝑋1 𝑋14 296.8
𝑀𝑒𝑎𝑛(𝑋1|𝑋14 = 1) = 𝑋̅1|14 = = =
𝑁1|14 ∑ 𝑋14 50
= 5.936
𝝁1
∑ 𝑋4|14 ∑ 𝑋4 𝑋14 66.3
𝑀𝑒𝑎𝑛(𝑋4|𝑋14 = 1) = 𝑋̅4|14 = = =
𝑁4|14 ∑ 𝑋14 50
= 1.326
80
REAL TIME MACHINE LEARNING
∑ 𝑋1|14
̅̅̅̅ ∑ 𝑋1|14
̅̅̅̅ ∑ 𝑋1|14 ∑ 𝑋1|14
∑ 𝑋1 𝑋1 − −
𝑁0 𝑁1
𝐶1,1 =
𝑁0 + 𝑁1 − 2
579.7 × 579.7 296.8 × 296.8
5223.9 − −
= 100 50 = 0.6846
100 + 50 − 2
∑ 𝑋1|14
̅̅̅̅ ∑ 𝑋4|14
̅̅̅̅ ∑ 𝑋1|14 ∑ 𝑋4|14
∑ 𝑋1 𝑋4 − −
𝑁0 𝑁1
𝐶1,4 = 𝐶4,1 =
𝑁0 + 𝑁1 − 2
𝑪 579.7 × 113.5 296.8 × 66.3
1127.7 − −
= 100 50
100 + 50 − 2
= 0.5129
∑ 𝑋4|14
̅̅̅̅ ∑ 𝑋4|14
̅̅̅̅ ∑ 𝑋4|14 ∑ 𝑋4|14
∑ 𝑋4 𝑋4 − −
𝑁0 𝑁1
𝐶4,4 =
𝑁0 + 𝑁1 − 2
113.5 × 113.5 66.3 × 66.3
302.3 − −
= 100 50
100 + 50 − 2
= 0.5763
81
REAL TIME MACHINE LEARNING
Algorithm
82
REAL TIME MACHINE LEARNING
Quadratic Programming.
83
REAL TIME MACHINE LEARNING
Figure 4.8 The slack variable 𝜉 allows some instances to fall off the
margin, but penalize them.
84
REAL TIME MACHINE LEARNING
85
REAL TIME MACHINE LEARNING
= +1, 𝑡ℎ𝑒𝑛 𝒙 ∈ 𝐴 +
𝑠𝑖𝑔𝑛(𝒙′ 𝝎 − 𝛾) {
= −1, 𝑡ℎ𝑒𝑛 𝒙 ∈ 𝐴 −
𝑬 = [𝑨 − 𝒆]
𝝎
(ii) Compute [𝛾 ] by for some positive v:
−1
𝝎 𝑰
[𝛾 ] = ( + 𝑬′𝑬) 𝑬′𝑫𝒆
𝑣
86
REAL TIME MACHINE LEARNING
(iii) Classify a new case by using the Linear Classifier and the
𝝎
solution [𝛾 ] from the step above.
87
REAL TIME MACHINE LEARNING
𝑬′𝑬
∑ 𝑋1 𝑋1 ∑ 𝑋1 𝑋𝑖 ∑ 𝑋1 𝑋𝑛 − ∑ 𝑋1
∑ 𝑋𝑖 𝑋1 ∑ 𝑋𝑖 𝑋𝑖 ∑ 𝑋𝑖 𝑋𝑛 − ∑ 𝑋𝑖
∑ 𝑋𝑛 𝑋1 ∑ 𝑋𝑛 𝑋𝑖 ∑ 𝑋𝑛 𝑋𝑛 − ∑ 𝑋𝑛
− ∑ 𝑋1 − ∑ 𝑋𝑖 − ∑ 𝑋𝑛 𝑁
𝑬′𝑫𝒆
2 ∑ 𝑋1 𝑋𝑡𝑎𝑟𝑔𝑒𝑡 − ∑ 𝑋1
2 ∑ 𝑋𝑖 𝑋𝑡𝑎𝑟𝑔𝑒𝑡 − ∑ 𝑋𝑖
2 ∑ 𝑋𝑛 𝑋𝑡𝑎𝑟𝑔𝑒𝑡 − ∑ 𝑋𝑛
𝑁𝑡𝑎𝑟𝑔𝑒𝑡 − 2 ∑ 𝑋𝑡𝑎𝑟𝑔𝑒𝑡
88
REAL TIME MACHINE LEARNING
where:
• 𝑁 : Count (1)
The above equations are valid if all attributes have the same count
(N) and also do not have uneven number of missing values.
However, in practice the maximum count can be tried in this
situation.
89
REAL TIME MACHINE LEARNING
All the following real time statistical quantities are based on the Iris
dataset in the Appendix A. To calculate any LSVM quantity, we
only need to use the elements of the Basic Elements Table (BET)
generated from the transformed Iris dataset.
−1 𝜔1 = −0.0606
LSVM 𝝎 𝐼 ′
[𝛾 ] = ( + 𝑬′𝑬) 𝑬 𝑫𝒆 = [ 𝜔2 = 0.0560 ]
𝑣 𝛾 = 0.0231
𝑬′𝑬
𝑬′𝑬
∑ 𝑋4 𝑋1 = 1127.7 ∑ 𝑋4 𝑋4 = 302.3 − ∑ 𝑋4 = −179.8
90
REAL TIME MACHINE LEARNING
𝑬′𝑫𝒆
𝑬′𝑫𝒆
2 ∑ 𝑋4 𝑋14 − ∑ 𝑋4 = 2 × 66.3 − 179.8 = −47.2
1 1
𝑣= = = 0.0067 ; 𝐼 = 𝑠𝑝𝑒𝑦𝑒(3)
𝑁3 150
91
REAL TIME MACHINE LEARNING
PCR)
92
REAL TIME MACHINE LEARNING
𝑌 = 𝑏0 + 𝑏1 𝑋 + 𝜀
𝐶𝑜𝑣𝑎𝑟(𝑋, 𝑌)
𝑏1 = (47)
𝑉𝑎𝑟(𝑋)
𝑏0 = 𝑌̅ − 𝑏1 𝑋̅ (48)
where:
93
REAL TIME MACHINE LEARNING
Figure 5.1 Simple Linear Regression for “petal length” and “petal
width” (see the Iris dataset in the Appendix A).
94
REAL TIME MACHINE LEARNING
All the following real time statistical quantities are based on the Iris
dataset in the Appendix A. To calculate any SLR quantity, we only
need to use the elements of the Basic Elements Table generated
from the Iris dataset.
petal_width (Y)
petal_length (X)
indices 𝑖 = 3 𝑎𝑛𝑑 𝑗 = 4
𝑝𝑒𝑡𝑎𝑙_𝑤𝑖𝑑𝑡ℎ = 𝑏0 + 𝑏1 × 𝑝𝑒𝑡𝑎𝑙_𝑙𝑒𝑛𝑔𝑡ℎ
SLR
𝑝𝑒𝑡𝑎𝑙_𝑤𝑖𝑑𝑡ℎ = −0.3668 + 0.4165 × 𝑝𝑒𝑡𝑎𝑙_𝑙𝑒𝑛𝑔𝑡ℎ
𝐶𝑜𝑣𝑎𝑟(𝑋3 , 𝑋4 ) 1.288
𝑏1 = = = 0.4165
𝑉𝑎𝑟(𝑋3 ) 3.092
Coefficients
𝑏0 = 𝑋̅4 − 𝑏1 𝑋̅3 = 1.1987 − 0.4165 × 3.7587
= −0.3668
∑ 𝑋3 ∑ 𝑋4
∑ 𝑋3 𝑋4 −
𝑁3,4
𝐶𝑜𝑣𝑎𝑟(𝑋3 , 𝑋4 ) =
𝑁3,4
Covariance
563.8 × 179.8
869.0 −
= 150 = 1.288
150
95
REAL TIME MACHINE LEARNING
(∑ 𝑋3 )2 (563.8)2
∑ 𝑋32 − 2583.0 −
𝑁3,4 150
Variance 𝑉𝑎𝑟(𝑋3 ) = =
𝑁3,4 150
= 3.092
∑ 𝑋3 563.8
𝑋̅3 = = = 3.7587
𝑁3,4 150
Averages
∑ 𝑋4 179.8
𝑋̅4 = = = 1.1987
𝑁3,4 150
96
REAL TIME MACHINE LEARNING
Observed data:
𝑌 = 𝑏0 + 𝑏1 𝑋1 + 𝑏2 𝑋2 + ⋯ + 𝑏𝑝 𝑋𝑝 + 𝜀
Predicted data:
𝑌′ = 𝑏0 + 𝑏1 𝑋1 + 𝑏2 𝑋2 + ⋯ + 𝑏𝑝 𝑋𝑝
Error:
𝜀 = 𝑌 − 𝑌′
𝑚𝑖𝑛𝑖𝑚𝑖𝑧𝑒 ∑(𝑌 − 𝑌 ′ )2
97
REAL TIME MACHINE LEARNING
𝑏0 = 𝑌̅ − 𝒃𝑿
̅ (50)
where:
• 𝑪−𝟏
𝑿𝑿 : Inverse covariance matrix between attributes
(independent variables)
98
REAL TIME MACHINE LEARNING
There are just three steps to build a real time MLR model using the
(independent variables).
𝑪𝑿𝑿 𝑋1 𝑋𝑗 𝑋𝑝
where:
• 𝐶𝑖𝑗 = 𝐶𝑗𝑖
99
REAL TIME MACHINE LEARNING
𝑪𝑿𝒀 𝑌
𝑋1 𝐶1𝑦
𝑋𝑖 𝐶𝑖𝑦
𝑋𝑝 𝐶𝑝𝑦
where:
−1
𝒃 = 𝑪𝑋𝑋 𝑪𝑿𝒀
where:
−1
• 𝑪𝑋𝑋 : Inverse covariance matrix between attributes
terms.
100
REAL TIME MACHINE LEARNING
Coefficient of Determination:
𝑆𝑆𝑅 𝑆𝑆𝐸
𝑅2 = = 1−
𝑆𝑆𝑇 𝑆𝑆𝑇
101
REAL TIME MACHINE LEARNING
(𝑛 − 1)𝑆𝑆𝑅
𝐹=
(𝑛 − 𝑝 − 1)𝑆𝑆𝑇
𝑑𝑓1 = 𝑁 − 1
𝑑𝑓2 = 𝑁 − 𝑝 − 1
𝑏𝑖 𝑏𝑖
𝑡(𝑁−𝑝−1) = =
𝑆𝑒 (𝑏𝑖 ) √𝑆𝑒2 (𝑿′𝑿)−1
102
REAL TIME MACHINE LEARNING
𝑆𝑆𝐸
𝑆𝑒 = √
𝑁−𝑝−1
𝑑𝑓 = 𝑁 − 𝑝 − 1
𝑿′𝑿 𝑋1 𝑋𝑗 𝑋𝑝
𝑋1 ∑ 𝑋1 𝑋1 ∑ 𝑋1 𝑋𝑗 ∑ 𝑋1 𝑋𝑝
𝑋𝑖 ∑ 𝑋𝑖 𝑋1 ∑ 𝑋𝑖 𝑋𝑗 ∑ 𝑋𝑖 𝑋𝑝
𝑋𝑝 ∑ 𝑋𝑝 𝑋1 ∑ 𝑋𝑝 𝑋𝑗 ∑ 𝑋𝑝 𝑋𝑝
where:
• 𝑝 : number of attributes
• 𝑁 : Count (1)
• 𝑿′ 𝑿 : Sum of Multiplication (14)
103
REAL TIME MACHINE LEARNING
Multicolinearity
important attributes.
104
REAL TIME MACHINE LEARNING
Model Selection
Backward Elimination
We start with a model with all selected attributes and eliminate the
least statistically significant attribute using t-test. We stop the
elimination procedure when the least significant attribute has
probability, 𝑃(𝑡, 𝑑𝑓) less than the predefined threshold (e.g., 0.05).
Forward Selection
We start with no attributes in the model, trying out the attributes one
by one and including them in the model if they are statistically
significant. In the first round of iterations, we build a simple linear
regression model for each attribute, and calculate the improvement
in the sum of squares of error for each of these resulting models
relative to the intercept only model using F-test and select the
105
REAL TIME MACHINE LEARNING
This one-term model provides a new starting model for the next
round. We stop the forward selection procedure if in any rounds the
lowest candidate P-value is not lower than the predefined
probability.
106
REAL TIME MACHINE LEARNING
All the following real time statistical quantities are based on the Iris
dataset in the Appendix A. To calculate any MLR quantity, we only
need to use the elements of the Basic Elements Table generated
from the Iris dataset.
𝑏0 = 𝑌̅ − 𝒃𝑿
̅ = 1.1987 − 1.4485 = −0.2498
107
REAL TIME MACHINE LEARNING
∑ 𝑋𝑖 ∑ 𝑋𝑗
∑ 𝑋𝑖 𝑋𝑗 −
𝑁𝑖𝑗
𝐶𝑜𝑣𝑎𝑟(𝑋𝑖 , 𝑋𝑗 ) =
𝑁𝑖𝑗
𝑪𝑋𝑋 𝑋1 𝑋2 𝑋3
Covariance
Matrix 𝑋1 𝐶1,1 = 0.6811 𝐶1,2 = −0.0390 𝐶1,3 = 1.2652
𝑪𝑋𝑌 𝑋4
𝑋3 𝐶3,4 = 1.2877
∑ 𝑋4 179.8
𝑌̅ = = = 1.1987
𝑁4,4 150
∑ 𝑋1 876.5
𝑋̅1 = = = 5.8433
𝑁1,1 150
Averages ∑ 𝑋2 458.1
𝑋̅2 = = = 3.0540
𝑁2,2 150
∑ 𝑋3 563.8
𝑋̅3 = = = 3.7587
𝑁3,3 150
̅ = [5.8433
𝑿 3.0540 3.7587]
108
REAL TIME MACHINE LEARNING
𝑪𝒆 = 𝝀𝒆 (52)
where:
2. 𝒆 : Eigenvectors
3. 𝝀 : Eigenvalues
the data
109
REAL TIME MACHINE LEARNING
110
REAL TIME MACHINE LEARNING
indices 𝑖, 𝑗 = 1, 2, 3, 4
PCA 𝑪𝒆 = 𝝀𝒆
∑ 𝑋𝑖 ∑ 𝑋𝑗
∑ 𝑋𝑖 𝑋𝑗 −
𝑁𝑖𝑗
𝐶𝑜𝑣𝑎𝑟(𝑋𝑖 , 𝑋𝑗 ) =
𝑁𝑖𝑗
𝑪𝑋𝑋 𝑋1 𝑋2 𝑋3 𝑋4
Covariance 𝐶1,1 𝐶1,2 𝐶1,3 𝐶1,4
Matrix
𝑋1 = 0.6811 = −0.0390 = 1.2652 = 0.5135
𝐶2,1 𝐶2,2 𝐶2,3 𝐶2,4
𝑋2 = −0.0390 = 0.1868 = −0.3196 = −0.1172
𝐶3,1 𝐶3,2 𝐶3,3 𝐶3,4
𝑋3 = 1.2652 = −0.3196 = 3.0924 = 1.2877
𝒆1 𝒆2 𝒆3 𝒆4
sepal_length 0.3616 0.6565 -0.5810 0.3173
111
REAL TIME MACHINE LEARNING
Eigenvalue % Cumulative%
112
REAL TIME MACHINE LEARNING
First the Learner builds the first Basic Elements Table (BET PCR)
from the original data and then by using the eigenvectors builds the
second Basic Elements Table (BET MLR) using Scores.
𝑺 = 𝝀𝑘 (𝐱 − 𝝁𝑘 ) (53)
where:
113
REAL TIME MACHINE LEARNING
𝒚 = 𝒙′𝝎 − 𝛾
𝑬 = [𝑨 − 𝒆]
𝝎
(ii) Compute [𝛾 ] by for some positive v:
−1
𝝎 𝑰
[𝛾 ] = ( + 𝑬′𝑬) 𝑬′𝑫𝒆
𝑣
(iii) Predict a new output by using the Linear Regression and the
𝝎
solution [𝛾 ] from the step above.
114
REAL TIME MACHINE LEARNING
𝑬′𝑬
∑ 𝑋1 𝑋1 ∑ 𝑋1 𝑋𝑖 ∑ 𝑋1 𝑋𝑛 − ∑ 𝑋1
∑ 𝑋𝑖 𝑋1 ∑ 𝑋𝑖 𝑋𝑖 ∑ 𝑋𝑖 𝑋𝑛 − ∑ 𝑋𝑖
∑ 𝑋𝑛 𝑋1 ∑ 𝑋𝑛 𝑋𝑖 ∑ 𝑋𝑛 𝑋𝑛 − ∑ 𝑋𝑛
− ∑ 𝑋1 − ∑ 𝑋𝑖 − ∑ 𝑋𝑛 𝑁
𝑬′𝑫𝒆
∑ 𝑋1 𝑋𝑡𝑎𝑟𝑔𝑒𝑡
∑ 𝑋𝑖 𝑋𝑡𝑎𝑟𝑔𝑒𝑡
∑ 𝑋𝑛 𝑋𝑡𝑎𝑟𝑔𝑒𝑡
− ∑ 𝑋𝑡𝑎𝑟𝑔𝑒𝑡
115
REAL TIME MACHINE LEARNING
where:
• 𝑁 : Count (1)
116
REAL TIME MACHINE LEARNING
All the following real time statistical quantities are based on the Iris
dataset in the Appendix A. To calculate any LSVM quantity, we
only need to use the elements of the Basic Elements Table (BET)
generated from the transformed Iris dataset.
𝜔1 = −0.2056
−1
𝝎 𝐼 𝜔 = 0.2064
RSVM [𝛾 ] = ( + 𝑬′𝑬) 𝑬′ 𝑫𝒆 = [ 2 ]
𝑣 𝜔3 = 0.5205
𝛾 = 0.1865
𝑬′𝑬
∑ 𝑋1 𝑋1 = 5223.9 ∑ 𝑋1 𝑋2 = 2671.0 ∑ 𝑋1 𝑋3 = 3484.3 − ∑ 𝑋1 = −876.5
117
REAL TIME MACHINE LEARNING
𝑬′𝑫𝒆
∑ 𝑋1 𝑋4 = 1127.7
𝑬′𝑫𝒆 ∑ 𝑋2 𝑋4 = 531.5
∑ 𝑋3 𝑋4 = 869.0
− ∑ 𝑋4 = −179.8
𝑣 = 2 ; 𝐼 = 𝑠𝑝𝑒𝑦𝑒(4)
118
REAL TIME MACHINE LEARNING
where:
• 𝑇𝑖 = 𝑃(𝑋𝑖 |𝑋𝑖−1 )
• 𝑇0 = 𝑃(𝑋1 )
119
REAL TIME MACHINE LEARNING
∑ 𝑋𝑖 𝑋𝑖−1
𝑃(𝑋𝑖−1 |𝑋𝑖 ) =
∑ 𝑋𝑖
Probability State i :
∑ 𝑋𝑖
𝑃(𝑋𝑖 ) =
𝑁𝑖
∑ 𝑋𝑖−1
𝑃(𝑋𝑖−1 ) =
𝑁𝑖−1
where:
120
REAL TIME MACHINE LEARNING
where:
• 𝑇𝑖 = 𝑃(𝑋𝑖 |𝑋𝑖−1 )
• 𝐸𝑖 = 𝑃(𝑌𝑖 |𝑋𝑖 )
• 𝑇0 = 𝑃(𝑋1 )
121
REAL TIME MACHINE LEARNING
∑ 𝑋𝑖 𝑋𝑖−1
𝑃(𝑋𝑖−1 |𝑋𝑖 ) =
∑ 𝑋𝑖
∑ 𝑋𝑖 𝑌𝑖
𝑃(𝑋𝑖 |𝑌𝑖 ) =
∑ 𝑌𝑖
Probability of State i:
∑ 𝑋𝑖
𝑃(𝑋𝑖 ) =
𝑁𝑖
Probability of Output i:
∑ 𝑌𝑖
𝑃(𝑌𝑖 ) =
𝑁𝑖
∑ 𝑋𝑖−1
𝑃(𝑋𝑖−1 ) =
𝑁𝑖−1
122
REAL TIME MACHINE LEARNING
where:
123
REAL TIME MACHINE LEARNING
124
REAL TIME MACHINE LEARNING
125
REAL TIME MACHINE LEARNING
APPENDIX A:
The Iris dataset, the transformed Iris dataset and its related Basic
Elements Table.
Iris Dataset
sepal length sepal width petal length petal width iris
5.1 3.5 1.4 0.2 Iris-setosa
4.9 3 1.4 0.2 Iris-setosa
4.7 3.2 1.3 0.2 Iris-setosa
4.6 3.1 1.5 0.2 Iris-setosa
5 3.6 1.4 0.2 Iris-setosa
5.4 3.9 1.7 0.4 Iris-setosa
4.6 3.4 1.4 0.3 Iris-setosa
5 3.4 1.5 0.2 Iris-setosa
4.4 2.9 1.4 0.2 Iris-setosa
4.9 3.1 1.5 0.1 Iris-setosa
5.4 3.7 1.5 0.2 Iris-setosa
4.8 3.4 1.6 0.2 Iris-setosa
4.8 3 1.4 0.1 Iris-setosa
4.3 3 1.1 0.1 Iris-setosa
5.8 4 1.2 0.2 Iris-setosa
5.7 4.4 1.5 0.4 Iris-setosa
5.4 3.9 1.3 0.4 Iris-setosa
5.1 3.5 1.4 0.3 Iris-setosa
5.7 3.8 1.7 0.3 Iris-setosa
5.1 3.8 1.5 0.3 Iris-setosa
5.4 3.4 1.7 0.2 Iris-setosa
5.1 3.7 1.5 0.4 Iris-setosa
4.6 3.6 1 0.2 Iris-setosa
5.1 3.3 1.7 0.5 Iris-setosa
4.8 3.4 1.9 0.2 Iris-setosa
5 3 1.6 0.2 Iris-setosa
5 3.4 1.6 0.4 Iris-setosa
5.2 3.5 1.5 0.2 Iris-setosa
126
REAL TIME MACHINE LEARNING
127
REAL TIME MACHINE LEARNING
128
REAL TIME MACHINE LEARNING
129
REAL TIME MACHINE LEARNING
130
REAL TIME MACHINE LEARNING
131
REAL TIME MACHINE LEARNING
5 3 1.6 0.2 1 0 0 1 1 0 1 0 1 0 0
132
REAL TIME MACHINE LEARNING
5 2 3.5 1 1 0 1 0 1 0 0 1 0 1 0
6 2.2 4 1 0 1 1 0 0 1 0 1 0 1 0
133
REAL TIME MACHINE LEARNING
6.7 3 5 1.7 0 1 0 1 0 1 0 1 0 1 0
5 2.3 3.3 1 1 0 1 0 1 0 0 1 0 1 0
134
REAL TIME MACHINE LEARNING
5.7 2.5 5 2 1 0 1 0 0 1 0 1 0 0 1
6 2.2 5 1.5 0 1 1 0 0 1 0 1 0 0 1
135
REAL TIME MACHINE LEARNING
6.5 3 5.2 2 0 1 0 1 0 1 0 1 0 0 1
136
REAL TIME MACHINE LEARNING
𝟐
i j 𝑿𝒊 𝑿𝒋 𝑵𝒊𝒋 ∑ 𝑿𝒊 ∑ 𝑿𝒋 ∑ 𝑿𝒊 𝑿𝒋 ∑(𝑿𝒊 𝑿𝒋 ) ∑ 𝑿𝟐𝒊 ∑ 𝑿𝟐𝒋 ∑ 𝑿𝟑𝒊
1 1 sepal_length sepal_length 150 876.5 876.5 5223.9 196591.7 5223.9 5223.9 31745.0
1 2 sepal_length sepal_width 150 876.5 458.1 2671.0 49251.8 5223.9 1427.1 31745.0
1 3 sepal_length petal_length 150 876.5 563.8 3484.3 106644.7 5223.9 2583.0 31745.0
1 4 sepal_length petal_width 150 876.5 179.8 1127.7 12633.4 5223.9 302.3 31745.0
1 5 sepal_length sepal_lengt_b1 150 876.5 83.0 433.6 2279.9 5223.9 83.0 31745.0
1 6 sepal_length sepal_lengt_b2 150 876.5 67.0 442.9 2943.9 5223.9 67.0 31745.0
1 7 sepal_length sepal_width_b1 150 876.5 57.0 339.3 2045.4 5223.9 57.0 31745.0
1 8 sepal_length sepal_width_b2 150 876.5 93.0 537.2 3178.5 5223.9 93.0 31745.0
1 9 sepal_length petal_length_b1 150 876.5 61.0 309.2 1575.5 5223.9 61.0 31745.0
1 10 sepal_length petal_length_b2 150 876.5 89.0 567.3 3648.4 5223.9 89.0 31745.0
1 11 sepal_length petal_width_b1 150 876.5 50.0 250.3 1259.1 5223.9 50.0 31745.0
1 12 sepal_length petal_width_b2 150 876.5 100.0 626.2 3964.8 5223.9 100.0 31745.0
1 13 sepal_length Iris_setosa 150 876.5 50.0 250.3 1259.1 5223.9 50.0 31745.0
1 14 sepal_length Iris_versicolor 150 876.5 50.0 296.8 1774.9 5223.9 50.0 31745.0
1 15 sepal_length Iris_virginica 150 876.5 50.0 329.4 2189.9 5223.9 50.0 31745.0
2 2 sepal_width sepal_width 150 458.1 458.1 1427.1 14682.2 1427.1 1427.1 4533.3
2 3 sepal_width petal_length 150 458.1 563.8 1673.9 22772.2 1427.1 2583.0 4533.3
2 4 sepal_width petal_width 150 458.1 179.8 531.5 2696.0 1427.1 302.3 4533.3
2 5 sepal_width sepal_lengt_b1 150 458.1 83.0 259.4 830.8 1427.1 83.0 4533.3
2 6 sepal_width sepal_lengt_b2 150 458.1 67.0 198.7 596.3 1427.1 67.0 4533.3
2 7 sepal_width sepal_width_b1 150 458.1 57.0 150.5 400.2 1427.1 57.0 4533.3
2 8 sepal_width sepal_width_b2 150 458.1 93.0 307.6 1026.8 1427.1 93.0 4533.3
2 9 sepal_width petal_length_b1 150 458.1 61.0 198.3 660.1 1427.1 61.0 4533.3
2 10 sepal_width petal_length_b2 150 458.1 89.0 259.8 767.0 1427.1 89.0 4533.3
2 11 sepal_width petal_width_b1 150 458.1 50.0 170.9 591.3 1427.1 50.0 4533.3
2 12 sepal_width petal_width_b2 150 458.1 100.0 287.2 835.8 1427.1 100.0 4533.3
2 13 sepal_width Iris_setosa 150 458.1 50.0 170.9 591.3 1427.1 50.0 4533.3
2 14 sepal_width Iris_versicolor 150 458.1 50.0 138.5 388.5 1427.1 50.0 4533.3
137
REAL TIME MACHINE LEARNING
2 15 sepal_width Iris_virginica 150 458.1 50.0 148.7 447.3 1427.1 50.0 4533.3
3 3 petal_length petal_length 150 563.8 563.8 2583.0 68227.4 2583.0 2583.0 12974.0
3 4 petal_length petal_width 150 563.8 179.8 869.0 8344.4 2583.0 302.3 12974.0
3 5 petal_length sepal_lengt_b1 150 563.8 83.0 211.2 696.1 2583.0 83.0 12974.0
3 6 petal_length sepal_lengt_b2 150 563.8 67.0 352.6 1886.9 2583.0 67.0 12974.0
3 7 petal_length sepal_width_b1 150 563.8 57.0 257.0 1217.5 2583.0 57.0 12974.0
3 8 petal_length sepal_width_b2 150 563.8 93.0 306.8 1365.5 2583.0 93.0 12974.0
3 9 petal_length petal_length_b1 150 563.8 61.0 112.6 250.6 2583.0 61.0 12974.0
3 10 petal_length petal_length_b2 150 563.8 89.0 451.2 2332.4 2583.0 89.0 12974.0
3 11 petal_length petal_width_b1 150 563.8 50.0 73.2 108.6 2583.0 50.0 12974.0
3 12 petal_length petal_width_b2 150 563.8 100.0 490.6 2474.4 2583.0 100.0 12974.0
3 13 petal_length Iris_setosa 150 563.8 50.0 73.2 108.6 2583.0 50.0 12974.0
3 14 petal_length Iris_versicolor 150 563.8 50.0 213.0 918.2 2583.0 50.0 12974.0
3 15 petal_length Iris_virginica 150 563.8 50.0 277.6 1556.2 2583.0 50.0 12974.0
4 4 petal_width petal_width 150 179.8 179.8 302.3 1108.5 302.3 302.3 563.5
4 5 petal_width sepal_lengt_b1 150 179.8 83.0 58.1 71.5 302.3 83.0 563.5
4 6 petal_width sepal_lengt_b2 150 179.8 67.0 121.7 230.8 302.3 67.0 563.5
4 7 petal_width sepal_width_b1 150 179.8 57.0 82.6 130.1 302.3 57.0 563.5
4 8 petal_width sepal_width_b2 150 179.8 93.0 97.2 172.2 302.3 93.0 563.5
4 9 petal_width petal_length_b1 150 179.8 61.0 24.4 17.3 302.3 61.0 563.5
4 10 petal_width petal_length_b2 150 179.8 89.0 155.4 285.0 302.3 89.0 563.5
4 11 petal_width petal_width_b1 150 179.8 50.0 12.2 3.5 302.3 50.0 563.5
4 12 petal_width petal_width_b2 150 179.8 100.0 167.6 298.8 302.3 100.0 563.5
4 13 petal_width Iris_setosa 150 179.8 50.0 12.2 3.5 302.3 50.0 563.5
4 14 petal_width Iris_versicolor 150 179.8 50.0 66.3 89.8 302.3 50.0 563.5
4 15 petal_width Iris_virginica 150 179.8 50.0 101.3 208.9 302.3 50.0 563.5
5 5 sepal_lengt_b1 sepal_lengt_b1 150 83.0 83.0 83.0 83.0 83.0 83.0 83.0
5 6 sepal_lengt_b1 sepal_lengt_b2 150 83.0 67.0 0.0 0.0 83.0 67.0 83.0
5 7 sepal_lengt_b1 sepal_width_b1 150 83.0 57.0 28.0 28.0 83.0 57.0 83.0
5 8 sepal_lengt_b1 sepal_width_b2 150 83.0 93.0 55.0 55.0 83.0 93.0 83.0
138
REAL TIME MACHINE LEARNING
5 9 sepal_lengt_b1 petal_length_b1 150 83.0 61.0 61.0 61.0 83.0 61.0 83.0
5 10 sepal_lengt_b1 petal_length_b2 150 83.0 89.0 22.0 22.0 83.0 89.0 83.0
5 11 sepal_lengt_b1 petal_width_b1 150 83.0 50.0 50.0 50.0 83.0 50.0 83.0
5 12 sepal_lengt_b1 petal_width_b2 150 83.0 100.0 33.0 33.0 83.0 100.0 83.0
5 13 sepal_lengt_b1 Iris_setosa 150 83.0 50.0 50.0 50.0 83.0 50.0 83.0
5 14 sepal_lengt_b1 Iris_versicolor 150 83.0 50.0 26.0 26.0 83.0 50.0 83.0
5 15 sepal_lengt_b1 Iris_virginica 150 83.0 50.0 7.0 7.0 83.0 50.0 83.0
6 6 sepal_lengt_b2 sepal_lengt_b2 150 67.0 67.0 67.0 67.0 67.0 67.0 67.0
6 7 sepal_lengt_b2 sepal_width_b1 150 67.0 57.0 29.0 29.0 67.0 57.0 67.0
6 8 sepal_lengt_b2 sepal_width_b2 150 67.0 93.0 38.0 38.0 67.0 93.0 67.0
6 9 sepal_lengt_b2 petal_length_b1 150 67.0 61.0 0.0 0.0 67.0 61.0 67.0
6 10 sepal_lengt_b2 petal_length_b2 150 67.0 89.0 67.0 67.0 67.0 89.0 67.0
6 11 sepal_lengt_b2 petal_width_b1 150 67.0 50.0 0.0 0.0 67.0 50.0 67.0
6 12 sepal_lengt_b2 petal_width_b2 150 67.0 100.0 67.0 67.0 67.0 100.0 67.0
6 13 sepal_lengt_b2 Iris_setosa 150 67.0 50.0 0.0 0.0 67.0 50.0 67.0
6 14 sepal_lengt_b2 Iris_versicolor 150 67.0 50.0 24.0 24.0 67.0 50.0 67.0
6 15 sepal_lengt_b2 Iris_virginica 150 67.0 50.0 43.0 43.0 67.0 50.0 67.0
7 7 sepal_width_b1 sepal_width_b1 150 57.0 57.0 57.0 57.0 57.0 57.0 57.0
7 8 sepal_width_b1 sepal_width_b2 150 57.0 93.0 0.0 0.0 57.0 93.0 57.0
7 9 sepal_width_b1 petal_length_b1 150 57.0 61.0 13.0 13.0 57.0 61.0 57.0
7 10 sepal_width_b1 petal_length_b2 150 57.0 89.0 44.0 44.0 57.0 89.0 57.0
7 11 sepal_width_b1 petal_width_b1 150 57.0 50.0 2.0 2.0 57.0 50.0 57.0
7 12 sepal_width_b1 petal_width_b2 150 57.0 100.0 55.0 55.0 57.0 100.0 57.0
7 13 sepal_width_b1 Iris_setosa 150 57.0 50.0 2.0 2.0 57.0 50.0 57.0
7 14 sepal_width_b1 Iris_versicolor 150 57.0 50.0 34.0 34.0 57.0 50.0 57.0
7 15 sepal_width_b1 Iris_virginica 150 57.0 50.0 21.0 21.0 57.0 50.0 57.0
8 8 sepal_width_b2 sepal_width_b2 150 93.0 93.0 93.0 93.0 93.0 93.0 93.0
8 9 sepal_width_b2 petal_length_b1 150 93.0 61.0 48.0 48.0 93.0 61.0 93.0
8 10 sepal_width_b2 petal_length_b2 150 93.0 89.0 45.0 45.0 93.0 89.0 93.0
8 11 sepal_width_b2 petal_width_b1 150 93.0 50.0 48.0 48.0 93.0 50.0 93.0
139
REAL TIME MACHINE LEARNING
8 12 sepal_width_b2 petal_width_b2 150 93.0 100.0 45.0 45.0 93.0 100.0 93.0
8 13 sepal_width_b2 Iris_setosa 150 93.0 50.0 48.0 48.0 93.0 50.0 93.0
8 14 sepal_width_b2 Iris_versicolor 150 93.0 50.0 16.0 16.0 93.0 50.0 93.0
8 15 sepal_width_b2 Iris_virginica 150 93.0 50.0 29.0 29.0 93.0 50.0 93.0
petal_length_b
9 9 petal_length_b1 150 61.0 61.0 61.0 61.0 61.0 61.0 61.0
1
petal_length_b
9 10 petal_length_b2 150 61.0 89.0 0.0 0.0 61.0 89.0 61.0
1
petal_length_b
9 11 petal_width_b1 150 61.0 50.0 50.0 50.0 61.0 50.0 61.0
1
petal_length_b
9 12 petal_width_b2 150 61.0 100.0 11.0 11.0 61.0 100.0 61.0
1
petal_length_b
9 13 Iris_setosa 150 61.0 50.0 50.0 50.0 61.0 50.0 61.0
1
petal_length_b
9 14 Iris_versicolor 150 61.0 50.0 11.0 11.0 61.0 50.0 61.0
1
petal_length_b
9 15 Iris_virginica 150 61.0 50.0 0.0 0.0 61.0 50.0 61.0
1
petal_length_b
10 10 petal_length_b2 150 89.0 89.0 89.0 89.0 89.0 89.0 89.0
2
petal_length_b
10 11 petal_width_b1 150 89.0 50.0 0.0 0.0 89.0 50.0 89.0
2
petal_length_b
10 12 petal_width_b2 150 89.0 100.0 89.0 89.0 89.0 100.0 89.0
2
petal_length_b
10 13 Iris_setosa 150 89.0 50.0 0.0 0.0 89.0 50.0 89.0
2
petal_length_b
10 14 Iris_versicolor 150 89.0 50.0 39.0 39.0 89.0 50.0 89.0
2
petal_length_b
10 15 Iris_virginica 150 89.0 50.0 50.0 50.0 89.0 50.0 89.0
2
11 11 petal_width_b1 petal_width_b1 150 50.0 50.0 50.0 50.0 50.0 50.0 50.0
11 12 petal_width_b1 petal_width_b2 150 50.0 100.0 0.0 0.0 50.0 100.0 50.0
11 13 petal_width_b1 Iris_setosa 150 50.0 50.0 50.0 50.0 50.0 50.0 50.0
11 14 petal_width_b1 Iris_versicolor 150 50.0 50.0 0.0 0.0 50.0 50.0 50.0
11 15 petal_width_b1 Iris_virginica 150 50.0 50.0 0.0 0.0 50.0 50.0 50.0
12 12 petal_width_b2 petal_width_b2 150 100.0 100.0 100.0 100.0 100.0 100.0 100.0
12 13 petal_width_b2 Iris_setosa 150 100.0 50.0 0.0 0.0 100.0 50.0 100.0
12 14 petal_width_b2 Iris_versicolor 150 100.0 50.0 50.0 50.0 100.0 50.0 100.0
12 15 petal_width_b2 Iris_virginica 150 100.0 50.0 50.0 50.0 100.0 50.0 100.0
13 13 Iris_setosa Iris_setosa 150 50.0 50.0 50.0 50.0 50.0 50.0 50.0
13 14 Iris_setosa Iris_versicolor 150 50.0 50.0 0.0 0.0 50.0 50.0 50.0
13 15 Iris_setosa Iris_virginica 150 50.0 50.0 0.0 0.0 50.0 50.0 50.0
14 14 Iris_versicolor Iris_versicolor 150 50.0 50.0 50.0 50.0 50.0 50.0 50.0
140
REAL TIME MACHINE LEARNING
14 15 Iris_versicolor Iris_virginica 150 50.0 50.0 0.0 0.0 50.0 50.0 50.0
15 15 Iris_virginica Iris_virginica 150 50.0 50.0 50.0 50.0 50.0 50.0 50.0
141
REAL TIME MACHINE LEARNING
142
REAL TIME MACHINE LEARNING
K-Means Algorithm:
1. Clusters the data into k groups where k is predefined.
2. Select k points at random as cluster centers.
3. Assign objects to their closest cluster center (c) per the
Euclidean distance function.
4. Calculate the centroid (c) or mean of all objects in each
cluster.
5. Repeat steps 2, 3 and 4 until the same points are assigned to
each cluster in consecutive rounds.
𝑁 ∶= 𝑁 ± 𝑁 𝑛𝑒𝑤
𝑛𝑒𝑤
∑ 𝑋 ∶= ∑ 𝑋 ± ∑ 𝑋
143
REAL TIME MACHINE LEARNING
Now, instead of keeping all the original training data we only keep
m (e.g., 100) centroids which can be updated in real time using BET
real time equations. For prediction, first we choose a much smaller
number of clusters (k) and then we use the saved m centroids as the
source data to build the new set of clusters.
144
REAL TIME MACHINE LEARNING
∆2 = 𝜷𝑇 (𝝁𝟎 − 𝝁𝟏 )
145
REAL TIME MACHINE LEARNING
146
REAL TIME MACHINE LEARNING
terms (page 102). You can use the 𝑅 2 to finds subsets of variables
2. Add the best next variable to the first variable one by one
147
REAL TIME MACHINE LEARNING
𝑌 = (𝑋1 + 𝑋2 )
∑ 𝑌 = ∑(𝑋1 + 𝑋2 ) = ∑ 𝑋1 + ∑ 𝑋2
∑ 𝑌 2 = ∑(𝑋1 + 𝑋2 )2 = ∑ 𝑋1 2 + ∑ 𝑋2 2 + 2 ∑ 𝑋1 𝑋2
This unique feature can play a major role when you work with data
with thousands of variables, and merging variables can make the
model much smaller without excluding any variables.
148