You are on page 1of 6

Heightcms Weightkgs - s

-independent -independent TshirtSize – (Y)


variable variable Dependent variable Dstiance lowest valuFindout what will be Tshirt Size
160 60 M 1.41 1 New Custo height
163 61 M 2.00 2 161
160 59 M 2.24 3
163 60 M 2.24 4 step1 Euclidean distance
160 64 L 3.16 5
158 59 M 3.61 Assume k=3 , class will be M
158 63 M 3.61 Assume K=5 , class will be M
163 64 L 3.61
165 61 L 4.00 if we have cagtegorical columns then w
165 62 L 4.12
158 58 M 4.24 for all distance based algorithm scaling
165 65 L 5.66 standard scaling = (X-mean)/std
168 62 L 7.07 Normalization = (X-Xmin)/(XMax-Xmin)
168 63 L 7.28
168 66 L 8.60
170 63 L 9.22
170 64 L 9.49
170 68 L 11.40

Heightcms Weightkgs - s
-independent -independent TshirtSize – (Y)
variable variable Dependent variable
160 60 M
163 61 M
160 59 M
163 60 M
160 64 L
158 59 M
158 63 M
163 64 L
165 61 L
165 62 L
158 58 M
165 65 L
168 62 L
168 63 L
168 66 L
170 63 L
170 64 L
170 68 L
164 62.33 Average
4.33 2.63 Std Dev

Standard scaling
Height-X1 Weight-X2 Shirtsize-Y dsitance find out what is the shirt size
-0.92 -0.89 M 1.28 new custom 161 61
-0.23 -0.51 M scaled -0.69 -0.51
-0.92 -1.27 M
-0.23 -0.89 M
-0.92 0.63 L
-1.39 -1.27 M
-1.39 0.25 M
-0.23 0.63 L
0.23 -0.51 L
0.23 -0.13 L
-1.39 -1.64 M
0.23 1.01 L
0.92 -0.13 L
0.92 0.25 L
0.92 1.39 L
1.39 0.25 L
1.39 0.63 L
1.39 2.15 L

Normalization Normalization = (X-Xmin)/(XMax-Xmin)


Heightcms Weightkgs - s TshirtSize – (Y)
-X1 -X2 Dependent variable X1norm X2norm Distance
160 60 M 0.17 0.20 0.13 1
163 61 M 0.42 0.30 0.17 2
160 59 M 0.17 0.10 0.22 3
163 60 M 0.42 0.20 0.19 4
160 64 L 0.17 0.60 0.31 5
158 59 M 0.00 0.10 0.32 6
158 63 M 0.00 0.50 0.32 7
163 64 L 0.42 0.60 0.34 8
165 61 L 0.58 0.30 0.33 9
165 62 L 0.58 0.40 0.35 10
158 58 M 0.00 0.00 0.39
165 65 L 0.58 0.70 0.52
168 62 L 0.83 0.40 0.59
168 63 L 0.83 0.50 0.62
168 66 L 0.83 0.80 0.77
170 63 L 1.00 0.50 0.78
170 64 L 1.00 0.60 0.81
170 68 L 1.00 1.00 1.03
158 58 min
170 68 max
hat will be Tshirt Size
weight
61

Euclidean distance

3 , class will be M
5 , class will be M

cagtegorical columns then we have to dummy encode

nce based algorithm scaling or normalization should be done


caling = (X-mean)/std
tion = (X-Xmin)/(XMax-Xmin)
161 61
0.25 0.3

K=3 class M
K=5 class M
Age Loan HousePriceIndex Agenorm loanNorm Distance New csutomer
45 80000 231 0.63 0.31 0.32 age loan
35 120000 139 0.38 0.50 0.34 48 142000
60 100000 139 1.00 0.41 0.37 0.7 0.61386139
33 150000 264 0.33 0.65 0.38
48 220000 250 0.70 1.00 0.39 K3=170
40 62000 216 0.50 0.22 0.44 K5=204.6
35 60000 256 0.38 0.21 0.52
52 18000 150 0.80 0.00 0.62
23 95000 127 0.08 0.38 0.67
25 40000 135 0.13 0.11 0.77
20 20000 267 0.00 0.01 0.92

60 220000 max
20 18000 min
Sheet3
1.41
2.00
2.24
2.24
3.16
3.61
3.61
3.61
4.00
4.12
4.24
5.66
7.07
7.28
8.60
9.22
9.49
11.40

Page 6

You might also like