You are on page 1of 29

PROJECT ON DATA MINING-RAVEENDRA BABU GADDAM

1) Exploratory Data Analysis

# Setting of Working Directory and Importing of Data Set

> setwd("C:/Users/HP/Desktop/GL-BABI")

> BankData<-read.csv("Thera Bank_Personal_Loan_Modelling-dataset-1.csv",header=TRUE


)

> str(BankData)

'data.frame': 5000 obs. of 14 variables:


$ ID : int 1 2 3 4 5 6 7 8 9 10 ...
$ Age..in.years. : int 25 45 39 35 35 37 53 50 35 34 ...
$ Experience..in.years.: int 1 19 15 9 8 13 27 24 10 9 ...
$ Income..in.K.month. : int 49 34 11 100 45 29 72 22 81 180 ...
$ ZIP.Code : int 91107 90089 94720 94112 91330 92121 91711 93943 90089 93023 ...
$ Family.members : int 4 3 1 1 4 4 2 1 3 1 ...
$ CCAvg : num 1.6 1.5 1 2.7 1 0.4 1.5 0.3 0.6 8.9 ...
$ Education : int 1 1 1 2 2 2 2 3 2 3 ...
$ Mortgage : int 0 0 0 0 0 155 0 0 104 0 ...
$ Personal.Loan : int 0 0 0 0 0 0 0 0 0 1 ...
$ Securities.Account : int 1 1 0 0 0 0 0 0 0 0 ...
$ CD.Account : int 0 0 0 0 0 0 0 0 0 0 ...
$ Online : int 0 0 0 0 0 1 1 0 1 0 ...
$ CreditCard : int 0 0 0 0 1 0 0 1 0 0 ...

The structure of the data set is Data frame; we have 14 variables and 5000 Observations. Personal Loan is a dependent variable

# Variables ID and ZIP Code is not relevant for the analysis of the Data.

# Removing of Variables from the Data Set and checking of structure of the data

> BankData<-BankData[,-c(1,5)]

> str(BankData) # Structure after removing of ID and ZIP Code from the data set.

'data.frame': 5000 obs. of 12 variables:


$ Age..in.years. : int 25 45 39 35 35 37 53 50 35 34 ...
$ Experience..in.years.: int 1 19 15 9 8 13 27 24 10 9 ...
$ Income..in.K.month. : int 49 34 11 100 45 29 72 22 81 180 ...
$ Family.members : int 4 3 1 1 4 4 2 1 3 1 ...
$ CCAvg : num 1.6 1.5 1 2.7 1 0.4 1.5 0.3 0.6 8.9 ...
$ Education : int 1 1 1 2 2 2 2 3 2 3 ...
$ Mortgage : int 0 0 0 0 0 155 0 0 104 0 ...
$ Personal.Loan : int 0 0 0 0 0 0 0 0 0 1 ...
$ Securities.Account : int 1 1 0 0 0 0 0 0 0 0 ...
$ CD.Account : int 0 0 0 0 0 0 0 0 0 0 ...
$ Online : int 0 0 0 0 0 1 1 0 1 0 ...
$ CreditCard : int 0 0 0 0 1 0 0 1 0 0 ...

# Let us fix the Column name and check the structure again.

# Fixing of Column Name.

> colnames(BankData) <- c("Age", "Experience", "Income", "Family.Members", "CCAvg", "Education", "Mortgage",
+ "Personal.Loan", "Securities", "CD.Account", "Online.Access", "CreditCard")
> str(BankData) # Structur of data after fixing of column names.

'data.frame': 5000 obs. of 12 variables:


$ Age : int 25 45 39 35 35 37 53 50 35 34 ...
$ Experience : int 1 19 15 9 8 13 27 24 10 9 ...
$ Income : int 49 34 11 100 45 29 72 22 81 180 ...
$ Family.Members: int 4 3 1 1 4 4 2 1 3 1 ...
$ CCAvg : num 1.6 1.5 1 2.7 1 0.4 1.5 0.3 0.6 8.9 ...
$ Education : int 1 1 1 2 2 2 2 3 2 3 ...
$ Mortgage : int 0 0 0 0 0 155 0 0 104 0 ...
$ Personal.Loan : int 0 0 0 0 0 0 0 0 0 1 ...
$ Securities : int 1 1 0 0 0 0 0 0 0 0 ...
$ CD.Account : int 0 0 0 0 0 0 0 0 0 0 ...
$ Online.Access : int 0 0 0 0 0 1 1 0 1 0 ...
$ CreditCard : int 0 0 0 0 1 0 0 1 0 0 ...

# converting of all class variables in Factor Variables

> BankData$Family.Members <- as.factor(BankData$Family.Members)


> BankData$Education <- as.factor(BankData$Education)
> BankData$Personal.Loan <- as.factor(BankData$Personal.Loan)
> BankData$Securities <- as.factor(BankData$Securities)
> BankData$CD.Account <- as.factor(BankData$CD.Account)
> BankData$Online.Access <- as.factor(BankData$Online.Access)
> BankData$CreditCard <- as.factor(BankData$CreditCard)

> str(BankData)

'data.frame': 5000 obs. of 12 variables:


$ Age : int 25 45 39 35 35 37 53 50 35 34 ...
$ Experience : int 1 19 15 9 8 13 27 24 10 9 ...
$ Income : int 49 34 11 100 45 29 72 22 81 180 ...
$ Family.Members: Factor w/ 4 levels "1","2","3","4": 4 3 1 1 4 4 2 1 3 1 ...
$ CCAvg : num 1.6 1.5 1 2.7 1 0.4 1.5 0.3 0.6 8.9 ...
$ Education : Factor w/ 3 levels "1","2","3": 1 1 1 2 2 2 2 3 2 3 ...
$ Mortgage : int 0 0 0 0 0 155 0 0 104 0 ...
$ Personal.Loan : Factor w/ 2 levels "0","1": 1 1 1 1 1 1 1 1 1 2 ...
$ Securities : Factor w/ 2 levels "0","1": 2 2 1 1 1 1 1 1 1 1 ...
$ CD.Account : Factor w/ 2 levels "0","1": 1 1 1 1 1 1 1 1 1 1 ...
$ Online.Access : Factor w/ 2 levels "0","1": 1 1 1 1 1 2 2 1 2 1 ...
$ CreditCard : Factor w/ 2 levels "0","1": 1 1 1 1 2 1 1 2 1 1 ...

# Checking for Missing Values

> dim(BankData)

[1] 5000 12

> colSums(is.na(BankData))

Age Experience Income Family.Members CCAvg


0 0 0 18 0
Education Mortgage Personal.Loan Securities CD.Account
0 0 0 0 0
Online.Access CreditCard
1 0

# there is 12 Variables and 5000 rows of the data.


# We have missing values in Family. Members.
# Let us remove the missing value of the and
# checking of dimensions and summary of data after removing missing values

> BankData <- na.omit(BankData)

> colSums(is.na(BankData))

Age Experience Income Family.Members CCAvg


0 0 0 0 0
Education Mortgage Personal.Loan Securities CD.Account
0 0 0 0 0
Online.Access CreditCard
0 0

> dim(BankData)

[1] 4982 12

> summary(BankData)
Age Experience Income Family.Members CCAvg
Min. :23.00 Min. :-3.0 Min. : 8.00 1:1464 Min. : 0.00
1st Qu.:35.00 1st Qu.:10.0 1st Qu.: 39.00 2:1292 1st Qu.: 0.70
Median :45.00 Median :20.0 Median : 64.00 3:1009 Median : 1.50
Mean :45.33 Mean :20.1 Mean : 73.73 4:1217 Mean : 1.94
3rd Qu.:55.00 3rd Qu.:30.0 3rd Qu.: 98.00 3rd Qu.: 2.50
Max. :67.00 Max. :43.0 Max. :224.00 Max. :10.00
Education Mortgage Personal.Loan Securities CD.Account Online.Access
1:2088 Min. : 0.00 0:4504 0:4463 0:4682 0:2013
2:1399 1st Qu.: 0.00 1: 478 1: 519 1: 300 1:2969
3:1495 Median : 0.00
Mean : 56.55
3rd Qu.:101.00
Max. :635.00
CreditCard
0:3517
1:1465

# After checking the results of the above summary,


# It is understood that there are some negative values in variable experience.
# Experience cannot be negative number. Let us replace negative numbers with zero
# And will check summary once again.

> BankData$Experience[which(BankData$Experience < 0 )] <- 0

> summary(BankData)
Age Experience Income Family.Members CCAvg
Min. :23.00 Min. : 0.00 Min. : 8.00 1:1464 Min. : 0.00
1st Qu.:35.00 1st Qu.:10.00 1st Qu.: 39.00 2:1292 1st Qu.: 0.70
Median :45.00 Median :20.00 Median : 64.00 3:1009 Median : 1.50
Mean :45.33 Mean :20.11 Mean : 73.73 4:1217 Mean : 1.94
3rd Qu.:55.00 3rd Qu.:30.00 3rd Qu.: 98.00 3rd Qu.: 2.50
Max. :67.00 Max. :43.00 Max. :224.00 Max. :10.00
Education Mortgage Personal.Loan Securities CD.Account Online.Access
1:2088 Min. : 0.00 0:4504 0:4463 0:4682 0:2013
2:1399 1st Qu.: 0.00 1: 478 1: 519 1: 300 1:2969
3:1495 Median : 0.00
Mean : 56.55
3rd Qu.:101.00
Max. :635.00
CreditCard
0:3517
1:1465
# Checking for Outliers in the data

# Plotting of Box Plot to check the outliers

boxplot(BankData)
600
100 200 300 400 500
0

Age Experience Family.Members Education Personal.Loan CD.Account CreditCard

Boxplot for Age

30 40 50 60
Boxplot for Experience

0 10 20 30 40

Boxplot for Income

0 50 100 150 200


Boxplot for CCAvg

0 2 4 6 8 10

Boxplot for Mortgage

0 100 200 300 400 500 600

boxplot(BankData$Age, horizontal = TRUE, main = "Boxplot for Age")


> boxplot(BankData$Experience, horizontal = TRUE, main = "Boxplot for Experience")
> boxplot(BankData$Income, horizontal = TRUE, main = "Boxplot for Income")
> boxplot(BankData$CCAvg, horizontal = TRUE, main = "Boxplot for CCAvg")
> boxplot(BankData$Mortgage, horizontal = TRUE, main = "Boxplot for Mortgage")

# From the above boxplots it is evident that there are few outliers in Income & CCAvg,
# but there are a lot of outliers in Mortgage.
# Correlation between the Continuous Variables

> names(BankData)

[1] "Age" "Experience" "Income" "Family.Members"


[5] "CCAvg" "Education" "Mortgage" "Personal.Loan"
[9] "Securities" "CD.Account" "Online.Access" "CreditCard"

> BankData_cont_vars<-BankData[, c(1:3,5,7)]


> corrmatrix<-cor(BankData_cont_vars)
> corrplot::corrplot(corrmatrix,method='number',type='upper',order='FPC')

Experience

Mortgage

Income
CCAvg
Age
1

Age 1 0.99 -0.01 -0.05 -0.06 0.8

0.6
Experience 1 -0.01 -0.05 -0.05 0.4

0.2
Mortgage 1 0.11 0.21 0
-0.2

CCAvg 1 0.65 -0.4

-0.6
Income 1 -0.8

-1
# there is a perfect correlation between age and experience,
# So it is better to keep only one variable.
# I am going to keep age variable only.

BankData <- BankData[, -2]

> str(BankData)

'data.frame': 4982 obs. of 11 variables:


$ Age : int 25 45 39 35 35 37 53 50 35 34 ...
$ Income : int 49 34 11 100 45 29 72 22 81 180 ...
$ Family.Members: Factor w/ 4 levels "1","2","3","4": 4 3 1 1 4 4 2 1 3 1 ...
$ CCAvg : num 1.6 1.5 1 2.7 1 0.4 1.5 0.3 0.6 8.9 ...
$ Education : Factor w/ 3 levels "1","2","3": 1 1 1 2 2 2 2 3 2 3 ...
$ Mortgage : int 0 0 0 0 0 155 0 0 104 0 ...
$ Personal.Loan : Factor w/ 2 levels "0","1": 1 1 1 1 1 1 1 1 1 2 ...
$ Securities : Factor w/ 2 levels "0","1": 2 2 1 1 1 1 1 1 1 1 ...
$ CD.Account : Factor w/ 2 levels "0","1": 1 1 1 1 1 1 1 1 1 1 ...
$ Online.Access : Factor w/ 2 levels "0","1": 1 1 1 1 1 2 2 1 2 1 ...
$ CreditCard : Factor w/ 2 levels "0","1": 1 1 1 1 2 1 1 2 1 1 ...
# Checking of Sctatterplot and Histogram

# from the scatter plot it is understood that the data is not distributed normally.
# Head and Tail of Data
head(BankData)
Age Income Family.Members CCAvg Education Mortgage Personal.Loan Securities
1 25 49 4 1.6 1 0 0 1
2 45 34 3 1.5 1 0 0 1
3 39 11 1 1.0 1 0 0 0
4 35 100 1 2.7 2 0 0 0
5 35 45 4 1.0 2 0 0 0
6 37 29 4 0.4 2 155 0 0
CD.Account Online.Access CreditCard
1 0 0 0
2 0 0 0
3 0 0 0
4 0 0 0
5 0 0 1
6 0 1 0
> tail(BankData)
Age Income Family.Members CCAvg Education Mortgage Personal.Loan Securities
4995 64 75 3 2.0 3 0 0 0
4996 29 40 1 1.9 3 0 0 0
4997 30 15 4 0.4 1 85 0 0
4998 63 24 2 0.3 3 0 0 0
4999 65 49 3 0.5 2 0 0 0
5000 28 83 3 0.8 1 0 0 0
CD.Account Online.Access CreditCard
4995 0 1 0
4996 0 1 0
4997 0 1 0
4998 0 0 0
4999 0 1 0
5000 0 1 1

# Creation of additional column which will say if a person has mortgage or not

BankData<-BankData %>% mutate(Have.Mortgage = if_else(Mortgage>0, 1, 0))


BankData$Have.Mortgage <- as.factor(BankData$Have.Mortgage)

# scaling of the continuous variables to remove the impact of outliers # and bringing of all the data under same scale

# Summary of Scaled Data

summary(BankData_scaled)

Age.V1 Income.V1 Family.Members CCAvg.V1


Min. :-1.9464324 Min. :-1.427724 1:1464 Min. :-1.109459
1st Qu.:-0.9002998 1st Qu.:-0.754399 2:1292 1st Qu.:-0.709131
Median :-0.0285226 Median :-0.211395 3:1009 Median :-0.251613
Mean : 0.0000000 Mean : 0.000000 4:1217 Mean : 0.000000
3rd Qu.: 0.8432545 3rd Qu.: 0.527091 3rd Qu.: 0.320284
Max. : 1.8893871 Max. : 3.263833 Max. : 4.609513
Education Mortgage.V1 Personal.Loan Securities CD.Account Online.Access
1:2088 Min. :-0.555703 0:4504 0:4463 0:4682 0:2013
2:1399 1st Qu.:-0.555703 1: 478 1: 519 1: 300 1:2969
3:1495 Median :-0.555703
Mean : 0.000000
3rd Qu.: 0.436863
Max. : 5.684692
CreditCard Have.Mortgage
0:3517 0:3448
1:1465 1:1534
2) Apply appropriate clustering on the data and interpret the output

# Calculation Euclidean distance

D.euc <- dist(BankData_scaled, method = "euclidean")

Note: Please note that, I have not added distance numbers here as it was showing very big amount of data.

# Performing Clustering-Hierarchical Clustering

hclust.euc <- hclust(D.euc, method = "complete")

# Plot Cluster Height

cluster.height <- hclust.euc$height

cluster.height <- sort(cluster.height, decreasing = TRUE)

plot(cluster.height, pch = 20, col = "red")

lines(cluster.height, lty = 2, lwd = 2, col = "blue")


8
cluster.height

6
4
2
0

0 1000 2000 3000 4000 5000

Index
Height Height

0 2 4 6 8 0 2 4 6 8 plot(hclust.euc)

2078
247
2763
1332
3324
2488
2140
3207
2766
2067
1016
3996
1266
2923
1707
4224
2385 723
2528
4639
4569
4331
4842
2802
2945
4280
1270 908
2759
1347
1297 1321
4317
798
474
2531
284
4406
775
360
92583
2373
2032 928
3597
45
2830 2078
2472763
1332
3324
2488
2140
2923
1707 723
3207
2766
2067
1016 2528
4639
4569
4331
3996
1266
4224
2385 4842 2583
2802
2945
4280
1270 360
9 908
2759
1347
1297 1321
4317
798
474
2531
284
4406
775
2373
2032 928
3597
45
2830
3560
2704
4656
1774
3609
2412
3010
1727
4372
39183504
26582654
1281
3400
1614
2677
2502
2223
4467
1826
2930
1679
1818
3014
322 349
4925
4409
10702476
65
18731993
1365
3459
4825
2743
1352
1104
3769
4004
2208
3809 3362
330
3567
947
1564
468
4525 23456
1780 98
4795 3560
2704
4656
1774
3609
2412
3010
3504
26582654
1281
3400
1614
2502
2223
1727
4372
39184467
1826
2930
1679
1818
3014
322 349
1352
1104
2677
3769
4925
65
4409
1070 2476
1873 1993
1365
3459
4825
2743
4004
2208
3809 3362
1564
468
4525
330
3567
947 23456
1780 98
4795
4176
1157
2310
1160
2935
2931
3001
4093
3009
678
2758
2810
3124
2314454
2643
2490
4169
2936
1155
1296
4827 3872
1656
2000
2018
1504
2181833
3863
1961
3810
1620
20923931 3933
3328
5
2978
3333690
1000
2433
1294
772
10 3651
1815
2328
19 2308
42
3764
84226
780 4176
1157
2310
1160
2935
2931
3001
4093
3009
678
2758
2643
2490454
4169
2936
2810
3124
2314
1155
48271296
2018
1504
2181 3872
1656
3933
3328
29785
333
2000
833
3863
1961
3810
1620
20923931
772
10 3690
1000
2433
1294
3651
1815
2328
19 2308
42
3764
84226
780
2847
2838
1687
4242
697
1156
23702692
1821
630
3192
3386
1512
4810
4788
34663373
1183
2050
3018
4736
4871
3733
2631
4353
1457 2977
1702
15191756
902
520
24
4030
2551
2198
2390
1404
3907933
881
4245
3022
4278
4151 3551
2887
2603
437
644
1813
769
1400 477
2150 23702847
2838
1687
4242
697
1156
2692
1821
630
3192
3386
1512
4810
4788
4871
37332050
2631
3466
4353
14573373
1183
3018
4736
3907933
881
4245
3022
4278
4151 2977
1702
15191756
902
520
4030
2551
2198
2390
1404
3551
2887
140024
2603
437
1813
769644
477
2150
2921
2293
2029
1185
4262384
2458
1109
224
3668
4564
2307
778
1440
4139
571
191
2368
418
4894
2761
25132599
3542
4374
1489
777
4044
3768
3723
32717761909
1043
148
1072
3621
2329
1141
427
3585
1952
3689
2859
285
95 3338
2336
2688
4891
718 665
2580
3
417 601 2921
2293
2029
1185
4262
1440384
2458
1109
224
3668
4564
2307
778
4139
571
191
2368
418
4894
2761
25132599
3542
4374
1489
777 7761909
1043
148
1072
3621
2329
4044
3768
3723
3271
1141
427
3585
1952
3689
2859
285
95 3338
2336
2688
4891
718 665
25803
417 601
2568
193
4916
1994
4828
1732
3814
2916
3349
1754
3262
2765
2144
3310
4637
2094
4048
17101783
4837
2587
3795
3746
4976
4558
1695
3677
3628
4590 44
2626
71
1803
2703
2241
2966
4645
1892
4330
800
1875
4142
2941
210
10012841
4587 3884 2568
193
4916
1994
4828
1732
3814
2916
3349
1754
3262
2765
2144
3310
4637
2094
4048
17101783
4837
2587
3795
3746
4976
4558
1695
3677
3628
4590 44
2626
71
1803
2703
2241
2966
4645
1892
4330
800
1875
4142
2941
210
1001 2841
4587 3884
2437
1881
910
733
3301
2781
3173
4211
3792
356
297
1213
341655
2182
488
169874
538
2269
1776
3997
2548
2570
1834
4257
19534547
1314
4684583
4643
1425
1592
1543
1985
7934920
1788
3730
2296
608
1300
16863875
324
2468
1617548
4271
41042917
3796
1118 557 2437
1881
910
733
3301
2781
3173
4211
3792
356
297
1213
3416
2570
183455
2182
488
169
4257
1953874
538
2269
1776
3997
2548
4547
1314
4684
2468
1617583
4643
1425
1592
1543
1985
7934920
1788
3730
2296
608
1300
16863875
324
548
4271
41042917
3796
1118 557
4663591
2638
4765
2400
3435
306
3912
1591
2512
469
4464
1642635
2222
1511
14243465
698
1142
3604
2226
2777
1532
4856
4739
609
3507
2711
3728
4805
4404
4723
700 3471
2340
3154
4577
1395
1816
4133
1411
2100
3438
3472
1657 60
503
2783
3296
143 397
4314
1583
458 4663591
2638
4765
2400
3435
306
14243465
698
1142
3604
2226
2777
1532
4856
4739
609
3507
3912
1591
2512
469
4464
1642
2711
3728
4805
4404
4723
700 635
2222
1511
3471
2340
3154
4577
1395
1816
4133
1411
2100
3438
3472
1657 60
503
2783
3296
143
4314397
1583
458
4253
2492
1461
3177
2080
4275
25243297
4847
4786
2391
1836
4299
1242
3489
1351
4477
2527
569
2621
573
4456
1243
1868
189
227
2303
1990
4254
1534
4801
3999
218 1500
3041
48
3634
1809
3251
1465
4259
3058
144
961
320
25753646
681
1792
677 3976
1490
615
4 13 4253
2492
1461
3177
1809
3251
1465
1242
3489
1351
4477
2575
2080
4275
25243297
3041
48
3634
4847
4786
2391
18364259
3058
144
4299961
320
2527
569
2621
573
4456
1243
1868
189
227
2303
1990
4254
1534
4801
3999
218 1500
3646
681
1792
677 3976
1490
615
4 13
4066121
2326
4471
1291
4655
3311
3954
18794329
2728
2655
3200
4896
1095
1038
1853
1394
691
3294
4298
81
4053
554
721
4588
1119
3980
1623
1188
4924
4423
3028
2185
1015
4323
1121
4794
3134
1218
4686
3823
2975
1437
4612785
1992
4165
2259
4549
1861 72
2940
2792765 4066121
2326
4471
1291
4655
3311
3954
18794329
2728
2655
3200
4896
1095
1038
1853
1394
691
3294
4298
81
4053
554
721
4588
1119
3980
1623
1188 785
1992
4924
4423
3028
2185
1015
4323
1121
4794
3134
1218
4686
3823
2975
1437
4612
18614165
2259
4549 72
2940
2792765
3594
19893852
1595
3057
1065
1785
1020
1672
2646
2257 3265
2574
3345
2681
724
2965
1730
3942
1984
4213
3819
535
2559
1391
4630
566
3670
2404
3886
3575
4443
2585
880 77
1124
2899
4288
3392
3445
4454
958
312
3285
593
3960
27971368
2175
2526
2668 730944
303
4326 3594
1785
16723852
1595
3057
1065 3265
4288
19893392
2574
3345
593 77
1124
2899
3445
4454
958
2681312
3285
3960
1020
724
2965
1730
3942
1984
4213
3819
535
2646
2257
2559
1391
4630
566
3670
2404
3886
3575
4443
2585
880 2526
2668
2797 1368
2175
730944
303
4326
4463
4306
4263
743
353
2266
1537
1236
2165
1948
2482
4219
2904
2168
1829
4410
4359
3267
3174
3150
2217
4707
3370
4550
4509
4027
3598
3182
3602
23643355
1859
1401
675
531
2382
2989
986
1477
3692
2467
4363
1136
3497
4478 294
1743
1594 4463
4306
4263
743
353
2266
1537
1236
2165
1948
2482
4219
2904
2168
1829
4410
4359
3267
3174
3150
2217
4707
3370
4550
4509
4027
3598
3182
3602
23643355
1859
1401
675
531
2382
2989
986
1477
3692
2467
4363
1136
3497
4478 294
1743
1594
4295
196
1230
2480
1486
1926
889
3372
2762
25763650
3149
2052
3361
2856
3708
4244
2994
1031
4296
129
4626
1199
1077
2769
3049
4202
3259
2187
729
774
2901
2376
4153
2086
3284 852
3530
4008
2005
250
2038
319
3562
626
2867 1650
239
1469
2995
433
920
3206
89
1643 4295
196
2480
1486
19261230
889
3372
4244
2994
1031
42963650
3149
2052
3361
2856
3708
129
2762
2576
4626
1199
1077
2769
3049
4202
3259
2187
729
774
2901
2376
4153
2086
3284 852
3530
4008
2005
250
2038
319
3562 1650
1469
2995
3206
89
1643
626
2867 239
433
920
4075
1632
4293
648
3139
3257
2012
4866
1130
4945
3453
2592
4343
2206
1495
2135
1903
1342
2795
1628
4636
2846
395
4161
4137 383
4806
3974
974
715
2445827
2604
4703
2068
763
1057
4023
4963
934
2037
199617138
560
3043
343
457
1581670
149
4268
2350
4398
3357 4075
1632
4293
648
3139
3257
2012
4866
1130
4945
3453
2592
4343
2445
4636
4161
4137 383
4806
974
715
3974
827
3043
2604
4703
2068
2206
1495
2135
1903
1342
2795
1628 457
763
1057
2846
395
934 158
4023
4963
2037
1996 17138
560
343
1670
149
4268
2350
4398
3357
1759
460917
3831
1159
2252
4829
3603
346 316
2319
1851
229574
180
4340470
4014
2827
2347 4773
1237
4615
3478
2267
1134
13303635
6
892 96 1759
460917
3831
1159
2252
4829
3603
346 316
2319
1851
229574
180
4340 4773
470
4014
2827
2347 1237
4615
3478
2267
1134
1330 3635
6
892 96
Hierarchical Clustering Dendogram

3971
3531
3735
1169
3935
3772
614
2910
4482
21944155
3488
4817
2772
1150
955
3525
3756
1564236
3051
2459
4025509
588
4168
3702
2552
1336
2065
2739
658499
2388
32228764658
3346
257
33198
1665
4611
2848
4557
4091
1275
744 081
2389 3971
3531
3735
1169
3935
3772
614
2910
4482
21944155
3488
4817
2772
1150
955
3525
3756
1564236
3051
2459
4025509
588
4168
3702
2552
1336
2065
2739
658499
2388
3222 4658
3346
876257
3
4557
4091
1275
744 3198
1665
4611
2848
081
2389
1857
991
41941833
2567
19382205
3861
4132
1239
4217
3506
3143 3929
425
3380
3440
1912
1308
4952
42073674
856
619
32
4814941
3870
4809
4442
2754
1248
3911 37
4490
3056
3759
3166
3415
757
634
3938966
1112
3619
840
2332 650
1723
362
3050
17083059
2865 3568 1857
991
4194
42171833
2567
19382205
3861
4132
1239
425
3380
3440
1912
1308
4952
4207
4814941
3870
4809
4442
3938 3929
3674
856
619
32
2754
3506
3143 37
4490
1248
3056
3759
3166
3415
757
3911
634
966
1112
3619
840
2332
1708 650
1723
362
3050
3059
2865 3568
238
4662
4403
2043
14301466
3070
2460
125
4946
2327
26224554
2089
5812441
1590
2962
17712839
4748
3427
896
3127
2424
2679
2282
578
2103
2807
4647
2560
3757 928
978
2081
2091
3377
4971
3318
4664
4776
2596
3815
616 117
3054
3925
899
4270 4
1110
4964
2529
1170 00 238
4662
4403
26221466
3070
2460
125
4946
2327
4554
2089
581
2424
2043
1430 2441
1590
2962
17712839
4748
3427
896
3127
2679
2282
578
2103
2807
4647
4776
2596 928
9782081
2091
3377
4971
3318
2560
3757
3815 3054
3925
4664
899
4270
616 117
1110
4964
2529
1170 00
3928
1189
2717
2573
4783
2819
3432
3447
1542
2624
516
4619
1003263
3657
3487
1197
4698
2845
4682
4011
3277
2957
1574
44022874
1904 1
4035688
2418
3936
4294
1573
1880 568
1055
1090
1007
184 964
53
1388
3846
1098
2636
2237
1866
884
4848
206
3461 522
4287
1285 47 3928
1189
2717
2573
4783
2819
3432
3447
1542
2624
516
4619
1003263
4035
1007
184
3657
3487
1197
4698
2845
4682
4011
3277
2957
1574
44022874
1904 1688
2418
3936
4294
1573
1880
4848
206
3461
4287
4568
1055
1090
964
53
1388
3846
1098
2636
2237
1866
884
522
1285 47
3787
4965
2452
1036
4923
3165
2514
3132
1889
3233
4342025
3748
2254
3799
4523
3838
1649
4582
274
557
165 673
1273
4455
2333
2774
11172662
1021
3099
1152416
3073
3529
863
3541
811
794683754
1515
2393
2015
2697 809
4113
1059
1278
15703131
3026
2352
3375 3787
11172025
3099
1152
4965
2452
1036
4923
3165
2514
3132
1889
3233
434 3748
2393
2015673
1273
4455
2333
3073
3529
2254
3799
4523
3838
1649
4582
274
557
165863
3541
811
794 3754
1515
2697
2774
2662
1021
416
68 809
4113
1059
1278
1570
33753131
3026
2352
1885
2270
4002
175
850
3413
1709
4761
2384
2879
3592
3591
3138
2275
1751
4496
1951
190
2938
2816
3825
1241
1374
4473
1706
2027
171562
2002
3172
851
3097
2554
1946
1069
4606
819
3574
2411
4575
1755
625
33603105
4791
3636
3348
2386 1885
2270
4002
175
850
3413
1709
4761
2384
2879
3592
3591
3138
2275
1751
4496
1951
190
2938
2816
3825
1241
137462
2002
4473
1706
2027
1715
3172
851
3097
2554
1946
1069
4606
819
3574
2411
4575
1755
625
33603105
4791
3636
3348
2386
4792
1220
818
447
91
2157
2934
4089
3991
3658
1660176
44373226
3095
4904
164
1968
1084
3382
2955
4855
2519
701
838
3824
3406
1025
2745
2808
1609 08 4792
1220
818
447
91
2157
1084 176
44373226
3095
4904
164
1968
3382
2955
4855
2519
2934
4089
3991
3658
1660
701
838
3824
3406
1025
2745
2808
1609 08

rect.hclust(hclust.euc, k = 4, border = "red")


4937
1227
3444
432
3709
1531
3327
1778
3797
2132
2060
773
4769
2701
1380
3739
420
3296
3151
4822
2453
1436
254
4627
4367
4063
3790
2479
865
3851
3188
4823
3659
2225
2255 4
3281
1174
4379
1175
11333812
2138
575
141
3518
3129
4446
202636523
1
333450251
775 4937
1227
3444
432
3709
1531
3327
1778
3797
2132
3518
3129
44461174
4379
1175
1133
2026
2060
773
4769
2701
1380
3739
420
3296
3151
4822
2453
1436
254
4627
4367
4063
3790
2479
865
3851
3188
4823
3659
2225
2255 4
3281
3812
2138
575
141
36523
1
3334 50251
775
3881
3503
1194
4958
381
372
3910
2417
1235
4535
3135
782
878
3442
2707
2842
1223
489
4397
1343
3429
3367680
3110
2733
1481
3082
3475
154
3142
1795
4479
1562
8323072
817
1131
1085
2473
1713
4186
3501
2325
11353004
3614
2317
716
18 3881
3503
1194
4958
381
372
3910
2417
1235
4535
3135
782
878
3442
2707
2842
1223
489
680
3110
2733
1481
3082
3475
154
4397
1343
3429
3367
3142
1795
4479
1562
832
2325
1135 3072
817
1131
1085
2473
1713
4186
3501
3004
3614
2317
716
18
3397
4338
1392
859
2963
4518
3160
3252
3218
1333
983
1918
3499
4581
4147
281
2750 3590
2586
835
1261
4580
1646
4084
41
214
1067
3523
3248
829
421
308
3889
2927
3957
1758
2998
331
2279
3737
4749
1274
621
116
3035
2953
4292
408
482
3755
4785
155
205
104 3397
4338
1392
859
2963
4518
3160
3252
3218
1333
983
1918
3499
4581
4147
281
1067
3523
32483590
2586
835
1261
4580
1646
4084
41
214
829
421
308
3889
2927
3957
1758
2998
331
2279
3737
4749
1274
621
2750116
3035
2953
4292
408
482
3755
4785
155
205
104
4906
4902
4450
2908
4711
4193
2419
2817
4227
2690
443
1167
4672
4653
4815
1383
3244
1746
1319
3508
2618
3770
3103
4065
2357
4507
2073
728
587
565
602
2020
2358
2852
450
199
3153
3956
394
2667
2377
4755
2263370
2999
2250
2990
2056
1633
2993
4982
1446
389270
4665
3662 4906
4902
4450
2908
4711
4193
2419
2817
4227
2690
443
1167
4672
4653
4815
1383
3244
1746
1319
3508
2618
3770
3103
4065
2357
4507
2073
728
587
565
602
2020
2358
2852
450
199
3153
3956
394
2667
2377
4755
2263370
4665
3662
2999
2250
2990
2056
1633
2993
4982
1446
389270
3215
1886
3656
160
153
703
3303
1449
4883
2420
2261
1177
2639
4086
3190
921
4780
4092
3205
3161
2918
3972
2042
1205
4708
1764
2948
3794
2495
3422
17501812
518
3358
1393
1379
870
204
3669
1551424
3802
2200
561
3264766
355
295
3424
3164
1357
40833374
1282
4620 3215
1886
3656
160
153
703
3303
1449
4883
2420
2261
1177
2639
4086
3190
921
1551
3802
2200
561
326
4780
4092
3205
3161
2918
3972
2042
1205
4708
1764
2948
3794
2495
3422 1812
518
3358
1393
1379
870
204
3669424
4766
355
1750
295
3424
3164
1357
40833374
1282
4620
4475
3587
2770
1280
4181
4469
1494
1083
40704890
1163
1507
3793
3538
1882
2414
3641
1842
1200
3713
1307
584
505
4498
4465
4935
3492
792 4447
3835
2256
334
4449
4504
2242
4300
936
465
805
34
2746
1187
2873
2096
867
1299
3981
2108
4097
3136
1830
4143 2583 33 4475
3587
2770
1280
4181
1882
24144890
1163
1507
3793
3538
4300
936
4654447
3835
2256
334
4469
1494
3641
1842
1200
4449
4504
2242
805
1083
4070
3713
1307
584
505
4498
4465
4935
3492
792
3981
210834
2746
1187
2873
2096
867
1299
4097
3136
1830
4143 2583 33
1042
3204
1779
4382
2849
3399
2547
3319
1662
1522
4150
451390
984
937
3439
1808
45524835
4975
1802
3516
2440
4779
2886
3697
3645
273
552
3483
825
2516
897
2581
2594
4396
1195235
3917
246
4297
2487 4105
4939
4499
3329
4174
413
3615
2689
3550
1530
919
1363 1042
3204
1779
4382
2849
3399
2547
3319
1662
1522
4150
477990
984
937
3439
1808
45524835
4975
1802
3516
2440
4939
4499
3329
4174
413
3615
4513
2886
3697
3645
273
552
3483
825
2516
897
2581
2594
4396
1195235
3917
246
4297
24872689
3550
1530
919 4105
1363
3025
4760
4349
1976
4134
2311
42581738
3612
3572
4005
2156
407178
3094
1012
4267
396
491
1116
3952
1827
3696 272
1262
4123
3791
3841 2775
1876
2499
2179
3005
2974
611
3167
2491
288
1245
904
2760
4956
871
4010
1256
3543
1856
579
1028 3025
4760
4349
1976
4123
3791
3841
4134
2311
42581738
3612
3572
407
1116
3952
1827
3696 178
3094
1012
4267
396
4005
2156 272
1262
2775
1876
2499
2179
3005
2974
491
611
3167
2491
288
1245
904
2760
4956
871
4010
1256
3543
1856
579
1028
3753
310
508
1800525
3379
4566
4857
452
1106
3934
2535
4568
2969
1517
4282
4579
2425
731570
4034
3122
913
2506
1076
3806
4073
4967
3273
25284
2361
166
88
4753
4572
2378
3476
3180
943 3784
1692
42602305
2675
3383
382
3044
1963
4556
3842
4077 2264
643 2674 3753
310
508
1800525
3379
4566
4857
452
1106
4967570
4034
3122
913
3934
2535
4568
29692506
1076
3806
4073
3273
25284
2361
166
4282
4579
2425
73188
4753
4572
2378
3476
3180
1517
943
42602305
2675
3383
382
3044
1963
4556
3842
4077 3784
1692
2264
643 2674
4107 4090
4214
2543
3578
33532961
2738
1408
1323
1916
1454
3264212
1287
3930
3235
22
1832
4419
2245
2650
2457
965
1334
3384
4928
2494
1071
226
1114
2163
2920
3418
1910
4059
2826
3455
3584
2991 1728
4693
4594
4265
3398
3736
3477
2461
4285
755
618 209
9892369
3967 4107
32644090
2961
2738
1408
1323
1916
1454
212
3930
3235
22
3384
49281287
1832
4419
2245
2650
2457
965
1334
2494
1071
1910226
4214
2543
3578
3353
1114
2163
2920
3418
4059
2826
3455
3584
2991 1728
4693
4594
4265
3398
3736
3477
2461
4285
755
618 209
9892369
3967
1668
4136
2354
3871
669
3479
2648
4515
2387
4629
2556
4771
2164
4427
3589
4784
687
2450
1298
1763
2142
1825 3862
521
3202
4514
4195
1002
2537
4886
1701
2202
3033
2251
2564
286
2088
1463
1375
3678
1773
2363
3776
1420
3808
139
124 231
3169
446
4036 1139
52344 1668
4136
2354
3871
669
3479
2648
4515
2387
4629
2556
4771
2164
4427
3589
4784
6873862
521
3202
4514
4195
1002
2537
2450
1298
1763
2142
1825
2251
2564
286
2088
1463
1375446
4886
1701
2202
3033 231
3169
4036
3678
1773
2363
3776
1420
3808
139
124 1139
52344
1355
914
3350
3121
2155
41004171
1268
2801
369
1192
3378
4122
4602
2824
2699
4650
4361
2929
1086
2919
2546
122
3707
3491
2381
3599823866
3029
2115
4802
645
3078
26084750
2153
702
530
2725
4101
1864
4283
1612
3939
3758
347385
3159
2609
506
229 9 3716
4328 1355
3350
3121
2155
914
4171
1268
2801
369
1192
3378
4122
4602
2824
2699
4650
4361
3078
2608823
2153
702
530
4100
2929
1086
2546
122 866
3029
2115
4802
645
4750
2725
4101
29191864
4283
1612
3707
3491
2381
3599
3939
3758
347385
3159
2609
506
229 9 3716
4328
624
422
2047
4862
1939
3414
3146
2232
1790
16614752
1421
4979
4172
2183
3699
4775
1597
954
821
2812
1735
3816
3027
2063
287
4900
2075
4625
3408
1193
4364
3803
1841
192
410
1444
2220365
710
2493
268
4117
2832
3747485
4173
2148
1044 549
1206
202 195
6
1396 624
422
2047
4862
1939
3414
3146
3699
22324752
1421
4979
4172
2183
4775
1597
954
821
2812
1735
3816
3027
2063
268
4117
2832
3747
287
4900
2075
4625
3408
1193
4364
3803
1841
192
410
1790
1661
1444
2220365
710
2493
485
4173
2148
1044
202 549
1206
195
61396
3731
3187
2751
2112
24361962
664
3883598
2744
1276
3813
2642
2825
546
31752321
3495
110
4932
4716
893
1339
563
795
2829
1619
2718
1006
4362
1849
426
1865
2818
600
4543
3876
3905
3626
3867
1549
128
18978259
30609
87
3293
4 96
830 3731
3187
2751
2112
24361962
664
3883
2818598
2744
1276
3813
2642
2825
546
31752321
3495
110
4932
4716
893
1339
563
795
2829
1619
2718
1006
4362
1849
426
1865
600
4543
3876
3905
3626
3867
1549
128
1897 8
259
3060
3293
4 9
87 96
830
1289
1264
1896
1417
639
1843
1010
4488
19754719
2197
1862
174
3833
3322
3366
1225
39
1513
4313
4746
1945
336
1641762
1229
3549
4418
1527
3102
2900
1725
1838
2805 1476
4813
276
3067
1587
1520
2231 4740
534
313
497
1547
1509
4344
3873
541
245
2079957
540888 1289
1264
1896
1417
639
3366
19754719
2197
1862
174
3833
3322
1225
39
1843
1010
44881513
4313
4746
1945
336
1641762
1229
3549
4418
1527
3102
29001476
4813
276
1725
1838
2805 3067
1587
1520
2231 4740
534
313
4971547
1509
4344
3873
541
245
2079 957
540888
2120
1965
1414
1033
2098
4652
31062013
3485
4576
4539
4559
4112
2863
2276
4516
40813359
3111
3640
2127
4069
4859
3552
2959
4140
1032
2523
1580
806
3745 57
3982
590
3332
3315
601
3467
311
1824 2
878
2258
2221
1398
2169 253
1313
4910
3085
2799
2613
1775 29
3316
2423 2120
1965
1414
1033
22762098
4652
31062013
3485
4576
45393359
3111
3640
2127
4559
4112
2863 4069
4859
3552
2959
4140
1032
4516
4081 57
3982
2523
1580
806
3745
3467
311
1824 590
3332
3315
601 2
878
2258
2221
1398
2169
4910
3085
2799
2613
1775 253
1313
3316
242329
3947
1722
4820
3532
2118
4324
2764
4834
2822 2306
2283
2806
4583
594
3486
637
135
2727
4667
4831
1438
4121
3617
3333 146
4231
4222
2048
413069 1789
1154
4574
4689
1663
2979
4429
1705
2285
2248
401
11583949
3410
4289
953
405
766 47352893
656
3308
4392
1472 907 3947
1722
4820
3532
2118
4324
2764
48342306
2283
2806
4583
2822594
3486
637
135
2727
4667
4831
1438
1158
4121
3617
3333
4231
4222
2048
413069 1789
1154
4574
4689
1663
2979
146
4429
1705
2285
2248
401
3410
4289
953
405
766 3949
4735 2893
656
33084392
1472 907
3859
2383
3236
551
473
337
4377
4394
2166
976
1328
1097
48003365
1060
1311
875
4654
3340
1462
3157
2983
487
3919
4724
3534
4039
150596
2696
1971
1689
3781
3644
4878
3113
3002
2694
4468
4080
1647 671
654
3760
3266
2630
2481275
1582
909 27
1062 737861 3859
2383
3236
551
473
337
4377
4394
2166
976
1328
1097
48003365
1060
1311
875
4654
3340
1462
3157
2983
487
3919
4724
3534
4039
150596
2696
1971
1689
3781
3644
4878
3113
3002
2694
4468
4080
1647
3266
2630
2481 671
654
3760
275
1582
909 27
1062 737861
4345
4649
1932
4434
2607
869
2549
3953
2478
2635
1335
484
3897
2876
1724
1370
1198
3458
3718
4528
4369
4849
3335
2956
694
3404
2273
2147
2682
4428
1621
4115
4391
4022
3866
3387
3848
35791482
1111
4542
580
4212
15413
2705
159
949
289 778 4345
4649
1932
4434
2607
869
2549
3953
2478
2635
1335
484
3897
2876
1724
1370
1198
3458
3718
4528
4369
4849
3335
2956
694
3404
2273
2147
2682
4428
1621
4115
4391
4022
3866
3387
3848
35791482
1111
4542
580
4212
15413
2705
159
949
289 778
4218
1096
4685
3230
4880
1516
4308
2432
2600
1402
3096
1410
4885
3468
4040
2903
3117
4050
4415
2484
1967
4321
761
3744
2297364
3261
4907
1099
1664
2045
4709
4351
3237
789
3544
1943
10754492
2647
905
646
2831
3583
1770
735
21014003
4474
1214
21973
998 725
1058 4218
1096
4685
3230
4880
1516
4308
2432
2600
1402
3096
1410
4885
3468
4040
2903
3117
4050
4415
2484
1967
4321
761
3744
2297364
3261
4907
1099
1664
2045
4709
4351
3237
789
3544
1943
10754492
2647
905
646
2831
3583
1770
735
21014003
4474
1214
21973
998 725
1058
3030
2664
3829
4302
4017
4221
2177
3047
3042
1455
4697
3616
2287
4899
4000
3685
1922
2907
3452
16813703
2006
3062
167
786
3970
1793
4701
179
3104
527
4741
596
213
689
1814
1066
1629
2109846
1483
3512
2472
291
1478
1184
2228
4079
1966
1091
479 1
3521
3309 126 782 3030
2664
3829
4302
4017
4221
2177
3047
3042
1455
4697
3616
2287
4899
4000
3685
1922
29073703
2006
3062
167
786
3452
3970
1793
4701
3104
179
1681
527
4741
596
213
689
846
1483
3512
2472
291
1478
1184
1814
1066
1629
2109
2228
4079
1966
1091
479 1
3521
3309 126 782
1210
4729
2180
2033
4284
3700
1553
3821
22463843
2666
3443
4497
3170
813
529
2888
1898
4816
4893
249
2610
1950
1447
2397
4950
523
4444
4228
4614
2584
496
1324
985 1140
1356
4879
2732
1895
1182
411
3779
2192
1977
3727
1409
1567
4041
3555
1484
2563
1700
2926
16451589 1210
4729
2180
2033
4284
2888
1898
4816
37003843
2666
3443
4497
3170
813
529
4893
249
2610
1950
1447
2397
4950
523
4444
4228
4614
2584
1553
3821
2246496
1324
985
411 1140
1356
4879
2732
1895
1182
3779
2192
1977
3727
1409
1567
4041
3555
1484
2563
1700
2926
16451589
3013
2880
2003
3524
4966
38323344
3618
2024
3385
1523
445
183
2683
2593
636
3571
726
2035
3622
312898
3774
2062
1137
4320
3548
2170
4536
3189
211
1626
3238
2970
4170
1957
4197
4175335
142
293
2981
1305
4790
1120
4608732
3903
4095
1786
4679
759 6
1271 3 3013
2880
2003
3524
3548
21703344
3618
2024
3385
1523
445
183
2683
2593
636
3571
726
2035
3622
312898
3774
2062
1137
4320
4536
3189
211
4966
3832
1626
3238
2970
4170
1957
4197
4175
4608 335
142
293
2981
1305
4790
1120
732
3903
4095
1786
4679
759 6
1271 3
3773
2538
2203
3295
1349
4604
1566
1637
839
4266
28554567
107
1337
3765
1277
4006
3088
4586
1286
3978
2713
4216
2598
1852
1224
4120
3048
2644613
4200
2637
214651
2715
2158
1372660
2651 847
1731
1585
963
357
244
2044
2021848
185
4887
29851622
828 26
5481 3773
2538
2203
3295
1349
4604
1566
1637
8394567
107
1337
3765
1277
4006
3088
4586
1286
3978
2713
4216
2598
4266
2855
1852
1224
4120
3048
264
2637
2146 4613
4200
51
2715
2158
1372660
2651 847
1731
1585
963
357
244
2044
2021
2985848
185
4887
1622
828 26
5481
3433
1204
2313
2009
1960
1493455
606
4838
4413
1257
2097
1867
2661
2660
1576
3250
3820
1448328
4191
3038
4110
1123
18742652
2058
3196
1081
4915
4633
4947
4717
2579
4668
2968
1442
999
717
3805
3600 64
187
304
1524
4833 102 17 3433
1204
2313
2009
1960
1493455
606
4838
4413
1257
2097
1867
2661
2660
1576
3250
3820
1448328
4191
3038
4110
1123
18742652
2058
3196
1081
4915
4633
4947
4717
2579
4668
2968 64
187
1442
999
717
3805
3600 304
1524
4833 102 17
4334
3675
2462
2196
1211
4670
4085
4824
2791
4111
1844
193371
607
3090
895
3979
3269
4897
2207
3036
834
3940
2557
2534
4184
1246
4234
16661536
1734
3673
1546
19443964
994
2510
2884
3247
2489
2360
2395
4309
340
2186
2530
4310
553
194
467 36
393
318
215
4551 68
644 4334
3675
2462
2196
1211
4670
4085
4184
1246
4234
1666
4824
2791
4111
1844
193371
607
3090
895
3979
3269
4897
2207
3036
834
3940
2557
2534
1734
3673
1546
2510 1536
3964
994
2884
19443247
2489
2360
2395
4309
340
2186
2530
4310
553
194
467 15
236
318
455139 68
3644
2741
2405
466
428
368
3798
1233
1149
3927
4782
1539
4503
2522
1635
931
3946
3564
1369
868
3801
2454
4114
1973
3828
451
2486
1215 605
539
4407
4347
2337
3258
111647
4208
2964
1056 993
472
1202
1378
4681
2016
1675
2113604
2640 537 2741
2405
466
428
368
3798
1233
1149
3927
4782
1539
4503
2522
1635
931
3946
3564
1369
868
3801
2454
4114
1973
3828
451
2486
1215
3258 605
539
4407
4347
2337
111
4208
2964
1056 993
647472
1202
1378
4681
2016
1675
2113 604
2640 537

D.euc
D.euc
4164
2747
25823006
4399
2399
1364
2617
4016
1581519
4738
1405
4532 2085
3148
1923
4683
2973
2465
2134
3633
3522
3643
2463
2402
4854
3015
2485
2007
1947
2117
49683546
4230
3069
2076
4970
4843
4770
282 4332
4256
3112
173 623
4360 82 4164
2747
25823006
4399
2399
1364
2617
4016
1581519
4738
1405
4532 2085
3148
1923
4683
2973
2465
2134
3633
3522
3643
2463
2402
4854
3015
2485
2007
1947
2117
49683546
4230
3069
2076
4970
4843
4770
282 4332
4256
3112
173 6623
4360 82
4487
2906
2782
2623
4936
25
2398
2116
1828
1100
754
1676
240
4281661
3511
1027
764
3420
3580
1717
3080
359
1360
2338
2290
612
4152
1079
363
2286
1733
4864
3209
1051
4578
2798
2742
2143
1510
550
3450900
820
1586
3108
431
940
62199
1753
17291435
4389
251 4487
2906
2782
2623
4936
25
2398
2116
1828
1100
754
1676
240
4281661
3511
1027
764
3420
3580
1717
3080
359
1360
3209 92
2338
2290
612
4152
1079
363
2286
1733
4864
1051
4578
2798
2742
2143 900
820
1586
1510
550
3450
3108
431
940
1753
17291435
2199
4389
251
2416
4021
3632
4955
4430
3540339
2649
399
2723
1893858
3786
1253
3017
3331
1208
2878
1560
2240
3505
1045
2234
3857
3120
685 392
100
704
1064
4459
3901
3520
1958
3071
2004
4903 4393
3437
511
2392
942
2429
4335
4177
1080
46
4796
4441
3610
4922
1739
4103
2663 944 2416
4021
3632
4955
4430
3540
1893339
2649
399
2723
858
3786
1253
3017
3331
3071
2004100
704
1064
4459
3901
3520
1958
4903
1208
2878
1560 3
4393
3437
511
2392
2240942
3505
4335
4177
1080
104546
2234
3857
3120
685 2429
4796
4441
3610
4922
1739
4103
2663 944
4355
1822
2152
4726
4673
4405
2633
890
1899
3436
2216
4119
18312714
2247
1073
3811
1767
970
367
4390
3272
2755
1453
2036
1685
3016
1529
2083
2602
4598
4102
1575
4777
1908
1361
796
221 672
3998
4593
1871
4301
1667 52850
883
4141
1554
261
4269 14
388
66
4426
3125225 4355
1822
2152
4726
4673
4405
2633
890
1899
16852714
2247
1073
3811
1767
970
367
4390
3272
2755
1453
2036
3016
1529
2083
2602
3436
2216
4119
1831 883
4141
4598
4102
1575
4777
1908
1361
796
221 672
3998
4593
1871
4301
1667 52850
1554
261
4269 14
388
66
31254426
225
1935
1625
2960
2605
4597
2497
3704 4485
3965
4973
3740
4436
28434819
1453705
4882
2860
1306
2210
4610
11464037
3045
2789
2566
102
1029
79
3734
2597
1791
3193705
2800
1741
4158
2128
3156
4913
4290
38531636
1431
3533
1863
2614
990
668 3012
1550
6413874
2053
1376 44361935
1625
28432960
145
11464485
4973
3740
1306
2210
4610
2566
102 3965
4819
3705
4882
2860
4037
3045
2789
1029
79
2605
4597
24973734
2597
1791
3193705
2800
1741
4158
2128
3704
3853 3156
4913
4290
1636
1431
3533
1863
2614
990
668 3012
1550
6413874
2053
1376
2885
4071
3304
1384
430
3563
3074
4638
2107
1872
1212
2868
1691371
1544
1316
975
35
2214
3565
2749
4720
1902
750
3855
2343
2151
657
4553
3672
2611
1452
4199
147
3682
1900
4799
3240
4721
2545
132
3031 2885
4071
3304
1384
430
3563
3074371
1544
1316
975
35
2214
3565
2749
4720
1902
750
3855
2343
2151
657
4553
3672
4638
2107
1872
1212
2868
1691
2611
1452
4199
147
3682
1900
4799
3240
4721
2545
132
3031

hclust (*, "complete")


hclust (*, "complete")

4522
1108
2330
1870
879
708
4012
3724
3717
536
510
3133
4538
2980
1618
4229
1998
37014530
2684
2359
3945
4731
3968
1593
3364
2040
3536
1805
2872
4562
3688
4688
363886
4508
3982
1429
172
448967
1696
3977
267
515
2415
3019
3227
4747
3299
2532
3197
348 3347428 4522
1108
2330
1870
879
708
4012
3724
3717
536
510
3133
4538
2980
1618
4229
1998
37014530
2684
235986
4508
3982
1429
172
4489
3945
4731
3968
2040
3536
1805
2872
4562
3688
4688
3638 67
1696
3977
267
515
1593
3364
2415
3019
3227
4747
3299
2532
3197
348 3
347 428
4772
1082
412
325
2469
4908
822
960
73970
3307
2149
1561
758
2615
3401
2653
1745
1203
1353
979
3053
3253
2851
3234
2084 1823
2339
2971456
2756
3751
2518
2262
3490
4223
37802099
3130
4154
11014378
4109
3089
415 713
3260
4149
1017 2501
4591366
1607243377 4772
1082
412
325
2469
4908
822
960
73970
3307
2149
1561
758
2615
3401
2653
1745
1203
1353
979
2971456
2756
3751
2518
3053
3253
2851
3234
2084 1823
2339
2262
3490
4223
37802099
3130
4154
11014378
4109
3089
415 713
3260 2501
1366
1607
4149
1017
459 243377
3611
475
2346
4867
4570
4350
37612366
1186
1301
4239
3804
4368
3098
3336
1678
3691
3425
2986
2932
1040
663406
4131 1978
586
3925
2
659 988
71 3611
475
2346
4867
45702366
1186
1301
4239
3804
4368
3098
3336
1678
4350
3761
3691
4061978
586
23
659925
988
71
Cluster Dendrogram
3425
2986
2932
1040
663
4131
Cluster Dendrogram

3623
4680
2322
2274
1228
4869
3630
4157
4826
2577
134
3844
2672
4376
2159
4047
3232
3762
2520804
14
784
864
4839
3221
4921
2686
749
997
824
387
3055
1035
4961
3807
555
4303
4648
1653
2508
742
1326
1283
1290
49574305
1
872
2835 563 3623
4680
2322
2274
1228
4869
3630
4157
4826
2577
134
4047
3232
3762
2520804
14
3844
2672
4376
2159
784
864
4839
3221
4921
2686
749
997
824
387
3055
1035
4961
3807
555
4303
4648
1653
2508
742
1326
1283
1290
49574305
1
872
2835 563
3288
3352
1460
4743
2071
1555
1179
46691997
1924
2595
109
2500
2218
4671
3011
653
3836
3629
1627
1023
2055
901
951
277
2427
3742
2620
3449
3985
4336
41894261
1480
2933
3989
32491877
4486
4607
3152
1798
3211
2676
3987
4424
995
2031
1094
4919
4138
2550
1956 2302 11 3288
3352
1460
4743
2071
1555
1179
46691997
1924
2595
109
2500
2218
4671
3011
653
3836
3629
1627
1023
2055
901
951
277
2427
3742
2620
3449
3985
4336
4189
4607
31524261
1480
2933
39891877
4486
1798
3249
3211
2676
3987
4424
995
2031
1094
4919
4138
2550
1956 2302 11
4728
1265
215
2840
280
113
1584
4977
3368
309
1367
2105
2409
1907
4873
1386
4432
292
4687
3305
3326
3932
440
21
4054
4872
4225
1820
3586
444
2171
1434
3951
3224
3729
3101
1310
3008
1787
3394
787
398
3061
2139
2786
2471
2201
3093
1577
4045
2866
9591475
290 4728
1265
215
2840
280
113
1584
4977
3368
309
1367
2105
2409
1907
4873
1386
4432
292
4687
3305
3326
3932
440
21
4054
4872
4225
1820
3586
444
2171
1434
3951
3224
3729
3101
1310
3008
1787
3394
787
398
3061
2139
2786
2471
2201
3093
1577
4045
2866
9591475
290
1416
1602
4425
3885
4545
16904797
3402
4128
3732
3771
1502
826
2422
32142708
1894
3306
3086
815
2833
4484
3091
4204
3163
2289
1699
748
935
3984
4236
1039
972
292837340
2949
4322
2348
4255917
3868
4352
3941
1331
3396
1598
4860
3052
1533 1416
1602
4425
3885
4545
16904797
3402
4128
3732
3771
1502
826
2422
32142708
1894
3306
3086
815
2833
4484
3091
4204
3163
2289
1699
748
935
3984
4236
1039
972
292837340
2949
4322
2348
4255917
3868
4352
3941
1331
3396
1598
4860
3052
1533
3782
946
3426
812
799
152
2466
2612
3298
4033
797
4055
1345
4502
1252
1399841
1354
3325
45003493
2571
4082
3631
4206
4315
1371
3342
1720
631
379
2051
574
2768
1937
1506
2219
2193
4445
3722
2870
4365
4015
3627
4642
3185
1499
1468
4621
1942
1638
1302
4145
939 3782
946
3426
812
799
152
2466
2612
3298
4033
797
4055
1345
4502
1252
4315
1371841
1354
3325
45003493
2571
4082
3631
4206
3342
1720
631
379
1399
2051
574
2768
1937
1506
2219
2193
4445
3722
2870
4365
4015
3627
4642
3185
1499
1468
4621
1942
1638
1302
4145
939
4235
2882
4571
1068
2154
228
3116
2095
4850
4373
2292
4599
2281
4713
814
603
278
4116
3620
1295
1498
2066
3183
2110
3596
2312
3498
911
3686
4519
3283
4190
4127
2590
4694
1765
4470
127
350 26
94
831
4601
4781
500
695
449
3743
4666
2204185
2589
3092
652
119 4235
2882
4571
1068
2154
228
3116
2095
4850
4373
2292
4599
2281
4713
814
603
278
4116
3620
1295
1498
2066
3183
2110
3596
2312
3498
911
3686
4519
3283
4190
4127
2590
4694
1765
4470
127
350 26
94
831
4601
4781
500
695
449
3743
4666
2204185
2589
3092
652
119
3714
3661
1419
2565
1441
112
4160
3254
4237
962
790
269
2443
2260
4675
4960
3637
498
2992
3032
3566
409
572
4501
2922
2925 4078
1176
1165
3653
4927
4370
4243
59
2167
1845
1132
4438
495
493
3517
1433
4763
341
80
1972
1927
2627
2629
4431
2401
1459
4616
3115
4774 3714
3661
1419
2565
1441
112
4160
3254
4237
962
790
269
2443
2260
4675
4960
3637
498
2992
3032
3566
409
572
1165
3653
49274078
1176
4370
4243
59
2167
1845
1132
4438
495
493
4501
3517
1433
4763
341
80
2922
2925
1972
1927
2627
2629
4431
2401
1459
4616
3115
4774
4311
170
1526
4031
3290
4042
3184
27944144
783
305
2857
2511
1087
3767
4951
4273
3915
133
414
4821
4480
1934
1034
1548
1608
140345
4248
4057
1760
1267
517
2106
640
1503
4026
4789
1565
2298
4858
3040
4493
2243
1683
2753
2426
894
4631
2820
1528
2914
486
9483854 4311
170
1526
4031
3290
1608
14034144
783
305
2857
2511
1087
3767
4951
4273
3915
133
414
4821
4480
1934
1034
1548
45
4248
4057
1760
1267
517
4042
3184
2794
2106
640
1503
4026
4789
1565
2298
4858
3040
4493
2243
1683
2753
2426
894
4631
2820
1528
2914
486
9483854
4524
2133
2724
2719
1799
891
2784
1508
200
1445
266
3681
3519
2421230
4725
448
719
2230
1284
3906
157
2891
1604
4058
3462
4074
4247
1381
3667
781
366
391
3347
2712 1677
969
4412
4605
4874
628
2379
2011
4676
1698
768
314 7136
2023
3969
3888
543
56 236 53 4524
2133
2724
2719
1799
891
2784
1508
200
1445
266
3681
3519
2421230
4725
448
719
2230
1284
3906
157
2891
1604
4058
3462
4074
4676
1698 1677
969
4247
1381
3667
781
3664412
391
3347
2712
4605
4874
628
2379
2011
768
314
543
56 7136
2023
3969
3888 236 53
2897
2811
4712
1654
740
279
4007
2680
4981
699
2946
3738
2687
1846
1688720
3879
3593
1921
3144
2304
3721
4381
2344
544
323
2877
2272
1680
567
2288
1726
1047
2434
1891
2864
2533
3513
4357
3766
1207
745
3509
3514
4505
4205
1074
99 2897
2811
4712
1654
740
279
4007
2680
4981
699
2946
3738
2687
1846
1688720
3879
3593
1921
3144
2304
3721
4381
2344
544
323
2877
2272
1680
567
2288
1726
1047
2434
1891
2864
2533
3513
4357
3766
1207
745
3509
3514
4505
4205
1074
99
4692
3126
4520
3441
4333
4179
3992
2367
4020
23551669
4798
5592061
662
1491
3880
1373
4918
3664
2767547
2507
2022
4187
1041
1596
123
3830
13401693
950
4401
3750
4808
186
4745
2017
120
771
2942
14432249
2972
4546
1954
3419
301
2374
4241
403
727
3446
2752 711 4692
3126
4520
3441
4333
4179
3992
2367
4020
2355
3830
13401669
4798
559
4401
3750
48082061
662
1491
3880
1373
4918
3664
2767547
2507
2022
4187
1041
1596
1231693
950
186
4745
2017
120
771
2942
14432249
2972
4546
1954
3419
301
2374
4241
403
727
3446
2752 711
2375
1855
3655
655
390
980
4695
4098
4458
2291
2947
3990
3948
4863
1222
1143
3916
3000
801
3547
4067
1196
3923
836
967
3850
2136
4264
2475
28924452
3726
1883
674
886
3219
4387
3958
248
632
3140
1178
4099
1982 4
4691
2121
1406
2008
3671
1749
260
46402 096
168 2375
1855
3655
655
390
980
4695
4098
4458
2291
2947
3990
3948
4863
1222
1143
3916
3000
801
3547
4067
1196
3923
836
967
3850
2136
4264
2475
2892
32194452
3726
1883
674
886
4387
3958
248
632
3140
1178
4099
1982 4
4691
2121
1406
2008
3671
1749
260
46402168096
1742
3314
2191
2700
393
4635
4767
2912
1078
2090
207
25362720
4705
4540
1807
3654
1906
4660
2498
779
4481
3282
478
163
1769
1322
4348
2517
3576
1019
3920
4884
923
392
1579
2951
2837 3199
1839
1125
4865
2324
1615
12
1092
753
1311103
3896 161
811122
747
582 1742
3314
2191
2700
393
4635
4767
2912
1078
2090
207
25362720
4705
4540
1807
3654
1906
4660
2498
779
4481
32823199
1839
1125
4865
2324
478
163
1769
1322
4348
2517
35761615
1103
3896
1019
3920
4884
923
392
1579
2951
283712
1092
753
131 161
81 1122
747
582
1166
3448
3339
374
358
223
4124
2895
4088
877
679
577
4595
3607
2299
1063
3109
885
4596
3351
4622
4541
3741
1014
1648
1545
4807
4733
36934764
1651
4159
1423
380
1312930
4702
3330
3191
217
2069807
2301
1974
2411148
1819
3502
2268
4641
2898
17362896
1492 844 1166
3448
3339
374
358
223
4124
2895
4088
877
679
577
4595
3607
2299
1063
3109
885
4596
3351
4622
4541
3741
1014
1648
1545
4807
4733
36934764
1651
4159
1423
380
1312930
4702
3330
3191
217
2069807
2301
1974
2411148
1819
3502
2268
4641
2898
17362896
1492 844
3239
1172
3118
2659
4892
2601
3625
3274
4188
707
4632
2448
3155
3064
2351
3195 1061
3223
2656
3100 3924
3213
2161
1181
4754
1556
2294
4832 1784
4944
3569
556232
2902
4395
1168
568242
4043
4076
3263
2456
2362
4917
2407
156
988 1
3119 887 3239
1172
3118
2659
4892
3100
2601
3625
3274
4188
707
4632
2448
48321061
3223
2656
3924
3213
2161
1181
4754
1556
22941784
4944
31553569
232
2902
4395
556
1168
568
4076
3064
2351
31953263
2456
2362
4917
2407
156
988 242
4043
1
3119 887
2093
4940
3065
882
3554
2057
3680
1721
2734
19641138
3395
3321
2028
3890
1263
4696
4491
2737
1450
1279
3570
738
3995
3217
4453
3687
4876
2665
1878
37104870
3023
3066
2233
3908
2300
1631
589
4778
17962823
1673
3123
4400
3904
2771
121678
4742
1658 2541
342
4252 2093
4940
3065
882
3554
2057
4696
4491
3680
1721
4453
36871138
3395
3321
2028
3890
1263
2737
1450
1279
3570
738
3995
3217
4876
2734
19642665
1878
37104870
3023
3066
2233
3908
2300
1631
4400
3904589
4778
17962823
1673
3123
2771
121678
4742
1658 2541
342
4252
4949
4457
3789
1603
105
1479
4506
3178
4845
4472
2731
3676
2331
3834
3865
3898
3381
2439
4617
585562
3882
1251
4659
1535
4930
4009
3922
1191
3895
1599
810
4510
2365
4032
25043606
1569
3168
4354
13153
2129
4440
1415
3228
3909
4861
1115
2702
2320
162 826
23
83 42
840 4949
4457
3789
1603
105
1191
3895
1479562
3882
1251
4659
1535
4930
4009
3922
1599
810
4510
2365
4032
25043606
1569
3168
4354
1315
4506
3178
4845
4472
2731
3676
2331
3834
3865
3898
3381
2439
4617
585 3
2129
4440
1415
3228
3909
4861
1115
2702
2320
162 826
8323 42
840
1748
4386
1329
3162
1107
816
2515
4156
1949
4905
4804
2451
1640
11531413
1030
2010
1009
2082
741
4710
3210
4385
1970
3300
2861
3317
4512
4342
843
1711
1317
2836
1914
4727438
4383
3891
666
2778
109361
208 915
1048
299 3
3663
1704
2213
1428
2588
378
385 43 1748
4386
1329
3162
1107
816
2515
4156
1949
4905
4804
2451
1640
1153
4385
1970
3300
2861
2991413
1030
2010
1009
2082
741
4710
3210
1048
3317
4512
4342
843
1711
1317438
4383
3891
66661
208
2836
1914
4727
2778
1093 915
3
3663
1704
2213
1428
2588
378
385 43
4836
4931
2408
2862
3046
4933
3114
1521203
3421
441
2710
1385
4495
4106
453
3800462
4325
3986
3341
2540
2396
1171
4312
22783537
476
49
3545
1412
3877
830
2815
2505
4307
4068
3434
1955
1341
3390
3063
4812
3892513
106
2844
2124
3484
1162
352
4948
1981
2539 532 4836
4931
2408
2862
3046
4933
3114
1521203
3421
441
2710
1385
4495
4106
453
3800462
4325
3986
3341
2540
2396
11713537
476
49
3545
1412
3877
830
4312
2278
2815
2505
513
106
2844
2124
3484
1162
352
4307
40684948
3434
1955
1341
3390
3063
4812
3892
1981
2539 532
3393
1288
667
338
4337845
620
3539
1422
2137
1470
558
686
2722
1995
3720
1439
1011
3323
2041
20391644
1304
2561
2271
1258
4959
2943
1309
4146
1052
3561
2779
3553
2049
4411
2430
4388
39504461
3937
3581
4356
2114
4926
1991
857
3648
3527 3393
1288
667
338
4959
4337845
620
3539
1422
2137
1470
558
686
2722
1995
3720
1439
1011
3323
2041
20391644
1304
2561
2271
1258
2943
1309
4146
1052
3561
2779
3553
3581
2049
4411
2430
4388
39504461
3937
4356
2114
4926
1991
857
3648
3527
3216
1249
3719
2345
4573
1325
4730
1049
2074
2721
463
2628
4624
4751
323154
4972
2070
2442
924
1127
4978
1929
2572
4585
354
3921
4366
1714
2162
3186
3864
237231
1917
429
3245
2483
4969
4125
512
507
4358
1986
2145
4462
3268
1941
177782
3902
1126
791
987 3216
1249
3719
2345
4573
1325
4730
1049
2074
2721
463
2628
4624
392154
4972
2070
2442
924
1127
4978
1929
2572
4585
354
4366
1714
2162
3186
4751
3231
3864
2372
412531
1917
429
3245
2483
4969
512
507
4358
1986
2145
4462
3268
1941
177782
3902
1126
791
987
3208
2706
3649
2470
3858
4196
2976
4327
803
4661
130
1050
1255
3818
1969
3481
1487
4049
3827
492
140
3763
2174
4414
1247
1984291
4768
1231
4433
4565
1652
11132380
4408
610
4715
2952
3959
2323
197
1105
4757
1451
3286
1346
4274
4511
3225
1190
9181781 461
8258 3208
2706
3649
2470
3858
4196
2976
4327
803
4661
130
1231
1050
1255
3818
1969
3481
1487
4049
3827
4565
1652
1113
4408
492
140
3763
2174
4414
1247
1984291
4768
4433
2380
610
4715
2952
3959
2323
197
1105
4757
1451
3286
1346
4274
4511
3225
1190
9181781 461
8258
3856
3176
2641
3962
1913
651
3695
4064
3024
2569
265
977
4941
2883
3302
3460
4108
3417
1703
2229
1054
3470
1719
2046
361
3021
1382
2435
3660
4182
1920
4019
1089
4592
1610
4167
3431
3212
3725
4618
3354
2793
2729
3343
4341
2353
3683
4762
3955
3913
102283
2111
3608
302 3856
3176
2641
3962
1913
651
3695
4064
3024
2569
265
977
4941
2883
3302
3460
4108
3417
1703
2229
1054
3470
1719
2046
361
3021
1382
2435
3660
4182
1920
4019
1089
4592
1610
4167
3431
3212
3725
461883
2111
3608
3354
2793
2729
3343
4341
2353
3683
4762
3955
3913
1022
302
4634
1488
2780
4135
1925
542
1613
1611
3966
22842477
4803
4166
101
1571
4466
2634
1244
4678
34742813
2126
2173
3291
2671
3510
3276
2215
2619
4024
1762
2265
1634
2803
1514
1151 307
3558
1292
2685
2209
2356
706
956
4422
1747
3087
2371
2019 1144
4704
3500
182
4877 345 4634
1488
2780
4135
1925
542
1613
1611
4422
17472477
4803
4166
101
3966
2284
3087
2371
2019
1571
4466
2634
1244
4678
34742813
2126
2173
3291
2671
3510
3276
2215
2619
4024
1762
2265
1634
2803
1514
1151 307
3558
1292
2685
2209
2356
706
956 1144
4704
3500
182
4877 345
2553
4060
2394
222
3141
4046
2542
4706
3900
3039
2875
4094
3356
26912776
4162
1848
1601237
1173
4840
2077
4953
4909
3457
1639
4589
3389
4192
18401605
2072
6223496
642
4929
2905
2709
329
3869
3837
3411
2871
1129
1682
912
4898
1201
4163
2693436
4013
4304 973
4737 2553
4060
2394
222
3141
4046
2542
4706
3900
1848
1601
33892776
4162
237
1173
4840
2077
4953
4909
3457
1639
4589
41921605
2072
622
3039
2875
4094
3356
2691
1840
2709
3293496
642
4929
2905
3869
3837
3411
2871
1129
1682
912
4898
1201
4163
2693436
4013
4304 973
4737
3158
1538
2235
3535
2562
4052
4371
3973
1999
2277
4718
1418
4911
4699
2406
30
460
4889
3961
1518
2280 3405
1559
3003
2238692
2698
4974
2555
2184 952
1315
3839
2984
4962
2625
3595
2447
1269
3514851
2997
3588
29377 810
854
760
190112 3158
1538
2235
3535
2562
4052
4371
3973
1999
2277
4718
1418
4911
4699
2406
30
460
2238
4889
3961
1518
2280
2555
2184 3405
1559
3003
692
2698
4974
4962
2625
3595
2447
1269
351 952
1315
760
3839
2984
4851
2997
3588
2937 810
854
7190112
3137
2525
3899
2954
592
1766
2790138
4758
1389
1232
693
576
2474
1884
2673
2591
533
188
2735
2190
1496
3020
3083
670
1768
3242
3292
4657
922
4714
1752
1426
4901
4118
1238
1088
2939
1456404
1905
1552
504
837
3605
4942
3822
4844
2315
2172
4881
4732
617
4875 3137
2525
3899
2954
592
1766
1768
3242138
4758
1389
1232
693
576
2474
1884
2673
2591
533
188
2735
2190
1496
3020
3083
670
3292
4657
922
4714
2790
1752
1426
4901
4118
404
1905
1552
504
837
3605
1238
1088
2939
14564942
3822
4844
2315
2172
4881
4732
617
4875
4183
2645
4203
2064
4600
3409
4451
1888
4238
23162125
1037
2730
1712
2669
4759
2944177
3171
1407
2909
684
115
3694
2606
3712
2958
3388
2087 690
2736
2889
1740
3423
3639
1272
2834
3494
2785
3279
1259
2853
439
2130
1697
3107
2211
3391
2828
4276
1558
3241
927 1600
802 4183
2645
4203
2064
3388
20872125
1037
2730
1712
2669
4759
2944177
3171
1407
2909
684
115
3694
2606
3712
2958
3423
3639
1272
4600
3409
4238
2316 690
2736
2889
1740
2834
3494
2785
3279
4451
18881259
2853
439
2130
1697
3107
2211
3391
2828
4276
1558
3241
927 1600
802
2212
2913
2804
4561
1397
1377
4651
4240
2911
1959
1988 971
903
722
3255
2342
376
28
4934
4476
2924
2176
216
1344
3034
1046
3289
1164
2881
3684
4521
1217
1147
2224
2950
2787
402
3246
3894
332
233
4943
4811
4584
2503
4914
1931
442
3847
4954
1980
3893
1987
4555 321 2212
2913
2804
4561
4934
4476
2924
1217
1147
2224
2950
2787
1397
4240
2911
1959 971
903
722
3255
2342
376
28
2176
216
1344
3034
1046
3289
1164
2881
3684
4521
402
3246
3894
332
233
1377
46514943
4811
1988
4584
2503
4914
1931
442
3847
4954
1980
3893
1987
4555 321
3528
3363
3179
1432
4215
613
4384
1606
1915
1254
3613
2413
2349
4220
2131
1209
2318
1630
2796
2616
3147
3860
4841
4416
4734
1471
1250
2632
490
3788
2428
627
375
4180
4674
4062 3220
3817
16
2788
2188
1464
3994
1919
2670
1161
734
137
3079
4316
2119
3181
1940
849 9 29 3528
3363
3179
1432
4215
613
4384
1606
1915
1254
3613
2413
2349
4220
2131
1209
2318
1630
2796
2616
3147
3860
4841
4416
4734
1471
1250
2632
490
3788
2428
627
375
4180
4674
4062
2788
2188
1464
3994
1919
2670
1161
734
137
3079
4316
2119
31813220
3817
16
1940
849 9 29
1362
3749
2030
714
2821
3482
2054
2438
2915
3983
3559
2449
39753430
1234
118
4560
649
502
1911
1674
4051
3706
2178
1358
1804
1737
2034
4938
633
2982
1835
3320 4
4623
3407
4339
1260
4210
2748
1757
4852
3337
1303
2403
1458
4494
2464 788
1588
906
386
75 464
528 83 1362
3749
2030
714
2821
3482
2054
2438
2915
3983
3559
4560
18043430
1234
118
649
502
1911
1674
4051
3706
2178
1358
1737
2449
3975
2982
1835
3320
3337
1303
2403
1458
4494
24644623
3407
2034
4938
633
4339
1260
4210
2748
1757
4852 788
1588
906
386
475464
528 83
4980
4001
3412
1671
4201
1338
4529
1320
4286
2854
4249
3376
3243
2059
1694
1390
3515
2894
4527
3203
3526
3075
501
2987
2814
3679
2001
4346
1427
4460
3752
1858
4912
494 97
4690
3785
1026
2809
1145
4435
4421
2869
2623469
752
87 7 4980
4001
3412
1671
4201
1338
4529
1320
4286
2854
4249
3376
3243
2059
1694
1390
3515
2894
4527
3203
3526
3075
501
2987
2814
3679
2001
4346
1427
4460
3752
1858
4912
494
4690
3785
1026
2809
1145
4435
4421
2869
262 97
3469
752
87 7
1616
4756
1860
4646
2678
1501
638
4517
4056
2967
4420
1933
3624
4533
1240
4531
3463
1847
926
2558
1327
3914
2104
4129
1930
1744
12263926
48681578
3076
2341
1497
3068
3777
234
932
2509
4448
2335
1219
770
3256
2521
12931103
597
4818
2244
13594888
4793
24554700
3993 1616
4756
1860
4646
2678
1501
638
4517
4056
2967
4420
1933
3624
4533
1240
4531
3463
1847
926
2558
1327
3914
2104
4129
1930
1744
12263926
48681578
3076
2341
1497
3068
3777
234
932
2509
4448
2335
1219
770
3256
2521
12931103
597
4818
2244
13594888
4793
24554700
3993
4722
1540
1018
1936
1869
4537
1004
2236
1387
1572
3642
1718
1983
4744
3480
862629
4279
2431
4029
3666
2446 3451
4526
709
3582
2123
3084
1005
746
4591
4483
3698
2160 435
4439
4375
1806
4250
151
4318
1772
5244544
2773
4853
3270
2716
981
808
4028
201
1180 683
3 715 4722
1540
1018
1936
1869
4537
1004
4029
2236
1387
1572
4318
1772
524
3642
1718
1983
4744
3480
862629
4279
2431
4526
709
3582
3666
24462123
3084
1005
746
4591
4483
3698
2160 3451
435
4439
4375
1806
4250
151
4544
2773
4853
3270
2716
981
808
4028
201
1180 3683715
4038
3369
2996
4380
855
853
2195
1850
3287
3077
2410
4603
1761
4548
2858
1817
1221
3573
2239
3665
1024
4895
1128
4846
2309
2890
2695
1557
4198
1013
3783
3556
1854
3711
3278
4277
1811
4178
1979
873
4272
2014
2757
1318
4319
3007
2334
3037
1348
3229
4563
255
300423
898 4038
3369
2996
4380
855
853
2195
1850
3287
3077
2410
4603
1761
4548
2858
1817
1221
3573
2239
3665
1024
4895
1128
4846
2309
2890
2695
1557
4198
1013
3783
3556
1854
3711
3278
4277
1811
4178
1979
873
4272
2014
2757
1318
4319
3007
2334
3037
1348
3229
4563
255
300 423
898
1801
1473
3312
1716
4644
176
1890
4677
4061
1684
2122
419
3464
2726
3403
2578
3647
1797
3194
1525
3887
4148
595
150
1053
756
256
767
599
545
2988
1659
24441485
1467
3145
2740
4209
916
4232
3845
3454
4126
3849
3943
1008
4018471
676
1624 2141
4072
408720
751 1801
1473
3312
1716
4644
176
1890
4677
4061
1684
2122
419
3464
2726
3403
2578
3647
1797
3194
1525
3887
4148
595
150
1053
756
256
767
599
545
2988
1659
1485
1467
3145
2740
4209
916
4232
2444 471
676
3845
3454
4126
3849
3943
1008
4018
1624 2141
4072
4087 20
751
4787
3201
4417
3275
2496
3963
2227
4628
1837
3280
3577
2657
2544
2204992
480
327
2189
114
938
4534
4233
1350
860
4246
1794
1655
2102
3313
1474 4787
3201
4417
3275
2496
3963
2227
4628
1837
3280
3577
2657
2189
114992
480
327
938
2544
2204
4534
4233
1350
860
4246
1794
1655
2102
3313
1474
Optimal Number of Clusters

fviz_nbclust(BankData_scaled[, c(1, 2, 4, 6)], pam, method = "wss", diss = D.euc) + geom_vline(xintercept = 4, linetype = 2) + labs(subtitle =
"Elbow method")
Optimal number of clusters
Elbow method
35000
Total Within Sum of Square

30000

25000

20000

1 2 3 4 5 6 7 8 9 10
Number of clusters k
Profiling of customers based on the suggested number of clusters as per the above mentioned graph

pam_fit_4 <- pam(D.euc, diss = TRUE, k = 4)

pam_results_4 <- BankData %>% mutate(cluster_id = pam_fit_4$clustering) %>% group_by(cluster_id) %>% do(overall_summary = summary(.))

pam_results_4$overall_summary
[[1]]
Age Income Family.Members CCAvg Education
Min. :23.00 Min. : 8.00 1: 0 Min. :0.000 1:449
1st Qu.:33.00 1st Qu.: 35.00 2: 0 1st Qu.:0.600 2:490
Median :41.00 Median : 54.00 3: 306 Median :1.300 3:416
Mean :42.27 Mean : 57.42 4:1049 Mean :1.465
3rd Qu.:52.00 3rd Qu.: 78.00 3rd Qu.:2.100
Max. :67.00 Max. :182.00 Max. :6.600
Mortgage Personal.Loan Securities CD.Account Online.Access CreditCard
Min. : 0.00 0:1284 0:1199 0:1299 0:515 0:972
1st Qu.: 0.00 1: 71 1: 156 1: 56 1:840 1:383
Median : 0.00
Mean : 23.46
3rd Qu.: 0.00
Max. :215.00
Have.Mortgage cluster_id
0:1091 Min. :1
1: 264 1st Qu.:1
Median :1
Mean :1
3rd Qu.:1
Max. :1
[[2]]
Age Income Family.Members CCAvg Education
Min. :23.00 Min. : 8.00 1:755 Min. :0.000 1:524
1st Qu.:41.00 1st Qu.: 25.00 2:577 1st Qu.:0.400 2:531
Median :51.00 Median : 41.00 3:397 Median :1.000 3:677
Mean :49.35 Mean : 45.61 4: 3 Mean :1.081
3rd Qu.:58.00 3rd Qu.: 62.00 3rd Qu.:1.600
Max. :67.00 Max. :141.00 Max. :4.500
Mortgage Personal.Loan Securities CD.Account Online.Access CreditCard
Min. : 0.00 0:1715 0:1561 0:1670 0: 656 0:1207
1st Qu.: 0.00 1: 17 1: 171 1: 62 1:1076 1: 525
Median : 0.00
Mean : 17.33
3rd Qu.: 0.00
Max. :192.00
Have.Mortgage cluster_id
0:1454 Min. :2
1: 278 1st Qu.:2
Median :2
Mean :2
3rd Qu.:2
Max. :2

[[3]]
Age Income Family.Members CCAvg Education
Min. :23.00 Min. : 55.0 1:424 Min. : 0.000 1:737
1st Qu.:35.00 1st Qu.:108.0 2:433 1st Qu.: 2.500 2:147
Median :44.00 Median :132.0 3:102 Median : 3.600 3:117
Mean :44.39 Mean :132.5 4: 42 Mean : 3.912
3rd Qu.:54.00 3rd Qu.:158.0 3rd Qu.: 5.400
Max. :67.00 Max. :224.0 Max. :10.000
Mortgage Personal.Loan Securities CD.Account Online.Access CreditCard
Min. : 0.00 0:746 0:890 0:895 0:382 0:706
1st Qu.: 0.00 1:255 1:111 1:106 1:619 1:295
Median : 0.00
Mean : 14.38
3rd Qu.: 0.00
Max. :306.00
Have.Mortgage cluster_id
0:903 Min. :3
1: 98 1st Qu.:3
Median :3
Mean :3
3rd Qu.:3
Max. :3
[[4]]
Age Income Family.Members CCAvg Education
Min. :23.00 Min. : 9.00 1:285 Min. :0.000 1:378
1st Qu.:34.00 1st Qu.: 54.00 2:282 1st Qu.:0.900 2:231
Median :42.00 Median : 79.00 3:204 Median :1.800 3:285
Mean :43.22 Mean : 87.12 4:123 Mean :2.115
3rd Qu.:52.00 3rd Qu.:112.00 3rd Qu.:2.700
Max. :67.00 Max. :205.00 Max. :8.800
Mortgage Personal.Loan Securities CD.Account Online.Access CreditCard
Min. : 76.0 0:759 0:813 0:818 0:460 0:632
1st Qu.:150.0 1:135 1: 81 1: 76 1:434 1:262
Median :209.0
Mean :229.9
3rd Qu.:284.8
Max. :635.0
Have.Mortgage cluster_id
0: 0 Min. :4
1:894 1st Qu.:4
Median :4
Mean :4
3rd Qu.:4
Max. :4

Assigning of Cluster groups to Observations

groups <- cutree(hclust.euc, k = 4)

> BankData <- cbind(BankData, groups)


> group1 <- BankData %>% filter(groups == 1)
> group2 <- BankData %>% filter(groups == 2)
> group3 <- BankData %>% filter(groups == 3)
> group4 <- BankData %>% filter(groups == 4)

> summary(group1)
Age Income Family.Members CCAvg Education
Min. :23.0 Min. : 8.00 1:1048 Min. :0.000 1:1576
1st Qu.:36.0 1st Qu.: 33.00 2: 934 1st Qu.:0.500 2:1186
Median :46.0 Median : 53.00 3: 892 Median :1.300 3:1219
Mean :45.8 Mean : 59.09 4:1107 Mean :1.351
3rd Qu.:56.0 3rd Qu.: 80.00 3rd Qu.:2.000
Max. :67.0 Max. :204.00 Max. :4.900
Mortgage Personal.Loan Securities CD.Account Online.Access CreditCard
Min. : 0.00 0:3880 0:3564 0:3822 0:1610 0:2802
1st Qu.: 0.00 1: 101 1: 417 1: 159 1:2371 1:1179
Median : 0.00
Mean : 38.08
3rd Qu.: 82.00
Max. :383.00
Have.Mortgage groups
0:2916 Min. :1
1:1065 1st Qu.:1
Median :1
Mean :1
3rd Qu.:1
Max. :1
summary(group2)
Age Income Family.Members CCAvg Education
Min. :23.0 Min. : 48.0 1:254 Min. : 1.20 1:322
1st Qu.:34.0 1st Qu.:112.0 2:237 1st Qu.: 3.80 2:149
Median :43.0 Median :138.0 3: 78 Median : 4.90 3:168
Mean :43.2 Mean :137.1 4: 70 Mean : 5.11
3rd Qu.:52.0 3rd Qu.:164.0 3rd Qu.: 6.50
Max. :65.0 Max. :224.0 Max. :10.00
Mortgage Personal.Loan Securities CD.Account Online.Access CreditCard
Min. : 0.00 0:367 0:568 0:548 0:245 0:461
1st Qu.: 0.00 1:272 1: 71 1: 91 1:394 1:178
Median : 0.00
Mean : 30.26
3rd Qu.: 0.00
Max. :400.00
Have.Mortgage groups
0:532 Min. :2
1:107 1st Qu.:2
Median :2
Mean :2
3rd Qu.:2
Max. :2
> summary(group3)
Age Income Family.Members CCAvg Education
Min. :24.00 Min. : 39.0 1:119 Min. :0.000 1:96
1st Qu.:35.00 1st Qu.: 78.5 2: 54 1st Qu.:0.800 2:38
Median :44.00 Median : 93.0 3: 17 Median :1.900 3:81
Mean :43.23 Mean :103.6 4: 25 Mean :2.117
3rd Qu.:50.00 3rd Qu.:123.0 3rd Qu.:2.900
Max. :65.00 Max. :195.0 Max. :5.700
Mortgage Personal.Loan Securities CD.Account Online.Access CreditCard
Min. : 75.0 0:167 0:195 0:189 0: 95 0:145
1st Qu.:186.0 1: 48 1: 20 1: 26 1:120 1: 70
Median :240.0
Mean :238.3
3rd Qu.:300.5
Max. :431.0
Have.Mortgage groups
0: 0 Min. :3
1:215 1st Qu.:3
Median :3
Mean :3
3rd Qu.:3
Max. :3
> summary(group4)
Age Income Family.Members CCAvg Education
Min. :25.00 Min. : 81.0 1:43 Min. :0.000 1:94
1st Qu.:34.50 1st Qu.:128.0 2:67 1st Qu.:1.950 2:26
Median :45.00 Median :153.0 3:22 Median :3.400 3:27
Mean :44.71 Mean :150.9 4:15 Mean :3.858
3rd Qu.:54.00 3rd Qu.:175.0 3rd Qu.:6.050
Max. :65.00 Max. :202.0 Max. :9.000
Mortgage Personal.Loan Securities CD.Account Online.Access CreditCard
Min. :193.0 0:90 0:136 0:123 0:63 0:109
1st Qu.:339.0 1:57 1: 11 1: 24 1:84 1: 38
Median :397.0
Mean :405.2
3rd Qu.:459.5
Max. :635.0
Have.Mortgage groups
0: 0 Min. :4
1:147 1st Qu.:4
Median :4
Mean :4
3rd Qu.:4
Max. :4
Plotting of Clusters:

tsne_obj_4 <- Rtsne(D.euc, is_distance = TRUE)


tsne_data_4 <- tsne_obj_4$Y %>% data.frame() %>% setNames(c("X", "Y")) %>% mutate(cluster = factor(pam_fit_4$clustering))
ggplot(aes(x = X, y = Y), data = tsne_data_4) + geom_point(aes(color = cluster))
60

30

cluster
1
0
Y

2
3
4

-30

-60
-25 0 25 50
X
Interpretation of Results: As per the elbow method, the optimal numbers of clusters came as 4 clusters. Hierarchical clustering groups similar
types of objects into one cluster.

K Means Clustering:

> kclust_4 <- kmeans(BankData_scaled, 4)

> attributes(kclust_4)

$names
[1] "cluster" "centers" "totss" "withinss" "tot.withinss"
[6] "betweenss" "size" "iter" "ifault"

$class
[1] "kmeans"

Aggregation of Clusters

aggregate(BankData[, c(1,2,4,6)], by=list(cluster=kclust_4$cluster), mean)


cluster Age Income CCAvg Mortgage
1 1 42.67176 57.07357 1.433797 46.37634
2 2 37.12032 64.11676 1.363957 42.93137
3 3 44.76667 146.52184 4.673402 120.61149
4 4 57.09158 54.84383 1.255233 38.15699

# Cluster Size

kclust_4$size

[1] 1767 1122 870 1223

# Add the K Means group to the data


BankData <- cbind(BankData, kgroup = kclust_4$cluster)

# Visualizing Clusters

fviz_cluster(kclust_4, BankData[, c(1,2,4,6)])

Cluster plot
24202951
4395
3119 2707 3812
28373784
2456
882 4379
153
2250 223
577
1166
518 44983615
4972
310
680
3896
3146
3414 103
3475
2442 4696
3858
163 4613281
2 4271 4104 3875 4550
591 786 179
1058
88 3970
33833273 4034
1085
643 2969
2093
4697479 3934
2535
2407
141
299914661
342
3135358
374
4552
4779 4465
3367
1174
4566 489
570
3136
20703429
2516
392
3753 4491
3739 154
35274926
4257149 1859 2217 4003
3735
2940 219
47413806
2522608
3458
1681
4351
3073 4883
1713
1886
3448
725 2261
3064
4917 4568
4065
4755 3268
4291
1255
3797
3652
2177
2148
913 4884
2358
2380
36563217
60240883920
4857
3902
162
4855
508 2183
4535
2746
4513
2517
4086 1421
4251
1769
924
1379
46654132138
4097 2849
27794356
1223
866
2264
44921633
2351 4653
2263 3616
2287
4221 2232 1802
1175 1052
1991
31102114
857
1300
1425
4268 2638
7634765
4663 1342 2295 4014 1314 1330
4323
3150 4612
27971015
2482
73 416
2259
167
1922
1478
1370
846
3026
664
724
593
3044
1963
2101
3452
2472
4967
166
2305
2907
601
344
527
84
2675
4924
1838213
2506
4369 160
4528
245
2744
530
1664
4815
3169
2642
1076
3339
1455
568
1383
703
4073
575
30423065
3492
1153
28732039
1746
2378
556897
4327
1231
4982
4693235
1235
4940
3103
4076
4672
2056 4507
4499
1206584
792
2952
4978
3329
4935
4005
3793
4068 1640
3155
4307
3030
512
988
4348
1949
2132
1830
3956
3941106
3190
3153
686
4959
2731
791 1187
1579
3720
2886
2628
4752
1644
21373570
1322
2842
3483
4397
1168
1279
753
1951343
3865
2852
3379
4521086
420
1609
273
478 1615
1019
923
3877
3950
3699
2519
2479
2417
3082
4388
12
54
3576
3439
2721
3442
782
3981
4382
1481
3351
49754146
4643324 548 19534547
2400
158346
4871
2206 4963
3598 180
743 3650
3960 1948 1169
1570 4165
2825
3062 2865
1021
2436
2103
11311095
4125
1184
689
4701
3512 1446
1692
3476
1793
2805
596
1122 259
1943
789
1075 231
4899
3703
1975
3122
4077270
3770
4107
4260
3180
3495 1966
9364036
4501864
446
1917
2020507
3612
996
1050
2357
4488
199
2664
3047 552
1929
46243881
4337
867381
372
429
1470
1009
2451
43023829
2156
4948 2832
3909
4174
1412
2320
4472738
2929 4063
1092
585 2268
2060
3693
131
4804
4440 4768
3561
4433
1413
322833
1819
3191
1492
3741
2179
4117
1738
3910
1648
43542108
1764
3892
47331325
1249
3216
4411
4730
1974
3790
2581
3142
4573
2430
4506
4905
878
3937
457 9021756 1543
1776
1834 2267
4478
3733
2252
2827 353
535
892 1159
4359 3772
11211214
1896
15132165
1984
3143
1268 1218
4896
3531 3568
41713718
3398 957
2006
4956
3696
11701623
3237 1248 2662
2546
4000
3263
2109 1059
2228
2647
3842
505 1309
645
3833
364
3468
382
1083
1878
3508
4017 823
2665
3104 2690
42673827
2082
2943
2377
2242
443 2096
2439
130
410134
396
3029 1126
1002
1735
4617
1048
1324
35722702
3834
4156
3685
3763
354
1973502
20633995
3848
1750
1439
1530 2074
830
2440
1115
865 4349
4979
2278
326
4702
773 2733
4807
2157
1423915
1312217
2301
3178
1778 50
1133
3709
2918
4596
1702 1628
24681686 793
46843557
2269
2887 1495
2570 583
2319
1592 1197
2347 31823267
74
534 2168
2155
37161829
313
3124
2000 1313
1723
4213
1119
1188 540
3467
1430
2043
1198
3252
4588 4969
1887
3378
4579
943
4079735
23624070
1138 2699
4147
1307
465
4556 1319
2667
3215
2656
156
669
1629 408
2750
2073
2145492
5254504
1315
2955
2648 803
1812
987
1107
2515
3381
1356
1393
268
287
4059
1139
2586
2572 4461
2976
3550 3579
620
1012
710
246
2331
476 3078272
1363
3094
1976
3129
4541
4627
2495 164
3518
3151
4367
4446
842
1876 807 532
4812
2396
3188
3581
2813
2049
3648
3553
4845 4823
2917
4048
1617 1985
3730 4703 3831 3935
1672 899
4463
3031020
4443
3942
4219
1134
3487
3506 4647
1537 4173
4423
1136 614
38231276
598
4313
2091
2754
21813059
4285
8711192
2674
389
1264546
1641
124
12293218
3954 3350
2998
2965
1730
3897
1420
3549 2927
4814
3538
2618
2993
3713
370
1483
1612
18142419
3248
1995
48852372
3195
4328
3491
38894462
82
2154
2990
1066 2361
4049
728
587
2639
140
2556
1842 2601
3424
1748
29203898
3821
4515
3245
1125
41
24834358
3523308
1494
1177
835
295 8932246
2812
3747
3816
1113
3818
3422
4227
3662
31002174
1553
624 3882
3802
3186
3162
3676
4386
3986
1415
4595
3358
278 204 4457
3794
4710 1981
4760
2450
17011298
3296
3319
1299
4861
4312
42971662
4708
4143
1205
561
4822 4396
3851
2042
2540 3972
4633205
3719
1531
520 29412050
4920 22961743 4409
1851 3355
2135
2364
2631 3567 4410
37573670 3497
4611 6884363
3867
4027
2904
2404 1236
3863 3813
31344698
579
3873
3131
1824 128
755833
914
887
983
646
2185
31663599
4326
484
174
2425
1355
1289 1770
22
29004581
3364450
3583
3244
1061
1556
1258
38704709
1261
3908
37072505
291
2961
497
4785
2279
43001646
1800
29532908
1120
19181114
4890
4932
521010
4408
4022 565707
4469
1638
3641228289
2484
3109921
2381
4188 985
2365
422
1422
4292
1986
1030
3864
3455
28952898
4196
816521
1380
4325
2162
679
3479 2047
2028
355
1499
4041 3327
3607
3503
3866
1396 4172
4514
3545
1777
1014198
1551
1194
1839
2808
380
52487
870 3840
3516
4150
241
3025
919
3341 3334
2069
3645
901171
34813161
3330
4159 254
3959
24531195 2345
1042
4764
2255
3226
1463 1808
3399
104398 2068
3386
2846 2445
4023
4030
3328 3796
4637
675
1926
2560
4615 3603
2551 3028
4295
4829
2559
1477 1003
4549
2490 369
25853056
2728
696
2655
3174
3760 94132004010
1256
4740
3415
1417
3736362 1333
3121
4686 2147
4
3375
4783
2760 3544
1792
762905
2609
4282
3744
4753
4572
3102
4474
4066
4907
2549499
2776
4608
43214428
2635
3404 3261
1784
2369
2697
1843
3484
331
1271
1517
1335
731
1856
3239
513 1163
6182271
2722
4642
805
1172 4629
4802
2561
939
4754
2352
2015 2077
41003321
1955 1449
1044 845
2775
1941
1167
1063
3669
1329
558
106 667
192
2200
825
1251
3027
821 4769
2026
432
1436
3510
1247 307
22251487
1049
4965
3789 2547
4166
3095
3105
2938
28164437
2745
1 1788
538
4558
1512 635
4810 1237
4788 3602
2604 2966
1650
19611866
2350
1963466
827 2810
3872
2221 2529 1547
4509
1154 4794
46199744283
4820
3776
1862 2321
4971
1211879
2831
2774
3808
1585
4214 276
46504344
2326
759
2197 4361
4580
3590
2817
1099
2297
3187
236315623543
1906
3035
4622
694
1140
44183185
3500
4953
3223 1668
4784
4585
2387
482
4235
4391
4909
4840
1848 3408
1597
4193
4366
3382
2164
9
1408 4599 2115
949549
4212
4632
4565 3587
43734124
4750
3625
410283
702
562
4195
4900
2300
246146024715
3697
3680
1419
424 2594
3775603
1651
1736
1969
1193
4835 2671
1025
338
2737
2215 3787
4780
3291
4904
4571 3917
818
178
4092
4414
4958
4601
94 2499
1262
3659
1841
29483354
3683
651
4032
14821117 2209
1779
1374
2092 3795 395
1710 3478
10703692
2094 3764
4964
2433 4468
4554 2460
3318
1296
2166 4222
2048
3819
23142169
48272738
14722423
1476
3532
2624 1375
2819
4471 3264
4329
2013
1291
39034479
22732481
2478
747
1278 989
3060
3957
139
2956
973
2815
23933521
600998 3649
4084
4345
34994122
1631
1302
2414
1601 2844
885
3117
2882 4659
3922
159
4415
1652
4625
4510 4751
4308
4050
1096
22813303
3231
2075
2292
1895
954 1200
3498
3274
877
3661
930
4775
2701
2432
3208
3566
365 4850
2421599
3687
126
814
368619131763
3276
15221244
3406
3474
43 4678
3714
2161
4939
2110112 4762
1613
17063118
1611
3
956
35294473
447
1220
2088 1571
4105
4587
4142608 3001 29311118
2308
3933 2795
1594
1230 1993
2688
884 1519
333
2807
2626 2765
4946
2694 2257
3974
2327 4866
4773
37591708 4630
294594
2258
4655
4707
566 1568
3971 1725
2285 2991
1189
551
2725 2630
3170
1033 1582
1067
3294
214 1454
2902
3953
832817
2045
4944
182
345
3826 1967
334
12871323
4771
2553
222 1274
2470
4449
3160
4660
2299
2397 1127
3921
3800
2751
580
3524218
2066
47901714
2770
2542
4046
523
2801
3727
2732
2129
3434
1341
498
3537
47061541
3210
49 4447
1304
31
3554
2323
610
4960
536 2689
2493
2173
3658
4713
2371
984
3254
1749
4160 4106
1069
1790
1468
3606 2537
3955
2634
4466
4341
2353
1661
3116
4453
2095
6 15691709
3204
22652685
3362 43313551
4636 169
2847
1156 1457
4343 2643 7182382
2754662
654 3370
4394 125
3723 1849
1394
650 3584
1527
677
2876
3859 3657
4906
4649
3755
4197
202
7951795
26002818
2498
3072
205 4338
869
281
2120
3387
3862
1392
28724190
1593
3141
3900
4489 267
3945
35201589
411
1409
4879
261910
3063
2202
2706 2057
4870
4135937
3636
1721
1968844 1970
4495
3558
1022
1068 108
2251
2111
2312
3856
1441
3393 741
2408
1084 3087
1751
4511 4641
4892706
1634
3373 3635 3173
3416 343 1174891242989
1401
4340
3262
2761 17 2144
3346
934
3149
2646
4306
4080
1647
3310423070
2467
861
1098
1141 312
3575
4263766952266
4719
3311
4121 785
1561
880
3925
4735
1032
1724
516
15493678
2118
72
81 1773
2573
691
1515
3883
4704
4298
615
122
4813
3366
1853
3634
1939
3331
4265
48492354
116
1865
3758
4360
829
3335
1402
4902
3578
4518
155
5063485
1349
3939
3230
4711
1758
1621
1520
1893 4675
3569
1151
4538
1514
2033
2192
4427
4685
2868
421
1245
18
3791
2980 2391
4043
2153
4364
2394
1410
2260
687
3067
1182
1977
582
2167
4847
4749
1507
3096
2031
3202
1011
1882
2079
1080
1898
2888 1111
3982
1094
2650
2826
1435
3596
1148
4519
1433
2590
1064
40784876
4365
2705
2963
1280
4862
4060
3183
4620
3779
3390269
3418
4012
2903
3395
3837
1173
2004
911
4475
4950
127
1479
3071 962
4694
4930
1535
3283
4237
463374 3323
2625
302
632
4083
1191 3913
2823
1295
1545
3046
790
4009
3539
4099
2008
3097
1450
4618
4791
25654274
61
4803 4019
918
3343
16603663
4886
3033
2554
3695
265
4792
2504
863
1747
2555
62
3140
1946
91
1178
3431
32122356
3444
4766101
3966
1089
2019
1093
2284
4544
2659
4422
58 2182
1213
488 3809
55 2237
3542
2921 1057
2329
2599294400 986
21983127
2848
4161 38 3936 47
3284
2799 4266
13913617
3050 639
3878 69 4746
2332
2613
2985
4490
3843
3316 4830
1038 3871
2543
4593266
293
2919 1006
554
3457
4497
1158
3947
3938104
1932
10284123
4865
1691
3322 85
2124
3654
3737
4115
3928
4910
1676
212 3111
3605
3952240
3199
1836
3353
4186
3133
4284 2151
1516
107
3364
1863
3359
2040
3835 2539
3477
3517
4181
4127
12634880
4243
2443
761
1567
1162
2323637
1498
4040
1765
3384 3164
3924
4424
3428
4185
3717
4903 8103671
3348
2041
4575
2641
3722
350
2315
1227 2411
3176
4342
1428
1603 2729
2010
2142
2793
1825
1288
44701703
838
4931 1
2386
208
378 2447
33604252
1920
3413
4322
3725
3825
1781 3172
3225
4592
704
3099 2619
1951
2836
1241
2348
1183 3997
3324 171
4795
1821 2935
947
6973562293 1903 1109
470 2424 2845
3815 3257
1803
3814
2916
4645 4272859
2376
2696
4837 337
3085
35853333
7571722
3236 2163
426
3251
4259
3626
4859 563
616
1062
909
4053
2491
2718 3967
3731
39
7642473
3004
3710
2107
4638965
3855
2607341
3282
1827
2589
4621
586859
1135
4716
4090
1210
2666
2233
2112
4652
22314434
589
1384
1965
4928
4138
942
2611
779 3895
2240
4162
2550
4542
4177
2939
4459
462
704 4949
695
1925
1357
4728
105
3890 280
215496
4781
831 485
1269
2896
3574
2324
3713213
2
625
1523539
4937
1755
118125 286
3417
38032934
4089
260
2477
3608 3595
1382
1610
2213
385
2126
4836
4029
3824 3317
2778
851
1914 3706 cluster
13472328 4211 2037
1892
23104182548 4736
19094270
6303357 2703
41371789 4454
531
71 1994
4231
473
4176
4035 3886
2901
23831992
1656
1505
3271 1689 4481 2457
2881212
3159
840
715
3437 3247
2317
1459
4679
2684
4175
407
1942
1807
3841
2958
3051 1731
43701809
4801
1045
3533
14142184501
621
3397
4145
2325
172
4562
1958 2824
1956
4919
13341488
2850
4393
1336
3756
3998
31681389
40154445
4732 4875
1658
2558
1798
2905 617
3962 2027
4923
4024
1982
1715
44963493
2002
2937
76
83
843 1699
17622588
3114
1707 780 2528 874
2781
4590 776297 1759
4293
3043 1732
4828
2679 1110
2867
721
4289
737
3002 1901
1971 401
3745
637
509 20073235
1112
22032098
1872
144
1947 3930
2343
2981
35014134
611 1871
2682
23119903589
572
3778
3167
3688
4071
3968 1305
657
4116
3620
6932541
4576
1091 3869
1544
1618
995
2448
35361282
3701
4229
1429
4737
4614 4228
467
4503
3112 3588
2275
1216
4167
502
4606
3021
4051
1521
4255 794
Dim2 (24.8%)

2437
910 19 4242
4976454 2488
384
2692 11852370
148
417 2568 2418
4878
3113
3621
224136514155
1783
1398 3445
44
1665
3365
1155 3619
316
9532899
2012
4806
2855
976
4800
27 4362
4377
49453307
3857
2822 110 1116
4200
1828
3234 2829
415
1605
4095
227 2116
1962
1832
3614
1243
758 2621
4163
3058
4258 3712
4539
3295
2614
2401
904
1639
2256
2117
3999
3473 982
685 576671452
4763
2922
3032
4553
80
1129
3987
2294 373
23
47584256
3627
2709 4782
2780
4444
4634 1054
1327
2254
3891
2172
898
433241922270
2435
2564
1385
3421
709
4203
4487 441
3591
4933
3991
4640
4383799
434
701
2184
2405
453
45514523
819
3868
3838
3302 464
3165
2879
2710
190 1406
2289
4383
3233
2862 1190
2803
4385
299
2759 4639 4842 1695778 4894
2531 3018 4093
4374
4353
2576 3054
2587 671 578
2018
3139
2086 4609238
4658
4403 3377
4916
4130 3980
405
1438
25763
3175
150914651619
2248
3005
2084
1897
3432
2276
896 3905
2974
1607
1667
491
4776
1444 3443
249
3106
4136
1916
3189
1990
1339
2538
2303
2180 541
1564
2359
1998
1782
1587 1208
32901696
4091
4893
2245
3023 1456
1361
1766
3901 147
12382219
3653
4786
4635
681
4877
515
3304
237
86
984778
4530
1805
4508
35
4419 3411
3914
4936
41994589
4094
2584
3505
2474
3084
592229
3389
3314
329
3066
4742
4927 4512
1575
2511
19874277183
1453
3138
3698
2046
4108
203
670
3091
3463
2870
2771 1964
4182
3667
3749
2883
3163
258
2514
3342
850
3300
4064
3927 1144
3926 a
0 379233013884
45693207
3746
1160
723
1687
2838 224
1996
2100 2368 3022
4169 3689
2636
193
458 1157
964
12947774664
1754
3349 3911
4153
4613
590 3846
3534 13
3297
4516
8883427
10553644
4081 1663
2863
3640
3410 682
2610
1137
1337
4254 4103
3344
1225
4583 1552
3015
1626
1786
732
2220229
2552
4729
135
36391728
4031
4431
2925225
4731
3977
4816 1934
668
4335
2170
142
442
4304
726
3069
837 271
3211
2064 2193
1165
542 622
4898
2024
4832
1087
3847
2691 746
3822
495
493
1840
2875
30391176
1201 3571
1822
642
3460 3798
1073
2191
1005
4841 4531
449
1354
2392
46071260
274
3941
4438
1811 165
1649
926
1292
2861
4591
4941
3132
361
972
1889
3137 1533
649
1152
3660
3024
3963
4484 4074
3984
4727
1346
1720977
8
391 2536
4744 1
3192 1643 1489
2458
1365 2222 603768
3668 2513
9781813
4564 966
4717
634 3345
1466
3839 1203
1745
3293 623
2717
1945
263
3081
3876 2979
4426
3546
4016 2459
1927
1351
2149
3120
2485
2596 4854
3496
4456
4799189
1566708
9551912
2127
3019
806
1950
209
1740 716
4536
3309
2806 1209
4759
37002413
49523238
4220
636
4480
2627 2676
490
1854
605
1037
3270
4013
4869
4240
3548
2190 3618
2741
4770
1548
37623854
3356
2318
2757 1685
4119
1451
811
3152
4929
4720
445
3684623
500
2700 1944
1847
2438
261
4310
256 1149
4582
4412
4400
2227
557
3541
3777
1331
1039
1884
4844 379
2030
3592
2623 3480
3888
1345
1036
1885
946
1690
6662912
40611078
207
2830 2977 1815 4226 1573
42623530
1001
20294139 3198
644
33723259
4044 1368
4294
2957 1060
1130 1328 2983
2339
2526487
285 4429
4140 1202
2016
2501
1447
3949
404
1150 3489
4170
839
925
1027 1100
4966
9123832
2575
813
45572693
1242
1868
226
42991108
4522
529
7541538
2913
1272
4111
2629
1905
4376
4477 750 2785
2035
49704968
3622
2772
4796
4811
4651
321
2503
3473128 2645
3642
1718
2992
1105
1508
3092
181
4705
1734
547430 638
2784
4092072
2857
2911
3524
3893
971 436
3227
2062
2530
20762714
4901 4657
3682
4352
32321983
2360
1845553
652
3214
1316
855
1240
119
42364178
3083
3385 4761
4198
1381
1387
2160
3298
4204
4355
1899
2525 78
2186 3904
1772
1719
37822452
2569
4055
2341
1578
3470
340
4183 574
1252
2734
4318
23841317
1399168
46771711
4767
1497
4526
1074
4209
2090
68
2726
4082 a 2
2923 3560800 4253
3677
3628 928 21871657 2792 1000
769
2592 1952
4654875
1097
3157
3340 2306 1490
3260
1462
1311 46813089
4378
1957
660
2698 4567
1972
4288 3031
3240
3646
1581
20801637
847
1534 4604
2398
4667
4442
2494
2672
1844
29241239
132 1739
2125
3729
3672 3279
4230
4109
3767
2214
510
3638 997
2527 952
1318
1308690
1088
1937
4934
3299
1082
4559161
2764 3811
2970
588752
3750 2735
3711
2602
4756
483
3197
2131
412
7
1767
2591
355670 4895
799
3783
38602426
4047
1503
30202673
2349
348
2742
4843 4390
4533
3280
4416
2616
2796 2593
2821
466
2790
24442773
528
29151496
4860
12324247
3983
2236
48681744
4502
2466
3559
862 4483
1226
3482631
1572
1796175
1673
393
1911
4206969
3145
1371
2051524118
3052
2928683
3464
2140 4244 1440
571
3009 2936 4075 184 979
4947 2962
4543
2843 3919 3222
647 4974
3552
4851
2388
36114207
4112 1426
4301
4817
796
3061
2139
42391228
3702
1560
448
221
3519
4319
864 1935
2804
4273
1445
3724
879 1301
787
2055
4584
659
2176
4401
2532
3732
24153402
1988
475
1596
1491
4537
2885
325
1888 282
3278
2036
3774 4881
1959
387
4238114
2152
3726
152
2632 1506
2178
2429
3147
4616
1768
138188
2446
2422
756 1233
2612
797
1979
4038
2083 562122
4224 2580733
2385 1881
4330 1016
284
2766 2480
2841
2005
772 2978
1620 678
191
1072
503
1504
3447349 2307
889
2762
560
1826
1388
4577
765 3315
1090
89 4039
2622
3265
2282
648
96
1857
1353
145
3781
4287
2975
2653
243
77
4962
1771 1705
1065
146
3177
3401
1306
1595
4413
1536
3833392
1823
425
2469
822
456
3488
4215
1775
960
2889
3041
569
4149
3525
3090
3446
2910
2752
2274
3773
4275
92342
61 211
186
37
2253
320
4688
2523
2959
9514324
809
236
1101
2121
3486
4132
5043000
2159
3704
2871
1143 7841416
376
4320
4942
4834
2489
3179
12221275
2465
3844
4863
3101
20992524
1900
2205
802
1752
3804
4208
66
2736
2891
3388
3853
4673
3221
1040
2471
3305
4808
2932
177
1071
1310
1475
1377
4540173
2230
2921860
3394
2133
1883
2366
301
559
4451
4714
2960
1029
4025
4821
3425
3528663
3093
2878
2330
3885
1870 28
2421
4646
216
23462298
1978
3586
4524
4747
4839
749 1669
3932
4102
2286
1350
1799
4026
4493
3899398
783
4309
2316
3249
1004
2798
4264
1034
1682
1013
1894
4874
2768
3965
2782 812
4578
4853
2949
3184
4666
3743
567
3565
2724
2972
9501697
3016
883
2877
220
916
1132
428
3144
2933
3074 4560
2695
2683
2954
2522
4439
19214057
2794
2520
3563 3994
748
517
16302755
2216
3292
1598
3631
935
1674
767
388
4272
3123 2753
3068
1742
2967
4002
4757
2678
2720
3436
20
1930
4129
917
2890
1616
2054
419
194
1716
1831
2104841
894
1837
4505
770
1338
2740932
2897
3076
2811
234740
3286
2034788 a 3
1270 9452373 2583 27041583 1486
306 1542 4464
1395
1590 2441
53
1031 581 774
4724 2984 1500
2081 3196
1989
305748
1785 447657
594
672
4922
2210
1938
3555
613
2337
573
1712 1693
3158619
4889
940
2663
895 3643
4021
2087
2730
339
3681
3916
277
15803363
4961
4687
2986
40 4128
45
21
3836
21301907
4605
1003324045
1931
38071362
1787
3255
2906
3115
32
3242
511 922
1501
2136
906
959
49142786
4246
4486938
3313
3407
2003
359
891 2106
628
2272
4210
305
2687
3426
4033
1654
150
3738
1554
3993
3272 1684
4250
599
2247
351
19403287
592
3513
3396
366
4500
2712
42052367
4333
3624
278878175
386
3515
4628676
2864
11421511698
210
1875
319
1816
1780
2758 2603
1007
4245 4682 1874
4110
3036
3601 1123 34382893
397
1670
4848 834
626
22832194
2756
4482
2713
1124
311 2518
15763045
1461
4838
2997
4593
604 2968
3610
2507
2727
876
4555
2534
2065
2234
3130
4943
1484
3269
722
2973
4574 2389
2545
48192944
4118
4441
658
1378 3861
1602
47981046
1502
1636
2866
201
1700
2510
2134
1017
3990
3423
2113
3034
200
440
4725
4570
1186
4772970
3923
2708
255
2881
4157
4787
134
3008
3951
4951
719 714
2719
414
1284
1794
3086
1397
4872
4954
2563 2201
742
133
3613
1474
4131
640
2606
2723
1056
14
1577
4042
4957
2212
4248
1820
2880
1980
3326
1434
2171
4303
3786
4058
3352
2014
120 595
4375
4671
2892
170
3721
1586
4371
3077
3915
1217
4311
47771147
1432
4545
1683
2243
886 1919
4018
1528
1221
2828
927
4631
3194
2224
4144
1460
2017
1443 4911
136
2749
4598
975
2304
4141
3715
1529
38284460
2395
1024
712
1525
28202189
4644
3462
4380
1539 137
1680
4432
3273347
4676
4774
2914 1128
2288
4712
494
1890
3666
1485
533 4315
2123
3514
4056
4517
948
2854
4674
2011 545
1053
4249
1473 a 4
2032 2067 27112208 24921424
1294856 3604
477 4011
70020581632
2557
1257
4234
4925
991
4633 1833
3471
2971
2574
4915
64 318
26601286
3053
1081
36
2399 3976
39784668
4334
3674944
856335
34404241
4973
3740
3380 34504347
4350
3630
3623
4691
1344
3246
2355
4680
2567
3880
3011
4189
3901367
4020
994 4918
3673
4722
1146
3171
32883850
1340 2249
2767
711
1869
4168
3655
2577
3547
903
2833
1760
266
4028 102
2235
367
1253
406
39061403
4797
2950
3098
1604
4867
3007
290 4368
1936
2022
2463
1232102
3224
4387
3887
3771853
3040
1008 4563
2670
486 1103
4726
1358
1902
471 992
3312
3509
4232
4864
2403
16
323
544
4072
2455 1512509
4690
2571
1726
2434
2988
1973
3946
2244
1051
993 3325
4793
1026
44204803526
4062
2188
1801 3647
3075
42863243
2344
3126
3337
12071797
1891
2059
-1 13329082078 3690
1043 4133 143
4739
3728
469 2769
1591
35074805 3361
4372
4202 4467
3038
1437 2262
1666
437 187
4037
2340 3308
744 4223
4069
455
3006 111
4882
244 848
4831 2617
403
3258
4738 1645
3632
3664
36754085
3433
4217
4194 1373
1366
4154
3830607
26693694
4809
1290 2964
1791
3894
2280
1855
1518
3684
4087
3201
3989
3013
4955
2840
2322
4721 826
2605
4921
1246 684
4054
1540444
1546 873
4826
1608
736
815
1655
2218
3080860
4187 4114
2508
157
4858
824
4745
2942
302001
674
4873
2475 543
1625
3451
1804
3256
2858
16353079
4548
2204
2946627
3412
2578
14673403
734
4148
99279
2521 1933
1698
4529
4448
2406
929
7203785
4339
1526
1458
49123203
1320
2987
4007
699 3441
4692
1737
3220
4001
1303
2023
2428
4179
4527
633 3975176
1297
564 3435 2777
1077 3562
2783
3472 3918
3453 3338
3912
3504
2681 433
4191
729
481 713
3940
2637
1285
1622
4586
3609 512615
4120
1550
19044670
3594
3048
3751 2085
2044
1405
3148
304 1023
1204
2053
3748
1211
3988
739
3371 3522
4689
4610
1407
115
980
1717
2462 3780
2313 2597
4700
3275
3125
2238
3761
1283
1326
1480
2926
3332 1606
4521
25822375
233
19152334
1196
4225
3985
2800
4425 1267
3948
2105 230
2061
771
1041 2791
1677
4417
32894824
849
2500
2566
854 1671
1565
3943
109
3001161
1386
1954
4789
1817
1180
17292449
2141
1179
4316
3241
4648 3577
4669
1924
4126
375
2409
3679 1624
2748
2814
4520
1359
2309 1219
550
858
309 1659
1215
2657
12934603
49384180
1250 3752
1047
2809
4980
1145
3376
4852
3931 247 40087984004 1532
2994
1642 2654
4723 828 958
4404
2175
4006 2677
1852 963
2146
6733852
18612302
3820
3277 1867
20972661
3171524
1372
4261
4833 1364
3580 4455
4269
968
2196
3742999
1442
2620 3633
4436
34494485
4532
3003
4430 655
4384
1678
872
1254
1584
4561
2686
653
2909 2884
967
431
79 804
4421
262
2649
514
3540 3845
3055
1558555
2374
4305
1653
1557
3629
1035
1164
34692544
4407
3593
2454
629
4846 868
3209
3801
2595
751
2869
2464 4888
1234 2290
1761
2533
3419
1688 4357
2379
3788
26807453430
3969
5014494
4201
17572894
4825298 2226
609
2512 1199 2839
3461 26683285
1818 2038
47484402
3250 3929
1448
2009
2579 2715
328
1873
357
330
7302874
18802207
4336
264 2497
4908
1960
4683
3961
3600 1559
3964
2834
2644
641
4399 836
4096
4977
3979
2402
7171265
3107
3229
537
3193801
3494
1259
526
19083108
4695 4021555
1563
3582
3391
1348
808
2562
3219
399
27872481369
981
4546 3665
439
3454
2211820
30371997
2716423
2277
87
1427 2143
2119890
597
3849
1360
3879
2853 1464 2195
931
3564
4534
3817
4981
20713573 2633
1510
2239
1846
4405
2996 1858
4233
3992
768
1390
3181 1806
314
1694 1835
2982
2802 3810 2390
4656 1104
4314 2856
3049 3010 4392 2502
2223 4164 2851
2598
1493
11 1741
1810
924897
3490
1877 4184
16274067
3306
3336
3958
315113
3368 2416
1018 3734 612
1999
2410
2486 3369
4734 472
43812338
3766
253
662 1588
97
2743 1928 933 3154
881 3465
4296 4626 1727
2052 852 30142089
1411
3754
2150 322
656
4216 26522427
907
2025
2651
1224
2476
1102
1675
2158377
4913
4389
3705
4113
93
3805
1431
606
3765
4290
705
661
2747
3253 152835
1418
2947
4458
4718
3409
4600
3156
3691
4098
2291
16003405
4279
1753
435
3511 3944
4452
2496
901
4276
1079 727
4743
363
760
1733451
4052
3017
3535
3973
2431
1471
4346 900
2335
4818
4152 4435
3320
177444062995 3459 2412
4151 185
2021
2930
250 3206
1679 920 519 1923 4597
34202199
4281
2640 4699
251 1850
775 360 2394317 4278 1574 3708
4887 2789
1277
2658 3012
30881376 692
2333
2739 460
4280399613211266 1404 140065 2860 3874 2128 4158
-2 474296 1469
206
4525 2336665 1281
3456 1273
3769 522
2763 3597 1614 3400
2945 3907 1352
468

-6 -4 -2 0 2
Dim1 (43.1%)

clusplot(BankData_scaled,kclust_4$cluster,color=TRUE,lable=2,lines=1)

CLUSPLOT( BankData_scaled )
4
2
Component 2

0
-2
-4

-8 -6 -4 -2 0 2

Component 1
These two components explain 33.6 % of the point variability.
3. Build appropriate models on both the test and train data  (CART & Random Forest). Interpret all the model
outputs and do the necessary modifications wherever eligible (such as pruning)

Splitting of Data into Train and Test:

> set.seed(100)

> index <- sample.split(BankData$Personal.Loan, SplitRatio = 0.7)

> CART_train <- subset(BankData, index == T)

> CART_test <- subset(BankData, index == F)

> dim(CART_train)

[1] 3488 12

> dim(CART_test)

[1] 1494 12

Building of CART Model and Plotting of Tree.

> tree=rpart(formula=Personal.Loan~.,data=CART_train,method="class",minbucket=10,cp=0)

> rpart.plot((tree))

0
0.10
100%

yes Income < 102 no

0 0
0.01 0.36
76% 24%

CCAvg < 3 Education = 1

0 0 1
0.22 0.11 0.80
4% 15% 9%

CD.Account = 0 Family.Members = 1,2 Income < 117

0 1 0
0.18 0.86 0.33
4% 2% 3%

Income < 93 Income < 114 CCAvg < 2.5

0
0.39
1%

CCAvg >= 4.3

0 0 0 1 1 0 0 1 0 1 1
0.00 0.11 0.00 0.60 0.75 0.00 0.40 1.00 0.13 0.76 1.00
72% 3% 0% 1% 0% 13% 0% 1% 2% 1% 6%
Printing and plotting of CP Tree:

> printcp(tree)

Classification tree:
rpart(formula = Personal.Loan ~ ., data = CART_train, method = "class",
minbucket = 10, cp = 0)

Variables actually used in tree construction:


[1] CCAvg CD.Account Education Family.Members Income

Root node error: 335/3488 = 0.096044

n= 3488

CP nsplit rel error xerror xstd


1 0.2746269 0 1.00000 1.00000 0.051946
2 0.1373134 2 0.45075 0.45970 0.036217
3 0.0925373 3 0.31343 0.32537 0.030674
4 0.0447761 4 0.22090 0.24179 0.026552
5 0.0089552 5 0.17612 0.21194 0.024895
6 0.0059701 8 0.14925 0.19104 0.023660
7 0.0000000 10 0.13731 0.18209 0.023109

> plotcp(tree)

E
size of tree

1 3 4 5 6 9 11
1.0
X-val Relative Error

0.8
0.6
0.4
0.2

Inf 0.19 0.11 0.064 0.02 0.0073 0

cp

After looking at the above CP table it is understood that the minimum xerror is at end node, so pruning is not required.

Using of the CART Tree for Prediction for Train and Test Data

> CART_train$CART.Pred = predict(tree, data = CART_train, type = "class")

> CART_train$CART.Score = predict(tree, data = CART_train, type = "prob")[, "1"]

> CART_test$CART.Pred = predict(tree, CART_test, type = "class")

> CART_test$CART.Score = predict(tree, CART_test, type = "prob")[, "1"]


Building of Random Forecast Model:

> set.seed(100)

> indexrf <- sample.split(BankData$Personal.Loan, SplitRatio = 0.7)

> RF_train <- subset(BankData, index == T)

> RF_test <- subset(BankData, index == F)

> dim(RF_train)

[1] 3488 12

> dim(RF_test)

[1] 1494 12

Rndforest = randomForest(Personal.Loan ~., data = RF_train, ntree = 501, mtry = 10, nodesize = 10, importance=TRUE)

> print(Rndforest)

Call:
randomForest(formula = Personal.Loan ~ ., data = RF_train, ntree = 501, mtry = 10, nodesize = 10, importance = TRUE)
Type of random forest: classification
Number of trees: 501
No. of variables tried at each split: 10

OOB estimate of error rate: 1.52%


Confusion matrix:
0 1 class.error
0 3139 14 0.004440216
1 39 296 0.116417910

plot(Rndforest)

Rndforest
0.00 0.02 0.04 0.06 0.08 0.10 0.12
Error

0 100 200 300 400 500

trees
plot(Rndforest$err.rate)

0.011
0.009
0

0.007
0.005

0.015 0.016 0.017 0.018 0.019 0.020

OOB

Tuning of Random Forest model to get the best mtry

set.seed(100)

> tRndForest=tuneRF(x=RF_train[,-c(7)],y=RF_train$Personal.Loan,mtryStart = 4,stepFactor = 1.5,ntreeTry = 51,improve =


0.0001,nodesize=10,trace=TRUE,plot=TRUE,doBest = TRUE,importance=TRUE)

mtry = 4 OOB error = 1.58%


Searching left ...
mtry = 3 OOB error = 1.49%
0.05454545 1e-04
mtry = 2 OOB error = 2.52%
-0.6923077 1e-04
Searching right ...
mtry = 6 OOB error = 1.35%
0.09615385 1e-04
mtry = 9 OOB error = 1.55%
-0.1489362 1e-04
0.022
OOB Error

0.018
0.014

2 3 4 6 9

mtry

Based on the above OOB error rate and above graph, it is clearly evident that the best number of mtry is 6.

Building of Refined Random Forest Model

> Rndforest = randomForest(Personal.Loan ~., data = RF_train, ntree = 350, mtry = 6, nodesize = 10,importance=TRUE)
> print(Rndforest)

Call:
randomForest(formula = Personal.Loan ~ ., data = RF_train, ntree = 350, mtry = 6, nodesize = 10, importance = TRUE)
Type of random forest: classification
Number of trees: 350
No. of variables tried at each split: 6

OOB estimate of error rate: 1.35%


Confusion matrix:
0 1 class.error
0 3144 9 0.002854424
1 38 297 0.113432836
plot(Rndforest)

Rndforest
0.15
0.10
Error

0.05
0.00

0 50 100 150 200 250 300 350

trees

Prediction on Train and Test data based on refined random forest

RF_train$RF.Pred = predict(Rndforest, data = RF_train, type = "class")


> RF_train$RF.Score = predict(Rndforest, data = RF_train, type = "prob")[, "1"]
> RF_test$RF.Pred = predict(Rndforest, RF_test, type = "class")
> RF_test$RF.Score = predict(Rndforest, RF_test, type = "prob")[, "1"]
4. Check the performance of all the models that you have built (test and train). Use all the model
performance measures you have learned so far. Share your remarks on which model performs the best .

Performance Evaluation for CART:

# Confusion matrix

CART_CM_train = table(CART_train$Personal.Loan, CART_train$CART.Pred)


> CART_CM_test = table(CART_test$Personal.Loan, CART_test$CART.Pred)

> ## Error Rate

> cart_tr_cm_err <- (CART_CM_train[1, 2] + CART_CM_train[2, 1]) / nrow(CART_train)


> cart_tst_cm_err <- (CART_CM_test[1, 2] + CART_CM_test[2, 1]) / nrow(CART_test)

> ##Accuracy

> cart_tr_cm_acc <- (CART_CM_train[1, 1] + CART_CM_train[2, 2]) / nrow(CART_train)


> cart_tst_cm_acc <- (CART_CM_test[1, 1] + CART_CM_test[2, 2]) / nrow(CART_test)

ROC Curve and AUC Value

> predobjtrain = prediction(CART_train$CART.Score, CART_train$Personal.Loan)


> preftrain = performance(predobjtrain, "tpr", "fpr")

plot(preftrain, main = "ROC Curve for Train Data (CART)")

ROC Curve for Train Data (CART)


1.0
0.8
True positive rate

0.6
0.4
0.2
0.0

0.0 0.2 0.4 0.6 0.8 1.0

False positive rate


> predobjtest = prediction(CART_test$CART.Score, CART_test$Personal.Loan)
> preftest = performance(predobjtest, "tpr", "fpr")
> plot(preftest, main = "ROC Curve for Test Data (CART)")

ROC Curve for Test Data (CART)


1.0
0.8
True positive rate

0.6
0.4
0.2
0.0

0.0 0.2 0.4 0.6 0.8 1.0

False positive rate

> auctrain = performance(predobjtrain, "auc")


> cart_tr_auc <- as.numeric(auctrain@y.values)
> auctest = performance(predobjtest, "auc")
> cart_tst_auc <- as.numeric(auctest@y.values)

> #KS Value

> cart_tr_ks <- max(preftrain@y.values[[1]] - preftrain@x.values[[1]])


> cart_tst_ks <- max(preftest@y.values[[1]] - preftest@x.values[[1]])
>
> #Gini Value

> cart_tr_gini <- ineq(CART_train$CART.Score, "gini")


> cart_tst_gini <- ineq(CART_test$CART.Score, "gini")
>
> #Concordance Table

> cart_tr_con <- Concordance(actuals = CART_train$Personal.Loan, predictedScores = CART_train$CART.Score)


> cart_tst_con <- Concordance(actuals = CART_test$Personal.Loan, predictedScores = CART_test$CART.Score)
Performance Measure Parameters (Random Forest)

>
> ###Confusion Matrix
>
> ## Random Forest Model Confusion Matrix
>
> RF_CM_train = table(RF_train$Personal.Loan, RF_train$RF.Pred)
>
> RF_CM_test = table(RF_test$Personal.Loan, RF_test$RF.Pred)
>
> ## Error Rate
>
> RF_tr_cm_err <- (RF_CM_train[1, 2] + RF_CM_train[2, 1]) / nrow(RF_train)
> RF_tst_cm_err <- (RF_CM_test[1, 2] + RF_CM_test[2, 1]) / nrow(RF_test)
>
> ##Accuracy
>
> RF_tr_cm_acc <- (RF_CM_train[1, 1] + RF_CM_train[2, 2]) / nrow(RF_train)
> RF_tst_cm_acc <- (RF_CM_test[1, 1] + RF_CM_test[2, 2]) / nrow(RF_test)
>
> #ROC Curve and AUC Value
>
> predobjtrain = prediction(RF_train$RF.Score, RF_train$Personal.Loan)
> preftrain = performance(predobjtrain, "tpr", "fpr")
>
> plot(preftrain, main = "ROC Curve for Train Data (Random Forest)")

ROC Curve for Train Data (Random Forest)


1.0
0.8
True positive rate

0.6
0.4
0.2
0.0

0.0 0.2 0.4 0.6 0.8 1.0

False positive rate


ROC Curve for Test Data

> predobjtest = prediction(RF_test$RF.Score, RF_test$Personal.Loan)


> preftest = performance(predobjtest, "tpr", "fpr")
> plot(preftest, main = "ROC Curve for Test Data (Random Forest)")

ROC Curve for Test Data (Random Forest)


1.0
0.8
True positive rate

0.6
0.4
0.2
0.0

0.0 0.2 0.4 0.6 0.8 1.0

False positive rate

> auctrain = performance(predobjtrain, "auc")


> RF_tr_auc <- as.numeric(auctrain@y.values)
> auctest = performance(predobjtest, "auc")
> RF_tst_auc <- as.numeric(auctest@y.values)
>
> #KS Value
>
> RF_tr_ks <- max(preftrain@y.values[[1]] - preftrain@x.values[[1]])
> RF_tst_ks <- max(preftest@y.values[[1]] - preftest@x.values[[1]])
>
> #Gini Value
>
> RF_tr_gini <- ineq(RF_train$RF.Score, "gini")
> RF_tst_gini <- ineq(RF_test$RF.Score, "gini")
>
> #Concordance Table
>
> RF_tr_con <- Concordance(actuals = RF_train$Personal.Loan, predictedScores = RF_train$RF.Score)
> RF_tst_con <- Concordance(actuals = RF_test$Personal.Loan, predictedScores = RF_test$RF.Score)

Combining all the evaluation results

> result <- list(Model = c('CART', 'CART', 'Random Forest', 'Random Forest'), Dataset = c('Train','Test'))

> result <- as.data.frame(result)

> result <- list(Model = c('CART', 'CART', 'Random Forest', 'Random Forest'), Dataset = c('Train', 'Test', 'Train', 'Test'), Classification.Error =
c(cart_tr_cm_err, cart_tst_cm_err, RF_tr_cm_err, RF_tst_cm_err), Classification.Accuracy = c(cart_tr_cm_acc, cart_tst_cm_acc, RF_tr_cm_acc,
RF_tst_cm_acc), AUC = c(cart_tr_auc, cart_tst_auc, RF_tr_auc, RF_tst_auc), KS = c(cart_tr_ks, cart_tst_ks, RF_tr_ks, RF_tst_ks), Gini =
c(cart_tr_gini, cart_tst_gini, RF_tr_gini, RF_tst_gini), Concordance = c(cart_tr_con$Concordance, cart_tst_con$Concordance,
RF_tr_con$Concordance, RF_tst_con$Concordance), Discordance = c(cart_tr_con$Discordance, cart_tst_con$Discordance,
RF_tr_con$Discordance, RF_tst_con$Discordance), Tied = c(cart_tr_con$Tied, cart_tst_con$Tied, RF_tr_con$Tied, RF_tst_con$Tied), Pairs =
c(cart_tr_con$Pairs, cart_tst_con$Pairs, RF_tr_con$Pairs, RF_tst_con$Pairs))
Comparing Training Data from Both Models

> result <- as.data.frame(result)

> print(result[c(1,3), ])

Model Dataset Classification.Error Classification.Accuracy AUC


1 CART Train 0.01318807 0.9868119 0.9969349
3 Random Forest Train 0.01347477 0.9865252 0.9976081
KS Gini Concordance Discordance Tied Pairs
1 0.9385272 0.8984150 0.9956014 0.004398559 1.301043e-17 1056255
3 0.9540679 0.8980679 0.9976047 0.002395255 -5.074066e-17 1056255

Comparing Test Data from Both Models

> print(result[c(2,4), ])

Model Dataset Classification.Error Classification.Accuracy AUC


2 CART Test 0.01338688 0.9866131 0.9847950
4 Random Forest Test 0.01338688 0.9866131 0.9982401
KS Gini Concordance Discordance Tied Pairs
2 0.9188894 0.8950450 0.9753304 0.024669631 2.775558e-17 193193
4 0.9626591 0.8926719 0.9982090 0.001790955 -4.163336e-17 193193

After comparing the results of both training and test data, it is understood that our models are good because there is no much difference in
accuracy or performance rate of each model when comparing training data Vs Test Data.

Comparing Train & Test for CART

> print(result[1:2, ])

Model Dataset Classification.Error Classification.Accuracy AUC KS


1 CART Train 0.01318807 0.9868119 0.9969349 0.9385272
2 CART Test 0.01338688 0.9866131 0.9847950 0.9188894
Gini Concordance Discordance Tied Pairs
1 0.898415 0.9956014 0.004398559 1.301043e-17 1056255
2 0.895045 0.9753304 0.024669631 2.775558e-17 193193

Comparing Train & Test for Random forest


print(result[3:4, ])

Model Dataset Classification.Error Classification.Accuracy AUC


3 Random Forest Train 0.01347477 0.9865252 0.9976081
4 Random Forest Test 0.01338688 0.9866131 0.9982401
KS Gini Concordance Discordance Tied Pairs
3 0.9540679 0.8980679 0.9976047 0.002395255 -5.074066e-17 1056255
4 0.9626591 0.8926719 0.9982090 0.001790955 -4.163336e-17 193193
Comparing everything together

print(result)

Model Dataset Classification.Error Classification.Accuracy AUC


1 CART Train 0.01318807 0.9868119 0.9969349
2 CART Test 0.01338688 0.9866131 0.9847950
3 Random Forest Train 0.01347477 0.9865252 0.9976081
4 Random Forest Test 0.01338688 0.9866131 0.9982401
KS Gini Concordance Discordance Tied Pairs
1 0.9385272 0.8984150 0.9956014 0.004398559 1.301043e-17 1056255
2 0.9188894 0.8950450 0.9753304 0.024669631 2.775558e-17 193193
3 0.9540679 0.8980679 0.9976047 0.002395255 -5.074066e-17 1056255
4 0.9626591 0.8926719 0.9982090 0.001790955 -4.163336e-17 193193

After checking above results Random forest model is giving better accuracy under maximum evaluation models for test data. i.e for Confusion
matrix CART-98.66% Vs Random Forest-98.66%, AUC CART-98.47% Vs Random Forest-99,82%, KS CART-91.88% Vs Random Forest-96.26%, GIN-
89.50% Vs Random Forest-89.27% and Concordance CART-97.5% Vs Random Forest-99.82%.

Random Forest Model performs the best on both training and test data.

You might also like