Professional Documents
Culture Documents
CLUSTERING
CONCEPT
5 CLUSTERS
CONCEPT
4 CLUSTERS
3 CLUSTERS
CONCEPT
2 CLUSTERS
1 CLUSTER
Dendrogram
A binary tree that shows how clusters are
merged/split hierarchically
Each node on the tree is a cluster; each leaf node is a
singleton cluster
10
Dendrogram
A clustering of the data objects is obtained by
cutting the dendrogram at the desired level, then
each connected component forms a cluster
11
Dendrogram
A clustering of the data objects is obtained by
cutting the dendrogram at the desired level, then
each connected component forms a cluster
12
How to Merge Clusters?
How to measure the distance between clusters?
Single-link
Complete-link
Distance?
Average-link
Centroid distance
13
How to Define Inter-Cluster Distance
mi,mj are the means
of Ci, Cj,
17
Hierarchical Clustering: Comparison
Single-link Complete-link
5
1 4 1
3
2 5
5 5
2 1 2
2 3 6 3 6
3
1
4 4
4
Agglomerative approach
Initialization:
Each object is a cluster
Iteration:
a ab Merge two clusters which are
b abcde most similar to each other;
Until all objects are merged
c
cde into a single cluster
d
de
e
19
Hierarchical Clustering
20
CONCEPT- FINAL DENDOGRAM
21
CONCEPT
• We don't want clusters with distances
greater than 3. so that leaves us with only the 3
• Then we draw a threshold line at 3 clusters below the threshold line
22
CONCEPT
23
OPTIMAL NUMBER OF CLUSTERS
24
THRESHOLD
younger customers that don’t spend much, older customers with less
spending also, and the mid-segment that spends a lot
25