Professional Documents
Culture Documents
CLUSTERING
The goal is that the objects within a group be similar (or related) to one
another and different from (or unrelated to) the objects in other groups.
The greater the similarity (or homogeneity) within a group and the greater
the difference between groups, the better or more distinct the clustering"
BTW - The words "groups" and "clusters" mean the same thing.
https://www.daveondata.com/tdwi-nashville-2023 2
K-MEANS CLUSTERING
Introducing K-Means
https://www.daveondata.com/tdwi-nashville-2023 3
K-MEANS CLUSTERING
A Contrived Example
https://www.daveondata.com/tdwi-nashville-2023 4
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 5
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 6
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 7
K-MEANS CLUSTERING
Centroids are moved based on the average values of the data points within
the cluster.
https://www.daveondata.com/tdwi-nashville-2023 8
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 9
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 10
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 11
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 12
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 13
K-MEANS CLUSTERING
The k-means algorithm stops when no data points are assigned to a new
cluster.
https://www.daveondata.com/tdwi-nashville-2023 14
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 15
K-MEANS CLUSTERING
https://www.daveondata.com/tdwi-nashville-2023 16