0% found this document useful (0 votes)

75 views5 pages

Introduction to K-means Clustering

Uploaded by

bhaskarsupplychain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views5 pages

Introduction to K-means Clustering

Uploaded by

bhaskarsupplychain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Week-3: Clustering: An Introduction and

Analysis

Sherry Thomas (21f3001449), Vivek Sivaramakrishnan (21f2000045)

Contents
Introduction to Clustering 1

K-means Clustering (Lloyd’s Algorithm) 2

The Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

Convergence of K-means Algorithm 3

Nature of Clusters Produced by K-means 3

Smart Initialization - K-means++ 4

Choice of K 5

Acknowledgments 5
Abstract
The week commences with an introduction to the concept of clustering
and a comprehensive examination of the K-means algorithm, a crucial
element within the topic. The week also delves into the constraints of the
K-means approach and offers potential remedial measures to address such
limitations.

Introduction to Clustering
Clustering represents an essential method in unsupervised machine learning
aimed at grouping similar objects into clusters, thereby revealing inherent struc-
tures within the data for exploratory analysis or serving as a preprocessing step
for subsequent algorithms.
Our primary objective is to partition a set of 𝑛 datapoints into 𝑘 clusters.
Notation:
X = {x1 , x2 , … , x𝑛 } x𝑖 ∈ ℝ𝑑
𝑆 = {z ∀z ∈ {1, 2, … 𝑘}𝑛 }

1
𝑛
∑ x𝑖 ⋅ 𝟙(𝑧𝑖 = 𝑘)
𝑖=1
𝜇𝑘 = 𝑛
∑ 𝟙(𝑧𝑖 = 𝑘)
𝑖=1

Where:
• x𝑖 denotes the 𝑖𝑡ℎ datapoint.
• 𝑧𝑖 denotes the cluster indicator of x𝑖 .
• 𝜇𝑧 denotes the mean of the cluster with indicator 𝑧𝑖 .
𝑖
• 𝑆 represents the set of all possible cluster assignments. It is important to
note that 𝑆 is finite (𝑘𝑛 ).
Goal:
𝑛
2
min ∑ ||x𝑖 − 𝜇𝑧 ||
z∈𝑆 𝑖
𝑖=1

However, the manual solution to this optimization problem is classified as an

NP-Hard problem, which necessitates considering alternative approaches to ap-
proximate its solution due to computational constraints.

K-means Clustering (Lloyd’s Algorithm)

Lloyd’s Algorithm, popularly known as the k-means algorithm, offers a widely
utilized and straightforward clustering method that segregates a dataset into 𝐾
predetermined clusters by iteratively computing the mean distance between the
points and their cluster centroids.

The Algorithm
The algorithm proceeds as follows:
Step 1: Initialization: Randomly assign datapoints from the dataset as the
initial cluster centers.
Step 2: Reassignment Step:
2
𝑧𝑖𝑡 = arg min||x𝑖 − 𝜇𝑡𝑘 || ∀𝑖
2
𝑘

Step 3: Compute Means:

𝑛
∑ x𝑖 ⋅ 𝟙(𝑧𝑖𝑡 = 𝑘)
𝑖=1
𝜇𝑡+1
𝑘
= 𝑛 ∀𝑘
∑ 𝟙(𝑧𝑖𝑡 = 𝑘)
𝑖=1

Step 4: Loop until Convergence: Repeat steps 2 and 3 until the cluster
assignments do not change.

2
Convergence of K-means Algorithm
The convergence of K-means algorithm with respect to the objective is estab-
lished by considering the following points:
• The set of all possible cluster assignments 𝑆 is finite.
• The objective function value strictly decreases after each iteration of
Lloyd’s Algorithm.

𝐹 (𝑧1𝑡+1 , 𝑧2𝑡+1 , … , 𝑧𝑛𝑡+1 ) < 𝐹 (𝑧1𝑡 , 𝑧2𝑡 , … , 𝑧𝑛𝑡 )

Moreover, a smarter initialization of K-means can improve the likelihood of

converging to a good cluster assignment with a reduced number of iterations.
Although the final assignment may not necessarily represent the global optima
of the objective function, it is practically satisfactory as the objective function
strictly decreases with each reassignment.
Considering the finiteness of the number of possible assignments (𝑘𝑛 ), conver-
gence of the algorithm is guaranteed.
Alternate Explanation: The convergence of the K-means algorithm is ascer-
tained by its iterative nature, which minimizes the sum of squared distances be-
tween points and their cluster centroids—a convex function possessing a global
minimum. The algorithm, under mild assumptions regarding the initial cluster
means, reaches its convergence point, making it a dependable tool for clustering.

Nature of Clusters Produced by K-means

Let 𝜇1 and 𝜇2 denote the centroids of the clusters 𝐶1 and 𝐶2 , respectively.
For 𝐶1 ,
2 2
||x − 𝜇1 || ≤ ||x − 𝜇2 ||
||𝜇2 ||2 − ||𝜇1 ||2
∴x𝑇 (𝜇2 − 𝜇1 ) ≤ ∀x
2

The above equation takes the form of x𝑇 (𝑤) < 𝑐, indicating a linear separator
or half-space. As a result, our resulting partition of the region represents an
intersection of multiple half-spaces, commonly known as a Voronoi Partition.
However, the standard k-means algorithm may not perform effectively when
the underlying clusters in the dataset possess a non-linear structure. In such
cases, alternative methods like Kernel K-means or Spectral Clustering can be
employed to enhance clustering accuracy. Nonetheless, a detailed exploration
of these methods is beyond the scope of this course.

3
Figure 1: Voronoi Partition

Smart Initialization - K-means++

The concept of K-means++ involves selecting centroids that are maximally dis-
tant from each other.
• Step 1: Randomly select 𝜇01 from the dataset.
• Step 2: For 𝑙 ∈ {2, 3, … , 𝑘}, choose 𝜇0𝑙 probabilistically proportional to
the score(𝑆), where 𝑆 is defined as follows:
2
𝑆(x𝑖 ) = min ||x𝑖 − 𝜇0𝑗 || ∀x𝑖 ∈ X
{𝑗=1,2,…,𝑙−1}

The probabilistic aspect of the algorithm provides an expected guarantee

of optimal convergence in K-means. The guarantee is given by:
𝑛 𝑛
2 2
𝔼 [∑ ||x𝑖 − 𝜇𝑧 || ] ≤ 𝑂(log 𝑘) [ min ∑ ||x𝑖 − 𝜇𝑧 || ]
𝑖 {𝑧1 ,𝑧2 ,…,𝑧𝑛 } 𝑖
𝑖=1 𝑖=1

where 𝑂(log 𝑘) is a constant of order log 𝑘.

• Step 3: Once the centroids are determined, we proceed with our usual
Lloyd’s Algorithm.

4
Choice of K
A prerequisite for K-means is determining the number of clusters, denoted as 𝑘.
However, what if the value of 𝑘 is unknown?
If we were to choose 𝑘 to be equal to 𝑛:
𝑛
2
𝐹 (𝑧1 , 𝑧2 , … , 𝑧𝑛 ) = ∑ ||x𝑖 − 𝜇𝑧 || = 0
𝑖
𝑖=1

However, as having as many clusters as datapoints is undesirable, we aim to

minimize 𝑘 while penalizing large values of 𝑘.
𝑛
2
arg min [∑ ||x𝑖 − 𝜇𝑧 || + Penalty(𝑘)]
𝑖
𝑘 𝑖=1

Two common criteria for making the above argument are:

̂ ∗ ))]
• Akaike Information Criterion: [2𝐾 − 2 ln(ℒ(𝜃
̂ ∗ ))]
• Bayesian Information Criterion: [𝐾 ln(𝑛) − 2 ln(ℒ(𝜃
However, detailed elaboration of these criteria is beyond the scope of this course.

Acknowledgments
Professor Arun Rajkumar: The content, including the concepts and nota-
tions presented in this document, has been sourced from his slides and lectures.
His expertise and educational materials have greatly contributed to the devel-
opment of this document.

A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
Simple K-Means Clustering Explained
No ratings yet
Simple K-Means Clustering Explained
3 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
k-means++: Enhanced Clustering Seeding
No ratings yet
k-means++: Enhanced Clustering Seeding
9 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
27 pages
K-Means Clustering Overview
No ratings yet
K-Means Clustering Overview
5 pages
Understanding K-Means Clustering Types
No ratings yet
Understanding K-Means Clustering Types
21 pages
K-Means Clustering Algorithm Overview
No ratings yet
K-Means Clustering Algorithm Overview
47 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
16 pages
Unit 4
No ratings yet
Unit 4
125 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
1 The K-Medoids Algorithm
No ratings yet
1 The K-Medoids Algorithm
5 pages
K-means Clustering Algorithm Explained
No ratings yet
K-means Clustering Algorithm Explained
37 pages
Unsupervised Learning: Clustering Techniques
No ratings yet
Unsupervised Learning: Clustering Techniques
23 pages
K-Means Clustering
No ratings yet
K-Means Clustering
16 pages
Clustering
No ratings yet
Clustering
17 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
ML (Unit 4)
No ratings yet
ML (Unit 4)
19 pages
Algo
No ratings yet
Algo
59 pages
Unsupervised Learning: Clustering Techniques
No ratings yet
Unsupervised Learning: Clustering Techniques
21 pages
K-Means Clustering Explained in Python
No ratings yet
K-Means Clustering Explained in Python
26 pages
K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
No ratings yet
K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
3 pages
Kmean Clustering
No ratings yet
Kmean Clustering
3 pages
K Means - Ipynb - Colab
No ratings yet
K Means - Ipynb - Colab
10 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
13 pages
Unsupervised Learning: K-Means Insights
No ratings yet
Unsupervised Learning: K-Means Insights
46 pages
Cluster Analysis and K-Means Guide
No ratings yet
Cluster Analysis and K-Means Guide
20 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
K-Means Clustering
No ratings yet
K-Means Clustering
6 pages
K - Means Clustering
No ratings yet
K - Means Clustering
3 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
12 pages
Intro to Clustering Techniques
No ratings yet
Intro to Clustering Techniques
13 pages
Clustering Techniques in Machine Learning
No ratings yet
Clustering Techniques in Machine Learning
65 pages
K Means
No ratings yet
K Means
23 pages
Clustering and Dimensionality Reduction
No ratings yet
Clustering and Dimensionality Reduction
58 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
24 pages
Clustering
No ratings yet
Clustering
6 pages
Unsupervised Learning and Clustering Methods
No ratings yet
Unsupervised Learning and Clustering Methods
13 pages
Kmean
No ratings yet
Kmean
11 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
Understanding K-Means Clustering
No ratings yet
Understanding K-Means Clustering
12 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
33 pages
Understanding Clustering in Unsupervised Learning
No ratings yet
Understanding Clustering in Unsupervised Learning
4 pages
Overview of Clustering Methods in ML
No ratings yet
Overview of Clustering Methods in ML
37 pages
1 s2.0 S0031320319301608 Main
No ratings yet
1 s2.0 S0031320319301608 Main
18 pages
Clustering
No ratings yet
Clustering
18 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
20 pages
Partitioning-Based Clustering Overview
No ratings yet
Partitioning-Based Clustering Overview
27 pages
UNIT III Part-1
No ratings yet
UNIT III Part-1
69 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
31 pages
Understanding K-Means Clustering Algorithm
No ratings yet
Understanding K-Means Clustering Algorithm
14 pages
K Means Algo
No ratings yet
K Means Algo
7 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
Introduction To Kmeans
No ratings yet
Introduction To Kmeans
4 pages
7 Clustering1
No ratings yet
7 Clustering1
72 pages
K-Means Clustering Guide & Python Implementation
No ratings yet
K-Means Clustering Guide & Python Implementation
21 pages
Kernel PCA: Enhancing Principal Component Analysis
No ratings yet
Kernel PCA: Enhancing Principal Component Analysis
4 pages
Estimation and EM Algorithm Insights
No ratings yet
Estimation and EM Algorithm Insights
6 pages
PCA for Dimensionality Reduction Guide
No ratings yet
PCA for Dimensionality Reduction Guide
6 pages
Linear Regression Fundamentals Explained
No ratings yet
Linear Regression Fundamentals Explained
6 pages
BMC205 DSAA Unit1 Intro Notes
No ratings yet
BMC205 DSAA Unit1 Intro Notes
14 pages
Data Structures - Question Set
No ratings yet
Data Structures - Question Set
3 pages
Python Example
No ratings yet
Python Example
43 pages
AI - Possible 16 Marks
No ratings yet
AI - Possible 16 Marks
12 pages
Rule-Based Classification Guide
No ratings yet
Rule-Based Classification Guide
3 pages
Express Publishing (Obra Colectiva) - Advanced Computer Science For The Ib Diploma Program International Baccalaureate-Express (2017) - Convert
No ratings yet
Express Publishing (Obra Colectiva) - Advanced Computer Science For The Ib Diploma Program International Baccalaureate-Express (2017) - Convert
301 pages
A Parallelizable Variant of HCA : Sreenivasan Ganti Visnu Srinivasan Pallavi Ramicetty
No ratings yet
A Parallelizable Variant of HCA : Sreenivasan Ganti Visnu Srinivasan Pallavi Ramicetty
7 pages
PIMPLE vs SIMPLE in OpenFOAM
No ratings yet
PIMPLE vs SIMPLE in OpenFOAM
5 pages
Machine Learning Final Exam Solutions 2004
No ratings yet
Machine Learning Final Exam Solutions 2004
27 pages
Tree Data Structure Notes
100% (1)
Tree Data Structure Notes
39 pages
Solid Modeling Techniques: Constructive Solid Geometry (CSG)
No ratings yet
Solid Modeling Techniques: Constructive Solid Geometry (CSG)
22 pages
Binary Search Tree Implementation Guide
No ratings yet
Binary Search Tree Implementation Guide
60 pages
Linear Programming and POM For Windows
No ratings yet
Linear Programming and POM For Windows
2 pages
Compiled - Wise Dsa Sheet: 450 3 - 4 Months
No ratings yet
Compiled - Wise Dsa Sheet: 450 3 - 4 Months
3 pages
Unit-03 Graph Process
No ratings yet
Unit-03 Graph Process
6 pages
Planning and Search: Classical Planning: Planning Graphs, Graphplan
No ratings yet
Planning and Search: Classical Planning: Planning Graphs, Graphplan
22 pages
Page Replacement Strategies
No ratings yet
Page Replacement Strategies
10 pages
Game Selection Algorithm in Pascal
No ratings yet
Game Selection Algorithm in Pascal
4 pages
University of Engineering and Technology Lahore
No ratings yet
University of Engineering and Technology Lahore
5 pages
Best First Search for 8 Puzzle
No ratings yet
Best First Search for 8 Puzzle
3 pages
DS (3rd) Dec2020
No ratings yet
DS (3rd) Dec2020
2 pages
Flow Chart Practice - Grade 6
100% (3)
Flow Chart Practice - Grade 6
6 pages
Floyd-Warshall Algorithm
100% (1)
Floyd-Warshall Algorithm
7 pages
Pattern Matching Algorithms Explained
No ratings yet
Pattern Matching Algorithms Explained
3 pages
VLOOKUP Guide for Pricing and Data
No ratings yet
VLOOKUP Guide for Pricing and Data
8 pages
Local Search & Optimization Guide
No ratings yet
Local Search & Optimization Guide
33 pages
Chapter 7 - Binary Trees
No ratings yet
Chapter 7 - Binary Trees
23 pages
Cheat Sheet
No ratings yet
Cheat Sheet
3 pages
PCCST 303: Data Structures and Algorithms: Shafina F R Assistant Professor, Cse
No ratings yet
PCCST 303: Data Structures and Algorithms: Shafina F R Assistant Professor, Cse
29 pages
Sheet7 - Trees - S2018 - Final - Solution
No ratings yet
Sheet7 - Trees - S2018 - Final - Solution
18 pages