K-Means Clustering Implementation Guide

The document discusses implementing K-Means clustering. It shows how to identify clusters in 1D and 2D data using scikit-learn KMeans. It generates scatter plots to visualize clustering for different numbers of clusters on randomly generated data.

Uploaded by

Arslan Mansoori

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views8 pages

K-Means Clustering Implementation Guide

Uploaded by

Arslan Mansoori

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

EXPERIMENT 9

Aim: Implementation of K-Mean Clustering

COURSE OUTCOMES

CO4 Evaluate machine learning model’s performance and apply learning strategy to
improve the performance of supervised and unsupervised learning model.

CO5 Develop a suitable model for supervised and unsupervised learning algorithm and
optimize the model on the expected accuracy.

K Means Clustering
In this model Data is divided into clusters on the basis of nearest mean to each cluster.
1. Identify 2 groups in 1D Array
from [Link] import KMeans
import numpy as np

data = [Link]([1,2,3,4,5,6,7,8,9,10,91,92,93,94,95,96,97,98,99,100])

kmeans = KMeans(n_clusters=2).fit([Link](-1,1))
[Link]([Link](-1,1))

1. Identify 5 groups in 1D Array

from [Link] import KMeans
import numpy as np
data = [Link]([101, 107, 106, 199, 204, 205, 207, 306, 310, 312, 312, 314, 317, 318, 380, 377,
379, 382, 466, 469, 471, 472, 557, 559, 562, 566, 569])

kmeans = KMeans(n_clusters=5).fit([Link](-1,1))
[Link]([Link](-1,1))

2. Identify 2 groups in 2 D Array

from [Link] import KMeans
import numpy as np
X = [Link]([[1, 2], [1, 4], [1, 0], [10, 2], [10, 4], [10, 0]])
kmeans = KMeans(n_clusters=2, random_state=0).fit(X)
[Link]([[0, 0], [12, 3]])
[Link]([[11,11], [8, 9]])
[Link]([[2,20], [4, 4]])
Explanation:
1 2
1 4
1 0
10 2
10 4
10 0
Ans is [1,0]
[0,0] will be predicted in Column No 1
[12,3] will be predicted in Column No 0

Similarly check [11,11] [8,9] it must come in [0,0]

And Check[2,2][4,4] it must come in [1,1]

3. Plotting K means cluster for 2D Group for 2 Clusters

from [Link] import KMeans
import numpy as np
X = [Link]([[1, 2], [1, 4], [1, 0], [10, 2], [10, 4], [10, 0]])
kmeans = KMeans(n_clusters=2, random_state=0).fit(X)
y_predict= kmeans.fit_predict(X)
#[Link]([[0, 0], [12, 3]])

import [Link] as mtp

[Link](X[y_predict == 0, 0], X[y_predict == 0, 1], s = 100, c = 'blue', label = 'Cluster 1')

#for first cluster
[Link](X[y_predict == 1, 0], X[y_predict == 1, 1], s = 100, c = 'green', label = 'Cluster 2')
#for second cluster
[Link](0,10)
[Link](0,10)
[Link]()

4. Plot a scatter Chart for 300 random numbers

%matplotlib inline
import [Link] as plt
import seaborn as sns; [Link]() # for plot styling
import numpy as np
from [Link] import make_blobs
X, y_true = make_blobs(n_samples=300, centers=4,
cluster_std=0.60, random_state=0)
[Link](X[:, 0], X[:, 1], s=50);
# The scatter() function plots one dot for each observation. It needs two arrays of the same
length, one for the values of the x-axis, and one for values on the y-axis.
# Using : means that we take all elements in the correspond array dimension.
# s tells the size of the marker. (This is the size of the marker)

Now seeing this chart we can identify that there are 4 different clusters.
The k-means algorithm does this automatically, and in Scikit-Learn uses the typical estimator
API:

from [Link] import KMeans

kmeans = KMeans(n_clusters=4)
[Link](X)
y_kmeans = [Link](X)
[Link](X[:, 0], X[:, 1], c=y_kmeans, s=50, cmap='viridis')
centers = kmeans.cluster_centers_
[Link](centers[:, 0], centers[:, 1], c='black', s=200, alpha=0.5);

5. Plot a scatter Chart for 300 random numbers (For the same data increase the clusters to 5
say)
from [Link] import KMeans
kmeans = KMeans(n_clusters=5)
[Link](X)
y_kmeans = [Link](X)

[Link](X[:, 0], X[:, 1], c=y_kmeans, s=50, cmap='viridis')

centers = kmeans.cluster_centers_
[Link](centers[:, 0], centers[:, 1], c='black', s=200, alpha=0.5);

Figure 30: 5 Clusters

6. Plot a scatter Chart for 300 random numbers (For the same data increase the clusters to 6
say)

from [Link] import KMeans

kmeans = KMeans(n_clusters=6)
[Link](X)
y_kmeans = [Link](X)

[Link](X[:, 0], X[:, 1], c=y_kmeans, s=50, cmap='viridis')

centers = kmeans.cluster_centers_
[Link](centers[:, 0], centers[:, 1], c='black', s=200, alpha=0.5);

Figure 31: 6 Clusters

Similarly do the same for 7 Clusters and 8 Clusters

Figure 32: 7 Clusters

Figure 33: 12 Clusters

Viva Questions
1. What is the main difference between k-Means and k-Nearest Neighbours?
2. How is Entropy used as a Clustering Validation Measure?
3. How to determine k using the Elbow Method?
4. What is the difference between Classical k-Means and Spherical k-Means?
5. What is the difference between k-Means and k-Medians and when would you use one
over another?

K-Means Clustering Explained: Algorithm & Applications
No ratings yet
K-Means Clustering Explained: Algorithm & Applications
26 pages
K-Means Clustering in Machine Learning
No ratings yet
K-Means Clustering in Machine Learning
12 pages
0006 - K Means Clustering - Introduction - 2025
No ratings yet
0006 - K Means Clustering - Introduction - 2025
19 pages
K-Means Clustering in Python Guide
No ratings yet
K-Means Clustering in Python Guide
7 pages
Wa0033.
No ratings yet
Wa0033.
38 pages
K-Means Clustering Lab Report
No ratings yet
K-Means Clustering Lab Report
8 pages
Python Data Scaling and Clustering Methods
No ratings yet
Python Data Scaling and Clustering Methods
20 pages
K-Means Clustering Lab with Sklearn
No ratings yet
K-Means Clustering Lab with Sklearn
21 pages
K-Means Clustering in Python Guide
No ratings yet
K-Means Clustering in Python Guide
3 pages
Python K-Means Clustering Guide
No ratings yet
Python K-Means Clustering Guide
6 pages
Clustering Algorithms in Python: K-means & Agglomerative
No ratings yet
Clustering Algorithms in Python: K-means & Agglomerative
9 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
K-Means Clustering Python Guide
No ratings yet
K-Means Clustering Python Guide
3 pages
Practical 03
No ratings yet
Practical 03
3 pages
K-means Clustering Implementation Guide
No ratings yet
K-means Clustering Implementation Guide
4 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
K-Means Clustering Tutorial
No ratings yet
K-Means Clustering Tutorial
16 pages
EXP-6 K Mean Clustring
No ratings yet
EXP-6 K Mean Clustring
6 pages
K Means Clustering
No ratings yet
K Means Clustering
2 pages
Aam Unit 4 QB With Answer
No ratings yet
Aam Unit 4 QB With Answer
11 pages
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
Clustering Algorithms in Machine Learning
No ratings yet
Clustering Algorithms in Machine Learning
6 pages
K-Means Clustering: Unsupervised Learning
No ratings yet
K-Means Clustering: Unsupervised Learning
5 pages
Yunsu Han KNN K Means
No ratings yet
Yunsu Han KNN K Means
8 pages
K-Means Clustering Algorithm Overview
No ratings yet
K-Means Clustering Algorithm Overview
47 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
Aiml Lab
No ratings yet
Aiml Lab
37 pages
Lab11 Kmeans 6H
No ratings yet
Lab11 Kmeans 6H
3 pages
K-Means Clustering Experiment Guide
No ratings yet
K-Means Clustering Experiment Guide
6 pages
SOLUTION ONLY CODE DWDM - Lab - All
No ratings yet
SOLUTION ONLY CODE DWDM - Lab - All
8 pages
Experiment-7: Implementation of K-Means Clustering Algorithm
No ratings yet
Experiment-7: Implementation of K-Means Clustering Algorithm
3 pages
K-Means Algorithm
No ratings yet
K-Means Algorithm
29 pages
K-Means Clustering in Python Guide
No ratings yet
K-Means Clustering in Python Guide
2 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
2 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
51 pages
K-means++ Algorithm for Improved Clustering
No ratings yet
K-means++ Algorithm for Improved Clustering
5 pages
K-Means Clustering in Python Guide
No ratings yet
K-Means Clustering in Python Guide
7 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
DADV Exp-5
No ratings yet
DADV Exp-5
3 pages
Advanced Machine Learning Experiments
No ratings yet
Advanced Machine Learning Experiments
15 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
K-Means and DBSCAN Clustering Lab
No ratings yet
K-Means and DBSCAN Clustering Lab
4 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
K-Means Clustering Visualization
No ratings yet
K-Means Clustering Visualization
3 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
6 pages
LAB7 Kmeans
No ratings yet
LAB7 Kmeans
11 pages
Yogesh Siddiq Edited
No ratings yet
Yogesh Siddiq Edited
6 pages
K-Means Clustering Implementation in Python
No ratings yet
K-Means Clustering Implementation in Python
4 pages
K-Means Clustering in Python Guide
No ratings yet
K-Means Clustering in Python Guide
10 pages
Rajeek8 12
No ratings yet
Rajeek8 12
21 pages
k-Means and Spectral Clustering Lab
No ratings yet
k-Means and Spectral Clustering Lab
6 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
6 pages
K-Means Clustering and Naive Bayes Guide
No ratings yet
K-Means Clustering and Naive Bayes Guide
4 pages
Unit 3 Unsupervised Learning
No ratings yet
Unit 3 Unsupervised Learning
9 pages
Clustering Techniques in Python
No ratings yet
Clustering Techniques in Python
1 page
Unsupervised Learning: Clustering & KMeans
No ratings yet
Unsupervised Learning: Clustering & KMeans
50 pages
Understanding Classification in Machine Learning
No ratings yet
Understanding Classification in Machine Learning
28 pages
Bhopal, Vizag, Assam Floods, Fukushima Disasters
No ratings yet
Bhopal, Vizag, Assam Floods, Fukushima Disasters
10 pages
PyCaret Regression
No ratings yet
PyCaret Regression
13 pages
Design Traffic Lights with D Flip Flop
No ratings yet
Design Traffic Lights with D Flip Flop
5 pages
Design Automatic Street Light Using LDR
No ratings yet
Design Automatic Street Light Using LDR
4 pages
Using Additional Training Sensors To Improve Single-Sensor Complex Activity Recognition
No ratings yet
Using Additional Training Sensors To Improve Single-Sensor Complex Activity Recognition
5 pages
Hybrid AI Framework for Healthcare SVM
No ratings yet
Hybrid AI Framework for Healthcare SVM
13 pages
1.4 NN MP Neuron Model
No ratings yet
1.4 NN MP Neuron Model
20 pages
Image Steganalysis Using Deep Learning 2023
No ratings yet
Image Steganalysis Using Deep Learning 2023
33 pages
GenAI LLM Complete Course
No ratings yet
GenAI LLM Complete Course
70 pages
Transformers in Medical Imaging - A Survey
No ratings yet
Transformers in Medical Imaging - A Survey
40 pages
AI 501 - Lesson 1 - Intro To AI PDF
No ratings yet
AI 501 - Lesson 1 - Intro To AI PDF
45 pages
Pretrained ResNet-18 Convolutional Neural Network - MATLAB Resnet18
No ratings yet
Pretrained ResNet-18 Convolutional Neural Network - MATLAB Resnet18
2 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
49 pages
Augmented Intelligence in AI Systems
No ratings yet
Augmented Intelligence in AI Systems
4 pages
Intelligent Systems Assignment Overview
No ratings yet
Intelligent Systems Assignment Overview
3 pages
OCS351-AIML Question Bank
100% (1)
OCS351-AIML Question Bank
5 pages
DL - Assignment 7 Solution
100% (1)
DL - Assignment 7 Solution
5 pages
Deep Learning for Plant Disease Detection
No ratings yet
Deep Learning for Plant Disease Detection
1 page
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
5 pages
Modern Convolutional Neural Networks Overview
No ratings yet
Modern Convolutional Neural Networks Overview
68 pages
Cryptocurrency Fraud Detection Project
No ratings yet
Cryptocurrency Fraud Detection Project
19 pages
Ai French Research Final by KaRma
No ratings yet
Ai French Research Final by KaRma
6 pages
GANDALF: Deep Learning for Tabular Data
No ratings yet
GANDALF: Deep Learning for Tabular Data
25 pages
ICBB-2016: Bioinformatics Conference
No ratings yet
ICBB-2016: Bioinformatics Conference
2 pages
AlexNet Guide for ML Practitioners
No ratings yet
AlexNet Guide for ML Practitioners
10 pages
The Application of Reinforcement Learning in Video Games
No ratings yet
The Application of Reinforcement Learning in Video Games
10 pages
Campus X DSMP 2025
No ratings yet
Campus X DSMP 2025
4 pages
AI and Data Science Program Objectives
No ratings yet
AI and Data Science Program Objectives
51 pages
29324-Article Text-33378-1-2-20240324
No ratings yet
29324-Article Text-33378-1-2-20240324
8 pages
Context-Aware Feature Engineering with LLMs
No ratings yet
Context-Aware Feature Engineering with LLMs
23 pages
Artificial Intelligence Chapter 20.5: Neural Networks
No ratings yet
Artificial Intelligence Chapter 20.5: Neural Networks
84 pages
AI Models Overview
No ratings yet
AI Models Overview
2 pages
Backpropagation in Multi-Layer Networks
No ratings yet
Backpropagation in Multi-Layer Networks
46 pages