Welcome to Scribd!

Skip carousel

Simple Case Study of Implementing K Means Clustering On The IRIS Dataset

Uploaded by

gargwork1990

0% found this document useful (0 votes)

17 views4 pages

Code and text on case study based on K means clustering approach for Machine Learning

Original Title

K Means Cluster

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Code and text on case study based on K means clustering approach for Machine Learning

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

17 views4 pages

Simple Case Study of Implementing K Means Clustering On The IRIS Dataset

Uploaded by

gargwork1990

Code and text on case study based on K means clustering approach for Machine Learning

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Simple Case Study of Implementing K Means Clustering

on the IRIS Dataset

Import tools and libraries:

from time import time

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

from sklearn import metrics

from sklearn.cluster import KMeans

from sklearn.datasets import load_digits

from sklearn.decomposition import PCA

from sklearn.preprocessing import scale

Create a function to extract a cluster with k labels:

def get_cluster_metric(y_train, km_labels_):

print("Homogeneity: %0.3f" % metrics.homogeneity_score(y_train, km_labels_))

print("Completeness: %0.3f" % metrics.completeness_score(y_train, km_labels_))

print("V-measure: %0.3f" % metrics.v_measure_score(y_train, km_labels_))

print()

Generate hypothetical data for practice:

np.random.seed(42)

digits = load_digits()

data = scale(digits.data)

n_samples, n_features = data.shape

n_digits = len(np.unique(digits.target))

labels = digits.target

sample_size = 300

print("n_digits: %d, \t n_samples %d, \t n_features %d"

% (n_digits, n_samples, n_features))

Output:

n_digits: 10, n_samples 1797, n_features 64

labels.shape

Output: (1797, )

Loading the inbuilt IRIS dataset in Python:

from sklearn.datasets import load_iris

Algorithm to extract the clusters and compute sum of squared errors:

y = labels

sse = {}

accuracy = []

for k in range(1, 20):

kmeans = KMeans(n_clusters=k, max_iter=1000).fit(data)

sse[k] = kmeans.inertia_ # Inertia: Sum of distances of samples to their closest cluster center

labels_pred = kmeans.labels_

# print(labels_pred.shape)

# check how many of the samples were correctly labeled

correct_labels = sum(labels == labels_pred)

accuracy.append(correct_labels/float(y.size))

# print("Result: %d out of %d samples were correctly labeled. when k = %d " % (correct_labels,

y.size,k))
print("correct %.02f percent classification at k = %d" % (correct_labels/float(y.size) * 100 ,k))

get_cluster_metric(y, kmeans.labels_)

Visualisation:
#No. of clusters v/s SSE

plt.figure()

plt.plot(list(sse.keys()), list(sse.values()))

plt.xlabel("Number of cluster")

plt.ylabel("SSE")

plt.show()

#No. of clusters v/s accuracy

plt.figure()

plt.plot(range(1, 20,1),accuracy)

plt.xlabel("Number of cluster")

plt.ylabel("accuracy")

plt.show()

Machine Learning Hands-On
Document18 pages
Machine Learning Hands-On
Vivek JD
100% (1)
Machine Learning LAB: Practical-1
Document24 pages
Machine Learning LAB: Practical-1
Tsering Jhakree
100% (1)
Machine Learning With SQL
Document12 pages
Machine Learning With SQL
prince krish
100% (1)
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
ML2 Practical List
Document80 pages
ML2 Practical List
Yash Amin
No ratings yet
21BEC505 Exp2
Document7 pages
21BEC505 Exp2
jay
No ratings yet
ML Lab Programs
Document23 pages
ML Lab Programs
Roopa 18-19-36
No ratings yet
Soft Sensor Code
Document4 pages
Soft Sensor Code
Marvin Martins
No ratings yet
Soft Sensor Code
Document4 pages
Soft Sensor Code
Marvin Martins
No ratings yet
Exp 4
Document10 pages
Exp 4
jay
No ratings yet
2.3 Aiml Rishit
Document7 pages
2.3 Aiml Rishit
heex.pros
No ratings yet
Machine
Document45 pages
Machine
Gagan Sharma
100% (1)
DM Slip Solutions
Document24 pages
DM Slip Solutions
09.Khadija Gharatkar
100% (1)
ML Ex8
Document2 pages
ML Ex8
yefigoh133
No ratings yet
CSC649 Lecture 3 Unsupervised ML - KMeansClustering
Document22 pages
CSC649 Lecture 3 Unsupervised ML - KMeansClustering
Ryan anak Gaybristi
No ratings yet
Tous Les Algo de ML
Document7 pages
Tous Les Algo de ML
Jadlaoui Asma
No ratings yet
Implementing Custom Randomsearchcv: 'Red' 'Blue'
Document1 page
Implementing Custom Randomsearchcv: 'Red' 'Blue'
Tayub khan.A
No ratings yet
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
Document7 pages
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
Kartik Katekar
No ratings yet
Classification Algorithm Python Code 1567761638
Document4 pages
Classification Algorithm Python Code 1567761638
Awanit Kumar
No ratings yet
Assignment #1: K Nearest Neighbor Classifier: Name: Srikanth Mujjiga (Roll No: 2015-50-831
Document8 pages
Assignment #1: K Nearest Neighbor Classifier: Name: Srikanth Mujjiga (Roll No: 2015-50-831
srikanth.mujjiga
No ratings yet
Is Lab Aman Agarwal PDF
Document8 pages
Is Lab Aman Agarwal PDF
Aman Bansal
No ratings yet
Pythonfile
Document36 pages
Pythonfile
collection58209
No ratings yet
Mlaifile1 3
Document27 pages
Mlaifile1 3
Krishna kumar
No ratings yet
K Means
Document4 pages
K Means
mohamed mohsen
No ratings yet
Assignment 4 On Clustering Techniques
Document2 pages
Assignment 4 On Clustering Techniques
06–Yash Bhusal
No ratings yet
Numpy NP Sklearn - Cluster Sklearn Sklearn - Datasets Sklearn - Preprocessing
Document1 page
Numpy NP Sklearn - Cluster Sklearn Sklearn - Datasets Sklearn - Preprocessing
Swappy Boi
No ratings yet
Image Feature Extraction Based On PCA
Document5 pages
Image Feature Extraction Based On PCA
Sanjana Kuril
No ratings yet
20MIS1025 - DecisionTree - Ipynb - Colaboratory
Document4 pages
20MIS1025 - DecisionTree - Ipynb - Colaboratory
Sandip Das
No ratings yet
NguyenTrungThinh BT3.3
Document5 pages
NguyenTrungThinh BT3.3
Nguyen Trung Thinh
No ratings yet
ML With Python Practical
Document22 pages
ML With Python Practical
n58648017
No ratings yet
Codes
Document6 pages
Codes
Vamshi Krishna
No ratings yet
Home Work
Document12 pages
Home Work
sandeepssn47
No ratings yet
Project
Document17 pages
Project
mohamed mohsen
No ratings yet
Machinelearning - Alisya Athirah Binti Mohd Huzzainny (Updated)
Document26 pages
Machinelearning - Alisya Athirah Binti Mohd Huzzainny (Updated)
Alisya Athirah
No ratings yet
K Means Algorithm
Document6 pages
K Means Algorithm
Asir Mosaddek Sakib
No ratings yet
01 249212 012 10129792044 11122022 112910pm
Document8 pages
01 249212 012 10129792044 11122022 112910pm
Safi ullah
No ratings yet
Big Data Merged
Document7 pages
Big Data Merged
Ingame Id
No ratings yet
16BCB0126 VL2018195002535 Pe003
Document40 pages
16BCB0126 VL2018195002535 Pe003
Mohit
No ratings yet
Assignment 2
Document1 page
Assignment 2
estebandgono
No ratings yet
Support Vector Machine A) Classification
Document3 pages
Support Vector Machine A) Classification
4NM20IS003 ABHISHEK A
No ratings yet
Decision Tree
Document3 pages
Decision Tree
saba
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
Document22 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
Ahm Tharwat
No ratings yet
Enc Encoded C GjaZn0H7s4d8TxwCe 3b4pjo24ZJ0okwUOhgcEyM H RQ1n30CeFMvO9vFiyLlyNQ
Document10 pages
Enc Encoded C GjaZn0H7s4d8TxwCe 3b4pjo24ZJ0okwUOhgcEyM H RQ1n30CeFMvO9vFiyLlyNQ
Pranav vignesh
No ratings yet
Advance AI and ML LAB
Document16 pages
Advance AI and ML LAB
Priyanka Priya
No ratings yet
Seguridad ML
Document7 pages
Seguridad ML
andres python
No ratings yet
Exp 6
Document6 pages
Exp 6
jay
No ratings yet
ML p4
Document2 pages
ML p4
Nathon Mine
No ratings yet
ML Lab
Document7 pages
ML Lab
Sharan Patil
No ratings yet
Exp2 - Data Visualization and Cleaning and Feature Selection
Document13 pages
Exp2 - Data Visualization and Cleaning and Feature Selection
mnbatrawi
No ratings yet
Online Payment Fraud Detection Using Machine Learning
Document2 pages
Online Payment Fraud Detection Using Machine Learning
Dev Ranjan Raut
No ratings yet
Solution First Point ML-HW4
Document6 pages
Solution First Point ML-HW4
Juan Sebastian Otálora Montenegro
100% (1)
Prob13: 1 EE16A Homework 13
Document23 pages
Prob13: 1 EE16A Homework 13
Michael ARK
No ratings yet
Machine Learning
Document54 pages
Machine Learning
Jacob
No ratings yet
Ai Combined Update
Document274 pages
Ai Combined Update
John Doe
No ratings yet
Efficient Python Tricks and Tools For Data Scientists
Document20 pages
Efficient Python Tricks and Tools For Data Scientists
Javier Velandia
100% (1)
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
Document20 pages
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
Khagen
No ratings yet
Is Lab 7
Document7 pages
Is Lab 7
Aman Bansal
No ratings yet
Is Lab 7
Document7 pages
Is Lab 7
Aman Bansal
No ratings yet
2 Interface Python With Mysql - Programs
Document2 pages
2 Interface Python With Mysql - Programs
varaprasadpgtitjnv
No ratings yet
Chapter08 - Intro To DL For Computer Vision
Document10 pages
Chapter08 - Intro To DL For Computer Vision
Jas Lim
No ratings yet