Welcome to Scribd!

20BCE2126 ML Da 5

Uploaded by

vedantmodi202013

0% found this document useful (0 votes)

5 views3 pages

The document describes an assignment to perform k-means clustering on wine data to group wines based on their malic acid and proline levels, including preprocessing the data, computing the elbow plot to determine the optimal number of clusters k, running k-means clustering, and assigning cluster labels to each data point. The analysis finds an optimal k of 3 clusters for grouping the wines.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views3 pages

20BCE2126 ML Da 5

Uploaded by

vedantmodi202013

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Assignment – 5

Name – Vedant Modi

Reg. no.- 20BCE2126
Course – Machine Learning
Lab – L21+L22
Question : Use wine data set from sklearn library and try to form
clusters of wine behaviour using Malic Acid and Proline features.
Drop the other features for simplicity. -Create a scatter plot of the
above mentioned features of the wine data set. -Figure out if any
pre-processing such as scaling would here. -Draw elbow plot and
from that figure out an optimal value of k.
import pandas as pd
from sklearn.datasets import load_wine
from sklearn.preprocessing import StandardScaler
from sklearn.cluster import KMeans
from matplotlib import pyplot as plt

# Load the wine dataset

wine = load_wine()

# Create a DataFrame with only the Malic Acid and Proline features
data = pd.DataFrame(wine.data[:, [0, 11]], columns=['Malic acid', 'Proline'])

# Create a scatter plot of the Malic Acid and Proline features

plt.scatter(data['Malic acid'], data['Proline'])
plt.xlabel('Malic acid')
plt.ylabel('Proline')
plt.show()

# Standardize the data

scaler = StandardScaler()
scaled_data = scaler.fit_transform(data)

# Compute the sum of squared distances for a range of k values

k_values = range(1, 11)
sse = []
for k in k_values:
kmeans = KMeans(n_clusters=k)
kmeans.fit(scaled_data)
sse.append(kmeans.inertia_)

/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan

warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan
warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan
warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan
warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan
warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan
warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan
warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan
warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan
warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarning: The default value of `n_init` will chan
warnings.warn(
# Plot the elbow method
plt.plot(k_values, sse)
plt.xlabel('Number of clusters (k)')
plt.ylabel('Sum of squared distances')
plt.show()

# Determine the optimal number of clusters using the elbow method

optimal_k = 3

# Perform clustering with the optimal number of clusters

kmeans = KMeans(n_clusters=optimal_k)
kmeans.fit(scaled_data)

/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWar
warnings.warn(
▾ KMeans
KMeans(n_clusters=3)

# Assign cluster labels to each data point

data['cluster'] = kmeans.labels_

# Print the results

print('Cluster labels:')
print(data['cluster'].value_counts())

Cluster labels:
0 65
2 57
1 56
Name: cluster, dtype: int64

Scikit Learn Cheat Sheet
Document9 pages
Scikit Learn Cheat Sheet
burhan ök
No ratings yet
Troubleshooting Ubuntu Server
From Everand
Troubleshooting Ubuntu Server
Bhargav Skanda
No ratings yet
K Means
Document329 pages
K Means
yousef shaban
100% (1)
Cluster Algorithm
Document25 pages
Cluster Algorithm
Doli Paik
No ratings yet
RA2111003011432
Document3 pages
RA2111003011432
IMMANUEL RUSSO (RA2111051010032)
No ratings yet
KMeans
Document1 page
KMeans
bouazizchahine7
No ratings yet
Vid 4
Document6 pages
Vid 4
diyalap01
No ratings yet
Kmeans
Document7 pages
Kmeans
patil samrudhi
No ratings yet
R Lab Program
Document20 pages
R Lab Program
Radhiyadevi Chinnasamy
No ratings yet
Everything Useful I Know About Kubectl - Atomic Commits
Document10 pages
Everything Useful I Know About Kubectl - Atomic Commits
tutorific
No ratings yet
Output2
Document2 pages
Output2
Laptop-Dimas-249
No ratings yet
IRIS Commands Practice
Document10 pages
IRIS Commands Practice
aqib ahmed
No ratings yet
Seminar 10
Document3 pages
Seminar 10
Nishad Ahamed
No ratings yet
ML0101EN Clus DBSCN Weather Py v1
Document16 pages
ML0101EN Clus DBSCN Weather Py v1
Rajat Solanki
No ratings yet
Deep Learning Record
Document70 pages
Deep Learning Record
mathan.balaji.02
No ratings yet
Installing Spark On Windows Environment
Document16 pages
Installing Spark On Windows Environment
Dr Mohammed Kamal
No ratings yet
ML With Python Practical
Document22 pages
ML With Python Practical
n58648017
No ratings yet
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
Document7 pages
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
Kartik Katekar
No ratings yet
Project
Document17 pages
Project
mohamed mohsen
No ratings yet
Ilovepdf Merged
Document25 pages
Ilovepdf Merged
rohitfc3105
No ratings yet
K-Means in Python - Solution
Document6 pages
K-Means in Python - Solution
Rodrigo Violante
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
Document20 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
Saloni Tuli
No ratings yet
Assignment 2.1.2 Image Augmentation
Document8 pages
Assignment 2.1.2 Image Augmentation
Hockhin Ooi
No ratings yet
DeepLearningForVisionSystems Ch5 AlexNet
Document32 pages
DeepLearningForVisionSystems Ch5 AlexNet
mkkadambi
No ratings yet
2324 BigData Lab3
Document6 pages
2324 BigData Lab3
Elie Al Howayek
No ratings yet
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
Document6 pages
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
Angelina Tutu
No ratings yet
ML - Practical File
Document15 pages
ML - Practical File
Jatin Mathur
No ratings yet
Problem Set 1: Introduction To R - Solutions With R Output: 1 Install Packages
Document24 pages
Problem Set 1: Introduction To R - Solutions With R Output: 1 Install Packages
Darnell Larsen
No ratings yet
22MCA1008 - Varun ML LAB ASSIGNMENTS
Document41 pages
22MCA1008 - Varun ML LAB ASSIGNMENTS
S Varun (RA1931241020133)
100% (1)
Experiment-7: Implementation of K-Means Clustering Algorithm
Document3 pages
Experiment-7: Implementation of K-Means Clustering Algorithm
19-361 Sai Prathik
No ratings yet
ResNet50 Training Code
Document9 pages
ResNet50 Training Code
natashamarie.relampagos
No ratings yet
Spark Beleske
Document21 pages
Spark Beleske
SlavimirVesić
No ratings yet
CRT2 LDA Assignment
Document4 pages
CRT2 LDA Assignment
rasaraman
No ratings yet
Midterm 2 Codes
Document15 pages
Midterm 2 Codes
sameertarda
No ratings yet
Sentiment Analysis On Tweets
Document2 pages
Sentiment Analysis On Tweets
vikibytes
No ratings yet
Maxbox Starter96 CNN Evaluation
Document7 pages
Maxbox Starter96 CNN Evaluation
Max Kleiner
No ratings yet
Lab - Batch Data Ingestion With DMS - Instructor Setup
Document16 pages
Lab - Batch Data Ingestion With DMS - Instructor Setup
Job Llanos Montaldo
No ratings yet
2020BIT007 Assignment No7.Ipynb - Colaboratory
Document2 pages
2020BIT007 Assignment No7.Ipynb - Colaboratory
aqonline906
No ratings yet
HANDON
Document6 pages
HANDON
Dora Cecilia Bernal Puentes
No ratings yet
ESTIVEN - HURTADO.SANTOS - Analytics, De, Data, No, Estructurada - Machine, Learning - ESTIVEN - HURTADO.SANTOS - Ipynb - Colaboratory
Document5 pages
ESTIVEN - HURTADO.SANTOS - Analytics, De, Data, No, Estructurada - Machine, Learning - ESTIVEN - HURTADO.SANTOS - Ipynb - Colaboratory
Estiven Hurtado Santos
No ratings yet
Yolo v4
Document42 pages
Yolo v4
Deepak Saxena
No ratings yet
SKB Buffers
Document32 pages
SKB Buffers
Kvivek Vivek
No ratings yet
20bcs7635-EXP 10
Document5 pages
20bcs7635-EXP 10
sameer
No ratings yet
NER Lab
Document65 pages
NER Lab
Kondo Candi
No ratings yet
Nouveau Document Texte
Document14 pages
Nouveau Document Texte
MANAÏ Beligh
No ratings yet
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
Document6 pages
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
kevin
No ratings yet
ML Lab Programs
Document23 pages
ML Lab Programs
Roopa 18-19-36
No ratings yet
Sunbird ED Developer Bootcamp 2023
Document9 pages
Sunbird ED Developer Bootcamp 2023
Dhamodharan Thangavel
No ratings yet
Data Analysis and Evaluation Methods Comparison
Document11 pages
Data Analysis and Evaluation Methods Comparison
Jelena Nađ
No ratings yet
Clonamos El Repositorio para Obtener Los Dataset: From Import
Document23 pages
Clonamos El Repositorio para Obtener Los Dataset: From Import
Juan Chavarria Asparrin
No ratings yet
Time
Document215 pages
Time
james
No ratings yet
Deep Learning With Python File
Document22 pages
Deep Learning With Python File
Arnav Shrivastava
No ratings yet
Buff
Document11 pages
Buff
MK
No ratings yet
K Means
Document4 pages
K Means
mohamed mohsen
No ratings yet
TT - Ipynb - Colaboratory
Document3 pages
TT - Ipynb - Colaboratory
hos1999moh78
No ratings yet
Recor
Document6 pages
Recor
Hariharan.k
No ratings yet
Tugas Mandiri Pertemuan 9 - Dea Ashari Oktavia - ITS
Document1 page
Tugas Mandiri Pertemuan 9 - Dea Ashari Oktavia - ITS
Dea
No ratings yet
Roop Unleashed 02.ipynb
Document15 pages
Roop Unleashed 02.ipynb
eternalsoldiergirl
No ratings yet
Maxbox Starter60 Machine Learning
Document8 pages
Maxbox Starter60 Machine Learning
Max Kleiner
No ratings yet
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet
Signal Energy and Power
Document19 pages
Signal Energy and Power
Varsha Moharana
No ratings yet
Synopsis of The Transient Solvers in ESATAN
Document4 pages
Synopsis of The Transient Solvers in ESATAN
P_lee
No ratings yet
2017 Fall ME349 03 NumAnalysis1
Document41 pages
2017 Fall ME349 03 NumAnalysis1
Abdo Salah
No ratings yet
FCM - Zip Fuzzy C - Means Clustering MATLAB, Which Contains 10 Function WWW - Pudn
Document2 pages
FCM - Zip Fuzzy C - Means Clustering MATLAB, Which Contains 10 Function WWW - Pudn
Armando ChachiLankx Evol Alletsserhc
No ratings yet
CSC-411-AI-lec6-Adversarial Search
Document38 pages
CSC-411-AI-lec6-Adversarial Search
Abdullah
No ratings yet
C++ Classes and Data Structures: Hash Tables
Document238 pages
C++ Classes and Data Structures: Hash Tables
Bala Ganabathy
No ratings yet
Topic4 2
Document65 pages
Topic4 2
Yoftahi
No ratings yet
Comm II Tutorial Sheet 1
Document10 pages
Comm II Tutorial Sheet 1
Toka Ali
No ratings yet
Echo Cancellation in Audio Signal Using LMS Algorithm: Sanjay K. Nagendra Vinay Kumar.S.B
Document5 pages
Echo Cancellation in Audio Signal Using LMS Algorithm: Sanjay K. Nagendra Vinay Kumar.S.B
Prabira Kumar Sethy
No ratings yet
Unit - 3 RS PDF
Document16 pages
Unit - 3 RS PDF
Santhosh Givari
No ratings yet
Numerical Simulation For The Solution of Nonlinear Jaulent-Miodek Coupled Equations Using Quartic B-Spline
Document18 pages
Numerical Simulation For The Solution of Nonlinear Jaulent-Miodek Coupled Equations Using Quartic B-Spline
Waleed Adel
No ratings yet
Experiment No. 3 - Roots of Equations Bracket Methods
Document28 pages
Experiment No. 3 - Roots of Equations Bracket Methods
Cedric Dela Cruz
No ratings yet
Job Sequencing With The Deadline
Document5 pages
Job Sequencing With The Deadline
Abhi
No ratings yet
Radix Sort
Document5 pages
Radix Sort
Mohd arman
No ratings yet
VLSI Design of Half-Band IIR Interpolation and Decimation Filter
Document7 pages
VLSI Design of Half-Band IIR Interpolation and Decimation Filter
tansnvarma
No ratings yet
CRC and Hamming Code
Document12 pages
CRC and Hamming Code
Ayush Raj
100% (1)
9-Int Array Searching - Sorting
Document11 pages
9-Int Array Searching - Sorting
Muhammad Awais Ghafoor
No ratings yet
B
Document2 pages
B
Ahmed Malik
No ratings yet
Beyon Broyden
Document25 pages
Beyon Broyden
Christian Verde
No ratings yet
110sort Malik Ch10
Document81 pages
110sort Malik Ch10
sachinsr099
No ratings yet
Q. No. Description Question Choices Unit I-Introduction: 1 1 Easy
Document15 pages
Q. No. Description Question Choices Unit I-Introduction: 1 1 Easy
Archana Saini
No ratings yet
IJE - Volume 32 - Issue 10 - Pages 1464-1479
Document16 pages
IJE - Volume 32 - Issue 10 - Pages 1464-1479
IMANE TORBI
No ratings yet
03 Unit Three or
Document61 pages
03 Unit Three or
Abdi Mucee Tube
No ratings yet
Regularized Target Encoding Outperforms Traditional Methods in Supervised Machine Learning With High Cardinality Features
Document22 pages
Regularized Target Encoding Outperforms Traditional Methods in Supervised Machine Learning With High Cardinality Features
marboe
No ratings yet
EC6405-Control Systems Engineering
Document17 pages
EC6405-Control Systems Engineering
Ajju K Ajju
No ratings yet
CS1010 ETutorial 6 Solution
Document3 pages
CS1010 ETutorial 6 Solution
Yu Shu Hearn
No ratings yet
Full Name: Lab Section: ECE 3500 (Spring 2017) - Examples #1
Document11 pages
Full Name: Lab Section: ECE 3500 (Spring 2017) - Examples #1
Stacey Boylan
No ratings yet
Solution of Nonlinear Equations: 1 Bisection
Document9 pages
Solution of Nonlinear Equations: 1 Bisection
Jose D Costa
No ratings yet
Cisco Support Community - Upgrading Ws-3560g-24ps Ios To c3560-Ipbasek9-Mz.122-55.Se10.Bin - 2015-02-24
Document2 pages
Cisco Support Community - Upgrading Ws-3560g-24ps Ios To c3560-Ipbasek9-Mz.122-55.Se10.Bin - 2015-02-24
Leyenda Heroe
No ratings yet
BCS-042 Solved Assignment 042 Solved Assignment 042 Solved Assignment
Document28 pages
BCS-042 Solved Assignment 042 Solved Assignment 042 Solved Assignment
Naksh Mayank
100% (1)