Welcome to Scribd!

Click Here To Browse The K-Means Clustering Code in Google Colab

Uploaded by

0% found this document useful (0 votes)

27 views1 page

The document discusses two assignments: 1) Experimenting with K-means clustering on the Iris dataset using different numbers of clusters from 1 to 10, plotting the errors, and determining the optimal number of clusters. 2) Enhancing a spam classifier code from using bag-of-words representation to TF-IDF (Term Frequency - Inverse Document Frequency) as features instead of word count. The document provides links to code examples and explanations of K-means clustering, TF-IDF, and relevant Scikit-learn modules.

Original Description:

Original Title

3_Week3_Assignment

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

27 views1 page

Click Here To Browse The K-Means Clustering Code in Google Colab

Uploaded by

Giridhar Reddy

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

Assignment

1) Study the code showing K-means clustering using the Iris dataset. The number of
clusters is chosen to be 5.

Click here to browse the K-means clustering code in Google Colab.

a) Experiment within different values of number of clusters (say from 1 to 10) and store
the error in a list.
(Hint: Error = [] Error.append(model_kmeans.inertia_))
The K-means algorithm aims to choose centroids that minimise the inertia, or within-
cluster sum-of-squares criterion (https://scikit-learn.org/stable/modules/clustering.html)
b) Plot a graph where X axis represents the number of clusters and Y axis represents
the error. What is the optimal value of the number of clusters?
View this video to understand the graph that you have plotted.

2) Study the code for a simple spam classifier using Bag of Words representation (each
feature is basically the frequency of a particular word in the document)

Click here to browse the code in Colab.

Now enhance the code to use Term Frequency — Inverse Document Frequency (TF-
IDF) as feature instead of word count.

Hint: Please refer the examples by visting the links below:

Explanation of TF-IDF
SK-learn page of TfidfVectorizer

AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Google Interview Questions
Document9 pages
Google Interview Questions
radz143
No ratings yet
Exam18 ICSE Sample Paper Computer Applications PDF
Document7 pages
Exam18 ICSE Sample Paper Computer Applications PDF
Yash Dubey
No ratings yet
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
C# Interview Questions You'll Most Likely Be Asked
From Everand
C# Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
What Is Matlab
Document3 pages
What Is Matlab
raghgk2012
No ratings yet
Dbms University Paper 18-19
Document2 pages
Dbms University Paper 18-19
kritikasaini1712
No ratings yet
ISCL - Wintersemester 2007 - IR - Midterm Exam
Document6 pages
ISCL - Wintersemester 2007 - IR - Midterm Exam
Pulkit Mehndiratta
No ratings yet
MCS 031
Document5 pages
MCS 031
rajivkk
No ratings yet
Eee & Ece-Cpl-Set-I
Document2 pages
Eee & Ece-Cpl-Set-I
kisnamohan
No ratings yet
DBMS NCS 502 2017 18
Document2 pages
DBMS NCS 502 2017 18
apurvbjp
No ratings yet
MCS 201
Document4 pages
MCS 201
Bedodipti Choudhary
No ratings yet
PPL 2017 Aktu Paper
Document2 pages
PPL 2017 Aktu Paper
Shivanand Pal
No ratings yet
CS 322 Assignment 2 UBC 2015
Document3 pages
CS 322 Assignment 2 UBC 2015
cauliflowerpower
No ratings yet
Answer All Questions, Each Carries 3 Marks: Reg No.: - Name
Document2 pages
Answer All Questions, Each Carries 3 Marks: Reg No.: - Name
Mooo Point
No ratings yet
HW 1
Document4 pages
HW 1
Anonymous gUySMcpSq
No ratings yet
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
Document2 pages
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
aditi
No ratings yet
9A05402 Object Oriented Programming
Document4 pages
9A05402 Object Oriented Programming
sivabharathamurthy
No ratings yet
VLSI Design NOV 17
Document2 pages
VLSI Design NOV 17
Saurabh Bhise
No ratings yet
Oot
Document19 pages
Oot
Priyanka R Shah
No ratings yet
UEE1304 Answer Key
Document20 pages
UEE1304 Answer Key
zentill
No ratings yet
Question Paper Code:: (10×2 20 Marks)
Document4 pages
Question Paper Code:: (10×2 20 Marks)
Nallasivam Munnur
No ratings yet
Model Answer-17332 (PR - Test - 50 - 1) - FIRST
Document21 pages
Model Answer-17332 (PR - Test - 50 - 1) - FIRST
Rohit Parsode
No ratings yet
CSE 331 Final
Document2 pages
CSE 331 Final
BD Entertainment
No ratings yet
Assign1 Ans
Document31 pages
Assign1 Ans
Navin Andrew Prince
100% (1)
ServiceNow Interview Questioner
Document3 pages
ServiceNow Interview Questioner
rajendergrr
No ratings yet
B) Define:-1) Tag 2) Tag 3) Tag
Document1 page
B) Define:-1) Tag 2) Tag 3) Tag
Fusion x
No ratings yet
Ec2202 - Data Structures and Object Oriented Programming in C++ Model Question Paper Total Marks: 100 Part A - (10 X 2 20 Marks) Answer ALL Questions
Document6 pages
Ec2202 - Data Structures and Object Oriented Programming in C++ Model Question Paper Total Marks: 100 Part A - (10 X 2 20 Marks) Answer ALL Questions
knk14091991
No ratings yet
M.SC (Computer Science) 2008 Pattern
Document48 pages
M.SC (Computer Science) 2008 Pattern
Temp
No ratings yet
DBMS
Document20 pages
DBMS
Sravanti Bagchi
No ratings yet
Cst0 Computer Science Tripos Part I
Document11 pages
Cst0 Computer Science Tripos Part I
Epics Godfrey
No ratings yet
Time A/Lorted: 3 Hours: Maulana Abul Kalam Azad University Csl8.Tf - CH (N) Ioddisem - 313443/'2Q22 - 202311oos
Document1 page
Time A/Lorted: 3 Hours: Maulana Abul Kalam Azad University Csl8.Tf - CH (N) Ioddisem - 313443/'2Q22 - 202311oos
Abshishek Ghosh
No ratings yet
Attachment 3
Document4 pages
Attachment 3
Talha Tahir
No ratings yet
Homework Exercise 4: Statistical Learning, Fall 2020-21
Document3 pages
Homework Exercise 4: Statistical Learning, Fall 2020-21
Elinor Rahamim
No ratings yet
M.SC (Computer Science) 2008 2011 Pattern
Document45 pages
M.SC (Computer Science) 2008 2011 Pattern
Temp
No ratings yet
Btech 1 Sem Programming For Problem Solving kcs101 2021
Document1 page
Btech 1 Sem Programming For Problem Solving kcs101 2021
rishabhchauhan2266
No ratings yet
Bachelor of Computer Application (B.C.A.) Semester-I (C.B.S.) Examination "C" Programming Paper-II
Document2 pages
Bachelor of Computer Application (B.C.A.) Semester-I (C.B.S.) Examination "C" Programming Paper-II
Saurabh Raut
No ratings yet
Question Paper Code:: (10×2 20 Marks)
Document3 pages
Question Paper Code:: (10×2 20 Marks)
Sinduja Baskaran
No ratings yet
Advance Digital Design Using Verilog Nec024r
Document2 pages
Advance Digital Design Using Verilog Nec024r
Manjeet Singh
No ratings yet
Java Question Paper 1
Document5 pages
Java Question Paper 1
usha
No ratings yet
Assignment Four: CSPS: Question One
Document2 pages
Assignment Four: CSPS: Question One
Cindy San
No ratings yet
Lab 2 Questions
Document11 pages
Lab 2 Questions
mac one
No ratings yet
A514469624 - 63640 - 1 - 2022 - Cse228 K21MD Ca1
Document12 pages
A514469624 - 63640 - 1 - 2022 - Cse228 K21MD Ca1
Narendra Reddy
No ratings yet
Lab (I)
Document3 pages
Lab (I)
anand_sesham
No ratings yet
Computer Applications 2020
Document6 pages
Computer Applications 2020
Jeethesh
No ratings yet
Introduction To Programming Sample Question Paper
Document35 pages
Introduction To Programming Sample Question Paper
Cs
No ratings yet
Data Structures rcs305 2020
Document2 pages
Data Structures rcs305 2020
Shivanshu Kumar Upadhyay
No ratings yet
Paper For Aptech Dism-Unsolved
Document16 pages
Paper For Aptech Dism-Unsolved
Hussain Baloch
No ratings yet
Python
Document2 pages
Python
Aditi Kokane
No ratings yet
MCS 024june 12 PDF
Document3 pages
MCS 024june 12 PDF
Technical Sach
No ratings yet
Using Categorical Data With One Hot Encoding - Kaggle PDF
Document4 pages
Using Categorical Data With One Hot Encoding - Kaggle PDF
Mathias Mbizvo
No ratings yet
Java Programming Sample Paper 5th Semester MSBTE Diploma
Document2 pages
Java Programming Sample Paper 5th Semester MSBTE Diploma
Sanjay Dudani
100% (7)
Database Management System Ncs 502
Document2 pages
Database Management System Ncs 502
compiler&automata
No ratings yet
Unit Iv - Syntax Directed Translation & Run Time Environment
Document8 pages
Unit Iv - Syntax Directed Translation & Run Time Environment
shailesh waran
No ratings yet
Computer Programming
Document4 pages
Computer Programming
andhracolleges
No ratings yet
JPR Sample Question Paper - 1
Document2 pages
JPR Sample Question Paper - 1
api-3728136
No ratings yet
X10310 (CS8251)
Document2 pages
X10310 (CS8251)
5052 - UTHRA .T
No ratings yet
MTP Solution
Document19 pages
MTP Solution
Vanshika Garg
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Fitting A Model: Generalization, Overfitting, Underfitting
Document1 page
Fitting A Model: Generalization, Overfitting, Underfitting
Giridhar Reddy
No ratings yet
Movie Recommender Systems: By: BATCH-1 (Group-1) Karam Shraddha Rai Mayank Binny Jeshan
Document6 pages
Movie Recommender Systems: By: BATCH-1 (Group-1) Karam Shraddha Rai Mayank Binny Jeshan
Giridhar Reddy
No ratings yet
3 Week4 Network of Perceptron
Document1 page
3 Week4 Network of Perceptron
Giridhar Reddy
No ratings yet
Key ML Terminology: Labels
Document1 page
Key ML Terminology: Labels
Giridhar Reddy
No ratings yet
ML Week1 Tools Services PDF
Document1 page
ML Week1 Tools Services PDF
Giridhar Reddy
No ratings yet
Interesting Use Cases of AI (
Document10 pages
Interesting Use Cases of AI (
Giridhar Reddy
No ratings yet
ML Week1 Learning Styles PDF
Document2 pages
ML Week1 Learning Styles PDF
Giridhar Reddy
No ratings yet
ML Week - 1 Definitions PDF
Document1 page
ML Week - 1 Definitions PDF
Giridhar Reddy
No ratings yet