Welcome to Scribd!

7-4 Clustering

Uploaded by

0% found this document useful (0 votes)

6 views3 pages

This document discusses clustering data from three different CSV files. The first file contains citizen location data that appears to have 3 clusters. The second file adds voting data for citizens and appears to have 4 clusters. The third file contains customer age and product purchase data to cluster with a sample output using 7 clusters shown. The document instructs clustering the data, normalizing as needed, and plotting the results.

Original Description:

Original Title

7-4 clustering (1)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

6 views3 pages

7-4 Clustering

Uploaded by

piet

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Machine Learning: Clustering

1) The file ‘4-1 citizens.csv’ contains information on the location of citizens in a neighborhood.
The file contains the x and y coordinates of where the citizen lives.

Looking at the data, it appears there are 3 living areas in the neighborhood.

Cluster the citizens based on their location and draw the clustered citizens in a scatterplot.
The result should look like the graph below (but feel free to use different colors).
2) The file ‘4-2 citizens.csv’ contains data for another year. It contains both the x and y
coordinate of where the citizen lives and the party they voted for in the previous election.
You see a representation of the data in the graph below.

Based on the observation, we feel that we can split up the data into 4 clusters. Do the
clustering (remember to normalize) and draw the resulting graph.

The result should look like the graph below. (Colors may again look different.)
3) The file ‘4-3 customers.csv’ contains information on customers. It shows customers of
various ages buying your product A, B, C, or D. The graph below represents the data in the
file.

Cluster the data (remember to normalize). Plot the result. Try various settings for the
number of clusters and see what happens. You can easily plot the data in the same manner
as shown above, by adding an extra column to the data that contains the x coordinate on a
scatterplot. A customer has the value 1 for x, if the product is A; 2, if the value is B, …

The graph below shows what your result could look like for 7 clusters.

QMM1001 Module 3 (1) Applied Activity
Document2 pages
QMM1001 Module 3 (1) Applied Activity
IP Rana
No ratings yet
Simple Data Science (R)
From Everand
Simple Data Science (R)
Narayana Nemani
Rating: 5 out of 5 stars
5/5 (1)
Muthayammal College of Arts and Science Rasipuram: Assignment No - 1
Document10 pages
Muthayammal College of Arts and Science Rasipuram: Assignment No - 1
Gopi Balakrishnan
No ratings yet
Diskusi 7 BING4102
Document8 pages
Diskusi 7 BING4102
istiqomah
100% (1)
Cube Transport Software Guide
Document46 pages
Cube Transport Software Guide
imegha89
No ratings yet
Ch-4 Plotting Data Using Matplotlib
Document32 pages
Ch-4 Plotting Data Using Matplotlib
seemabhatia392
No ratings yet
7-2 K Nearest Neighbors
Document2 pages
7-2 K Nearest Neighbors
piet
No ratings yet
Visualization
Document19 pages
Visualization
Deva Hema D
No ratings yet
L17-18 PPT IVSem
Document38 pages
L17-18 PPT IVSem
Rohit Tiwari
No ratings yet
IT Skill Lab-2
Document23 pages
IT Skill Lab-2
minu kumari
No ratings yet
Math Project Rubric
Document2 pages
Math Project Rubric
api-254133636
No ratings yet
Saveetha Institute of Medical and Technical Sciences: Unit V Plotting and Regression Analysis in R
Document63 pages
Saveetha Institute of Medical and Technical Sciences: Unit V Plotting and Regression Analysis in R
Muzakir Laikh Khan
No ratings yet
Unit5&6 Mba
Document12 pages
Unit5&6 Mba
mahih16237
No ratings yet
Gis Datastruct
Document3 pages
Gis Datastruct
Aqsa Bilal
No ratings yet
Charts in Excel
Document12 pages
Charts in Excel
poorna
100% (1)
Data Visualization
Document28 pages
Data Visualization
vsy9926
No ratings yet
Data Visualization
Document14 pages
Data Visualization
Ihsan baust
No ratings yet
Performance Task #4 - Data Visualization (MMW)
Document2 pages
Performance Task #4 - Data Visualization (MMW)
GAILE MEIZTY MOSADA
No ratings yet
COMP2501 - Assignment - 1 - Questions - RMD 2
Document7 pages
COMP2501 - Assignment - 1 - Questions - RMD 2
yanaa
No ratings yet
Data Analysis Graphs
Document9 pages
Data Analysis Graphs
sid
No ratings yet
Prac - 6
Document7 pages
Prac - 6
Eklavya Sudan
No ratings yet
Unit 13 Data Presentation and Descriptiv
Document35 pages
Unit 13 Data Presentation and Descriptiv
lemesa
No ratings yet
IGNOU Material - Statistics
Document183 pages
IGNOU Material - Statistics
abhishek singh
No ratings yet
Ict Notes
Document8 pages
Ict Notes
VANSHIKA AGARWAL
No ratings yet
07 Scatterplot Barplot Piechart
Document15 pages
07 Scatterplot Barplot Piechart
the killerboy
100% (1)
Aim: Perform Data Visualization On Datasets
Document5 pages
Aim: Perform Data Visualization On Datasets
Natnael Tamirat
No ratings yet
Course3 Notes
Document44 pages
Course3 Notes
Stefano Pentury
No ratings yet
Brunel Documentation
Document36 pages
Brunel Documentation
tarek gabrr
No ratings yet
Gracy File Report
Document18 pages
Gracy File Report
ajay kumar
No ratings yet
CH 5 Collecting and Displaying Data
Document9 pages
CH 5 Collecting and Displaying Data
Catalin Sapariuc
No ratings yet
Matlab III: Graphics and Data Analysis: Updated: August 2012
Document39 pages
Matlab III: Graphics and Data Analysis: Updated: August 2012
wan ismail ibrahim
No ratings yet
Tableau
Document4 pages
Tableau
shaukat JALAL
No ratings yet
Nodexl For Network Analysis Demo/Hands-On at Nicar 2012, ST Louis, Feb 24 Peter Aldhous, San Francisco Bureau Chief
Document14 pages
Nodexl For Network Analysis Demo/Hands-On at Nicar 2012, ST Louis, Feb 24 Peter Aldhous, San Francisco Bureau Chief
helmioemry
No ratings yet
Lab 7 Spatial Selection AP
Document13 pages
Lab 7 Spatial Selection AP
acwriters123
No ratings yet
Lab Assignment 19
Document5 pages
Lab Assignment 19
yash choudhary
No ratings yet
Bertin - Matrix Theory of Graphics - IDJ - Ocr
Document12 pages
Bertin - Matrix Theory of Graphics - IDJ - Ocr
Ricardo Cunha Lima
No ratings yet
Sant Rawool Maharaj, Mhavidyalay
Document14 pages
Sant Rawool Maharaj, Mhavidyalay
anamica
No ratings yet
Unit 18
Document10 pages
Unit 18
mohamed ahmed Hamada
No ratings yet
Midterm Module 1a
Document14 pages
Midterm Module 1a
Ma. Lourdes “Ria” Villanueva
No ratings yet
Introduction To Tecplot 10-1
Document19 pages
Introduction To Tecplot 10-1
snehil2789
No ratings yet
Q1) Solve Any Five A) What Is The Difference Between Inferential and Descriptive Statistics? Sample
Document6 pages
Q1) Solve Any Five A) What Is The Difference Between Inferential and Descriptive Statistics? Sample
Amar Nath Babar
No ratings yet
06 Plots Export Plots
Document17 pages
06 Plots Export Plots
the killerboy
100% (1)
Assignment 05
Document2 pages
Assignment 05
deens logo
No ratings yet
R Assignment 1
Document3 pages
R Assignment 1
Sudarshan Kumar
No ratings yet
In Line
Document9 pages
In Line
AARUSH SABOO
No ratings yet
Declaration:: MGNM801 Business Analytics-I
Document12 pages
Declaration:: MGNM801 Business Analytics-I
akash hossain
No ratings yet
Visualization
Document27 pages
Visualization
sbapoorvaa1
No ratings yet
Kinds of Graphs2
Document31 pages
Kinds of Graphs2
Xandra Lee
No ratings yet
Data Analytics Using SAS For Economists: Problem Sheet - 1
Document2 pages
Data Analytics Using SAS For Economists: Problem Sheet - 1
Ankit Dangi
No ratings yet
Business Report Project Machine Learning Rupesh Kumar DSBA-A5-21C-2021
Document77 pages
Business Report Project Machine Learning Rupesh Kumar DSBA-A5-21C-2021
Rupesh Gaur
100% (1)
Understanding Queries in Report Studio
Document4 pages
Understanding Queries in Report Studio
Ranjith Joseph
No ratings yet
Module 4 - Exercises
Document2 pages
Module 4 - Exercises
dfer43
No ratings yet
Correspondance Analysis
Document16 pages
Correspondance Analysis
Sooraj Muralee
No ratings yet
How Do You Make A Histogram
Document4 pages
How Do You Make A Histogram
api-162641823
No ratings yet
Arcmap: Ahmad Mokhtari
Document31 pages
Arcmap: Ahmad Mokhtari
ripal
100% (1)
Apunts BLOC 1 Estadística
Document15 pages
Apunts BLOC 1 Estadística
Mayssae Essabbar
No ratings yet
GIS Lab
Document12 pages
GIS Lab
Sravan Kumar
No ratings yet
Tableau Ans.
Document25 pages
Tableau Ans.
shubham chatterjee
No ratings yet
Geographic Information System - Notes
Document18 pages
Geographic Information System - Notes
36-Rumaisa Ravi
No ratings yet
G. Arrays: by LT Col Tom Schorsch
Document13 pages
G. Arrays: by LT Col Tom Schorsch
pirulitochampion
No ratings yet
6-2 List Search and Sort
Document2 pages
6-2 List Search and Sort
piet
No ratings yet
7-2 K Nearest Neighbors
Document2 pages
7-2 K Nearest Neighbors
piet
No ratings yet
6-3 Graph Search and Shortest Path
Document2 pages
6-3 Graph Search and Shortest Path
piet
No ratings yet
6-4 Knapsack and Bin Packing
Document1 page
6-4 Knapsack and Bin Packing
piet
No ratings yet