Welcome to Scribd!

Ex - No.5 Clustering

Uploaded by

0% found this document useful (0 votes)

12 views8 pages

An international catalog company wants to cluster its customers based on common attributes like customer information, products purchased, and sales data without predefined labels. The document describes using WEKA's SimpleKMeans clustering algorithm on a dataset of customers from the US to generate 5 clusters. It details selecting the clustering algorithm, setting the number of clusters to 5, running the algorithm, and visualizing the results to represent the clusters in a graph.

Original Description:

Clustering lab in weka

Original Title

Ex.No.5 Clustering

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

12 views8 pages

Ex - No.5 Clustering

Uploaded by

Amit kumar

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 8

Search inside document

Ex.No.

5
10.05.2021
CLUSTERING
4.1 Clustering Data:
WEKA contains “clusterers” for finding groups of similar instances in a dataset. The
clustering schemes available in WEKA are k-Means, EM, Cobweb, X-means, Farthest
First. Clusters can be visualized and compared to “true” clusters (if given). Evaluation
is based on log likelihood if clustering scheme produces a probability distribution.

An international online catalog company wishes to group its customers based on common
features. Company management does not have any predefined labels for these groups. Based
on the outcome of the grouping, they will target marketing and advertising campaigns to the
different groups. The information they have about the customers includes customer ID
,Customer Name , customer count,ProductSold, Sales Channel,Units Sold,Date Sold.
For our exercise we will use a part of the database for customers in US. Depending on the
type of products sold , not all attributes are important. For example, suppose the to know the
det
In ‘Preprocess’ window click on ‘Open file…’ button and select “customers.csv” file. Click
‘Cluster’ tab at the top of WEKA Explorer window.
4.2 Choosing Clustering Scheme:
In the ‘Clusterer’ box click on ‘Choose’ button. In pull-down menu select WEKA Æ
Clusterers, and select the cluster scheme ‘SimpleKMeans’. Some implementations of K-
means only allow numerical values for attributes; therefore, we do not need to use a filter.
Once the clustering algorithm is chosen, right-click on the algorithm,
“weak.gui.GenericObjectEditor” comes up to the screen. Set the value in “numClusters” box
to 5 (instead of default 2) because you have five clusters in your .arff file. Leave the value of
‘seed’ as is. The seed value is used in generating a random number, which is used for making
the initial assignment of instances to clusters. Note that, in general, K-means is quite sensitive
to how clusters are initially assigned. Thus, it is often necessary to try different values and
evaluate the results.
4.3 Setting Test Options:
Before you run the clustering algorithm, you need to choose ‘Cluster mode’. Click on
‘Classes to cluster evaluation’ radio-button in ‘Cluster mode’ box and select in the pull-down
box below.
Once the options have been specified, you can run the clustering algorithm. Click on the
‘Start’ button to execute the algorithm.
4.4 Visualization of Results

Another way of representation of results of clustering is through visualization. Right-click on

the entry in the ‘Result list’ and select ‘Visualize cluster assignments’ in the pull-down
window.

This brings up the ‘Weka Clusterer Visualize’ window.

On the ‘Weka Clusterer Visualize’ window, beneath the X-axis selector there is a dropdown
list, ‘Colour’, for choosing the color scheme. This allows you to choose the color of points
based on the attribute selected. Below the plot area, there is a legend that describes what
values the colors correspond to.

Femap Tips and Tricks Ebook
Document82 pages
Femap Tips and Tricks Ebook
Fabio Temporini
No ratings yet
Deployment: Cheat Sheet: Machine Learning With KNIME Analytics Platform
Document2 pages
Deployment: Cheat Sheet: Machine Learning With KNIME Analytics Platform
Izan Izwan
No ratings yet
Microsoft Excel 2016: Data Analysis with PivotTables
From Everand
Microsoft Excel 2016: Data Analysis with PivotTables
Fish Davis
No ratings yet
HW4 Text-1
Document8 pages
HW4 Text-1
Utkarsh Shrivatava
No ratings yet
Conquering Complexity in Your Business: How Wal-Mart, Toyota, and Other Top Companies Are Breaking Through the Ceiling on Profits and Growth: How Wal-Mart, Toyota, and Other Top Companies Are Breaking Through the Ceiling on Profits and Growth
From Everand
Conquering Complexity in Your Business: How Wal-Mart, Toyota, and Other Top Companies Are Breaking Through the Ceiling on Profits and Growth: How Wal-Mart, Toyota, and Other Top Companies Are Breaking Through the Ceiling on Profits and Growth
Michael L. George
Rating: 4 out of 5 stars
4/5 (2)
Bank Based On Data Warehouse
Document5 pages
Bank Based On Data Warehouse
All In One
No ratings yet
X'Pert HighScore Plus Introduction
Document15 pages
X'Pert HighScore Plus Introduction
Levent Kartal
100% (1)
Split Valuation
Document2 pages
Split Valuation
casandesh1gmailcom
No ratings yet
CS699 Introduction to Data Mining
Document50 pages
CS699 Introduction to Data Mining
t na
No ratings yet
Db2 12 For zOS SQL Performance and Tuning Course - CV964G PDF
Document2 pages
Db2 12 For zOS SQL Performance and Tuning Course - CV964G PDF
mana1345
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Smart Meter Data Analysis
Document306 pages
Smart Meter Data Analysis
Gurpinder Singh
100% (1)
SuperMarket Dataset Analysis in WEKA
Document20 pages
SuperMarket Dataset Analysis in WEKA
Ahmed Mohammed
0% (1)
Data Mining Clustering Techniques
Document3 pages
Data Mining Clustering Techniques
Engineering and Scientific International Journal
No ratings yet
Clustering Iris Data with Weka: Visualizing to Improve k-Means Clustering Results
Document6 pages
Clustering Iris Data with Weka: Visualizing to Improve k-Means Clustering Results
Cheng Lin
No ratings yet
6.034 Design Assignment 2: 1 Data Sets
Document6 pages
6.034 Design Assignment 2: 1 Data Sets
upender_kalwa
No ratings yet
DOC-20231017-WA0002.
Document21 pages
DOC-20231017-WA0002.
thabeswar2003
No ratings yet
WEKA Manual
Document11 pages
WEKA Manual
prabumn
No ratings yet
Classification and Clustering of Animals Using Decision Trees and K-Means
Document4 pages
Classification and Clustering of Animals Using Decision Trees and K-Means
Kaustubh Khare
No ratings yet
Objective: Classification Using ID3 and C4.5 Algorithms Tasks
Document8 pages
Objective: Classification Using ID3 and C4.5 Algorithms Tasks
Shivam Shukla
No ratings yet
Practical-04 - Clustering On Dataset
Document6 pages
Practical-04 - Clustering On Dataset
SourabH BhalsE
No ratings yet
Exp 6
Document12 pages
Exp 6
Soban Maruf
No ratings yet
Weka Tutorial 2
Document50 pages
Weka Tutorial 2
Fikri Faris
No ratings yet
Assignment 1-Preprocessing Handon
Document6 pages
Assignment 1-Preprocessing Handon
Ch Ubaid Warraich
No ratings yet
Study Decision Tree learners using WEKA and datasets like Breast Cancer or Reuters
Document20 pages
Study Decision Tree learners using WEKA and datasets like Breast Cancer or Reuters
Shalini Singhal
No ratings yet
Assignment 1-Preprocessing Handon
Document13 pages
Assignment 1-Preprocessing Handon
suleman045
No ratings yet
Wekappt
Document58 pages
Wekappt
bahubali king
No ratings yet
WEKA Data Preprocessing and Decision Tree
Document8 pages
WEKA Data Preprocessing and Decision Tree
Pottli Siddhu
No ratings yet
Ebook 042 Tutorial Spss Hierarchical Cluster Analysis
Document17 pages
Ebook 042 Tutorial Spss Hierarchical Cluster Analysis
sneha
No ratings yet
AI32 Guide To Weka PDF
Document6 pages
AI32 Guide To Weka PDF
datruccone
No ratings yet
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
Document55 pages
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
Jayesh bansal
No ratings yet
German Dataset Tasks
Document6 pages
German Dataset Tasks
Prateek Singh
No ratings yet
Cognos Framework Manager Example
Document7 pages
Cognos Framework Manager Example
Madhes Analyst
100% (1)
DWM Lab Manual
Document92 pages
DWM Lab Manual
Hemamalini
No ratings yet
Page - 1
Document4 pages
Page - 1
Anurag Singh
No ratings yet
CAP3770 Lab#4 DecsionTree Sp2017
Document4 pages
CAP3770 Lab#4 DecsionTree Sp2017
Melving
No ratings yet
Clustering With WEKA Explorer: Lab Exercise Four
Document11 pages
Clustering With WEKA Explorer: Lab Exercise Four
ardindavirman
No ratings yet
Indian Institute of Management Bangalore: PGP 4 Term 2019-20
Document3 pages
Indian Institute of Management Bangalore: PGP 4 Term 2019-20
IIMB Sumit
No ratings yet
Introducing WEKA's Data Mining Tools
Document58 pages
Introducing WEKA's Data Mining Tools
Prakash Naik
No ratings yet
Conjoint Analysis Tutorial
Document24 pages
Conjoint Analysis Tutorial
Mohammad Shayzi
No ratings yet
GL Wand User Guide
Document62 pages
GL Wand User Guide
parvathi12
No ratings yet
Why Comparative Analysisjhkjhilj
Document9 pages
Why Comparative Analysisjhkjhilj
sanais1985
No ratings yet
Credit Risk Assessment Using Data Mining
Document34 pages
Credit Risk Assessment Using Data Mining
Keerthana Sudarshan
No ratings yet
XLStat K-Means Clustering
Document6 pages
XLStat K-Means Clustering
Irina Alexandra
No ratings yet
Tutorial #5: Analyzing Ranking Data: Bank Segmentation Example
Document19 pages
Tutorial #5: Analyzing Ranking Data: Bank Segmentation Example
janine
No ratings yet
NetLogo K-Means Guidelines
Document3 pages
NetLogo K-Means Guidelines
vignesh
No ratings yet
Data Mining Functi
Document6 pages
Data Mining Functi
parthasarathy_vinaya
No ratings yet
Data Mining Lab File Using Weka Tools
Document20 pages
Data Mining Lab File Using Weka Tools
DanishKhan
No ratings yet
Data Warehousing and Data Mining Lab
Document53 pages
Data Warehousing and Data Mining Lab
Aman Jolly
No ratings yet
Lecture 25 K Means Clustering
Document28 pages
Lecture 25 K Means Clustering
sunita chalageri
No ratings yet
COC131 Tutorial w6
Document4 pages
COC131 Tutorial w6
Krishan Acharya
No ratings yet
W7 Weka Classification
Document7 pages
W7 Weka Classification
Azfar Jiji
No ratings yet
DM Lab
Document101 pages
DM Lab
Tamilvanan S
No ratings yet
Weka Exercise 1 - Getting Acquainted With Weka
Document7 pages
Weka Exercise 1 - Getting Acquainted With Weka
amol
No ratings yet
Procedure To Identify The Minerals in A Specimen: Computer-Automated Search-Match of The PDF Database
Document5 pages
Procedure To Identify The Minerals in A Specimen: Computer-Automated Search-Match of The PDF Database
john33leeds
No ratings yet
Tutorial Weka - Feature Selection and Classification, Data Mining
Document7 pages
Tutorial Weka - Feature Selection and Classification, Data Mining
Tresna Maulana
No ratings yet
STRT Abhay
Document14 pages
STRT Abhay
Dolly Mehra
No ratings yet
Weka Tutorial
Document32 pages
Weka Tutorial
Algus Dark
No ratings yet
KnowYourMolecule ChEMBL KNIME
Document11 pages
KnowYourMolecule ChEMBL KNIME
Marcos Ferreira
No ratings yet
SPSS Annotated Output K Means Cluster Anal
Document10 pages
SPSS Annotated Output K Means Cluster Anal
Aditya Mehra
No ratings yet
DataMiningForTheMasses (001 158) (105 158)
Document54 pages
DataMiningForTheMasses (001 158) (105 158)
Tracy Kereh
No ratings yet
Data Science and Machine Learning Essentials: Lab 4B - Working With Classification Models
Document29 pages
Data Science and Machine Learning Essentials: Lab 4B - Working With Classification Models
aussatris
No ratings yet
Storage Cluster File Systems Standard Requirements
From Everand
Storage Cluster File Systems Standard Requirements
Gerardus Blokdyk
No ratings yet
Business Case Analysis with R: Simulation Tutorials to Support Complex Business Decisions
From Everand
Business Case Analysis with R: Simulation Tutorials to Support Complex Business Decisions
Robert D. Brown III
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Elastic cloud storage Second Edition
From Everand
Elastic cloud storage Second Edition
Gerardus Blokdyk
No ratings yet
Ek Peng Chew - Ek Peng Chew - Loo Hay Lee - Loon Ching Tang-Advances in Maritime Logistics and Supply Chain Systems-World Scientific Publishing Company (2011)
Document17 pages
Ek Peng Chew - Ek Peng Chew - Loo Hay Lee - Loon Ching Tang-Advances in Maritime Logistics and Supply Chain Systems-World Scientific Publishing Company (2011)
hare199
No ratings yet
ChatGPT Scientific Text Comparison
Document17 pages
ChatGPT Scientific Text Comparison
Jiahao Mi
No ratings yet
Discretization Techniques A Recent Survey
Document12 pages
Discretization Techniques A Recent Survey
tia
No ratings yet
How To Write An Engineering Literature Review
Document7 pages
How To Write An Engineering Literature Review
cigyofxgf
100% (1)
A Novel Approach To Real Time Incremental Short Text Summarization On Social Networks
Document6 pages
A Novel Approach To Real Time Incremental Short Text Summarization On Social Networks
Editor InsideJournals
No ratings yet
A Typology of English Texts : Douglas Biber
Document42 pages
A Typology of English Texts : Douglas Biber
Abraham
No ratings yet
Cluster
Document5 pages
Cluster
Shivam Jadhav
No ratings yet
AI ML Unit 4 QB
Document38 pages
AI ML Unit 4 QB
Sumedh Rane
No ratings yet
Detecting IoT Botnet Attacks Using Machine Learning Methods
Document7 pages
Detecting IoT Botnet Attacks Using Machine Learning Methods
cmgarciasilva
No ratings yet
2008 - Evolution of Interest Rate Curve - Empirical Analysis of Patterns Using Nonlinear Clustering Tools
Document9 pages
2008 - Evolution of Interest Rate Curve - Empirical Analysis of Patterns Using Nonlinear Clustering Tools
Franck Dernoncourt
No ratings yet
Overall Course Brochure in Upgrad
Document51 pages
Overall Course Brochure in Upgrad
Naveen raj
No ratings yet
KPI Optimization Test Plan For LTE
Document42 pages
KPI Optimization Test Plan For LTE
huzaif zahoor
No ratings yet
Course 9 Applied Data Analytics Second Version
Document16 pages
Course 9 Applied Data Analytics Second Version
jin0935
No ratings yet
Banchar Arnonkijpanich, Barbara Hammer and Alexander Hasenfuss - Local Matrix Adaptation in Topographic Neural Maps
Document34 pages
Banchar Arnonkijpanich, Barbara Hammer and Alexander Hasenfuss - Local Matrix Adaptation in Topographic Neural Maps
Tuhma
No ratings yet
Classification of Market Regimes - Quantdare
Document10 pages
Classification of Market Regimes - Quantdare
droolingpanda
No ratings yet
Machine Learning Based Access Control Framework
Document10 pages
Machine Learning Based Access Control Framework
farrukh
No ratings yet
Data Mining Clustering Algorithms Explained
Document36 pages
Data Mining Clustering Algorithms Explained
Ishaq Ali
No ratings yet
ADA's Nutrition Care Process Steps
Document105 pages
ADA's Nutrition Care Process Steps
yoga saputra
No ratings yet
Algorithmic Clustering of Music
Document8 pages
Algorithmic Clustering of Music
Muhja Mufidah
No ratings yet
CS583 Data Prep
Document33 pages
CS583 Data Prep
Harman Patel
No ratings yet
Electric Vehicle Maintenance Training Syllabus
Document41 pages
Electric Vehicle Maintenance Training Syllabus
iswahyudi
No ratings yet
Servicescape Interior Design and Consumers' Personality Impressions
Document10 pages
Servicescape Interior Design and Consumers' Personality Impressions
tipuan senja
No ratings yet
SPWLA Log Integration
Document14 pages
SPWLA Log Integration
pahlawankemaleman
No ratings yet
1789 Anil DiseasePredictor PDF
Document34 pages
1789 Anil DiseasePredictor PDF
PsXGaMeR
No ratings yet