Clustering

Uploaded by

Loganathaan Srimathi

0% found this document useful (0 votes)

5 views1 page

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views1 page

Clustering

Uploaded by

Loganathaan Srimathi

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

Clustering is a popular technique in data analysis used to group similar objects or

observations into distinct categories or clusters. Two commonly used clustering

algorithms are Hierarchical and K-means clustering. Both of these techniques have
their own strengths and weaknesses, and their selection largely depends on the
nature of the data being analyzed and the research question.

Hierarchical clustering is a bottom-up approach, where each observation is initially

considered as a separate cluster and then progressively combined into larger clusters
based on their similarity or dissimilarity. The results of hierarchical clustering can be
visualized using a dendrogram, which shows the clustering hierarchy and the
distance between the clusters. The main advantage of hierarchical clustering is that it
does not require the number of clusters to be pre-specified, and it can be used to
identify nested clusters within the data. However, it is computationally intensive, and
the results can be sensitive to the choice of distance metric and linkage method
used.

On the other hand, K-means clustering is a top-down approach, where a pre-

specified number of clusters are created based on the similarity of observations. The
algorithm assigns each observation to a cluster based on their proximity to the
centroid of that cluster. The process is repeated until the centroids no longer change,
indicating convergence. The main advantage of K-means clustering is that it is
computationally efficient and can handle large datasets with a large number of
observations. However, it requires the number of clusters to be pre-specified and can
be sensitive to the choice of the initial centroid positions.

Both hierarchical and K-means clustering have their own strengths and limitations,
and the choice of algorithm depends on the specific research question and data
characteristics. Hierarchical clustering is useful when the number of clusters is not
known in advance, and nested clusters are of interest. K-means clustering is
appropriate when the number of clusters is known in advance and computational
efficiency is important.

In conclusion, hierarchical and K-means clustering are popular clustering techniques

used in data analysis to group similar objects or observations into distinct categories
or clusters. Both techniques have their own strengths and limitations, and the
selection of the algorithm depends on the nature of the data being analyzed and the
research question. By using these techniques, analysts can gain insights into the
structure of the data and identify patterns and relationships that may not be
immediately apparent.

Ai Fundamentals Source Quizzes
Document109 pages
Ai Fundamentals Source Quizzes
Lyn Flores
100% (1)
Smart Agriculture Emerging Pedagogies of Deep Learning Machine Learning and Internet of Things - Govind Singh Patel Amrita Rai Nripendra Narayan Das R.P. Singh
Document222 pages
Smart Agriculture Emerging Pedagogies of Deep Learning Machine Learning and Internet of Things - Govind Singh Patel Amrita Rai Nripendra Narayan Das R.P. Singh
ohundper
100% (2)
DMDW R20 Unit 5
Document21 pages
DMDW R20 Unit 5
car sorry
No ratings yet
Hierarchical Clustering PDF
Document5 pages
Hierarchical Clustering PDF
Likitha Reddy
No ratings yet
Assi 1
Document27 pages
Assi 1
Menna
No ratings yet
Hierarchical Clustering
Document14 pages
Hierarchical Clustering
João Lucas Barros
No ratings yet
DWDM Unit-5
Document52 pages
DWDM Unit-5
Arun kumar Soma
No ratings yet
Unit 5
Document27 pages
Unit 5
ajayagupta1101
No ratings yet
Pam Clustering Technique: Bachelor of Technology Computer Science and Engineering
Document11 pages
Pam Clustering Technique: Bachelor of Technology Computer Science and Engineering
samaksh
No ratings yet
Unit 5
Document5 pages
Unit 5
hollowpurple156
No ratings yet
Clustering and Classification
Document1 page
Clustering and Classification
Loganathaan Srimathi
No ratings yet
DA Assignment 2
Document10 pages
DA Assignment 2
Hîмanî Jayas
No ratings yet
Data Minig Unit 4th
Document5 pages
Data Minig Unit 4th
Malik Bilaal
No ratings yet
What Is Clustering?: Points To Remember
Document10 pages
What Is Clustering?: Points To Remember
UBSHimanshu Kumar
No ratings yet
Cluster Analysis-Unit 4
Document7 pages
Cluster Analysis-Unit 4
20PCT19 THANISHKA S
No ratings yet
Efficient Data Clustering With Link Approach
Document8 pages
Efficient Data Clustering With Link Approach
seventhsensegroup
No ratings yet
UNIT 4 Clustering and Applications
Document5 pages
UNIT 4 Clustering and Applications
singireddysindhu1
No ratings yet
By Lior Rokach and Oded Maimon: Clustering Methods
Document5 pages
By Lior Rokach and Oded Maimon: Clustering Methods
Rohit Paul
No ratings yet
Clustering and Distance Metrics
Document12 pages
Clustering and Distance Metrics
Aarthi E
No ratings yet
G Lavanya Computerscience
Document51 pages
G Lavanya Computerscience
Dhilsanth SL
No ratings yet
Fundamentals of Data Science Unit 3
Document15 pages
Fundamentals of Data Science Unit 3
rakshithadahnu
No ratings yet
Project Report
Document7 pages
Project Report
M Shahid Khan
No ratings yet
Comparison of Graph Clustering Algorithms
Document6 pages
Comparison of Graph Clustering Algorithms
seventhsensegroup
No ratings yet
Ult SCH 94 Benchmark
Document14 pages
Ult SCH 94 Benchmark
arshin
No ratings yet
DM Lecture 06
Document32 pages
DM Lecture 06
Sameer Ahmad
No ratings yet
Gautam A. Kudale
Document6 pages
Gautam A. Kudale
Hellbuster45
No ratings yet
Clustering in Machine Learning
Document4 pages
Clustering in Machine Learning
elin
No ratings yet
Multiple Clustering Views For Data Analysis
Document4 pages
Multiple Clustering Views For Data Analysis
International Journal of Application or Innovation in Engineering & Management
No ratings yet
Cluster Is A Group of Objects That Belongs To The Same Class
Document12 pages
Cluster Is A Group of Objects That Belongs To The Same Class
kalpana
No ratings yet
Recursive Hierarchical Clustering Algorithm
Document7 pages
Recursive Hierarchical Clustering Algorithm
reader29
No ratings yet
Camintac Essay - Nubbh Kejriwal
Document4 pages
Camintac Essay - Nubbh Kejriwal
Nubbh Kejriwal
No ratings yet
An Introduction To Clustering Methods
Document8 pages
An Introduction To Clustering Methods
magargie
No ratings yet
Clustering
Document37 pages
Clustering
Rafael
No ratings yet
Multilevel Techniques For The Clustering Problem
Document15 pages
Multilevel Techniques For The Clustering Problem
CS & IT
No ratings yet
UNIT 3 (2marks) TA
Document4 pages
UNIT 3 (2marks) TA
aathyukthas.ai20001
No ratings yet
DWBI4
Document10 pages
DWBI4
Dhanraj Deore
No ratings yet
Iv Unit DM
Document26 pages
Iv Unit DM
Vishwanth Bavireddy
No ratings yet
Clustering in Machine Learning
Document7 pages
Clustering in Machine Learning
ysakhare69
No ratings yet
Alehandro Lumentah 210211010188 Assignment09
Document10 pages
Alehandro Lumentah 210211010188 Assignment09
Alex Fred
No ratings yet
UNIT 4 Part 3 6 May
Document29 pages
UNIT 4 Part 3 6 May
harshrastogi0603
No ratings yet
Comparative Analysis of K-Means and Fuzzy C-Means Algorithms
Document5 pages
Comparative Analysis of K-Means and Fuzzy C-Means Algorithms
Format Seorang Legenda
No ratings yet
Data Mining - Cluster Analysis
Document4 pages
Data Mining - Cluster Analysis
Ravindra Kumar Prajapati
No ratings yet
Cluster Evaluation Techniques: Atds Assignment
Document4 pages
Cluster Evaluation Techniques: Atds Assignment
Archa Shaji
No ratings yet
A Survey On Partitioning and Hierarchical Based Data Mining Clustering Techniques
Document5 pages
A Survey On Partitioning and Hierarchical Based Data Mining Clustering Techniques
Hayder Kadhim
No ratings yet
Unit 4
Document74 pages
Unit 4
Sai Manasa
No ratings yet
Data Mining - Cluster Analysis: What Is Clustering?
Document4 pages
Data Mining - Cluster Analysis: What Is Clustering?
Sourav Das
No ratings yet
Clustering
Document5 pages
Clustering
neha
No ratings yet
Prediction Analysis Techniques of Data Mining: A Review
Document7 pages
Prediction Analysis Techniques of Data Mining: A Review
Edward
No ratings yet
UNIT 3 DWDM Notes
Document32 pages
UNIT 3 DWDM Notes
Divyansh
No ratings yet
Unit-3 DWDM 7TH Sem Cse
Document54 pages
Unit-3 DWDM 7TH Sem Cse
Navdeep Khubber
No ratings yet
Data Warehouse and Mining Notes
Document12 pages
Data Warehouse and Mining Notes
badal.singh07961
No ratings yet
Dmbi Unit-4
Document18 pages
Dmbi Unit-4
Paras Sharma
No ratings yet
IJIKMv15p091 108altaf6036
Document18 pages
IJIKMv15p091 108altaf6036
Mujahid Qaisrani
No ratings yet
Clustering - The Data Ensemble
Document4 pages
Clustering - The Data Ensemble
Daniel N Sherine Foo
No ratings yet
A Comparative Study of K-Means, K-Medoid and Enhanced K-Medoid Algorithms
Document4 pages
A Comparative Study of K-Means, K-Medoid and Enhanced K-Medoid Algorithms
IJAFRC
No ratings yet
1 s2.0 S0957417420302591 Main
Document16 pages
1 s2.0 S0957417420302591 Main
www.rameez687
No ratings yet
Clustering Importante
Document12 pages
Clustering Importante
Marcela Tenorio Castillo
No ratings yet
Practical Software Testing
Document3 pages
Practical Software Testing
ralliart art
No ratings yet
Local Search Genetic Algorithm-Based Possibilistic Weighted Fuzzy C-Means For Clustering Mixed Numerical and Categorical Data PDF
Document16 pages
Local Search Genetic Algorithm-Based Possibilistic Weighted Fuzzy C-Means For Clustering Mixed Numerical and Categorical Data PDF
Sunny Nguyen
No ratings yet
Ama 2018
Document14 pages
Ama 2018
Dr. Thulasi Bikku
No ratings yet
Clustering
Document57 pages
Clustering
Madina Dates
No ratings yet
Python Machine Learning for Beginners: Unsupervised Learning, Clustering, and Dimensionality Reduction. Part 1
From Everand
Python Machine Learning for Beginners: Unsupervised Learning, Clustering, and Dimensionality Reduction. Part 1
Tom Lesley
No ratings yet
Plan For Storytelling With Data
Document1 page
Plan For Storytelling With Data
Loganathaan Srimathi
No ratings yet
Sample RUBRICS STD3 TO 5 - HYE-2022-23
Document1 page
Sample RUBRICS STD3 TO 5 - HYE-2022-23
Loganathaan Srimathi
No ratings yet
Best Practices For Storytelling With Data
Document2 pages
Best Practices For Storytelling With Data
Loganathaan Srimathi
No ratings yet
Exploratory Data Analysis Queries About BankFinanceData
Document2 pages
Exploratory Data Analysis Queries About BankFinanceData
Loganathaan Srimathi
No ratings yet
Webinar StorytellingwithDataSession5-6
Document30 pages
Webinar StorytellingwithDataSession5-6
Loganathaan Srimathi
No ratings yet
Best Practices of Data Visualization
Document2 pages
Best Practices of Data Visualization
Loganathaan Srimathi
No ratings yet
Webinar StorytellingwithDataSession3-4
Document30 pages
Webinar StorytellingwithDataSession3-4
Loganathaan Srimathi
No ratings yet
Examples of Use of Different Data Collection Methods in Banking and Finance Context
Document2 pages
Examples of Use of Different Data Collection Methods in Banking and Finance Context
Loganathaan Srimathi
No ratings yet
Biases in Studies
Document1 page
Biases in Studies
Loganathaan Srimathi
No ratings yet
Syllabus DU New
Document48 pages
Syllabus DU New
ANKIT CHAKRABORTY
No ratings yet
Lecture Slides For Introduction To Applied Linear Algebra: Vectors, Matrices, and Least Squares
Document470 pages
Lecture Slides For Introduction To Applied Linear Algebra: Vectors, Matrices, and Least Squares
Muhammad Sadno
No ratings yet
MCQ Artificial Intelligence Class 10 Computer Vision
Document41 pages
MCQ Artificial Intelligence Class 10 Computer Vision
pratheesh
No ratings yet
772s Data - Mining.concepts - And.techniques.2nd - Ed
Document239 pages
772s Data - Mining.concepts - And.techniques.2nd - Ed
FaisalJameel
No ratings yet
Diabetes Prediction Using Machine Learning A Review
Document10 pages
Diabetes Prediction Using Machine Learning A Review
IJRASETPublications
No ratings yet
Social Networks and Data Mining
Document81 pages
Social Networks and Data Mining
Sushil Kulkarni
100% (21)
Technical Paper
Document6 pages
Technical Paper
Kunal salunkhe
No ratings yet
Cse Btech IV Yr Vii Sem Scheme Syllabus July 2022
Document25 pages
Cse Btech IV Yr Vii Sem Scheme Syllabus July 2022
Ved Kumar Gupta
No ratings yet
Event Detection in Clustered Wireless Sensor Networks Using Dynamic Cell Structures Neural Networks
Document19 pages
Event Detection in Clustered Wireless Sensor Networks Using Dynamic Cell Structures Neural Networks
abd01111
No ratings yet
FCM Yunlu 2016 Adaptive Weighted Fuzzy Clustering Algorithm For
Document6 pages
FCM Yunlu 2016 Adaptive Weighted Fuzzy Clustering Algorithm For
Andrei Carvalho
No ratings yet
PlantLeafDiseaseRecognitionUsingRandomForest KNN SVMandCNN
Document8 pages
PlantLeafDiseaseRecognitionUsingRandomForest KNN SVMandCNN
Jd K
No ratings yet
Compstat2012 Boa
Document61 pages
Compstat2012 Boa
Andry Onix
No ratings yet
CS614 Finalterm Subjective Referencefile
Document27 pages
CS614 Finalterm Subjective Referencefile
nimra shabeer
No ratings yet
Plant Disease Recognition A Large-Scale Benchmark Dataset and A Visual Region and Loss Reweighting Approach
Document13 pages
Plant Disease Recognition A Large-Scale Benchmark Dataset and A Visual Region and Loss Reweighting Approach
a
No ratings yet
Lab 3
Document36 pages
Lab 3
Bình Quốc
No ratings yet
Midterm Solution
Document11 pages
Midterm Solution
frozenfire310803
No ratings yet
Manual and Automatic Identification of Similar Arguments in EFL Learner Essays
Document9 pages
Manual and Automatic Identification of Similar Arguments in EFL Learner Essays
Laura de los Santos
No ratings yet
Data Analysis and Mining
Document52 pages
Data Analysis and Mining
Mahendra Singh Ranwa
No ratings yet
The Use of Modern Technology in Smart Waste Management and Recycling: Artificial Intelligence and Machine Learning
Document16 pages
The Use of Modern Technology in Smart Waste Management and Recycling: Artificial Intelligence and Machine Learning
pabasmo
No ratings yet
Importance of Clustering
Document5 pages
Importance of Clustering
Sattyasai Allapathi
No ratings yet
Assessment of Genetic Diversity in Sesame (Sesamum Indicum L.) Genotypes at Bako and Uke, Western Oromia
Document9 pages
Assessment of Genetic Diversity in Sesame (Sesamum Indicum L.) Genotypes at Bako and Uke, Western Oromia
Premier Publishers
No ratings yet
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
Document6 pages
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
Yihan Wang
No ratings yet
Bio Python
Document374 pages
Bio Python
Vidushi Dubey
No ratings yet
Datamining ch8
Document39 pages
Datamining ch8
selvaperumalvijayal
No ratings yet
Understanding DBSCAN Algorithm and Implementation From Scratch - by Andrewngai - Towards Data Science
Document10 pages
Understanding DBSCAN Algorithm and Implementation From Scratch - by Andrewngai - Towards Data Science
eimisjaneisi
No ratings yet
KDD96 037
Document6 pages
KDD96 037
JulioMartinez
No ratings yet
A Review On Churn Prediction and Customer Segmentation Using Machine Learning
Document5 pages
A Review On Churn Prediction and Customer Segmentation Using Machine Learning
SimranKohli
No ratings yet
Indexing Cortical Entrainment To Natural Speech at The Phonemi 2017 Hearing
Document8 pages
Indexing Cortical Entrainment To Natural Speech at The Phonemi 2017 Hearing
Juan Sebas Vizuete
No ratings yet