Clustering

Uploaded by

Apurba Roy

0% found this document useful (0 votes)

3 views3 pages

It's a note for Clustering in CNN. It can help you to read a summary

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

It's a note for Clustering in CNN. It can help you to read a summary

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

3 views3 pages

Clustering

Uploaded by

Apurba Roy

It's a note for Clustering in CNN. It can help you to read a summary

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Clustering

Created: 2023-11-12 20:49

Lecture No. : 6

1. Clustering is also known as unsupervised learning.

2. Organizing data into classes such that there is :
1. High Intra Class Similarity
2. Lower Inter Class Similarity
3. Each Clustering problem is based on some kind of distance between points
4. Distance is the measure of dissimilarity. The intuitions behind the distance measure
properties are:
1. Symmetry
2. Constancy of self similarity
3. Positivity
4. Triangular Inequality
5. Euclidean Distance:
1. L2 Norm: Square root of the sum of the square of the differences between x and y
2. L1 Norm: Sum of the differences in each dimension
6. Non Euclidean Distance:
1. Jaccard Distance: 1 − I ntersection

U nion

2. Cosine Distance:
P .Q

|P ||Q|

7. Types of Clustering:
1. Partitional Algorithms
1. Centroid Based Method
2. Most Common Algorithm: K-Means Clustering
3. Algorithm:
1. Decide the number of centroids i.e K
2. Initialize K cluster centroids
3. Decide which cluster is closer to the co-ordinate and associate with the co-
ordinate with the center
4. Re-calculate the centroid by averaging all the points
5. In 2 successive iteration, if the value of the centroids doesn't change, the
algorithm converges.
2. Hierarchical Algorithms
1. Bottom Up or Hierarchical Agglomerative Clustering:
1. Starting with each item on its own cluster from the bottom to find the best pair
to merge into a new cluster
2. Doesn't require us to prespecify the number of clusters
3. Algorithm:
1. Compute all pair wise pattern pattern similarity coefficients
2. Place each of n patterns into a class of its own
3. Merge the two most similar clusters into one. Re-compute the inter
cluster similarity scores with respect to the new cluster.
4. Repeat above steps until there are k Clusters left. (K can be 1)
2. Top Down or Divisive:
1. Starting with all data in a single cluster, consider every possible way to divide
the cluster into two.
2. Cluster is split using a flat clustering algorithm
3. More Complex as a flat clustering algorithm is required as a subroutine.
4. More Efficient: Linear in the number of patterns & Clusters
5. More Accurate: Consider global distribution where bottom up cares for the
local distribution
8. Properties of a Clustering Algorithm:
1. Scalability
2. Ability to deal with different data
3. Minimal requirements for Domain Knowledge
4. Able to deal with noise and outliers
5. Insensitive to order of inputs
6. Incorporation of user specified constraints
7. Interpretability & Usability
9. Computing Distance Matrix:
1. Min Distance
2. Max Distance
3. Group Average
4. Ward's Method: Increase in squared error when two clusters are merged.
10. k-Means Method: A Partitional Clustering Approach
1. Strength:
1. Efficient O(tkn) where t=iterations, k=no of clusters, n=data/onject
2. Often terminates at a local optimum
2. Weakness:
1. Applicable when mean is defined, problem for categorical data
2. Need to prespecify no of clusters
3. Unable to handle noisy data
4. Not suitable for clusters with non-convex shapes
11. Birch Algorithm:
1. Use an in memory R tree to store points that are clustered
2. Insert points on at a time into the tree, merging a new point with the existing cluster if
less than allowable threshold
3. If there are more leaf nodes than fit in memory, merge existing clusters that are close
to each other
4. At the end of first pass, we get a large number of clusters at the leaves of R tree.
12. Applications of Clustering:
1. Identification of Cancer Cells
2. Search Engines
3. Customer Segmentation
4. Biology: Different Species Classification
5. Lang Use: GIS

##References

Machine Learning with R, the tidyverse, and mlr
From Everand
Machine Learning with R, the tidyverse, and mlr
Hefin Rhys
No ratings yet
A Detailed Lesson Plan in (Teaching Science)
Document8 pages
A Detailed Lesson Plan in (Teaching Science)
Evan Jane Jumamil
67% (3)
(Culture and History of The Ancient Near East 65) Leslie Anne Warden - Pottery and Economy in Old Kingdom Egypt-Brill Academic Publishers (2014)
Document343 pages
(Culture and History of The Ancient Near East 65) Leslie Anne Warden - Pottery and Economy in Old Kingdom Egypt-Brill Academic Publishers (2014)
HugoBotello
100% (1)
Filipino Nationalism Lesson
Document24 pages
Filipino Nationalism Lesson
Ian Jay Tumulak
No ratings yet
The Historical Foundations of Law. Harold Berman
Document13 pages
The Historical Foundations of Law. Harold Berman
espinasdorsales
No ratings yet
Agenda: 1. Introduction To Clustering
Document47 pages
Agenda: 1. Introduction To Clustering
Salih Genel
No ratings yet
Clustering Algorithms on Iris Dataset
Document6 pages
Clustering Algorithms on Iris Dataset
Shrey Dixit
No ratings yet
Cluster Analysis Concept & Methods
Document14 pages
Cluster Analysis Concept & Methods
Kshitij Vijayvergia
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
Document22 pages
Clustering Techniques - Hierarchical, K-Means Clustering
Tanya Sharma
No ratings yet
Techniques of Cluster Analysis: A Seminar On
Document25 pages
Techniques of Cluster Analysis: A Seminar On
VAIBHAV NANAWARE
No ratings yet
Techniques of Cluster Analysis: A Seminar On
Document25 pages
Techniques of Cluster Analysis: A Seminar On
VAIBHAV NANAWARE
No ratings yet
An Introduction To Clustering Methods
Document8 pages
An Introduction To Clustering Methods
magargie
No ratings yet
DM BS Lec8 Clustering
Document48 pages
DM BS Lec8 Clustering
ruba
No ratings yet
Data Mining Unit 3 Cluster Analysis: Types of Clusters
Document11 pages
Data Mining Unit 3 Cluster Analysis: Types of Clusters
rohan
No ratings yet
ML Mod6
Document24 pages
ML Mod6
amarthya v
No ratings yet
DMDW Qa-5
Document7 pages
DMDW Qa-5
hashitapusapati012
No ratings yet
Partitioning Methods
Document3 pages
Partitioning Methods
Diyar T Alzuhairi
100% (1)
Lecture14 Notes
Document9 pages
Lecture14 Notes
chelsea
No ratings yet
Clustering
Document23 pages
Clustering
Aditya Mohite
No ratings yet
4 Clustering
Document21 pages
4 Clustering
paulitxenko08
No ratings yet
Data Clustering and Algorithm: Seema Yadav
Document2 pages
Data Clustering and Algorithm: Seema Yadav
erpublication
No ratings yet
Pertemuan-X - Manajemen Data Bagian 2
Document31 pages
Pertemuan-X - Manajemen Data Bagian 2
Roisyal Bariz
No ratings yet
A Famous Example of Cluster Analysis
Document5 pages
A Famous Example of Cluster Analysis
Vinit Shah
No ratings yet
Data Mining Clustering Algorithms
Document83 pages
Data Mining Clustering Algorithms
Teofilus Evan
No ratings yet
Assignment No. A6: 1 Title
Document5 pages
Assignment No. A6: 1 Title
Pallavi Vetal
No ratings yet
CLUSTERING
Document16 pages
CLUSTERING
engineershaiwal
No ratings yet
4 Clustering
Document9 pages
4 Clustering
Bibek Neupane
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
Document23 pages
Clustering Algorithm: An Unsupervised Learning Approach
SyedDabeerAli
No ratings yet
K-Means Clustering Explained
Document6 pages
K-Means Clustering Explained
nur ashfaraliana
No ratings yet
02 K-Means
Document25 pages
02 K-Means
Kushagra Bhatnagar
No ratings yet
Grouping
Document98 pages
Grouping
Aditya Patel
No ratings yet
K Means Clustering
Document6 pages
K Means Clustering
Alina Corina Bala
No ratings yet
Implementation of Single-Pass Clustering Algorithm
Document4 pages
Implementation of Single-Pass Clustering Algorithm
Pratik B
No ratings yet
Data Mining Unit-Iv
Document34 pages
Data Mining Unit-Iv
lokeshappalaneni9
No ratings yet
JNU Project Design with K-Means Clustering
Document26 pages
JNU Project Design with K-Means Clustering
Faizan Shaikh
100% (1)
K-Means Clustering & K-Nearest Neighbors Algorithms Explained
Document62 pages
K-Means Clustering & K-Nearest Neighbors Algorithms Explained
Griffithe Here
No ratings yet
Chapter-5-Cluster Analysis PDF
Document5 pages
Chapter-5-Cluster Analysis PDF
कजौली युथ्
No ratings yet
Lecture+Notes+ +clustering
Document13 pages
Lecture+Notes+ +clustering
Pankaj Pandey
No ratings yet
Lecture Notes - Clustering
Document13 pages
Lecture Notes - Clustering
gunjan Bhardwaj
No ratings yet
DMW Assignment 2
Document4 pages
DMW Assignment 2
mad world
No ratings yet
Week-9-Part-2 Agglomerative Clustering
Document40 pages
Week-9-Part-2 Agglomerative Clustering
Michael Zewdie
No ratings yet
Introduction To Clustering
Document8 pages
Introduction To Clustering
course16rahul
No ratings yet
Digital Image Processing: Segmentation-5
Document43 pages
Digital Image Processing: Segmentation-5
hamza
No ratings yet
Clustering Techniques in Data Mining
Document18 pages
Clustering Techniques in Data Mining
Hasset Tiss Abay Genji
No ratings yet
SPK Clustering
Document35 pages
SPK Clustering
Antonius
No ratings yet
Hierarchical Clustering unit 4 ml
Document14 pages
Hierarchical Clustering unit 4 ml
Smriti Sharma
No ratings yet
Cluster Analysis Clustering
Document6 pages
Cluster Analysis Clustering
17CSE97- VIKASHINI TP
No ratings yet
Cluster Analysis
Document2 pages
Cluster Analysis
awanish kumar
No ratings yet
DM Lecture 06
Document32 pages
DM Lecture 06
Sameer Ahmad
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
Document9 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
Nikhil Jojen
No ratings yet
Clustering
Document80 pages
Clustering
Aatmaj Salunke
No ratings yet
Clustering Algorithm PDF
Document6 pages
Clustering Algorithm PDF
Hugs
No ratings yet
Assignment Part II
Document6 pages
Assignment Part II
sourav.sur.ee
No ratings yet
Clustering
Document36 pages
Clustering
Arul Kumar Venugopal
No ratings yet
Intro to K-Means Clustering
Document60 pages
Intro to K-Means Clustering
ashishamitav123
No ratings yet
Clustering
Document4 pages
Clustering
DhruTheGamer
No ratings yet
FEM 2063 - Data Analytics Chapter 8: Clustering Techniques
Document42 pages
FEM 2063 - Data Analytics Chapter 8: Clustering Techniques
FakhrulShahrilEzanie
No ratings yet
Agnes
Document25 pages
Agnes
Dyah Septi Andryani
No ratings yet
Module-5-Cluster Analysis-Part1
Document24 pages
Module-5-Cluster Analysis-Part1
Shrimohan Tripathi
No ratings yet
Module5 QB 1
Document21 pages
Module5 QB 1
Vaishnavi G . Rao
No ratings yet
An Introduction To Clustering and Different Methods of Clustering
Document9 pages
An Introduction To Clustering and Different Methods of Clustering
Leonor Patricia MEDINA SIFUENTES
No ratings yet
Clustering
Document7 pages
Clustering
Rupesh Gaur
No ratings yet
Data Mining Clustering
Document76 pages
Data Mining Clustering
Anjali Asha Jacob
No ratings yet
Selectors
From Everand
Selectors
John E. Jayne
No ratings yet
Project Name: Purchase Order Management Creation Date: 26 October 2021 Created By: Sofiyan Pathan
Document2 pages
Project Name: Purchase Order Management Creation Date: 26 October 2021 Created By: Sofiyan Pathan
Atul Pawar
No ratings yet
Labconco-3905503 Rev e Purifier Hepa Filtered and Class I Filtered Enclosures User Manual
Document77 pages
Labconco-3905503 Rev e Purifier Hepa Filtered and Class I Filtered Enclosures User Manual
Calixto Grajales
No ratings yet
What is phonics
Document244 pages
What is phonics
Nelly Fernandez
No ratings yet
Pediatrics Study Schedule
Document2 pages
Pediatrics Study Schedule
Natnael
No ratings yet
2011 Mena Annual Reportv1
Document73 pages
2011 Mena Annual Reportv1
Yasmeen Layallie
No ratings yet
Leaving Cert Maths Scholarships
Document3 pages
Leaving Cert Maths Scholarships
John Hayes
No ratings yet
Timetable 1
Document1 page
Timetable 1
sunilbijlani
No ratings yet
Carbon Disulfide: Hazard Summary
Document5 pages
Carbon Disulfide: Hazard Summary
Alyssa Zerlina
No ratings yet
ĐỀ THI HSG
Document13 pages
ĐỀ THI HSG
ahmad amda
No ratings yet
6.1.2 The Solar System
Document4 pages
6.1.2 The Solar System
205 Nursyazliyana
No ratings yet
Lab Practice # 01 An Introduction To Matlab
Document10 pages
Lab Practice # 01 An Introduction To Matlab
Ghulam Abbas Lashari
No ratings yet
Course Code Part Sem Paper Code Paper Name
Document3 pages
Course Code Part Sem Paper Code Paper Name
shiv mishra
No ratings yet
A Rite of Spring
Document10 pages
A Rite of Spring
Lucius Gregory Meredith
No ratings yet
The Separation of Coherent and Incoherent Compton X-Ray Scattering
Document8 pages
The Separation of Coherent and Incoherent Compton X-Ray Scattering
Faisal Amir
No ratings yet
Critical Buckling Load of Pile in Liquefied Soil
Document8 pages
Critical Buckling Load of Pile in Liquefied Soil
Kefas Januar
No ratings yet
Top Answers to Mahout Interview Questions
Document6 pages
Top Answers to Mahout Interview Questions
Pappu Khan
No ratings yet
Olay
Document36 pages
Olay
rachit.chaudhary
No ratings yet
Robotics Process Automation
Document21 pages
Robotics Process Automation
bhaskarkiran.p
No ratings yet
GRP 10 JV's
Document43 pages
GRP 10 JV's
Manas Chaturvedi
No ratings yet
wizBRAINeng20 2
Document4 pages
wizBRAINeng20 2
Deepika Agrawal
No ratings yet
Edited Hydraulics Lecture - Part 2 - Orifices
Document36 pages
Edited Hydraulics Lecture - Part 2 - Orifices
Vincent
No ratings yet
Manjit Thapp Research
Document24 pages
Manjit Thapp Research
Dough Rodas
No ratings yet
XII Class Assignment Programs 2023-24
Document8 pages
XII Class Assignment Programs 2023-24
Sudhir Kumar
No ratings yet
GCS Activity 2003
Document33 pages
GCS Activity 2003
donnottry
No ratings yet
Data Structures and Algorithms in Java ™: Sixth Edition
Document8 pages
Data Structures and Algorithms in Java ™: Sixth Edition
Iván Bartulin Ortiz
0% (1)
Overview of Research Process
Document31 pages
Overview of Research Process
prema balusamy
No ratings yet