Mean Shift Algorithm in Density Estimation

Mean shift is an algorithm used for locating maxima and modes of density functions. It is an iterative procedure that takes an initial estimate and shifts it towards the mean of nearby points within a given kernel. This shift in the mean continues iteratively until the mean value converges. Mean shift has applications in cluster analysis, tracking objects in video, and image smoothing. It is a non-parametric technique that does not assume a particular shape for clusters of data.

Uploaded by

joseph676

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

121 views5 pages

Mean Shift Algorithm in Density Estimation

Uploaded by

joseph676

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Mean shift

Mean shift is a non-parametric feature-space mathematical analysis technique for locating the maxima of a
density function, a so-called mode-seeking algorithm.[1] Application domains include cluster analysis in
computer vision and image processing.[2]

History
The mean shift procedure is usually credited to work by Fukunaga and Hostetler in 1975.[3] It is, however,
reminiscent of earlier work by Schnell in 1964.[4]

Overview
Mean shift is a procedure for locating the maxima—the modes—of a density function given discrete data
sampled from that function.[1] This is an iterative method, and we start with an initial estimate . Let a
kernel function be given. This function determines the weight of nearby points for re-
estimation of the mean. Typically a Gaussian kernel on the distance to the current estimate is used,
. The weighted mean of the density in the window determined by is

where is the neighborhood of , a set of points for which .

The difference is called mean shift in Fukunaga and Hostetler.[3] The mean-shift algorithm now
sets , and repeats the estimation until converges.

Although the mean shift algorithm has been widely used in many applications, a rigid proof for the
convergence of the algorithm using a general kernel in a high dimensional space is still not known.[5]
Aliyari Ghassabeh showed the convergence of the mean shift algorithm in one dimension with a
differentiable, convex, and strictly decreasing profile function.[6] However, the one-dimensional case has
limited real world applications. Also, the convergence of the algorithm in higher dimensions with a finite
number of the stationary (or isolated) points has been proved.[5][7] However, sufficient conditions for a
general kernel function to have finite stationary (or isolated) points have not been provided.

Gaussian Mean-Shift is an Expectation–maximization algorithm.[8]

Details
Let data be a finite set embedded in the -dimensional Euclidean space, . Let be a flat kernel that is
the characteristic function of the -ball in ,
In each iteration of the algorithm, is performed for all simultaneously. The first question,
then, is how to estimate the density function given a sparse set of samples. One of the simplest approaches
is to just smooth the data, e.g., by convolving it with a fixed kernel of width ,

where are the input samples and is the kernel function (or Parzen window). is the only parameter
in the algorithm and is called the bandwidth. This approach is known as kernel density estimation or the
Parzen window technique. Once we have computed from the equation above, we can find its local
maxima using gradient ascent or some other optimization technique. The problem with this "brute force"
approach is that, for higher dimensions, it becomes computationally prohibitive to evaluate over the
complete search space. Instead, mean shift uses a variant of what is known in the optimization literature as
multiple restart gradient descent. Starting at some guess for a local maximum, , which can be a random
input data point , mean shift computes the gradient of the density estimate at and takes an uphill
step in that direction. [9]

Types of kernels
Kernel definition: Let be the -dimensional Euclidean space, . The norm of is a non-negative
number, . A function is said to be a kernel if there exists a profile,
, such that

and

k is non-negative.
k is non-increasing: if .

k is piecewise continuous and

The two most frequently used kernel profiles for mean shift are:

Flat kernel

Gaussian kernel

where the standard deviation parameter works as the bandwidth parameter, .

Applications

Clustering
Consider a set of points in two-dimensional space. Assume a circular window centered at and having
radius as the kernel. Mean-shift is a hill climbing algorithm which involves shifting this kernel iteratively
to a higher density region until convergence. Every shift is defined by a mean shift vector. The mean shift
vector always points toward the direction of the maximum increase in the density. At every iteration the
kernel is shifted to the centroid or the mean of the points within it. The method of calculating this mean
depends on the choice of the kernel. In this case if a Gaussian kernel is chosen instead of a flat kernel, then
every point will first be assigned a weight which will decay exponentially as the distance from the kernel's
center increases. At convergence, there will be no direction at which a shift can accommodate more points
inside the kernel.

Tracking

The mean shift algorithm can be used for visual tracking. The simplest such algorithm would create a
confidence map in the new image based on the color histogram of the object in the previous image, and use
mean shift to find the peak of a confidence map near the object's old position. The confidence map is a
probability density function on the new image, assigning each pixel of the new image a probability, which
is the probability of the pixel color occurring in the object in the previous image. A few algorithms, such as
kernel-based object tracking,[10] ensemble tracking,[11] CAMshift [12][13] expand on this idea.

Smoothing

Let and be the -dimensional input and filtered image pixels in the joint spatial-range
domain. For each pixel,

Initialize and
Compute according to until convergence, .
Assign . The superscripts s and r denote the spatial and range components of
a vector, respectively. The assignment specifies that the filtered data at the spatial location
axis will have the range component of the point of convergence .

Strengths
1. Mean shift is an application-independent tool suitable for real data analysis.
2. Does not assume any predefined shape on data clusters.
3. It is capable of handling arbitrary feature spaces.
4. The procedure relies on choice of a single parameter: bandwidth.
5. The bandwidth/window size 'h' has a physical meaning, unlike k-means.

Weaknesses
1. The selection of a window size is not trivial.
2. Inappropriate window size can cause modes to be merged, or generate additional “shallow”
modes.
3. Often requires using adaptive window size.

Availability
Variants of the algorithm can be found in machine learning and image processing packages:

ELKI. Java data mining tool with many clustering algorithms.

ImageJ. Image filtering using the mean shift filter.
mlpack. Efficient dual-tree algorithm-based implementation.
OpenCV contains mean-shift implementation via cvMeanShift Method
Orfeo toolbox. A C++ implementation.
scikit-learn Numpy/Python implementation uses ball tree for efficient neighboring points
lookup

See also
DBSCAN
OPTICS algorithm
Kernel density estimation (KDE)
Kernel (statistics)

References
1. Cheng, Yizong (August 1995). "Mean Shift, Mode Seeking, and Clustering". IEEE
Transactions on Pattern Analysis and Machine Intelligence. 17 (8): 790–799.
CiteSeerX 10.1.1.510.1222 (https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.510.
1222). doi:10.1109/34.400568 (https://doi.org/10.1109%2F34.400568).
2. Comaniciu, Dorin; Peter Meer (May 2002). "Mean Shift: A Robust Approach Toward Feature
Space Analysis". IEEE Transactions on Pattern Analysis and Machine Intelligence. 24 (5):
603–619. CiteSeerX 10.1.1.160.3832 (https://citeseerx.ist.psu.edu/viewdoc/summary?doi=1
0.1.1.160.3832). doi:10.1109/34.1000236 (https://doi.org/10.1109%2F34.1000236).
S2CID 691081 (https://api.semanticscholar.org/CorpusID:691081).
3. Fukunaga, Keinosuke; Larry D. Hostetler (January 1975). "The Estimation of the Gradient of
a Density Function, with Applications in Pattern Recognition". IEEE Transactions on
Information Theory. 21 (1): 32–40. doi:10.1109/TIT.1975.1055330 (https://doi.org/10.1109%2
FTIT.1975.1055330).
4. Schnell, P. (1964). "Eine Methode zur Auffindung von Gruppen" (https://onlinelibrary.wiley.co
m/doi/10.1002/bimj.19640060105). Biometrische Zeitschrift (in German). 6 (1): 47–48.
doi:10.1002/bimj.19640060105 (https://doi.org/10.1002%2Fbimj.19640060105).
5. Aliyari Ghassabeh, Youness (2015-03-01). "A sufficient condition for the convergence of the
mean shift algorithm with Gaussian kernel" (https://doi.org/10.1016%2Fj.jmva.2014.11.009).
Journal of Multivariate Analysis. 135: 1–10. doi:10.1016/j.jmva.2014.11.009 (https://doi.org/1
0.1016%2Fj.jmva.2014.11.009).
6. Aliyari Ghassabeh, Youness (2013-09-01). "On the convergence of the mean shift algorithm
in the one-dimensional space". Pattern Recognition Letters. 34 (12): 1423–1427.
arXiv:1407.2961 (https://arxiv.org/abs/1407.2961). Bibcode:2013PaReL..34.1423A (https://u
i.adsabs.harvard.edu/abs/2013PaReL..34.1423A). doi:10.1016/j.patrec.2013.05.004 (https://
doi.org/10.1016%2Fj.patrec.2013.05.004). S2CID 10233475 (https://api.semanticscholar.or
g/CorpusID:10233475).
7. Li, Xiangru; Hu, Zhanyi; Wu, Fuchao (2007-06-01). "A note on the convergence of the mean
shift". Pattern Recognition. 40 (6): 1756–1762. Bibcode:2007PatRe..40.1756L (https://ui.ads
abs.harvard.edu/abs/2007PatRe..40.1756L). doi:10.1016/j.patcog.2006.10.016 (https://doi.or
g/10.1016%2Fj.patcog.2006.10.016).
8. Carreira-Perpinan, Miguel A. (May 2007). "Gaussian Mean-Shift Is an EM Algorithm". IEEE
Transactions on Pattern Analysis and Machine Intelligence. 29 (5): 767–776.
doi:10.1109/tpami.2007.1057 (https://doi.org/10.1109%2Ftpami.2007.1057). ISSN 0162-
8828 (https://www.worldcat.org/issn/0162-8828). PMID 17356198 (https://pubmed.ncbi.nlm.n
ih.gov/17356198). S2CID 6694308 (https://api.semanticscholar.org/CorpusID:6694308).
9. Richard Szeliski, Computer Vision, Algorithms and Applications, Springer, 2011
10. Comaniciu, Dorin; Visvanathan Ramesh; Peter Meer (May 2003). "Kernel-based Object
Tracking". IEEE Transactions on Pattern Analysis and Machine Intelligence. 25 (5): 564–
575. CiteSeerX 10.1.1.8.7474 (https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.8.
7474). doi:10.1109/tpami.2003.1195991 (https://doi.org/10.1109%2Ftpami.2003.1195991).
S2CID 823678 (https://api.semanticscholar.org/CorpusID:823678).
11. Avidan, Shai (2005). "Ensemble Tracking". 2005 IEEE Computer Society Conference on
Computer Vision and Pattern Recognition (CVPR'05). IEEE Transactions on Pattern
Analysis and Machine Intelligence. Vol. 2. San Diego, California: IEEE. pp. 494–501.
doi:10.1109/CVPR.2005.144 (https://doi.org/10.1109%2FCVPR.2005.144). ISBN 978-0-
7695-2372-9. PMID 17170479 (https://pubmed.ncbi.nlm.nih.gov/17170479).
S2CID 1638397 (https://api.semanticscholar.org/CorpusID:1638397).
12. Gary Bradski (1998) Computer Vision Face Tracking For Use in a Perceptual User Interface
(http://download.intel.com/technology/itj/q21998/pdf/camshift.pdf) Archived (https://web.archi
ve.org/web/20120417121810/http://download.intel.com/technology/itj/q21998/pdf/camshift.p
df) 2012-04-17 at the Wayback Machine, Intel Technology Journal, No. Q2.
13. Emami, Ebrahim (2013). "Online failure detection and correction for CAMShift tracking
algorithm". 2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP).
Vol. 2. IEEE. pp. 180–183. doi:10.1109/IranianMVIP.2013.6779974 (https://doi.org/10.1109%
2FIranianMVIP.2013.6779974). ISBN 978-1-4673-6184-2. S2CID 15864761 (https://api.sem
anticscholar.org/CorpusID:15864761).

Retrieved from "https://en.wikipedia.org/w/index.php?title=Mean_shift&oldid=1166895929"

Introduction To Mean Shift
No ratings yet
Introduction To Mean Shift
13 pages
Mean Shift Clustering
No ratings yet
Mean Shift Clustering
23 pages
Mean Shift, Mode Seeking, and Clustering: Cheng
No ratings yet
Mean Shift, Mode Seeking, and Clustering: Cheng
10 pages
Mean Shift Cluster
No ratings yet
Mean Shift Cluster
10 pages
Mean Shift Algorithm Implementation
No ratings yet
Mean Shift Algorithm Implementation
18 pages
Mean-Shift Clustering Guide
No ratings yet
Mean-Shift Clustering Guide
2 pages
Mean Shift 3
No ratings yet
Mean Shift 3
4 pages
Mean Shift An Information Theoretic Pers
No ratings yet
Mean Shift An Information Theoretic Pers
9 pages
Lecture Notes
No ratings yet
Lecture Notes
54 pages
Adaptive Mean Shift Clustering Algorithm
No ratings yet
Adaptive Mean Shift Clustering Algorithm
11 pages
Mean Shift Algo 3
No ratings yet
Mean Shift Algo 3
5 pages
Mean-Shift Object Tracking Techniques
No ratings yet
Mean-Shift Object Tracking Techniques
93 pages
Mean-Shift Blob Tracking Through Scale Space: Robert T. Collins Carnegie Mellon University
No ratings yet
Mean-Shift Blob Tracking Through Scale Space: Robert T. Collins Carnegie Mellon University
7 pages
SLIC and Mean-Shift Image Segmentation
No ratings yet
SLIC and Mean-Shift Image Segmentation
31 pages
Simulation: Generalized Transport Mean Shift Algorithm For Ubiquitous Intelligence
No ratings yet
Simulation: Generalized Transport Mean Shift Algorithm For Ubiquitous Intelligence
15 pages
Data Mining: EM & Mean Shift Clustering
No ratings yet
Data Mining: EM & Mean Shift Clustering
9 pages
Mean Shift Clustering
No ratings yet
Mean Shift Clustering
5 pages
Unit 3 Clustering Algorithm
No ratings yet
Unit 3 Clustering Algorithm
44 pages
Image Segmentation Techniques Explained
No ratings yet
Image Segmentation Techniques Explained
42 pages
Cluster Analysis in Data Mining Techniques
No ratings yet
Cluster Analysis in Data Mining Techniques
65 pages
Unsupervised Learning: Clustering & Anomaly Detection
No ratings yet
Unsupervised Learning: Clustering & Anomaly Detection
31 pages
MA
No ratings yet
MA
6 pages
Unsupervised Learning: Cluster Analysis
No ratings yet
Unsupervised Learning: Cluster Analysis
39 pages
2022 Istdm 06
No ratings yet
2022 Istdm 06
76 pages
Moving Averages for Analysts
No ratings yet
Moving Averages for Analysts
7 pages
Unsupervised Learning: K-Means Clustering
No ratings yet
Unsupervised Learning: K-Means Clustering
21 pages
Unit 3
No ratings yet
Unit 3
33 pages
Diffusion-Based Image Filtering Techniques
No ratings yet
Diffusion-Based Image Filtering Techniques
45 pages
Clustering Techniques in Machine Learning
No ratings yet
Clustering Techniques in Machine Learning
26 pages
Cluster Analysis in Data Mining Techniques
No ratings yet
Cluster Analysis in Data Mining Techniques
89 pages
Clustering and Ensemble Methods Overview
No ratings yet
Clustering and Ensemble Methods Overview
21 pages
Understanding Kernel Functions in Statistics
No ratings yet
Understanding Kernel Functions in Statistics
4 pages
Kernel Classes in Machine Learning
No ratings yet
Kernel Classes in Machine Learning
14 pages
Cluster
No ratings yet
Cluster
120 pages
8910 24120 1 PB
No ratings yet
8910 24120 1 PB
7 pages
Kernel-Based Clustering Algorithm Analysis
No ratings yet
Kernel-Based Clustering Algorithm Analysis
6 pages
Understanding Noise and Filtering Techniques
No ratings yet
Understanding Noise and Filtering Techniques
62 pages
Image Filtering Techniques in Computer Vision
No ratings yet
Image Filtering Techniques in Computer Vision
110 pages
02 KnowYourData
No ratings yet
02 KnowYourData
44 pages
Further Mathematics Exam Tips & Data Analysis
No ratings yet
Further Mathematics Exam Tips & Data Analysis
29 pages
Agglomerative Mean-Shift Clustering
No ratings yet
Agglomerative Mean-Shift Clustering
7 pages
PM Notes
No ratings yet
PM Notes
26 pages
Kernelized Fuzzy Clustering Algorithm
No ratings yet
Kernelized Fuzzy Clustering Algorithm
6 pages
Lecture 5 # Effective Data Denoising Techniques
No ratings yet
Lecture 5 # Effective Data Denoising Techniques
18 pages
Data Mining: Understanding Data Basics
No ratings yet
Data Mining: Understanding Data Basics
44 pages
Analysis&Comparisonof Efficient Techniquesof
No ratings yet
Analysis&Comparisonof Efficient Techniquesof
5 pages
Understanding Cluster Analysis Methods
No ratings yet
Understanding Cluster Analysis Methods
75 pages
1 Introduction
No ratings yet
1 Introduction
44 pages
Cluster Analysis in Data Mining Techniques
No ratings yet
Cluster Analysis in Data Mining Techniques
120 pages
1 Edge
No ratings yet
1 Edge
140 pages
Kernel Methods for Pattern Analysis
No ratings yet
Kernel Methods for Pattern Analysis
77 pages
Data and Metrics
No ratings yet
Data and Metrics
35 pages
Understanding Cluster Analysis Methods
No ratings yet
Understanding Cluster Analysis Methods
51 pages
Cluster Analysis in Data Mining Techniques
No ratings yet
Cluster Analysis in Data Mining Techniques
120 pages
Hierarchical Clustering Algorithms Explained
No ratings yet
Hierarchical Clustering Algorithms Explained
31 pages
Image Filtering: Formal Terminology - Filtering With Masks
No ratings yet
Image Filtering: Formal Terminology - Filtering With Masks
30 pages
Pregeometry (Model Theory)
100% (1)
Pregeometry (Model Theory)
4 pages
Image J
No ratings yet
Image J
3 pages
A* Algorithm: Optimal Pathfinding Guide
No ratings yet
A* Algorithm: Optimal Pathfinding Guide
12 pages
Oriented Matroid Theory Explained
No ratings yet
Oriented Matroid Theory Explained
9 pages
ELKI: Java Data Mining Framework
No ratings yet
ELKI: Java Data Mining Framework
7 pages
Matroid
No ratings yet
Matroid
18 pages
Randomized Algorithm
No ratings yet
Randomized Algorithm
11 pages
Gradient Boosting
100% (1)
Gradient Boosting
9 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
11 pages
Matroid
No ratings yet
Matroid
18 pages
Understanding Ensemble Learning Methods
No ratings yet
Understanding Ensemble Learning Methods
15 pages
Understanding Autoencoders in AI
No ratings yet
Understanding Autoencoders in AI
14 pages
Multidimensional Scaling Explained
No ratings yet
Multidimensional Scaling Explained
6 pages
ServiceManual C1C EN
100% (1)
ServiceManual C1C EN
658 pages
Exercices Anglais Préterit
No ratings yet
Exercices Anglais Préterit
1 page
Filipino Figures of Speech Examples
100% (1)
Filipino Figures of Speech Examples
3 pages
Brijen Shinde Resume
No ratings yet
Brijen Shinde Resume
1 page
SAP Cloud Services Upgrade Guide
No ratings yet
SAP Cloud Services Upgrade Guide
7 pages
Language Extinction in North America
No ratings yet
Language Extinction in North America
8 pages
BCA IV Sem Database Management System
No ratings yet
BCA IV Sem Database Management System
15 pages
T104 - EEE Instructions 2020oct12
No ratings yet
T104 - EEE Instructions 2020oct12
15 pages
Legal AI: Trends and Challenges
No ratings yet
Legal AI: Trends and Challenges
14 pages
Functional Skills Course Overview
No ratings yet
Functional Skills Course Overview
20 pages
LAMDA Acting Grade 7 Help Sheet Knowledge
No ratings yet
LAMDA Acting Grade 7 Help Sheet Knowledge
2 pages
Coherence Theory of Truth Explained
No ratings yet
Coherence Theory of Truth Explained
16 pages
Book Edcoll 9789004437203 BP000007-preview
No ratings yet
Book Edcoll 9789004437203 BP000007-preview
2 pages
Slow Learners and Learning Disabilities
No ratings yet
Slow Learners and Learning Disabilities
4 pages
Grade 2 English Summer Test 2024
No ratings yet
Grade 2 English Summer Test 2024
2 pages
Introduction To Python Programming - Notes
No ratings yet
Introduction To Python Programming - Notes
20 pages
HP Color LaserJet Managed MFP E877 Series - UG
No ratings yet
HP Color LaserJet Managed MFP E877 Series - UG
239 pages
Senses in Language and Culture Insights
No ratings yet
Senses in Language and Culture Insights
15 pages
Teri Aaradhana Ho G Hindi&English
No ratings yet
Teri Aaradhana Ho G Hindi&English
2 pages
A Brief History of The Hebrew Language-5pp
No ratings yet
A Brief History of The Hebrew Language-5pp
5 pages
Your LLM Knows The Future: Uncovering Its Multi-Token Prediction Potentia
No ratings yet
Your LLM Knows The Future: Uncovering Its Multi-Token Prediction Potentia
14 pages
The API For The Internet Protocols: Unit III Interprocess Communication
100% (1)
The API For The Internet Protocols: Unit III Interprocess Communication
10 pages
Comparative and Superlative Grammar Guide
No ratings yet
Comparative and Superlative Grammar Guide
6 pages
Writing An Email
No ratings yet
Writing An Email
14 pages
Modify ALV Reports with BADI ALV_GRID_XT
No ratings yet
Modify ALV Reports with BADI ALV_GRID_XT
6 pages
MouseJack: Wireless Mouse Vulnerabilities
No ratings yet
MouseJack: Wireless Mouse Vulnerabilities
86 pages
SHARC DSP Core Design in Verilog
No ratings yet
SHARC DSP Core Design in Verilog
6 pages
World of Self, Family and Friends Module 1 - Welcome! Speaking Speaking 3 Monday Friendship Language
No ratings yet
World of Self, Family and Friends Module 1 - Welcome! Speaking Speaking 3 Monday Friendship Language
4 pages
W3Schools Online Web Tutorials
No ratings yet
W3Schools Online Web Tutorials
16 pages
Pat 2025. Informatika (X) (Jawaban)
No ratings yet
Pat 2025. Informatika (X) (Jawaban)
128 pages