Professional Documents
Culture Documents
Similarity
Robert F. Murphy
Cytometry Development
Workshop 2000
General Multivariate
Dataset
We are given values of p variables
for n independent observations
Construct an n x p matrix M
consisting of vectors X1 through Xn
each of length p
Multivariate Sample
Mean
M(i, j)
I( j) =
i=1
or
matrixnotation
Xi
i=1
I=
vectornotation
Multivariate Variance
( j) =
(M(i, j) I( j))
i=1
n1
matrixnotation
Multivariate Variance
or
2
(Xi I)
i=1
n1
vectornotation
Covariance Matrix
cov( j,k) =
i=1
n1
Covariance Matrix
cov( j, j) = ( j)
Univariate Distance
M(i, j) M(l, j)
Univariate z-score
Distance
M(i, j) M(l, j)
( j)
Bivariate Euclidean
Distance
Multivariate Euclidean
Distance
j=1
showsthe
50%contour
ofa
hypothetical
population.
PointsAandBhavesimilarEuclideandistancesfromthemean,
butpointBisclearlymoredifferentfromthepopulationthan
pointA.
Mahalanobis Distance
-1