Professional Documents
Culture Documents
1.2.1 Sample mean vector and dispersion matrix from a data matrix
In this section you shall learn how to compute the sample statistics of multivariate data
through some examples.
Example 1.1 : Measurements were taken on ten flea beetle Halticus. These are the thorax
length X 1 (in microns) and elytra length X 2 (in 0.01 mm).
Subject
Characteristic 1 2 3 4 5 6 7 8 9 10
Thorax 180 192 217 221 171 192 213 192 170 201
length( X 1 )
Elytra 245 260 276 299 239 262 278 255 244 276
Length( X 2 )
Compute the sample mean vector and the sample dispersion matrix.
Solution: We have
Example 1.2: Eight men each received a certain drug. The recorded changes in blood
sugar and blood pressure(systolic and diastolic) are listed in the table below:
1
Subject
Characteristic 1 2 3 4 5 6 7 8
Blood sugar( X 1 ) 30 90 -10 10 30 60 0 40
Blood -8 7 -2 0 -2 0 -2 1
pressure(systolic-
X2 )
Blood -1 6 4 2 5 3 4 2
pressure(diastolic-
X3 )
Compute the sample mean vector and the sample dispersion matrix.
Solution: We have
Data Matrix
2
n
∑X
1
Xi = ik , i = 1, 2, ..., p
n k =1
⎛ n n
⎞
⎜
1 n
∑ ∑X X ik jk ⎟
the components of S are computed by sij = ⎜ X ik X jk −
n ⎜ k =1
∑ k =1
n
k =1 ⎟,
⎟
⎜⎜ ⎟⎟
⎝ ⎠
X i, j = 1, 2, ..., p
⎛ n n
⎞
⎜
1 ⎜ n
∑ ∑X X ik jk ⎟
sij =
n − 1 ⎜ k =1
∑
X ik X jk − k =1
n
k =1 ⎟ , i, j = 1, 2, ..., p
⎟
⎜⎜ ⎟⎟
⎝ ⎠
The vector X is the sample mean vector while S and Su are called sample variance-
covariance matrices or dispersion matrices. In Example 1.1 p = 2 and n = 10 while in
Example 1.2 p = 3 and n = 8
⎛1 ⎞
⎜s 0 ... 0 ⎟
⎜ 1 ⎟
⎜ 1 ⎟
⎜0 ... 0 ⎟
D − 12
=⎜
s2 ⎟ = diag ⎛⎜ 1 , 1
, ...,
1 ⎞
⎟
⎜ . . ... . ⎟ ⎜ s1 s2 sp ⎟
⎜ ⎟ ⎝ ⎠
⎜ . . ... . ⎟
⎜ 1 ⎟
⎜0 0 ... ⎟
⎜ s p ⎟⎠
⎝
Example 1.3 Compute the sample correlation matrix for the data in Example 1.1
3
Solution:
⎛ 1 0.95 ⎞
R=⎜ ⎟
⎝ 0.95 1 ⎠
1.5 Exercise
X1 3 4 2 6 8 2 5
X2 5 5.5 4 7 10 5 7.5
(a) Calculate the sample means X 1 , X 2 ,the sample variances S11 , S 22 the and
sample covariance S12 .
(b) Use the results of part (i) to form the mean vector X and the sample correlation
matrix ℜ .
(c) Plot the graph of X 1 against X 2
1.5.2. A morning newspaper lists the following used-car prices for a particular model of a
car with age( X 1 ) measured in years and selling price measured in millions(Ksh)
X1 3 5 5 7 7 7 8 9 10 11
X2 2.3 1.9 1.0 0.70 0.30 1.00 1.05 0.45 0.70 0.30
4
(d) Use the quantities computed in part(c) to form the sample mean vector X , the
sample variance-covariance matrix S and the correlation matrix ℜ .
Compute X , S , and ℜ .
1.5.4. The 10 largest industrial corporations in KENAI yield the following data in
millions of dollars
Company X 1 =sales X 2 =profits X 3 =assets
Summary
In this Lesson we have considered data matrix as we give some example of multivariate
data in real life situations. In particular we have:
5
1. Evaluated a sample mean vector and dispersion matrix from a data matrix
2. Evaluated the sample correlation matrix.
References