Professional Documents
Culture Documents
Result 5.2.
is, approximately, a x;-vo random variable. Here the degrees of freedom are v - v0
= ( dimension of e) -
(dimension of 8 o ) · •
Statistical tests are compared on the basis of their power, which is defined as the
curve or surface whose height is P[ test rejects
8. H0 I 8], H0
evaluated at each parameter
vector Power measures the ability of a test to reject when it is not true. In the
8 80 H0
rare situation where = is completely specified under and the alternative H1
8 81
consists of the single specified value = , the likelihood ratio test has the highest
power among all tests with the same significance level a = P[ test rejects = 60] .
H0 I 8
(8
In many single-parameter cases has one component), the likelihood ratio test is uni
formly most powerful against all alternatives to one side of (} = H0: 00 •
In other
cases, this property holds approximately for large samples.
We shall not give the technical details required for discussing the optimal prop
erties of likelihood ratio tests in the multivariate situation. The general import of
these properties, for our purposes, is that they have the highest possible (average)
power when the sample size is large.
To obtain our primary method for making inferences from a sample, we need to ex
tend the concept of a univariate confidence interval to a multivariate confidence re
gion. Let be a vector of unknown population parameters and @ be the set of all
8 8. 8
possible values of A confidence region is a region of likely values. This region
is determined by the data, and for the moment, we shall denote it by R(X) , where
X = [ X , X 2 , . . . , X n ] ' is the data matrix.
1
The region R(X) is said to be a 100 ( 1 a)% confidence region if, before the
-
sample is selected,
P[R(X) will cover the true = 1 a
8] - (5-17)
This probability is calculated under the true, but unknown, value of
IL
The confidence region for the mean of a p-dimensional normal population
8.
IL) S_1 IL) - 1)p
is available from (5-6) . Before the sample is selected,
P n(X -[ _ , _
] (n
( X - < (n p) Fp,n - p (a) = 1 - a
_
squared distance is larger than [p(n - 1 )/(n - p ) ] Fp, n - p (a) , /L o is not in the confi
dence region. Since this is analogous to testing H0 : IL = /L o versus H1 : IL # IL o [see
(5-7)], we see that the confidence region of (5-18) consists of all p0 vectors for which
the T2 -test would not reject H0 in favor of H1 at significance level a.
For p > 4, we cannot graph the joint confidence region for p. However, we can
calculate the axes of the confidence ellipsoid and their relative lengths. These are
determined from the eigenvalues Ai and eigenvectors e i of S. As in (4-7), the direc
tions and lengths of the axes of
x p ) < cz -
p(n - 1 )
( - ) ' s-1 ( -
nX-p -
(n p) Fp n - p
_ , (a)
The ratios of the A/s will help identify relative amounts of elongation along pairs of axes.
x ' S
=
.603203.018 -163..0117391 ] .0146 '
=
JL2 J
- JL2
2(4041) F2,4o( .05)
<
or, since F2 ,40( . 0 5) = 3. 2
JL1 ) 2
3, - JL2 ) 2
42(203.018) ( .564 - 42(2-00.84(228)163.(3.691)03 ( .564 - JLl ) ( .603 - 6.62
+
JL2 ) <
To see whether IL ' = [ . 5 62, . 5 89 J is in the confidence region, we compute
42(203.018) ( .564 - -.562)84(2163.42(391)200.( .526428)-( ..650362)-( .6.50389)-2 .589) = 1.30 6.62
+
<
We conclude that IL '
IL =
[ ..558962] = [. 5 62, .
. . 5 89
would not be reJected In favor of H1 :
J is in the region. Equivalently, a test of H0:
[ ..556289 ]
IL # at the a = . 05
level of significance.
x' = [
The joint confidence ellipsoid is plotted in Figure
.564, .603p], ( n - 1 ) The center is at
and the half-lengths of the major and minor axes are given by
5. 1 .
v'Ar n( n p_ ) Fp, n - p (a) =
2(
Y.026
4 1)
42(40) (3.23) .064 =
and
vx;
pn ((nn - p1))
_ Fp, n - p (a) =
2(42(441)0) (3.23) .018
\!.002 =
2� )pn((nn - p1))
and minor axes. This ratio is
\!A,
-
Fp, n - p (a)
.65
.60
.55
The length of the major axis is 3.6 times the length of the minor axis. •
z = a' x
224 Chapter 5 I nferences about a Mean Vector
and s 2 = a' Sa z
z
t= / --- (5-20)
Sz Vn Va'Sa
and leads to the statement
or Va'Sa Va'Sa
X
a - tn _ 1 ( a/2)
' - Vn < ' < X +
a IL a tn - l ( a/2) Vn (5-21 )
' -
Asativielmyulstmaneous al l f o rconf
all
i d
choiencec es r eofgi o
a. n iIst gi
s ev en
ems by r et h
as e o s e t
nabl of e
2 at 'oIL val
expect u es s
t hu ch
at tthheat t
cons i st r
a e l
nt
toped�_1 (afj2)or manyin (5-2choi2) wicesl beof a.replaced by a larger2 valu2e, c , when statements are devel
Cons
termination of i d er i n g t h e val u es of a f o r whi c h t < c , we ar e nat u r a l y l e d t o t h e de
Section 5.4 Confidence Reg ions and S i m u lta neous Compa r isons of Com ponent Means 225
maxt - max
a
2 _
a
n(a' ( x -
a ' Sa
JL)) 2
Using the maxiJL))m2 izatio[n lemma (2-50)JL))wi2th] , ) , and we get
n(a' (x - (a' (x -
x == a, d == (x - IL B == S,
maxa a' Sa
= nmax a a ' Sa
= n ( x - p, ) S 1 ( x - p, ) =
_ _ _
T2 (5-23)
with the maximum occurring for proportional to . a s-1 ( x - IL )
(X a' -
p(n - 1 )
n(n p) Fp, n - p
_ (a)a' Sa , a'X +
p(n - 1 )
n(n p) Fp,n - p
_
(a)a' Sa
)
wil contain with probability a' IL 1 - a.
Proof. Fr o m ( 5 - 2 3) ,
2T = n(X - p, ) ' S - 1 (X - p, ) < 2c implies a' Sa
) 2 c2
n( a' x - a' IL
<
for every or a,
fa'Sa < a' IL < a' x + fa'Sa
a' x -c -y ---;:;- c -y -;;--n-
fthorateverwil ycontaChoos
a. i n g
a' IL
c 2 == p(n - 1 ) Fp,n - p (a)j(n - p)
in for all with probability P[T2 c2] .
a, 1 - a ==
[ s e e ( 5 - 6 ) ]<
gi v es i n t e r v al s •
2TheT -intsIeutrccesviaslsconveni e nt
,ssivinecechoitheccovert o rae f
ge e rpr t o
o t h
babi e l i s
t i
y mi ul
s t
deta neous
e r m i n i
ed n t e
by r vtal
h es of
di s t Res
r i b u
ut i l
o tn 5.
of3 T as
2 •
a' [0, 0, for these T2-inte[rv,al0,s allow us to concl[0, ude that and so on through
. . . , 1J
a' == 1 . . . , OJ, a' == 1, . . . , O J ,
� �
==
\j�
p(n - 1) p(n - 1)
---;,;- < JL 1 < X1 +
_ _
X1 - (a) , (a)
p) Fp, n - p p) Fp n - p
x2 - � x2 �
_ _
(n (n
_ p(n - 1) (S:;:; p(n - 1)
\j ---;,;- < JL2 < +
_
(a) (a)
p) p, n - p
(n _ F (n _
p) Fp,n - p
0
0
0
0
0
0
-
(5-24)
X P - \j (n p) Fp,n - p (a) \j(S;;
- I p(n - 1 ) I p(n - 1 ) (S;;
PP
----;; P \j (n p) \j n;
----;
_ < JL p < X + Fp,n - p (a) _
( 5 - 2 5)
The s i m ul t a neous T 2 confidence intervals are ideal for "data snooping." The
confbinatidioencens ofcoefthefcomponent
icient 1 - as remaithatnmers unchanged
JLi i t i n s p ect f
i o
o r
n any choi c e of a,
based upon an examination of
s o l i n ear com
the data can be es t i m at e d.
mentsInaboutaddition, accorbeldionngig[toS·ntg· hteoS·rtehs]ue-lstlas[mplinX·Supple-mean-ementcenterewed el1)canlipsiensclude the state
( JLi , JLk )
SA,
n[ xi - JLi , xk - JLk ] ll k
sik skk
l
_
lJLl·
xk - i-Lk J <
p(n -
n-P
Fp n-p(a) (5-26)
'
and stTheil maisimntulaitnaneous the confT2idconfenceidcoefenceficinietnterv(al1 s-foa)r tfhoer itnhdie vwholiduale scomponent et of statements of s. a
=
fN44
+
2(41) 3.23 \j�) or ( .516, .612)
(N44
= +
I n Fi g ur e 5. 2 , we have
5.el3li.pThese on95%thesaxesimultofaneousthe component r e dr a wn
intervals armeans t h e 95% conf i d ence el l i
e shown. as shadows, or projections, of thisp s e f r o m Exampl e
II
---------,- - - - - - - - - - - - - - - - - -
I I
I
�-
---�--�� pl
.516 .612
x == S ==
3, 48
p
n _
P = _
= =
and we obtain the simult�aneous confi d ence s t a t
5691.87 34 527.74 �5691.87 34 e ment s [ s e e ( 5 - 2 4) ]
527. 7 4 - v'8.29 < f.L l < + v'8.29
or 504.45 551.03 �126.05 < I-L l <
_
+
_
_
± _ ·
X <
Anthe alcomponent
ternative apprs J.Liooneach tato thaetconsime,trasuctsiuoggesn oftconfed byidence(5-2i1)ntewirvtalhs is to cons[0, . i.d, er0, a'
0,p variables andwherleeads to t1.heThiintserapprvals oach ignores the covariance structure of the
ai , . . . , OJ ai =
=
�
V)
11 3
r-
N
0
V)
111
\£')
500 522 544 N
JLi ,
Alwe dothnotoughknowpriowhatr to sampl t o as s i
e n
r g,
t , i t
n h e i
genert h i
a n
l ,t e rv
about al has
t h e pr
pr o
o babi
babi lliittyy
1-a
of
all i nof
t e rcover
v al s i n
con g
taininTog thseihedr resospmeectliivgeht on tAshe prweoblhaveem,poiconsntiedderoutth,ethspisecipraolbabicaseliwherty is note the obser
JL/S. 1 - a.
vations have a joint normal distribution and0 0 lTl l
I=
0
0 0
Sivarnceiablthee, andobsseorvon,atiotnshe onprotducthe firrsutlvare foirablinedependent
are independent of t h os e on t h e s e cond
fore tP[healsalmplt-inteeirsvsalesleictn e5-d.27) contain the events can be applied, and
( JL/S J ( 1 - a) ( 1 - a) · · · ( 1 - a)
be
( 1 - a) P
If .95 and 6, this probability is .95)6 .74.
1 -a= p = ( =
-
Section 5.4 Confidence Reg ions and S i m u lta neous Com pa risons of Com ponent Means 231
To guar a nt e e a pr o babi l i t y of 1 a t h at al l of t h e s t a t e ment s about t h e compo
nentrate tmeans hol d s i m ul t a neous
-Forinter1val-s; just .how95, nmuch15,wiandder depends
a == ==
ly , t h e i n
p
di v
==
i d
4,
ual
t hone mulbothtipplianders n,ofas well asinon(51-2-4) and
i n t e r v al s mus t be
�
wi d er t h an t h e s
a.
e pa
(5-27) are )p( n - 1)
(n p _ ) Fp, n - p ( . O S) =
4( 11
1 4) (3.36) 4.14 ==
andtervals ar( .e025)100(4.2.14145,- 2.re1s45)pect/2.iv1el45y. Cons93%equent
tn - l ==
== wi d er l yt ,
h i n
an t h
t i
h soscas e e
der t h i e
v s
ed i mf ul
r o tma neous
t h e onei n
at-a-tiTablme temet5.3 hgiod.ves some critical distance multipliers for one-at-a-time 2t-intervals
comput e d accor d i n g t o ( 5 - 2 1) , as wel l as t h e cor r e s p ondi
Innasgenerp inacrl,etasheeswianddthdecrof theeasTes-ifnotrerfvixaleds, rpelasatinveintcro ethaseets-.intervals, increases for fixed
2 n g s i m ul t a neous T - i n t e r v al s .
TABLE 5.3 CRITICAL DISTANCE M U LTI PLI ERS FOR O N E-AT-A-TI M E
t- I NTERVALS A N D T2- I NTE RVALS FOR S E LECTED n AN D p ( 1 - a == .95 )
n tn - 1 ( .025) p == 4 p 10 ==
IsOfunltttehn,eseatstIeiftntuthatieoinonumber
5.3.
nsis riet sitsrposicteofsdibtslopeeciatosmfdoiealdlbetcomponent
m
number
ter thanoftihnmeans die sviimdualultaconfneousidenceintersvtalatsement
JLi or l i n ear combi n of
at iRe
o ns
s.
a' IL = a 1 JL 1 + a 2 JL2 + ··· + a p JLp i s s m al l , s i m ul t a neous conf i d ence 2 i n t e r v al s can be
tidevelterisnatdeveloivpede metotpedhathodfarroefmoshraormulprteortbabiip(mleorcompar
l i
e preciisseo)nsthians calthleedsimthule taneous T -intervals.becaus
t y i n equal i t y car r y i n g Bonferroni method,
t h at name.
The ale
Suppos e t h at , pr i o r t o t h e col l e ct i o n of dat a , conf i d ence s t a t e ment s about m
== 1 - a
m terms
Thermakeeftohreef, owil othwianngoveral confstateiment
m == p
dences:level greater than or equal to we can 1 - a,
5 29 )
(-
Section 5.4 Confidence Reg ions and S i m u ltaneous Com pa risons of Component Means 233
The s t a t e ment s i n (5-29) can be compar e d wi
poiarenoft tnth_e1 (saj2amep)srtreuplctaucesre.v'(n - 1 )pFp,n-p(a)j(n - p) , but otherwise the intervals t h t h os e i n The per c ent
(5-24). a ge
Example 5 . 6 (Co nstructi ng Bonferro ni simu ltaneous confidence i nterva ls
and co mparing them with T2 -i nterva ls)
Letshalusl obtretauirnntthoethseimmiulctraoneous wave oven rBonf adiatieornodatni aconfin Exampl
95% i d ence e isn t e r and
v al s f o r We
t h 5.3
e 5.4.
snotureinmentg tJLh1ats winth=JLai2 =and = We= make use of= the resutoltsgetin Example
means , and , of
42
t h e f o ur
.05/2, i 1 , 2.
t h
t4 1 ( .05/2 ( 2 ) )
r o ot s of t h e door - c l o s e
t4 1 ( .0125 )
d and door - o pen
2.327,
mea 5.3,
:X1 ± t4 1 ( .0125 ) Iff = .564 ± 2.327 -J¥ or .521 < I-L l < .607
x 2 ( =2
- ± t4 1 .0125 ) \jrs;
---;:;- .603 ± 2.327 \jfN46
� or JL2 .560 < < .646
FiFiggururee shalowsong twiheth theTcorsirmesulponditaneousng confBonf
5.4
5.2,
95% idenceerinotniervin2altesrfvoalr sJL. 1 For, JL2 feachrom
95%
component mean, t h e Bonf e r o ni i n t e r v al f a l s wi t h i n t h e T - i n t e r v al . Cons e
iquent
sintconterelsyat,ientdhedonle rineycttihnaengulthreectcomponent
aarngulUoinart)rreegigioonnfoformrmededbybyththeetwtwooTBonf
means, the Bonfer oni intervals provide more 2 - i n t e errvalonis. Iinftweervarales
.651 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - -
.646
- - - - - - -
- - .....,
I
Bonferroni
.56 0 .....
. 555 -
_ _ _ _ _ _ _ _ _
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - r-
516 . 07 .612
pl
. .521 6
�- -���,----�--�,-
We s e e f r o m
valfidences wheninmterv=alp.s needed t h e Tabl e 5.4 t h at t h e Bonf e r o ni met h od pr
Becausfeotrhieynfearreence,easyweto applwil yofandtenprapploviydesitmheulrtealneous o vi d es s h or t e r i n t
ativelyts-hinortetrconvals e r
based on the Bonfer oni method.
5.5 LARG E SAM PLE I N FERENCES ABOUT A POPU LATI ON M EAN VECTOR
P[ 1L ) 1L ) <
=
_:_
( 5 31)
_
(5-7),
Compar
we s e e t i
h n
atg t
t h
h e
e t
t e
e s
s t
t si tna Res
t i s t i u
c sl t 5.
have 4 wit ht h
e t h
same e cors t rrue s
ctpuondi
r e , butn g tnormal
h e c r i t i t
c h
a eory
l val u t e
es s t
ari n
e
disafmeererntesu. lAt inclosisteuratexami n at i o n, however 2 , r e veal s t h at bot h t e s t s yi e l d es s e nt i a l y t h e
ectly ffroormn tlharegfearctieolnstahtatiwherve( ntoe-p.th1e)p(SxeeFp-,tTablne-sp(t ofaes)Resj(andn u-lt p)inanditsheapprx�appendi
dilyrequal 3
5.4
4
o(apr) iaratee.apprThiosxifoml atowse
x.)
Result 5.5. Let X 1 , X 2 ,
mean IL and positive definite covariance I. If n - p is large, ... , X n be a r a ndom s a mpl e f r o m a popul a t i o n wi t h
a' X ± � \jfa'S8
-----;;---
and,ellipsiens addition, for all pairs = the sample mean-centered ( JLi , JLk ) , i, k 1, 2, . . . , p,
n[ xi - JLi , xk - I-Lk ]
sii sik
sik skk
[ J -1 [ - J xi - JL i
xk - JLk
_
<
2
Xp (a) contain •
( JLi , JLk )
x2
l = harmony 28.1
26.6
5.76
5.85
x3x4 ==mettempoer 35.4
34.2
3.82
5.12
Xx56 == phrbalaancesing 23.6
22.0
3.76
3.93
x7 = style
Source: Data courtesy of V. Sell.
22.7 4.03
Section 5 . 5 Large Sa m p l e Inferences about a Pop u l ation Mean Vector 237
Let
meanFrcomponent us cons t r
s u ct 90% s i m ul t a
JLi , i = 1, 2, . . . , 7.
neous conf i d ence i n t e r v al s f o r t h e i n di v i d ual
X; ±
ov' m Res�'ult simultaneouswhere confidence limitsThusare, giwivtenh ap-by
x� ( .lO)
5.5,
i = 1 , 2, . . . , 7,
90%
x� ( .10 ) = 12.02 .
proximately confidence, 90%
28.1 ± VT2.02
96
� contains or JL 1 26 . 06 < i-L l < 30.14
5.85
12.02 V%
26.6 ± VT2.02 contains or JL2 24.53 < JL2 < 28.67
VT2.02 3.82
35.4 ± 12.02 V% contains or JL3 34.05 < JL3 < 36.75
VT2.02 5.12
34.2 ± 12.02 V% contains or JL4 32.39 < JL4 < 36.01
VT2.02 3.76
23.6 ± 12.02 V% contains or JLs 22.27 < JLs < 24.93
VT2.02 3.93
22.0 ± 12.02 V% contains or JL6 20.61 < JL6 < 23.39
4.03
22.7 ± VT2.02
12.02 V% contains or JL7 21.27 < JL7 < 24.13
When t
vidual means are (a) {%h e s a mpl e s i z e i s l a r g e, t h e one- a t - a - t i m e conf i d ence i n t e r v al s f o r i n di
X; z --::
\j < f.L; < X; + z
(a) {%--:: \j i 1 , 2, . . . ' p
a a
=
-
2 2
wher e z( /2 ) i s t h e upper
TheindivBonfidualermeansoni simtaulketatneous 100( /2
c o nf i )
d t h
ence per i c
n ent
t e r vi l
ale soff o t
r hteh s
e t a ndar
m = p
d nor
s t a t m
e al
ment di s
s t r i
about b ut i o
t n.
h e
( a ) {%--:: ( a ) {%--::
h e s a me f o r m , but us e t h e modi fi e d per c ent i l e z( aj2p ) t o gi v e
X; - z
2p
\j < f.L; < X; + z
2 p \j i = 1 , 2, . . . ' p
X5
x4 == phrmetaesring 33.18
22.85
35.22
24.35
32.79
22.57
35.61
24.63
32.24
22.16
36.16
25.04
xx6 == balstylaence
7
21.21
21.89
22.79
23.51
20.92
21.59
23.08
23.81
20.50
21.16
23.50
24.24
Tovariimatprion.oveWhenthe quala manuf ity of goods and s e r v i c es , dat a need t o be exami n ed f o r caus e s of
wepabiarlietiemonis andtosrtianbig lactityivofititaehctseofuprriaonscesgeprrvs.oiccesWhen e, dats isaacontshprouloincesduousbes iscollystpraleblctoeducie,dthteongvarevaliteiamsutatioenortihswhenecapro
duced by common caus e s t h at ar
of varTheiatiopurn. pose of any control chart is to identify occurrences of of e al w ays pr e s e nt , and no one caus e i s a ma j or s o ur c e
special causes
varindiiacattioenathneedat comefor farotmimoutelysridepaie ofr,thbute ustuheyal prcanocesalss.oThessuggese caust imeprs ofovement variatiosntooftthene
monprocesfArso. mcontContspreciorloacharllcharcaust tteysspimakeofcalvarlythiconsaetivaron.isitastofiondatvisaibplleoandt edalinlotwimonee ortdoerdiandstinguihorshizontcomal
lesin.esOne, callusedefcontul control rlioml charits, thtatis itnhdie cat-ceharthet amount
X ( r e ad X-
of
bar var chari a t i o
t n
.
)
due
To cr t o
e atcommon
e an X - c caus
har t,
1.
2.
PlCroetattheeandindiplviodtualtheobscenterevratlinioens orthseasmplample means x, e mean i nof t i m
al leofor t dh er
e .
obs e r v at i o ns .
3. CalculaUpper te and plcontot rtohlelcontimit roUCLl limits given by3 standard deviation
( ) = x + ( )
Lower cont r o l l i m i t ( LCL ) = x - 3 s( t a ndar d devi a t i o n )
Thethe obsstaendarrvatdiodevins beiatinognplinottheed.contForroslinliglmeitobss isetrhveatesiotnsim, atit eisdofsttaendarn thed sdeviamplateiosntanof
dardevidadevitionaistitohn.e sIafmpltheemeans of s u bs a mpl e s of s i z e m arVm .
e pl o t e d, t h en t h e s t a ndar d
andassumiminngusnorthrmeealsltyandardistsrtdiabndardeviutedadtdatdeviionsa,aaroftioefnachosldisevlyiednsedisgonalbythiatngthanerTheeoutis -contaoverf-crontoylslmrimoallitlobsschance, of plus
e r v a
tion-that is, an observation suggesting a special cause of variation.
Example 5.8 (Creating a u nivariate co ntro l chart)
ThetivitiMadi s o n, Wi s c ons i n , pol i c e depar t m ent r e gul a r l y moni t o r s many of i t s ac
dattotaaleonfsoasrf1ivpar2epaydit fofpereanrentiongoi
kiods,nordsngaboutofqualoverihaltytiimmf apreyearhourovement s. . Eacprhogrobsam.ervTablatione r5.e8prgievseesnttshea
We exami n e t h e s t a bi l i t y of t h e l e gal appear a nces
puttheesamer calculasationAlgivsoes, the sa3558.mple sSitanndarce inddidevividualatiovaln uises wil be 607,plot anded, thies
x1 =
over t i m e hour s . A com
x1
x1. � =
LCL = x1 - 3 (�) =
Thein Fidatgurae, 5.al5ongonwipageth th240.e centerline and control limits, are plot ed as an -chart 3558 - 3 607
( ) = 1737 X
240 Chapter 5 I nferences about a Mean Vector
5500
Legal Appearances Overtime Hours
UCL 5379 =
4500
�
::3
ca
>
::3
x1
ca 3500
"C1 = 3558
·>
�
�
2500
LCL = 1737
1 500
0 5 10 15
Observation Number
Figure 5.5 The X -ch a rt for x1 = legal a ppea ra nces ove rtime h o u rs.
Section 5.6 M u ltivariate Q u a l ity Control Charts 241
1, .
We assume that X X2, _!_ , Xn are i_!_ndependently_!_distributed_!_as By_!_Result 4.8, Np ( JL , I ) .
X. - X ( - ) X. - X - - X . - X. - - X
. .
n
1
1 1 n 1 n 1- 1 n 1+
1
=
n n
· · · · · ·
has
and
Cov ( Xj - X) ( - ) _
= 1
1
n I
2 + ( n - 1 )n -2 I =
(n
n
-1)
I
Each Xj - X has a normal distribution but, Xj - X is not independent of the sam
pl(Xje covar i a nce
1 mat r i x S. However t o set control limits we approximate that
- X ) ' S- (Xj - X ) has a chi-square distribution.
Ellipse Format Chart. The el l i p s e f o r m at char t f o r a
ichars theactmoreriestiinctsuonitivteheofjtthheunichart artse, butplotitsedappras aopaiachris(xjlim1 ,ixjted2).toThetwo variaqualblesi.tyTheellitpwsoe bi v ar i a t e cont r o l r e gi o n
95%
consists of all x that satisfy(x - x)'S-1 (x - x) < x�( .05 ) (5-32)
Example 5.9 {An e l l i pse format chart for overti me hou rs)
Letcharusactreerfiesrtitcos Exampl e 5.8 and cr e at e a qual i t y el l i p s e f o r t h e pai r of over t i m e
culation gives (legal appearances, extraordinary event) hours. A computer cal
242 Chapter 5 I nfe rences a bout a Mean Vector
x- = [ 3558
1478
] and = [ S
]
367,884.7 -72,093.8
-72,093.8 1,399,053.1
We i l u s t r a t e
consists of all that satisfy
x
t h e qual i t y el l i p s e f o r m at char t us i n g t h e el l
99%i p s e , whi c h
(x - x ) ' S - 1 (x - x ) x� ( .01 ) <
S1 1
----
S1 1S22 S22
= ( 367844.7 X 1399053.1 )
367844.7 X 1399053.1 - ( -72093.8) 2
X
( (x 1 - 3558 ) 2
367844.7
- 2( - 72093 · 8 )
(x 1 - 3558) ( x2 - 1478) (x2 - 1478) 2
367844.7 X 1399053.1
+
)
1399053.1
< 9.Zl
This ellipse format chart is graphed, along with the pairs of data, in Figure 5.6.
• -+-
0
0
0
V)
0
0
0
s
�
•.:;j
�
� •
0
> •
•
•
•
=:
•
� +.
j:.Ll
>
0 •
0
�
0 • • •
-
•
�
=:
•
• •
�
0
.b
� 0
j:.Ll
K
0
0
0
1500 2500 3500 4500 5500 Figure 5.6 The q u a l ity control 99%
e l l i pse fo r legal a ppea ra nces a n d
Appearances Overtime extraord i n a ry event ove rtime.
6000
4000
�
::3
3000
ca
> 2000
::3
ca
"'0
· s; 1 000
�
�
�
0
- 1000
- 2000 LCL = - 207 1
- 3000
0 5 10 15
Observation Number
Was t h er e a s p eci
ttihmeeUnithatteids outStatseidsebombed a l caus e of t h e s i n gl e poi n t f o r ext r a or
the uppera fcontoreirgonl lcapiimittainl,FiandgursetudentDurs at iMadi
5.7?
d i n ar y event
ng thissopern wereiod,over
prweekotesperting.iod.A Almathjoough,rity ofbythites extverryaordefdiinniartioyn,overexttriamoredwasinaryusovered intimtheatoccurfours
onltainyswhentabilitsyp. ecial events occur and is therefore unpredictable, it stil has a cer •
cise 5.e225.6. 9We. take = .01 to be consistent with the ellipse format chart
iExern Exampl a
The
orin dExampl T - c har t i n Fi g ur e 5. 8 r e veal s t h at t h e pai r ( l e gal appear a nces , ext r a
inary evente 5.9),hourconfsirfmors perthatiotdhi1s iiss dueout ofto contthe lraorlg. eFurvaltuheerofinvesexttriagoratdioinn,arasy
12
event overtime during that period. II
10
N 6
N
•
2
•
•
•
0
0 2 4 6 8 10 12 14 16
Period
Figure 5.8 The T2 -ch a rt for legal appea ra n ces h o u rs and ext raord i n a ry eve nt hou rs, a = . 0 1 .
When t h e mul t i v ar i
sBonfhouledrbeonidetineterrmvialnsediswhifrequent a nt e T 2 - c har t s i g nal s t h at t h e j t h uni t i s out of cont r o
ch varlyiachosbles earneforerstphonsis puriblpeos. e. modiThe ktfiehdvarregiiaoblnebasis outed
A
l
on
,
it
of
xk
control if does not ltine i1n( .t0h05/e inp)terval tn 1 ( .005/p)
1
( xk -
- vs;:;; ' xk +
- YS;;;; )
where p is the total number of measured variables.
Exa m p l e 5 . 1 1 {Contro l of roboti c welders-more than T2 needed)
Thetubeasyokessembltoy aoftuabe.drivTheeshafintputfors anto tauthe oautmobiomatle eredquiwelredsintghemachicirclenweles musdingt of
be
contof goodrol equald toitbey. wiIntorhindercerttoacontin operrolatthinegprliomcesitsswher, oneepraomachi
ces s ne prneeroducesmeasweluredds
engi
four critical variables: X1 =Voltage (volts)
XX32 == FeedCurresntpeed((amps)in/min)
X 4 = ( i n er t ) Gas
Table 5.9 gives the values of these variables at 5-second intervals. fl o w ( c fm )
Section 5.6 M u ltivariate Q u a l ity Control Charts 245
12- •
•
� • •
('l 8-
• •
• •
6- • •
• • •
••
• • •• •
•
4-
• • • •
••
• • • •
2-
• • •• •
•
•
••
I I I I I
0-
4.05
•
4.00
0
,.-.
••
�
� •
• . a• a ••• a
1/.l
�
3 . 95
• •• a. • • • ••
OJ)
s::=
._.,
3 . 90
3 . 85
1
UCL = 4. 005
4.00
1:>< 3 . 95 Mean = 3 . 95 1
3 . 90 LCL = 3 . 896
0 10 20 30 40
Figure 5 . 1 1 The u n ivariate X-c h a rt
Case for l n (gas flow) .
I n t h i s exampl e , a s h i f t i n a s i n gl
almost masked (with 95% limits), by being combined into a single T -value. e var i a bl e was mas k ed by 99% 2 l i m i t s ,•
or
Contro l Reg ions for Future I nd ividual Observations
aproces
futures obsis setravblatei,owen is expect
t a ke t e
h ed t o
obsl i e
e rivs cal
at i l
o e
nsd atforecast,
o be i n or prediction, region.
dependent l y di s t r i b ut I
e fd t h
ase
iNtop(ringIqual) . Becaus
JL,
ity, we egitvheestheerebasgioinsc diarsteriofbutmorionetgener heoryaasl imResporutltance5.6. than just for mon
Result 5.6. Let X 1 , X 2 , . . , X
let X be a2future observation f,rom1 the same distribution. Then- ) n be i n dependent l y di s t r i b ut e d as Np ( JL, I ) , and
n - - (n 1 p
T = (X -X) s- (X - X) is distributed as n-p
n+ 1 n - p FP'
and a ( - a)% p-dim-ensio1nal pre-diction ellipsoid is given by all x satisfying
100 1
(n 2 - 1 )p
X- X
( ) ' s- ( ) X- X p, n - p a )
(
n(n p) F
<
_
elthleipeisoNotgienvect
d. eItthiatsorcentsthofeeprreeddiSiatnctceitohne rinegiitioanl sianmplReseumean
S.
lt 5.6 foranda futitusreaxesobsarereveddetvalermueinxieds byan
x,
1 )p
-1 - ( 2
J 1
[ -'
P (X - X ) S (X - X ) < (
nn n- _
p ) p,n - p
F (a) = -a
beftionorelelianypse newis 1 observations are taken, the probability that wil fal in the predic
- a.
X
Keep i n mi n d t h at t h e cur r e nt obs e r v at i o ns mus
used tBaso detederonmiResne contult 5.ro6l, weregiobtonsaifnorthfuettuwreo obscharertvs atfoirofnsut. ure observations. t be s t a bl e bef o r e t h ey c a n be
Co ntrol E l l i pse for Futu re Observations
p
With 2, the 95% prediction ellipse in Resn2ul-t 5.16)2specializes to
=
, _1
_ (
_
(x - x ) (x - x) n n 2)
S < ( _
F2 ,n - 2 ( .05 ) (5-34)
Anytrol elfulitpusree. observation xis declared to be out of control if it fal s out of the con
Example 5 . 1 2 {A Co ntro l e l l i pse for futu re overti me hours)
Ieventn Examploverteim5.9e,hourwe checked s . Let ' s t h
us e
e sttha bi
es l
e i t y
dat ofa l
t e
o gal
det appear
e r m i n a
e nces
a cont and r o l ext
r e r
gi a
o or
n df i
o n
r ar
fuy
ture paiFrrosmofExampl values.e 5.9 and Figure 5.6, we find that the pair of values for pe
reliolidps11e. werAlleofoutthofe poicontntrsoarl. eWethenremoved i n cont t
r h
o i
l s
, poi
s o tnhtey and can det s e
e rrvme itnoeddett h e
e r new
m i n e99%
t h e
p
Fi95%gurpreAny5.edi12cftualiotounngrreeobswigiothnerjtvhuatestiiondefnitifianal15ledinfgsotiranblteheobs2.eleliThirpvsatesiiocontsnsre. garroldeledlipasse sitsasblhowne or iinn
=
0
0
0
�
•
0
0 •
0 •
N
� •
•
•
a
�
>
'€
0
� •
> 0
� -+-
� 0
0 •
>. • •
......
�
�
•
0 0
�
C\S 0 •
;....
l:i lr) •
>< •
�
0
0
lr)
I
1 500 2500 3500 4500 5500 Figure 5 . 1 2 The 95% co ntrol e l l i pse
for future legal a ppeara nces a n d
Appearances Overtime ext raord i n a ry event overtime.
where 1 �
X j=1 xj
=
= -
n
£.J
scommon As wi
amples ccovar l be des
an beicombi c rni b
ed edt oi n
giSect
v e a isoi n
n 6.
gl e4 ,
es tthi e
m s
ataempl
( c ale l covar
e d i a nces
S pooled in f r
Chapt o m et rh e
6)n
ofs u
t hb
e
ance This pooled estimate is
I.
1
S ( S 1 S2 S n )
= -
n
+ + ··· +
Her
Furof frtheedom. e( nm
( nm - n
- n ) S i s i n dependent of each
er, Notic)Se itshdiatswetribaruteedesastima Wiatinshgart irnatndom X j and, t
math err e
i xf o r
wi et,hof t
nm - n
h ei r mean
degr X.
e es
anylargeginumber
ven periofod.degrTheseese ofestfirmeedom. I e r n al l y f r o m
ators arConse combiequentnedlyt,o give a single estimator with t h e dat a col l e ct e d in
a
(5-35)
is distributed as (nm - n)p
( nm - n - p + 1 )
Fp, nm - n - p+1
multivariate observations, theInellanipsanale foormgousat charfashtiofonrtpaio ourrs ofdisscuusbssaiompln one means
Ellipse Format Chart.
, _1 ( n - 1 ) ( m - 1 )2
individiuals
( )S ( )
_
x-x
= _
x-x
=
<
m ( nm - n - 1 ) F2, nm - n- 1 ( .05) (5-36)
althoughSubsthample rigehts -corhandresspiondide isnusg utoalpoily apprnts outoximsidateeofd asthe�cont.05)rjol .ellipse should
x( m
be
carmeasefuulred.y checked f o r changes i n t h e behavi o r
The interested reader i2s refer ed to for additional discussion. of t
[9]
h e qual i t y char a ct e r i s t i c s bei n g
we plot the quantToity constrTyuct a T -Xcharj -tXwi)t'hS-s1u(bsXjampl- Xe)data and characteristics,
T2-Chart.
= m(
p
for j = 1 , 2, . . . , n,
where the (n - 1 ) (m - 1 p ) Fp, nm- n- p+1 ( .05)
UCL =
( nm - n - p + 1 )
The UCLValuiess ofofteTjn apprthat oexceedximatetdheasUCL.0cor5) when
x�(
r e s p ond inst l
o a r
potg e.e nt i a l y out - o f - c ont r o lor
special cause variation, which should be checked. (See [9] .)
Section 5.6 M u ltivariate Q u a l ity Control Charts 251
Contro l Reg ions for Future Subsample Observations
is distributed as ( nm - n)p
( nm - n - p + 1 ) Fp, nm -n - p +l
Control Ellipse for Future Subsample Means.
ture subsample mean for = characteristics is defined by the set of all such that
p 2
The pr e di c t i o n el l i p
x
s e f o r a f u
, _1 _ (n + 1 ) (m - 1 ) 2
(x - x) S (x - x) F2, nm - n- 1 ( .05 ) (5-37)
_ = =
<
m ( nm - n - 1 )
where, again, the right-hand side is usually approximated as .05 . x�( ) /m
causaOfretesepn,ondent
ofsoamebrcomponent
etakdown i nstofhearvectecorodrinobsg equiervatpmention arore unavai becaus laeblofe.thThie unwis mayl inoccurgnessbe of
e quent
tsiictualattieocnshniwherquesehavedata beenare midevels ingoatpedrandom-t i n fe r e nces may be s e r i
for thesehcasat ies,scas. Howevero us l y bi a s e d.
, we To ardat
e able , no
e t o s t
t a
r t
e i s
at
anism regenerA
sponsiablleapprfor tohache mifsorincomput g valuesinisg maxiinflmuumencedlikebyels iinhthoodwhie valchesutethiofme chance
not tatheesvarfromech
iambliens.
complete data isconsgivenistsbyofDemps t e r , Lai r d , and Rubi n [ 5 ] . Thei r t e chni q ue, cal l e d t h e
the and Givanen isstoteemerpsati:vesetcalimatculeatioofn itnhvole unknown
EM algorithm,
prediction estimation
ving two steps. We call them
tshtaeticontsticsr. ibution of any mis ing observation to the (complparetea-dmetatae)rssu, prf iceidientct
1. Prediction step.
r--..1
(J
2. Estimation step. Us e t h e pr e di c t e d s u f i c i e nt
tTheimatcale ofcultahteioparn cyclametesefrrso.m one step to the other, until the revised estimates do s t a t i s t i c s t o comput e a r e vi s e d es
not diWhen f er apprtheeciobsableyrvfatroimonsthXe 1es, tXim2,at...e,obtXnaarineeda irnandom the presviamplouseitferroamtioan.p-variate
nordatamsalufpopul a t i o n, t h e
icient statistics [see (4-21)] n pr e di c t i o n-es t i m at i o n al g or i t h m i s bas e d on t h e compl e t e
T1 j�=l Xj
= = nX
and n
T2 j�=l XjXj
= = (n - l)S + nXX '
Iandn thvaris casiance-JL e, the algandorithmreprspoectceedsivelyas-arfoleounknown
I,
ws: We assandumemusthatt betheespopultimataetid.on mean
Prediction step. For each vect o r xj wi t h mi s i n g val u es , l e t x ) 1) denote the
mixj' s- inxjg(1component s and x ) 2 ) denote those components which are available. Thus,
- [ 1) , xj(2)' ] .
Gi v en es t i m at e ji
s and I f r o m t h e es t i m at i o n s t e p, us e t h e mean of t h e condi
( l
tional normal distribution of x ), given x ), to estimate the mis ing values. That ( 2 is, 1