Professional Documents
Culture Documents
GROUP 3 MEMBERS
S/N NAME REG.NO
1 GHALI A NUHU UG14/STAT/1080
2 FAIZ AHMAD A UG14/STAT/1004
3 ALIYU YAHAYA UG14/STAT/1094
4 KAMAL ALHAJI YARO UG12/STAT/1032
5 BASHIR SUNUSI UG15/STAT/2006
6 SAGIR ABDURRASHID UG15/STAT/2013
7 JA’AFAR ABUBAKAR UG14/STAT/1024
8 IBRAHIM USMAN UG14/STAT/1032
9 KAMALUDDEEN BELLO UG15/STAT/2010
10 IBRAHIM ABDULLAHI UG14/STAT/1041
11 USMAN ADAM YERO UG14/STAT/1067
12 SHAMSUDDEEN IBRAHIM MUSTAPHA UG15/STAT/2018
13 LUCKY ANNICHE UG11/STAT/1275
14 NAJEEB SABO WADA UG15/STAT/2012
15 ABBAS RABIU UG15/STAT/2008
16 BASHIR MAGAJI UG14/STAT/1021
17 ANAS SHUAIBU ADAMU UG13/STAT/1074
18 UMAR BELLO MUSTAPHA UG13/STAT/1002
19 LUKMAN AHMED UG14/STAT/1109
20 SULAIMAN LAWAN UG14/STAT/1105
21 JUNAIDU SALE UG15/STAT/2015
22 CHIKA VIVEIN EBERE UG13/STAT/1004
6
INTRODUCTION
MULTIVARIATE STATISTICS is a subdivision of statistics encompassing the simultaneous
observation and analyses of more than one outcome variable, its application is multivariate
analyses
HYPOTHESIS TESTING
Is a statistical method that’s is used in making statistical decision using experimental
data, hypothesis testing is basically on assumption that we make about the population
parameter, its sometimes called confirmatory data analyses, is a hypothesis that’s testable on
the basic of observing a process that is modeled via a set of random variable.
R-PACKAGE
R is an extensible, powerful and wide, used scripting language and environment for
graphics and statistical data analysis. The R-language was developed at bell laborites in 1970s by
john chambers and colleagues, the source code of R is initially writing in Fortran, C and R with
growing population in recent years, R is gradually becoming the leading language in statistics
METHODOLOGY
The given data were analyzed with the R-Package using appropriate test statistic,
Hoteling’s T2 distribution.
HOTELLING T2 DISTRIBUTION
the distribution of a set of statistics which are natural generalization of the statistics underlying
THE DISTRIBUTION OF T2
between the multivariate means of different population, where test for univariate problems
6
would make used of T-test, the distribution is named for HAROLD HOTTELING, who developed
DEFINITION
If the d vector with (px1) dimension follows a normal distribution i.e. NP(0, I) and M is
pxp symmetric matrix with m degrees of freedom, with a wishart distribution i.e. W P(IPXP, M)
m−p+ 1
X~T2P,M then X~Fp,m-p+1
pm
Let NP(µ, ∑) be an p-variate normal distribution mean µ and known covariance matrix ∑,
let X1,X2,….,Xn ~ NP(µ, ∑) be an independent and identical distributed random variables, which
may be represented as px1 column vectors. With mean X́ , and covariance matrix ∑ it can
show that
DEFINITION
The covariance matrix ∑ used above is often unknown; here we used instead the sample
covariance,
6
It can be shown that ∑=(n-1)Sn which it follows a wishart distribution with (n-1) degree of
freedom,
T2= ( X́ −µ )ISn-1 ( X́ −µ )
p ( n−p )
T2~ T2p,n-1= Fp,n-p
n− p
Where Fp,n-p is the F distribution in order to calculate p-value, note that the distribution
n− p
T2 ~Fp,n-p
p ( n−p )
Then used the quantity of the left hand side to evaluate the p-value corresponding to
We compare,
n− p
¿
T2 and p ( n−p ) Fn,n-p
¿
Then we accept the null hypothesis if the calculated p-value is greater, and reject the
PROCEDURES
library(ICSNP)
aa=colMeans(Group3)
bb=cov(Group3)
N=20
P=3
cc=HotellingsT2(Group3)
#Hypothesis test:Ho:mu=[4,50,10],Ha:mu<>[4,50,10]
mu.Ho<-c(4,50,10)
T.sq<-N*t(aa-mu.Ho)%*%solve(bb)%*%(aa-mu.Ho)
test.stat=(N-P)/(P*(N-1))*T.sq
Crit.val=qf(0.95,P,N-P)
p.value=1-pf((N-P)/(P*(N-1))*T.sq,P,N-P)
round(data.frame(T.sq,test.stat,Crit.val,p.value),3)
6
RESULT
library(ICSNP)
> aa=colMeans(Group3)
> bb=cov(Group3)
> N=20
> P=3
> cc=HotellingsT2(Group3)
> mu.Ho<-c(4,50,10)
> T.sq<-N*t(aa-mu.Ho)%*%solve(bb)%*%(aa-mu.Ho)
> test.stat=(N-P)/(P*(N-1))*T.sq
> Crit.val=qf(0.95,P,N-P)
> p.value=1-pf((N-P)/(P*(N-1))*T.sq,P,N-P)
6
> round(data.frame(T.sq,test.stat,Crit.val,p.value),3)
> Group3
> aa
> bb
>N
6
[1] 20
>P
[1] 3
> cc
data: Group3
> mu.Ho
[1] 4 50 10
> T.sq
[,1]
[1,] 12.02739
6
> test.stat
[,1]
[1,] 3.587117
> crit.val
> Crit.val
[1] 3.196777
> p.value
[,1]
[1,] 0.03564193
> round(data.frame(T.sq,test.stat,Crit.val,p.value),3)
Since the p.value=0.036 is less than α=0.95 then we reject the null hypothesis with
respect to the alternative one, i.e. and commence that Ho is not equal to [4,50,10]
REFRENCE