Professional Documents
Culture Documents
GROUP ASSIGNMENT
BY
Name ID No
Samuel Lemma PGP/1062/14
Selam Kassahun PGP/1060/14
DEMISSE AMBAW PGP/1063/14
Siraj Shekmohammed PGP/1066/14
September 2022
Haramaya University, Haramaya
PART ONE
1) Data: Twins.JMP (Cochran, pp. 198) In 193 pairs of Swedish twins (2), 56 were of type MM
(both male), 72 of type MF (one male, one female), and 65 of type FF. On the hypothesis that a
twin is equally likely to be a boy or a girl and that the sexes of the two members of a twin pair
are determined independently, the probabilities of MM, MF, & FF pairs are %, %, & %,
respectively. Compute the values of the chi-square the significance probability.
Solution 1
Distributions
Twins
ff mf mm
Frequencies
N Missing 0
3 Levels Test Probabilities
Conclusion
Since our Prob>Chisq value (i.e., 0.3679) is greater than = 0.05, we fail to reject H0 &
conclude that the data came from equal probability.
Solution 2
Distributions
Gender
MM FM FF
Frequencies
MM 56 0.29016
FM 72 0.37306
FF 65 0.33679
N Missing0
3 Levels
Test Probabilities
Level Estim Prob Hypoth Prob
MM 0.29016 0.25000
FM 0.37306 0.50000
FF 0.33679 0.25000
Hypothesis:
H0: µ1=0.25, µ2=0.5, µ3=0.25 (the data came from a population having a 1/4:1/2:1/4
ratio of 56 MM, 72 of type MF, and 65 of type FF;
HA: the data came from a population not having a 1/4:1/2:1/4 ratio of 56 MM, 72 of type
MF, and 65 of type FF.
Conclusion
Since our Prob>Chisq value (i.e., 0.0013*) is less than = 0.05, we reject H0 & conclude that
the data came from a population not having a 1:1:1 ratio of 56 MM, 72 of type MF, and 65 of
type FF.
2) Flowers. jmp
In an experiment of yellow vs. green flowers, we expect a 3:1 ratio. We can test if what we
observe is statistically different from 3:1. Let's say we raise 100 flowers & observe the following
colors:
Hypothesis
H0: yellow = 0.75; green = 0.25 (the data came from a population having a 3:1 ratio of yellow
to green flowers);
HA: the data came from a population not having a 3:1 ratio of yellow to green flowers
Solution
Distributions
Seed color
Green Yellow
Frequencies
Green 16 0.16000
Yellow 84 0.84000
Total 100 1.00000
N Missing0
2 Levels
Test Probabilities
Decision
Since our Prob>Chisq value (i.e., 0.0377) is less than = 0.05, we reject H0 & conclude that the
data came from a population not having a 3:1 ratio of yellow to green flowers.
Solution
Distributions 4 Levels
seed color Test Probabilities
Frequencies
N Missing 0
Test ChiSquare DF Prob>Chisq
Likelihood Ratio 300.8662 3 <.0001*
Pearson 360.9724 3 <.0001*
Hypothesis
HO: µ1= 9/16, µ2=3/16, µ3=3/16, µ4 = 1/16
HA: the data came from a population not having a 9:3:3:1 ratio (i.e. at least
one of the proportions above is not correct).
Conclusion
Since our Prob>Chisq value (i.e., <.0001*) is less than = 0.05, we reject H0 & conclude
that the data came from a population not having a 9:3:3:1 ratio of yellow-smooth to
yellow-wrinkled to green-smooth to green-wrinkled seeds.
F1 F2 F3 F4 Total
773 231 238 59 1301
Solution
Distributions Level Count Prob
F4 59 0.04535
maize type
Total 1301 1.00000
N Missing0
4 LevelsTest Probabilities
Frequencies
Hypothesis
HO: µ1= 9/16, µ2=3/16, µ3=3/16, µ4 = 1/16 (the data came from a population having a
9:3:3:1 ratio)
HA: the data came from a population not having a 9:3:3:1 ratio (i.e., at least
one of the proportions above is not correct).
Conclusion
Since our Prob>Chisq value (i.e., 0.0259*) is less than = 0.05, we reject H0 & conclude
that the data came from a population not having a 9:3:3:1 ratio.
5) Data: Answers2.JMP People sometimes say that “b” and “c” answers occur most frequently
on multiple choice tests. To see if there is any evidence of this, use the answers below from the
verbal section of a real SAT. (This SAT exam was selected randomly from The College Board,
10 SATs, New York: College Entrance Examination Board, 1988).
Solution Frequencies
Distributions
Level Count Prob
A 12 0.14118
B 22 0.25882
C 19 0.22353
D 17 0.20000
E 15 0.17647
Total 85 1.00000
N Missing0
A B C D E 5 Levels
Test Probabilities
Level Estim Prob Hypoth Prob
A 0.14118 0.2
B 0.25882 0.2
C 0.22353 0.2
D 0.20000 0.2
E 0.17647 0.2
Test ChiSqua DF Prob>Chi
re sq
Likelihood 3.4568 4 0.4845
Ratio
Pearson 3.4118 4 0.4914
Hypothesis
H0: 1 = 2 = 3 = 4 = 5 = 0.2 (i.e., the populations have categories of equal
proportion);
HA: at least one i differs from that given under H0.
Conclusion
Since our Prob>Chisq value (i.e., 0.4914) is greater than = 0.05, we fail to reject H0 &
conclude that there is no evidence to say the data differ from that given by Ho.
6) Data: R-Number.JMP One crude check of a random number generator is to generate, say 100
random numbers, and check that 10% are b/n 0 and 1, 10% are b/n 1 and 2, 10% are b/n 2 and 3,
etc. People are very bad at picking random numbers themselves. To show that let’s try this
experiment. Everyone in class should pick a number, randomly, b/n 1 and 10 (inclusive). We’ll
then survey the class and record the distribution of chosen numbers in the table below. Test that
these numbers are randomly uniformly distributed across the 10 cells (e.g., 10% in each cell).
Random nr. 1 2 3 4 5 6 7 8 9 10 Total
Obs. 2 4 6 4 1 7 24 10 10 4 72
Frequency
Solution Frequencies
Distributions
random numbers Level Count Prob
1 2 0.02778
2 4 0.05556
3 6 0.08333
4 4 0.05556
5 1 0.01389
6 7 0.09722
7 24 0.33333
8 10 0.13889
9 10 0.13889
10 4 0.05556
1 2 3 4 5 6 7 8 9 10 Total 72 1.00000
N Missing0 10 Levels
Test Probabilities Level Estim Prob Hypoth Prob
5 0.01389 0.1
Level Estim Prob Hypoth Prob 6 0.09722 0.1
1 0.02778 0.1 7 0.33333 0.1
2 0.05556 0.1 8 0.13889 0.1
3 0.08333 0.1 9 0.13889 0.1
4 0.05556 0.1 10 0.05556 0.1
Hypothesis
Ho: 1 = 2 = 3 = 4 = 5 = 6 = 7 = 8 = 9 = 10 = 0.1(i.e., the populations have
categories of equal proportion);
HA: at least one i differs from that given under H0.
Conclusion
Since Prob>Chisq value (i.e., <.0001*) is less than = 0.05, we reject H0 & conclude that
the data came from a population not having a µ1 =10%, µ2 =10%, µ3 =10%, µ4 =10%,
µ5 =10%, µ6 = 10%, µ7 =10%, µ8 =10%, µ9 =10%, µ10 =10%, ratio.
7) Data: Seed-Color2.JMP Mendel performed experiments with peas to test his genetic theory. He
predicted that 9/16 of his peas would be Round-Yellow peas, 3/16 would be Round-Green peas,
3/16 would be Wrinkled-Yellow peas, & 1/16 would be Wrinkled- Green peas. His data is shown
Round yellow peas 315
Ho:
to the1right.
= 9/16,
Do 2
the=data
3/16, 3 = 3/16,
support 4 = 1/16;
the theory? HA: at least one EL differs from the others
Solution: Round green peas 108
Wrinkled yellow peas 101
Wrinkled green peas 32
Solution
Distributions
Seed color
Frequencies
N Missing0 4 Levels
Test Probabilities
Hypothesis
Ho: 1 = 9/16, 2 = 3/16, 3 = 3/16, 4 = 1/16 (the data came from a population having
a 9:3:3:1 ratio of Round-Yellow peas, Round-Green peas, Wrinkled-Yellow peas, &
wrinkled green peas);
HA: the data came from a population not having a 9:3:3:1 ratio of Round-Yellow peas,
Round-Green peas, Wrinkled-Yellow peas, & wrinkled-green peas
CONCLUSION
Since our Prob>Chisq value (i.e., <.0001*) is less than = 0.05, we reject H0 & conclude
that the data came from a population not having a 9:3:3:1 ratio of Round-Yellow peas,
Round-Green peas, Wrinkled-Yellow peas, & wrinkled green peas
8) Data: M&M.JMP According to the M&M website, the color distribution in milk chocolate is
30% brown, 20% yellow, 20% red, 10% green, 10% blue, & 10% orange. Get together with a
partnerColor
and combine your packages
Brownof M&Ms
Yellow to Red
test theGreen
hypothesis
Blue thatOrange
this is Total
the color
Obs. Frequency 18 18 19 14 7 16 92
Solution
Distributions Level Count Prob
green 14 0.15217
color
orange 16 0.17391
red 19 0.20652
yellow 18 0.19565
Total 92 1.00000
N Missing0 6 Levels
blue brown green orange red yellow
Freque
ncies
Level Count Prob
blue 7 0.07609
brown 18 0.19565
Test Probabilities
Level Estim Prob Hypoth Prob
blue 0.07609 0.1
brown 0.19565 0.3
green 0.15217 0.1
orange 0.17391 0.1
red 0.20652 0.2
yellow 0.19565 0.2
Test ChiSquare DF Prob>Chisq
Likelihood Ratio 10.6783 5 0.0581
Pearson 11.4239 5 0.0436*
Hypothesis
Ho: 1 = 30% brown, 2 = 20% yellow, 3 = 20% red, 4=10% green, 5=10% blue,
6=10% orange (the data came from a population having a 3:2:2:1:1:1 ratio of brown,
yellow, red, green, blue and orange);
HA: the data came from a population not having a 3:2:2: 1:1:1 ratio of brown, yellow,
red, green, blue, and orange.
Conclusion
Since our Prob>Chisq value (i.e., 0.0436*) is less than = 0.05, we reject H0 & conclude
that the M&M color distribution in milk chocolate does not come from a 3:2:2: 1:1:1
ratio of brown, yellow, red, green, blue and orange.
9) In rice, the green leafhopper is suspected to differ in feeding preference b/n an already
diseased plant and a healthy plant. The researcher, therefore, encloses a prescribed number of
green leafhoppers in a cage that holds an equal number of healthy and diseased rice plants. After
2 hours of caging, he then counts the number of insects found on diseased and on healthy plants.
Of 239 insects confined, 67 were found on the healthy plants and 172 on the diseased plants.
Does the observed ratio of 67:172 deviate significantly from the hypothesized no-preference
ratio 0f 1:1?
Solution
Distributions
plant status
diseasd health
Frequencies
N Missing0
2 LevelsTest Probabilities
Hypothesis
Ho: µ1 = µ2 = 1:1 (i.e., the populations have categories of equal proportion);
HA: at least one i differs from that given under H0
Conclusion
Since our Prob>Chisq Pearson value (i.e., <.0001*) is less than = 0.05, we reject H0 &
conclude that the observed ratio of 67:172 deviate significantly from the hypothesized
no-preference ratio 0f 1:1 (the populations have no categories of equal proportion)
PART TWO
1) Descriptive statistics
40 50 60 70 80 90 100 110
Compare Distributions
Show Distribution AICc BIC -2*LogLikelihood
Quantiles
Summary Statistics
Mean 73.9625
Std Dev 14.023889
Std Err Mean 1.5679185
Upper 95% Mean 77.083364
Lower 95% Mean 70.841636
N 80
Variance 196.66946
Skewness 0.069106
Kurtosis -0.418091
CV 18.96081
Minimum 42
Maximum 108
Measures
-2*LogLikelihood 648.55212
AICc 652.70797
BIC 657.31618
Goodness-of-Fit Test
W Prob<W
Shapiro-Wilk 0.9880738 0.6695
A2 Simulated p-
Value
Anderson-Darling 0.4274881 0.3024
minimum…………………….............................................................42
maximum…………………………………………………………...108
mean………………………………………………………………….73.9625
variance……………………………………………………………...196.66946
coefficient of variation………………………………………………...18.96081
N……………………………………………………………………....80
B) Test whether the above data is normally distributed or not.
Hypothesis
Ho: the data is from the normal distribution
HA: the data is not from the normal distribution
Conclusion
Since our Shapiro-Wilk is (i’e. 0.6695) is greater than >0.05. so, we fail to reject the null
hypothesis and there is no evidence to say that the data is from a normal distribution.
Distributions
amt
Quantiles
100.0% maximum 2.6
99.5% 2.6
97.5% 2.6
90.0% 2.6
75.0% quartile 2.525
50.0% median 2.35
25.0% quartile 2.05
10.0% 1.9
2.5% 1.9
0.5% 1.9
0.0% minimum 1.9
Summary Statistics
Mean 2.3
Std Dev 0.2607681
Std Err Mean 0.1064581
Upper 95% Mean 2.5736593
Lower 95% Mean 2.0263407
N 6
Test Mean
Hypothesized Value 2
Actual Estimate 2.3
DF 5
Std Dev 0.26077
t Test
Test Statistic 2.8180
Prob > |t| 0.0372*
Prob > t 0.0186*
Prob < t 0.9814
Hypothesis
Ho µ=2.0
Ha µ≠2.0
Conclusion
P-value =0.0372 leads to the decision of rejecting Ho & we conclude that the mean of the new
variety is not equal to 2.0.
The mean yield (2.3) is significantly higher than the hypothesized mean (2.0).
yield
8 8.5 9 9.5
Quantiles
Summary Statistics
Mean 8.6
N 6
Test Mean
Hypothesized Value 8
DF 5
t Test
Ha µ≠8.0
Conclusion
P-value =0.0648 leads to the decision of rejecting Ho & we conclude that the mean of this year
yield is not equal to 8.0.
The mean yield (8.6) is significantly higher than the hypothesized mean (8.0).
diff
-1 0 1 2 3
Quantiles
Mean 1.525
Std Dev 1.5163396
Std Err Mean 0.536107
Upper 95% Mean 2.7926916
Lower 95% Mean 0.2573084
N 8
Test Mean
Hypothesized Value 0
Actual Estimate 1.525
DF 7
Std Dev 1.51634
t Test
Hypothesis
H o: 1 = 0
H a: 1 0
Conclusion
P-value=0.0249 suggests that the population means are not equal and we reject Ho.
Thus, we would conclude that the two varieties are not equal. (Mean of the difference (1.5)
shows a significant increase)
3.2
2.8
2.6
2.4
2.2
cont +grth
oilseed
Oneway Anova
Summary of Fit
Rsquare 0.058055
Adj Rsquare 0.005725
Pooled t Test
+grth-cont
Assuming equal variances
Difference 0.17000 t Ratio 1.053283
Std Err Dif 0.16140 DF 18
Upper CL Dif 0.50909 Prob > |t| 0.3061
Lower CL Dif -0.16909 Prob > t 0.1531
Confidence 0.95 Prob < t 0.8469
Analysis of Variance
Our hypothesis
H o: 1 = 2
H a: 1 2
Conclusion
Rsquare 0.343353
Pooled t-Test
standard variety-new variety
Assuming equal variances
C. Total 15 1.1893750
Hypothesis
Ho: µ1 = µ2
Ha: µ1 ≠ µ2(µ1 = new variety, µ2 = standard variety).
Conclusion
Since (Prob > ) p-value is 0.0171*. Suggests that the varieties means are not equal.
Thus, we would conclude that the mean of new variety and standard variety are not equal.
(Mean of new variety is significantly higher than that of standard variety)
4.5
3.5
A B
varity
Oneway Anova
Summary of Fit
Rsquare 0.276017
Pooled t Test
B-A
Assuming equal variances
Analysis of Variance
C. 18 6.5968421
Total
Hypothesis
Ho:1=2
H a: 1 2
Conclusion
P-value=0.0209 suggests that the population means are not equal.
Thus, we would conclude that variety A and variety B are not equal. (Mean of B (4.7) is
significantly higher than that of A (4.0))
7. Chi-square test
In an experiment of purple vs. green seeds, we might expect a 3:1 ratio. We can test if what we
observe is statistically different from 3:1. Let's say we raise 100 flowers and observe the
following colors:
Seed color Yellow Green Total
Observed 112 46 158
frequency
Write H0 & HA.
Do the analysis & make your decision?
Solution
Distributions
seed colar
yellow green
Frequencies
green 46 0.29114
N Missing0
2 Levels
Test Probabilities
Level Estim Prob Hypoth Prob
Hypothesis
H0: yellow = 0.75; green = 0.25, in other words H0: = the data came from a population
having a 3:1 ratio of yellow to green flowers;
HA: the data came from a population not having a 3:1 ratio of yellow to green flowers
Conclusion and interpretation
Since our Prob>Chisq value (i.e., 0.2324) is greater than = 0.05, we fail to reject H0 &
conclude that we don’t have evidence data came from a population having a 3:1 ratio of
yellow to green flowers.
8) Chi-square test
A parental cross is expected to produce progeny in categories Brown, Yellow, Red & Green
with a ratio of 9:3:3:1. The observed frequencies of the progeny were as given below:
sed colour
Frequencies
green 8 0.03200
red 50 0.20000
yellow 42 0.16800
N Missing0
4 Levels
Test Probabilities
Conclusion
Since our Prob>Chisq Pearson p-value (i.e., 0.1674) is greater than = 0.05, we fail to
reject H0 & conclude that we don’t have evidence that the data differ from a population
having a ratio of 9:3:3:1.
9) Chi-square test
Seed colour
Petal Dark- Brown Light- White
colour brown brown
Purple 21 32 13 4
Mauve 55 102 70 8
White 14 59 112 10
Write H0 & HA.
Do the analysis & make your decision?
Do correspondence analysis if necessary!
Solution
Hypothesis
H0: there is no relationship between Seed color & Petal color (independent);
HA: there is a relationship between Seed color & Petal color (Dependent).
Contingency Analysis of seed color by petal color
Mosaic Plot
1.00 W
0.75 LB
0.50
0.25
DB
0.00
purple mauve white
petal color
Freq: freq
Contingency Table
petal color by seed color
Count DB B LB W Total
Total %
Col %
Row %
purple 21 32 13 4 70
4.20 6.40 2.60 0.80 14.00
23.33 16.58 6.67 18.18
30.00 45.71 18.57 5.71
mauve 55 102 70 8 235
11.00 20.40 14.00 1.60 47.00
61.11 52.85 35.90 36.36
23.40 43.40 29.79 3.40
white 14 59 112 10 195
2.80 11.80 22.40 2.00 39.00
15.56 30.57 57.44 45.45
7.18 30.26 57.44 5.13
Total 90 193 195 22 500
18.00 38.60 39.00 4.40
Tests
N DF -LogLike RSquare (U)
500 6 30.679975 0.0520
Test ChiSquare Prob>ChiSq
Likelihood Ratio 61.360 <.0001*
Pearson 58.575 <.0001*
Correspondence Analysis
0.6
DB
purple
0.4
mauve
0.2 B
c1 0.0
W
-0.2
LB
white
-0.4
-0.6
-0.6 -0.4 -0.2 0.0 0.2 0.4 0.6
c2
Based on the result from correspondence analysis, Dark-brown and Purple have a high
relationship followed by Brown and Mauve. On the other hand, Light-brown and White have a
low relationship with others.
0.75
0.50
no of germinate
0.25
0.00
A B C D E
storage type
Freq: count
Contingency Table
storage type By type
Correspondence Analysis
0.6
no of not germinate
0.4
C
0.2
B
c1 0.0 D
no of germinate
A
-0.2 E
-0.4
-0.6
-0.6 -0.4 -0.2 0.0 0.2 0.4 0.6
c2
Hypothesis
H0: there is no relationship between germination status & storage method (independent);
HA: there is a relationship between germination status & storage method (Dependent).
Decision
Since the Pearson Prob>Chisq value (i.e., 0.0007*) is less than = 0.05, we reject H0 &
conclude that germination & storage type are dependent.
From the correspondence analysis we get the following facts of relationships. It looks like
number of germinated seed prefer the storage type A, D, E, B. far more than any other
storage type C. Also, the storage type C & the number of not germinated seed more
related than the other four storage type.
0.75
rooting
typ 0.50
0.25
not rooting
0
A B
hormon
Freq: frequency
Contingency Table
hormon By typ
Tests
not rooting
0.4
A
0.2
c1 0
rooting
-0.2 B
-0.4
Hormone typ
typ c1
rooting -0.1435
hormon c1
A 0.2738
B -0.2191
Hypothesis
Decision
Since the Pearson Prob>Chisq value (i.e. 0.0201*) is less than = 0.05, we reject H0 &
conclude that hormone and rooting are dependent.
From the correspondence analysis we get the following facts of relationships. It looks like
rooting prefer the hormone type B (B hormone) far more than any other hormone A (A
hormone).
Frequencies
Level Count Prob
Spotted 20 0.05000
N Missing 0 3
Levels
Test Probabilities
Level Estim Prob Hypoth Prob
Brown 0.60000 0.48000
Spotted 0.05000 0.16000
White 0.35000 0.36000
Since our Prob>Chisq value (i.e., <.0001*) is less than = 0.05, we reject are & conclude that
the proportions in color is not that of the sample expected (not given by the 9:12:4 ratio of
brown, spotted, and white.
0.75
B
0.50
0.25
A
0.00
mastiti no mastiti
mastities
Freq: count
Contingency Table
mastities By group
Count A B C Total
Total %
Col %
Row %
mastiti 36 29 10 75
12.00 9.67 3.33 25.00
37.50 21.97 13.89
48.00 38.67 13.33
no mastiti 60 103 62 225
20.00 34.33 20.67 75.00
62.50 78.03 86.11
26.67 45.78 27.56
Total 96 132 72 300
32.00 44.00 24.00
Tests
0.4 mastiti
0.2
c1 0.0
B
no mastiti
-0.2
C
-0.4
mastitis group
Hypothesis
H0: there is no relationship between number of cow & treatment groups (independent);
HA: there is a relationship between number of cow & treatment groups (Dependent).
Decision
Since the Pearson Prob>ChiSq value = 0.0012* is less than α= 0.05, we reject Ho &
conclude that there is a relationship between number of cow & treatment groups
(dependent).
Based on the result of Contingency Table from correspondence analysis, number of cows
that have mastitis is highly significant in treatment group A.
14) One sample t test
Solution
Ho: μ = 4000kg
Ha: μ ≠ 4000kg
Distributions
milk yield
Summary Statistics
Mean 3800
Std Dev 500.15553
N 10
Test Mean
z Test
Test Statistic -1.2649
Distributions
-2 0 2 4 6 8 10 12
Quantiles
100.0% maximum 9
99.5% 9
97.5% 9
90.0% 9
75.0% quartile 6.5
50.0% median 2
25.0% quartile 0
10.0% -1
2.5% -1
0.5% -1
0.0% minimum -1
Summary Statistics
Mean 3.1111111
Std Dev 3.6552854
Std Err Mean 1.2184285
Upper 95% Mean 5.9208122
Lower 95% Mean 0.3014101
N 9
Test Mean
Hypothesized Value 0
Actual Estimate 3.11111
DF 8
Std Dev 3.65529
t Test
Test Statistic 2.5534
Prob > |t| 0.0340*
Prob > t 0.0170*
Prob < t 0.9830
Hypothesis
Ho:1=2 2
H a: 1 2
Conclusion
Solution
Oneway Analysis of FREQUENCY By GROUP
140
130
120
110
A B
GROUP
Oneway Anova
Summary of Fit
Rsquare 0.203236
Pooled t Test
B-A
Assuming equal variances
-10 -5 0 5 10
Analysis of Variance
Hypothesis
• Ho: µ1 = µ2
• Ha: µ1 ≠ µ2(µ1 = group A measurement, µ2 = Group B measurement).
Conclusion
Since (Prob > F ) p-value is 0.0604, so we fail to reject the null hypothesis.
We don’t have evidence that the group measurement means are equal.
PART THREE
Solution
DAT set 1
Response yield
Whole Model
Actual by Predicted Plot
18
16
14
12
10
4
4 6 8 10 12 14 16 18
yield Predicted RMSE=2.811 RSq=0.07
PValue=0.5968
Summary of Fit
RSquare 0.074132
Analysis of Variance
Effect Tests
Source Nparm DF Sum of Mean Square F Ratio Prob > F
Squares
TRT
Leverage Plot
15
10
Response yield
Whole Model
Actual by Predicted Plot
8
1
1 2 3 4 5 6 7 8
yield Predicted RMSE=1.4048 RSq=0.23
PValue=0.0564
Effect Summary
Summary of Fit
RSquare 0.225568
Analysis of Variance
Effect Tests
0
4.0 4.5 5.0 5.5 6.0
TRT Leverage, P=0.0564
Data set 3
Response yield
Whole Model
Actual by Predicted Plot
30
25
20
15
10
10 15 20 25 30
yield Predicted RMSE=4.2162 RSq=0.58
PValue=<.0001
-5
-10
10 15 20 25 30
yield Predicted
Summary of Fit
RSquare 0.579513
RSquare Adj 0.531457
Root Mean Square Error 4.216234
Mean of Response 21.14975
Observations (or Sum Wgts) 40
Analysis of Variance
TRT
Leverage Plot
30
25
20
15
10
5
14 16 18 20 22 24 26 28
TRT Leverage, P<.0001
Mean[i]-Mean[j] 1 2 3 4 5
Std Err Dif
Lower CL Dif
Upper CL Dif
1 0 10.4938 5.2725 -1.315 8.58125
0 2.10812 2.10812 2.10812 2.10812
0 4.43278 -0.7885 -7.376 2.52028
0 16.5547 11.3335 4.74597 14.6422
2 -10.494 0 -5.2213 -11.809 -1.9125
2.10812 0 2.10812 2.10812 2.10812
-16.555 0 -11.282 -17.87 -7.9735
-4.4328 0 0.83972 -5.7478 4.14847
3 -5.2725 5.22125 0 -6.5875 3.30875
2.10812 2.10812 0 2.10812 2.10812
-11.333 -0.8397 0 -12.648 -2.7522
0.78847 11.2822 0 -0.5265 9.36972
4 1.315 11.8088 6.5875 0 9.89625
2.10812 2.10812 2.10812 0 2.10812
-4.746 5.74778 0.52653 0 3.83528
7.37597 17.8697 12.6485 0 15.9572
5 -8.5813 1.9125 -3.3088 -9.8963 0
2.10812 2.10812 2.10812 2.10812 0
-14.642 -4.1485 -9.3697 -15.957 0
-2.5203 7.97347 2.75222 -3.8353 0
Level Least Sq
Mean
4 A 27.071250
1 A B 25.756250
3 B C 20.483750
5 C 17.175000
2 C 15.262500
Data set 4
Response yield
Whole Model
Actual by Predicted Plot
80
70
60
50
40
40 50 60 70 80
yield Predicted RMSE=6.8385 RSq=0.45
PValue=0.0002
Summary of Fit
RSquare 0.452224
RSquare Adj 0.389621
Root Mean Square Error 6.838489
Mean of Response 61.42225
Observations (or Sum Wgts) 40
Analysis of Variance
TRT
Leverage Plot
80
70
60
50
40
55 60 65 70
TRT Leverage, P=0.0002
Level Least Sq
Mean
3 A 70.120000
2 A B 64.473750
4 A B C 62.290000
5 B C 56.455000
1 C 53.772500
For the data sets given above answer the following questions:
1. What is (are) the factor(s) in this experiment?
We have one (1) treatment factor for all data sets given above.
2. What are the levels of the factor(s) in this experiment?
We have five (5) level for the above four data set given.
3. Is there block effect(s), if yes give the block(s)?
There is no block effect in the CRD design f experiment, this means no block
effect for all four data sets. Because the design is CRD.
4. State the null & alternative hypotheses!
H0: µ1 = µ2 = µ3 = µ4 = µ5 (or all means are equal)
HA: not all the means are equal = at least one mean is different from the rest
5. Undertake ANOVA using JMP & based on the output you get interpret
your results and give your decision & recommendation?
Result and interpretation
In Data set 1;
Since Prob > F value (i.e., 0.5968) p values greater than α = 0.05, we fail to reject the null
hypothesis and conclude that we don’t have evidence that the mean between the treatment
1, 2, 3, 4, and 5 is not equal.
In Data set 2;
Since Prob > F value (i.e., 0.0564) p values greater than α = 0.05, we fail to reject the null
hypothesis and conclude that we don’t have evidence that the mean between the treatment
1, 2, 3, 4, and 5 are not equal.
In Data set 3;
Since Prob > F value (i.e., <.0001*) p values less than α = 0.05, we reject null hypothesis
and conclude that the mean between the treatment 1, 2, 3, 4, 5 are not equal.
Based on the result from the multiple comparison procedure (MCP), in this group, treatment
four (4) is high significance followed by treatment one (1) while treatments two (2) and five
(5) are low significance.
In Data set 4;
Since Prob > F value (i.e., 0.0002*) p values less than α = 0.05, we reject null hypothesis
and conclude that the mean between the treatment 1, 2, 3, 4, 5 are not equal.
Based on the result from the multiple comparison procedure (MCP), in this group, treatment
three (3) is high significance followed by treatments two (2) and four (4) while treatments
one (1) and five (5) are low significance.
6. Which group do you recommend as best? Why?
Option 1:
Based on the significant difference, we recommend Data Set 3 and Data Set 4 because they
have a significant difference (p-value. <.0001* and 0.0002*) Data Set 3 and Data Set 4
respectively, while Data Set 1 and Data Set 2 have no significant difference.
Option 2:
Based on the significance level, we recommend Data Set 3 because it has a highly
significant difference (p-value <.0001*) than Data Set 4 which has (p-value of 0.0002*).
7. Give the compare and contrast of this model using the relative efficiency value
in comparison to RCBD/LSD depending on the presence of block(s)?
Relative efficiency to compare RCBD with LSD.
Solution
Data set 1
Given:
MScolumns = 5.535866
Treatment = 5
MSerror = 7.90174
RE = 5.535866 + (5 – 1)* 7.90174/ 5 (7.90174) = 0.94069
Data set 2
Given:
MScolumns = 5.029791
Treatment = 5
MSerror = 1.97355
RE = 5.029791 + (5 – 1)* 1.97355/ 5 (1.97355) = 1.30972
Data set 3
Given:
MScolumns = 214.3718
Treatment = 5
MSerror = 17.777
RE = 214.3718+ (5 – 1)* 17.777/ 5 (17.777) = 3.2117
Data set 4
Given:
MScolumns = 337.8155
Treatment = 5
MSerror = 46.765
RE = 337.8155+ (5 – 1)* 46.765/ 5 (46.765) = 2.2447
The interpretation is that: Based on the following rule decision can be made.
o If RE >1, then RCBD is more efficient
than CRD
o If RE =1, then both RCBD & CRD are
equally efficient
o If RE <1, then CRD is better
So in data set 2, 3, and 4 has the higher efficiency percent, means that the result we
get for those data set ( 2, 3 and 4) is greater than one ( RE = >1). This leads as to
the decision of RCBD is efficient than that of LSD.