Professional Documents
Culture Documents
Instructions:
Name: ______________________________________
Signature: ___________________________________
Total 50
1
Statistics 252 – Midterm Exam – Paul Cartledge – Winter 2011
ID:___________________________
Question 1 (2 marks) At the start of their midterm, a student sees an F-statistic of 5.22.
From a data structure of three groups and eight observations from each group, what is the
range of the corresponding p-value?
Question 3 (2 marks) Is the following statement true or false? Defend your answer
either way in one or two sentences. Simply an answer of “true” or “false” will not
receive any credit. “It is possible to make causal inferences in an experiment.”
2
Statistics 252 – Midterm Exam – Paul Cartledge – Winter 2011
Question 6 (15 marks total) If Darwinism has taught us anything, it’s that science is best
understood by analyzing superior beings; since aliens allegedly don’t exist, let’s take a
look at varsity athletes. Keeping it simple, though, let’s analyse their heights. The tables
below summarize: summary statistics of varsity teams (measured in inches); the ANOVA
output; selected linear combinations. Assume all assumptions hold.
Contrast Coefficients
Type
Contrast 1 2 3 4 5 6
1 1 -1 0 1 -1 0
2 0 1 -1 0 1 -1
3 -1 0 1 -1 0 1
4 1 1 1 0 -3 0
5 2 -1 0 0 -1 0
Contrast Tests
Value of
Contrast Contrast Std. Error t df Sig. (2-tailed)
Height Assume equal 1 9.770 1.379 7.085 109 .000
variances
2 -9.880 1.301 -7.593 109 .000
3 0.110 1.489 .074 109 .941
4 -4.120 2.113 -1.950 109 .054
5 4.340 1.741 2.493 109 .014
a) (2 marks) If you had to fully analyse “unplanned comparisons” upon the groups, how
many unique pairings of the groups will you need? Also, if the experiment-wise
confidence level is 98.5%, what are the corresponding individual confidence levels?
3
Statistics 252 – Midterm Exam – Paul Cartledge – Winter 2011
The following ANOVA output compares average height by grouping the varsity teams by
gender (Pandas are female, Golden Bears are male):
ANOVA
Height
Sum of
Squares df Mean Square F Sig.
Between Groups 645.6 1 645.6 44.24 .000
Within Groups 1649.0 113 14.6
Total 2294.6 114
b) (3 marks) Suppose we want to test if a model, where all teams have potentially
different mean heights, is significantly better than this “gender” model. List appropriate
null and alternative hypotheses for such a test, as well as identify SSR and df for the
respective models.
c) Are the volleyball players taller than the basketball players? Carry out an
appropriate test to answer this question.
i) (2 marks) First, define a linear combination of means contrasting the average
heights of volleyball players and basketball players. Fill in the blanks below for
coefficients in your contrast.
ii) (2 marks) Give the estimate and standard error for this contrast.
g = ________________________________
S.E.(g) = ________________________________
iii) (3 marks) What are the test statistic and exact p-value for the test to answer the
question listed at the start of part c)?
iv) (3 marks) Make a decision and state your conclusion in plain English.
4
Statistics 252 – Midterm Exam – Paul Cartledge – Winter 2011
Question 7 (14 marks total) Numerous CEOs at large corporations have recently
become concerned with how much REM (rapid eye movement) sleep they’ve been
having and the average price of their corporation’s stock in the following week.
Consulting an “a-maize-ing” Cobb (yeah, he sounds pretty corny), the data was
surprisingly easy to obtain for the two variables. Time of REM sleep (x) was measured
in minutes and average stock price in the following week (y) was measured in $US.
Assume all regression model assumptions hold.
Sum of
Model Squares df Mean Square F Sig.
1 Regression 1310.957 1 1310.957 66.487 .000a
Residual 453.501 23 19.717
Total 1764.458 24
a Predictors: (Constant), Sleep
b Dependent Variable: StockPrice
Coefficientsa
Unstandardized Standardized
Coefficients Coefficients
a) (2 marks) Predict the average stock price of a company in the following week if the
REM sleep of the CEO is 1 hour and 40 minutes.
Interpreting the correlation value, the variables have a (strong, moderate, weak)
relationship that has a (positive, negative) association.
5
Statistics 252 – Midterm Exam – Paul Cartledge – Winter 2011
c) (7 marks) Test if the simple linear regression model is significantly better than the
“one-mean” model. Carry out a full analysis in detail.
d) (3 marks) Calculate a 99% confidence interval for the intercept of the simple linear
regression model above.
6
Statistics 252 – Midterm Exam – Paul Cartledge – Winter 2011
Based on statistical evidence, does light in Daub River have a higher wavelength?
c) (4 marks) Suppose S.E.(Estimate) = 7.482. Calculate the test statistic, state the
distribution of the test statistic, and determine the range of the p-value.
d) (2 marks) Using part c), make a decision and state a conclusion in plain English.