Final Exam with Solutions

April 24

SURNAME

Winter, 2008

GIVEN NAMES

Time: 2 Hours

I.D.

STUDENT IDENTIFICATION Each candidate must sign the Seating List conﬁrming presence at the examination. All candidates for ﬁnal examinations are required to place their University of Calgary student I.D. cards on their desks for the duration of the examination. (Students writing mid-term tests can also be asked to provide identity proof.) Students without an I.D. card who can produce an acceptable alternative I.D., e.g., one with a printed name and photograph, are allowed to write the examination. A student without acceptable I.D. will be required to complete an Identiﬁcation Form. The form indicates that there is no guarantee that the examination paper will be graded if any discrepancies in identiﬁcation are discovered after veriﬁcation with the student’s ﬁle. A student who refuses to produce identiﬁcation or who refuses to complete and sign the Identiﬁcation Form is not permitted to write the examination. EXAMINATION RULES 1. Students late in arriving will not normally be admitted after one-half hour of the examination time has passed. 2. No candidate will be permitted to leave the examination room until one-half hour has elapsed after the opening of the examination, nor during the last 15 minutes of the examination. All candidates remaining during the last 15 minutes of the examination period must remain at their desks until their papers have been collected by an invigilator. 3. All enquiries and requests must be addressed to supervisors only. 4. Candidates are strictly cautioned against: (a) speaking to other candidates or communicating with them under any circumstances whatsoever; (b) bringing into the examination room any textbook, notebook or memoranda not authorized by the examiner; (c) making use of calculators and/or portable computing machines not authorized by the instructor; (d) leaving answer papers exposed to view; (e) attempting to read other students’ examination papers. The penalty for violation of these rules is suspension or expulsion or such other penalty as may be determined. 5. Candidates are requested to write on both sides of the page, unless the examiner has asked that the left half page be reserved for rough drafts or calculations. 6. Discarded matter is to be struck out and not removed by mutilation of the examination answer book. 7. Candidates are cautioned against writing in their answer books any matter extraneous to the actual answering of the question set. 8. The candidate is to write his/her name on each answer book as directed and is to number each book. 9. A candidate must report to a supervisor before leaving the examination room. 10. Answer books must be handed to the supervisor-in-charge promptly when the signal is given. Failure to comply with this regulation will be cause for rejection of an answer paper. 11. If a student becomes ill or receives word of domestic aﬄiction during the course of an examination, he/she should report at once to the Supervisor, hand in the unﬁnished paper and request that it be cancelled. Thereafter, if illness is the cause, the student must go directly to University Health Services so that any subsequent application for a deferred examination may be supported by a medical certiﬁcate. An application for Deferred Final Examinations must be submitted to the Registrar by the date speciﬁed in the University Calendar. Should a student write an examination, hand in the paper for marking, and later report extenuating circumstances to support a request for cancellation of the paper and for another examination, such request will be denied. 12. SMOKING DURING EXAMINATIONS IS STRICTLY PROHIBITED.

Blacken the course number (STAT 217) and section number (L01.L01. L02 and L06 . carry your computation to as many signiﬁcant digits as shown in the available answers and select the best answer that you think to be closest to the correct answer. It has 38 multiplechoice questions. 2 pencil. This examination is scheduled for a 2-hour period.Winter 2008 Introduction Page 2 of 15 1. 2 pencil. A non-programmable and nongraphing calculator is allowed. Make no stray marks. All questions are equally weighted. Read each question and its lettered answers. 4. they may count against you. A standard normal z-table. This is a closed book exam.Final Exam . formula sheet is allowed. a χ2 -table. 3. your name and ID on the corresponding space on the answer sheet with a No. Blank answers and incorrect answers are scored as zero credit. a t-table and F -tables are provided. blacken the corresponding space on the answer sheet with a No. For questions involving numerical answers. 5. L02 or L06). . Hand in the exam papers with the answer sheet.Stat 217 . while correct answers are worth 1 point each.5” x 11”. 8. A two-sided. 2. When you have decided which answer is correct.

L02 and L06 .Stat 217 .Final Exam .Winter 2008 Page 3 of 15 Table 1: Correct answers to the multiple-choice questions Question Your Question Your Question Your number answer number answer number answer 1 C 16 C 31 B 2 D 17 B 32 A 3 C 18 C 33 D 4 A 19 B 34 B 5 C 20 B 35 C 6 B 21 C 36 D 7 D 22 A 37 C 8 A 23 C 38 B 9 B 24 B 10 A 25 B 11 A 26 A 12 C 27 C 13 B 28 B 14 A 29 C 15 D 30 A .L01.

05 CAUTION: the next 3 questions refer to the following information: You wish to compare the prices of apartments in two neighboring towns. The average height of a random sample of 25 American adult males is found to be x = 69. Let µ1 represent the true average price of apartments in town A and µ2 the average price in town B. You take a simple random sample of 12 apartments in town A and calculate the average price of these apartments. You repeat this for 15 apartments in town B. L02 and L06 . The one-sample t statistic has the value t = −1. what would be the conservative estimate for the degrees of freedom? A) 11 B) 12 C) 14 .L01.025 C) 0. What would be the hypotheses for this problem? A) H0 : µ1 = µ2 versus Ha : µ1 < µ2 B) H0 : µ1 = µ2 versus Ha : µ1 > µ2 C) H0 : µ1 = µ2 versus Ha : µ1 = µ2 D) None of the above.Final Exam .S.025 < P − value < 0.04 2. If we were to use the unpooled t test.Stat 217 .01 B) 0. 4.68. The hypotheses H0 : µ = 350 versus Ha : µ < 350 are examined using a sample of size n = 20.72 inches with a standard deviation ¯ of s = 4. are believed to be normally distributed. What do we know about the P -value of this test? A) P − value < 0.Winter 2008 Page 4 of 15 1.05 D) P − value > 0.15.83 D) 2. The heights (in inches) of males in the U.69 C) 0.17 B) 0. What is the standard error of x ? ¯ A) 0.01 < P − value < 0. 3.

05 < P − value < 0. He also selected an independent simple random sample of 200 registered Democrats from a list of registered Democrats and determined the mean age to be x2 = 40 years with a ¯ standard deviation s2 = 10 years.01 B) 0. 6. Suppose we were to use the unpooled t test with the conservative estimate for the degrees of freedom.76 years 7.01 < P − value < 0.98 years D) 1 ± 2. What is a 90% conﬁdence interval for µ1 − µ2 ? A) −1 ± 1.05 C) 0.L01. He decided to see if the mean age of registered Republicans was lower than that of registered Democrats.1. Suppose the political analyst had wished to test the hypotheses H0 : µ1 = µ2 versus Ha : µ1 < µ2 . Let σ1 and σ2 be the corresponding population standard deviations. The t statistic for comparing the mean prices is 2. Suppose it is reasonable to assume that the distributions of age in the populations of registered Republicans and of registered Democrats are normal with the same standard deviation. L02 and L06 .Stat 217 .74 years C) −1 ± 1.10 D) P − value > 0. He selected a simple random sample of 128 registered Republicans from a list of registered Republicans and determined the mean age to be x1 = 39 years with a ¯ standard deviation s1 = 8 years.Winter 2008 D) 25 Page 5 of 15 5. What can we say about the value of the P -value? .Final Exam . What can we say about the value of the P -value? A) P − value < 0.10 CAUTION: the next 3 questions refer to the following information: A political analyst was curious if younger adults were becoming more conservative.66 years B) −1 ± 1. respectively. Let µ1 and µ2 represent the mean ages of the populations of all registered Republicans and Democrats.

Following up on these couples.025 B) 0.05 B) 0.10 D) P − value > 0.025 < P − value < 0. What is a 90% plus four conﬁdence interval for the proportion of couples who had children within the ﬁrst two years of marriage and are divorced within ﬁve years? A) 0.20 D) P − value > 0. 9. are the data statistically signiﬁcant for testing these hypotheses? A) Yes .10 Page 6 of 15 8. she selects a simple random sample of 200 couples who had children within the ﬁrst two years of marriage.402 ± 0. Using hospital birth records.088 1 10.402 ± 0.067 D) 0.10 < P − value < 0.402 ± 0.Winter 2008 A) P − value < 0.L01. we wish to test the hypotheses H0 : σ1 = σ2 versus Ha : σ1 = σ2 . 3 At the 10% signiﬁcance level.05 < P − value < 0.10 C) 0.402 ± 0.Final Exam . To do this. L02 and L06 .05 < P − value < 0. Let p = the population proportion of couples who had children within the ﬁrst two years of marriage and are divorced within ﬁve years. What can we say about the value of the P -value? A) P − value < 0.056 C) 0. Suppose the sociologist wishes to test H0 : p = 3 versus Ha : p = 1 . Suppose we wish to test the hypothesis that the groups are equivalent in how variable their ages are.05 C) 0.Stat 217 . she ﬁnds that 80 couples are divorced within ﬁve years.035 B) 0.20 CAUTION: the next 2 questions refer to the following information: A sociologist is studying the eﬀect of having children within the ﬁrst two years of marriage on the divorce rate.

Final Exam .Stat 217 . Under the null hypothesis that there is no association between pizza size and pizza topping. B) The average pizza size is the same for pepperoni. L02 and L06 . The results are presented below. and cheese pizzas. D) The distribution of pizza size is diﬀerent for the three diﬀerent pizza toppings. veggie. 12. What would be null hypothesis for a chi-square test based on these data? A) Pizza topping and pizza size are independent. medium.L01. They were asked what size pizza they usually order and what their favorite topping is. What are the appropriate degrees of freedom for the chi-square statistic? A) 3 .Winter 2008 Page 7 of 15 B) No C) This cannot be determined from the information given. CAUTION: the next 4 questions refer to the following information: A simple random sample of 100 college students was interviewed.67 B) 12 C) 12. what is the value of the expected count for a small pepperoni pizza? A) 11. C) The distribution of pizza topping is the same for small. Topping Pepperoni Veggie 18 11 14 12 3 9 35 32 Size Small Medium Large Total Cheese 6 7 20 33 Total 35 33 32 100 11. and large pizzas.25 D) 13 13.

Final Exam . The numbers are displayed in the table below.L01. A certain bookstore keeps copies of this book with each of the pictures on the cover on its racks.333 C) 31 D) 40 16.3 C) 20. Picture on the cover Photograph Drawing Embossed graphic Total 31 47 42 120 15.8 CAUTION: the next 3 questions refer to the following information: A particular paperback mystery book is published with a choice of three diﬀerent pictures on the cover: a photograph of the actor playing the main character in the movie version of the book. a simple random sample of 120 purchases of this book is obtained. what is the value of the expected count for sales of the book with the photograph of the actor on the cover? A) 0. L02 and L06 .225 . What is the contribution to the chi-square statistic from the cell of a large cheese pizza? A) 8. Under the null hypothesis that sales are equally divided among the three diﬀerent covers. To test the hypothesis that sales of this book are equally divided among the three choices. or an embossed graphic of the murder weapon. What is the value of the chi-square statistic? A) 1.Winter 2008 B) 4 C) 6 D) 8 Page 8 of 15 14.258 B) 0.Stat 217 .4 B) 9. a drawing of the mansion where the story in the book takes place.4 D) 52.

Winter 2008 B) 3 C) 3.816 195.Stat 217 .835 . Error t p-value Constant -1.833926 Variance 57.900 Coeﬃcients Predictor Coeﬃcients Std.544 .000 39 2028.L01.Final Exam .92961 -.913195 R Square . CAUTION: the next 3 questions refer to the following information: The data referred to in this question were collected on 41 employees of a large company.472 305.000 .816 10183.590 Starting Salary 2.994 .829667 Source Regression Error Total ANOVA df Sum of Squares Mean Square F p-value 1 10183.10473 .00 36.002 40 12211.59266 2.084 52.15040 13.9818 Current Salary 19.35 D) 40 Page 9 of 15 17. The company is trying to predict the current salary of its employees from their starting salary (both expressed in thousands of dollars). The Minitab regression output is given below as well as some summary measures: Descriptive Statistics Minimum Maximum Mean Starting Salary 10.10 17.297 R (or r) .65 100.2543 Model Summary R Square Adjusted . do we reject the null hypothesis? A) Yes B) No C) This cannot be determined from the information given. At the 10% signiﬁcance level. L02 and L06 .95 44.

39) B) (-4.Stat 217 . What is an (approximate) 95% conﬁdence interval for the slope β1 ? A) (-7.516.57.683. $45. 1. would we reject the null hypothesis? A) Yes B) No C) This cannot be determined from the information given.80.34) C) (1.205. Predict his current salary with a 90% interval.885) CAUTION: the next 4 questions refer to the following information: A researcher is investigating possible explanations for deaths in trafﬁc accidents. Express the interval in the appropriate units.Winter 2008 Page 10 of 15 18. 2.C. this would imply that the employees currently earn about twice as much as their starting salary. $32.Final Exam .537) B) ($18.000) between the ages of 1 and 14 in the state As part of his investigation he ran the following multiple regression . 2. Together with an insigniﬁcant constant in this model. He started with a salary of $15.300. 4. The data included information on the following variables: Deaths The number of deaths in traﬃc accidents Income The average income per family Children The number of children (in multiples of 100.95. $32.52. At the 5% signiﬁcance level.L01. $43.580. He examined data from 1991 for each of the 50 states plus Washington.41) D) (1. John Doe works for this company.640) D) ($31. D. Suppose we wish to test the hypotheses H0 : β1 = 2 versus Ha : β1 = 2. A) ($15. L02 and L06 .014) C) ($28. 20.26) 19.

629 ± 5.002 < p − value < 0.629 -0.55 .45 B) 17.3 C) 381. where the deviations εi were assumed to be independent and normally distributed with mean 0 and standard deviation σ. What is a 95% conﬁdence interval for β1 . Suppose we wish to test the hypotheses H0 : β1 = β2 = 0 versus Ha : at least one of the βj is not 0 using the ANOVA F -test.829 90.Final Exam .114 3.Stat 217 . What can we say about the P -value for the ANOVA F -test? A) P − value < 0. This model was ﬁt to the data using the method of least-squares.5 22.02 D) P − value > 0.01 < P − value < 0. the coeﬃcient of the variable Children? A) 90.305 0. What is the value of the F statistic? A) 12. L02 and L06 .02 23.L01.Winter 2008 model Page 11 of 15 Deaths = β0 + β1 (Children) + β2 (Income) + εi . Source Model Error Variable Constant Children Income Sum of Squares 48362278 3042063 df 2 48 Coeﬃcient 593.01 C) 0.5 D) 402.002 B) 0.039 Standard Error 204.015 21. The following results were obtained from statistical software.

629 ± 4. He randomly selects ten students from three diﬀerent student populations: high school students.92 .60 C) 56. The statistical assumptions required to perform a one-way ANOVA to compare the means of these three groups are reasonable based on the data. Source Sum of Squares DF Groups Error 3240 4450 Total Mean Square F p-value 25.840 CAUTION: the next 4 questions refer to the following information: A store owner wishes to compare the average amount of money high school and college students spend on CDs.L01.Stat 217 .Final Exam .305 C) 90. undergraduate students. What proportion of the variation in the variable Deaths is explained by the explanatory variables Children and Income? A) 0.941 C) 0. A partially completed ANOVA table is provided below. What is the value of the pooled standard deviation? A) 10. What are the degrees of freedom for the F statistic? A) 2 and 10 B) 2 and 27 C) 3 and 27 D) 3 and 30 26. L02 and L06 .Winter 2008 B) 90.380 B) 0.980 D) 0.68 D) 90.65 Page 12 of 15 24.95 B) 24.629 ± 3.629 ± 6. and graduate students.

04 D) 5. a district in the suburbs.920 29. The two-way ANOVA output is shown below in an incomplete form.427 59 F p-value 67.037 39.005 1 2674. Source Sex Region Sex*Region Error Total Sum of Squares DF Mean Square 2674.000 2.803 115. What is the value of the sum of squares for the model? A) 2143.Final Exam .379 0.686 5752.791 231.36 B) 3.Winter 2008 D) 120 27.864 0.73 C) 5.588 C) 3609.Stat 217 .583 2 351.037 B) 3377. What is the value of the F statistic? A) 3.000 8.427 . would we reject the null hypothesis? A) Yes B) No C) This cannot be determined from the information given. and a district in a rural area.L01.901 2143. L02 and L06 . CAUTION: the next 3 questions refer to the following information: Twenty high school-aged students are randomly selected from three diﬀerent school districts: a district in the city. Each of the students was asked what price they paid for their last haircut. At the 1% signiﬁcance level. Each group of twenty students consisted of 10 boys and 10 girls.99 Page 13 of 15 28.391 D) 5752.005 703.

32.Stat 217 .Final Exam . a college student has randomly selected people from his college town and collected income data for these people. What is the appropriate alternative hypothesis? A) The distribution of income for married people has values that are systematically larger than for unmarried people. C) The distribution of income for one of the two groups of people has values that are systematically larger than for the other group. The degrees of freedom for the interaction term and the error term are missing from the table. They were categorized according to their marital status. . The following results (in thousands of dollars) were obtained: Marital status Income ($1000) Married 136 116 82 166 72 Not married 33 78 76 125 80 The data are to be analyzed using the Wilcoxon rank sum test. What are their values? A) 2 and 54 B) 6 and 54 C) 2 and 56 D) 6 and 56 31.Winter 2008 Page 14 of 15 30. B) The distribution of income for unmarried people has values that are systematically larger than for married people. D) None of the above.L01. CAUTION: the next 4 questions refer to the following information: Does income increase when you get married? To investigate this. The null hypothesis is that income has the same distribution for married and unmarried people. Is the interaction eﬀect statistically signiﬁcant at the 5% signiﬁcance level? A) Yes B) No C) This cannot be determined from the information given. L02 and L06 .

L01.9 D) 38.79 C) 22.Winter 2008 Page 15 of 15 33. and September. L02 and L06 .Final Exam .Stat 217 .67 35.000 receive? A) 3 B) 4 C) 5 D) 7 34. May. The resulting data were grouped by month of birth: January. Arrange the ten observations in order and assign ranks. What is the observed value of W ? A) 6 B) 21 C) 34 D) 55 CAUTION: the next 3 questions refer to the following information: At what age do babies learn to crawl? Does it take longer to learn in the winter when babies are often bundled in clothes that restrict their movement? Data were collected from parents who brought their babies into the University of Denver Infant Study Center to participate in one of a number of experiments between 1988 and 1991. Parents reported the birth month and the age at which their child was ﬁrst able to creep or crawl a distance of four feet within one minute. What rank does the observation with an income of $116. . Birth month January May September Crawling age (month) 26 29 33 27 29 31 31 34 37 The data is to be analyzed with the Kruskal-Wallis test. What is the standard deviation of W under the null hypothesis that the two distributions are the same? A) 4.56 B) 4.

77 B) H = 3. The null hypothesis is that the crawling ages have the same distribution in all groups.Stat 217 .Winter 2008 Page 16 of 15 36. D) None of the above. would we reject the null hypothesis? A) Yes B) No C) This cannot be determined from the information given. What is the alternative hypothesis? A) Not all three mean crawling ages are equal. At the 5% signiﬁcance level. 37.99 38.76 D) H = 5. B) The mean crawling age in September is higher than the other two months. C) The crawling times in September are systematically higher than in the other two months. What is the value of the Kruskal-Wallis statistic H for these data? A) H = 2.Final Exam .L01.10 C) H = 3. Have a nice summer! . L02 and L06 .

