You are on page 1of 2

# Chapter 3

AP Practice Quiz
Date ___________________

Name ________________________________________________________

1. A simple random sample of 50 families produced these statistics: number of children in family: x = 2.1, sx = 1.4 annual gross income: y = 34,250, sy = 10,540 r = 0.75 The linear regression equation relating these variables, based on these data, is income = 5,646(number of children) + 22,392 income = 34,250 + 0.0001(number of children) income = 0.0001(number of children) 1.312 number of children = 5,646(income) + 22,392 The equation cannot be determined from the given information. 2. A study using a simple random sample of 40 college students recorded their hours of part-time work per week and grade point average and found that the correlation coefficient between the variables was 0.43. If the resulting linear regression is GPA = 3.75 0.05 (number of hours) which of these is not a correct statement? The average GPA of students who dont work is approximately 3.75. If the correlation coefficient were 0.60, the slope of the regression equation would be approximately 0.07. Students who work 40 hours per week have a mean GPA of approximately 1.75. The value of the correlation coefficient and steepness of the regression lines are not related. All of these are correct statements. 3. If the correlation coefficient of a bivariate set of data {(x, y)} is r, which of these are tue? The variables x and y are linearly related. The correlation coefficient of the set {(y, x)} is also r.

The correlation coefficient of the set {(x, ay)} is a r. The correlation coefficient of the set {(ax, ay)} is a r. None of these are true. 4. Which of these is a correct conclusion based on the displayed residual plot produced from a least squares line that is shifted off-center?

If you use the line to predict y from x, the predictions will tend to be too small. If you use the line to predict y from x, the predictions will tend to be too large. It is not appropriate to fit a line to these data because there is clearly no correlation between the variables. The variables y and x do not appear to be linearly related. None of these choices is correct. 5. In this scatterplot of y versus x, the least squares regression line is graphed. Which of points AE has the largest residual?

A B C D E

Chapter 3

## AP Practice Quiz (continued)

6. Students scores on two exams in a statistics course are given here, along with a scatterplot with regression line and a residual plot. The regression equation is Exam 2 = 51.0 + 0.430(Exam 1) and the correlation, r, is 0.756.
Exam 1 Exam 2 80 52 87 95 67 71 97 96 88 100 88 86 81 61 97 88 83 87 92 75 78 97 85 93 93 86 85 81 73 92 Exam 1 96 78 93 92 91 96 69 76 91 98 83 96 95 80 Exam 2 99 90 88 92 93 92 73 87 91 97 89 83 97 86

a. Is there a point that is more influential than the others on the slope of the regression line? How can you tell from the scatterplot? From the residual plot? b. How will the slope change if the scores for this one influential point are removed from the data set? How will the correlation change? Calculate the slope and correlation for the revised data to check your estimate. c. Construct a residual plot for the revised data. Does a linear model fit the data well? d. Refer to the scatterplot of Exam 2 versus Exam 1 scores. Does this plot illustrate regression to the mean? Explain your reasoning.