Professional Documents
Culture Documents
Full Download Test Bank For Statistics The Art and Science of Learning From Data 4Th Edition Alan Agresti PDF
Full Download Test Bank For Statistics The Art and Science of Learning From Data 4Th Edition Alan Agresti PDF
Name___________________________________
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
Income Group
Low Middle High
Workaholic HBP NBP HBP NBP HBP NBP
Yes 25 75 23 87 26 134
No 25 80 18 72 9 51
Consider each income group separately and find the proportion with high blood pressure among
the workaholics and the proportion with high blood pressure among the non-workaholics.
Then determine the proportion of workaholics overall who have high blood pressure and the
proportion of non-workaholics overall who have high blood pressure (ignoring information about
income group). Explain how these data satisfy Simpson's paradox. How would you explain what is
responsible for this result?
Smoking Status
Smoker Nonsmoker
Likes Risky Things 45 46
Doesnʹt Like Risky Things 36 153
a. Create a side-by-side bar graph that compares smoking status with respect to risky
things status.
b. Summarize the results of the side-by-side bar graph.
c. Describe how the graph would look if there was not an association between smoking
and unhealthy behaviors among Mexican -American male adolescents.
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
^
3) A regression line for predicting Internet usage (%) for 39 countries is y = -3.61 + 1.55x, where x is the per 3)
capita GDP, in thousands of dollars, and y is Internet usage. Interpret the residual for one of the 39 countries
with per capita GDP of $15,000 and actual Internet use of 20 percent.
A) The actual Internet usage for this country is 3.25% lower than expected from the regression equation.
B) The actual Internet usage for this country is 3.6% higher than expected from the regression equation.
C) The actual Internet usage for this country is 3.25% higher than expected from the regression equation.
D) The actual Internet usage for this country is 0.36% higher than expected from the regression equation.
E) The actual Internet usage for this country is 0.36% lower than expected from the regression equation.
1
4) The relationship between the number of games won by a minor league baseball team and the average 4)
attendance at their home games is analyzed. A regression to predict the average attendance from the number
of games won has an r = 0.73. Interpret this statistic.
A) Positive, fairly strong linear relationship. 53.29% of the variation in average attendance is explained
by the number of games won.
B) Negative, fairly strong linear relationship. 53.29% of the variation in average attendance is explained
by the number of games won.
C) Positive, weak linear relationship. 7.29% of the variation in average attendance is explained by the
number of games won.
D) Positive, fairly strong linear relationship. 73% of the variation in average attendance is explained by
the number of games won.
E) No association
5) A teacher recorded for each of five students their score on a quiz (x) and their score on a project (y). A 5)
scatter plot of the points and the corresponding regression line are shown below.
y
21
18
15
Score on Project
12
9
6
3
2 4 6 8 10 12 14 16 18 20 x
Score on Quiz
^
6) A regression line for predicting the selling prices of homes in Chicago is y = 168 + 102x, where x is the 6)
square footage of the house. Interpret the residual for a house with 1800 square feet that recently sold for
$200,000.
A) The house sold for $16,400 less than was to be expected from the regression equation.
B) The house sold for $16,064 more than was to be expected from the regression equation.
C) The house sold for $16,232 more than was to be expected from the regression equation.
D) The house sold for $16,400 more than was to be expected from the regression equation.
E) The house sold for $16,232 less than was to be expected from the regression equation.
2
7) Almost all of the acidity of soda pop comes from the phosphoric acid which is added to give them a sharper 7)
flavor. Is there an association between the pH of the soda and the amount of phosphoric acid (in grams)? The
correlation between pH and phosphoric acid is -0.991. Describe the association.
A) Strong linear association in a positive direction
B) Very strong linear association in a negative direction
C) Weak linear association in a negative direction
D) Weak linear association in a positive direction
E) No evidence of association
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
8) A large manufacturer hires many handicapped workers and keeps track of both their type of 8)
handicap and their level of performance.
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
9) A researcher recorded for each of a number of recent high school and college graduates the number of years 9)
of education and their annual salary when they started their first job. The scatter plot is shown below
together with the regression equation. The point (14, 4) was excluded when obtaining the regression
equation.
y
48
Salary (Thousands of Dollars)
32
16
4 8 12 16 20 x
Years of Education
10) For 14 baseball teams, the correlation with number of wins in the regular season is 0.51 for shutouts, 0.61 for 10)
hits made, -0.70 for runs allowed and -0.56 for homeruns allowed. Which variable has the weakest linear
association with number of wins?
A) homeruns allowed B) shutouts
C) hits made D) runs allowed
3
11) A researcher records for each participant in a clinical trial the amount of alcohol they consume per 11)
month and their cholesterol level. Indicate whether each variable is categorical or quantitative.
A) alcohol: quantitative; cholesterol level: quantitative
B) alcohol: categorical; cholesterol level: categorical
C) alcohol: quantitative; cholesterol level: categorical
D) alcohol: categorical; cholesterol level: quantitative
12) In order for a data point to be considered influential, which of the following must hold? 12)
I) the point has a large residual for the regression line fitted including that point
II) its x value must be relatively low or high compared to the rest of the data
III) the point has a large residual for the regression line fitted without using that data point
A) II only B) I and II C) III only D) II and III E) I only
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
14) When two explanatory variables are both associated with a response variable but are also associated 14)
with each other, there is said to be ____________________.
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
17) The advantage of a side-by-side bar graph is that it allows for easy comparison of the explanatory 17)
variable groups with respect to values on the response variable.
A) False B) True
4
19) If a positive association exists between two quantitative variables, 19)
A) the movement of x does not affect the movement of y.
B) none of these.
C) y tends to decrease as x increases.
D) y tends to decrease as x decreases.
E) y tends to increase as x decreases.
x x
C) D)
y y
x x
5
Determine the type of association apparent in the following scatterplot.
21) 21)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
6
The following scatterplot shows the percentage of the vote a presidential candidate received in an election according to the voter's
income level based on an exit poll of voters. The income levels 1-8 correspond to the following income classes: 1=Under $15,000;
2=$15-30,000; 3=$30-50,000; 4=$50-75,000; 5=$75-100,000; 6=$100-150,000; 7=$150-200,000; 8=$200,000 or more.
24) Which of the following describes the linear association between income level and percentage of the 24)
vote won by the candidate?
A) strong, negative
B) weak, negative
C) no linear association
D) strong, positive
E) weak, positive
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
7
Answer Key
Testname: CHAPTER 3 FORM B TEST
1) For low income: 25% high blood pressure for workaholics versus 23.8% high blood pressure for non-workaholics
For middle income: 20.9% high blood pressure for workaholics versus 20% high blood pressure for non-workaholics
For high income: 16.3% high blood pressure for workaholics versus 15% high blood pressure for non-workaholics
All income groups combined: 20% high blood pressure for workaholics.versus 20.4% for non-workaholics
Within each income group, the percentage with high blood pressure is higher for workaholics. However, when income groups
are combined the percentage with high blood pressure is higher for non-workaholics.
b. The graph clearly shows that the proportion of Mexican-American male adolescents that likes risky things is higher
for those Mexican-American male adolescents who are smokers. The proportion of Mexican-American male
adolescent smokers that likes risky things is almost two and a half times as high as the proportion of
Mexican-American male adolescent nonsmokers that likes risky things; c. If there was not an association, the two bars
in the ʺlikes risky thingsʺ group would be the same height and the two bars in the ʺdoesnʹt like risky thingsʺ group
would be the same height.
3) D
4) A
Copyright © 2017 Pearson Education, Inc.
8
Answer Key
Testname: CHAPTER 3 FORM B TEST
5) C
6) C
7) B
8) a. The two variables are type of handicap and level of performance;
b. response variable = level of performance, explanatory variable = type of handicap
9) D
10) B
11) A
12) D
13) Simpson's paradox
14) confounding
15) B
16) B
17) B
18) E
19) D
20) A
21) D
22)
9
Test Bank for Statistics: The Art and Science of Learning from Data, 4th Edition, Alan Agres
Answer Key
Testname: CHAPTER 3 FORM B TEST
25)
10