You are on page 1of 5

MDM 4U Two-Variable Analysis Assignment

Multiple Choice
Identify the choice that best completes the statement or answers the question.

____ 1. What is the independent variable in a correlational study of amounts of sunlight and the heights of tomato
plants?
a. the types of tomato plants
b. the heights of the tomato plants
c. the angle of the sun
d. the numbers of hours of sunlight

____ 2. Which set of data would probably show a strong positive linear correlation?
a. marks on a history test and the heights of the students
b. the number of defective light bulbs produced and the time of the day when they were
manufactured
c. the colour of cars sold and the annual income of the car buyers
d. the height of corn in a field and the amount of precipitation during the growing season

____ 3. A set of data with a correlation coefficient of –0.55 has a


a. strong negative linear correlation
b. moderate negative linear correlation
c. weak negative linear correlation
d. little or no linear correlation

____ 4. If a relationship has a strong, negative, linear correlation, the correlation coefficient that would be appropriate
is
a. –1.0 c. –0.45
b. –0.92 d. –0.32

____ 5. Which value of r would be appropriate for the scatter plot shown?

a. 0.2 c. 0.7
b. –0.6 d. 0.4

____ 6. Which value of r would be appropriate for the scatter plot shown?
a. –0.9 c. –0.5
b. 0.9 d. 0.5

____ 7. Using a linear-regression equation to predict values outside the range of the data is an example of
a. extrapolation c. least-squares fit
b. residuals d. interpolation

____ 8. Which graph shows the most appropriate line of best fit for the given scatter plot?
a. c.

b. d.

____ 9. The scatter plot shown includes an outlier in the upper left corner of the graph. The line of best fit is shown.

How would the line of best fit be affected if the outlier were removed?
a. The slope would decrease and remain positive.
b. The slope would decrease and become negative.
c. The slope would increase.
d. The slope would be unchanged.
Short Answer

10. Identify the likely outlier in this set of data.


X 17 103 93 54 44 33 26
Y 22 118 19 64 53 40 31

11. The two scatter plots shown generate the same line of best fit. Which will have the more reliable estimates?
Explain.

12. List the type of correlation and causal relationship that you would expect to find for each pair of variables.
a) the price of gasoline at the pump, the current world price of crude oil
b) the fish population in a lake, the number of cottages around the lake
c) the humidex rating (an index based on air temperature and humidity), the number of respiratory ailments
reported
d) the stock price of a telephone company, the cost of car insurance
e) parents’ educational level, their children’s success in school

13. Five players on the Statsville football team compare the widths of their hands and the numbers of passes they
caught during the last two seasons.

Palm Width (cm) 8 9.5 10.5 11 12


Passes Caught 40 39 44 45 51

a) Find the linear correlation coefficient for these data.


b) Does this coefficient prove that players with wider hands have an advantage in catching football passes?
Why or why not?

Problem

14. A study was done with a group of university students to determine if there was a correlation between the
amounts of sleep they got and their academic performance. The table lists some data from the study.
Student A B C D E F G H I J K L
Hours of Sleep 6.0 6.5 7.0 6.5 8.5 8.0 9.0 8.5 7.0 7.5 6.5 7.5
Average Mark 62 58 66 71 76 82 76 75 70 68 56 77

a) Make a scatter plot of these data.


b) Determine the correlation coefficient.
c) What would you conclude about the relationship between the two variables?

15. A manufacturer of flexible seals for industrial equipment tests samples of its seals at a variety of temperatures
and collects the following data.
Temperature (°C) 16 5 9 12 7 10
Seal Failures 3 12 8 6 4 7

a) Draw a scatter plot for these data.


b) Identify any outlier(s) and explain your choice(s).
c) Perform a linear-regression analysis on the data to determine a line of best fit for all the data points.
Determine the correlation coefficient for this data.
d) Remove the outlier(s) and repeat the analysis of part c).
e) Compare the equations of the two lines of best fit and the correlation coefficients from parts b) and c).
f) Why does an outlier have such a great effect on the analysis of these data

You might also like