101 views

Uploaded by Joanna

- vitruvian man
- kumpulan jurnal.pdf
- Export-Led Growth vs. Domestic Demand-Led Growth:
- Analysis of Missing Value Estimation Algoithms for Data Farming
- The Influence of After Sale Services on the Customer Loyalty (Marketing)
- COTRAN_3
- Syllabus Bcom 3 Sem
- 35
- Reverse Code
- 22 Linear Fit Post
- Working Condition Sewing Floor
- 12-Coorelation
- 01 Appendix
- multiple regression
- Eco Trics
- A level Mathematics Practice Paper G – Statistics and Mechanics mark scheme.docx
- Wald Press
- correlationalresearchdesign-111125101237-phpapp01
- Regression
- Introduction to Correlation and Regression Analysis

You are on page 1of 12

Chap 7-9

Practice Test

SECTION I

Number of Questions 42

Percent of Total Grade 50

Directions: Solve each of the following problems, using the available space (or extra paper) for

scratchwork. Decide which is the best of the choices given and place that letter on the ScanTron

sheet. No credit will be given for anything written on these pages for this part of the test. Do not

spend too much time on any one problem.

1. In a statistics course, a linear regression equation was computed to predict the final exam score from

the score on the first test. The equation was y = 10 + 0.9x where y is the final exam score and x

is the score on the first test. Carla scored 95 on the first test. What is the predicted value of her

score on the final exam?

A. 95

B. 85.5

C. 90

D. 95.5

E. none of these

2. Refer to the previous problem. On the final exam, Carla scored 98. What is the value of her residual?

A. 98

B. 2.5

C. -2.5

D. 0

E. none of these

3. A study of the fuel economy for various automobiles plotted the fuel

consumption (in liters of gasoline used per 100 kilometers traveled)

versus speed (in kilometers per hour). A Least Squares Regression

Line (LSRL) was fit to the data. Here is the residual plot from this

least squares fit. What does the pattern of the residuals tell you

about the linear model?

A. The evidence is inconclusive.

B. The residual plot confirms the linearity of the fuel economy data.

C. The residual plot does not confirm the linearity of the data.

D. The residual plot clearly contradicts the linearity of the data.

E. None of the above.

4. All but one of the following statements contains a blunder. Which statement is correct?

A. There is a correlation of 0.54 between the position a football player plays and his weight.

B. The correlation between planting rate and yield of corn was found to be r = 0.23.

C. The correlation between the gas milage of a car and its weight is r = 0.71 mpg.

D. We found a high correlation (r = 1.09) between the height and age of children.

E. We found a correlation of r = -0.63 between gender and political party preference.

A. near -1

B. clearly negative, but not near -1

C. near 0

D. clearly positive, but not near 1

E. near 1

6. In the scatterplot in the previous question, if each x-value were decreased by one unit and the y-values

remained the same, then the correlation r would

A. decrease by 1 unit

B. decrease slightly

C. increase slightly

D. stay the same

E. you cant tell without knowing the data values

A. those factors unexplained by the data

B. the difference between the observed responses and the values predicted by the regression line

C. those data points which were recorded after the formal investigation was completed

D. possible models unexplored by the investigator

E. none of these

A. the slope of the LSRL

B. the intercept of the LSRL

C. the extent to which cause and effect is present in the data

D. the percent of the variation in the values of y that is explained by least squares regression on

the other

E. the strength of the linear association between the variables

I. Correlation and regression require explanatory and response variables.

II. Scatterplots require that both variables be quantitative.

( )

A. I and II only

C. II and III only

D. I, II, and III

E. none of these

10. A local community college announces the correlation between college entrance exam grades and

scholastic achievement was found to be -1.08. On the basis of this, you would tell the college that

A. the entrance exam is a good predictor of success.

B. the exam is a poor predictor of success.

C. students who do best on this exam will be poor students.

D. students at this school are underachieving.

E. the college should hire a new statistician.

11. A researcher finds that the correlation between the personality traits greed and superciliousness is

-0.40. What percentage of the variation in greed can be explained by the relationship with

superciliousness?

A. 0%

B. 16%

C. 20%

D. 40%

E. 60%

12. Suppose the following information was collected, where x = diameter of tree trunk in inches,

and y = tree height in feet.

4

2

8

6

10

6

x

18

22

30

If the LSRL equation is y = -3.6 + 3.1x, what is your estimate of the average height of all trees

having a trunk diameter of 7 inches?

A. 18.1

B. 19.1

C. 20.1

D. 21.1

E. 22.1

13. Suppose we fit the LSRL to a set of data. What is true if a plot of the residuals shows a curved

pattern?

A. A straight line is not a good model for the data.

B. The correlation must be 0.

C. The correlation must be positive.

D. Outliers must be present.

E. The LSRL might or might not be a good model for the data, depending on the extent of

the curve.

A. least squares regression line

B. correlation coefficient

C. both the least squares line and the correlation coefficient

D. neither the least squares line nor the correlation coefficient

E. it depends

15. A study found correlation r = 0.61 between the sex of a worker and his or her income. You conclude

that

A. women earn more than men on average.

B. women earn less than men on average.

C. an arithmetic mistake was made; this is not a possible value of r.

D. there may be a lurking variable that is causing a moderately strong relationship.

E. there is a moderately strong, positive, linear relationship between sex and income.

16. A copy machine dealer has data on the number x of copy machines at each of 89 customer locations

and the number y of service calls in a month at each location. Summary calculations give

x = 8.4 , s x = 2.1 , y = 14.2 , s y = 3.8 , and r = 0.86 . What is the slope of the LSRL of

number of service calls on number of copiers?

A. 0.86

B. 1.56

C. 0.48

D. none of these

E. you cant tell from the information given

17. In the setting of the previous problem, about what percent of the variation in the number of service

calls is explained by the linear relation between number of service calls and the number of

machines?

A. 86%

B. 93%

C. 74%

D. none of these

E. you cant tell from the information given

18. If Data Set A of (x , y) data has correlation coefficient r = 0.65, and a second Data Set B has

correlation r = -0.65, then

A. the points in A exhibit a stronger linear association than B.

B. the points in B exhibit a stronger linear association than A.

C. neither A or B has a very strong linear association.

D. you cant tell which data set has a stronger linear association without seeing the data or

seeing the scatterplots.

E. A and B are actually the same data set.

19. There is a linear relationship between the number of chirps made by the striped ground cricket and

the air temperature. A least squares fit of some data collected by a biologist give the model

y = 25.2 + 3.3x (when x is between 9 and 25), where x is the number of chirps per minute

and y is the estimated temperature in degrees Fahrenheit. What is the estimated increase in

temperature that corresponds to an increase in 5 chirps per minute?

A. 3.3F

B. 16.5F

C. 25.2F

D. 28.5F

E. 41.7F

20. The equation of the LSRL for a certain set of data is y = 1.3 + 0.73x . What is the residual for the

point (4 , 7)?

A. 2.78

B. 3.00

C. 4.00

D. 4.22

E. 7.00

21. A set of data relates the amount of annual salary raise and the performance rating an employee

receives. The least squares regression equation is y = 1400 + 2000x where y is the estimated

raise and x is the performance rating. Which of the following statements is not correct?

A. For each increase of one point in performance rating, the raise will increase on average

by $2,000.

B. This equation produces predicted raises with an average error of 0.

C. A rating of 0 will yield a predicted raise of $1,400.

D. The correlation for the data is positive.

E. All of the above are true.

22. Suppose we fit the LSRL to a set of data. What do we call any individual points with unusually large

values of the residuals?

A. response variables

B. the slope

C. outliers

D. correlations

E. none of these

23. The effect of removing the right-most point (near the positive x-axis)

in the scatterplot shown would be that

A. the slope of the LSRL will increase; r will increase.

B. the slope of the LSRL will increase; r will decrease.

C. the slope of the LSRL will decrease; r will increase.

D. the slope of the LSRL will decrease; r will decrease.

E. there will be no change.

24. If removing an observation from a data set would have a marked change on the position of the LSRL

fit to the data, what is the point called?

A. robust

B. a residual

C. a response

D. influential

E. none of these

I. Two quantitative variables that are strongly associated will have a correlation near 1.

II. Regression requires an explanatory-response relationship, while correlation does not.

III. Even though the correlation between two variables may be high, in order to use the LSRL to

predict, there needs to be an explanatory-response relationship between x and y.

A. I and II only

D. I, II, and III

C. II and III only

E. none of these gives the complete set of true responses

26. Suppose the correlation between two variables x and y is due to the fact that both are responding to

changes in some unobserved third variable. What is this due to?

A. cause and effect between x and y

B. the effect of a lurking variable

C. extrapolation

D. common sense

E. none of these

27. Suppose a straight line is fit to data having response variable y and explanatory variable x. Predicting

values of y for values of x outside the range of the observed data is called

A. correlation

B. causation

C. extrapolation

D. sampling

E. none of these

I. High correlation does not necessarily imply causation.

II. A lurking variable is a name given to variables that cannot be identified or explained.

III. Successful prediction requires a cause and effect relationship.

A. I only

C. III only

B. II only

D. I and II only

E. none of these

29. There is a positive association between the number of local river drownings and ice cream sales at

lunch in the cafeteria. This is an example of an association likely caused by

A. lurking variable

B. cause and effect relationship

C. a confounding factor

D. common response

E. none of these

30. If the correlation between body weight and annual income were high and positive, we could conclude

that

A. high incomes cause people to eat more food.

B. low incomes cause people to eat less food.

C. high-income people tend to spend a greater proportion of their income on food than

low-income people, on average.

D. high-income people tend to be heavier than low-income people, on average.

E. high incomes cause people to gain weight.

and sepal width for two varieties of an exotic tropical plant.

Variety A and Variety B are represented by xs and os,

respectively, in the following plot:

A. Considering Variety A alone, there is a negative correlation between sepal length and

sepal width.

B. Considering Variety B alone, the least squares regression line for predicting sepal

length from sepal width has a negative slope.

C. Considering both varieties together, there is a positive correlation between sepal length

and sepal width.

D. Considering each variety separately, there is a positive correlation between sepal length

and sepal width.

E. Considering both varieties together, the least squares regression line for predicting

sepal length from sepal width has a positive slope.

I. Removal of an influential point changes the regression line.

II. Data points that are outliers in the horizontal direction are more likely to be influential than

points that are outliers in the vertical direction.

III. Influential points have large residuals.

A. I only

C. I and III

B. I and II

D. II and III

33. The model strength

(in pounds) from its diameter (in inches). According to this model, how much force should a rope

one-half inch in diameter be able to withstand?

A. 4.7 lbs

B. 16 lbs

C. 22 lbs

D. 256 lbs

E. 484 lbs

2

34. One of the three flavors of unusual points is the type that confirms the pattern, but is an extreme

data point. These points

I. have high leverage and large residuals.

II. have negligible effect on the value of R 2 .

III. have no impact on the slope of the line of best fit, but greatly change its y-intercept.

A. II only

C. I and II

B. III only

D. all of these are true

I. it will help us see whether our model is appropriate.

II. it might show a pattern in the data that was hard to see in the original scatterplot.

III. it will clearly identify influential points.

A. I only

C. I and II only

B. II only

D. I and III only

I. Random scatter in the residuals indicates a model with high predictive power.

II. If two variables are very strongly associated, then the correlation between them will be near

+1.0 or 1.0.

III. The higher the correlation between two variables, the more likely the association is based on

cause and effect.

A. none of these

C. II only

B. I only

D. I and II only

removed from the data set, then what will happen to the

slope of the line of best fit, b, and to the correlation, r?

A. both will increase

B. both will decrease

C. b will increase, and r will decrease

D. b will decrease, and r will increase

E. both will remain the same

38. All but one of the statements below contain a mistake. Which one could be true?

A. The correlation between height and weight is 0.568 inches per pound.

B. The correlation between weight and length of foot is 0.488.

C. The correlation between the breed of a dog and its weight is 0.435.

D. The correlation between gender and age is -0.171.

E. If the correlation between blood alcohol level and reaction time is 0.73, then the correlation

between reaction time and blood alcohol level is -0.73.

39. A regression analysis of students college Grade Point Averages and their high school GPAs

found R 2 = 0.311 . Which of these is true?

I. High school GPA accounts for 31.1% of college GPA.

II. 31.1% of college GPAs can be correctly predicted with this model.

III. 31.1% of the variance in college GPA can be accounted for by this model.

A. I only

C. III only

B. II only

D. I and II

E. none

40. When using midterm exam scores to predict a students final grade in a class, the student would

prefer to have a

A. positive residual, because that means the students final grade is higher than we would

predict with this model.

B. positive residual, because that means the students final grade is lower than we would

predict with this model.

C. residual equal to zero, because that means the students final grade is exactly what we

would predict with the model.

D. negative residual, because that means the students final grade is higher than we would

predict with this model.

E. negative residual, because that means the students final grade is lower than we would

predict with the model.

41. Using the scatterplot shown and the stray point, which of

the following is true?

I. The stray point has little leverage, but a large residual.

II. The stray point is not influential.

III. If the stray point were removed from the data, the

correlation would become stronger and become

more negative because scatter would decrease.

A. I only

C. II and III only

D. all of these are true

A. dont extrapolate much, if at all.

B. extrapolate only within the data you have been given.

C. extrapolate only up to five points past your last data point, but no further.

D. there are no clear guidelines on extrapolating, so do as you please.

E. never, never, never extrapolate.

SECTION II

Questions 43-57

Percent of Total Grade 50

Directions: Show all of your work. Indicate clearly the methods you use, because you will be

graded on the correctness of your methods as well as on the accuracy of your results and

explanations.

Assembly Line.

Your new job at Panasony is to do the final assembly of camcorders.

As you learn how, you get faster. The company tells you that you will

qualify for a raise if, after 13 weeks, your assembly time averages under

20 minutes. The data shows your average assembly time during each

of your first 10 weeks.

Week

Time (min)

1

2

3

4

5

6

7

8

9

10

43

39

35

33

32

30

30

28

26

25

43. ______________________

44. ________________

45. You want to predict whether or not you will quality for that raise. Would is be appropriate to use a

linear model? Defend your position.

Associations.

For each pair of variables, indicate what association you expect. Use P for positive, N for negative,

C for curved, or O for none.

46. Power level setting of a microwaveandnumber of minutes it takes to boil water 46. __________

you mowed your lawn that month

47. __________

minutes it takes the person to do a crossword puzzle

48. __________

suntan lotion in Minnesota during that week

49. __________

50. Length of a students hairandnumber of credits the student earned last year

50. __________

A couple of years ago a local newspaper published research results claiming a positive association

between the number of years high school children had taken instrumental music lessons and their

performances in school as measured by their GPA.

51. A group of parents then went to the school board demanding more funding for music programs as a

way to improve student chances for academic success in high school. As a statistician, do you

agree or disagree with their reasoning? Briefly explain.

Crawling.

Researchers at the University of Denver Infant Study Center investigated whether babies take longer to

learn to crawl in cold months (when they are often bundled in clothes that restrict their movement) than in

warmer months. The study sought an association between babies first crawling age (in weeks) and the

average temperature during the month they first try to crawl (about 6 months after birth). Between 1988

and 1991 parents reported the birth month and age at which their child was first able to creep or crawl a

distance of four feet in one minute. Data were collected on 208 boys and 206 girls. The graph below

plots average crawling ages (in weeks) against the mean temperatures when the babies were 6 months old.

The researchers found a correlation of r = -0.70 and their scatterplot was:

- vitruvian manUploaded byapi-312549818
- kumpulan jurnal.pdfUploaded byyuda
- Export-Led Growth vs. Domestic Demand-Led Growth:Uploaded byRuby Ojha
- Analysis of Missing Value Estimation Algoithms for Data FarmingUploaded byResearch Cell: An International Journal of Engineering Sciences
- The Influence of After Sale Services on the Customer Loyalty (Marketing)Uploaded byMuhammad Nawaz Khan Abbasi
- COTRAN_3Uploaded byulastuna2001
- Syllabus Bcom 3 SemUploaded bymahesh5050
- 35Uploaded byMuhammad Arif
- Reverse CodeUploaded byMiana Yana
- 22 Linear Fit PostUploaded by555568984948
- Working Condition Sewing FloorUploaded byAnonymous wA6NGuyklD
- 12-CoorelationUploaded bydranu3
- 01 AppendixUploaded byArun Sethuram
- multiple regressionUploaded byarmailgm
- Eco TricsUploaded byakshay patri
- A level Mathematics Practice Paper G – Statistics and Mechanics mark scheme.docxUploaded byZaka Ahmed
- Wald PressUploaded byKumar Abhishek
- correlationalresearchdesign-111125101237-phpapp01Uploaded byJapeth Purisima
- RegressionUploaded byluispedro1985
- Introduction to Correlation and Regression AnalysisUploaded byshoaib
- MA Assignment 2_G7Uploaded byVijeta Gour
- TInspire CoreUploaded bythor1s
- j NakamuraUploaded byBhoomika Shetty M
- Forecasting PPT NotesUploaded bykunwar
- Measuring the Impact of Foreign Institutional Investments on S&P CNX Nifty - A Pragmatic StudyUploaded bychitkarashelly
- Least Squares Fitting of a Straight LineUploaded byNigel Gray
- Regression CorrUploaded byChristopher H. Clarke
- AFC SyllabusUploaded byM Mumtaz
- Estatística AdsorçãoUploaded byErivaldo Reis
- 67309sipos&bizoiUploaded byAnonymous p52JDZOd

- Graficas en Economia - Capt 1a - EconomiaUploaded byCarlos Soto-Santoni
- analyzing wwi e-portfolioUploaded byapi-289959242
- Excel ManualUploaded byalinds16
- Chapter 09Uploaded byAtin Zai
- software quality management plan.docxUploaded byselinasimpson1501
- 9 Seven Tools of QualityUploaded byAman Chhabra
- 57732_Chapter_8Uploaded byJerin
- visualization using r qplotUploaded byapi-355102227
- Business StastasicUploaded byPraveen Kumar Singh
- Math 13Uploaded byKilliam Wettler
- Regression and MulticollinearityUploaded byMalkah Noor
- Descriptive ResearchUploaded byDiana Jane Lagundino
- Statistics Laerd Com Spss Tutorials Linear Regression UsingUploaded byabcxyz7799
- 7 Basic Quality Control Tools for SuccesUploaded byCarlos Oliver Montejano
- Notes - Correlation & RegressionUploaded byTeo Liang Wei
- model of a process-based quality management system.docxUploaded byselinasimpson1501
- technology mini lesson planUploaded byapi-268604495
- Be Awesome in Ggplot2Uploaded byShengyun Zhan
- chapter 10 part 1Uploaded byapi-232613595
- KPI Measurement Monitoring and Analysis GuideUploaded bymdavange
- Quantitative Research samplesUploaded byKamil Irmansyah
- gapminder assignmentUploaded byapi-282791615
- applying quality management in healthcare.docxUploaded byselinasimpson1601
- Basic principles of graphing data.PDFUploaded byKevin Campbell
- AgainstAllOdds_StudentGuide_Unit30Uploaded byHacı Osman
- 2965_01 5540F Paper 2 Foundation Tier November 2008Uploaded bystretfordhigh
- Tools and Techniques of Quality ManagementUploaded byZahid Nazir
- Relationship Between Product Quality and Customer SatisfactionUploaded byCobraCMM
- Regression Analysis.pdfUploaded byandrew myintmyat
- The Right GraphUploaded byClerenda Mcgrady