You are on page 1of 5

5. A student scored 70 percent on a test, and was in the 80th percentile.

Explain these two


numbers.
This means that the student got 70% of the total points available, but the 80th
percentile means that 80% of all students who took the test scored less than 70%,
and 20% scored above 70%.
6. An animal trainer obtained the following data (Table A) in a study of reaction time of
dogs (in seconds) to a specific stimulus. He then selected another group of dogs that were
much older than the first group and measured their reaction time (data in Table B) to the
same stimulus. Table A Table B 2.32.9 10 2.32.9 1 3.03.6 12 3.03.6 3 3.74.3 6 3.7
4.3 4 4.45.0 8 4.45.0 16 5.15.7 4 5.15.7 14 5.86.4 2 5.86.4 4 Find the variance and
standard deviation for the two distributions above. Compare the variation of the data sets.
Decide if one data set is more variable than the other.

Please see the attached excel sheet for calculations.


Table A:
Variance = 0.8958
Standard Deviation = 0.9465
Table B:
Variance = 1.5238
Standard Deviation = 1.2344
Data set in table 2 is more variable as it has a larger variance and standard
deviation

7. Answer the following:


a. What is the relationship between the sign of the correlation coefficient and the slope of
the regression line?

The correlation coefficient is the slope (b) of the regression line when both the X and
Y variables have been converted to z-scores. So, both the correlation coefficient and
slope of regression line will have same sign
b. As the value of the correlation coefficient increases from 0 to 1, or decreases from 0 to
1, how do the points of the scatter plot fit the regression line?
When r=0.0 the points scatter widely about the plot, the majority falling roughly in
the shape of a circle. As the linear relationship increases, the circle becomes more
and more elliptical in shape until the limiting case is reached (r=1.00 or r=-1.00) and
all the points fall on a straight line.
8. Describe in your own words line of best fit.
This is the line that comes closest to each of the points in a scatter diagram.
9. In your own words, state the primary purpose of:
a. Linear correlation analysis
In Linear Correlation Analysis, we are attempting to measure the strength of the
relationship between two variables. The higher the correlation coefficient , the
stronger the correlation between the two variables under study.
b. Regression analysis
In Regression Analysis, we are attempting to find the best fitting line from which we
can predict that value of the dependent variable given the value of the independent
variable.
10. You are given the following data. Number of Absences Final Grade 0 96 1 91 2 78 2
83 3 75 3 62 4 70 5 68 6 56
a. Create a scatter plot for the data.

b. Find the correlation coefficient for the data.


0.91 please see the attached excel sheet
c. Find the equation for the regression line for the data, and predict the final grade of a
student who misses 3.5 days.

y = -6.2846x + 93.6
y = -6.2846(3.5) + 93.6
= 71.6
11. Explain the difference between a discrete and a continuous random variable. Give two
examples of each.

A discrete random variable can assume only a certain number of separated values.
If there are 100 employees, then the count of the number absent on Monday can

only be 0, 1, 2, 3, . . . , 100. A discrete random variable is usually the result of


counting something.
A Continuous random variable can assume one of an infinitely large number of
values, within certain limitations. An example of such variable would be: The times
of commercial flights between Atlanta and Los Angeles are 4.67
hours, 5.13 hours, and so on. The random variable is the number of hours.
12. Determine whether each of the distributions given below represent a probability
distribution. Justify your answer.
a) x 1 2 3 4 P (x) 1/8 1/8 3/8 1/8
No because the sum of the probabilities is greater than 1.
b) x 3 6 8 P (x) 0.2 0 1
No because the sum of the probabilities is greater than 1.
c) x 20 30 40 50 P (x) 0.3 0.2 0.1 0.4
Yes because the sum of the probabilities is equal to 1.
13. From a standard deck of 52 cards, four cards are selected one at a time. Let x
represent the number of aces drawn in a set of 4 cards.
a. If this experiment is completed without replacement, explain why x is not a binomial
random variable.
This is because, without replacement, the outcome of one trial affects the outcome of
the following trials, and one of the characteristics of a binomial distributions is that
trials are independent, meaning that the outcome of one trial does not affect the
outcome of any other trial.
b. If this experiment is completed with replacement, explain why x is a binomial random
variable.
This is because, with replacement, the outcome of one trial does not affect the
outcome of the following trials, and one of the characteristics of a binomial
distributions is that trials are independent, meaning that the outcome of one trial
does not affect the outcome of any other trial.
14. Consider a binomial distribution with 15 identical trials and a probability of success is
0.5. Find the probability that x = 2 using the binomial tables

P(X=2) = 0.0032

You might also like