Professional Documents
Culture Documents
CHAPER -FIVE
ELEMENTARY PROBABILITY
Probability is the chance of an outcome of an experiment.
It is the measure of how likely an out is occur.
5.1. Definitions of Basic Probability Terms
Experiment: any process which generates well defined out comes.
Outcome: The result of a single trial of an experiment.
Sample space (S): is a set of all possible outcome of an experiment.
Event: is a sub set of sample space. Remark: If S (sample space) has n members then there are exactly 2n subsets or events.
Equally Likely Events: events which have the same chance of occurrence.
Complement of an event: the component of an event A means non-occurrence of A and is denoted by Ac contains those points of the sample space
which doesn’t belong to A.
Elementary event: an event having only a single element or sample event.
Mutually Exclusive (ME) Event: two events that can’t occur simultaneously (which cannot happen at the same time).
Independent Event: two events are independent if the occurrence of one event does not affect the occurrence or non-occurrence of the other event.
Otherwise, they are dependent events.
Example: - what is sample space for the following experiment
1. Toss a dice one time
2. Toss a dice two times.
5.2. Counting Rules
In order to calculate probabilities, we have to know the number of elements of an event and the number of elements of the sample space. In order to
determine the number of outcomes, one can use the following rules of counting:
Addition rule
Multiplication rule
Permutation and combination rule.
Example: A student goes to the nearest snack to have a breakfast. He can take tea, coffee, or milk with bread, cake and sandwich. How many
possibilities does he have?
Solutions
Tea bread, cake and sandwich
Coffee bread, cake and sandwich
Milk bread, cake and sandwich
There are 9 possibilities
The Multiplication rule:
If a choice consists of K steps which the 1 st can be made in n1 ways, the 2nd can be made n2 ways…, the kth can be made in nk ways, then the whole
choice can be made in (n1*n2*…….*nk) ways.
Example: - the digits 0, 1, 2, 3 and 4 are to be used in 4 digit identification card, how many different cards are possible. If
Introduction to statistics
a. Repetitions are permitted,
b. Repetition are not allowed
Solutions: a) there are 4- steps:
Selecting the 1st digit, this can be made in 5 ways.
Selecting the 3rd digit, this can be made in 5 ways.
Selecting the 2nd digit, this can be made in 5 ways.
Selecting the 4th digit, this can be made in 5 ways.
Permutation: Is an arrangement of n objects in a specified order. The number of permutation of n distinct objects taken all together is n! Where n!
=n*(n-1)*(n-2)*….*3*2*1
• The arrangement of n objects in a specified order using r objects at a time is called the permutation of n objects taken r objects at time.
The formula is given as:-
The numbers of permutations of n objects in which k1 are alike k2 are alike …..Etc
Introduction to statistics
I.e. in all the enumeration methods introduced so far all the objects under consideration are assumed to be different. However, there are times when
some of the objects under considerations are similar. That is, out of k objects k 1 are first kind, k2 are of second kind ,… kn are of nth kind, where k1+k2+
…kn = k . Then the number of permutation of these objects is given by:-
nPr = _____n!_______
k1!*k2!*…..*kn!
Example:-1
1. Suppose we have a letters A, B, C, D. how many permutation are there taking all the four?
2. How many permutations are there two letters at a time?
3. How many different permutations can be made from the letters in the word “CORRECTION”?
Exercise: -2
Six different statistics books, seven different physics books, and 3 different economics books are arranged on a shelf. How many different
arrangements are possible if;
a) There is no restriction?
b) The books in each particular subject must all stand together?
c) Only statistics books must stand together?
Example: - Given the letters A, B, C, and D list the permutation and combination for selecting two letters.
Solutions:
Permutation: Combination
AB BA CA DA AB BC
AC BC CB DB AC BD
AD BD CD DC AD DC
Note: In permutation AB is different BA. But in combination AB is the same as BA.
B/c in a combination grouping pays no attention to order.
Combination rule:-
The number of combinations of r objects selected from n objects is given as follow:-
Introduction to statistics
b) If none of the defective clock is included: this is equivalent to zero defective and three non defective, which can be done in: 2Co *13 C3 = 286
ways.
c) If only one of the defective clock is included: this is equivalent to one defective and two non defective, which can be done in; 2 C1 *13 C2 =156
ways.
d) if two of the defective clock is included: this is equivalent to two defective and one non defective, which can be done in : 2C2 *13C1 =13 ways.
Introduction to statistics
Exercise1: out of 5 mathematicians and 7 statisticians a committee consisting of 2 mathematicians and 3 statisticians is to be formed. In how many
ways this can be done if
a) There is no restriction
b) One particular statistician should be included.
c) Two particular mathematicians cannot be included on the committee.
2. If 3 books are picked at random from a shelf containing 5 novels, 3 books of poems, and
a dictionary, in how many ways this can be done if:
a).There is no restriction.
b).The dictionary is selected.
c). 2 novels and 1 book of poems are selected.
The classical approach of measuring probability:
Definition: if a random experiment with N equally likely out comes is conducted and out of theses N A outcomes are favorable to the event A, then the
probability that event A occur denoted P (A) is defined as:
N ¿ n( A)
P (A) = A = No . of out comes favorable ¿ A total no . of out comes =
N n (S)
Example:- A fair die tossed once. What is the probability of getting:-
a) Number 4
b) An odd number
c) An even number
d) Number 8?
Solutions: First identify the sample space, say S= {1, 2, 3, 4, 5, 6} N=n(S) =6
n( A) 1
Let A be the event of number 4: A= {4} NA = n(A) =1 P(A) = =
n (S) 6
n( A) 3
Let A be the event of odd number: A= {1, 3, 5}: NA = n (A) =3 P (A) = = = 0.5
n (S) 6
n( A) 3
Let A be the event of even number: A= {2, 4, 6}: NA = n (A) =3 P (A) = = = 0.5
n (S) 6
n( A)
Let A be the event of number 8: A= Ø: NA = n (A) = 0 P (A) = = 0/6 = 0
n (S)
Example:- A box of 80 candles consists of 30 defective and 50 non defective candles. If 10 of these candles are selected at random, what is the
probability: if?
Introduction to statistics
a) All will be defective
b) 6 will be non defective
c) All will be non defective
Solutions: - Total selection = 80C10 = NA=n(S)
a) Let A be the event that all will be defective; total ways in which A occur
n( A) 30C 10∗50 C
30C10 *50C0 =NA= n (A) P (A) = = 0.00001825
0
n (S) 80 C 10
b). Let A be the event that 6 will be non defective; total ways in which A occur:-
n( A) 30C 4∗50C
30C4 *50C6 =NA= n (A) P (A) = = = 0.265
6
n (S) 80 C10
c). Let A be the event that all will be non defective; total ways in which A occur:-
n( A) 30C 0∗50 C
30C0*50C10 =NA= n (A) P (A) = = = 0.00624
10
n (S) 80 C 10
Exercise: 1 what is the probability that a waitress will refuse to serve alcoholic beverages to only three minors if she randomly check’s the I.D’s of five
students from among ten students of which four are not of legal age?
Exercise: 2 if 3 books are picked at random from a shelf containing 5 novels, 3 books of poems, and a dictionary, what is the probability that: if
a) The dictionary is selected?
b) 2 novels and 1 book of poems are selected?
Conditional probability:
The conditional probability of an event A gives that B has already occurred, denoted P(A/B) is
Introduction to statistics
P (A/B) = P (A ∩ B) , P (B) ≠ 0
P (B)
Solution: let A= the event that a student will get a scholar ship and B= the event that a student will graduate.
Given: P (A) = 0.25, P (B) = 0.75, P (A n B) = 0.2
Required P (B/A) =?
P (B/A) = P(A∩ B) = 0.2 = 0.8
P (A) 0.25
Ex.2. If the probability that a research project will be well planned is 0.6 and the probability that it will be well planned and well executed is 0.54, what is the
probability that it will be well executed given that it will be well planned?
Solution: Let A= event that a research project will be well planned & B= event that a research project will be well executed
Required P (B/A) =?
Solution: let A= the first ball drawn is black and B= second drawn is black.
Required P (A ∩ B) =?
CHAPTER - SIX
RANDOM VARIABLE AND PROBABILITY DISTRIBUTIONS
A random variable is 1
Random variables are two types: Discrete and continues random variables:-
Discrete random variable: are variables which can assume only a specific number of values. They have values that can be counted.
Examples: - tossing a coin n times and count the number of heads. Number of children in a family. Number of cars accidents per week. Number of
defective items in a given company.
Continuous random variable:- are variables that can assume all values between any two given values.
Example: height student in a certain college. Mark of a student. Life time of light bulbs. Length of time required to complete a given training.
A probability distribution: consists of a value of random variable can assume and the corresponding probabilities of the values.
Example: consider the experiment tossing a coin three times. Let X be the number of heads. Construct the probability distribution of X.
Solution: first identify the possible value that X can assume. Calculate the probability of each possible distinct value of X and express in the form of
frequency distribution.
X=x 0 1 2 3
P(X=x) 1/8 3/8 3/8 1/8
1
Refer in chapter one
Introduction to statistics
c).The possibility of each outcome does not change from trial to trial.
d).The trials are independent, thus we must sample with replacement.
Example of binomial experiments:
Tossing a coin 20 times to see how many tails occur
Asking 200 people if they watch BBC news.
Registering a newly produced product as defective or non defective.
Notation:
X ~Bin (n, p)
Definition: Let P = probability of success & q= 1-p = the probability of failure. Then the probability of getting X success is n trials becomes:-
P(X=x) = nCx pxqn-x, x= 0,1,2,3…,n
Ex.1. what is the probability of getting three heads by tossing a fair coin four times?
Solution: let X be the number of heads in tossing a fair coin four times. X ~Bin (n=4, p=0.5)
P(X=x) = nCx pxqn-x, x= 0, 1,2,3,4 (X=3) = 4C3 (0.5)3 * (0.5)1=0.25
Ex.2: suppose that an examination consists of six true and false questions, and assume that a student has no knowledge of the subject matter. The
probability that the student will guess the correct answer to the first question is 30%. Likewise, the probability of guessing each of the remaining
questions correctly is also 30%.
a) What is the probability of getting more than three correct answers?
b) What is the probability of getting at least two correct answer?
c) What is the probability of getting at most three correct answers?
d) What is the probability of getting less than five correct answers?
Solution: Let the number of correct answers that the student gets.
X ~Bin (n=6, p=0.3)
a) P(X > 3) =?
P(X=x) = nCx pxqn-x , x= 0, 1,2,3,4, 5, 6
x 6-x
6Cx (0.3) * (0.7)
Poisson distribution:
A random variable X is said to be have a Poisson distribution if its probability distribution is given by
P(X=x) = x e- , X=0, 1, 2… where = the average number.
X!
Notation:-X ~ Po ( )
The Poisson distribution is used to as a distribution of rare events, such as: Number of misprints, Natural disasters like earth quick, accidents, etc…
The process that given gives rise to such events are called Poisson process.
Examples: if 1.6 accidents can be expected an intersection on any given day, what is the probability that there will be 3 accidents on any given day?
Solution: let X= the number of accidents, X~ Po (),
Exercise:- on average five smokers pass a certain street corner every ten minutes, what is the probability that during a given 10 minutes the number of
smokers passing will be a) 6 or fewer b) 7 or more c) exactly 8.
F Normal distribution
Properties of normal distribution:
1. It is bell shaped.
Introduction to statistics
2. It is symmetrical about its mean and it is Mesokurtic. Mean = median = mode =
3. It is asymptotic to the x- axis.
4. It is a continuous distribution.
5. Total area under the curve sums to 1.
6. It is unimodal. i.e. values mound up only in the center of the curve.
7. Mean = µ and variance= ð2 are the two parameters of a normal distribution.
Æ Given a normal distributed random variable X with mean µ and standard deviation .
P (a < X< b) = (a-X- b- (a- b-
Examples: Find the area under the standard normal distribution which lies:
10. A normal distribution has mean 62.4. Find its standard deviation if 20.05% of the area under the normal curve lies to the right of 72.9.
Solution:
= P(X > 72.9) = 0.2005
= P (X
P (Z >
P (Z > P (0 < Z <
and from the table P (0<Z< 0.84) = 0.2995 10.5 = 0.84 =>
A random variable has a distribution with =5. Find its mean if the probability that the
random variable will assume a value less than 52.5 is 0.6915.
Solution:
P (Z < z) = P (Z< 52.5-Pz) = 0.6915- 0.50 = 0.1915
5
But From the table => P (0<Z<0.5) = 0.1915 z = 52.5-
5
Exercise: Out of a large group of men, 5 % are less than 60 inches in height and 40 % are between 60 & 65 inches. Assuming a normal distribution,
find the mean and standard deviation of heights.
Introduction to statistics
CHAPTER- SEVEN
Sampling and Sampling Distribution
ÆProbability (Random) Sampling: - is a method of sampling where all items in a population have a pre-assigned non- zero probability to be chosen or
selected in the sample. E.g.
@ Simple random sample (SRS)
@ Stratified sampling
@ Systematic sampling
@ Cluster sampling
Random Sampling
Introduction to statistics
Sampling in which the data is collected using chance methods or random numbers.
Systematic Sampling
Sampling in which data is obtained by selecting every kth object.
Stratified Sampling
Sampling in which the population is divided into groups (called strata) according to some characteristic. Each of these strata is
then sampled using one of the other sampling techniques.
Cluster Sampling
Sampling in which the population is divided into groups (usually geographically). Some of these groups are randomly selected,
and then all of the elements in those groups are selected.
ÆNon Probability (Non Random Sampling):- is a method of sampling where the sample to be chosen (selected) depends on personal decision or
subjective judgment of the investigator. E.g.
@ Judgment Sampling
@ Convenience Sampling
@ Quota Sampling
Convenience Sampling
Sampling in which data is which is readily available is used.
If a sample of size n is drawn from a population of size N in such a way that every possible sample of size n has the same probability of being
selected, then the sample is called simple random sample and the method of sampling is called simple random sampling (SRS). i.e. SRS: is a
sampling technique in which every item of the population has equal chance of being included in the sample.
CHAPTER- EIGHT
ESTIMATION AND HYPOTHESIS TESTING
Statistical Inference: is the process of making (giving) conclusion about the population of interest based on the information (data) obtained from the
sample.
Two ways of making inference in statistics are:-
F Estimation:
F Hypothesis Testing:
Estimation: - this is one ways of making inference about the population parameter. There are two ways of making estimation.
@ Point Estimation
@ Interval Estimation (confidence interval)
Point Estimation: is a procedure that results in a single value as an estimate for a parameter.
Interval Estimation (CI):- is the procedure that results in the interval of values as an estimate for a parameter.
?It deals with identifying the upper and lower limits of a parameter.
Definitions:
Estimator: is a rule or a formula used to estimate a parameter.
Estimate: is a specific value of an estimator.
1
Example: x = ∑ xi is an estimator. x = 20 is an estimate.
n
Point estimate: A single value used to estimate a parameter.
Interval Estimate: a range of values used to estimate a parameter.
Point estimation of
n
1
The point estimation is x = ∑ xi
n i=1
Example:- given the values 5,9,6,4,7,11 are samples of a given population, the point estimate for becomes:-
Introduction to statistics
n
1 1
x= ∑
n i=1
xi = (5+9+...+11) = 7
6
Remark: - if the variable of interest is continuous, the probability that estimator exactly estimates a population parameter is zero. Hence, we will not
usually estimate population parameter with a point estimator. so we go for interval estimation.
Case I: if sample size (n) is large (n > 30) or if the population is normal with known variance.
x ~N (n) Z = x - ~ N(0, 1)
√ n
Hence, X~ N ( , ) 2
The (1-) 100% confidence interval for is given by: - ¿L , x u¿ = [ x – Z/2 x + Z/2
√ n√ n
P ¿L, < x u¿ = 1 -
With Where (1 - is referred to as confidence coefficient.
= ( x L , x u¿ = [ 30.35 , 33.65]
Interpretation: - we are 95 % confidence that the unknown population mean will lie in the interval
Introduction to statistics
[30.35, 33.65] i.e. there is only 5% chance that is out of the CI [30.35, 33.65].
Exercise; Construct and interpret the 90%, 98% and 98%. CI for of the above example.
Example:- A drug company is testing a new drug which is supported to reduce blood pressure. From six people who are used as subject, it is found
that the average drop in blood pressure is 2.28 points, with a standard deviation of 0.95 points. Construct a 95 % confidence interval for the mean
change in blood pressure.
Given: n=6 (n<30) => small, x = 2.28, s = 0.95 (2 = unknown)
0.95= 1- => =0.05 => /2=0.025
t/2 n-1 = t0.025 5 =2.571
If the sample size, n is sufficiently large (n >30), then we go for the central limit theorem and then...
X~ N ( , 2/n ), so then (1-)100% confidence interval for becomes:
CASES:
Case I : When sampling is from a normally distributed with known known
The test statistics is Z = x -
√ n
After specifying we have the following regions (critical and acceptance):
Step 6: Decision making: since ¿ t cal∨¿ = 0.76 is not greater than t0.005(9) = 3.2498 we do not reject Ho at 1 % level of significance.
step 7 : conclusion: at 1 % level of significance, we have no evidence that the average height content of containers is different from 10 liters, based on
the given sample data.
Example 2:
The mean life time of a sample of 16 fluorescent light bulbs produced by a company is computed to be 1570 hours. The population standard deviation
is 120 hours. Suppose the hypothesized value for the population mean is 1600 hours. Can we conclude that the mean life times of light bulbs are
decreasing?
Use =0.05 and assume the normality of the population.
Solution:
Introduction to statistics
Given: n= 16 (small), x = 1570, 120, = 0.05, =1600
Step1: identify the appropriate hypothesis: Ho: = =1600 vs H1: < < 1600.
Step 2: specify Given =0.05.
Step 3. Select an appropriate test statistics: Z- statistic is appropriate. Because 2 is known.
Step 4: identify the critical region: Reject Ho at level of significance Zcal < - Z i.e
Zcal < -Z0.05 = -1.645
Step 5: calculate the test statistic: Zcal = x -
/√ n 120/√ 16
Step 6: Decision making: since Zcal = -1.0 is not less than – Z0.05 = -1.645 we do not reject Ho at 5 % level of significance.
Step 7: Conclusion: at 5 % level of significance, we have no evidence to say that the mean life time of light bulbs is decreasing, based on the given
sample data.
Example 3: The mean weekly sale of chocolate bar in a candy store was 146.3 bars per store. After the advertizing campaign the mean weekly sales in
22 stores for a typical week increased to 157.7 and showed a standard deviation of 17.2. Is the evidence conclusive that advertizing was successful
at 5 % level of significance?
Solution: Given: = 146.3, n = 22(small), x = 157.7, S= 17.2 ( = unknown) , normality, = 0.05
Step1: Identify the appropriate hypothesis Ho: = =146.3 vs. H1: > 146.3
Step 2: specify Given =0.05.
Step 3. Select an appropriate test statistics: t- statistic is appropriate. Because 2 is unknown and n is small
Step 4: identify the critical region: Reject Ho at level of significance tcal >tn = t0.05(21) = 1.721
Step 5: calculate the test statistic: tcal = x -
s/√ n 17.2/√ 22
Step 6: Decision making: since tcal = 2.02 is greater than t0.05(21) = 1.721 we reject Ho at 5 % level of significance.
step7: Conclusion: at 5% level of significance, we have strong evidence that advertizing the chocolate was successful in bringing an increase in mean
weekly sale, based on the given data.
Exercise: It is known in a pharmacological experiment that rats feed with a particular diet over a certain period gain an average of 40gms in weight. A
new diet was tried on a sample of 20 rats yielding a weight gain 43 gms with variance 7 gms. Test the hypothesis that the new diet is an
improvement, assume normality.
CHAPTER - NINE
Here we will discuss in the method employed to determine if there exists any relationship between two variables and express the r/p numerically.
Examples: we may need to know whether there exists a r/p between:
Income & Expenditure Fertilizer & plant yield
Age & Blood pressure Height & weight
Correlation Analysis: is a statistical technique that can be used to describe the degree to which one variable is linearly related to other variable. I.e.
correlation: is the degree (strength) of linear r/p b/n two variables (say X & Y). Two variable X & Y are said to be highly correlated if they have a
strong r/p. i.e. ↑ X => ↑ Y, ↓ X => ↓Y or ↓X => ↑Y or vice versa.
If higher (lower) values of one variable (say X) is a accompanied by higher (lower) of the other variable (say Y), then we say that there exists a
positive (direct) correlation b/n X & Y. i.e. ↑ X => ↑ Y, ↓ X => ↓Y. Example: The greater the radius of a circle, the greater will be the circumstance.
If lower (higher) values of one variable (say X) is a accompanied by lower (higher) of the other variable (say Y), then we say that there exists a
negative (indirect) correlation b/n X & Y. i.e. ↓X => ↑Y or ↑ X => ↓Y. Example: saving Vs expenditure.
The simple correlation coefficient (Pearsonian’s correlation coefficient): is a measure used to determine the degree of correlation b/n two or more
variables.
∑ XY −n X Y
r = n ∑ XY – ¿¿ ¿ or
√[∑ X 2−n X 2 ] ¿ ¿ ¿
Remark: Always this r lies between -1 and 1 inclusively and it is also symmetric.
Interpretation of r
F Perfect positive linear r/p (if r = 1)
F Perfect negative linear r/p (if r = -1)
F No linear relationship (if r = 0)
F Some positive linear r/p (if r is b/n 0 and 1)
Introduction to statistics
F Some negative linear r/p (if r is b/n 0 and -1)
Example: compute the value of Pearson’s correlation coefficient based on the study of Age (X) and Blood pressure(Y) of a person.
Interpretation: there is strong positive linear r/p b/n age and blood pressure.
9.2 Simple Linear Regression (Regression of Y on X)
Regression:- Is the a r/p existing b/n a dependent (effect) variable and given independent (cause ) variable(s). i.e. by regression the dependent
variable (Y) is given as a function of the independent variable or variables(X).
Simple Linear Regression: is a regression where there is a linear relation b/n dependent variable (Y) and independent variables.
Example: Study Hours’ (X= independent variable= cause) versus
Grade obtained(Y= dependent variable= effect)
Two variables X and Y are said to be linearly related if their relationship can be expressed by simple linear model:
Y = x
......, referred as regression of Y on X
Where: Y = dependent variable.., X = independent var.
= intercept of the regression line, = slope of regression line,
= error (random disturbance term)
Here (intercept) and (slope) are the parameters. Also they are known as regression coefficients.
Introduction to statistics
There are different methods of estimating the parameters and . The most commonly applied method is the ordinary least square (OLS) method.
The OLS method tries to estimate the parameters by minimizing the sum of squares of the error term ∑ e2
Y = x is estimated by Y^ = a + bX , where: a and b are OLS estimates of and respectively.
Where a & b are the least squares estimates of the regression line of Y on X.
Examples: the following hypothetical data set are shows income and monthly food expenditure of household in hundreds of birr. Then,
Ü a). Fit the least squares regression line to the given data.
Ü b). Calculate a simple correlation coefficient (r)
Ü c). Predict the food expenditure for 800 birr (8).
Income=X Expenditure = Y XY X2 Y2
3.8 3.1 11.78 14.44 9.61
4.5 3.6 16.2 20.25 12.96
2.5 2.3 5.75 6.25 5.29
4.8 3.7 17.76 23.04 13.69
n = 11, X = 5.95 and Y = 4.03…………
7.7 4.6 35.42 59.29 21.16 a). Y^ = a + bX
5.0 4.1 20.5 25 16.81
n ∑ XY −∑ X ∑ Y 11 ( 294.97 )− (65.45 ) (44.43)
12.6 6.5 81.9 158.76 42.25 b= n ∑ X −¿ ¿ ¿
2 = 11 ( 472.19 )−(65.45)2
= 0.38
8.5 5.1 43.35 77.25 26.01
5.5 4 22 30.25 16 and
7.1 4.1 29.11 50.41 16.81
3.5 3.2 11.2 12.25 10.24 a = Y −b X = 4.03-(0.38) (5.95) = 1.79
Therefore, Y^ = a + bX = 1.79+0.38X
∑ X = 65.5 ∑ Y = 44.3 ∑ XY =¿ ¿ ∑ X 2 = 472.19 ∑ Y 2= 190.83 Interpretation: wherever income (X) is zero, the expenditure on
294.97
food will be birr 1.79(179) and for every birr increase in income
38% of it will be spent on food.
Introduction to statistics
b). r = n ∑ XY −¿ ¿ ¿ = 0.9829
c). When income X= 8 => food expenditure Y^ = a + bX= 1.79 + 0.38X = 1.79 + 0.38(8) = 4.83
Coefficient of Determination( r2): Is a measure of the proportion of the total variation in Y that is explained by its r/p with X. That is, to know how
far the regression equation has been able to explain the variation in Y we use r2
i.e. r2 = measure the goodness of fit of the regression line.
Interpretation: If r2 close to 1 => r/p b/n X & Y is well explained by the regression equation (line). => Good fit
If r2 close to zero=> r/p b/n X & Y is not well explained by the regression equation (line) => not good (bad) fit.
Example: Compute the coefficient of determination (r2) for the above data.
We already calculate r = 0.9829 => r2 =0.967 = 0.97