Professional Documents
Culture Documents
Stat - Prob 11 - Q4 - SLM - WK8
Stat - Prob 11 - Q4 - SLM - WK8
1
Statistics and Probability – Grade 11
Alternative Delivery Mode
Quarter 4 – Module 8: Exploring Regression Analysis
Republic Act 8293, Section 176 states that: No copyright shall subsist in any
work of the Government of the Philippines. However, prior approval of the
government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other
things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this book are owned by their respective copyright
holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and
authors do not represent nor claim ownership over them.
Office Address: 0050 Lino Chatto Drive Barangay Cogon, Tagbilaran City,
Bohol
Telefax: (038) 501 – 7550
Tel Nos. (038) 412 – 4938; (038) 411-2544; (038) 501 – 7550
E-mail Address: depedbohol@deped.gov.ph
2
Learning Competencies:
Identifies the independent and dependent variables.
M11/12SP-IVi-1
Calculates the slope and y-intercept of the regression line.
M11/12SP-IVi-3
Interprets the calculated slope and y-intercept of the regression
line.
M11/12SP-IVi-4
Predicts the value of the dependent variable given the value of the
independent variable.
M11/12SP-IVj-1
Solves problems involving regression analysis
M11/12SP-IVj-2
What is it…
Examples :
Identify the dependent and independent variables in each of the following pair of
variables:
3
If two variables are significantly correlated, we can predict the value of the
dependent variable if we know the value of the independent variable. The process is
called regression analysis.
The regression line Y=bX + a is also called the line prediction equation because
we use it to predict the dependent variable Y when the independent variable X is
known. Since in the analysis, only the y distance was considered, the line cannot be
used to predict X from Y.
Examples :
For the given regression line, predict the dependent variable Y, given the values of the
independent variable X.
Solution:
Y= 0.3X + 2.5
Y= 1.8 + 2.5
Y= 4.3
So, from the regression line Y= 0.3X + 2.5, the value of Y when X= 6 is 4.3.
What’s More…
A. For each of the following pairs of variables, Identify the independent and the
dependent variables:
4
Lesson 2: The Slope and Y-Intercept of the Regression Line
What is it…
The y-intercept of the regression line is the value of the dependent variable Y
when the independent variable is 0. The slope of the regression line is the change in the
dependent variable Y relative to a one unit change in the independent variable X.
The equation Y=bX + a is the equation of the regression line, where a is the y-intercept and b is
the slope of the regression line.
Examples:
Give the values of the slope b and y-intercept a in the following equation of the
regression line:
1. Y= 1.4X + 3.5 Answer: b= 1.4 (the slope) and a= 3.5 ( the y-intercept)
n ( ΣX Y )−(Σ X)(Σ Y )
b=
n ( Σ X 2) −(Σ X )2
Examples:
Compute the values of the slope b and y intercept a in the given data. Interpret the
result.
X Y X2 Y2 XY
1 1 1 1 1
1 2 1 4 2
2 4 4 16 8
3 2 9 4 6
4 4 16 16 16
( Σ Y ) ( Σ X 2 )−( Σ X)( Σ XY )
Compute for the y- intercept using the formula a=
n ( Σ X 2 )−(Σ X )2
( 14 ) ( 31 )−(11)(33)
a= 2 = 2.09
5 ( 31 )−(11)
So the slope b is 0.32 which means that every increase of 1 in X there is an increase of .32 in
Y. And the y-intercept a is 2.09, which means that when X is 0, Y is 2.09.
5
What’s More…
A. Give the values of the slope b and the y-intercept a in each of the following equation
of the regression line.
1. Y= 0.3X + 2.5
2. Y= 3.5X + 1.67
3. Y= 2.6X + 0.56
4. Y= -3.2X + 6.7
B. Find the slope b and y-intercept a in the given data below. Interpret the result.
X Y X2 Y2 XY
1 2 1 4 2
2 4 4 16 8
3 3 9 9 9
4 3 16 9 12
5 5 25 25 25
What is it…
Example 1.
The following data shows number of absences and the number of quizzes missed by
five students. If there is a significant relationship between the two variables, predict
the number of quizzes missed by a student who was absent for 6 days.
1 1 1
2 1 2
3 2 4
6
4 3 2
5 4 4
Solution:
Steps Solution
1. Identify the dependent and Here, the dependent variable is the number of
independent variables missed quizzes while the independent variable is the
number of absences.
2. Compute the correlation Let us put the data in columns and find the
coefficient (r) using the formula following:
1 1 1 1 1
1 2 1 4 2
2 4 4 16 8
3 2 9 4 6
4 4 16 16 16
n Σ XY −Σ X ⋅ Σ Y
r=
√[ n Σ X −(Σ X ) ][ n Σ Y −(Σ Y ) ]
2 2 2 2
r= 0.018
t= r
√ n−2
1−r 2
√
t= r
n−2
1−r 2
t= 0.018
√ 5−2
1−(0.018)2
t= 0.054
4. Compare the computed t- Using df= n-2= 5-2=3, α= 0.05, two-tailed test, we
value to the critical value. find from the table that the critical value of t is
3.182
5. Make a decision. Since the computed t= 0.054 is less than the critical
t= 3.182, we accept the null hypothesis. So, there is
no significant relationship between the two
7
variables.
Example 2. The following data show the population of the Philippines from 2005 to
2012. If there is a significant relationship between the two variables, find the
regression line and predict the population in 2014.
Year (X) Population (y, in
million)
2005 85.26
2006 86.97
2007 88.71
2008 90.46
2009 91.02
2010 92.6
2011 94.18
2012 95.77
Steps Solution
1. Identify the dependent and Here, the dependent variable is the population while the
independent variables independent variable is the year.
2. Compute the correlation Let us put the data in columns and find the following:
coefficient (r) using the formula
∑X, ∑Y, ∑X2, ∑Y2, ∑XY and substitute them in the
r= formula:
n Σ XY −Σ X ⋅ Σ Y
X Y X2 Y2 XY
√[ n Σ X −(Σ X ) ][ n Σ Y −(Σ Y ) ]
2 2 2 2
n Σ XY −Σ X ⋅ Σ Y
r=
√[ n Σ X −(Σ X ) ][ n Σ Y −(Σ Y ) ]
2 2 2 2
r=
8
8(1456163)−( 16068 ) (724.97)
√[ 8(32272620)−(16068) ][ 8(65786.69)−(724.97) ]
2 2
r= 0.99
t= r
√
n−2
1−r 2
t= r
√ n−2
1−r 2
t= 0.99
√ 8−2
1−(0.99)2
t= 17.19
2
8 ( 32272620 )−(16068)
n ( ΣX Y )−(Σ X)(Σ Y )
b=
n ( Σ X 2) −(Σ X )2 a= -2814.77
n ( ΣX Y )−(Σ X)(Σ Y )
b=
n ( Σ X 2) −(Σ X )2
8 ( 1456163 )−(16068)(724.97)
b= 2
8 ( 32272620 )−(16068)
b= 1.447
9
Y= bX + a
Y= 1.447X – 2814.77
Y= 1.44X – 2814.77
Y= 1.44(2014) – 2814.77
Y= 99.488
What’s More…
A. The following data shows the weight and the average daily allowance of five Grade
11 students. If there is a significant relationship between the two variables, predict the
daily allowance of a student whose weight is 44 kilograms. Show your solution on the
answer sheet provided.
1 39 30
2 40 20
3 41 25
4 42 20
5 43 30
B. The table shows the income of a company for the past 8 years. Answer as directed.
Income ( in million pesos) 2.2 2.4 2.9 2.5 3.1 3.3 3.6 3.9
a. Find the regression line that will predict the income of the company for a given year.
10
Assessment
Direction: Choose the letter that corresponds to the correct answer. Write your
answer in the answer sheet provided.
1. It is a statistical tool that measures the relationship between variables
A. Correlation analysis C. dependent variable
B. Regression analysis D. independent variable
2. Identify the independent variable: amount of fertilizer and the height of a plant
A. amount of fertilizer C. plant growth
B. height of a plant D. none of these
3. Which is the correct independent variable (x) and dependent variable (y)?
A. x- number of study hours and y- grades
B. x- income and y- number of working hours
C. x- body weight and y- food intake
D. none of these
9.Find the regression line that will predict the average mileage /liter of the car.
A. y= 1.39x + 19.83 C. y= 19.83x + 1.39
B. y= -1.39x + 19.83 D. y= -19.83x – 1.39
Performance Task # 4
Make a portfolio of all your activity/answer sheets in quarter 4 using a white
folder. Arrange your answer sheets according to module number, module 1 as the top
11
and module 8 as the bottom. A Table of Contents is already made for you to fill up,
put it on the topmost part of your compilation.
Reference
The t-table
12
Source: https://jimgrange.wordpress.com/Statistics Tables:Where do the Numbers
Come From?
TABLE OF CONTENTS
13
DATE MODULE / TITLE SCORE
LESSON
Module 1
Lesson 1
Lesson 2
Lesson 3
Module 2
Lesson 1
Lesson 2
Lesson 3
Module 3
Lesson 1
Lesson 2
Lesson 3
Module 4
Lesson 1
Lesson 2
Lesson 3
Module 5
Lesson 1
Lesson 2
Lesson 3
Module 6
Lesson 1
Lesson 2
Lesson 3
Module 7
Lesson 1
Lesson 2
Lesson 3
Module 8
Lesson 1
Lesson 2
Lesson 3
TOTAL SCORE
14
Answer Sheet
Name:
Grade & Section: Score:
Quarter 3 – Module 1
Lesson 1
A.
Independent Dependent
1.
2.
3.
4.
5.
B.
1. 2.
3. 4.
Lesson 2
A.
1. 3.
2. 4.
B.
Lesson 3
A.
Steps Solution
2. Compute the correlation Let us put the data in columns and find the
coefficient (r) using the formula following:
1
∑X= ∑Y= ∑X2= ∑Y2= ∑XY=
n Σ XY −Σ X ⋅ Σ Y
r=
√[ n Σ X −(Σ X ) ][ n Σ Y −(Σ Y ) ]
2 2 2 2
r=
r=
t= r
√ n−2
1−r 2
4. Compare the computed t-
value to the critical value.
5. Make a decision.
B.
Steps Solution
2. Compute the correlation Let us put the data in columns and find the following:
coefficient (r) using the formula
∑X, ∑Y, ∑X2, ∑Y2, ∑XY and substitute them in the
r= formula:
n Σ XY −Σ X ⋅ Σ Y
X Y X2 Y2 XY
√ [ n Σ X −(Σ X ) ][ n Σ Y −(Σ Y )
2 2 2 2
]
2
n Σ XY −Σ X ⋅ Σ Y
r=
√[ n Σ X −(Σ X ) ][ n Σ Y −(Σ Y ) ]
2 2 2 2
r=
r=
3. Test the significance of r using
the formula:
t= r
√ n−2
1−r 2
4. Compare the computed t-value
to the critical value.
5. Make a decision.
( Σ Y ) ( Σ X 2 )−( Σ X)( Σ XY )
a=
n ( Σ X 2 )−(Σ X )2
n ( ΣX Y )−(Σ X)(Σ Y )
b=
n ( Σ X 2) −(Σ X )2
8. Form the regression equation
Assessment
1.
2.
3.
4.
5. 6.
7.
8.
9.
10.
3
Answer Key
Lesson 1
A.
Independent Dependent
B.
1. y= -54.9 2. y= -9.3 3. y= 9.66 4. y= 29.67
Lesson 2
A.
1. b= 0.3 3. b= 2.6
a= 2.5 a= 0.56
2. b= 3.5 4. b= -3.2
a= 1.67 a= 6.7
B. b= 0.5
a= 1.9
Lesson 3
A.
Steps Solution
1. Identify the dependent and Here, the dependent variable is the average daily
independent variables allowance of the student while the independent
variable is the weight in kilograms.
2. Compute the correlation Let us put the data in columns and find the
coefficient (r) using the formula following:
1
∑X=205 ∑Y= 125 ∑X2= ∑Y2= ∑XY=
8415 3225 5125
n Σ XY −Σ X ⋅ Σ Y
r=
√[ n Σ X −(Σ X ) ][ n Σ Y −(Σ Y ) ]
2 2 2 2
r= 0
t= r
√n−2
1−r 2
t= r
√ n−2
1−r 2
t= 0
√ 5−2
1−(0)2
t= 0
4. Compare the computed t- Using df= n-2= 5-2=3, α= 0.05, two-tailed test, we
value to the critical value. find from the table that the critical value of t is
3.182
B.
Steps Solution
1. Identify the dependent and Here, the dependent variable is income of the company
independent variables while the independent variable is the particular year.
2. Compute the correlation Let us put the data in columns and find the following:
coefficient (r) using the formula
∑X, ∑Y, ∑X2, ∑Y2, ∑XY and substitute them in the
r= formula:
n Σ XY −Σ X ⋅ Σ Y
X Y X2 Y2 XY
√[ n Σ X −(Σ X ) ][ n Σ Y
2 2 2 2
−(Σ Y ) ]
2006 2.2 4,024,036 4.84 4413.2
2
2013 3.6 4,052,169 12.96 7246.8
n Σ XY −Σ X ⋅ Σ Y
r=
√[ n Σ X −(Σ X ) ][ n Σ Y −(Σ Y ) ]
2 2 2 2
r=
8 (48053.3)−( 16081 ) (23.9)
√[ 8(32324879)−(16081) ] [ 8(73.93)−(23.9) ]
2 2
r= 0.93
t= r
√
n−2
1−r 2
t= r
√ n−2
1−r 2
t= 0.93
√ 8−2
1−(0.93)2
t= 6.198
4. Compare the computed t-value Using df= n-2= 8-2=6, α= 0.05, two-tailed test, we find
to the critical value. from the table that the critical value of t is 2.447
5. Make a decision. Since the computed t= 6.198 is greater than the critical
t= 2.447, we reject the null hypothesis. So, there is
significant relationship between the two variables.
7. Compute the values of a and b Using the values obtained in Step 2, we have the
in the regression equation Y= bX + following:
a using the following formulas
( Σ Y ) ( Σ X 2 )−( Σ X)( Σ XY )
( Σ Y ) ( Σ X )−( Σ X)( Σ XY )
2 a=
a= n ( Σ X 2 )−(Σ X )2
n ( Σ X 2 )−(Σ X )2
( 23.9 ) (32324879 )−(16081)(48053.3)
n ( ΣX Y )−(Σ X)(Σ Y ) a= 2
b= 8 ( 32324879 )−(16081)
n ( Σ X 2) −(Σ X )2
a= -383.25
n ( ΣX Y )−(Σ X)(Σ Y )
b=
n ( Σ X 2) −(Σ X )2
3
8 ( 48053.3 ) −(16081)(23.9)
b= 2
8 ( 32324879 )−(16081)
b= 0.192
8. Form the regression equation Substitute the values of a and b in the equation
Y= bX + a
Y= 0.192X – 383.25
9. Predict the income of the Find the value of Y when X= 2020 in the regression
company in 2020. equation
Y= 0.192X -383.25
Y= 0.192(2020) – 383.25
Y= 4.59
Assessment
1. B 6. B
2. A 7. A
3. A 8. D
4. C 9. B
5. C 10. A
1. A 6. B 11. C
2. B 7. C 12. D
3. C 8. C 13. D
4. D 9. A 14. A
5. A 10. B 15. B
4
5