Professional Documents
Culture Documents
MODULE NO. 10
Title : Simple Regression Analysis
TOPICS OUTLINE :
1. Concept of simple regression analysis
2. Computation of slope and y-intercept of the regression line or
regression equation.
3. Interpreting the slope and y-intercept of the regression line.
4. Developing regression line and regression equation.
5. Predicting the value of the dependent variable using the
regression equation when the independent variable is known.
LEARNING OUTCOMES:
At the end of the lesson , the students will be able to :
1. Discuss the concept of simple regression analysis.
2. Compute the slope and y-intercept of the regression line or
regression equation.
3. Interpret the slope and y-intercept of the regression line.
4. Develop the regression line or regression equation.
5. Predict the value of the dependent variable using the regression
equation when the independent variable is known.
6. Investigate the linear relationship between two variables and
obtaining a regression equation to describe the relationship.
Overview :
When Francis Galton was working with large observation studies
on humans in the mid late 1800’s, he noticed that there is a regression
towards the mean effect after observing the heights of fathers and first
sons. That is how regression got its name.
TOPIC PRESENTATION
Using the regression analysis to find the “line of best fit,” consider a
regression line equation:
𝒚 = 𝒂 + 𝒃𝒙
̅−𝒃𝒙
𝑎= 𝒚 ̅
∑𝒙
Where : ̅=
𝒙 (mean of distribution x)
𝒏
∑𝒚
̅=
𝒚 (mean of distribution y)
𝒏
𝒏= number of respondents
𝑛∑𝑥𝑦 − ∑𝑥∑𝑦
𝑏=
𝑛∑𝑥 2 − (∑𝑥)2
𝒏= number of respondents
Example :
Fifteen randomly selected students were asked about the number of
hours they spent in studying their lessons (X) before they took the
Advanced Statistics test. Their scores (Y ) are listed in the table below:
X Y
STUDENT Number of Hours
Scores
Spent in Studying
A 0 50
B 1 55
C 2 60
D 3 61
E 4 63
F 5 65
G 6 68
H 7 70
i 8 72
J 9 74
K 10 76
L 11 78
M 12 80
N 13 83
O 14 85
Question No. 1 - Using the given data above determine the equation of
the regression line.
X Y
No. of XY X2 Y2
Hours Scores
Spent
0 50 0 0 2500
1 55 55 1 3,025
2 60 120 4 3,600
3 61 183 9 3,271
4 63 252 16 3,969
5 65 325 25 4,225
6 68 408 36 4,624
7 70 490 49 4,900
8 72 576 64 5,184
9 74 666 81 5,476
10 76 760 100 5,776
11 78 858 121 6,084
12 80 960 144 6,400
13 83 1,079 169 6,889
14 85 1,190 196 7,225
𝑛∑𝑥𝑦− ∑𝑥∑𝑦
𝑏=
𝑛∑𝑥 2 −(∑𝑥)2
Substitution:
15(7922)−(105)(1040)
𝑏=
15(1015)−(105)2
118,830 −109,200
𝑏= 15,225−11,025
9,630
𝑏=
4,200
𝑏 = 2.29
Solving for 𝒂:
Formula ̅−𝒃𝒙
𝑎= 𝒚 ̅
𝟏𝟎𝟒𝟎 𝟏𝟎𝟓
𝑎= − (𝟐. 𝟐𝟗)
𝟏𝟓 𝟏𝟓
𝑎 = 𝟔𝟗. 𝟑𝟑 − 𝟏𝟔. 𝟎𝟑
𝑎 = 𝟓𝟑. 𝟑0
Question No. 2 : Predict the score of the student who studied for
a) 15 hours
b) 17 hours
Solution: for letter a
𝒚 = 𝟓𝟑. 𝟑𝟎 + 𝟐. 𝟐𝟗(𝟏𝟓)
𝒚 = 𝟓𝟑. 𝟑𝟎 + 𝟑𝟒. 𝟑𝟓
𝒚 = 𝟖𝟕. 𝟔𝟓 𝒐𝒓 𝟖𝟖
a) If the student studied for 15 hours before taking the
Advanced Statistics test his predicted score is 88.
53.30. This implies that the student who did not study his lessons at
all before taking the test will get a score of about 53.30 or 53. In
general, the y-intercept (a) represents the predicted value as long as
the scope of the regression model defines X=0. Otherwise y-intercept
(a) is not meaningful at all.
References
1. Albert, Jose Ramon G., Ph.D. Training Manual, Teaching for Senior
High School Statistics and Probability, 2016
2. Dayrit, Benjamin C., Hernandez Rogelio, Ymas Sergio, Jr. E.
College Statistics 2016
3. Sirug, Winston S. Ph. D., Basic Probability and Statistics
A Step by Step Approach.