Professional Documents
Culture Documents
y x
a b
n n
Example 1
A car's engineer wants to study the relationship between the cars speed (km per hour) and
the fuel consumption (km per liter) for the new models. Those cars were driven at multi-speed
test and the results were recorded in table as below. Find the linear regression equation of
cars’ speed and fuel consumption.
Speed (km/h), x 30 40 50 60 70 80 90 100 110
Fuel consumption, y 39 38 36 32 28 27 25 23 22
Y=a+bx
Interpretation of the regression coefficient
Y=a+bx
a ‘a’ is the y-intercept, or value of y when x=0
• when x=0, y=a.
• We can state that when x=0 the expected y is a
a = 46.331
We can state that when cars’ speed is 0 km/h the expected fuel
consumption is 46.331 km/l
b = -0.2333
We can state that, on average the fuel consumption will decrease by
0.2333km/l for an additional 1 km/h of cars’ speed.
From Example 2 – Study hours and examination score
The regression line is
Prediction using regression
line
Y=a+bx
- Extrapolation and interpolation
Prediction using regression line
● The regression line constructed is used for the purpose of
prediction or estimation for the variable of y when value of x is
given.
● That is, the higher is the degree of the correlation between the
variables (significant value of r, r > 0.7), the more accurate is the
prediction made.
Extrapolation and interpolation
● Extrapolation means using the regression line to find the values of
y, when the value of x is outside the range of the observations.
Extrapolation results in the forecast being less accurate and
unreliable.
The fuel consumption is 46.10 km/l if the car's speed is 65km/h. This is an
interpolation estimate, it is reliable. ---1M
Speed (km/h), x 30 40 50 60 70 80 90 100 110
Fuel consumption, y 39 38 36 32 28 27 25 23 22
From Example 2 – Study hours and examination score
The regression line is
Predict the examination score if the study hours is 37 hours. ls the estimation
reliable? Explain your answer.
Study hours, x 10 34 23 27 32 18 22 25
Examination score, y 51 84 70 88 92 65 75 77
Exercise 2 (Correlation and Regression line by using excel)
The following table presents the number of pages in the book versus the price of a
book for 10 books in a lecturer room.
Number of pages 500 190 240 300 350 410 490 100 550 540
Price of book 50 25 50 75 50 40 45 32 60 55
(RM)
a. State the dependent and independent variable.
b. By using excel, draw a scatter diagram for the above data.
c. Find and interpret the Pearson’s Product Moment Correlation Coefficient.
d. Calculate and interpret the coefficient of determination.