Professional Documents
Culture Documents
Regression
“Regression is the measure of the average
relationship between two or more variables in
terms of the original units of the data”.
“Regression analysis is an attempt to establish
the nature of the relationship between variables-
that is to study the functional relationship
between the variables and thereby provides a
mechanism for prediction or forecasting”.
Regression Analysis is a statistical device with the help of
which we are able to estimate the unknown values of one
variable from known values of another variable. The
variable which is used to predict the another variable is
called independent variable (explanatory variable) and, the
variable we are trying to predict is called dependent variable
(explained variable).
The dependent variable is denoted by X and the independent
variable is denoted by Y.
The analysis used in regression is called simple linear
regression analysis. It is called simple because three is only
one predictor (independent variable). It is called linear
because, it is assumed that there is linear relationship
between independent variable and dependent variable.
Types of Regression
There are two types of regression. They are linear
regression and multiple regression.
Linear Regression:
It is a type of regression which uses one independent
variable to explain and/or predict the dependent
variable.
Multiple Regression:
It is a type of regression which uses two or more
independent variable to explain and/or predict the
dependent variable.
Regression Lines:
Regression line is a graphic technique to show the functional
relationship between the two variables X and Y. It is a line which
shows the average relationship between two variables X and Y.
If there is perfect positive correlation between 2 variables, then the
two regression lines are winding each other and to give one line.
There would be two regression lines when there is no perfect
correlation between two variables. The nearer the two regression
lines to each other, the higher is the degree of correlation and the
farther the regression lines from each other, the lesser is the degree of
correlation.
Properties of Regression lines:-
1. The two regression lines cut each other at the point of average of
X and average of Y ( i.e X and Y )
2. When r = 1, the two regression lines coincide each other and give
one line.
3. When r = 0, the two regression lines are mutually perpendicular.
Regression Equations (Estimating Equations)
y β0 β1x ε
Variable
y y β0 β1x ε
Observed Value
of y for xi
εi Slope = β1
Predicted Value Random Error
of y for xi
for this x value
Intercept = β0
xi x
Estimated Regression Model
The sample regression line provides an estimate of
the population regression line
ŷ i b0 b1x variable
b1
( x x )( y y )
and b0 y b1 x
(x x) 2
algebraic equivalent:
xy x y
b1 n
x 2
( x ) 2
n
Regression Equation of Y on X:-
b1
( x x )( y y )
b0 y b1 x
(x x) 2
Regression Equation of X on Y:-
b1
( x x )( y y )
b0 x b1 y
( y y) 2
Interpretation of the
Slope and the Intercept
350
Slope
300
250
= 0.10977
200
150
100
50
Intercept 0
= 98.248
0 500 1000 1500 2000 2500 3000
Square Feet
Regression coefficients of Y on X
Regression equation using mean and
regression coefficient
Regression equation of Y on X
Regression equation of X on Y
Ans:
For the data given below
Average S.D.
Production(in units) 35 10
Capacity 85 8
utilisation(%)
Coefficient of correlation=0.6
Obtain the two regression equations.
Estimate the production when the capacity utilisation is
70 per cent.
Properties of the regression coefficients
(1) The coefficient correlation is the geometric mean
of the two regression coefficients