You are on page 1of 17

LOGISTIC

REGRESSION

Sachin Nandakar
LOGISTIC REGRESSION
It is a regression method used to find
the probability of an event which is a
binary outcome.

In other words,
It is a regression analysis to conduct
when the dependent variable is
dichotomous(binary).
EXAMPLE

 How does the probability of placement


(Selected / Not-selected) change for every
additional percentage score in examination?

 Do change in body fat intake & age have an


influence on the probability of having a
heart attack (Yes / No)?
MAJOR ASSUMPTIONS &
UNDERSTANDINGS
 The dependent variable should be dichotomous in
nature (Ex: Pass/Fail, Present/Absent)
 There should be no high correlations (multi-
colinearity) among the predictors(independent
variables).
 It requires large sample size because maximum
likelihood estimates are less powerful at low sample
sizes.
 The regression doesn’t require linear relationship
between dependent and independent variables.
RANGE, ODDS & LOGIT
FUNCTION
Here the value of Y-dependent variable ranges
from 0 to 1 and it can be represented by following
equation
PLACEMENT DATA

Placement
Percentage
(0-Not Selected
Score
1-Selected)
42 0
48 1
54 1
65 1
40 0
52 0
60 1
58 1
40 0
SCATTER PLOT

BEST FIT ?
PROBABILITY
WHAT ARE THE ODDS?
ODDS RATIO
BRING BERNOULLI
PREDICTIVE EQUATION
After consideration of training datasets, we can
get the values bo & b1 –co-efficient of regression
& assuming the probability value is 0.76 for the
placement score of 55, then the outcome can be
approximated as 1 i.e selected.
LINEAR v/s LOGISTIC
Linear Regression Logistic Regression

Variable Type Continuous Dependent Categorical Dependent


Variable Variable

Estimation Least-Square Estimation Maximum Likelihood


Method Estimation

Equation y = ax + b y = b0 + b1x1 + b2x2 +….

Best fit line Straight Line Curve

Output Predicted Integer Value Predicted Binary Value (


0 or 1 )

You might also like