0% found this document useful (0 votes)

50 views29 pages

Logistic Regression for Binary Classification

Uploaded by

Mery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views29 pages

Logistic Regression for Binary Classification

Uploaded by

Mery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Chapter 10 – Logistic

Regression

Data Mining for Business

Analytics in RapidMiner
Shmueli, Bruce, Deokar & Patel
The contributions of Sambit Tripathi are gratefully
acknowledged

© Galit Shmueli, Peter Bruce and Amit Deokar 2023

Logistic Regression
●Extends idea of linear regression to
situation where outcome variable is
categorical
●Widely used, particularly where a
structured model is useful to explain
(=profiling) or to predict
●We focus on binary classification, i.e. Y=0
or Y=1, for which we need probabilities, P(Y=1)
or P(Y=0)
●Standard linear model doesn’t work for
probabilities, which are bounded by 0 and 1
●We work instead with odds and a function
The Logit
Goal: Find a function of the predictor
variables that relates them to a 0/1
outcome

●Instead of Y as outcome variable (like in linear

regression), we use a function of Y called the logit
●Logit can be modeled as a linear function of the
predictors
● Model parameters are fit iteratively via maximum
likelihood estimation
●The logit can be mapped back to a probability,
which, in turn, can be mapped to a class
Step 1: Logistic Response
Function
p = probability of belonging to class 1

Need to relate p to predictors with a function

that guarantees 0 ≤ p ≤ 1

Standard linear function (as shown below)

does not:

q = number of
predictors
The Fix:
use logistic response
function

Equation 10.2 in
textbook
Step 2: The Odds
The odds of an event are defined as:

p = probability of
eq. 10.3 event

Or, given the odds of an event, the probability

of the event can be computed by:
eq.
10.4
We can also relate the Odds to
the predictors:

eq. 10.5

To get this result, substitute 10.2

into 10.4
Step 3: Take log on both
sides

This gives us the logit:

log(Odds) = logit (eq. 10.6)

Logit, cont.

So, the logit is a linear function of predictors

x1, x2, …
● Takes values from -infinity to +infinity

Review the relationship between logit, odds

and probability
Odds (a) and Logit (b) as function of
P
Example
Personal Loan Offer
(UniversalBank.csv)

Outcome variable: accept bank loan (0/1)

Predictors: Demographic info, and info about

their bank relationship
Single Predictor Model
Modeling loan acceptance on income (x)

Fitted coefficients (more later): b0 = -6.3525, b1 = -

0.0392
Seeing the Relationship
Last step - classify
Model produces an estimated probability of
being a “1”

●Convert to a classification by establishing

cutoff level

●If estimated prob. > cutoff, classify as “1”

Ways to Determine Cutoff
●0.50 is popular initial choice

●Additional considerations (see Chapter 5)

● Maximize classification accuracy
● Maximize sensitivity (subject to min. level of
specificity)
● Minimize false positives (subject to max. false
negative rate)
● Minimize expected cost of misclassification
(need to specify costs)
Example, cont.

●Estimates of β’s are derived through an

iterative process called maximum likelihood
estimation

●Let’s include all 12 predictors in the model

now
Data Prep
• Select Attributes 11 predictors
• Map: Replace negative values for
Experience
• Numerical to Polynominal &
Nominal to Numerical : Convert
Education, Securities, Account, CD
Account, Online & Credit Card to
Categorical and dummy coding
• Numerical to Binominal: Personal
Loan
• Map: Personal loan Acceptor as
positive class
• Set Role: Personal loan as Target &
ID as id role
Fitting and Applying Model
• Data Preprocessing: Preprocessing steps
• Split Data: 60% Training, 40% Holdout
• Logistic Regression Estimation
• Apply model on Holdout Data
• Create Lift Charts
Personal Loan Offer : Estimated Model
Logit(Personal Loan = Yes) = −12.293 −
0.038 Age + 0.048 Experience + 0.060
Income + 0.606 Family + 0.186 CCAvg +
0.000 Mortgage − 1.328 Securities Account
+ 3.514 CD Account − 0.610 Online − 0.799
CreditCard
+ 3.972 Education.2 + 4.101Education.3
● Interpretation:
○ Having older age, securities acct.,
banking online, and owning a
credit card are associated with
lower acceptance
○ Graduate/professional education,
higher education, more family
members, higher credit card use,
more professional experience and
having a deposit account are
associated with higher probability
of offer acceptance
● Interpreting in terms of odds
○ Deposit Account: e3.514 = 33.58 are
the odds that a customer with a
deposit account will accept the
offer, relative to customer who
does not have deposit acct.
Converting to Probability
Interpreting Odds, Probability
For predictive classification, we typically use
probability with a cutoff value

For explanatory purposes, odds have a useful

interpretation:
●If we increase x1 by one unit, holding x2, x3 …
xq constant, then
●b1 is the factor by which the odds of
belonging to class 1 increase
Loan Example:
Evaluating Classification
Performance
Performance measures: Confusion matrix
and % of misclassifications

More useful in this example: gains (lift)

(terms sometimes used interchangeably)
Lift (gains) chart
Multicollinearity

Problem: As in linear regression, if one

predictor is a linear combination of other
predictor(s), model estimation will fail
●Note that in such a case, we have at least
one redundant predictor

Solution: Remove extreme redundancies

(by dropping predictors via variable
selection, or by data reduction methods such
as PCA)
Variable Selection
This is the same issue as in linear regression
● The number of correlated predictors can grow when we
create derived variables such as interaction terms
(e.g. Income x Family), to capture more complex
relationships
● Problem: Overly complex models have the danger of
overfitting
● Solution: Reduce variables via automated selection of
variable subsets (as with linear regression)
● See Chapter 6
P-values for Predictors

●Test null hypothesis that coefficient = 0

●Useful for review to determine whether to
include variable in model
●Important in profiling tasks, but less
important in predictive classification
Multi-Class Classification
Extend logistic regression classifier
• Option 1 –
• Transform the multi-class variable with
multiple dummy variables and build a logistic
model for each of the dummy variables.
• Once probabilities for each class are computed,
the “winning” class can be found as the
maximum probability of the classes
• Option 2 –
• Some software tools have multinomial logistic
regression algorithm implementations built-in
which can be used directly
Summary
●Logistic regression is similar to linear
regression, except that it is used with a
categorical response
●It can be used for explanatory tasks
(=profiling) or predictive tasks
(=classification)
●The predictors are related to the response
Y via a nonlinear function called the logit
●As in linear regression, reducing predictors
can be done via variable selection
●Logistic regression can be generalized to
more than two classes

Chap10 Logistic Regression
No ratings yet
Chap10 Logistic Regression
36 pages
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
No ratings yet
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
20 pages
Chap10 LogisticRegression
No ratings yet
Chap10 LogisticRegression
19 pages
BANA 560 Lecture - 4 - LogisticRegression
No ratings yet
BANA 560 Lecture - 4 - LogisticRegression
26 pages
Understanding Logistic Regression Basics
100% (1)
Understanding Logistic Regression Basics
41 pages
Logistic Regression Tutorial Python
No ratings yet
Logistic Regression Tutorial Python
30 pages
Chapter 10 Logistic Reg (Python)
No ratings yet
Chapter 10 Logistic Reg (Python)
29 pages
S4 LogisticRegression 15jan2025
No ratings yet
S4 LogisticRegression 15jan2025
25 pages
Chapter 10 Logistic Reg - Week 07 - 01
No ratings yet
Chapter 10 Logistic Reg - Week 07 - 01
31 pages
Logistic Regression for Loan Approval Analysis
No ratings yet
Logistic Regression for Loan Approval Analysis
33 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression for Binary Outcomes
No ratings yet
Logistic Regression for Binary Outcomes
33 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
20 pages
Logistic Regression in Business Analytics
No ratings yet
Logistic Regression in Business Analytics
29 pages
Logistic Regression for Classification Analysis
No ratings yet
Logistic Regression for Classification Analysis
29 pages
Linear Regression and Logit
No ratings yet
Linear Regression and Logit
15 pages
Understanding Logistic Regression in R
No ratings yet
Understanding Logistic Regression in R
18 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
22 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
4 pages
Logistic Regression for Analysts
No ratings yet
Logistic Regression for Analysts
61 pages
Logistic Regression for Coupon Usage
100% (1)
Logistic Regression for Coupon Usage
56 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
14 pages
FEM 2063 - Data Analytics: CHAPTER 4: Classifications
100% (2)
FEM 2063 - Data Analytics: CHAPTER 4: Classifications
76 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
6 pages
Machine Learning for Mechanics
No ratings yet
Machine Learning for Mechanics
19 pages
Classification Techniques in Statistical Learning
No ratings yet
Classification Techniques in Statistical Learning
102 pages
Eml 24.7.25
No ratings yet
Eml 24.7.25
23 pages
Classification Algorithms Overview
No ratings yet
Classification Algorithms Overview
92 pages
Probability and Learning in Classification
No ratings yet
Probability and Learning in Classification
34 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
84 pages
ML2 Logistic Regression
No ratings yet
ML2 Logistic Regression
23 pages
Understanding Regression Types and Applications
No ratings yet
Understanding Regression Types and Applications
14 pages
Understanding Logistic Regression in Classification
No ratings yet
Understanding Logistic Regression in Classification
25 pages
Logistic Regression for Classification Analysis
No ratings yet
Logistic Regression for Classification Analysis
56 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
34 pages
SMDS Unit 5
No ratings yet
SMDS Unit 5
21 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Lecture 6 Logistic Regression
No ratings yet
Lecture 6 Logistic Regression
28 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
Chapter 4 Statistical Classification Methods
No ratings yet
Chapter 4 Statistical Classification Methods
63 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Logistic Regression
100% (2)
Logistic Regression
30 pages
Logistic Regression
No ratings yet
Logistic Regression
72 pages
Understanding Logistic Regression Concepts
No ratings yet
Understanding Logistic Regression Concepts
11 pages
L6 LogisticRegression
No ratings yet
L6 LogisticRegression
22 pages
Lec 20
No ratings yet
Lec 20
16 pages
Lecture 4-Logistic Regression
No ratings yet
Lecture 4-Logistic Regression
20 pages
Logistic Regression for Business Insights
No ratings yet
Logistic Regression for Business Insights
18 pages
Business Analytics & Machine Learning: Logistic and Poisson Regressions
No ratings yet
Business Analytics & Machine Learning: Logistic and Poisson Regressions
62 pages
11logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
11logistic Regression in Machine Learning - GeeksforGeeks
4 pages
Logistic Regression Insights
No ratings yet
Logistic Regression Insights
49 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
40 pages
Logistic - Regression Class 3
No ratings yet
Logistic - Regression Class 3
88 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Chap4 Logistic Regression
No ratings yet
Chap4 Logistic Regression
40 pages
Module 5
No ratings yet
Module 5
31 pages
Linear Classification Methods Overview
No ratings yet
Linear Classification Methods Overview
29 pages
Basketball Wins Prediction with Regression
No ratings yet
Basketball Wins Prediction with Regression
10 pages
Anova
100% (2)
Anova
49 pages
Machine Learning Cheat Sheet-1
No ratings yet
Machine Learning Cheat Sheet-1
12 pages
Econometrics Test Bank Solutions
No ratings yet
Econometrics Test Bank Solutions
134 pages
Understanding Multiple Regression Analysis
No ratings yet
Understanding Multiple Regression Analysis
43 pages
Machine Learning Techniques Overview
No ratings yet
Machine Learning Techniques Overview
27 pages
Statistics Applications in Education
No ratings yet
Statistics Applications in Education
11 pages
Multiple Regression Analysis in Econometrics
No ratings yet
Multiple Regression Analysis in Econometrics
51 pages
22IZ023 Nikhil - Exercise 6 - Linear Regression
No ratings yet
22IZ023 Nikhil - Exercise 6 - Linear Regression
4 pages
Time Series Regression Techniques
No ratings yet
Time Series Regression Techniques
47 pages
Regression With Categorical Variables
No ratings yet
Regression With Categorical Variables
4 pages
10 1109@BigData 2018 8621913
No ratings yet
10 1109@BigData 2018 8621913
10 pages
Logistic Regression Tutorial
No ratings yet
Logistic Regression Tutorial
25 pages
Advanced Statistical Estimation Techniques
0% (1)
Advanced Statistical Estimation Techniques
113 pages
Big Data New Tricks For Econometrics
No ratings yet
Big Data New Tricks For Econometrics
51 pages
Solution To Problem
No ratings yet
Solution To Problem
6 pages
INGGRIS - Nurullita Tri Handayani - 1709618019 - PAP A 2018
No ratings yet
INGGRIS - Nurullita Tri Handayani - 1709618019 - PAP A 2018
36 pages
BSF Report Draft
No ratings yet
BSF Report Draft
12 pages
Mean Median Mode
No ratings yet
Mean Median Mode
15 pages
Understanding Regression Models
100% (2)
Understanding Regression Models
26 pages
Analysis of Covariance (ANCOVA) vs. Moderated Regression (MODREG)
No ratings yet
Analysis of Covariance (ANCOVA) vs. Moderated Regression (MODREG)
9 pages
Lab10 Augmented Lattice
No ratings yet
Lab10 Augmented Lattice
5 pages
Father-Son Height Regression Analysis
No ratings yet
Father-Son Height Regression Analysis
1 page
Regression Analysis on Performance Factors
No ratings yet
Regression Analysis on Performance Factors
4 pages
Validitas dan Reliabilitas Skala Dukungan Sosial
No ratings yet
Validitas dan Reliabilitas Skala Dukungan Sosial
5 pages
Predicting Loan Eligibility with ML Models
No ratings yet
Predicting Loan Eligibility with ML Models
29 pages
Variable Selection 8.1 The Model Building Problem
No ratings yet
Variable Selection 8.1 The Model Building Problem
18 pages
Data Science for Business Leaders
No ratings yet
Data Science for Business Leaders
18 pages

Logistic Regression for Binary Classification

Uploaded by

Logistic Regression for Binary Classification

Uploaded by

Chapter 10 – Logistic

Data Mining for Business

© Galit Shmueli, Peter Bruce and Amit Deokar 2023

●Instead of Y as outcome variable (like in linear

Need to relate p to predictors with a function

Standard linear function (as shown below)

Or, given the odds of an event, the probability

To get this result, substitute 10.2

This gives us the logit:

log(Odds) = logit (eq. 10.6)

So, the logit is a linear function of predictors

Review the relationship between logit, odds

Outcome variable: accept bank loan (0/1)

Predictors: Demographic info, and info about

Fitted coefficients (more later): b0 = -6.3525, b1 = -

●Convert to a classification by establishing

●If estimated prob. > cutoff, classify as “1”

●Additional considerations (see Chapter 5)

●Estimates of β’s are derived through an

●Let’s include all 12 predictors in the model

For explanatory purposes, odds have a useful

More useful in this example: gains (lift)

Problem: As in linear regression, if one

Solution: Remove extreme redundancies

●Test null hypothesis that coefficient = 0

You might also like