100% found this document useful (1 vote)

657 views34 pages

Logistic Regression Guide

This document provides an overview of logistic regression. It begins with a review of simple and multiple linear regression. It then introduces the logistic function and how it is used to model dichotomous outcome variables. It describes how to interpret the coefficients in logistic regression models when predictors are continuous, dichotomous, or have multiple categories. It provides examples of using logistic regression to model relationships between disease status and risk factors.

Uploaded by

Mohamed Med

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

657 views34 pages

Logistic Regression Guide

Uploaded by

Mohamed Med

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Introduction to Logistic Regression: Briefly introduces the concept of logistic regression and its purpose within statistical modeling.
Outline of Regression Concepts: Provides an outline of the topics covered, including simple and multiple logistic regression methods and examples.
Simple Linear Regression: Explains the basics of modeling a numeric response with a single predictor in linear regression.
Multiple Linear Regression: Describes the use of multiple predictors in determining the numeric response, emphasizing linear combinations.
General Linear Models: Discusses the family of regression models and introduces logistic regression within this context.
Introduction to Logistic Regression: Defines logistic regression and its applicability to categorical and continuous predictor variables.
Logistic Regression Applied: Coronary Heart Disease: Uses an example of logistic regression to analyze the relationship between age and signs of heart disease.
Logistic Function: Visualizes the logistic function, essential for understanding logistic regression probability estimates.
Logit Transformation: Introduces the logit transformation used to link predictor variables with dichotomous outcome variables in logistic regression.
Dichotomous Predictor Analysis: Explores the application of logistic regression to dichotomous predictor variables, explaining odds ratio and transformations.
Examples: Age and Cervical Cancer: Provides examples of logistic regression used to study the association between age at first pregnancy and cervical cancer risk.
Example: Smoking and Low Birth Weight: Examines how logistic regression models the relationship between smoking during pregnancy and low birth weight outcomes.
Putting it All Together: Summarizes how diverse predictor types and their logistic interpretations integrate into comprehensive regression modeling.
Summary of Logistic Regression: Concludes with key points on logistic regression, emphasizing odds ratios and predictor interpretation.

Logistic Regression

Outline
Review of simple and multiple regression
Simple Logistic Regression
The logistic function
Interpretation of coefficients
continuous predictor (X)
dichotomous categorical predictor (X)
categorical predictor with three or more levels (X)

Multiple Logistic Regression

Examples

Simple Linear Regression

Model the mean of a numeric response Y as a
function of a single predictor X, i.e.

E(Y|X) = o + 1f(x)
Here f(x) is any function of X, e.g.
f(x) = X
E(Y|X) = o + 1X
(line)
f(x) = ln(X) E(Y|X) = o + 1ln(X) (curved)
The key is that E(Y|X) is a linear in the
parameters o and 1 but not necessarily in X.

Simple Linear Regression

0 = Estimated Intercept
= y -value at x = 0

y E (Y | X ) 0 1 x

Interpretable only if x = 0 is a
value of particular interest.

1 w units
^

w units
0

1 = Estimated Slope
= Change in y for every
unit increase in x
= estimated change
in the mean of Y for
a unit change in X.
Always interpretable!

Multiple Linear Regression

We model the mean of a numeric response
as linear combination of the predictors
themselves or some functions based on
the predictors, i.e.
E(Y|X) = o + 1X1+ 2X2 ++ pXp
Here the terms in the model are the predictors

E(Y|X) = o + 1 f1(X)+ 2 f2(X)++ k fk(X)

Here the terms in the model are k different functions of the p predictors

Multiple Linear Regression

For the classic multiple regression model
E(Y|X) = o + 1X1+ 2X2 ++ pXp
the regression coefficients (i) represent the
estimated change in the mean of the
response Y associated with a unit change
in Xi while the other predictors are held
constant.
They measure the association between Y and
Xi adjusted for the other predictors in the
model.

General Linear Models

Family of regression models
Response
Model Type
Continuous
Linear regression
Counts
Poisson regression
Survival times Cox model
Binomial Logistic regression
Uses
Control for potentially confounding factors
Model building , risk prediction

Logistic Regression
Models relationship between set of variables Xi

dichotomous (yes/no, smoker/nonsmoker,)

categorical (social class, race, ... )
continuous (age, weight, gestational age, ...)

and
dichotomous categorical response variable Y
e.g. Success/Failure, Remission/No Remission
Survived/Died, CHD/No CHD, Low Birth Weight/Normal
Birth Weight, etc

Logistic Regression
Example: Coronary Heart Disease (CD) and Age In
this study sampled individuals were examined for
signs of CD (present = 1 / absent = 0) and the
potential relationship between this outcome and their
age (yrs.) was considered.

This is a portion of the raw data for the 100 subjects who
participated in the study.

Logistic Regression
How can we analyze these data?

Non-pooled t-test

The mean age of the individuals with some signs of

coronary heart disease is 51.28 years vs. 39.18 years for
individuals without signs (t = 5.95, p < .0001).

Logistic Regression
Simple Linear Regression?

E (CD | Age) .54 .02 Age

e.g . For an individual 50 years of age
E (CD | Age 50) .54 .02 50 .46 ??

Smooth Regression Estimate?

The smooth regression estimate is

S-shaped but what does the
estimated mean value represent?
Answer: P(CD|Age)!!!!

Logistic Regression
We can group individuals into age classes and look
at the percentage/proportion showing signs of
coronary heart disease.

Notice the S-shape to the

estimated proportions vs.
age.

Logistic Function

P(Success|X)

e o 1 X
P (" Success"| X )
1 e o 1 X

Logit Transformation
The logistic regression model is given by
o 1 X
e
P (Y | X )
1 e o 1 X
which is equivalent to

P (Y | X )
o 1 X
ln
1 P (Y | X )
This is called the
Logit Transformation

Dichotomous Predictor
Consider a dichotomous predictor (X) which
represents the presence of risk (1 = present)

P
e o 1 X
1 P

P(Y 1 | X 1)
e o 1
1 - P(Y 1 | X 1)
P(Y 1 | X 0)
Odds for Disease with Risk Absent
e o
1 - P(Y 1 | X 0)
Odds for Disease with Risk Present

o 1
Odds
for
Disease
with
Risk
Present
e
Therefore the
1

e
odds ratio (OR) Odds for Disease with Risk Absent
e o

Dichotomous Predictor
Therefore, for the odds ratio associated with

risk presence we have OR e

Taking the natural logarithm we have

ln(OR ) 1
thus the estimated regression coefficient
associated with a 0-1 coded dichotomous
predictor is the natural log of the OR
associated with risk presence!!!

Logit is Directly Related to Odds

The logistic model can be written

P (Y | X )
P
ln
ln
o 1 X
1 P
1 P (Y | X )

This implies that the odds for success can

be expressed as
P
o 1 X
e
1 P
This relationship is the key to interpreting the
coefficients in a logistic regression model !!

Dichotomous Predictor (+1/-1 coding)

Consider a dichotomous predictor (X) which
represents the presence of risk (1 = present)

P
e o 1 X
1 P

P(Y 1 | X 1)
e o 1
1 - P(Y 1 | X 1)
P(Y 1 | X -1)
Odds for Disease with Risk Absent
e o 1
1 - P(Y 1 | X -1)
Odds for Disease with Risk Present

o 1
Odds
for
Disease
with
Risk
Present
e
Therefore the
2 1

e
odds ratio (OR) Odds for Disease with Risk Absent
e o 1

Dichotomous Predictor
Therefore, for the odds ratio associated with
2

risk presence we have OR e

Taking the natural logarithm we have

ln(OR ) 2 1
thus twice the estimated regression
coefficient associated with a +1 / -1 coded
dichotomous predictor is the natural log of
the OR associated with risk presence!!!

Example: Age at 1st Pregnancy and

Cervical Cancer
Use Fit Model Y = Disease Status
X = Risk Factor Status
When the response Y is a
dichotomous categorical
variable the Personality box
will automatically change to
Nominal Logistic, i.e. Logistic
Regression will be used.
Remember when a
dichotomous categorical
predictor is used JMP uses
+1/-1 coding. If you want
you can code them as 0-1
and treat is as numeric.

Example: Age at 1st Pregnancy and

Cervical Cancer
o 2.183
0.607
1

Thus the estimated odds ratio is

ln(OR) 2 1 2(.607) 1.214

OR e1.214 3.37
Women whose first pregnancy is at or before age 25
have 3.37 times the odds for developing cervical cancer
than women whose 1st pregnancy occurs after age 25.

Example: Age at 1st Pregnancy and

Cervical Cancer
o 2.183
0.607
1

Thus the estimated odds ratio is

Risk Present

Odds Ratio for disease

associated with risk presence

Example: Smoking and Low Birth Weight

Use Fit Model Y = Low Birth Weight (Low, Norm)
X = Smoking Status (Cig, NoCig)

1 .335
OR e

2 1

e .670 1.954

We estimate that women who smoker

during pregnancy have 1.95 times
higher odds for having a child with low
birth weight than women who do not
smoke cigarettes during pregnancy.

Example: Smoking and Low Birth Weight

1 .335
OR e

2 1

e .670 1.954

Find a 95% CI for OR

1st

Find a 95% CI for

1.96 SE ( ) .335 1.96 (.013) .335 .025 (.310,.360)
1

(LCL,UCL)

2nd

.360OR = (e2LCL, e2UCL)

Compute
)
(e 2310 , eCI2for
) (1.86 , 2.05

We estimate that the odds for having a low birth weight infant are between 1.86
and 2.05 times higher for smokers than non-smokers, with 95% confidence.

Example: Smoking and Low Birth Weight

We might want to adjust for other potential
confounding factors in our analysis of the
risk associated with smoking during
pregnancy. This is accomplished by simply
adding these covariates to our model.
Multiple Logistic Regression Model

p
o 1 X 1 2 X 2 p X p
1 p

Before looking at some multiple logistic regression examples we need to

look at how continuous predictors and categorical variables with 3 or levels
are handled in these models and how associated ORs are calculated.

Example 2: Signs of CD and Age

Fit Model Y = CD (CD if signs present, No otherwise)
X = Age (years)

p
5.31 .111 Age
1 p
p
Odds
e 5.31.111 Age
1 p
ln

Consider the risk associated with a c year increase in age.

Odds for Age x c e o 1 ( x c )

Odds Ratio (OR)
o 1x e c1
Odds for Age x
e

Example 2: Signs of CD and Age

For example consider a 10 year increase in
age, find the associated OR for showing
signs of CD, i.e. c = 10
OR = ec = e10*.111 = 3.03
Thus we estimate that the odds for exhibiting
signs of CD increase threefold for each 10
years of age. Similar calculations could be
done for other increments as well.
For example for a c = 1 year increase
OR = e = e.111 = 1.18 or an 18% increase in
odds per year

Example 2: Signs of CD and Age

Can we assume that the increase in risk
associated with a c unit increase is
constant throughout ones life?
Is the increase going from 20 30 years of
age the same as going from 50 60
years?
If that assumption is not reasonable then
one must be careful when discussing risk
associated with a continuous predictor.

Example 3: Race and Low Birth Weight

1 for race black
1 for race white

Race[ Black ]

1 for race other

- 1 for race white

Race[Other ]

Calculate the odds for low birth weight for each race (Low, Norm)
White Infants (reference group, missing in parameters)

e 2.198.410 ( 1) .089 ( 1) e 2.198.410.089 .0805

Black Infants

e 2.198.410 ( 1) .089 ( 0 ) .167

Other Infants

e 2.198.410 ( 0 ) .089 ( 1) .102

OR for Blacks vs. Whites

= .167/.0805 = 2.075
OR for Others vs. Whites
= .102/.0805 = 1.267
OR for Black vs. Others
= .167/.102 = 1.637

Example 3: Race and Low Birth Weight

Finding these directly using the estimated
parameters is cumbersome. JMP will
compute the Odds Ratio for each possible
comparison and their reciprocals in case
those are of interest as well.
Odds Ratio column is odds for
Low for Level 1 vs. Level 2.
Reciprocal is odds for Low for
Level 2 vs. Level 1. These are
the easiest to interpret here as
they represent increased risk.

Putting it all together

Now that we have seen how to interpret
each of the variable types in a logistic
model we can consider multiple logistic
regression models with all these variable
types included in the model.
We can then look at risk associated with
certain factors adjusted for the other
covariates included in the model.

Example 3: Smoking and Low Birth Weight

Consider again the risk associated with smoking

but this time adjusting for the potential confounding

effects of education level and age of the mother &
father, race of the child, total number of prior
pregnancies, number children born alive that are
now dead, and gestational age of the infant.
Several terms are not
statistically significant and
could consider using
backwards elimination to
simplify the model.

Example 3: Race and Low Birth Weight

None of the mother and farther
related covariates entered into the
final model.
Adjusting for the included
covariates we find smoking is
statistically significant (p < .0001)

Adjusting for the included covariates we find the

odds ratio for low birth weight associated with
smoking during pregnancy is 2.142.

Odds Ratios for the other factors in the model can be computed
as well. All of which can be prefaced by the adjusting for
statement.

Summary
In logistic regression the response (Y) is a

dichotomous categorical variable.

The parameter estimates give the odds ratio
associated the variables in the model.
These odds ratios are adjusted for the other
variables in the model.
One can also calculate P(Y|X) if that is of
interest, e.g. given demographics of the
mother what is the estimated probability of
her having a child with low birth weight.

Logistic Regression
100% (1)
Logistic Regression
37 pages
Introduction to Logistic Regression
100% (2)
Introduction to Logistic Regression
32 pages
Logistic Regression for Researchers
100% (2)
Logistic Regression for Researchers
51 pages
Logistic Regression
100% (3)
Logistic Regression
30 pages
Data Science Interview Stats Q&A
No ratings yet
Data Science Interview Stats Q&A
5 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
83 pages
Modelling in R
No ratings yet
Modelling in R
47 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
30 pages
Logistic Regression for Dichotomous Data
100% (1)
Logistic Regression for Dichotomous Data
21 pages
Data Science & Statistics FAQs
100% (1)
Data Science & Statistics FAQs
41 pages
Linear Regression For Machine Learning
100% (1)
Linear Regression For Machine Learning
17 pages
Pattern Recognition Explained
No ratings yet
Pattern Recognition Explained
40 pages
Machine Learning Regression Techniques Guide
No ratings yet
Machine Learning Regression Techniques Guide
8 pages
Estimation and Hypothesis Testing Guide
100% (2)
Estimation and Hypothesis Testing Guide
32 pages
Fitting & Interpreting Linear Models in Rinear Models in R
100% (1)
Fitting & Interpreting Linear Models in Rinear Models in R
8 pages
Wayne Daniel
100% (6)
Wayne Daniel
186 pages
Multivariate Linear Regression Guide
100% (1)
Multivariate Linear Regression Guide
12 pages
Basic Statistics For Data Science
100% (1)
Basic Statistics For Data Science
45 pages
Understanding Regression in Machine Learning
100% (2)
Understanding Regression in Machine Learning
20 pages
Building A Career in Data Science - The Overview
No ratings yet
Building A Career in Data Science - The Overview
2 pages
Cluster Analysis: Concepts & Methods
100% (1)
Cluster Analysis: Concepts & Methods
72 pages
Simple Linear Regression Guide with Python
No ratings yet
Simple Linear Regression Guide with Python
8 pages
Overview of Machine Learning Concepts
No ratings yet
Overview of Machine Learning Concepts
11 pages
Logistic Regression Overview and Example
No ratings yet
Logistic Regression Overview and Example
32 pages
Exploratory Data Analysis
100% (3)
Exploratory Data Analysis
26 pages
Supervised Vs Unsupervised Learning
No ratings yet
Supervised Vs Unsupervised Learning
4 pages
Understanding Logistic Regression
100% (3)
Understanding Logistic Regression
41 pages
Question Bank For Biostatistics
No ratings yet
Question Bank For Biostatistics
4 pages
Business Analytics Regression Guide
No ratings yet
Business Analytics Regression Guide
91 pages
Roc Curve
No ratings yet
Roc Curve
43 pages
Correlation and Regression Analysis Guide
100% (1)
Correlation and Regression Analysis Guide
55 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
Statistic Interview Questions and Answers by Jeevan Raj
No ratings yet
Statistic Interview Questions and Answers by Jeevan Raj
21 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
60 pages
Understanding Simple Linear Regression
100% (1)
Understanding Simple Linear Regression
44 pages
Clustering Techniques: Similarity Measures
No ratings yet
Clustering Techniques: Similarity Measures
48 pages
Regression - Elements of AI 4-2
100% (2)
Regression - Elements of AI 4-2
20 pages
Regression Notes
100% (1)
Regression Notes
20 pages
Hypothesis Testing
0% (1)
Hypothesis Testing
139 pages
SPSS Hypothesis Testing & Analysis
No ratings yet
SPSS Hypothesis Testing & Analysis
66 pages
Understanding Regression Analysis Types
No ratings yet
Understanding Regression Analysis Types
5 pages
Shreya Bansal - 250418 - 153433
No ratings yet
Shreya Bansal - 250418 - 153433
971 pages
Statistics and Basic Distribution - Mabe
No ratings yet
Statistics and Basic Distribution - Mabe
103 pages
Data Science Lecture Notes Unit 5
100% (4)
Data Science Lecture Notes Unit 5
216 pages
UGC Statistics Curriculum Overview
No ratings yet
UGC Statistics Curriculum Overview
101 pages
Estimation and Hypothesis Testing Guide
No ratings yet
Estimation and Hypothesis Testing Guide
12 pages
Statistical Software for Analysts
100% (1)
Statistical Software for Analysts
5 pages
Linear Regression
No ratings yet
Linear Regression
12 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
33 pages
Statistics Interview Questions & Answers For Data Scientists
No ratings yet
Statistics Interview Questions & Answers For Data Scientists
43 pages
EDA Techniques in R with dlookr
100% (2)
EDA Techniques in R with dlookr
11 pages
Statistical Inference
No ratings yet
Statistical Inference
69 pages
Estimation of Parameters: Example
No ratings yet
Estimation of Parameters: Example
2 pages
Overview of Inferential Statistics
No ratings yet
Overview of Inferential Statistics
180 pages
Introduction to Statistics and Data Types
100% (1)
Introduction to Statistics and Data Types
46 pages
Introduction to Logistic Regression
No ratings yet
Introduction to Logistic Regression
36 pages
Understanding Logistic Regression Basics
100% (1)
Understanding Logistic Regression Basics
37 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
37 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
12 pages
Logistic Regression Guide
No ratings yet
Logistic Regression Guide
19 pages
Understanding Assembly Language Macros
No ratings yet
Understanding Assembly Language Macros
1 page
Optimizing PUTC Macro with System Qualifiers
No ratings yet
Optimizing PUTC Macro with System Qualifiers
1 page
PUTC Macro Expansion Example
No ratings yet
PUTC Macro Expansion Example
1 page
Macro Expansion System Qualifiers
No ratings yet
Macro Expansion System Qualifiers
1 page
Macro Expansion Example in Assembly
No ratings yet
Macro Expansion Example in Assembly
1 page
Macro Code for Register Incrementing
No ratings yet
Macro Code for Register Incrementing
1 page
Macro Calls Within A Macro: Saves The System State
No ratings yet
Macro Calls Within A Macro: Saves The System State
1 page
Conditional Assembly in Macro Facilities
No ratings yet
Conditional Assembly in Macro Facilities
1 page
Understanding the PUTC Macro
No ratings yet
Understanding the PUTC Macro
1 page
Macro Symbol Concatenation Guide
No ratings yet
Macro Symbol Concatenation Guide
1 page
Macro Expansion of PUTC Function
No ratings yet
Macro Expansion of PUTC Function
1 page
Programming with Macro Parameters
No ratings yet
Programming with Macro Parameters
1 page
Optimizing PUTC Macro with System Qualifiers
No ratings yet
Optimizing PUTC Macro with System Qualifiers
1 page
Optimizing PUTC Macro with SET Variables
No ratings yet
Optimizing PUTC Macro with SET Variables
1 page
Macro Expansion Example: PUTMSG
No ratings yet
Macro Expansion Example: PUTMSG
1 page
Keyword Parameters in Macro Calls
No ratings yet
Keyword Parameters in Macro Calls
1 page
Macro Definition Format Overview
No ratings yet
Macro Definition Format Overview
1 page
Assembly Macro Error Solutions
No ratings yet
Assembly Macro Error Solutions
1 page
Java Inheritance Quiz & Code Analysis
No ratings yet
Java Inheritance Quiz & Code Analysis
4 pages
Professional Communication Course
No ratings yet
Professional Communication Course
7 pages
Chapter 16: Time-Series Forecasting
No ratings yet
Chapter 16: Time-Series Forecasting
48 pages
Assembly Language Macros
No ratings yet
Assembly Language Macros
1 page
Student Collection Lab Manual Guide
No ratings yet
Student Collection Lab Manual Guide
1 page
Online Marketing Challenge 2017
No ratings yet
Online Marketing Challenge 2017
8 pages
Case 2.4 Asher Farms Inc
No ratings yet
Case 2.4 Asher Farms Inc
3 pages
Amandalovestoaudit: Amanda'S Lectures
No ratings yet
Amandalovestoaudit: Amanda'S Lectures
2 pages
Impossible Paths in Langton's Ant
100% (1)
Impossible Paths in Langton's Ant
19 pages
Medieval Bestiary Monstrous Guide
100% (2)
Medieval Bestiary Monstrous Guide
22 pages
Masonry NC II (Superseded)
No ratings yet
Masonry NC II (Superseded)
67 pages
Kant, Math Physics & Transcendental Philosophy
No ratings yet
Kant, Math Physics & Transcendental Philosophy
15 pages
Hinton - A New Era of Thought
100% (16)
Hinton - A New Era of Thought
276 pages
Capgemini (Part - 3)
No ratings yet
Capgemini (Part - 3)
4 pages
Business Tools For Decision Making UNIT-3
No ratings yet
Business Tools For Decision Making UNIT-3
12 pages
Eurotherm 2132
100% (1)
Eurotherm 2132
4 pages
Centrio Tower Project Overview
No ratings yet
Centrio Tower Project Overview
71 pages
Nonholonomic Constraints in Dynamics
No ratings yet
Nonholonomic Constraints in Dynamics
6 pages
eNOA Submission User Manual
No ratings yet
eNOA Submission User Manual
19 pages
Infoset - Sap Bi, Sap BW
No ratings yet
Infoset - Sap Bi, Sap BW
7 pages
Austin.J.L - Performative Utterances (On BBC)
No ratings yet
Austin.J.L - Performative Utterances (On BBC)
5 pages
Shapely Value Proof
No ratings yet
Shapely Value Proof
3 pages
Supply Chain Relationship Quality Impact
No ratings yet
Supply Chain Relationship Quality Impact
2 pages
Surface Roughness Tester
No ratings yet
Surface Roughness Tester
17 pages
IRS Big Data Analytics Overview
No ratings yet
IRS Big Data Analytics Overview
27 pages
IOC Technical Manual on Hospitality
No ratings yet
IOC Technical Manual on Hospitality
61 pages
Neuwaldegg Summer Seminar 2010
No ratings yet
Neuwaldegg Summer Seminar 2010
4 pages
5 7+classnotes
No ratings yet
5 7+classnotes
6 pages
Signals and Systems (Practice Questions - Laplace Transform)
0% (1)
Signals and Systems (Practice Questions - Laplace Transform)
26 pages
Atrazine's Impact on Male Frog Development
100% (1)
Atrazine's Impact on Male Frog Development
4 pages
Computer Virus Basics for Beginners
No ratings yet
Computer Virus Basics for Beginners
35 pages
cUSTOM TOOLS
No ratings yet
cUSTOM TOOLS
78 pages
Hole in The Sky Notes
No ratings yet
Hole in The Sky Notes
28 pages
2024-2025 Class Programs for Grades 7-9
No ratings yet
2024-2025 Class Programs for Grades 7-9
17 pages
Optical Comm Systems Course
No ratings yet
Optical Comm Systems Course
6 pages
Semgsem
No ratings yet
Semgsem
5 pages
BOPP Film Test Report Summary
No ratings yet
BOPP Film Test Report Summary
1 page
DLL July 7
No ratings yet
DLL July 7
4 pages

Logistic Regression Guide

Uploaded by

Logistic Regression Guide

Uploaded by

Logistic Regression

Multiple Logistic Regression

Simple Linear Regression

Simple Linear Regression

Multiple Linear Regression

E(Y|X) = o + 1 f1(X)+ 2 f2(X)++ k fk(X)

Multiple Linear Regression

General Linear Models

dichotomous (yes/no, smoker/nonsmoker,)

The mean age of the individuals with some signs of

E (CD | Age) .54 .02 Age

Smooth Regression Estimate?

The smooth regression estimate is

Notice the S-shape to the

risk presence we have OR e

Taking the natural logarithm we have

Logit is Directly Related to Odds

This implies that the odds for success can

Dichotomous Predictor (+1/-1 coding)

risk presence we have OR e

Taking the natural logarithm we have

Example: Age at 1st Pregnancy and

Example: Age at 1st Pregnancy and

Thus the estimated odds ratio is

ln(OR) 2 1 2(.607) 1.214

Example: Age at 1st Pregnancy and

Thus the estimated odds ratio is

Odds Ratio for disease

Example: Smoking and Low Birth Weight

We estimate that women who smoker

Example: Smoking and Low Birth Weight

Find a 95% CI for OR

Find a 95% CI for

.360OR = (e2LCL, e2UCL)

Example: Smoking and Low Birth Weight

Before looking at some multiple logistic regression examples we need to

Example 2: Signs of CD and Age

Consider the risk associated with a c year increase in age.

Odds for Age x c e o 1 ( x c )

Example 2: Signs of CD and Age

Example 2: Signs of CD and Age

Example 3: Race and Low Birth Weight

1 for race other

e 2.198.410 ( 1) .089 ( 1) e 2.198.410.089 .0805

e 2.198.410 ( 1) .089 ( 0 ) .167

e 2.198.410 ( 0 ) .089 ( 1) .102

OR for Blacks vs. Whites

Example 3: Race and Low Birth Weight

Putting it all together

Example 3: Smoking and Low Birth Weight

but this time adjusting for the potential confounding

Example 3: Race and Low Birth Weight

Adjusting for the included covariates we find the

dichotomous categorical variable.

You might also like