Professional Documents
Culture Documents
Binary Logistic Regression Schueppert 2009 PDF
Binary Logistic Regression Schueppert 2009 PDF
Anja Schppert
a.schueppert@rug.nl
Linear regression:
Univariate
One independent variable, one (continuous) dependent variable.
X1 predicts Y.
Linear regression:
Multivariate
Several independent variables, one (continuous) dependent variable.
X1 predicts Y.
Assumption
p
elogit(p) = 1 p
elogit(p) (1-p) = p = elogit(p) - pelogit(p)
p + pelogit(p) = elogit(p)
p(1+ elogit(p)) = elogit(p)
1 .
p = 1+e-logit(p)
Binary logistic regression:
Univariate
One independent variable, one categorical dependent variable.
1
P(Y ) =
1 + e b0 b1 x1
( + )
P: probability of Y occuring
e: natural logarithm base (= 2,7182818284)
b0: interception at y-axis
b1: line gradient
http://data.princeton.edu/wws509/notes/c3s1.html
Binary logistic regression:
Multivariate
Several independent variables, one categorical dependent variable.
P: probability of Y occuring
e: natural logarithm base
b0: interception at y-axis
b1: line gradient
bn: regression coefficient of Xn
X1: predictor variable
50 x 12 = 600 scores
Independent variables: Examples
Phonetic distance: mne/mne hund/hund apa/abe
Sw. a
Da.
0% 50% 100%
Swedish Danish
bbis vs pple baby vs ble
Difficult sounds: Danish sounds that have been shown to be significantly more
difficult to decode for Swedes (Schppert & Gooskens, in prep.): [
Data
Data Entry
Data Entry
Block 1
Data Entry
Block 2
Data Entry
Data Entry
Output: Block 1 (Phonetic distance)
Improvement
through added Improvement is -2LL:
variable significant: predictor Amount of
phonetic distance phonetic distance unexplained
contributes to the variance
model
2 2
RCS = 0.12 R N
= 0.17
Output: Block 1 (Phonetic distance)
Exp(B) < 1
Indicates that phonetic
distance correlates negatively Model predicts correct
with intelligibility. value in 75% of the cases.
-2LL:
Amount of
Model has further Significant value: unexplained
improved through variance is
indicates that one or
added variable(s) reduced from
both of the new
predictors improve the 513,09 to 498,34
new model. 2 2
RCS = 0.14 R N
= 0.21
2 2
(Block 1: RCS = 0.12 R N
= .17 )
Output: Block 2
(Phonetic distance, Toneme, Difficult Sounds)
Significant value
indicates that variable difficult
sounds improves the model. Non-significant value
Exp(B) < 1 indicates a negative indicates that variable toneme
correlation. does not improve the model.
Together, phonetic distance and number of strange sounds account for 14%
to 21% of the variance.
Cohen, J., P. Cohen, S.G. West, L.S. Aiken. 2003. Applied Multiple
Regression/Correlation Analysis for the Behavioral Sciences. London:
Lawrence Erlbaum.
Field, A. 2005. Discovering Statistics Using SPSS. London: Sage.
Rietveld, T., R. van Hout. 1993. Statistical Techniques for the Study of
Language and Language Behaviour. Berlin: Mouton de Gruyter.
Schppert, A. & Gooskens, Ch. In prep. Easy sounds, difficult sounds.
Evidence from a word comprehension task with Swedish children listening
to Danish.
Sieben, I. Logistische regressie analyse: een handleiding.
http://scm.ou.nl/Manual/logisticregreskun.html
Tabachnick, B.G., L.S. Fidell. 2007. Using Multivariate Statistics. Boston:
Pearson.