You are on page 1of 146

ID Age Av_Pages/Week Pages/Week

1 18-24 63.16 12
2 18-24 1.45 35
3 18-24 98.84 116
4 18-24 28.13 37
5 18-24 6.95 8
6 18-24 16.66 17
7 18-24 27.11 32
8 18-24 14.09 37
9 18-24 12.29 6
10 18-24 13.59 29
11 25-29 54.84 27
12 25-29 34.76 5
13 25-29 115.74 109
14 25-29 51.08 25
15 25-29 30.78 13
16 25-29 51.11 16
17 25-29 31.53 25
18 25-29 38.48 2
19 25-29 73.90 58
20 25-29 4.95 15
21 30-39 39.71 25
22 30-39 60.56 34
23 30-39 24.10 4
24 30-39 93.38 80
25 30-39 23.11 2
26 30-39 40.72 45
27 30-39 19.43 20
28 30-39 80.84 52
29 30-39 24.86 35
30 30-39 34.28 30
31 40-49 40.47 14
32 40-49 12.72 24
33 40-49 16.72 13
34 40-49 16.74 30
35 40-49 28.36 31
36 40-49 33.25 44
37 40-49 17.52 23
38 40-49 16.09 30
39 40-49 18.46 34
40 40-49 8.33 20
41 50-59 92.88 48
42 50-59 50.17 24
43 50-59 55.88 64
44 50-59 7.75 29
45 50-59 8.72 16
46 50-59 26.32 20
47 50-59 20.83 9
48 50-59 3.33 24
49 50-59 91.60 80
50 50-59 17.27 21
51 60+ 76.01 117
52 60+ 99.10 118
53 60+ 4.53 6
54 60+ 12.24 1
55 60+ 26.61 11
56 60+ 51.27 52
57 60+ 9.32 29
58 60+ 28.66 5
59 60+ 114.47 123
60 60+ 11.36 11
Renewed

0
1
1
1
1
0
0
1
1
0
1
1
1
1
0
1
1
0
1
0
1
1
1
1
0
1
0
1
1
1
0
0
0
0
0
1
0
0
0
0
1
1
1
1
0
1
1
0
1
1
1
1
1
0
0
1
1
0
1
0
XLSTAT 2021.3.1.12345 - Logistic regression - Start time: 22/09/2021 at 14:52:18 / End time: 22/09/2021 at 14:52:20
Response variable(s): Workbook = demoLOGbin-EN.xls / Sheet = Data / Range = 'Data'!$E:$E / 60 rows and 1 column
X / Quantitative: Workbook = demoLOGbin-EN.xls / Sheet = Data / Range = 'Data'!$C:$D / 60 rows and 2 columns
X / Qualitative: Workbook = demoLOGbin-EN.xls / Sheet = Data / Range = 'Data'!$B:$B / 60 rows and 1 column
Response variable(s): Binary
Model: Logit
Convergence: 0,000001
Iterations: 100
Confidence interval (%): 80
Tolerance: 0,001
Cutpoint: 0,5

Summary statistics (Quantitative data):


Obs.
Obs. with
Observati without Std.
Variable missing Minimum Maximum Mean
ons missing deviation
data
Av_Pages/ 60 0 data 60 1.450 115.740 37.124 30.074
Pages/Wee 60 0 60 1.000 123.000 33.700 30.620

Summary statistics (Qualitative data):

Categorie Frequenci
Variable Counts %
s es
Renewed 0 24 24 40.000
1 36 36 60.000
Age 18-24 10 10 16.667
25-29 10 10 16.667
30-39 10 10 16.667
40-49 10 10 16.667
50-59 10 10 16.667
60+ 10 10 16.667

Correlation matrix:

Av_Pages Pages/ Age-18- Age-25- Age-30- Age-40- Age-50-


Age-60+
/Week Week 24 29 39 49 59
Av_Pages/ 1 0.798 -0.133 0.174 0.105 -0.244 0.005 0.093
Pages/Wee 0.798 1 -0.012 -0.062 -0.015 -0.109 -0.003 0.200
Age-18-24 -0.133 -0.012 1 -0.200 -0.200 -0.200 -0.200 -0.200
Age-25-29 0.174 -0.062 -0.200 1 -0.200 -0.200 -0.200 -0.200
Age-30-39 0.105 -0.015 -0.200 -0.200 1 -0.200 -0.200 -0.200
Age-40-49 -0.244 -0.109 -0.200 -0.200 -0.200 1 -0.200 -0.200
Age-50-59 0.005 -0.003 -0.200 -0.200 -0.200 -0.200 1 -0.200
Age-60+ 0.093 0.200 -0.200 -0.200 -0.200 -0.200 -0.200 1
Renewed 0.442 0.426 0.000 0.091 0.183 -0.456 0.183 0.000

Regression of variable Renewed (Control category = 0):

Goodness of fit statistics (Variable Renewed):

Independ
Statistic Full
ent
Observatio 60 60
Sum of wei 60.000 60.000
DF 59 53
-2 Log(Like 80.761 46.943
R²(McFadd 0.000 0.419
R²(Cox and 0.000 0.431
R²(Nagelke 0.000 0.551
AIC 82.761 62.943
SBC 84.856 79.698
Iterations 0 6

Test of the null hypothesis H0: Pr(Renewed=1)=0,6:

Chi-
Statistic DF Pr > Chi²
square
-2 Log(Like 7 33.819 <0,0001
Score 7 23.816 0.001
Wald 7 12.623 0.082

Type II analysis (Variable Renewed):

Chi- Chi-
Source DF square Pr > Wald square Pr > LR
(Wald) (LR)
Av_Pages/ 1 0.826 0.363 0.862 0.353
Pages/Wee 1 6.274 0.012 8.424 0.004
Age 5 8.757 0.119 14.261 0.014

Hosmer-Lemeshow test (Variable Renewed):

Chi-
Statistic DF Pr > Chi²
square
Hosmer-Lem 7.743 8 0.459

Model parameters (Variable Renewed):


Odds
Wald Wald
ratio
Standard Wald Chi- Lower Upper Odds
Source Value Pr > Chi² Lower
error Square bound bound ratio
bound
Intercept -2.357 1.246 3.579 0.059 (80%)
-3.953 (80%)
-0.760 (80%)
Av_Pages/ 0.023 0.026 0.826 0.363 -0.010 0.057 1.024 0.990
Pages/Wee 0.089 0.036 6.274 0.012 0.044 0.135 1.093 1.045
Age-18-24 0.000 0.000
Age-25-29 0.688 1.206 0.325 0.569 -0.858 2.234 1.989 0.424
Age-30-39 0.963 1.276 0.570 0.450 -0.672 2.598 2.620 0.511
Age-40-49 -2.983 1.377 4.690 0.030 -4.748 -1.218 0.051 0.009
Age-50-59 1.086 1.168 0.864 0.352 -0.411 2.582 2.961 0.663
Age-60+ 0.309 1.264 0.060 0.807 -1.311 1.929 1.362 0.269

Equation of the model (Variable Renewed):

Pr(Renewed=1) = 1 / (1 + exp(-(-2,356668+0,023500*Av_Pages/Week+0,089277*Pages/Week+0,687808*Age-25-29+0,

Standardized coefficients (Variable Renewed):


Wald Wald
Standard Wald Chi- Lower Upper
Source Value Pr > Chi²
error Square bound bound
Av_Pages/ 0.386 0.425 0.826 0.363 (80%)
-0.158 (80%)
0.931
Pages/Wee 1.495 0.597 6.274 0.012 0.730 2.259
Age-18-24 0.000 0.000
Age-25-29 0.141 0.248 0.325 0.569 -0.176 0.459
Age-30-39 0.198 0.262 0.570 0.450 -0.138 0.534
Age-40-49 -0.613 0.283 4.690 0.030 -0.976 -0.250
Age-50-59 0.223 0.240 0.864 0.352 -0.084 0.530
Age-60+ 0.063 0.260 0.060 0.807 -0.269 0.396
Standardized coefficients

Renewed / Standardized coefficients


(80% conf. interval)
2

Pages/Week
1.5

0.5 Av_Pages/Week
Age-30-39 Age-50-59
Age-25-29
Age-60+
Age-18-24
0

-0.5
Age-40-49
Variable
-1

Predictions and residuals (Variable Renewed):


Observati Pred(Ren Significan Significan
Renewed Pr(0) Pr(1)
on ewed) t change t
Obs1 0 1 0.450 0.550 No No
Obs2 1 1 0.310 0.690 No
Obs3 1 1 0.000 1.000 Yes
Obs4 1 1 0.167 0.833 Yes
Obs5 1 0 0.814 0.186 No No
Obs6 0 0 0.610 0.390 No
Obs7 0 1 0.243 0.757 Yes Yes
Obs8 1 1 0.218 0.782 Yes
Obs9 1 0 0.822 0.178 No No
Obs10 0 1 0.365 0.635 No No
Obs11 1 1 0.116 0.884 Yes
Obs12 1 0 0.600 0.400 No No
Obs13 1 1 0.000 1.000 Yes
Obs14 1 1 0.146 0.854 Yes
Obs15 0 1 0.446 0.554 No No
Obs16 1 1 0.277 0.723 No
Obs17 1 1 0.213 0.787 Yes
Obs18 0 0 0.642 0.358 No
Obs19 1 1 0.005 0.995 Yes
Obs20 0 0 0.553 0.447 No
Obs21 1 1 0.145 0.855 Yes
Obs22 1 1 0.045 0.955 Yes
Obs23 1 0 0.615 0.385 No No
Obs24 1 1 0.000 1.000 Yes
Obs25 0 0 0.662 0.338 No
Obs26 1 1 0.027 0.973 Yes
Obs27 0 1 0.300 0.700 No No
Obs28 1 1 0.006 0.994 Yes
Obs29 1 1 0.090 0.910 Yes
Obs30 1 1 0.110 0.890 Yes
Obs31 0 0 0.958 0.042 No
Obs32 0 0 0.948 0.052 No
Obs33 0 0 0.978 0.022 No
Obs34 0 0 0.906 0.094 No
Obs35 0 0 0.870 0.130 No
Obs36 1 0 0.652 0.348 No No
Obs37 0 0 0.947 0.053 No
Obs38 0 0 0.907 0.093 No
Obs39 0 0 0.866 0.134 No
Obs40 0 0 0.966 0.034 No
Obs41 1 1 0.006 0.994 Yes
Obs42 1 1 0.114 0.886 Yes
Obs43 1 1 0.003 0.997 Yes
Obs44 1 1 0.182 0.818 Yes
Obs45 0 1 0.410 0.590 No No
Obs46 1 1 0.244 0.756 No
Obs47 1 1 0.495 0.505 No
Obs48 0 1 0.279 0.721 No No
Obs49 1 1 0.000 1.000 Yes
Obs50 1 1 0.267 0.733 No
Obs51 1 1 0.000 1.000 Yes
Obs52 1 1 0.000 1.000 Yes
Obs53 1 0 0.803 0.197 No No
Obs54 0 0 0.842 0.158 No
Obs55 0 0 0.608 0.392 No
Obs56 1 1 0.022 0.978 Yes
Obs57 1 1 0.319 0.681 No
Obs58 0 0 0.717 0.283 No
Obs59 1 1 0.000 1.000 Yes
Obs60 0 0 0.690 0.310 No

Probabilities
1

0.9

0.8

0.7
Pr(1)

0.6

0.5

0.4

0.3

0.2

0.1

Observations
0

Classification table for the training sample (Variable Renewed):

from \ to 0 1 Total % correct


Specificity
0 17 7 24 70.83%
Sensitivity
1 6 30 36 83.33%
% correct
Total 23 37 60 78.33%

Confusion plot
From
From

To

Significance analysis / Classification table for the training sample (Variable Renewed):

% %
from \ to 0 1 Uncertain Total % correct
uncertain incorrect
Specificity
0 0 1 23 24 0.00% 95.83% 4.17%
Sensitivity
1 0 24 12 36 66.67% 33.33% 0.00%
% correct
Total 0 25 35 60 40.00% 58.33% 0.00%

Goodness of Classification Index (GCI):

Statistic Value
% correct 40.00%
% uncertai 58.33%
% incorrect 1.67%
GCI 67.50%

ROC Curve (Variable Renewed):

ROC Curve (AUC=0,899)


1

0.9

0.8
Sensitivity

0.7

0.6

0.5

0.4

0.3

0.2

0.1

0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
1 - Specificity
0.3

0.2

0.1

0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
1 - Specificity

Area under the curve: 0.899


me: 22/09/2021 at 14:52:20 / Microsoft Excel 16.014326
E:$E / 60 rows and 1 column
/ 60 rows and 2 columns
60 rows and 1 column

Renewed
0.442
0.426
0.000
0.091
0.183
-0.456
0.183
0.000
1

Odds
ratio
Upper
bound
(80%)
1.058
1.144

9.334
13.439
0.296
13.221
6.885

Week+0,687808*Age-25-29+0,963235*Age-30-39-2,982738*Age-40-49+1,085529*Age-50-59+0,308923*Age-60+)))

You might also like