Professional Documents
Culture Documents
Lift Chart (Training Dataset) : A) Answer
Lift Chart (Training Dataset) : A) Answer
5
0 20 0 0 0 0
0 22 0.125 0 0 0
0 24 0.25 0 0 0
0 26 0.375 0 0 0
0 28 0.5 0 1 1
0 28.6 0.5375 0 1 1
0 27 0.4375 0 1 0
1 27.4 0.4625 1 1 0
1 28 0.5 1 1 1
1 28.4 0.525 1 1 1
1 29 0.5625 1 1 1
1 30 0.625 1 1 1
1 32 0.75 1 1 1
1 34 0.875 1 1 1
1 36 1 1 1 1
c) Answer
d) Answer
When Pcut value = 0.5, these two methods show the same Sensitivity & Specificity. However wh
Therefore d) rule would be better.
Pcut = 0.6 B) Answers
0 Pcut value = 0.4
0 Confusion Matrix Predicted Class Accuracy 0.80
0 Actual class 1 0 Sensitivity 1.00
0 1 8 0 Specificity 0.57
0 0 3 4
0
0 Pcut value = 0.5
0 Confusion Matrix Predicted Class Accuracy 0.80
0 Actual class 1 0 Sensitivity 0.88
0 1 7 1 Specificity 0.71
0 0 2 5
1
1 Pcut value = 0.6
1 Confusion Matrix Predicted Class Accuracy 0.73
1 Actual class 1 0 Sensitivity 0.50
1 4 4 Specificity 1.00
0 0 7
ty & Specificity. However when we chose other Pcut value, D's pi shows better resaults.
XLMiner : Multiple Linear Regression
Output Navigator
Inputs Train. Score - Summary Valid. Score - Summary Test Score - Summary Database Score
Elapsed Time Train. Score - Detailed Rep. Valid. Score - Detailed Rep. Test Score - Detailed Rep. New Score - Detailed Rep.
ANOVA Training Lift Charts Validation Lift Charts Test Lift Charts Subset selection
Inputs
Data
Training data used for building the model ['2007007723_이승훈_Data Mining_HW#3.xlsx']'data'!
$A$34:$D$48
# Records in the training data 15
Variables
# Input Variables 1
Input variables Pi
Output variable Y
Constant term present Yes
Total sum of
RMS Error Average Error
squared errors
Elapsed Time
Database Score
Subset selection
Residual df 13
R-squared 0.5217402941
Std. Dev. estimate 0.37060273
Residual SS 1.78550291
$A$33:$D$48
XLMiner : Multiple Linear Regression - Lift chart for training data
Back to Navigator
3 4 5 6 7 8 9 10
Deciles
Min. Max.
1 1
1 1
1 1
1 1
1 1
0 0
1 1
0 0
1 1
0 1
Serial no. in training data in training data edicted values
1 1.1224310107 1 1
2 1.0423125458 1 2
3 0.947277245 1 3
4 0.8234248295 1 4
5 0.7358479475 1 5
6 0.6881910802 0 5
7 0.6581382797 1 6
8 0.52441865 0 6
9 0.52441865 1 7
10 0.3606462198 1 8
11 0.3129893525 0 8
12 0.2254124705 0 8
13 0.101560055 0 8
14 0.0065247542 0 8
15 -0.073593711 0 8
using average Deciles / Global mean
0.5333333333 1 1.875
1.0666666667 2 1.875
1.6 3 1.875
2.1333333333 4 1.875
2.6666666667 5 1.875
3.2 6 0
3.7333333333 7 1.875
4.2666666667 8 0
4.8 9 1.875
5.3333333333 10 0.3125
5.8666666667
6.4
6.9333333333
7.4666666667
8
XLMiner : Multiple Linear Regression
DataSource
WorkBook Path D:\2013 2학기 수업\Data Mining\HW3
WorkBook Name 2007007723_이승훈_Data Mining_HW#3.xlsx
Training Range [data]!$A$34:$D$48
#Training Rows 15
#Variables in Data set 4
#Selected Variables 2
Data Dictionary
Variables in Data Set Y Xun X normal Pi
Variable Type* Continuous Continuous Continuous Continuous
Variable Data Type Number Number Number Number
Mining Schema
Selected Variables Pi Y
Variable Type Input Output
Inputs Normalised No
Constant term present Yes
Model
*This is an indication of how XLMiner stores this variable for later retrieval; it does not necessarily reflect what type of variable was originall
what type of variable was originally input.
XLMiner : Multiple Linear Regression
Output Navigator
Inputs Train. Score - Summary Valid. Score - Summary Test Score - Summary Database Score
Elapsed Time Train. Score - Detailed Rep. Valid. Score - Detailed Rep. Test Score - Detailed Rep. New Score - Detailed Rep.
ANOVA Training Lift Charts Validation Lift Charts Test Lift Charts Subset selection
Inputs
Data
Training data used for building the model ['2007007723_이승훈_Data Mining_HW#3.xlsx']'data'!
$A$2:$D$16
# Records in the training data 15
Variables
# Input Variables 1
Output variable Y
Constant term present Yes
Total sum of
RMS Error Average Error
squared errors
Elapsed Time
Database Score
Subset selection
Residual df 13
R-squared 0.4674149604
Std. Dev. estimate 0.39108503
Residual SS 1.98831749
$A$1:$D$16
XLMiner : Multiple Linear Regression - Lift chart for training data
Back to Navigator
3 4 5 6 7 8 9 10
Deciles
Min. Max.
1 1
1 1
1 1
1 1
1 1
0 0
1 1
0 0
1 1
0 1
Serial no. in training data in training data edicted values
1 1.20918791 1 1
2 1.0396591725 1 2
3 0.870130435 1 3
4 0.7006016975 1 4
5 0.6158373288 1 5
6 0.5819315812 0 5
7 0.5649787075 1 6
8 0.53107296 0 6
9 0.53107296 1 7
10 0.4802143388 1 8
11 0.4463085912 0 8
12 0.3615442225 0 8
13 0.192015485 0 8
14 0.0224867475 0 8
15 -0.14704199 0 8
using average Deciles / Global mean
0.5333333333 1 1.875
1.0666666667 2 1.875
1.6 3 1.875
2.1333333333 4 1.875
2.6666666667 5 1.875
3.2 6 0
3.7333333333 7 1.875
4.2666666667 8 0
4.8 9 1.875
5.3333333333 10 0.3125
5.8666666667
6.4
6.9333333333
7.4666666667
8
XLMiner : Multiple Linear Regression
DataSource
WorkBook Path D:\2013 2학기 수업\Data Mining\HW3
WorkBook Name 2007007723_이승훈_Data Mining_HW#3.xlsx
Training Range [data]!$A$2:$D$16
#Training Rows 15
#Variables in Data set 4
#Selected Variables 2
Data Dictionary
Variables in Data Set Y Xun a) answer X normal
Variable Type* Continuous Continuous Categorical Continuous
Variable Data Type Number Number String Number
Mining Schema
Selected Variables X normal Y
Variable Type Input Output
Inputs Normalised No
Constant term present Yes
Model
*This is an indication of how XLMiner stores this variable for later retrieval; it does not necessarily reflect what type of variable was originall
what type of variable was originally input.