Professional Documents
Culture Documents
Improve Analyze
Species
0.50 versicolor
0.25
setosa
0.00
All Rows
Number
RSquare N of Splits
0.000 150 0
All Rows
Count G^2
150 329.58369
Level Prob
setosa 0.3333
versicolor 0.3333
virginica 0.3333
Candidates
Candidate
Term G^2 LogWorth
Sepal length 115.8732799 32.09622458
Sepal width 58.8743943 14.76067659
Petal length 190.9542505 * 57.33863312
Petal width 190.9542505 56.63837050
200
150
100
50
0
1 2 3 4 5 6 7
Petal length
Species
2.5 setosa
versicolor
2.0 virginica
Petal width
1.5
Setosa
1.0
0.5
0.0
1 2 3 4 5 6 7
Petal length
2016 Philip J. Ramsey, Ph.D. 12
Case Study – Fisher Iris Data
All Rows
Species
2.5 Virginica setosa
versicolor
2.0 virginica
Petal width
1.5
1.0 Setosa
Versicolor
0.5
0.0
1 2 3 4 5 6 7
Petal length
Species
2.5 Virginica setosa
Split 1 versicolor
2.0 virginica
Petal width
1.5 Split 2
1.0 Setosa
Versicolor
0.5
Split 3
0.0
1 2 3 4 5 6 7
Petal length
The Leaf Report depicts the counts and probabilities for each of the
4 nodes. Notice that Petal Length < 3.0 was not split further
since such a split resulted in 100% setosa.
virginica
0.75
Species
0.50 versicolor
0.25
setosa
0.00
1 2 3 4
Leaf Number
2016 Philip J. Ramsey, Ph.D. 18
Case Study – Fisher Iris Data
The model predictions are nothing more than the estimated
proportions or probabilities for each of the classes in each of the
terminal nodes.
As an example below is the prediction formula for Versicolor
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version
published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. Subject to disclaimers.
A portion of
the Press
Band data
set.
There are 540
records and
39 variables.