Professional Documents
Culture Documents
Hypo Eval Notes
Hypo Eval Notes
[Read Ch. 5]
[Recommended exercises: 5.2, 5.3, 5.4]
Sample error, true error
Condence intervals for observed hypothesis
error
Estimators
Binomial distribution, Normal distribution,
Central Limit Theorem
Paired t tests
Comparing learning methods
74 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Two Denitions of Error
75 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Problems Estimating Error
76 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Example
77 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Estimators
Experiment:
1. choose sample S of size n according to
distribution D
2. measure errorS (h)
errorS (h) is a random variable (i.e., result of an
experiment)
errorS (h) is an unbiased estimator for errorD (h)
Given observed errorS (h) what can we conclude
about errorD (h)?
78 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Condence Intervals
If
S contains n examples, drawn independently of
h and each other
n 30
Then
With approximately 95% probability, errorD(h)
lies in interval vuu
uut errorS (h)(1 ? errorS (h))
errorS (h) 1:96 n
79 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Condence Intervals
If
S contains n examples, drawn independently of
h and each other
n 30
Then
With approximately N% probability, errorD(h)
lies in interval v
uuu error (h)(1 ? error (h))
errorS (h) zN ut S
n
S
where
N %: 50% 68% 80% 90% 95% 98% 99%
zN : 0.67 1.00 1.28 1.64 1.96 2.33 2.58
80 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
errorS (h) is a Random Variable
0.06
0.04
0.02
0
0 5 10 15 20 25 30 35 40
n!
P (r) = r!(n ? r)! errorD (h)r (1 ? errorD (h))n?r
81 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Binomial Probability Distribution
0.06
0.04
0.02
0
0 5 10 15 20 25 30 35 40
82 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Normal Distribution Approximates Bino-
mial
83 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Normal Probability Distribution
p(x) = p 1 2 e? ( x? )
-3 -2 -1 0 1 2 3
1 2
2
2
The probability that X will fall into the interval
(a; b) is given by Zb
a p(x)dx
Expected, or mean value of X , E [X ], is
E [X ] =
Variance of X is
V ar(X ) = 2
Standard deviation of X , X , is
X =
84 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Normal Probability Distribution
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
-3 -2 -1 0 1 2 3
85 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Condence Intervals, More Correctly
If
S contains n examples, drawn independently of
h and each other
n 30
Then
With approximately 95% probability, errorS (h)
lies in interval vuu
uut errorD (h)(1 ? errorD (h))
errorD (h) 1:96 n
equivalently, errorD (h) lies in interval
vuu
uut errorD (h)(1 ? errorD (h))
errorS (h) 1:96 n
which is approximately
vuu
uut errorS (h)(1 ? errorS (h))
errorS (h) 1:96 n
86 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Central Limit Theorem
87 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Calculating Condence Intervals
88 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Dierence Between Hypotheses
89 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Paired t test to compare hA,hB
90 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Comparing learning algorithms LA and LB
91 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Comparing learning algorithms LA and LB
92 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997
Comparing learning algorithms LA and LB
instead of
ESD [errorD (LA(S )) ? errorD (LB (S ))]
but even this approximation is better than no
comparison
93 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997