Professional Documents
Culture Documents
j 1 j
N=100, e=20
1- α
Normal Approximation to the Binomial
1- α
t Test
• Multiple training/validation sets
• xti = 1 if instance t misclassified on fold i
i
N t
• Error rate of fold i: x
pi t 1
N
• With m and s2 average and var of pi , we accept p0 or less error if
K m p0
~ t K 1
S
is less than tα,K-1
Comparing Classifiers:
H0: μ0 = μ1 vs. H1: μ0 ≠ μ1
• Single training/validation set: McNemar’s Test
~ X 12
e01 e10
Accept if < X2α,1
K-Fold CV Paired t Test
• Use K-fold cv to get K training/validation folds
• pi1, pi2: Errors of classifiers 1 and 2 on fold i
• pi = pi1 – pi2 : Paired difference on fold i
• The null hypothesis is whether pi has mean 0
H0 : 0 vs. H0 : 0
i 1 pi
K K
i 1 i
p m 2
m s2
K K 1
K m 0 K m
~ t K 1 Accept if in t / 2 ,K 1 , t / 2 ,K 1
s s
5×2 cv Paired t Test
• Use 5×2 cv to get 2 folds of 5 tra/val replications
(Dietterich, 1998)
• pi(j) : difference btw errors of 1 and 2 on fold j=1, 2 of
replication i=1,...,5
pi pi pi / 2 s pi pi pi pi
1 2 2 1 2 2 2
i
p11
~ t5
i 1 i / 5
5 2
s
Two-sided test: Accept H0: μ0 = μ1 if in (-tα/2,5,tα/2,5)
One-sided test: Accept H0: μ0 ≤ μ1 if < tα,5
5×2 cv Paired F Test
p j 2
5 2
i 1 j 1 i
~ F10,5
2 s
5 2
i 1 i
ˆ 2 K
L m j m 2
j 1 L 1
m m 2
~ X L21 SSb K m j m 2
j 2 /K
j
K
X m j 2 L S 2
X mj
2
ˆ
2 ij 2 j ij
S i 1
LK 1
j
K 1 j 1 L j i
SSw X ij m j 2
j i
S 2j SSw
K 1 2 ~ X 2
2
~
K 1 X 2
L K 1
SSb / 2 SSw / 2 SSb / L 1
/ ~ FL1,LK 1
L 1 LK 1 SSw / LK 1
H0 : 1 2 L if F ,L1,LK 1
ANOVA table