Professional Documents
Culture Documents
05/10/16
05/10/16
05/10/16
Lab 3
Music Genre Classification
High Dimensional dataset (>3000 features)
Lab 6
One pager
Problem statement
Source of data
Any reference material
Outline
Measures for evaluation
Experimental design
Estimating the generalized performance
Hypothesis testing
Interval estimation
Confidence intervals
Total
positive
True positive
( )
False negative
( )
()
negative
False positive
( )
True negative
( )
()
( )
( )
Total
negative
10
Performance Measures
Error: + /
Accuracy: + /
tp-rate: /
fp-rate: /
positive
Precision: /
negative
Recall: /
Total
Sensitivity: /
Specificity: /
F Measure:
Design and Analysis of
Experiments
positive
negative
Total
True positive
( )
False negative
( )
()
False positive
( )
True negative
( )
()
( )
( )
)+,-./0/12,-.344
+,-./0/125,-.344
CSL465/603 - Machine Learning
11
12
Example (1)
0.5
0.9
0.9
0.7
0.6
0.5
0.4
0.3
0.2
0.1
13
Example (2)
0.5
0.9
0.9
0.7
0.6
0.2
0.6
0.3
0.2
0.1
14
15
16
17
18
19
20
21
22
23
24
Stratification
25
26
27
EGH
where m~ , ) /
Define statistic with a unit normal distribution
0, 1
~
/
Design and Analysis of
Experiments
28
Therefore,
1.96 < < 1.96
= 0.95
Two-sided confidence
interval
Design and Analysis of
Experiments
29
e/)
2.58
0.99
2.33
0.98
1.96
1.96 < < 1.96 = 0.95
1.64
1.96 <
< 1.96 = 0.95 1.28
1.96
< < + 1.96
= 0.95
0.95
e/)
0.90
0.80
=1
30
Two-Sided Vs One-Sided
Confidence Interval
1.64
e
Design and Analysis of
Experiments
2.33
1.64
1.28
0.99
0.95
0.90
< = 0.95
< =1
CSL465/603 - Machine Learning
31
32
Students t-distribution
Similar to normal
distribution, but with
larger spread (heavier
tails)
It includes the
additional uncertainty
with using sample
variance
, it becomes a
normal distribution
33
H
ImH
)
I
, / ~ImH
EGH E
e/),ImH
< < + e/),ImH
=1
34
3.0
3.1
3.2
2.8
2.9
3.1
3.2
2.8
2.9
10
3.0
=3
) = 0.022
, = 0.149
= 0.05, = 1 = 9
o.o)p,q = 2.685
3 0.127 < < 3 + 0.127 = 0.95
2.873 < < 3.217 = 0.95
35
36
I
EGH ~
, )
H
I EGH E
Want to test if is not equal to some constant o
Null hypothesis - o : = o
Alternative hypothesis - H : o
Reject o if too far from o
37
I
EGH ~
, )
H
I
I
EGH E
Null hypothesis - o : o
Alternative hypothesis - H : > o
Reject o
38
I
EGH ~
, )
H I
EGH E
I
Null hypothesis - o : o
Alternative hypothesis - H : > o
Reject o
39
3.0
3.1
3.2
2.8
2.9
3.1
3.2
2.8
2.9
10
3.0
= 3, ) = 0.022 , = 0.149
o = 2.9
H : > 2.9, o : 2.9
= 0.05, = 1 = 9
o.op,q = 1.833
I }m~
= 2.121 , 1.833
40
41
Binomial Distribution
Coin Toss experiment
Probability of a head -
The probability of observing heads in coin
tosses is
=
1 Im
Mean -
Standard Deviation -
42
Classification of an
instance
Classifier misclassifies
an instance
Probability of
misclassification
misclassified instances
in samples of
Probability of a
misclassified instance in
S - ( = )
Estimating
43
Binomial Test
Test whether the error probability is less than or
equal to some value o .
Null hypothesis - o : o
Alternative hypothesis - H : > o
Reject o with significance if
I
=W
o 1 o
Im
<
Where = o
44
(/ o )
o 1 o
Works well for not too small and is not very close to
0 or 1
Design and Analysis of
Experiments
45
Example (1)
Let = 40, = 12, = 0.3
Set o = 0.2, = 0.05
Alternate Hypothesis: H : > o
Null Hypothesis: o : o
Compute
(/ o )
= 1.58 o.op = 1.64
o 1 o
1.58 , 1.64
Therefore fail to reject o
Design and Analysis of
Experiments
46
Example (2)
What is the 95% confidence interval around the
error , given o = 0.3?
95% confidence interval
= 0.05
o e/)
=1
47
t-Test
So far we have looked at single validation set.
Suppose do a k-fold cross validation
error percentages E , 1
M
M
1
1
)
= W E , =
W E
1
EGH
EGH
Hence
o /~MmH
Reject the null hypothesis with significance if this
value is greater than e,MmH
48
1
= W E ,
EGH
1
=
W E
1
)
EGH
Hence
0 /~MmH
Design and Analysis of
Experiments
49
Summary
Measures for evaluation
Experimental design
Estimating the generalized performance
Hypothesis testing
Interval estimation
Confidence intervals
50