Professional Documents
Culture Documents
Andrea Montanari
Stanford University
EE178/EE278A
1 / 37
Why???
EE178/EE278A
2 / 37
Example
EE178/EE278A
3 / 37
150
100
0
50
Frequency
200
250
A rst shot
0e+00
2e+05
4e+05
6e+05
8e+05
consumpt.
EE178/EE278A
4 / 37
40
0
20
Frequency
60
80
Log-scale
3.5
4.0
4.5
5.0
5.5
6.0
log(consumpt.)
EE178/EE278A
5 / 37
Log-scale
log values a fX ; X ; X ; : : : ; Xn g
Height@bink A a 5 i X Xi P bink
1
EE178/EE278A
6 / 37
15
Frequency
20
10
40
Frequency
20
60
25
80
30
Bin size?
3.5
4.0
4.5
5.0
5.5
6.0
3.0
3.5
log(consumpt.)
4.0
4.5
5.0
5.5
6.0
log(consumpt.)
Something annoying. . .
Andrea Montanari (Stanford)
EE178/EE278A
7 / 37
15
Frequency
20
10
40
Frequency
20
60
25
80
30
Bin size?
3.5
4.0
4.5
5.0
5.5
6.0
3.0
3.5
log(consumpt.)
4.0
4.5
5.0
5.5
6.0
log(consumpt.)
Something annoying. . .
Andrea Montanari (Stanford)
EE178/EE278A
7 / 37
100
0
50
Frequency
150
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
8 / 37
5i X Xi P bink
Height@bink A a
n length@bink A
EE178/EE278A
9 / 37
0.6
0.0
0.2
0.4
Density
0.8
1.0
1.2
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
10 / 37
The rationale
Data sample
X ; X ; : : : ; Xn $ f @x A
Nk 5 i X Xi P bink ; bink
1
i.i.d.
EE178/EE278A
ak ; bk A :
11 / 37
The rationale
0.0
0.5
Density
1.0
1.5
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
12 / 37
The rationale
Data sample
X ; X ; : : : ; Xn $ f @x A
Nk 5 i X Xi P bink ;
1
i.i.d.
Therefore
ENk
an
bk
bink
ak ; bk A
f @x A dx % n f @ak A jbk ak j
ak
Hence
Hk a Height@bink A
E Nk
E Hk a
n jb a j
k
k
bk
a jb a j
k
k
ak
f @x A dx % f @ak A
EE178/EE278A
13 / 37
The rationale
Data sample
X ; X ; : : : ; Xn $ f @x A
Nk 5 i X Xi P bink ;
1
i.i.d.
Therefore
ENk
an
bk
bink
ak ; bk A
f @x A dx % n f @ak A jbk ak j
ak
Hence
Hk a Height@bink A
E Nk
E Hk a
n jb a j
k
k
bk
a jb a j
k
k
ak
f @x A dx % f @ak A
EE178/EE278A
13 / 37
The rationale
Data sample
X ; X ; : : : ; Xn $ f @x A
Nk 5 i X Xi P bink ;
1
i.i.d.
Therefore
ENk
an
bk
bink
ak ; bk A
f @x A dx % n f @ak A jbk ak j
ak
Hence
Hk a Height@bink A
E Nk
E Hk a
n jb a j
k
k
bk
a jb a j
k
k
ak
f @x A dx % f @ak A
EE178/EE278A
13 / 37
Hk a n jb 1 a j Zi
k k ia
@
Z a 1 if Xi P ak ; bk A;
1
0 otherwise.
By the LLN
EE178/EE278A
14 / 37
EE178/EE278A
15 / 37
a 0 0 25
Density
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
16 / 37
1.0
0.0
0.5
Density
1.5
a 0
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
17 / 37
1.0
0.0
0.5
Density
1.5
2.0
a 0
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
18 / 37
1.0
0.0
0.5
Density
1.5
a 0
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
19 / 37
16
1.5
0.0
0.5
1.0
Density
2.0
2.5
3.0
a 0
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
20 / 37
16
1.5
0.0
0.5
1.0
Density
2.0
2.5
3.0
a 0
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
20 / 37
a 0
1.0
0.0
0.5
Density
1.5
Variance:
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
21 / 37
a 0
1.0
0.0
0.5
Density
1.5
Variance:
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
22 / 37
a 0
16
2
0
Density
Variance:
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
23 / 37
A tradeo
Small
Large
Very noisy.
Misses details.
Bias-Variance tradeo
EE178/EE278A
24 / 37
A tradeo
Small
Large
Very noisy.
Misses details.
Bias-Variance tradeo
EE178/EE278A
24 / 37
Example:
Is f @x A Gaussian?
EE178/EE278A
25 / 37
0.6
0.0
0.2
0.4
Density
0.8
1.0
1.2
Is f @x A Gaussian?
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
26 / 37
0.6
0.0
0.2
0.4
Density
0.8
1.0
1.2
Is f @x A Gaussian?
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
26 / 37
2
0
Density
Is f @x A Gaussian?
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
27 / 37
2
0
Density
Is f @x A Gaussian?
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
27 / 37
0.0
0.5
1.0
Density
1.5
2.0
2.5
A Gaussian sample
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
28 / 37
1.0
0.0
0.5
Density
1.5
Is f @x A Gaussian?
3.0
3.5
4.0
4.5
5.0
5.5
6.0
6.5
log(consumpt.)
EE178/EE278A
29 / 37
Theory
EE178/EE278A
30 / 37
Hk a Height@bink A
bink a ak ; bk A a ; a C A
1
Hk a 2n
Zi a
EHk
Zi
1 if Xi P a ; a C A;
0 otherwise.
1 a C
f @x A dx
2 a
EE178/EE278A
31 / 37
Hk a Height@bink A
bink a ak ; bk A a ; a C A
1
Hk a 2n
Zi a
EHk
Zi
1 if Xi P a ; a C A;
0 otherwise.
1 a C
f @x A dx
2 a
EE178/EE278A
31 / 37
Error
n
MSE a E
Hk f @a A
n
o
a E Hk E@Hk A C E@Hk A f @a A
n
n
h
i
a E Hk E@Hk A g C 2E Hk E@Hk Ag E@Hk A f @a A
h
i
C E@Hk A f @a A
h
i
a Var{zHk } C C E@Hk A f @a A
@ A
|
2
variance
{z
bias
EE178/EE278A
32 / 37
Variance
1
Hk a 2n
Var@Hk A a
4n
1
2
Zi
Var@Zi A
a E@Zi A E@Zi A
a C
E@Zi A a P Xi P a ; a C A a
f @x Adx
a
2
Var@Hk A a
Andrea Montanari (Stanford)
4n
a C
a
f @x Adx 1
EE178/EE278A
a C
a
f @x Adx
33 / 37
Variance
1
Hk a 2n
Var@Hk A a
4n
1
2
Zi
Var@Zi A
a E@Zi A E@Zi A
a C
E@Zi A a P Xi P a ; a C A a
f @x Adx
a
2
Var@Hk A a
Andrea Montanari (Stanford)
4n
a C
a
f @x Adx 1
EE178/EE278A
a C
a
f @x Adx
33 / 37
Variance
1
Hk a 2n
Var@Hk A a
4n
1
2
Zi
Var@Zi A
a E@Zi A E@Zi A
a C
E@Zi A a P Xi P a ; a C A a
f @x Adx
a
2
Var@Hk A a
Andrea Montanari (Stanford)
4n
a C
a
f @x Adx 1
EE178/EE278A
a C
a
f @x Adx
33 / 37
Variance
1
Hk a 2n
Var@Hk A a
4n
1
2
Zi
Var@Zi A
a E@Zi A E@Zi A
a C
E@Zi A a P Xi P a ; a C A a
f @x Adx
a
2
Var@Hk A a
Andrea Montanari (Stanford)
4n
a C
a
f @x Adx 1
EE178/EE278A
a C
a
f @x Adx
33 / 37
a C
a 21 a f @x A dx
1 a C h
a 2 a f @a A C f H @a Ax
% f @a A C 1 f HH@a A
6
1
C 2 f HH@a Ax C : : : dx
2
Var@Hk A a
%
Andrea Montanari (Stanford)
4n
1
4n
a C
a
f @x Adx 1
1
a C
a
f @x Adx
f @a A 2 a 2 n f @a A :
EE178/EE278A
34 / 37
a C
a 21 a f @x A dx
1 a C h
a 2 a f @a A C f H @a Ax
% f @a A C 1 f HH@a A
6
1
C 2 f HH@a Ax C : : : dx
2
Var@Hk A a
%
Andrea Montanari (Stanford)
4n
1
4n
a C
a
f @x Adx 1
1
a C
a
f @x Adx
f @a A 2 a 2 n f @a A :
EE178/EE278A
34 / 37
Hence
bias a EHk
Independent of
n.
1
f @a A % 6 f HH@a A a const
2
Increases with
variance a Var@Hk A %
Decreases with
n.
f @a A a const
2n
n
Decreases with
EE178/EE278A
35 / 37
Hence
bias a EHk
Independent of
n.
1
f @a A % 6 f HH@a A a const
2
Increases with
variance a Var@Hk A %
Decreases with
n.
f @a A a const
2n
n
Decreases with
EE178/EE278A
35 / 37
0.00
0.02
0.04
MSE
0.06
0.08
0.10
Summing up
0.00
0.05
0.10
0.15
0.20
0.25
Delta
MSE a c
C nc
4
EE178/EE278A
36 / 37
Summing up
MSE a c
C nc
4
Optimizing over
G n
opt
MSE
opt
G n
1=5
4=5
EE178/EE278A
37 / 37
Summing up
MSE a c
C nc
4
Optimizing over
G n
opt
MSE
opt
G n
1=5
4=5
EE178/EE278A
37 / 37
Summing up
MSE a c
C nc
4
Optimizing over
G n
opt
MSE
opt
G n
1=5
4=5
EE178/EE278A
37 / 37
In d dimensions
@ Cd A
G n @ Cd A
G n
opt
MSE
E..g. for d
1= 4
4= 4
opt
a 12:
MSE
opt
G n
0:25
EE178/EE278A
38 / 37
In d dimensions
@ Cd A
G n @ Cd A
G n
opt
MSE
E..g. for d
1= 4
4= 4
opt
a 12:
MSE
opt
G n
0:25
EE178/EE278A
38 / 37