Professional Documents
Culture Documents
Robert Snapp
snapp@cs.uvm.edu
Department of Computer Science
University of Vermont
CS 295 (UVM)
Fall 2013
1 / 21
x3
w3
xn
wn
w0
(
T
y D sgn.w x C w0 / D
CS 295 (UVM)
1; if wT x C w0 > 0;
1; if wT x C w0 0:
Fall 2013
2 / 21
CS 295 (UVM)
Fall 2013
3 / 21
sgn.wT xi C w0 / D `i :
If this is the case, how can we find a w and w0 that satisfies the above?
CS 295 (UVM)
Fall 2013
4 / 21
CS 295 (UVM)
Fall 2013
5 / 21
CS 295 (UVM)
Fall 2013
6 / 21
0 1
b
xD
1
x
1
B x1 C
B C
D B : C 2 RnC1 I
@ :: A
and let,
b
wD
w0
w
w0
B w1 C
B C
D B : C 2 RnC1 :
@ :: A
xn
wn
n
o
b m D .b
X
xi ; `i / D .1; xTi /T ; `i i D 1; 2; : : : ; m
CS 295 (UVM)
Fall 2013
7 / 21
Consequently, the expression used by the LTU, simplifies to just an inner product, as
wT x C w0 D b
wT b
x D kb
wkkb
xk cos ;
where represents the angle between b
w and b
x, measured in their common plane.
Note that the cosine of determines the sign of the inner product.
CS 295 (UVM)
Fall 2013
8 / 21
float b
w[n C 1] = <.n C 1/ random floats>;
bool errorDetected = true;
float = <positive_number>;
while(errorDetected) f
errorDetected = false;
for(int i D 1; i m; i ++) f
if (sgn.b
wT b
xi / `i ) f
errorDetected = true;
b
w += `ib
xi ;
ggg
return b
w;
CS 295 (UVM)
Fall 2013
9 / 21
Normalized Coordinates
For i D 1; 2; : : : ; m, let
b
x0i D `ib
xi :
float b
w[n C 1] = <.n C 1/ random floats>;
bool errorDetected = true;
float = <positive_number>;
while(errorDetected) f
errorDetected = false;
for(int i D 1; i m; i ++) f
if (b
wT b
x0i < 0) f
errorDetected = true;
b
w += b
x0i ;
ggg
return b
w;
CS 295 (UVM)
Fall 2013
10 / 21
o
n
def
bm
b0 D
b
xi .b
xi ; `i / 2 X
x0i D `ib
X
m
let b
w? 2 RnC1 denote a homogeneous weight vector that satisfies the given
dichotomy, i.e., b
w?Tb
x0i > 0, for i D 1; 2; : : : ; m.
Let b
w.k / 2 RnC1 denote the value of the perceptrons homogeneous weight vector
after the k -th update.
CS 295 (UVM)
Fall 2013
11 / 21
2
kw(k)w(0)
k
Bk
Ak2
k
b 0 , and parameter , we will show that there exists constants
For a given dichotomy X
m
A and B such that
2
w.k / b
w.0/
Bk :
Ak 2
b
Thus the network must converge after no more than kmax D B =A updates.
CS 295 (UVM)
Fall 2013
12 / 21
4AC 0;
with equality if and only if there exists a value of t 2 R for which .t / D 0. (Letting
c D t implies that ca D b.) More generally,
4AC B 2 ) 4kak2 kbk2 4jaT bj2
Fall 2013
13 / 21
o
n
def
bm
b0 D
b
xi .b
xi ; `i / 2 X
x0i D `ib
X
m
let b
w? 2 RnC1 denote a homogeneous weight vector that satisfies the given
dichotomy, i.e., b
w?Tb
x0i > 0, for i D 1; 2; : : : ; m.
Let b
w.k / 2 RnC1 denote the value of the perceptrons homogeneous weight vector
after the k -th update.
CS 295 (UVM)
Fall 2013
14 / 21
b
w.k / D b
w.0/ C b
x0 .1/ C b
x0 .2/ C C b
x0 .k / :
Subtracting b
w.0/ from both sides, and taking the inner-product with respect to the
hypothetical solution b
w? yields,
b
w?T .b
w.k /
CS 295 (UVM)
b
w.0// D b
w?T b
x0 .1/ C b
x0 .2/ C C b
x0 .k / :
Fall 2013
15 / 21
b
w?T .b
w.k /
b
w.0// D b
w?T b
x0 .1/ C b
x0 .2/ C C b
x0 .k / :
Now define,
a D min b
w?Tb
x0 > 0:
b
x0 2b
X 0m
Thus,
b
w?T .b
w.k /
b
w.0/ ak > 0:
kb
w? k2 kb
w.k /
b
w.0/k2 jb
w?T .b
w.k /
b
w.0//j2 .ak /2 :
Thus,
kb
w.k /
CS 295 (UVM)
b
w.0/k2
a
kb
w? k
2
k 2:
Fall 2013
16 / 21
b
w.1/ D b
w.0/
C b
x0 .1/
b
w.2/ D b
w.1/
C b
x0 .2/
::
:
b
w.k / D b
w.k 1/ C b
x0 .k /
Now subtract b
w.0/ from both sides,
CS 295 (UVM)
b
w.1/
b
w.0/ D b
x0 .1/
b
w.2/
b
w.0/ D b
w.1/
::
:
b
w.k /
b
w.0/ D b
w.k
b
w.0/ C b
x0 .2/
1/
b
w.0/ C b
x0 .k /
Fall 2013
17 / 21
b
w.1/
b
w.2/
b
w.k /
2
0
2
b
w.0/
D 2
b
x .1/
2
2
0
2
T 0
b
w.0/
D
b
w.1/ b
w.0/
C 2 b
w.1/ b
w.0/ b
x .2/ C 2
b
x .2/
::
:
2
2
0
2
T 0
b
w.0/
D
b
w.k 1/ b
w.0/
C 2 b
w.k 1/ b
w.0/ b
x .k / C 2
b
x .k /
CS 295 (UVM)
Fall 2013
18 / 21
b
w.1/
b
w.2/
b
w.k /
2
0
2
b
w.0/
D 2
b
x .1/
2
2
0
2
b
w.0/
b
w.1/ b
w.0/
2b
w.0/Tb
x0 .2/ C 2
b
x .2/
::
:
2
2
0
2
b
w.0/
b
w.k 1/ b
w.0/
2b
w.0/Tb
x0 .k / C 2
b
x .k /
b
w.k /
2
2
0
2
0
2
b
w.0/
2
b
x0 .1/
C
b
x .2/
C C
b
x .k /
2b
w.0/T b
x0 .2/ C C b
x0 .k /
CS 295 (UVM)
Fall 2013
19 / 21
2
x0
;
M D max
b
b
x0 2b
X 0m
and
D 2 min b
w.0/Tb
x0 :
b
x0 2b
X 0m
(Note that < 0, unless b
w.0/ solves the dichotomy.)
Whence,
b
w.k /
2
2
0
2
0
2
b
w.0/
2
b
x0 .1/
C
b
x .2/
C C
b
x .k /
2b
w.0/T b
x0 .2/ C C b
x0 .k /
becomes
b
w.k /
CS 295 (UVM)
2
b
w.0/
.2 M
/k :
Fall 2013
20 / 21
Ak 2
b
w.k /
with
AD
a
kb
w? k
2
;
Thus,
kmax D
CS 295 (UVM)
2
b
w.0/
Bk :
and,
B D .M
/:
M ? 2
kb
w k :
a 2
Fall 2013
21 / 21