Professional Documents
Culture Documents
and we see that evaluation of the gradient at two points gives information about F.
If n linearly independent directions p0 , p1 , p2 pn−1 and the corresponding qk ’s
are known, then F is uniquely determined. Indeed, letting P and Q be the n × n
matrices with columns pk and qk respectively, we have
F = QP−1 (8)
Hk+1 qi = pi 0 i k (9)
which preserves symmetry. The vector zk and the constant ak define a matrix of
(at most) rank one, by which the approximation to the inverse is updated. We select
them so that (9) is satisfied. Setting i equal to k in (9) and substituting (10) we obtain
pk − Hk qk pk − Hk qk T
Hk+1 = Hk + 2
ak zkT qk
pk − Hk qk pk − Hk qk T
Hk+1 = Hk + (13)
qkT pk − Hk qk
pi − Hi qi pi − Hi qi T
Hi+1 = Hi + (14)
qiT pi − Hi qi
Then
where
pk − Hk qk
yk =
qkTpk − Hk qk
By the induction hypothesis, (16) becomes
290 Chapter 10 Quasi-Newton Methods
Hk+1 qi = pi + yk pTk qi − qkT pi
dk = −Hk gk
and then minimize fxk + dk with respect to 0. This determines xk+1 =
xk + k dk , pk = k dk , and gk+1 . Then Hk+1 can be calculated according to (13).
There are some difficulties with this simple rank one procedure. First, the
updating formula (13) preserves positive definiteness only if qkT pk − Hk qk > 0,
which cannot be guaranteed (see Exercise 6). Also, even if qkT pk −Hk qk is positive,
it may be small, which can lead to numerical difficulties. Thus, although an excellent
simple example of how information gathered during the descent process can in
principle be used to update an approximation to the inverse Hessian, the rank one
method possesses some limitations.
pk pTk Hk qk qkT Hk
Hk+1 = Hk + − T (17)
pTk qk qk Hk qk
Update k and return to Step 1.