You are on page 1of 14

<— —<

How Good Is a Broyden]Fletcher]


Goldfarb]Shanno-like Update
Hessian Formula to Locate Transition
Structures? Specific Reformulation of
Broyden]Fletcher]Goldfarb]Shanno
for Optimizing Saddle Points

JOSEP MARIA ANGLADA,1 JOSEP MARIA BOFILL2


1
C.I.D]C.S.I.C., Jordi Girona Salgado 18-26, E-08034 Barcelona, Catalunya, Spain
2
´
Departament de Quımica `
Organica, Universitat de Barcelona, Martı´ i Franques
` 1-11,
E-08028 Barcelona, Catalunya, Spain

Received 28 March 1997; accepted 8 September 1997

ABSTRACT: Based on a study of the Broyden]Fletcher]Goldfarb]Shanno


ŽBFGS. update Hessian formula to locate minima on a hypersurface potential
energy, we present an updated Hessian formula to locate and optimize saddle
points of any order that in some sense preserves the initial structure of the BFGS
update formula. The performance and efficiency of this new formula is compared
with the previous updated Hessian formulae such as the Powell and MSP ones.
We conclude that the proposed update is quite competitive but no more efficient
than the normal updates normally used in any optimization of saddle points.
Q 1998 John Wiley & Sons, Inc. J Comput Chem 19: 349]362, 1998

Keywords: Broyden]Fletcher]Goldfarb]Shanno formula; update Hessian


formula; transition structures; optimization; saddle points

Correspondence to: Dr. J. M. Bofill; e-mail: jmbofill@canigo.


qo.ub.es
Contractrgrant sponsor: Spanish DGICYT; contractrgrant
number: PB95-0278-C02-01

Journal of Computational Chemistry, Vol. 19, No. 3, 349]362 (1998)


Q 1998 John Wiley & Sons, Inc. CCC 0192-8651 / 98 / 030349-14
ANGLADA AND BOFILL

quadratic and nonquadratic functions. However,


Introduction for optimizing saddle points of nonquadratic func-
tions, the Huang family of formulae present a very
low convergence, in general. Greenstadt 10 showed

T he study of potential energy hypersurfaces


and specifically the location of their station-
ary points are crucial in modern theoretical chem-
that the update formulae can be derived from a
minimization of weighted errors where the weights
are positive definite. Based on these conclusions,
istry. The most important stationary points are Bofill and Comajuan11 observed that when many
minima Ži.e., equilibrium structures. and first-order update Hessian formulae members of the Huang
saddle points Ži.e., transition structures or TS.. family are applied to locate and optimize a saddle
In general, the algorithms for locating this class point, this leads to negative weights. However,
of stationary points are based in the quasi-New- this is not the sole drawback of the Huang family
ton]Raphson ŽQNR. type technique.1 Basically, the because they generally modify the approximate
methodology consists of defining at each step of transition vector too much that is associated with
the optimization process a quadratic model for the the desired TS from iteration to iteration, produc-
expansion of the energy function around some ing a deterioration of the optimization process or a
lack of convergence.11
point of the hypersurface. The quadratic model is
To optimize a saddle point structure using a
built by computing only the energy and the gradi-
QNR algorithm, the most widely used update for-
ent at this point. The Hessian matrix or its inverse
mulae are the Powell one12 and its generalization,
is approximated by updating it at each iteration
the so-called Murtagh]Sargent]Powell ŽMSP. fam-
rather than evaluating it exactly. This represents a
ily.11, 13, 14 The use of the Powell formula for this
big savings of computing time, because in general
type of optimization was first suggested by Si-
the second derivatives of the energy are much
mons et al.15
more expensive to compute than the gradient. Due
When applied to a quadratic function, the Pow-
to its general efficiency in achieving convergence ell and MSP update formulae11 ] 14 both present a
and the relative low cost of the computation, the limitation with respect to the BFGS update: they
QNR methodology is a powerful tool to locate and do not preserve the so-called conjugancy
optimize stationary points. property 1, 16 except when the conjugate search di-
The most important point in any QNR algo- rections are mutually orthogonal. It is important
rithm is to find a good formula for updating the that the update formula possesses the conjugancy
Hessian matrix or its inverse at each iteration. This property because it ensures that the QNR algo-
updating formula technique was introduced by rithm converges for a quadratic function in at most
Davidon2 and modified by Fletcher and Powell 3 n q 1 searches, where n is the number of vari-
and is often called the Davidon]Fletcher]Powell ables.1, 16 This result can be applied when the func-
ŽDFP. method. Variants of this updating formula tion to be optimized is not quadratic, because near
were developed by Broyden,4 Fletcher,5 Goldfarb,6 a stationary point the function presents quadratic
and Shanno7; this most popular BFGS formula is behavior. Consequently, the update formulae that
the standard Hessian updating formula used in preserve the conjugancy property without any lim-
the QNR algorithms to minimize molecular ge- itation, like the BFGS, in general are more efficient
ometries. in geometry optimizations of local minima than
The mathematical nature of these update for- others like the Powell or MSP formulae, which are
mulae can be analyzed from different points of more restricted with this property. In locating and
view. In a short but beautiful article, Huang 8 de- optimizing TS, one has the following dilemma:
rived and unified a family of update formulae, BFGS is the best update for optimizing minima but
even for a nonsymmetric Hessian. His derivation surprisingly it is not a good update for optimizing
was based on the preconditioned conjugate direc- saddle points. The following questions arise. What
tions algorithm9 applied to quadratic functions. is the reason for the low efficiency in the optimiza-
This family of update Hessians Žor its inverse. is tion of saddle points? Is it possible to reformulate
called the Huang family. In principle, any member the BFGS-like update formula to optimize saddle
of the Huang family can be applied to locate points in an efficient way? This article analyzes
maximum and saddle points of any order for these questions.

350 VOL. 19, NO. 3


BFGS REFORMULATION FOR OPTIMIZING SADDLE POINTS

HUANG FORMULA’S TO UPDATE


Mathematical Background APPROXIMATE HESSIAN MATRICES
As pointed out in the Introduction section,
QNR METHOD Huang 8, 16 obtained a general formula for updat-
The QNR method1, 16 consists of first quadrati- ing the Hessian matrices by comparison with the
cally expanding around x k a function f Žx. that preconditioned conjugate direction method.9 The
depends on n real variables and is at least twice formula is
continuously differentiable:
C1 D g D g T q C 2 D g D x T B
B9 s B q r
f Žx q D x
k k.
C1 D g TD x q C2 D x T BD x
f qk ŽDx k . K 1 BD x D g T q K 2 BD x D x T B
1 y , Ž2.
T T K 1 D g TD x q K 2 D x T BD x
s f Žx k . q Žg k . D x k q Ž D x k . G k D x k , Ž1.
2
where D g s g kq 1 y g k and r , C1 , C2 , K 1 , and K 2
where D x s xk kq 1
y x , g is the gradient vector,
k k
are parameters. Normally, r is taken as equal to 1.
G k is the Hessian matrix, and q k Ž D x k . is the Selecting the other parameters conveniently one
quadratic approximation of f Žx k . for iteration k. gets the well-known update formulae such as the
The Hessian matrix G k of eq. Ž1. is substituted by Murtagh]Sargent ŽMS., 22 dual Greenstadt,10
a symmetric nonsingular matrix B k , which is up- DFP,2, 3 or BFGS.4 ] 7 Taking r s 1, C1 / 0, K 2 / 0,
dated from iteration to iteration. Sometimes this and C2 s K 1 s 0 we get the well-known BFGS
update is made directly on the inverse of the B k update formula for the B matrix, hereafter labeled
matrix, H k . In each iteration generally the correc- by BXBFGS .
tion D x k optimizes q k Ž D x k ., but sometimes D x k is
chosen such that it optimizes q k Ž D x k . in a selected D g D gT BD x D x T B
neighborhood of x k . BXBFGS s B q y . Ž3.
In principle, if the B k or H k matrices have the D g TD x D x T BD x
desired eigenvalue spectra, that is, the number of
positive and negative eigenvalues are correct, the Equation Ž2. can be seen also as an approximation
QNR algorithm should converge to a stationary of the Taylor series expansion of the Hessian G at
point such that its Hessian matrix G* possesses the point x k , where the second and the third term
the desired eigenvalue spectra at this point. of eq. Ž2. summarize the error due to the trunca-
To locate the optimize saddle points, the use of tion. In other words, eq. Ž2. may be written in a
a selected neighborhood during the optimization compact form as
converts the QNR method to a very efficient
one.13, 17 ] 19 This modification has been called a B9 s B q E, Ž4.
quadratic approximation,18b the trust region image
method18a ŽQArTRIM., or the restricted QNR13 and it can be seen as an approximation to the
ŽRQNR.. We note that other methods exist to com- Taylor series expansion of G Žsee ref. 23.,
pute the optimum step D x k , like the so-called
rational function optimization,20 but they will not G9 s G q FD x q ??? , Ž5.
be discussed here.
We are interested in locating and optimizing TS. where G9 is the Hessian matrix at x kq 1 and F is the
In this case, if one updates the approximated Hes- tensor of the third derivatives at x k . To update the
sian rather than its inverse, it is necessary to invert inverse Hessian matrix one uses an equation that
this matrix by diagonalization so the size and the is equivalent to eq. Ž4., that is, H9 s H q Q, where
sign of the eigenvalues can be controlled.21 Due to the Q matrix plays the role of the E matrix.16
this feature, we present only the update of the Finally, we note that the BFGS formula can be
Hessian matrix, B k , rather than its inverse H k . In reformulated with r s y1 to optimize saddle
order to simplify the notation, hereafter the su- points starting with a positive definite B matrix.
perindices will be dropped; that is, we take B for However, this has little practical value because
B k , B9 for B kq 1, and D x for D x k . this is only true in the strictly quadratic case.8

JOURNAL OF COMPUTATIONAL CHEMISTRY 351


ANGLADA AND BOFILL

VARIATIONAL BASIS OF UPDATE measure of the difference between B9 and G9. The
HESSIAN MATRICES value of this trace at the stationary point, N W ŽE*.,
Greenstadt 10 was first to apply a variational is11
principle to obtain update formulae, but it was 2
jT Wj Ž D x T j.
applied to the H matrix rather than the B matrix. N W Ž E*. s 2y . Ž 11.
Here we briefly summarize this variational princi- D x T MD x D x T MD xjT Wj
ple, but we apply it only to the B matrix.11 The QN
condition is defined as1, 16 In many situations the W matrix and, conse-
quently, the M matrix depend on a parameter,11
D g s Ž B q E . D x s B9D x. Ž6.
say f . The optimum value of this parameter that
This condition is an approximation of the first- minimizes N W ŽE. can be obtained by derivation of
order Taylor series expansion of g k with respect eq. Ž9. at the stationary point E*, p*, L* with re-
to the step vector D x conveniently rearranged. spect to f and imposing the stationary condition;
Greenstadt 10 evaluated the E matrix by minimiz- that is,
ing the following weighted Euclidean norm,
dN W Ž E*. dF Ž E Ž f . *, p Ž f . *, L Ž f . *, W Ž f ..
N W Ž E . s Tr Ž WEWET . , Ž7. s
df df
subject to the QN condition, eq. Ž6., rearranged as ­ ET ­ pT
s =E F q =pF
j s ED x s D g y BD x, Ž8. ­f ­f
and the symmetry condition E y E T s 0. In eq. Ž7. ­ LT ­F
Tr indicates the trace and W is a symmetric and q =LF q s 0. Ž 12.
­f ­f
positive definite matrix. The problem is formu-
lated using the following Lagrangian function, Using the fact that F is stationary with respect to
1 E, p, and L at the point E*, p*, and L*, one gets
F Ž E, p, L, W. s Tr Ž WEWET . after some rearrangements
2
2
q Tr w Ž ED x y j . p T x q Tr w L Ž E y E T .x , Ž9. jT WM9Wj Ž D x T MD x .
2
where the p vector and L matrix are the La- q D x T MD xjT Wj y Ž D x T j . D x T M9D x s 0, Ž 13.
grangian multipliers. With these considerations
the formula to update the B9 matrix is where M9 is the derivative of the M matrix with
respect to the f parameter. In the deduction of
B9 s B q E* s B q juT q ujT y Ž jTD x . uuT expression Ž13. the derivative of WM s I was used.
0 1 Note that the term in brackets is positive or equal
s B q w j ux w j ux T , Ž 10.
1 y Ž jTD x . to zero due to the Cauchy]Schwarz inequality.
Given a D x T MD x and jT Wj different from zero, if
where the E* matrix is the solution of eq. Ž9., w D x T MD xjT Wj y Ž D x T j. 2 x / 0 and the M9 matrix
u s Ž D x T MD x.y1 MD x, and M s Wy1. In Ap- is positive definite, the stationary condition is sat-
pendix A we give a derivation of the most general isfied by the condition jT WM9Wj s D x T M9D x s
inverse to eq. Ž10.. It is important to emphasize 0. On the other hand, if the term in brackets is
that this variational principle does not say any- zero, then the stationary condition should be satis-
thing about the form of the W matrix, except for its fied by the single condition jT WM9Wj s 0. How-
symmetric and positive definite character. How- ever, as will be seen below, these results have
ever, given suitable forms to the W matrix one can limited practical value.
derive many updated Hessian formulae as the
Powell,12 MSP,11, 13, 14 and others.10, 11
Equation Ž10. indicates that the best general DIRECT VARIATIONAL DEDUCTION OF BFGS
FORMULA TO UPDATE B MATRIX
correction to the B matrix from a variational point
of view is a rank two matrix. Applied to a quadratic From the author’s knowledge, all variational
function, because the vector j is conjugate to the deductions of the BFGS formula to update the B
previous directions, the B9 matrix will preserve the matrix were obtained indirectly. First the correc-
conjugancy condition if the u vector is also conju- tion to the H matrix is obtained using a formula
gate to the previous directions.1, 16 Equation Ž7. is a like Ž10. for inverse updating and after that one

352 VOL. 19, NO. 3


BFGS REFORMULATION FOR OPTIMIZING SADDLE POINTS

evaluates its inverse.1, 24, 25 Here we present a di- Substituting eqs. Ž18. into eq. Ž16. and the result
rect variational deduction based on expression Ž10.. into eq. Ž10., we obtain eq. Ž3. after some algebraic
Because the E matrix represents a variation of the manipulations. From this deduction we can very
B matrix, the best W matrix should be an approxi- easily analyze the behavior of the BFGS formula.
mation to the next average integral,24 Because the M matrix should be positive definite,
then from eq. Ž16. it is necessary that the B and B9
1 y1 Ž
H0 G x k q aD x . da matrices both be positive definite too. This means
Ws . Ž 14. that the optimum situation for BFGS updating of
1
H0 da the Hessian matrix is when the QNR algorithm is
applied to minimize a function. In other condi-
tions, such as in the optimization of a TS, the BFGS
A reasonable approximation to this average inte-
update formula presents a degradation because in
gral consists of using the H and H9 matrices rather
this case the M matrix is not positive definite. This
than the full variation of the true inverse Hessian
means that from the present variational theory one
matrix, Gy1 , and replacing the integral by a dis-
cannot use eq. Ž3. directly to optimize a TS. How-
crete summation
ever, other important conclusions can be drawn
aH9 q bH from this deduction. First, BFGS is based on an M
Ws a G 0, b G 0, a q b s 1. or W matrix that is very close to the correct aver-
aqb
Ž 15. age of the variation on the Gy1 matrix. Second, the
parameters a and b are functions of the quotient of
In this way the M matrix should be an approxima- two Rayleigh quotients, namely D x T BD xrDx TD x
tion of the average integral of the true Hessian and D x T B9D xrD x TD x s D x TD grD x TD x, which
matrix G; that is, take into account the behavior of B and B9. These
two points explain the good performance generally
M s aB9 q bB a G 0, b G 0, a q b s 1. Ž 16. observed with the BFGS update formula when it
Note that the M matrix is positive definite if B and is applied to the optimization of a minimum. Fi-
B9 are positive definite. In order to evaluate the nally, if in eq. Ž16. one forces a s 1, then M s B9,
parameters a and b, we impose the following and substituting it in eq. Ž10. we get the DFP
condition: formula.10, 11 On the other hand, forcing b s 1,
then M s B, gives the dual Greenstadt
D x T MD x s aD x TD g q bD x T BD x formula.10, 11 Using these results we conclude that
s Ž D x TD g D x T BD x .
1r2 the BFGS formula is an average of the DFP and
dual Greenstadt update formulae.
a G 0, b G 0, a q b s 1, Ž 17.
where eq. Ž6. has been used. Now from eq. Ž17.
and the fact that a q b s 1, we get TS-BFGS UPDATE HESSIAN FORMULA

1r2 From the above analysis of the BFGS formula,


D x T BD x we can consider the possibility of adequately mod-

as
ž D x TD g / 1r2
ifying this formula to apply it to saddle point
optimizations. The first serious and maybe the
D x T BD x single problem is that the M matrix is nonpositive
1q
ž D x TD g / defined, as pointed out before. In BFGS the u
vector is
1r2
Ž D x T BD x .
s 1r2 1r2
, Ž 18a.
Ž D x TD g . q Ž D x T BD x . 1
us 1r2
Ž aD g q bBD x . , Ž 19.
1 Ž D g TD x D x T BD x .
bs 1r2
D x T BD x
1q
ž D x TD g / which is clearly a vector that is a linear combina-
tion of the D g and BD x vectors. Now, if we define
1r2 the M matrix in the following way,
Ž D x TD g .
s 1r2 1r2
. Ž 18b .
Ž D x TD g . q Ž D x T BD x . M s Ž 1 y f . <B < q f D g D g T , Ž 20.

JOURNAL OF COMPUTATIONAL CHEMISTRY 353


ANGLADA AND BOFILL

where the <B < is the B matrix, but forcing all eigen- parameter f as the square cosine of the angle
values to be positive, that is, between the D x and j vectors. In this way the f
n
parameter falls in the correct domain.
<B < s Ý < li <vi viT , Ž 21.
is1 PRACTICAL IMPLEMENTATION OF TS-BFGS
UPDATE HESSIAN FORMULA
where  l i , n
are the eigenpairs of the B matrix.
vi 4is1
Equation Ž20. can be seen as a rank one correction The update Hessian formula just derived is ade-
to the Ž1 y f .<B < matrix. Note that the M matrix quate for any algorithm of the type QArTRIM or
defined in this way is positive definite if 1 G f G 0. RQNR briefly described above with the eigenvec-
The corresponding W matrix is tor following method.20 This is because one needs

1
Ws TABLE I.
Ž1 y f .
Starting and Final Geometrical Parameters
f for First Transition Structure of Reaction
= <H < y <H < D g D g T <H < , CH3 CH 2 F + OH yª CH 2 CH 2 + H 2 O + F y.
1 y Ž 1 y D g T <H < D g . f
Ž 22. H8
-
H7 F6

0
0
C5
where <H < s Ž<B <.y1 . Now the u vector takes the
following form

1 C4
us

-
2 H3 0 H10
½ D x B < D x q Ž D x TD g . y D x T <B < D x f
T<
5 H9
O2
= w Ž 1 y f . <B < D x q f Ž D x TD g . D g x , Ž 23.
H1
which is a linear combination of BD x with B forced
to be positive definite and the D g vectors as in the Parameter Starting Final
BFGS formula. Substituting eq. Ž23. into eq. Ž10.
we get a new update formula, labeled BXTS-BFGS ; H1O 2 0.944 0.943
O 2H3 1.500 1.559
that is,
H3 C4 1.254 1.222
BXTS-BFGS C 4C 5 1.472 1.503
C5F6 1.600 1.393
T
Ž D g y BD x .w Ž 1 y f . <B < D x q f Ž D x TD g . D g x C5H7 1.112 1.126
sBq 2 C5H8 1.112 1.126
 D x T <B < D x q w Ž D x TD g . y D x T <B < D x x f 4 C4 H9 1.110 1.111
C 4 H10 1.110 1.111
w Ž 1 y f . <B < D x q f Ž D x TD g . D g x Ž D g y BD x . T
q H 3 O 2 H1 101.8 101.8
2
 D x T <B < D x q w Ž D x TD g . y D x T <B < D x x f 4 C4 H3 O 2 177.3 178.8
C5 C4 H3 108.2 108.4
D x T Ž D g y BD x . F6 C5 C4 111.2 114.0
y 2
2 H7 C5 C4 114.0 109.9
 D x T <B < D x q w Ž D x TD g . y D x T <B < D x x f 4
H8C5C4 114.0 109.9
= w Ž 1 y f . <B < D x q f Ž D x TD g . D g x H 9 C 4C 5 112.2 111.2
H10 C 4C 5 112.2 111.2
T
= w Ž 1 y f . <B < D x q f Ž D x TD g . D g x . Ž 24. H1O 2 H 3 C 4 0.0 5.0
O 2 H 3 C 4C 5 4.8 162.0
To obtain the best f parameter one should H 3 C 4C 5 F 6 180.0 180.0
substitute eqs. Ž20. and Ž22. and the derivative of H7 C5 C4 F6 115.0 120.6
the M matrix with respect to f into eq. Ž13.. H 8 C5 C4 F6 y115.0 y120.6
However, this gives a very complicated equation H 9 C 4C 5 H 3 118.2 119.3
on f to be solved and, in addition, sometimes the H10 C 4C 5 H 3 y118.2 y119.2
value of the resulting f does not fall in the do- Bond lengths in angstroms and bond angles and dihedrals
main 1 G f G 0. Instead of this we selected the in degrees.

354 VOL. 19, NO. 3


BFGS REFORMULATION FOR OPTIMIZING SADDLE POINTS

the full diagonalization of the current B matrix. Compute u ¤ f uŽuTD x. and after that
Rather than evaluating BXTS-BFGS by using eq. Ž24.,
we employ the following algorithm: n
u ¤ u q Ž1 y f . Ý < li <vi ŽviTD x. . Ž 25.
Reset the vector u s 0. is1

Compute D g and store it in the u vector. Com-


pute the scalar product uTD x. Compute the scalar product uTD x and evaluate
Do the matrix]vector product BD x and store it u ¤ uŽuTD x.y1 .
in the j vector. Build j ¤ u y j. Finally, with the vectors j and u and the scalar
Compute the scalar products jTD x, D x TD x, and product jTD x, compute the BXTS-BFGS matrix
jT j. Evaluate f s ŽjTD x. 2rŽ D x TD xjT j.. using eq. Ž10..

TABLE II.
Behaviors of TS-BFGS, MSP, and Powell Update Formulae along Optimization Process of First
Transition Structure for Reaction CH3 CH 2 F + OH yª CH 2 CH 2 + H 2 O + F y.

TS-BFGS MSP Powell


Iteration 5g 5 a
f b 5g 5 a
cos v
2 c
a d 5g 5 a

1 29.76 0.86 29.76 0.965 0.00 e 29.76


2 27.34 0.90 27.34 0.963 0.00 e 27.34
3 22.64 0.90 22.64 0.936 0.00 e 22.64
4 13.49 0.59 13.48 0.599 0.00 e 13.48
5 3.13 0.66 3.17 0.570 0.00 e 3.12
6 0.71 0.02 0.55 0.073 0.00 e 2.06
7 0.22 0.07 0.30 0.003 1.00 0.64
8 0.12 0.65 0.14 0.007 0.99 0.40
9 0.05 0.04 0.09 0.314 0.00 e 0.34
10 0.01 0.00 0.03 0.401 0.00 e 0.31
11 5 ? 10 y3 0.01 0.02 0.037 0.95 0.26
12 5 ? 10 y3 0.03 4 ? 10 y 3 0.021 0.98 0.19
13 6 ? 10 y3 0.13 3 ? 10 y 3 0.000 1.00 0.11
14 7 ? 10 y3 0.24 3 ? 10 y 3 0.000 1.00 0.02 f
15 8 ? 10 y3 0.45 4 ? 10 y 3 0.000 1.00
16 0.01 0.45 4 ? 10 y3 0.000 1.00
17 0.01 0.54 5 ? 10 y3 0.005 0.99
18 0.02 0.27 6 ? 10 y3 0.008 0.99
19 0.02 0.04 7 ? 10 y3 0.019 0.99
20 0.02 0.11 8 ? 10 y3 0.038 0.97
21 0.03 0.10 0.01 0.046 0.95
22 0.04 0.01 0.01 0.038 0.96
23 0.02 0.00 0.02 0.046 0.96
24 0.02 0.00 0.02 0.051 0.00 e
25 0.01 0.00 0.02 0.080 0.00 e
26 0.01 0.00 0.03 0.017 0.98
27 2 ? 10 y3 0.00 0.04 0.011 0.99
28 6 ? 10 y 4 — 0.02 0.001 1.00
29 0.02 0.000 1.00
30 0.01 0.002 1.00
31 0.02 — —
a
The RMS gradient, 5g 5 s ŽgTgrn.1r 2 (kcal / mol).
b
The parameter f is defined as f = ( D xTD j) 2 / ( D xTD x D jTD j). See text for more details.
c
The cos 2v is defined in eq. (27), where v is the angle between the vectors j and D x. See text for more details.
d
The parameter a is computed according to the equation a = f 2 = (1 y cos 2 v ).
e
In this iteration the strictly Murtagh]Sargent 2 2 update formula is used. See text for more details.
f
Stationary point with two negative eigenvalues.

JOURNAL OF COMPUTATIONAL CHEMISTRY 355


ANGLADA AND BOFILL

as implemented in the GAMESS package


Discussion and Examples program27 for the ab initio calculations. The AM1
calculations were using a modified version of the
First we comment on the correct implementa- MOPAC program.28 In the AM1 optimizations, the
tion of the MSP update formula described in ref. convergence criteria were taken from: the maxi-
13 by considering the results that follow eq. Ž12.. mum component of D x and its root mean square
Briefly, the MSP update is a hybrid combination of ŽRMS., 5 D x 5 s Ž D x TD xrn.1r2 ; the maximum com-
the MS update 22 and the Powell update12 ; that is, ponent of g and its RMS, 5g 5 s Žg T grn.1r2 ; and
BXMSPŽ a. s Ž1 y a .BXMS q aBXP , where BXMP and BXP
are the MS and Powell updates, respectively, and
a is an arbitrary parameter defined in the domain
1 G a G 0. The M matrix that charactreizes the TABLE III.
MSP update formula, BXMSPŽ a. , is11 Starting and Final Geometrical Parameters of
Second Transition Structure for Reaction
CH3 CH 2 F + OH yª CH 2 CH 2 + H 2 O + F y.
D x TD x
M s f I q Ž1 y f . jjT 1 G f G 0, Ž 26. O2 H1
Ž D x T j.

0
H3
where f s a. Now the problem is to see which is
2
H9
H10
the best f parameter, that is, the f parameter that

0
C4
minimizes N W ŽE*.. Evaluating the inverse matrix
W and the derivative with respect to f , M9, corre-
sponding to the M matrix defined in eq. Ž26., and C5

-
substituting the results into eq. Ž13. we get H8 H7

2
Ž D x T j.
f s f cos 2 v s 0 1 G f G 0, Ž 27. F6
D x TD x jT j
Parameter Starting Final
where v is the angle formed between the vectors j
and D x. Equation Ž27. tell us that when cos 2 v / 0, H1O 2 0.962 0.963
O 2H3 0.966 0.965
the optimum f is f s 0, which corresponds to the
H3 C4 2.433 2.534
MS update.11 On the other hand, if cos 2 v s 0, any C 4C 5 1.334 1.336
f between 1 and 0 is, in principle, optimum. C5F6 2.500 2.420
However, the last situation very often occurs near C5H7 1.100 1.100
the convergence, but in this case the MS update is C5H8 1.100 1.104
not the optimum.1 Consequently, one should take C4 H9 1.097 1.096
any f such that 1 G f ) 0. From a practical point C 4 H10 1.096 1.096
of view, when cos 2 v F « , « being a small num- H 3 O 2 H1 102.5 102.4
ber, the parameter f is selected as f s C4 H3 O 2 111.5 106.2
Ž1 y cos 2 v .1r2 .11, 13 A suitable choice for « is 0.05. C5 C4 H3 81.3 74.5
Obviously, if along the optimization process F6 C5 C4 137.7 129.9
H7 C5 C4 123.5 122.4
cos 2 v F « , the MSP update coincides with the
H8C5C4 123.7 124.0
Powell update because in this case f f 1, and, H 9 C 4C 5 122.8 123.0
consequently, a f 1. H10 C 4C 5 122.7 122.4
Now we present the behavior and the perfor- H1O 2 H 3 C 4 0.0 y5.0
mance of the BFGS, TS-BFGS, MSP, and Powell O 2 H 3 C 4C 5 179.3 156.5
update formulae in the location and optimization H 3 C 4C 5 F 6 178.7 144.9
of TSs of several reactions. The calculations were H7 C5 C4 F6 87.6 99.9
performed at the AM126 and ab initio Hartree]Fock H 8 C5 C4 F6 y85.8 y74.2
levels. The optimizations were carried out in inter- H 9 C 4C 5 H 3 92.6 116.2
nal coordinates. The algorithms used to optimize H10 C 4C 5 H 3 y91.4 y68.5
the TSs were the RQNR described in ref. 13 for the Bond lengths in angstroms and bond angles and dihedrals
AM1 calculations and the QArTRIM algorithm18 in degrees.

356 VOL. 19, NO. 3


BFGS REFORMULATION FOR OPTIMIZING SADDLE POINTS

the absolute value of the energy differences be- reactions.30 In Table I we present the geometrical
tween two consecutive iterations < f Žx kq 1 . y f Žx k .<; parameters of the initial and optimized molecular
with the threshold values 1.8 ? 10y3 A,
˚ 1.2 ? 10y3 A, ˚ geometries calculated for the first TS. In Table II
y1 ˚ y2 ˚
1.5 ? 10 kcalrmol A, 7.5 ? 10 kcalrmol A, and we compare the behavior of the optimization pro-
5.0 ? 10y4 kcalrmol, respectively, which are the cess using different update formulae. We observe
units used in the MOPAC program. In the case of that using the TS-BFGS update Hessian formula
the ab initio optimizations, the standard conver- the algorithm converges within 28 iterations.
gence criteria of the GAMESS program were used. Using the Powell update formula, the algorithm
In the GAMESS program a value of 7 ? 10y3 for the converges within 14 iterations. However, the sta-
« parameter was taken. tionary point reached possesses two imaginary fre-
quencies, 331.3i cmy1 and 39.5i cmy1 . On the
other hand, if we use the MSP update formula the
ELIMINATION REACTION CH 3 CH 2 F + OH yª
algorithm needs 31 iterations to converge to the
CH 2 = CH 2 + H 2 O + F y
correct TS. The correct optimized geometrical pa-
This reaction was studied by Cummins and rameters are those shown in Table I. At the final
Gready 29 with the AM1 method.26 The wave func- stationary point, the imaginary frequency
tion employed was the restricted Hartree]Fock. 305.3i cmy1 is associated with the true transition
The overall reaction occurs through two transition vector. Finally we note that using the standard
structures as found for other elimination BFGS formula, eq. Ž3., the algorithm diverges.

TABLE IV.
Behavior of TS-BFGS, MSP, and Powell Update Formulae along Optimization Process of Second
Transition Structure for Reaction CH3 CH 2 F + OH yª CH 2 CH 2 + H 2 O + F y.

TS-BFGS MSP Powell


Iteration 5g 5 a
f b 5g 5 a
cos v
2 c
a d 5g 5 a

1 0.34 0.25 0.34 0.043 0.96 0.34


2 0.10 0.11 0.10 0.017 0.98 0.10
3 0.05 0.36 0.05 0.014 0.99 0.05
4 0.08 0.42 0.08 0.015 0.99 0.08
5 0.12 0.00 0.12 0.192 0.00 e 0.12
6 0.15 0.52 0.16 0.512 0.00 e 0.16
7 0.17 0.20 0.19 0.001 1.00 0.20
8 0.14 0.01 0.09 0.121 0.00 e 0.09
9 0.04 0.24 0.03 0.025 0.97 0.03
10 0.03 0.27 0.02 0.002 1.00 0.02
11 0.02 0.22 0.01 0.001 1.00 0.02
12 0.02 0.23 0.01 0.000 1.00 0.02
13 0.01 0.21 0.01 0.000 1.00 0.01
14 0.01 0.40 0.01 0.000 1.00 0.01
15 0.01 0.68 0.01 0.000 1.00 0.02
16 0.01 0.50 0.01 0.000 1.00 0.02
17 0.01 0.19 0.02 0.000 1.00 0.03
18 4 ? 10 y3 0.04 0.06 0.002 1.00 0.05
19 4 ? 10 y3 0.01 0.04 — — 0.02
20 5 ? 10 y3 0.00 0.02
21 6 ? 10 y3 0.01
22 0.01 0.00
23 0.02 0.01
24 0.04 0.44
25 0.04 —
a
The RMS gradient, 5g 5 s ŽgTgrn.1r 2 (kcal / mol).
b
The parameter f is defined as f s Ž D xTD j. 2 rŽ D xTD x D jTD j.. See text for more details.
c
The cos 2 v is defined in eq. (27), where v is the angle between the vectors j and D x. See text for more details.
d
The parameter a is computed according to the equation a s f 2 s Ž1 y cos 2 v ..
e
In this iteration the strictly Murtagh]Sargent 2 2 update formula is used. See text for more details.

JOURNAL OF COMPUTATIONAL CHEMISTRY 357


ANGLADA AND BOFILL

The results of the second TS are presented in DECOMPOSITION REACTION CH 3 OOH ª


Tables III and IV. In Table III we show the geomet- CH 3 + OOH
rical parameters. The value of the imaginary fre- This reaction was studied using the unrestricted
quency is 240.9i cmy1 . In Table IV we show the Hartree]Fock version of the semiempirical AM1
behavior of the algorithm using the different up- Hamiltonian. In Table V we show the geometrical
dates of the Hessian matrix. The TS-BFGS update parameters and in Table VI the behaviors of the
needs 25 iterations to converge, while the Powell different update Hessian formulae. Using the TS-
and MSP ones need 20 and 19 iterations, respec- BFGS update, the algorithm needs 23 iterations to
tively. For the MSP update, at iterations 5, 6, and 8 converge and the f parameter is close to zero in
the a parameter is zero, which means that in this almost all iterations. Regarding the a parameter,
case MSP is just the MS update, while in the rest of we observe that the MSP update is a genuine MS
the optimization process MSP is just the Powell update in eight iterations. In this example the
update. We emphasize that using the three dif- Powell update is the more efficient. Again, using
ferent update formulae the optimization process the three different update formulae the optimi-
converges to the same molecular geometry with zation process converges to the same molecular
insignificant differences. Finally, the process does geometry with insignificant differences. Finally,
not converge employing the standard BFGS up- using the standard BFGS equation the algorithm
date formula. converges to a minimum corresponding to the
products of the reaction.

TABLE V.
Starting and Final Geometrical Parameters of
DISSOCIATION REACTION OF H 2 CO ª
Transition Structure for Reaction
H 2 + CO
CH3 OOH ª CH3 + OOH. This reaction was studied by Cerjan and Miller 31
H1
at the Hartee]Fock ab initio level with the STO-2G
-

basis set. We report for this reaction results ob-


O2
tained at the same level of theory using different
update Hessian formulae. In Table VII we present
O3 the geometrical parameters for the initial and final
geometry. Table VIII shows the behavior of the
different update formulae. The Hessian matrix at
the initial geometry is positive definite and the
H60 potential transition vector corresponds to the sec-
ond eigenvector if the eigenpairs are in increasing
C4
H5 value order. Employing the TS-BFGS update for-
-

H7
mula, in the first four iterations the Hessian matrix
Parameter Starting Final does not have the desired eigenvalue spectra. For
the MSP update, in the first four iterations and at
H1O 2 1.010 1.009 iteration 11 the Hessian matrix is positive definite.
O2O3 1.177 1.178 Also, in the Powell update at the first four itera-
O3C4 2.800 2.683 tions, the Hessian matrix does not present the
C4 H5 1.086 1.087 correct structure. Using the three different update
C4 H6 1.086 1.086
formulae the optimization process converges to the
C4 H7 1.088 1.086
H1O 2 O 3 112.6 112.5
same molecular geometry with insignificant differ-
O2O3C4 119.5 117.3 ences. Finally, and employing the standard BFGS
O3C4 H5 70.0 68.8 formula, the process does not converge.
O3C4 H6 92.3 75.9
O 3 C4 H7 107.8 125.0
H1O 2 O 3 C 4 88.2 90.4 METHOXY RADICAL ISOMERIZATION,
O2O3C4 H5 174.5 177.5 CH 3 O ª CH 2 OH
H5O3C4 H6 y120.9 y127.5
This reaction was studied by Culot et al.18b at
H 5 O 3 C4 H7 116.8 116.0
the Hartree]Fock ab initio level. Here we report
Bond lengths in angstroms and bond angles in degrees. the results for the same reaction within the C s

358 VOL. 19, NO. 3


BFGS REFORMULATION FOR OPTIMIZING SADDLE POINTS

TABLE VI.
Behavior of TS-BFGS, MSP, and Powell Update Formulae along Optimization Process of Transition
Structure for Reaction CH3 OOH ª CH3 + OOH.

TS-BFGS MSP Powell


Iteration 5g 5 a f b 5g 5 a cos v2 c
a d 5g 5 a

1 0.62 0.14 0.62 0.136 0.00 e 0.62


2 0.20 0.17 0.20 0.197 0.00 e 0.20
3 0.14 0.18 0.14 0.239 0.00 e 0.14
4 0.11 0.03 0.09 0.013 0.97 0.08
5 0.15 0.00 0.15 0.004 0.99 0.13
6 0.05 0.00 0.21 0.009 0.98 0.14
7 0.06 0.00 0.38 0.522 0.00 e 0.19
8 0.04 0.00 0.16 0.167 0.00 e 0.43
9 0.08 0.00 0.12 0.090 0.00 e 0.20
10 0.07 0.01 0.09 0.009 0.98 0.08
11 0.17 0.60 0.07 0.006 0.99 0.07
12 0.11 0.07 0.07 0.019 0.96 0.06
13 0.10 0.07 0.07 0.051 0.00 e 0.06
14 0.09 0.21 0.06 0.000 1.00 0.04
15 0.09 0.46 0.06 0.057 0.00 e 0.05
16 0.08 0.03 0.06 0.030 0.94 0.04
17 0.07 0.00 0.06 0.008 0.98 0.04
18 0.07 0.00 0.05 0.000 1.00 0.02
19 0.07 0.01 0.04 0.000 1.00
20 0.07 0.02 0.04 0.000 1.00
21 0.07 0.11 0.03 0.000 1.00
22 0.07 0.03 0.02 0.001 1.00
23 0.07 — 0.02 0.002 1.00
24 0.05 — —
a
The RMS gradient, 5g 5 s ŽgTgrn.1r 2 (kcal / mol).
b
The parameter f is defined as f s Ž D xTD j. 2 rŽ D xTD x D jTD j.. See text for more details.
c
The cos 2 v is defined in eq. (27), where v is the angle between the vectors j and D x. See text for more details.
d
The parameter a is computed according to the equation a s f 2 s Ž1 y cos 2 v ..
e
In this iteration the strictly Murtagh]Sargent 2 2 update formula is used. See text for more details.

symmetry using the unrestricted Hartree]Fock sense that the u vector, which is present in any
wave function with the STO-3G basis set. In Table updating Hessian formula, is in this case a func-
IX we show the geometry and in Table X the tion of both the BD x and D g vectors, with the B
behavior of the three updated Hessians. The initial matrix forced to be positive definite as in the
Hessian matrix is positive definite, but just after standard BFGS formula. This update presents some
the first correction the Hessian achieves the correct stability and efficiency and is quite competitive
eigenvalue spectra in all three cases. The three with respect to the normal update formulae used
update formulae converge to the same molecular to locate TSs, such as the Powell and the MSP.
geometry. Using the standard BFGS formula, eq. Finally, we make the following consideration: be-
Ž3., the algorithm diverges. cause the BFGS update formula is the best formula
to update Hessian matrices for a minimization
algorithm, the TS-BFGS can be seen as its analogue
Concluding Remarks for optimizing saddle points, and the TS-BFGS
presents the same performance as the Powell or
We presented a BFGS-like updated Hessian for- MSP update formulae, it is likely that the Powell
mula to locate TSs. This TS-BFGS modification of and MSP formulae are the best update that can be
the standard BFGS formula is understood in the formulated to optimize saddle points.

JOURNAL OF COMPUTATIONAL CHEMISTRY 359


ANGLADA AND BOFILL

TABLE VII. TABLE IX.


Starting and Final Geometrical Parameters of Starting and Final Geometrical Parameters
Transition Structure for Reaction H 2 CO ª H 2 + CO. of Transition Structure for Reaction
CH3 O ª CH 2 OH within C S Symmetry.
H3
H4 O2

C1
H3

O2 C1
0

-
H4
Parameter Starting Final H5

C1O 2 1.220 1.199 Parameter Starting Final


C1H 3 1.110 1.122
C1H 4 1.110 1.537 C1O 2 1.423 1.423
O 2 C1H 3 123.4 156.7 C1H 3 1.484 1.326
O 2 C1H 4 122.4 107.9 C1H 4 1.087 1.088
O 2 C1H 3 H 4a 0.0 0.0 O 2 C1H 3 42.7 51.8
O 2 C1H 4 117.5 117.5
Bond lengths in angstroms and bond angles in degrees. H 3 O 2 C1H 4 y105.4 y106.4
a
Out of plane angle coordinate, defined from bond O 2 C1 to
plane C1H 3 H 4 . Bond lengths in angstroms and bond angles and dihedrals
in degrees.

TABLE VIII.
Behavior of TS-BFGS, MSP, and Powell Update Formulae along Optimization Process of Transition
Structure for Reaction H 2 CO ª H 2 + CO.

TS-BFGS MSP Powell


Iteration 5g 5 a
f b 5g 5 a
cos v
2 c
a d 5g 5 a

1 2.1 ? 10 y 2 e 4.2 ? 10 y 3 2.1 ? 10 y 2 e 4.2 ? 10 y 3 9.9 ? 10 y 1 2.1 ? 10 y 2 e


2 4.0 ? 10 y 2 e 6.4 ? 10 y 3 4.0 ? 10 y 2 e 6.0 ? 10 y 3 9.9 ? 10 y 1 4.0 ? 10 y 2 e
3 6.5 ? 10 y 2 e 9.2 ? 10 y 2 6.5 ? 10 y 2 e 8.6 ? 10 y 2 0.0 f 6.5 ? 10 y 2 e
4 8.5 ? 10 y 2 e 4.5 ? 10 y 1 8.4 ? 10 y 2 e 6.6 ? 10 y 1 0.0 f 8.5 ? 10 y 2 e
5 9.5 ? 10 y 2 8.4 ? 10 y 1 9.8 ? 10 y 2 7.3 ? 10 y 1 0.0 f 9.5 ? 10 y 2
6 1.3 ? 10 y1 6.0 ? 10 y 1 1.3 ? 10 y 1 6.4 ? 10 y 1 0.0 f 1.2 ? 10 y 1
7 1.0 ? 10 y1 7.3 ? 10 y 1 1.0 ? 10 y 1 7.2 ? 10 y 1 0.0 f 1.0 ? 10 y 1
8 8.6 ? 10 y 2 2.8 ? 10 y 2 8.5 ? 10 y 2 3.0 ? 10 y 1 0.0 f 8.8 ? 10 y 2
9 4.4 ? 10 y 2 2.1 ? 10 y 1 4.2 ? 10 y 2 4.3 ? 10 y 1 0.0 f 4.7 ? 10 y 2
10 5.2 ? 10 y 2 4.2 ? 10 y 1 3.7 ? 10 y 2 4.3 ? 10 y 1 0.0 f 5.1 ? 10 y 2
11 1.9 ? 10 y 2 1.0 ? 10 y 2 2.2 ? 10 y 2 e 6.3 ? 10 y 2 0.0 f 3.1 ? 10 y 2
12 1.0 ? 10 y 2 8.4 ? 10 y 2 2.3 ? 10 y 2 5.5 ? 10 y 1 0.0 f 1.2 ? 10 y 2
13 6.1 ? 10 y3 9.6 ? 10 y 2 2.1 ? 10 y3 0.0 1.0 6.5 ? 10 y3
14 2.2 ? 10 y 3 4.3 ? 10 y 2 9.9 ? 10 y 4 1.4 ? 10 y 1 0.0 f 4.6 ? 10 y 3
15 2.4 ? 10 y 4 1.7 ? 10 y 1 1.0 ? 10 y 4 1.2 ? 10 y 3 1.0 1.0 ? 10 y 3
16 1.5 ? 10 y 4 3.8 ? 10 y 1 8.0 ? 10 y 6 — — 6.5 ? 10 y 4
17 1.0 ? 10 y 5 — 1.2 ? 10 y 4
18 7.9 ? 10 y 6
The RMS gradient, 5g 5 s ŽgTgrn.1r 2 (Hartree / A ˚).
a
b
The parameter f is defined as f s Ž D xTD j. 2 rŽ D xTD x D jTD j.. See text for more details.
c
The cos 2 v is defined in eq. (27), where v is the angle between the vectors j and D x. See text for more details.
d
The parameter a is computed according to the equation a s f 2 s Ž1 y cos 2 v ..
e
Iterations where the Hessian matrix does not have the correct spectra.
f
In this iteration the strictly Murtagh]Sargent 2 2 update formula is used. See text for more details.

360 VOL. 19, NO. 3


BFGS REFORMULATION FOR OPTIMIZING SADDLE POINTS

TABLE X.
Behavior of TS-BFGS, MSP, and Powell Update Formulae along Optimization Process of Transition
Structure for Reaction CH3 O ª CH 2 OH.

TS-BFGS MSP Powell


Iteration 5g 5 a f b 5g 5 a cos v
2 c
a d 5g 5 a

1 3.6 ? 10 y 2 3.0 ? 10 y 3 3.6 ? 10 y 2 3.0 ? 10 y 3 9.9 ? 10 y 1 3.6 ? 10 y 2


2 4.2 ? 10 y 2 1.1 ? 10 y 1 3.6 ? 10 y 2 1.9 ? 10 y 1 0.0 e 3.6 ? 10 y 2
3 1.0 ? 10 y1 3.7 ? 10 y 3 4.2 ? 10 y 2 5.6 ? 10 y 1 0.0 e 4.9 ? 10 y 2
4 7.8 ? 10 y 2 6.6 ? 10 y 3 1.5 ? 10 y 2 1.8 ? 10 y 1 0.0 e 3.7 ? 10 y 2
5 3.0 ? 10 y 2 4.1 ? 10 y 2 1.9 ? 10 y 3 1.7 ? 10 y 2 0.0 e 2.3 ? 10 y 2
6 4.2 ? 10 y 2 1.6 ? 10 y 2 1.5 ? 10 y 2 8.0 ? 10 y 1 0.0 e 4.1 ? 10 y 3
7 8.9 ? 10 y3 3.9 ? 10 y 1 6.6 ? 10 y 3 6.1 ? 10 y 1 0.0 e 5.5 ? 10 y 4
8 1.2 ? 10 y 2 1.3 ? 10 y 1 1.1 ? 10 y 3 3.6 ? 10 y 2 0.0 e 1.4 ? 10 y 4
9 1.0 ? 10 y3 9.0 ? 10 y 2 4.3 ? 10 y 5 5.3 ? 10 y 1 0.0 e 8.3 ? 10 y 5
10 4.6 ? 10 y 4 1.6 ? 10 y 2 1.5 ? 10 y 5 — — 5.9 ? 10 y 6
11 6.7 ? 10 y 5 2.3 ? 10 y 1
12 9.1 ? 10 y 6 —
a
The RMS gradient, 5g 5 s ŽgTgrn.1r 2 (Hartree / A ˚).
b
The parameter f is defined as f s Ž D xTD j. 2 rŽ D xTD x D jTD j.. See text for more details.
c
The cos 2 v is defined in eq. (27), where v is the angle between the vectors j and D x. See text for more details.
d
The parameter a is computed according to equation a s f 2 s Ž1 y cos 2 v ..
e
In this iteration the strictly Murtagh]Sargent 2 2 update formula is used. See text for more details.

then the value of its elements are


Appendix A: Inverse of Matrix B9 Given
by Eq. ( 10)
uT Hu
D 11 s y 2
, Ž A.4a.
We are looking for the matrix H9 such that Ž D g T Hu. q uT HuD g T n
B9H9 s I. We assume that BH s I. Using the Sher-
man]Morrison formula1 on eq. Ž10. we get D g T Hu
D 12 s D 21 s 2
, Ž A.4b.
Ž D g T Hu. q uT HuD g T n
H9 s H y H w j u x Cy1 w j u x H.
T
Ž A.1.

In eq. ŽA.1. the C matrix is D gTn


D 22 s 2
. Ž A.4c.
y1
Ž D g T Hu. q uT HuD g T n
0 1
q w j ux H w j ux
T
Cs
1 y Ž jTD x .
Now substituting eqs. ŽA.4. into eq. ŽA.3. and the
y1 result into eq. ŽA.1., we get the expression for H9
0 1 yjT n yuT n
s q in a condensed form,
1 y Ž jTD x . yuT n uT Hu
yDg T n uT HD g H9 s H y D 11 nnT q D 12 Ž nuT H q HunT .
s , Ž A.2.
D g T Hu uT Hu
y D 22 HuuT H
where the following relations and definitions were yD11 D 12
used: Hj s HD g y D x s yn, jT n s nTD g q jTD x, s H q w n Hux w n Hux T
D 21 yD 22
and uT n s 1 y uT HD g. If Cy1 s D, where the
symmetric D matrix is s H q Q. Ž A.5.
D 11 D 12
Ds , Ž A.3. Equation ŽA.5. tells us that the general correction
D 21 D 22
to the H matrix is again a rank two matrix.

JOURNAL OF COMPUTATIONAL CHEMISTRY 361


ANGLADA AND BOFILL

16. L. E. Scales, Introduction to Non-Linear Optimization, Macmil-


lan, Basingstoke, U.K., 1985.
Acknowledgments
17. J. M. Anglada and J. M. Bofill, Int. J. Quantum Chem., 62,
153 Ž1997..
We are indebted to Professor S. Olivella for his
18. Ža. T. Helgaker, Chem. Phys. Lett., 182, 503 Ž1991.; Žb. P.
valuable suggestions. This research was supported
Culot, G. Dive, V. H. Nguyen, and J. M. Ghuysen, Theor.
by the Spanish DGICYT ŽGrant PB95-0278-C02-01.. Chim. Acta, 82, 189 Ž1992..
19. F. Jensen, J. Chem. Phys., 102, 6706 Ž1995..
20. A. Banderjee, N. Adams, J. Simons, and R. Shepard, J. Phys.
References Chem., 89, 52 Ž1985..
21. H. B. Schlegel, In Modern Electronic Structure Theory, D. R.
1. R. Fletcher, Practical Methods of Optimization, Wiley, New Yarkony, Ed., World Scientific Publishing, Singapore, 1995.
York, 1987.
22. B. A. Murtagh and R. W. H. Sargent, Comput. J., 13, 185
2. W. C. Davidon, Variable Metric Method for Minimization,
Ž1972..
AEC Research and Development Report, Argonne National
Laboratory Report, ANL-5990, 1959. 23. H. B. Schlegel, Theor. Chim. Acta, 83, 15 Ž1992..
3. R. Fletcher and M. J. D. Powell, Comput. J., 6, 149 Ž1963.. 24. M. Al-Baali and R. Fletcher, J. Oper. Res. Soc., 36, 405
4. C. G. Broyden, J. Inst. Math. Comput., 6, 222 Ž1970.. Ž1985..
5. R. Fletcher, Comput. J., 13, 317 Ž1970.. 25. J. E. Dennis and R. E. Schnabel, SIAM Rev., 21, 443 Ž1979..
6. D. Goldfarb, Math. Comput., 24, 23 Ž1970.. 26. M. J. S. Dewar, E. G. Zoebisch, E. F. Healy, and J. J. P.
7. D. F. Shanno, Math. Comput., 24, 647 Ž1970.. Stewart, J. Am. Chem. Soc., 107, 3902 Ž1985..
8. H. Y. Huang, J. Opt. Theor. Applic., 5, 405 Ž1970.. 27. M. W. Schmidt, K. K. Baldridge, J. A. Boatz, S. T. Elbert,
9. M. R. Hestenes, Conjugate Directions Methods in Optimiza- M. S. Gordon, J. H. Jensen, S. Koseki, N. Matsunaga,
tion, Springer]Verlag, New York, 1980. K. A. Nguyen, S. J. Su, T. L. Windus, M. Dupuis, and
10. J. Greenstadt, Math. Comput., 24, 1 Ž1970.. J. A. Montgomery, J. Comput. Chem., 14, 1347 Ž1993..
11. J. M. Bofill and M. Comajuan, J. Comput. Chem., 16, 1326 28. MOPAC Program, local version.
Ž1995.. 29. P. L. Cummins and J. E. Gready, J. Comput. Chem., 10, 939
12. M. J. D. Powell, Math. Prog., 1, 26 Ž1971.. Ž1989..
13. J. M. Bofill, J. Comput. Chem., 15, 1 Ž1994.. 30. N. S. Isaacs, Physical Organic Chemistry, Longman Scientific,
14. J. M. Bofill, Chem. Phys. Lett., 260, 359 Ž1996.. New York, 1987.
15. J. Simons, P. Jørgensen, H. Taylor, and J. Ozment, J. Phys. 31. C. J. Cerjan and W. H. Miller, J. Chem. Phys., 75, 2800
Chem., 87, 2745 Ž1983.. Ž1981..

362 VOL. 19, NO. 3

You might also like