Professional Documents
Culture Documents
Total Hessian Optimization Basu Hazra JOpt 2000
Total Hessian Optimization Basu Hazra JOpt 2000
P'dge 95-104
III le(ISI .f'luares oplimiZa/iOlI oj 1"11.1' de.film II,,' cOIl/rillll/ill/! oj second order
aberralioll cleriwlI;ves ill the Hessian malr;x tII"f! excluded ill cOlI/ptlri.l·ol/ with lilt! sall/e
jmlll 'he .firsl order a/tt'I'I'ClliOIl de,.i\'(lii1·('.~. Hl'eC',,' jor .101111' CII/"S('/~\' /"(!f1wrks 011 Ille
relative IIItigillitlldes (~,. the two contributions. 1I11./('l1Iwl basis just!fving this exclllsioll
has bem rel'0rle(1 ill Ihe fileratl/l~'. III .\"I,,·le (!t" Ihe /l1"II('/icul.l"lfccess ill oplimizalioll lI'ilh
Irwlculed Hrssialllllairix. it i~' gelleraflyjidlllt<lllllllillliwlioll wilillowl H('ss;(/IIII/"'rix
is likdy /0 I,mdm'c! bt'ller r('sulls. Willi lIlt' "h"'JOIIIC'II(1/ ri.\·C' ill COIl1fIUI(ltiolllll sl'eC'l1 ill
tll(' recent past. the primm:" dijficulty in lIIuJcr/akillg .1·.I'.rlemalic im'es1i8"licms on Ihis
I'mhlelll is gradual/y pelering 01/1. n,is IWI}(!/" P"<'.I"c'III.\· .1·Ollle l"f!suflS (!f" 0/11' illl'e.wig"-
liollS OIl least squares oplimiWlioll of lell.l· desigll /Ising tofU! Hessian mat/"i.\~
l.INTRODUCTION
Computer aided lens design involves tackling of a constrained nonlinear
multivariate opimization problem [n. An iterative procedure is usually adopted
to solve this problem by seeking the solution vector that yields an optimum value
of the objective function (more commonly described as merit or defect function
in optics) in the neighbourhood of an initial estimate. Most commonly adopted
procedures in practical nonlinear optimization are essentially variants of New-
ton's method that can attain an asymptotic rate of convergence, ifthe merit func-
tion can be approximated by a quadratic function over the search domain [2].
However. successful implementation of Newton's methods calls for ready avail-
ability of tirst and second deri vati ves of the merit function. In practical lens de-
sign problems. the merit function is usually formulated as a weighted sum of
squares of aberrations and pseudoabcrrations. This way of formulating the merit
function leads to an expression for the Hessian matrix as a sun of two matrices.
one of which involves only the first derivatives of the same. The difficulty in
obtaining fast and accurate values for the second part ofthe Hessian matrix has so
far prompted exclusion of this part in comparison with first part of the Hessian
matrix that could he determined from the values of the first derivatives of aberra-
tions alone [3-8]. An' incidental advantage of this approach is the positiVI;! sem-
idcfiniteness of the truncated Hessian matrix - one of .the key reasons for
pnu.:tical success of the -widely used damped least squares (DLS) algorithm in
lens design optimization. Exclusion of the seconu derivative terms in the Hessian
matrix is primarily motivated from computational considerations. and no other
justification for the same has yet been established. Indeed, for counteracting this
96
J. Ba.lu alld LN. Hazra
total exclusion of second derivative terms, several suggestions are made to incor-
porate the effects ofhomog~neous second derivatives in calculation of the damp-
ing factor in DLS programs, and the latter seem to provide better results [9-12].
The question, therefore, comes to mind : can we obtain better results if the total
Hessian, incorporating both first and second derivatives, is utilized in optimiza-
tion procedure? In view of the ready availability of fast computers, this is no
longer an infeasible proposition. By 'better' results in optimization procedure,
we imply either a faster rate of convergence, or overcoming stagnation, or a final
solution yielding a lower value for the defect function.
With the above backdrop in view, this paper presents some results of our
investigations on least squares optimization of lens design using total Hessian
matrix in the normal equations. The next section presents a brief restatement of
mathematical formulation of the problem. Section 3 presents numerical results of
optimization run with total Hessian, followed by our concluding remarks in the
last section.
2. Mathematical Formulation:
For the sake of convenience in representation. the merit function '" of eq.
(I) can be written as
M
\fI=D2 (2)
i=1
Wher!; (XI' x2••••••• , x N), i=I,2,3, ....... , M are the aberrations and pseudoaberra-
lions, and each of them is a function of the N variable Xi' j=1,2,3, ...... , N of the
lens system. Typically, the variables, also called the degrees of freedom of de-
sign, are the surface curvatures, aspheric coefficients, thicknesses, separations
between lens elements, optical materials for the lens elements etc.
fl XI
f2 Xl
f= X=
(3)
fM xN
J=
(5)
(7)
Furthermore. defining
(9)
we have,
G = 2JTJ + 2S (10)
Let us consider a change vector p given by
PI
P2
P= .
(11 )
PN
Using Taylor's series expansion of the gradient of 'I' • at the point x+p in the
hyperspace of the variabes,we have
g(x+p) =g+Gp (12)
If the point x + pis to be a mi ni mum of the function'll, then g (x + p) must
be zero. and we have
=
Gp -g. (13)
Substi tuting from Eqs. (6) and ( 10). we obtain
=
(P J + S) p -JTf. (14)
On nonquadratic functions, x.+p wi1l not in general, be tne minimum if pis
given by Eq. (14). and the process has to be performed iteratively. Thus at the kth
ih..~rt.Hion. the nonnal equations are
kth iteration and the initial estimate for the (k+ 1) th iteration is
Xk+1 = Xk+Pk (16)
3. NUMERICAL RESULTS
Evaluation of total Hessian in each iteration involves detennination of its
two parts, nameJy JTJ and S. Whereas the evaluation of rJ calls for the compu-
tation of (MN) first order aberration derivatives, the evaluation of S neccessitates
the computation of (MN) homogeneous and {(MN(N-I)I2) mixed second order
derivatives. Numerical techniques for reliable computation of these aberration
derivatives have been pre);ented earlier [13-14].
Experiments for total Hessian optimization were conducted on the follow-
ing systems:
i) An f/4.3 cemented doublet of focal length 6.4 inch and semi field an-
glctfl.
ii) An fl-t. air ~paced triplet of focal length 100 mm and semi field angle
15 n.
(inilial system adapted from U,S. Patent No. 1.987,878).
iii) An fl I. 12 Pellval lens of focal length 50mm and semi field angle S().
(initial system adapted from.U.S.Patent No.2. 158.202).
i\') An 171.5 Dmihle Gauss system of focal length I inch and semi field
angll! 2~1I.
(initial system adapted from U.S. Patent No. 2.379, 392).
For the cementeddoubfet the degrees of freedom are the three curvatures
=
and the two thicknesses, i.e .. N 5. Tables lA and IB give the JTJ and S matrices
Table IA.
The 5x5 JTJ matrix. for the cemented doublet.
Gauss system where we used twelve curvatures, seven thicknesses and five air
separations as degrees of freedom, i.e., N 24. =
Tablem
Thble II
Twenty four elements of the first and the last rows of the matrices JTJ and S for
the double Gauss system.
r (PJl 1• (S) I. <.PJ )l4. (S) l4r
Figure 1-4 present results of optimization run for the four lens systems men-
tioned above. They demonstrate significantly high rate of convergence
achieved in least squares optimization when the total Hessian is utilised.
900
800
700
..
III
C 600
0
u 500
--
c
:J
'0:::
Q)
::e
400
300
200
100
0
0 5 10 15
No. of iterations
fig. I Results of optimization run using lolal Hessian method with line search on a cemented doublet.
102
.I Da,wl and L. ,IV Ha;m
600
500
c:
0400
I
..
.2 300
' I;
~ 200
100
0
0 2 3 .\ 5 6 7 8
No. of iterations
Fig. 2. Resul!s of optimizalion run lIsing lolal Hessian method with line search on a Tronnier Iriplet.
1900
1880
1860
1840
1820
.. 1800
·c 1780
II
:E 1760
1740
1720
1700
0 2 6 8 10 12 14 16
No. of Iterations
Fig. 3. iteslilts bf 0l'Wnir.ailon nlll ll~ihg tolal Hessian method with lihe scardl oh a Schade Petzval
sY!item.
103
TOTAL IIESSIAN I~ LEAST SQUARES OPTIMIZATION OF LENS DESIGN
3500
3000
roo 2000
1500
1 1000
500
0
0 5 10 15
No.of lleratlon.
Fig. 4. Results of optimi/alion run using total lics.~ian Int'thod with line sear~ h on OJ Double
Gauss ~ ys tem .
4. CONCLUDING REMARKS
It is obvious that the total Hessian method for lens design optimization.
pre~ented above. takes proper account of the relative nonlinearity of the vari-
able~. and this may be one of the reasons for fast convergence at the initial stages
of the optimization run. Another advantage is the nonrequirement of any damp-
ing factor.
Nevertheless. after first few iteration stages. the convergence is slow which
can hardly justify the large amount of additional computation required in the
evaluation of total Hessian at this stage. A hybrid method incorporating Ihe total
Hessian approach along with usual DLS or Gauss-Newton approach seems 10 be
more appropriate for overall optimiz.ation.
REFERENCES
I. R. R. Shannon. The An and Sden~e of Opti~ a l Design. Camoridge l.;niversity Prc s ~. Cam·
bridge ( 1997).
L.E. Scales. IntrodUl:tion to Nonlinerar Optimil.ation. Manniilal!. l.lIndon 11981 ).
J. A . Girard. Rev. Opt 37. 225-241. W7-424! 1958l.
4. C. G. Wynne. Proc. Phys. SO<.' . London 73. 777-7F.7 119591.
5. D.P. Feder. Appl. Opt. 2. 1209-1226 (196.h
6. T H.Jamieson . Optimir.ation Techniques in lens Design, Adam Hilger. London I 1971 )
7. M.1 . Kidgcr, Opt. Eng. jl(8J. 1731-1739 l199J).
104
J. Bam and L. N. Haz",