Professional Documents
Culture Documents
Optimization Note PDF
Optimization Note PDF
Reading: Sundaram
Ch 3, Ch 4.1-4.5, (4.6 optional)
Ch 5.1-5.5 (5.2.3, 5.6 optional), Ch. 6.1-6.4 (6.1.2, 6.1.3 optional)
Weierstrass’ Theorem
Let S ⊆ RN be a compact set and f : RN → R is a continuous function. There is a
maximum and minimum of f in S. That is there exists m1 , m2 ∈ S such that
f (m2 ) ≤ f (x) ≤ f (m1 ) for all x ∈ S. These are also called global maximum and
global minimum.
Proof: First we shall show that f (S) is a compact set. Then we shall show that
compact sets in R has maximum and minimum.
Step 1: Take any sequence {yn }∞ n=1 in f (S). yn ∈ f (S) implies there is xn ∈ S such
that yn = f (xn ). Take the sequence {xn }∞ ∞
n=1 in S. Since S is compact, {xn }n=1 has a
∞
subsequence {zn }n=1 which converges to a point z in S. Since f is continuous
{f (zn )}∞ ∞ ∞
n=1 converges to f (z). Of course {f (zn )}n=1 is a subsequence of {yn }n=1 and
f (z) ∈ f (S) (because z ∈ S). Thus every sequence in f (S) has convergent
subsequence implying f (S) is compact.
Step 2: f (S) is compact in R implies f (S) is closed and bounded. f (S) is bounded
implies f (S) has a supremum (say M1 ) and an infimum (say M2 ). Since M1 (similarly
M2 ) satisfies -test, it is a limit point of f (S). Since f (S) is closed, M1 must be in
f (S). Thus there exists m1 ∈ S such that f (m1 ) = M1 ≥ f (x) for all x ∈ S.
Optimization: Existence
Examples
1. f : R → R. f (x) = x. There is no maximum. Here f is continuous but domain is
not compact because it is not bounded.
2. f : (0, 1) → R. f (x) = x. There is no maximum. f is continuous but once again
domain is not compact because it is not closed.
3. f : [0, 1] → R. f (x) = x for all x 6= 1 and f (1) = 0. There is no maximum. Here
domain is compact but f is not continuous.
Weierstrass’ Theorem gives ‘sufficient condition’ for existence of maximum and
minimum. It is not necessary to have these properties for existence of optimum.
4. Take f : [0, 1] → R. f (x) = x for all x 6= 1/2 and f ( 1/2 ) = 0. This has maximum
and minimum.
Finding Interior Optima
Interior Optima
Let f : S → R, where S ⊆ RN . We want to maximize (or minimize) f over S. We
shall focus on maximum. Note that minimum of f is the same as maximum of −f .
Instead of finding global maximum, we start from finding interior local maxima and
use them to find global maximum.
x ? is called an interior local maximum if
(i) x ? is an interior point of S
(ii) There exists > 0 such that f (x ? ) ≥ f (x) for all x ∈ B (x ? ) ⊆ S.
Note: At this moment we are excluding points that are not interior point of S.
Note: It is strict local maximum if the above inequality is strict.
First order necessary condition:
x ? is interior local maximum (or minimum) and f is differentiable at x ? ⇒ Df (x ? ) = 0
Proof: Take N = 1. Take two sequence {yn }∞ ∞
n=1 and {zn }k=1 such that
? ? ?
lim yn = lim zn = x ; yn < x and zn > x for all n.
f (yn ) − f (x ? ) f (zn ) − f (x ? )
≥0≥
yn − x ? zn − x ?
which is both PSD and NSD. However x ? is neither local min nor local max. For
> 0, f (, 0) > 0 and f (−, 0) < 0.
Example: Caution 2: Local maximum does not imply that it is global maximum.
Global maximum and minimum may not exist.
S = R, f (x) = 2x 3 − 3x 2 . Critical points are x ? = 0, 1. Second order conditions
imply 0 is a strict local maximum and 1 is strict local minimum.
Our algorithm will say that global maximum is obtained at 0 and global minimum is
obtained at 1.
However there is no global maximum or minimum. f (x) strictly increases for x > 1
and x < 0. Of course a local maximum is obtained at 0 and a local minimum is
obtained at 1.
Optimization with Equality Constraints
Suppose that the objective function is f : RN → R and there are K (< N) constraint
functions, g 1 : RN → R, g 2 : RN → R, . . . , g K : RN → R. We want to maximize
(or minimize) f (x) over all x ∈ RN such that g 1 (x) = 0, g 2 (x) = 0, . . . , g K (x) = 0.
Once again, we start from local maxima x ? that satisfy K constraints. That is
(i) g 1 (x ? ) = 0, g 2 (x ? ) = 0, . . . , g K (x ? ) = 0
(ii) There exists an open set U such that f (x ? ) ≥ f (x) for all x ∈ U and
g 1 (x) = 0, g 2 (x) = 0, . . . , g K (x) = 0.
Note: K < N makes the domain non trivial.
First order necessary condition (Theorem of Lagrange):
Suppose that f , g 1 , . . . , g K are C 1 functions.
x ? is local maximum (or minimum) of f ⇒ There exists λ?1 , λ?2 , . . . , λ?K ∈ RK such
that
K
X
Df (x ? ) + λ?j Dg j (x ? ) = 0
j=1
However this hold only when rank(Dg (x ? )) = K . This condition is called constraint
qualification. 1
∂g ∂g 1 ∂g 1
...
∂x1 ∂x2 ∂xN
Dg = . . . . . . . . . . . . . . . . . . . . . . .
∂g K ∂g K ∂g K
∂x1 ∂x2
... ∂xN
Second Order Condition
3x12 + λ1 = 0 (1)
3x22 − λ1 = 0 (2)
x1 − x2 = 0 (3)
Only solution is x1 = x2 = λ = 0. Even second order conditions can not rule out
x1 = x2 = λ = 0 as local maximum and minimum. However (0, 0) is neither local
maximum nor local minimum. For instance, take f (, ) = 23 > 0 = f (0, 0), when
> 0.
This also suggests that there is no global maximum and global minimum, which is
indeed true. f (, ) = 23 goes to ∞ as → ∞
Pathway and Caution
−2λ1 x1 = 0 (4)
−1 + 3λ1 x22 = 0 (5)
x12 − x23 = 0 (6)
x12 + 2 + λ1 = 0 (7)
−3x2 − λ1 = 0 (8)
x1 − x2 = 0 (9)
∂f ∂g ? ?
(x ? , x ? ) + λ? (x , x ) = 0 (10)
∂x1 1 2 ∂x1 1 2
∂f ∂g ? ?
(x ? , x ? ) + λ? (x , x ) = 0 (11)
∂x2 1 2 ∂x2 1 2
∂f
∂x2
(x1? ,x2? )
Equation (11) gives λ? = − ∂g . Remains to show Equation (10) holds for λ? .
∂x2
(x1? ,x2? )
∂f ∂g ∂f ∂f
(x ? , x2? )
∂x1 1
+ λ? ∂x (x1? , x2? ) = ∂x (x1? , x2? ) + (x ? , x2? )h0 (x1? )
∂x2 1
(by A)
1 1
=0 (by B)
Implicit Function Theorem