Professional Documents
Culture Documents
Dario Pighin
Departamento de Matemáticas, Universidad Autónoma de Madrid
28049 Madrid, Spain
Chair of Computational Mathematics, Fundación Deusto
University of Deusto, 48007, Bilbao, Basque Country, Spain
Introduction
In this manuscript, the long time behaviour of semilinear optimal control prob-
lems as the time-horizon tends to infinity is analyzed. Our results are global,
meaning that we do not require smallness of the initial datum for the governing
state equation.
In [33], A. Porretta and E. Zuazua studied turnpike property for control problems
governed by a semilinear heat equation, with dissipative nonlinearity. In particular,
[33, Theorem 1] yields the existence of a solution to the optimality system fulfilling
the turnpike property, under smallness conditions on the initial datum and the
target. Our first goal is to
(1) prove that in fact the (exponential) turnpike property is satisfied by the
optimal control and state;
(2) remove the smallness assumption on the initial datum.
We keep the smallness assumption on the target. This leads to the smallness and
uniqueness of the steady optima (see [33, subsection 3.2]), whence existence and
uniqueness of the turnpike follows. We also treat the case of large targets, under
the added assumption that control acts everywhere. In this case, we prove a weak
turnpike result, which stipulates that the averaged infimum of the time-evolution
functional converges towards the steady one. We also provide an L2 bound of the
time derivative of optimal states, uniformly in the time horizon.
Generally speaking, in turnpike theory a time-evolution optimal control problem
is considered together with its steady version. The “Turnpike Property” is verified
y
steady optimum
quasi-optimum
y0 optimum
t
O τ T −τ T
if the time-evolution optima remain close to the steady optima up to some thin
initial and final boundary layers.
An extensive literature is available on the topic. A pioneer on the topic has been
John von Neumann [43]. In econometrics the topic has been widely investigated
by several scholars including P. Samuelson and L.W. McKenzie [13, 38, 27, 28, 29,
9, 21]. Long time behaviour of optimal control problems have been studied by P.
Kokotovic and collaborators [44, 2], by R.T. Rockafellar [37] and by A. Rapaport
and P. Cartigny [35, 36]. A.J. Zaslavski wrote a book [47] on the topic. A turnpike-
like asymptotic simplification have been obtained in the context of optimal design
of the diffusivity matrix for the heat equation [1]. In the papers [12, 18, 17, 39], the
concept of (measure) turnpike is related to the dissipativity of the control problem.
Recent papers on long time behaviour of Mean Field games [7, 8, 31] motivated
new research on the topic. A special attention have been paid in providing an
exponential estimate, as in the work [32] by A. Porretta and E. Zuazua, where
linear quadratic control problems were considered. These results have later been
extended in [41, 33, 46, 40, 20, 19] to control problems governed by a nonlinear state
equation and applied to optimal control of the Lotka-Volterra system [23]. Recently
turnpike property have been studied around nonsteady trajectories [40, 15]. The
turnpike property is intimately related to asymptotic behaviour of the Hamilton-
Jacobi equation [24].
Note that for a general optimal control problem, even in absence of a turnpike
result, we can construct quasi-optimal turnpike strategies (see [22, Remark 7]) as
in fig. 1:
(1) in a short time interval [0, τ ] drive the state from the initial configuration
y0 to the turnpike y;
(2) in a long time arc [τ, T − τ ], remain on y;
(3) in a short final arc [T − τ, T ], use to control to match the required terminal
condition at time t = T .
In general, the corresponding control and state are not optimal, being not smooth.
However, they are easy to construct.
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 3
The proof of turnpike results is harder than the above construction. In fact, to
prove turnpike results, one has to ensure that there is not another time-evolving
strategy which is significantly better than the above one. In case the turnpike
property is verified, the above strategy is quasi-optimal.
Under the same assumptions required for the problem (2)-(1), for any given control
us ∈ L2 (Ω), there exists a unique state ys ∈ H2 (Ω) ∩ H10 (Ω) solution to (5) (see
e.g. [5]).
By adapting the techniques of [10], we have the existence of a global minimizer u
for (4). The corresponding optimal state is denoted by y. If the target is sufficiently
small in the uniform norm, the optimal control is unique (see [33, subsection 3.2]).
Furthermore any optimal control u = −qχω , where the pair (y, q) satisfies the
steady optimality system
−∆y + f (y) = −qχω in Ω
y = 0 on ∂Ω
(6)
−∆q + f ′ (y)q = β(y − z)χω0 in Ω
q=0 on ∂Ω.
The analysis in [33, section 3] leads to the following local result.
Theorem 0.1 (Porretta-Zuazua). Consider the control problem (5)-(4). There
exists δ > 0 such that if the initial datum and the target fulfill the smallness
condition
ky0 kL∞ (Ω) ≤ δ and kzkL∞ (ω0 ) ≤ δ,
there exists a solution y T , q T to the Optimality System
T
yt − ∆y T + f (y T ) = −q T χω in (0, T ) × Ω
T
y = 0
on (0, T ) × ∂Ω
y (0, x) = y0 (x)
T
in Ω
−qtT − ∆q T + f ′ (y T )q T = β(y T − z)χω0 in (0, T ) × Ω
q T = 0
on (0, T ) × ∂Ω
T
q (T, x) = 0 in Ω
satisfying for any t ∈ [0, T ]
kq T (t) − qkL∞ (Ω) + ky T (t) − ykL∞ (Ω) ≤ K [exp (−µt) + exp (−µ(T − t))] ,
where K and µ are T -independent.
We observe that the turnpike property is satisfied by one solution to the optimal-
ity system. Since our problem may be not convex, we cannot directly assert that
such solution of the optimality system is the unique minimizer (optimal control)
for (5)-(4).
Large initial data and small targets. We start by keeping the running target small,
but allowing the initial datum for (2) to be large.
Theorem 0.2. Consider the control problem (2)-(1). Let uT be a minimizer of
(1). There exists ρ > 0 such that for every initial datum y0 ∈ L∞ (Ω) and target z
verifying
kzkL∞ (ω0 ) ≤ ρ, (7)
we have
kuT (t)−ukL∞ (Ω) +ky T (t)−ykL∞ (Ω) ≤ K [exp (−µt) + exp (−µ(T − t))] , ∀t ∈ [0, T ],
(8)
the constants K and µ > 0 being independent of the time horizon T .
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 5
y
steady optimum
optimum
y0
y δ
t
O ts T
Suppose in addition y0 ∈ L∞ (Ω) ∩ H10 (Ω). Let uT be an optimal control for (2)-(1)
and let y T be the corresponding state, solution to (2), with control uT and initial
datum y0 . Then, the L2 norm of the time derivative of the optimal state is bounded
uniformly in T
ytT L2 ((0,T )×Ω) ≤ K, (10)
the constant K being T -independent.
The proof of Theorem 0.3, available in section 2, is based on the following rep-
resentation formula for the time-evolving functional (Lemma 2.2):
Z T
JT (u) = Js − ∆y(t, ·) + f (y(t, ·)) dt
0
Z Z
1 T 2
+ |yt (t, x)| dxdt
2 0 Ω
Z h i
1 2 2
+ k∇y(T, x)k + 2F (y(T, x)) − k∇y0 (x)k − 2F (y0 (x)) dx, (11)
2 Ω
where for a.e. t ∈ (0, T ), Js − ∆y(t, ·) + f (y(t, ·)) denotes the evaluation of the
steady functional Js at control us (·) := −∆y(t, ·) + f (y(t, ·)) and y is the state
associated to control u solving
yt − ∆y + f (y) = u
in (0, T ) × Ω
y=0 on (0, T ) × ∂Ω (12)
y(0, x) = y0 (x) in Ω.
Note that the above formula is valid for initial data y0 ∈ L∞ (Ω)∩H10 (Ω). However,
by the regularizing effect of (12) and the properties of the control problem, one can
reduce to the case of smooth initial data.
By means of (11), the functional JT can be seen as the sum of three terms:
RT
(1) 0 Js − ∆y(t, ·) + f (y(t, ·)) dt, which stands for the “steady” cost at a.e.
time t ∈ (0, T ) integrated over (0, T );
RT R 2
(2) 21 0 Ω |yt (t, x)| dxdt, which penalizes the time derivative of the func-
tional;
R h 2 2
i
(3) 21 Ω k∇y(T, x)k + 2F (y(T, x)) − k∇y0 (x)k − 2F (y0 (x)) dx, which de-
pends on the terminal values of the state.
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 7
Choose now an optimal control uT for (2)-(1) and plug it in (11). By Lemma 1.1,
the
R term
h i
1 2 2
2 Ω k∇y(T, x)k + 2F (y(T, x)) − k∇y 0 (x)k − 2F (y 0 (x)) dx can be estimated uni-
RT R 2
formly in the time horizon. At the optimal control, the term 0 Ω |yt (t, x)| dxdt
has to be small and the “steady” cost
RT
0
Js − ∆y(t, ·) + f (y(t, ·)) dt is the dominant addendum. This is the basic idea
of our approach to prove turnpike results for large targets.
The rest of the manuscript is organized as follows. In section 1 we prove The-
orem 0.2. In section 2, we prove Theorem 0.3. In section 3 we perform some
numerical simulations. The appendix is mainly devoted to the proof of the uniform
bound of the optima (Lemma 1.1) and a PDE result needed for Lemma 2.2.
the functional (1) admits a unique global minimizer uT . Furthermore, for every
ε > 0 there exists δε > 0 such that, if
kzkL∞ (ω0 ) ≤ δε and ky0 kL∞ (Ω) ≤ δε , (15)
where E and E b denote respectively the differential and algebraic Riccati operators
(see [33, equation (22)]) and ∆ : H10 (Ω) −→ H−1 (Ω) is the Dirichlet laplacian.
8 THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL
Lemma 1.4 (Global attactor property). Consider the control problem (2)-(1). Let
y0 ∈ L∞ (Ω) and z ∈ L∞ (ω0 ). Let uT be an optimal control for (2)-(1) and let y T
be the corresponding optimal state. For any ε > 0. there exist ρε = ρε (Ω, β, ε) and
τε = τε (Ω, β, y0 , ε), such that if kzkL∞(ω0 ) ≤ ρε and T ≥ τε ,
ky T (ts )kL∞ (Ω) ≤ δε , with the upper bound ts ≤ τε (23)
10 THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL
y
steady optimum
optimum
y0
y δ
t
O ts T
and
kuT (t)−ukL∞ (Ω) +ky T (t)−ykL∞ (Ω) ≤ ε [exp (−µ(t − ts )) + exp (−µ(T − (t − ts )))] , ∀t ∈ [ts , T ].
(24)
The constant µ is given by (17) and δε is given by (15).
1 β βµleb (ω0 )
kuk2L2 (ω) ≤ Js (u) ≤ Js (0) = kzk2L2 (ω0 ) ≤ kzk2L∞ (ω0 ) .
2 2 2
Now, any optimal control is of the form u = −qχω , where the pair (y, q) satisfies
the optimality system (6). Since n = 1, 2, 3, by elliptic regularity (see, e.g. [14,
Theorem 4 subsection 6.3.2]) and Sobolev embeddings (see e.g. [14, Theorem 6
subsection 5.6.3]), q ∈ C0 (Ω) and kqkL∞ (Ω) ≤ KkzkL∞(ω0 ) , where K = K (Ω).
This yields u ∈ C0 (ω) and
≥ (T − ts ) inf
2
Js − K 2 , (30)
L (Ω)
whence
ts K1 δε2 − inf
2
J s ≤ K2 . (32)
L (Ω)
Now, by (25), there exists ρε = ρε (Ω, β, ε) ≤ δε such that, if the target kzkL∞(ω0 ) ≤
K1 δε2
ρε , then inf L2 (Ω) Js ≤ 2 . This, together with (32), yields
K1 δε2
ts ≤ K2 ,
2
whence
K2
ts ≤ .
δε2
12 THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL
Set
K2
τε := + 1.
δε2
This finishes this step.
Step 3 Conclusion
By Step 2, for any T ≥ τε , there exists ts ≤ τε such that
ky T (ts )kL∞ (Ω) ≤ δε , (33)
T
where δε is given by (16). Now, by Bellman’s Principle of Optimality, u ↾(ts ,T ) is
optimal for (2)-(1), with initial datum y T (ts ) and target z. We took ρε ≤ δε , Then,
we also have
kzkL∞ (ω0 ) ≤ ρε ≤ δε . (34)
Then, we can apply Lemma 1.2, getting (24). This completes the proof.
1.2. Proof of Theorem 0.2. We now prove Theorem 0.2.
Proof of Theorem 0.2. By Lemma 1.4, there exists ρε (Ω, β, ε) > 0 such that if
kzkL∞ (ω0 ) ≤ ρε , (35)
any optimal control satisfies the turnpike estimate
kuT (t)−ukL∞ (Ω) +ky T (t)−ykL∞ (Ω) ≤ ε [exp (−µ(t − ts )) + exp (−µ(T − (t − ts )))] , ∀t ∈ [ts , T ].
(36)
Set
K0 := exp(µτ )K 1 + ky0 kL∞ (Ω) + δ ,
with µ > 0 the exponential rate defined in (17) and K is given by (13). Note that
K0 = K0 (Ω, β, y0 ) and, in particular, it is independent of the time horizon. By the
above definition, for every T > 0 and for each t ∈ [0, τε ] ∩ [0, T ]
kuT (t) − ukL∞ (Ω) + ky T (t) − ykL∞ (Ω) ≤ K0 exp(−µτ ) ≤ K0 exp(−µt). (37)
On the other hand, for t ≥ ts , (36) holds. Then, (8) follows.
Lemma 2.2. Consider R ythe functional introduced in (1)-(2) and its steady version
(5)-(4). Set F (y) := 0 f (ξ)dξ. Assume ω = Ω. Suppose the initial datum y0 ∈
L∞ (Ω) ∩ H10 (Ω). Then, for any control u ∈ L2 ((0, T ) × ω), we can rewrite the
functional as
Z T
JT (u) = Js − ∆y(t, ·) + f (y(t, ·)) dt
0
Z Z
1 T 2
+ |yt (t, x)| dxdt
2 0 Ω
Z h i
1
+ k∇y(T, x)k2 + 2F (y(T, x)) − k∇y0 (x)k2 − 2F (y0 (x)) dx, (39)
2 Ω
where, for a.e. t ∈ (0, T ), Js − ∆y(t, ·) + f (y(t, ·)) denotes the evaluation of
the steady functional Js at control us (·) := −∆y(t, ·) + f (y(t, ·)) and y is the state
associated to control u solution to
yt − ∆y + f (y) = u
in (0, T ) × Ω
y=0 on (0, T ) × ∂Ω (40)
y(0, x) = y0 (x) in Ω.
RT R 2
In (39), the term 0 Ω |yt (t, x)| dxdt emerges. This means that the time deriv-
ative of optimal states has to be small, whence the time-evolving optimal strategies
for (1)-(2) are in fact close to the steady ones.
The proof of Lemma 2.2 is based on the following PDE result, which basically
asserts that the squared right hand side of the equation
(
yt − ∆y + f (y) = h in (0, T ) × Ω
y=0 on (0, T ) × ∂Ω
can be written as
2 2 2
khkL2 ((0,T )×Ω) = kyt kL2 ((0,T )×Ω) + k−∆y + f (y)kL2 ((0,T )×Ω) + remainder, (41)
where the remainder depends on the value of the solution at times t = 0 and t = T .
Lemma 2.3. Let Ω be a bounded open set of Rn , n ∈ {1, 2, 3}, withRC∞ boundary.
y
Let f ∈ C3 (R; R) be nondecreasing, with f (0) = 0. Set F (y) := 0 f (ξ)dξ. Let
∞ 1 ∞
y0 ∈ L (Ω) ∩ H0 (Ω) be an initial datum and let h ∈ L ((0, T ) × Ω) be a source
term. Let y be the solution to
yt − ∆y + f (y) = h
in (0, T ) × Ω
y=0 on (0, T ) × ∂Ω (42)
y(0, x) = y0 (x) in Ω.
14 THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL
By the above equality and the definition of Js (5)-(4), formula (39) follows.
The last Lemma needed to prove Theorem 0.3 is the following one.
Lemma 2.4. Consider the time-evolution control problem (2)-(1) and its steady
version (5)-(4). Assume ω = Ω. Arbitrarily fix y0 ∈ L∞ (Ω) ∩ H10 (Ω) an initial
datum and z ∈ L∞ (ω0 ) a target. Let uT be an optimal control for (2)-(1) and let
y T be the corresponding state, solution to (2), with control uT and initial datum y0 .
Then,
(1) there exists a T -independent constant K such that
(2) the L2 norm of the time derivative of the optimal state is bounded uniformly
in T
ytT L2 ((0,T )×Ω) ≤ K, (48)
with K independent of T > 0.
Proof of Lemma 2.4. Step 1 Proof of
Z T Z Z h i
T
1 2 1 2
inf J T = J T u ≥ T inf Js + ytT (t, x) dxdt− k∇y0 (x)k + 2F (y0 (x)) dx.
L2 ((0,T )×ω) L2 (Ω) 2 0 Ω 2 Ω
whence
Z T Z Z h i
T
1 2 1 2
inf J T = J T u ≥ T inf Js + ytT (t, x) dxdt− k∇y0 (x)k + 2F (y0 (x)) dx.
L2 ((0,T )×ω) L2 (Ω) 2 0 Ω 2 Ω
(51)
Step 2 Conclusion
On the one hand, by Lemma 2.1, we have
inf JT − T inf Js ≤ K, (52)
L2 ((0,T )×ω) 2 L (Ω)
the constant K being independent of T > 0. On the other hand, by (51), we get
inf JT − T inf Js ≥ −K. (53)
L2 ((0,T )×ω) 2L (Ω)
whence Z Z
T
1 2
ytT (t, x) dxdt ≤ K,
2 0 Ω
as required.
We are now ready to prove Theorem 0.3.
Proof of Theorem 0.3. Estimate (10) follows directly from Lemma 2.4 (2.).
It remains to prove the convergence of the averages. By the regularizing effect of
the state equation (2) and Lemma 1.1, we can reduce to the case of initial datum
y0 ∈ L∞ (Ω) ∩ H10 (Ω). By Lemma 2.4, we have
Then,
1 1
inf JT − inf Js = inf JT − T inf Js
T L2 ((0,T )×ω) 2 L (Ω) T L2 ((0,T )×ω) 2
L (Ω)
K
≤ −→ 0,
T T →+∞
as required.
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 17
3. Numerical simulations
This section is devoted to a numerical illustration of Theorem 0.2. Our goal is
to check that turnpike property is fulfilled for small target, regardless of the size of
the initial datum.
We deal with the optimal control problem
Z Z 1 Z Z
1 T 2 2 β T 1
min J T (u) = |u| dxdt + |y − z|2 dxdt,
u∈L2 ((0,T )×(0, 21 )) 2 0 0 2 0 0
where:
3
yt − yxx + y = uχ(0, 12 )
(t, x) ∈ (0, T ) × (0, 1)
y(t, 0) = y(t, 1) = 0 t ∈ (0, T )
y(0, x) = y0 (x) x ∈ (0, 1).
We choose as initial datum y0 ≡ 10 and as target z ≡ 1.
We solve the above semilinear heat equation by using the semi-implicit method:
Yi+1 − Yi − ∆Y 3
i+1 + Yi = Ui χ(0, 12 ) i = 0, . . . , Nt − 1
∆t
Y = y ,
0 0
where Yi and Ui denote resp. a time discretization of the state and the control.
The optimal control is determined by a Gradient Descent method, with constant
stepsize. The optimal state is depicted in fig. 4.
10
0
0 0.5 1 1.5 2
t
Figure 4. graph of the function t −→ ky T (t)kL∞ (Ω) (in blue) and
kykL∞ (Ω) (in red), where y T denotes an optimal state, whereas y
stands for an optimal steady state.
18 THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL
Conjecture 4.1. Consider the control problem (2)-(1). Take any initial datum
y0 ∈ L∞ (Ω) and any target z ∈ L∞ (ω0 ). Let uT be a minimizer of (1). There
exists an optimal pair (u, y) for (5)-(4) such that
kuT (t)−ukL∞ (Ω) +ky T (t)−ykL∞ (Ω) ≤ K [exp (−µt) + exp (−µ(T − t))] , ∀t ∈ [0, T ],
(55)
the constants K and µ > 0 being independent of the time horizon T .
In [30] special large targets z are constructed, such that the optimal control for
the steady problem (5)-(4) is not unique. For those targets, a question arises: if
the turnpike property is satisfied, which minimizer for (5)-(4) attracts the optimal
solutions to (2)-(1)?
Note that, in the context of internal control, the counterexample to uniqueness in
[30] is valid in case of local control ω ( Ω.
Generally speaking a further investigation is required for the linearized optimality
system determined in [33, subsection 3.1]. We introduce the problem. As in (3),
consider the optimality system for (2)-(1)
ytT − ∆y T + f (y T ) = −q T χω in (0, T ) × Ω
T
y = 0
on (0, T ) × ∂Ω
y T (0, x) = y (x) in Ω
0
−qtT − ∆q T + f ′ (y T )q T = β(y T − z)χω0
in (0, T ) × Ω
qT = 0 on (0, T ) × ∂Ω
T
q (T, x) = 0 in Ω.
(56)
Pick any optimal pair (u, y) for (5)-(4). By the first order optimality conditions,
the steady optimal control reads as u = −qχω , with
−∆y + f (y) = −qχω in Ω
y = 0 on ∂Ω
(57)
−∆q + f ′ (y)q = β(y − z)χω0 in Ω
q=0 on ∂Ω.
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 19
Z #
t
+K exp(−µ(t − s − ε))kh(s)kL∞ ds
(⌊ Tt ⌋−1)T
t "Z # 21 "Z # 21
⌊X
T
⌋
iT iT
≤ K exp(−µ(t − ε − iT )) exp(−2µ(iT − s))ds kh(s)k2L∞ ds
(i−1)T (i−1)T
i=1
"Z # 21 "Z # 12
t t
+K exp(−2µ(t − s − ε))ds kh(s)k2L∞ ds
( ⌊ Tt ⌋−1)T ( ⌊ Tt ⌋−1)T
⌊ Tt ⌋
X
≤ K exp(−µ(t − ε − iT ))khkL2 ((i−1)T ,iT ;L∞ (Ω))
i=1
i
+khkL2 (⌊ t ⌋,t;L∞ (Ω))
T
"+∞
X
≤ K exp(−µ(t − ε − iT )) max khkL2 (((i−1)T ,iT );L∞ (Ω))
i=1,...,N
i=1
i
+khkL2 (⌊ t ⌋,t;L∞ (Ω))
T
h i
≤ K khkL2 ((i−1)T ,iT ;L∞ (Ω)) + khkL2 (N T ,T ;L∞ (Ω)) . (77)
as desired.
Remark A.3. Lemma A.1 can be applied to a bounded solution y to (2). Indeed,
set
f (y(t, x)) y(t, x) 6= 0
cy (t, x) := y(t, x)
′
f (0) y(t, x) = 0.
Since f is increasing and f (0) = 0, we have cy ≥ 0. Hence, we are in position to
apply Lemma A.1, with potential cy .
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 23
yT
y0
t
O t1 t2 T
and
ku − ûkL∞ ((0,T )×ω) ≤ Kky0 − ŷ(0)kL∞ (Ω) , (80)
where the constant K depends only on Ω, f , ω and R.
The proof of the above lemma is classical (see, e.g. [16, 3]).
In order to prove Lemma 1.1, we introduce an optimal control problem, with
specified terminal states. Let t1 < t2 . Let ŷ be a target trajectory, bounded
solution to (2) in (t1 , t2 ), i.e.
ŷt − ∆ŷ + f (ŷ) = ûχω
in (t1 , t2 ) × Ω
ŷ = 0 on (t1 , t2 ) × ∂Ω (81)
ŷ(t1 , x) = ŷ0 (x) in Ω,
with initial datum ŷ0 ∈ L∞ (Ω) and control û ∈ L∞ ((t1 , t2 ) × ω).
For any control u ∈ L2 ((t1 , t2 ) × ω), the corresponding state y is the solution to:
yt − ∆y + f (y) = uχω
in (t1 , t2 ) × Ω
y=0 on (t1 , t2 ) × ∂Ω (82)
y(t1 , x) = ŷ(t1 , x) in Ω.
We introduce the set of admissible controls
Uad := u ∈ L2 ((t1 , t2 ) × ω) | y(t2 , ·) = ŷ(t2 , ·) .
By definition, û ∈ Uad . Hence, Uad 6= ∅. We consider the optimal control problem
Z Z Z Z
1 t2 2 β t2
min Jt1 ,t2 (u) = |u| dxdt + |y − z|2 dxdt, (83)
u∈Uad 2 t1 ω 2 t1 ω0
with running target z ∈ L∞ (ω0 ). By the direct methods in the calculus of variations,
the functional Jt1 ,t2 admits a global minimizer in the set of admissible controls Uad .
We now bound the minimal value of the functional (83), showing that the mag-
nitude of the control û in the time interval [t1 , t2 − TR ] can be neglected when
estimating the cost of controllability. Namely, what matters is the norm of û in the
final time interval [t2 − TR , t2 ].
Lemma B.2. Consider the optimal control problem (82)-(83), with t2 − t1 ≥ 2TR .
Then,
h
min Jt1 ,t2 ≤ K kŷ(t1 , ·)k2L∞ (Ω) + (t2 − t1 )kzk2L∞ (ω0 )
Uad
i
+kûk2L∞ ((t2 −TR ,t2 )×ω) + kŷ(t2 − TR , ·)k2L∞ (Ω) ,
(84)
the constant K being independent of the time horizon t2 − t1 ≥ 2TR .
Proof of Lemma B.2. Step 1 A quasi-optimal control
To get the desired bound, we introduce a quasi-optimal control u for (82)-(83),
linking ŷ(t1 , ·) and y T (t2 , ·). The control strategy is the following
(1) employ null control for time t ∈ [0, t2 − TR ];
(2) match the final condition by control w, for t ∈ [t2 − TR , t2 ].
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 25
Let us denote by y 0 the solution to the semilinear problem with null control
0
yt − ∆y 0 + f y 0 = 0 in (t1 , t2 ) × Ω
y0 = 0 on (t1 , t2 ) × ∂Ω (85)
0
y (t1 , x) = ŷ(t1 , x) in Ω.
By Lemma B.1, there exists w ∈ L∞ ((t2 − TR , t2 ) × ω), steering (82) from y 0 (t1 , ·)
to ŷ (t2 , ·) in the time interval (t2 − TR , t2 ), with estimate
kw − ûkL∞ ((t2 −TR ,t2 )×ω) ≤ K y 0 (t2 − TR ) − ŷ (t2 − TR ) L∞ (Ω)
, (86)
Then, set
(
0 in (0, t2 − TR )
u := (87)
w in (t2 − TR , t2 ) .
By (86), we can bound the norm of the control,
h i
kukL∞ ((t1 ,t2 )×ω) ≤ K y 0 (t2 − TR ) − ŷ (t2 − TR ) L∞ (Ω)
+ kûkL∞ ((t2 −TR ,t2 )×ω) .
(88)
Step 2 Conclusion
Consider the control u introduced in (87) and let y be the solution to (82), with
initial datum y0 and control u. Then, we have
min Jt1 ,t2 ≤ Jt1 ,t2 (u)
Uad
Z t2 Z Z t2 Z
1 β
= |u|2 dxdt + |y − z|2 dxdt
2 t1 ω 2 t1 ω0
Z t2 Z Z t2 Z
1 β
= |w|2 dxdt + |y − z|2 dxdt
2 t2 −TR ω 2 t1 ω0
Z t2 Z Z t2 Z Z t2 Z
1 2 2
≤ |w| dxdt + β |y| dxdt + β |z|2 dxdt
2 t2 −TR ω t1 ω0 t1 ω0
Z t2 Z Z t2 Z
1
≤ |w|2 dxdt + β |y|2 dxdt + K(t2 − t1 )kzk2L∞ (ω0 )
2 t2 −TR ω t1 ω0
h
≤ K kwk2L∞ ((t2 −TR ,t2 )×ω) + (t2 − t1 )kzk2L∞ (ω0 )
Z t2 −TR #
0 2 2
+β y (t, ·) L2 (Ω) dt + kykL2 ((t2 −TR ,t2 )×Ω)
t1
h 2 2
≤ K y 0 (t2 − TR , ·) − ŷ (t2 − TR , ·) L∞ (Ω)
+ kûkL∞ ((t2 −TR ,t2 )×ω)
(89)
i
+(t2 − t1 )kzk2L∞ (ω0 ) + kŷ(t1 , ·)k2L∞ (Ω)
h
≤ K kŷ(t1 , ·)k2L∞ (Ω) + (t2 − t1 )kzk2L∞ (ω0 ) (90)
i
+kûk2L∞ ((t2 −TR ,t2 )×ω) + kŷ(t2 − TR , ·)k2L∞ (Ω) ,
(91)
where in (89) and in (90) we have employed the dissipativity of (85). This concludes
the proof.
26 THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL
B.2. A mean value result for integrals. In the following Lemma we estimate
the value of a function at some point, with the value of its integral.
Lemma B.3. Let h ∈ L1 (c, d) ∩ C0 (c, d), with −∞ < c < d < +∞. Assume h ≥ 0
a.e. in (c, d). Then,
(1) there exists tc ∈ c, c + d−c
3 , such that
Z d
3
h(tc ) ≤ hdt;
d−c c
(2) there exists td ∈ d − d−c3 , d , such that
Z d
3
h(td ) ≤ hdt.
d−c c
3
Rd
Proof of Lemma B.3. By contradiction, for any t ∈ c, c + d−c 3 , h(t) > d−c c
hds.
Then, we have
Z d Z c+ d−c Z c+ d−c " Z d # Z d
3 3 3
hdt ≥ hdt > hds dt = hds,
c c c d−c c c
B.3. Proof of Lemma 1.1. We are now in position to prove Lemma 1.1.
Set
IT := i ∈ {1, . . . , NT } the estimate (92) is not verified . (93)
On the other hand, for every i ∈ IT , we seek to prove the existence of a constant
Kθ = Kθ (Ω, f, R, θ), possibly larger than θ, such that
Z i3TR h i h i
kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt ≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) . (94)
(i−1)3TR
We start by considering the union of time intervals, where (92) is not verified
[
WT := [(i − 1)3TR , i3TR ].
i∈IT
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 27
The above set is made of a finite union of disjoint closed intervals, namely there
exists a natural M and {(aj , bj )}j=1,...,M , such that
bj < aj+1 , j = 1, . . . , M − 1
and [ [
WT = [(i − 1)3TR , i3TR ] = [aj , bj ].
i∈IT j=1,...,M
ŷ := yT
t1 := 0
t2 := td .
where Kθ = Kθ (Ω, f, R, θ) and γ = γ(Ω, f, R). In our case the target trajectory for
(82)-(83) is the state y T associated to an optimal control uT for (2)-(1). Then, by
definition of (82)-(83),
Jt1 ,t2 uT ≤ Jt1 ,t2 (u), ∀ u ∈ Uad .
Hence, by (98),
Jt1 ,t2 uT ≤ min Jt1 ,t2
Uad
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) + γtd kzk2L∞ (ω0 ) . (99)
where in the last inequality we have used (96), which yields b1 > td − 3TR . By the
above inequality, Lemma A.1, (97) and (99),
θ(td − 3TR ) h i
ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
6TR
Z b1 h i Z b1 h i
1 2 2
+ T T
kq (t)kL∞ (Ω) + ky (t)kL∞ (Ω) dt ≤ kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt
2 0 0
h i
≤ K Jt1 ,t2 uT + ky0 k2L∞ (Ω) + kq T (td )k2L∞ (Ω)
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) + γtd kzk2L∞ (ω0 ) ,
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 29
whence
Z b1 h i h i
kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt ≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
0
θ(td − 3TR )
+2 γtd − kzk2L∞ (ω0 )
6TR
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
θ
+2td γ − kzk2L∞ (ω0 ) .
6TR
and
Z bj +3TR h i h i
kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt ≤ θ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) . (101)
bj
By parabolic regularity in the optimality system (3) in the interval [ta,c , ta,d ], we
have
2 2
n
y T L∞ ((ta,c ,t )×Ω) + q T L∞ ((ta,c ,t )×Ω) ≤ K kq T (ta,d )k2L∞ (Ω) + ky T (ta,c )k2L∞ (Ω)
a,d a,d
+kzk2L∞(ω0 )
Z aj )
h i
T 2 T 2
+ kq (t)kL∞ (Ω) + ky (t)kL∞ (Ω) dt
aj −3TR
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) . (103)
0 < tb,c < bj + TR and bj + 2TR < tb,d < bj + 3TR , (104)
such that
Z bj +3TR h i
1
kq T (tb,c )k2L∞ (Ω) + ky T (tb,c )k2L∞ (Ω) ≤ kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt
TR bj
θ h i
≤ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
TR
and
Z bj +3TR h i
1
kq T (tb,d )k2L∞ (Ω) + ky T (tb,d )k2L∞ (Ω) ≤ kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt
TR bj
θ h i
≤ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) .
TR
By parabolic regularity in the optimality system (3) in the interval [tb,c , tb,d ], we
have
2 2
n
y T L∞ ((t ,t )×Ω) + q T L∞ ((t ,t )×Ω) ≤ K kq T (tb,d )k2L∞ (Ω) + ky T (tb,c )k2L∞ (Ω)
b,c b,d b,c b,d
Z bj +3TR
+kzk2L∞ (ω0 ) + kq T (t)k2L∞ (Ω) dt
bj
Z )
bj +3TR
+ ky T (t)k2L∞ (Ω) dt
bj
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) . (105)
ŷ := y T
t1 := ta,c
t2 := tb,d .
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 31
where Kθ = Kθ (Ω, f, R, θ) and γ = γ(Ω, f, R). In our case the target trajectory for
(82)-(83) is the state y T associated to an optimal control uT for (2)-(1). Then, by
definition of (82)-(83),
Jt1 ,t2 uT ≤ Jt1 ,t2 (u), ∀ u ∈ Uad .
Hence, by (106),
Jt1 ,t2 uT ≤ min Jt1 ,t2
Uad
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) + γ(tb,d − ta,c )kzk2L∞ (ω0 ) .
θ(bj − aj ) h i
= ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
3TR
θ(tb,d − ta,c − 6TR ) h i
> ky0 k2L∞ (Ω) + kzk2L∞ (ω
(107)
0)
,
3TR
where in the last inequality we have used (102) and (104) to get
bj − aj > tb,d − ta,c − 6TR . By the above inequality, Lemma A.1 and (107),
θ(tb,d − ta,c − 6TR ) h i
ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
6TR
Z Z bj h
1 bj h T 2 2
i i
+ T
kq (t)kL∞ (Ω) + ky (t)kL∞ (Ω) dt ≤ kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt
2 aj aj
h i
≤ K Jt1 ,t2 uT + ky0 k2L∞ (Ω)
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
+γ(tb,d − ta,c )kzk2L∞ (ω0 ) ,
whence
Z bj h i h i
kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt ≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
aj
(tb,d − ta,c − 6TR )
+2 γ(tb,d − ta,c ) − θ kzk2L∞ (ω0 )
6TR
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
θ
+2(tb,d − ta,c ) γ − kzk2L∞ (ω0 ) .
6TR
32 THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL
We apply Lemma B.3 in [aj − 3TR , aj ]. To this end, set c := aj − 3TR , d := aj and
h(t) := kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) . By Lemma B.3, there exist tc ,
aj − 3TR < tc < aj − 2TR (109)
such that
Z aj h i
1
kq T (tc )k2L∞ (Ω) + ky T (tc )k2L∞ (Ω) ≤ kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt
TR aj −3TR
θ h i
≤ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) . (110)
TR
We introduce the control
(
∗
uT in (0, tc )
u :=
0 in (tc , T )
Let y be the solution to (2), with initial datum y0 and control u and y ∗ be the
solution to (2), with initial datum y0 and control u∗ . By definition of minimizer,
we have
JT u T ≤ JT (u∗ )
Z Z Z Z
1 T ∗ 2 β T
≤ |u | dxdt + |y ∗ − z|2 dxdt
2 0 ω 2 0 ω0
Z Z Z Z
1 tc 2 β tc 2
= uT dxdt + y T − z dxdt
2 0 ω 2 0 ω0
Z Z
β T
+ |y ∗ − z|2 dxdt,
2 tc ω0
whence,
Z Z Z Z Z Z
1 T 2 β T
2 β T
uT dxdt + y T − z dxdt ≤ |y ∗ − z|2 dxdt
2 tc ω 2 tc ω0 2 tc ω0
h i
≤ K ky(tc )k2L∞ (Ω) + (T − tc )kzk2L∞ (ω0 )
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
+γ(T − tc )kzk2L∞ (ω0 ) ,
THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL 33
θ(bj − aj ) h i
= ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
3TR
θ(T − tc − 6TR ) h i
> ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) .
3TR
By the above inequality and Lemma A.1 and (111),
θ(T − tc − 6TR ) h i
ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
6TR
Z Th i Z Th i
1
+ kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt ≤ kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt
2 aj aj
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
+γ(T − tc )kzk2L∞ (ω0 ) ,
whence
Z Th i h i
kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt ≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
aj
(T − tc − 6TR )
+2 γ(T − tc ) − θ kzk2L∞ (ω0 )
6TR
h i
≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 )
θ
+2(T − tc ) γ − kzk2L∞ (ω0 ) .
6TR
If θ is large enough, we have γ − 6TθR < 0. Hence, choosing θ large enough, we
obtain the estimate
Z Th i h i
kq T (t)k2L∞ (Ω) + ky T (t)k2L∞ (Ω) dt ≤ Kθ ky0 k2L∞ (Ω) + kzk2L∞ (ω0 ) .
aj
Step 2 Conclusion
The proof is concluded, with an application of Lemma A.2 to the state and the
adjoint equation in (3).
34 THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL
Proof of lemma 2.1. Let u ∈ L∞ (Ω) be an optimal control for (5)-(4) and let y be
the corresponding solution to (5) with control u. Following step 1 of the proof of
Lemma 1.4, we obtain u ∈ C0 (ω) and
kukL∞ (ω) ≤ KkzkL∞ (ω0 ) . (112)
Step 1 Proof of
JT (u) − T inf
2
Js ≤ K,
L (Ω)
with K independent of T
Let ŷ be the solution to
ŷt − ∆ŷ + f (ŷ) = uχω
in (0, T ) × Ω
ŷ = 0 on (0, T ) × ∂Ω (113)
ŷ(0, x) = y0 (x) in Ω.
Set η := ŷ − y solution to
ηt − ∆η + f (ŷ) − f (y) = 0
in (0, T ) × Ω
η=0 on (0, T ) × ∂Ω (114)
η(0, x) = y0 (x) − y(x) in Ω.
By multiplying (114) by η, since f is increasing, for any t ∈ [0, T ] we have
kŷ(t, ·) − ykL2 (Ω) ≤ exp(−λ1 t) ky0 − ykL2 (Ω) , (115)
= T inf
2
Js + JT (u) − T inf
2
Js
L (Ω) L (Ω)
≤ T inf
2
Js + K.
L (Ω)
References
[1] G. Allaire, A. Münch, and F. Periago, Long time behavior of a two-phase optimal design
for the heat equation, SIAM Journal on Control and Optimization, 48 (2010), pp. 5333–5356.
[2] B. D. Anderson and P. V. Kokotovic, Optimal control problems over large time intervals,
Automatica, 23 (1987), pp. 355–363.
[3] S. Aniţa and D. Tataru, Null controllability for the dissipative semilinear heat equation,
Applied Mathematics & Optimization, 46 (2002), pp. 97–105.
[4] V. Barbu, Nonlinear differential equations of monotone types in Banach spaces, Springer
Science & Business Media, 2010.
[5] L. Boccardo and G. Croce, Elliptic Partial Differential Equations: Existence and Regu-
larity of Distributional Solutions, De Gruyter Studies in Mathematics, De Gruyter, 2013.
[6] H. Brezis, Functional Analysis, Sobolev Spaces and Partial Differential Equations, Univer-
sitext, Springer New York, 2010.
[7] P. Cardaliaguet, J.-M. Lasry, P.-L. Lions, and A. Porretta, Long time average of mean
field games., Networks & Heterogeneous Media, 7 (2012).
[8] P. Cardaliaguet, J.-M. Lasry, P.-L. Lions, and A. Porretta, Long time average of mean
field games with a nonlocal coupling, SIAM Journal on Control and Optimization, 51 (2013),
pp. 3558–3591.
[9] D. Carlson, A. Haurie, and A. Leizarowitz, Infinite Horizon Optimal Control: Deter-
ministic and Stochastic Systems, Springer Berlin Heidelberg, 2012.
[10] E. Casas and M. Mateos, Optimal Control of Partial Differential Equations, Springer In-
ternational Publishing, Cham, 2017, pp. 3–59.
[11] E. Casas and F. Tröltzsch, Second order analysis for optimal control problems: improving
results expected from abstract theory, SIAM Journal on Optimization, 22 (2012), pp. 261–279.
[12] T. Damm, L. Grüne, M. Stieler, and K. Worthmann, An exponential turnpike theorem
for dissipative discrete time optimal control problems, SIAM Journal on Control and Opti-
mization, 52 (2014), pp. 1935–1957.
[13] R. Dorfman, P. Samuelson, and R. Solow, Linear Programming and Economic Analysis,
Dover Books on Advanced Mathematics, Dover Publications, 1958.
[14] L. C. Evans, Partial differential equations, vol. 19 of Graduate Studies in Mathematics,
American Mathematical Society, Providence, RI, second ed., 2010.
[15] T. Faulwasser, K. Flaßkamp, S. Ober-Blöbaum, and K. Worthmann, Towards velocity
turnpikes in optimal control of mechanical systems, arXiv preprint arXiv:1907.01786, (2019).
[16] E. Fernández-Cara and E. Zuazua, Null and approximate controllability for weakly blowing
up semilinear heat equations, Annales de l’Institut Henri Poincare (C) Non Linear Analysis,
17 (2000), pp. 583 – 616.
[17] L. Grüne and R. Guglielmi, Turnpike properties and strict dissipativity for discrete time
linear quadratic optimal control problems, SIAM Journal on Control and Optimization, 56
(2018), pp. 1282–1302.
[18] L. Grüne and M. A. Müller, On the relation between strict dissipativity and turnpike
properties, Systems & Control Letters, 90 (2016), pp. 45–53.
[19] L. Grüne, M. Schaller, and A. Schiela, Sensitivity analysis of optimal control for a
class of parabolic pdes motivated by model predictive control, SIAM Journal on Control and
Optimization, 57 (2019), pp. 2753–2774.
[20] L. Grne, M. Schaller, and A. Schiela, Exponential sensitivity and turnpike analysis
for linear quadratic optimal control of general evolution equations, Journal of Differential
Equations, (2019).
[21] A. Haurie, Optimal control on an infinite time horizon: the turnpike approach, Journal of
Mathematical Economics, 3 (1976), pp. 81–102.
[22] V. Hernández-Santamarı́a, M. Lazar, and E. Zuazua, Greedy optimal control for ellip-
tic problems and its application to turnpike problems, Numerische Mathematik, 141 (2019),
pp. 455–493.
[23] A. Ibañez, Optimal control of the lotka–volterra system: turnpike property and numerical
simulations, Journal of biological dynamics, 11 (2017), pp. 25–41.
[24] H. Kouhkouh, E. Zuazua, P. Carpentier, and F. Santambrogio, Dynamic programming
interpretation of turnpike and hamilton-jacobi-bellman equation, (2018). Available online:
http://bit.ly/2R7soRx.
36 THE TURNPIKE PROPERTY IN SEMILINEAR CONTROL