Professional Documents
Culture Documents
x = Ax + Bu
( x(t0 ) = x0 )
J ( x0 , u) = xtTf Pxt f +
tf
t0
( x Qx + u Ru ) dt
T
(1)
( x1 (0) = x10 )
x2 = a21 x1 + a22 x2 + b2 u2
( x2 (0) = x20 )
(2)
(q x
ij
2
j
(3)
The constant real numbers qii>0 and qij0 (i, j=1, 2; ij).
Besides the constraint of continuity, each player should
construct his/her control strategy by linear state feedback
method as described by (4)
ui (t ) = K i x(t ) = ( ki1 ki 2 ) ( x1 (t ) x2 (t ) ) (i = 1, 2)
T
(4)
(5)
J i ( t , x(t ), u1 , u2 ) =
(q x
ij
2
j
+ qii xi2 ) d
( t [0, +])
(6)
(u1 U1 )
J 2 ( x0 , u , u ) J 2 ( x0 , u , u2 ) +
(u2 U 2 )
*
1
*
2
*
1
(7)
H i xi = 2 ( a ji + (u j xi )b j ) i x j + 2qii xi = 0 ,
(u1 U1 )
xi = qii1 ( a ji + (u j xi )b j ) i x j
(u2 U 2 )
(8)
Pi :
2
2
J i ( x0 , u1 , u2 ) = 0 ( qij x j + qii xi ) dt
(9-1, 2)
(10-1, 2)
xi
+ x ji ( a ji + b j (u j xi ) ) qii1 ( a ji + (u j xi )b j ) i x j (12-1, 2)
+ qij x 2j = 0
(13)
(14-1, 2)
i.e.
xi + qii1 ( a ji + (u j xi )b j ) i x j
(i, j = 1, 2; i j )
= ( aii xi + aij x j + bi ui )
+ qii1 ( a ji + (u j xi )b j ) i ( a jj x j + a ji xi + b j u j )
= li xi + qii1 ( a ji + (u j xi )b j ) i x j
2
i
(15-1, 2)
(16)
(17-1)
(17-2)
(a
11
the
(11-1, 2)
1 2 2
22 12 2
then
b1k11 + q111a211a21 + l1 ) x1
1 T
+ ( a22 b2 k22 + q22
a12 2 a12 + l2 ) x2 = 0
(18-1)
(18-2)
( t 0; s (0) = x (0) + q
a jii x j (0) )
(19-1)
si (t ) = e Li t si (0)
(19-2)
(20-1)
a22 b2 k22 + q a a + l2 = 0
(20-2)
1
11 21 1
1
1 11 21 1
1
22 12 2
1
2 22 12 2
1
22 12 2 12
theorem
V (t ) = x + x
2
1 2
(22)
{( x , x ) : s
si i
= xi + qii1a ji i x j i
(23)
'x
x1
x2
{( x , x ) : x , > 0}
{( x , x ) : x , x , > 0}
{( x , x ) : x , > 0}
{( x , x ) : x , > 0}
1
(32)
x(t ) si i (t T )
(33)
i =1
(34)
i =1,2
ln ( s (0) )
i
i
where Ti =
0,
( min l ) ,
i =1,2
si (0) 0
else
(21)
1
ii
From (17), (19) and (20), we need to find l1>0, l2>0, 1>0,
2>0, k11 and k22. For this problem, we have the following
2
2 1
1 2 2 2
s2 a12 2 x1 + x1 2 a12 s2 q22
a12 2 x1 < 0
2 2 2
s1 a211 x2 + x21a21 s1 q111a21
1 x21 < 0
( x (t )
1
x1
, x2 (t ) x 2 , t T
x x
(24)
(25)
(26)
s1 = x1 + q111a211 x2 1
(27)
= min
i qii1 i a ji 2
(28)
xi (t ) = qii1a jii x j (t ) + si (t )
(30)
By (14) we have
(29)
Then we have
'x x
x x
(31)
= qii1a ji i x j (t ) + e Li t si (0)
(t 0)
(35)
J i (T i , x0 , u1 , u2 ) =
T i
(q x
ij
+ qii xi2 ) dt
2
j
2
= qij x 2j + qii ( si qii1a jii x j ) dt
T i
(36)
part 1
T i
2
i
s (t )dt
part 2
we have
Wi = 2 i x j x j = 2 ( q ji qii1 i2 a 2ji ) i x 2j + 2 i a ji si x j
(i = 1, 2)
(37)
1 x22 (T ) + J1 (T , x0 , u1 = k11 x1 , u2 = k 22 x2 ) +
Also
T i
) ( 2l )
i
(39)
part 2
T i
(q x
ij
2
j
+ qii xi2 ) dt
) ( 2l )
i
(i = 1, 2)
) ( 2l ) (i = 1, 2)
i
nd
In order to obtain
(i = 1, 2)
(41)
)}
(42)
(43)
i x 2j (T ) J i (T , x0 , u1 = k11 x1 , u2 = k22 x2 )
i x (T ) +
2
j
4. NUMERICAL EXAMPLE
(40)
Define
J i ( x0 , u1 , u2 )
(46)
(45)
( x1 (T ) = x1 (T ) )
( x2 (T ) = x2 (T ) )
(i = 1, 2)
(44)
(47)
(48)
100
80
60
40
20
0.2
0.4
0.6
0.8
1
1.2
time(sec)
1.4
1
1.2
time(sec)
1.4
1.6
1.8
20
0
-20
-40
-60
-80
0.2
0.4
0.6
0.8
1.6
1.8
-200
-400
-600
0.2
0.4
0.6
0.8
1
1.2
time(sec)
1.4
1.6
1.8
0.2
0.4
0.6
0.8
1
1.2
time(sec)
1.4
1.6
1.8
1500
1000
500
0
-500
0.06
0.04
T (sec)
0.05
0.03
0.02
0.01
0.2
0.4
0.6
0.8
1.2
1.4
1.6
1.8
x 20
50
-50
-100
-100
-50
0
x 10
50
100
6. CONCLUSIONS
For 2nd order two-player nonzero-sum games where each
player has a control-free cost functional quadratic in system
states, we proposed a new equilibrium concept: asymptotic Nash equilibrium, to accommodate the constraint that each
players control strategy should be continuous linear state
feedback. Based on each players singular control problem,
the asymptotic -Nash equilibrium, which is attained by
partial state feedback and can also guarantee stability of the
decentralized closed-loop system, was obtained from a group
of algebraic equations. A numerical example illustrated the
various aspects of the theories in this paper.
REFERENCES
Amato, F. and Pironti, A. (1994) A note on singular zero-sum linear quadratic
differential games. In: Proceedings of the 33rd Conference on Decision and
Control, 1533-1535. Lake Buena Vista, FL.
Butman, S. (1968) A method for optimizing control-free costs in systems with
linear controllers. IEEE Trans. Automat. Contr., 13(5): 554-556.
Ho, Y. (1972) Linear stochastic singular control problems. J. Optim. Theory Appl.,
9(1): 24-31.
Kamneva, L. V. (2003) The sufficient conditions of stability for the value function
of a differential game in terms of singular points. J. Appl. Maths Mechs.,
67(3): 329-343.
Kelley, H. J. (1965) A transformation approach to singular subarcs in optimal
trajectory and control problems. SIAM J. Control, Ser. A, 2(2): 234-240.
Melikyan, A. A. (2001) Singular characteristics of first order PDEs in optimal
control and differential games. J. Mathematical Sciences, 103(6): 745-755.
Olsder, G. J. (2001) On open- and closed-loop bang-bang control in nonzero-sum
differential games. SIAM J. Control Optim., 40(4): 1087-1106.
Sarma, I. G. and Prasad, U. R. (1972) Switching surfaces in N-person differential
games. J. Optim. Theory Appl., 10(3): 160-177.
Speyer, J. L. and Jacobson, D. H. (1971) Necessary and sufficient conditions for
optimality for singular control problems; a transformation approach. J. Math.
Anal. Appl., 33(1): 163-187.
Starr, A. W. and Ho, Y. C. (1969 a) Nonzero-sum differential games. J. Optim.
Theory Appl., 3(3): 184-206.
Starr, A. W. and Ho, Y. C. (1969 b) Further properties of nonzero-sum differential
games. J. Optim. Theory Appl., 3(4): 207-219.
Glizer, V. Y. (2000) Asymptotic solution of zero-sum linear-quadratic differential
game with cheap control for minimizer. Nonlinear Differ. Equ. Appl., 7(2):
231-258.
Fudenberg, D. and Levine, D. (1986) Limit games and limit equilibria. J.
Economic Theory, 38(2): 261-279.
Zhukovskiy, V. I. and Salukvadze, M. E. (1994) The Vector-valued Maximin, 270280. Academic Press, Boston.
Xu, H. and Mizukami, K. (1997) Infinite-horizon differential games of singularly
perturbed systems: a unified approach. Automatica, 33(2): 273-276.
Jimenez, M. and Poznyak, A. (2006) -equilibrium in LQ differential games with
bounded uncertain disturbances: robustness of standard strategies and new
strategies with adaptation. International Journal of Control, 79(7): 786-797.