Professional Documents
Culture Documents
Abstract: This paper proposes two kinds of control strategy for swinging-up and keeping a twc-
link acrobot a t the upright position. One is based on the energy control, and the other is based
on the linear feedback control so that the poles of the closed-loop system are assigned t o make
the pendant pmition unstable and the upright position he stable. Two different linear feedback
control laws are used in the course of the swing-up and balancing of the acrohot a t the upright
position. The proposed strategies are evaluated by the simulation.
Keywords: acrobot, swing-up and balancing contorl, linear matrix inequality, energy control,
pole assignment,optimal regulator
where
3073
positive definite and symmetric matrix P and st.%tefeed- where Q is a positive semi-definite matrix, R is a pos-
back law F should be determined by satisfing itive definite matrix. The optimal control input ua(t)
that minimizes the criterion function J is given by
(A,, - B,F)P + P ( A , - B,F)T > 0 , (6)
(AB- B@)P + P ( A p - B P F )<~ 0. (7)
= - F ~ ~ ~ (=~R -)I B, T F
P ~ ,~
But because of B,FP term and BpFP term a r e bilinear where Pp that is symmetric and positive definite the
for variable F and P, Eq.(G),(7)is not necessarily come solution of the next Riccati equation,
down to the convex plan problem generally. Uling
AgPs + PpAp + Q - PpBpR-lBFPp =0
w := FP, (8)
4.2 Swing-up and balancing control
Eq.(6) and Eq.(7) are transformed into
based on energy
A,P + PA: - B,W - WTBZ > 0, (9) In this section, the control is designed by utilizing the
ADP i PA: - BpW - W T B z <. 0, (10) energy, but the balancing control is same as the one
3P = PT > 0,3 w, used in the pole assignment, which is given by 4.1, so
(11)
the only swing-up control law is shown in this section.
and the problem can be changed to LMI problem re-
garding P and W. Thus, useing Q.(9),(10),(11),a sin- Swing-up control based on energy: The total en-
gle feedback gain F is gave as ergy excepted the dissipation energy is calcurated as
F = WP-'. 1
(12) E ( t ) = -{(.TI +mil: +m,z:,e: +2m2Z1l2
2 . .
The computed feedback gain F is given by e I e z ~ s ( e-l e,) + ( J +~ m2r:)8:)
F = (-160,211,-6.27,-3.45] x (13) +
+{mlllcosO~ mz(llcos0, + 12cos&)}g. (17)
As the feedback gain becomes excessively larga, the re- The difference between the total energy E ( t ) and po-
alization of the swing-up and balancing based on tbis tential energy of balancing at the upright position E, is
control strategy by using single feedback law IS impos- given by
sible actually. E,(t) := E ( t ) - E , . (18)
Passivity of a two-link acrobot yileds
4. Swing-up and balancing control
by separate control laws
4.1 Swing-up balancing
and (control Now the next equation is considered
based on pole assignment d
-Ez(t) = -/&(t), (20)
Swing-up control based o n pole assignment: dt
The closed-loop system of Eq.(3) with input u,(t) = where p > 0 . As E,(t) --t 0 is satisfied Eq.(20), the
-F,z,(t) becomes swing-up control is succeeded, Thus, u(t) is expressed
as
d
,zo(t) = (A, - B,F,)zc.(t). (14)
3074
zo = [-T + 0.1,O,O,OIT. The feedback gain Fp is
computed as Fp = 12.918,-0.845, -0.100, -0.0771 for
Q = diog['20,1W,2,10) and R = [lo']. The poles are
assigned for swing-up control 0.283 f 4.6783, -4.460 *
0,5003, since poles of open-loop system at pen-
dent position are -8.201 x lo-' f 4.6783, -4.460 f
8.789j.The feedback gain F, is chosen as F, = I I
12.918, -0.845, -0.100,-0.077]. *
As the results, we swing-up the acrobot to the u p
right position by about 9.5[sec] and balanced by about
10.5[scc] as shown in Fig.3 because the state of O1 =
, = O[rad] corresponding to the upright posi-
O [ T Q ~ O2
tion is accomplished.
ms ugb9 in m y 0 and tdamng mnld w M cats asS"snt
1 . ...
I ' I'I
1I
*
lmed 1st
Fig. 3 The top figure are showing 0, and 8 2 by the pole References
assignment approach. The bottom figure is showing the
input. [l] Mark W Spong, The Swing Up Control Problem
For The Acrobot, IEEE, Control System Magazine,
5.2 Simulation for swing-up and balanc- 1995, Vo1.15, No.1, pp49-55
ing control based on energy
[Z] Hiromitsu Hikita,Shoichi Koyama,Ryoichi Miura,
The cotrol input was designed by u ( t ) = 0,if the angle The Redundancy of Feedback Gain Matrix and the
of the second joint became more than &%[rad].At the Derivation of Low Feedback Gain Matrix in Pole
initial state, the control input was designed hy -12[Nm] Assignment, SICE, Vol 1.11, No.5, pp.556560, 1975
for the moment. Accordingly, the control input used
this time is [3] Shigeo Hirose, Katsuhisa Furuta, Tsutomu Mita,
study of super Mecheno-system: Center of Excel-
-12 (t = O ) lence Project in TITech, The Robotics Society of
u(t) = { --pE.o
&+e
0 (t
(t > 0,Ba
> 0,@2
< 'f[rdJ)
> *?[rad])
. (22) Japan, Vo1.18, No.8, pp.1061-1065, 2000
[4] Hiroyuki Kajiwara, Kazuhiro Kosuge, Katsuhisa Fu-
T ruta, Position Control of a Double Inverted Pendu-
The initid state is chosen as Zo = [ - ~ , 0 , 0 , 0 ] . p of
lum on an Inclined Rail, SICE, Vo1.15, N0.7,pp.873-
Eq.(22) is chosen p = 11. The balancing control at
879,1979
upright position is the same as one given hy the pole
assignment approach. As the results, we found that the [5] Katsuhisa Furuta, Toshiaki Ochiai, Nobuto Ono,
swinging-up and balancing at the upright position are Attitude Control of Triple Inverted Pedulum by
succeeded because the state of O1 = O[radl, 02 = O[rad] Torque Control, Japan Society for Simulation Tech-
corresponding to the upright position is accomplished nology, V01.2, Nol.1, pp.35-42, 1983
in about 4[sec] as shown in Fig.4.
[SI Darrell Nalawaki, Sangwan Joo, Fumio Miyazaki,
6. Conclusions Multi-model Based Analysis of a Gymnastic Move-
ment on the Horizontal Bar, The Robotics Society
We have shown that two kinds of strategy of the swing- of Japan, Vo1.18, No.1, pp.59-65, 2000
up control of the acrobot by pole assignment and energy
3075