Professional Documents
Culture Documents
"This research is partially supported by the National S ci ence Foundation under grants MSS-91 00618
and IRI-9216428
Abstract. In this paper we study the swing up control problem for the Acrobot using partial
feedback linearization. We give conditions under which the response of either degree of freedom
may be globally decoupled from the response of the other and linearized. This result can be used
as a starting point to design swing up control algorithms. Analysis of the resulting zero dynamics
shows interesting and rich behavior. Simulation results are presented showing the swing up motion
resulting from the partial feedback linearization design.
Key Words_ control theory, robotics, intelligent machines, partial feedback linearization, underac-
tuated systems, mechatronics .
1. INTRODUCTION where
833
The difference between the system (1)-(2) and the or the system ~2
standard model of a two- link planar robot (Spong
and Vidyasagar, 1989) is, of course, the absence (6)
of an input torque to the first equation (1). Previ- (7)
ous studies have investigated the problem of bal-
ancing the Acrobot in its inverted position and Thus (under conditions that we will state below)
controlling its motion along its unstable equilib- the systems ~I and ~2 are both feedback equiv-
rium manifold (Bortoff, 1992 ; Bortoff and Spong, alents of the Acrobot dynamics. Either of these
1992; Murray and Hauser , 1990). The problem systems, ~I or ~2 , may be used to generate a
studied in this paper is to swing the Acrobot from swing up control strategy as we will show below,
is stable manifold to its unstable manifold and after first giving the details of the derivations of
balance it. Initial results on the swing up control ~I and ~2.
problem were reported in (Spong, 1994b). In this
paper we extend the results from (Spong, 1994b)
by showing that the input-output or partial feed- 2.1. DERIVATION OF THE SYSTEM ~I
back linearization approach can be used to lin- LINEARIZATION OF ql
earize either degree of freedom. In other words ,
Consider the first equation (1)
we may achieve a double integrator system
(8)
qi = Vi for i = 1 or 2 (3)
and assume that the term
This result is quite simple yet surprising in its im-
plications. We shall see that a careful analysis
and interpretation of the resulting zero dynam-
ics is important to understand both the behavior is non zero for all values of q2. Note that this im-
of the system under partial feedback linearization poses some restrictions on the inertia parameters
control and the limitations of the approach. of the robot, namely that h > m 2i c2 (i l - ic2) .
This condition is termed Strong Inertial Coupling
in (Spong, 1994a). Under this assumption we can
solve for ih from (8) as
2. PARTIAL FEEDBACK LINEARIZATION
(9)
It has been shown (Bortoff, 1992; Murray and
Hauser, 1990) that the Acrobot dynamics are not and substitute the resulting expression (9) into (2)
feedback linearizable with static state feedback to obtain
and nonlinear coordinate transformation . This
is typical of a large class of under actuated me- (10)
chanical systems. However , as we will show , we
may achieve a linear response from ei ther degree where the terms dl , hI, ~I are given by
of freedom by suitable nonlinear feedback. In this
section, we derive and analyze two distinct nonlin-
dl d2I - d 22 d ll /d I2
ear controllers to achieve two distinct closed loop hI h2 - d 22 hI/ d I2
systems , which we call ~I and ~2 , and which rep- ~I <P2 - d 22 <PI/d I2
resent the linearization of the response of link 1
and link 2, respectively. We will use these two sys-
tems to generate two distinct approaches for the A feedback linearizing controller can now be de-
swing up control problem. fin ed for equation (10) according to
834
wh ere kp and kd are p ositive gains. With state above that the surface z =
0 in state space de-
va riabl es fin es an invariant manifold fo r the system. Since
A is Hurwitz for positive values of kp and kd this
ZI = ql - qf Z2 = ql - qf (15 ) inva ri ant manifold is globally attractive. The dy-
1)1 = q2 172 = q2 namics o n this manifold are given by
t he closed loop syst e m may be written as
I] = !L'(O , 17) (23)
(16 )
Since we are interested in the swing up control
(17) problem, we consider the case qf
= 7r/2. Substi-
171 (18) tuting qf =
7r/2 , qf = = iif
0 into the equation
( 19) and using the original description of the sys-
172 (19) t em (1) yields the following expression for the ze ro
dynamics of the system with respect to the output
y =ql:
It is inte rest ing to note that the same result can be
obtain ed by simply choosing an o utput equation (m2e~2 + m 2£1£c2 COS(q2) + 12)<12
(20)
- m 2£ I£c2 sin(q2)q~ - m 2£c2gsin(q2) =0 (24)
fo r the o riginal system (1 )-(2), differentiating the T he syste m (24) , conside red as a dynamical sys-
o utput y until the input appears, and then choos- te m o n the cylinde r , h as two equilibrium points
ing the control input to Iin earize the resulting PI = (O , ojT , which is a saddle, a nd P2 = (7r,O)T ,
equation. The system therefore has relative de- whi ch is a center. A typi ca l phase portrait of the
g ree 2 with respect to the o utput ql' The manner system (24) is shown in Figure (2). The dynamic
in which we have arrived at t he system ~I has parameters used to generate this phase portrait
t.he advantage that the comput at ion and analysis co in cid e with those used in the simul ations of the
of the resulting zero dyn am ics is simple. next section.
835
next section by simulation result.s.
:; ; .. ; .. ::::: .;1'"
. .
I: ":" •• ':: I ::::::
.....
11:::: I:::::: I:::::: 11::::: :1: : :: : :1::
A=
[ 10 .19
-10.35
-1.57
6.12
0
0
0
0 1
(26)
q q
2
-2
+--..-1---c::----~-----::6c---~ t.
13=
r -L 1
2.37
(27)
-6
Q=
r 1000
-500
-500
1000
0
0
0
0
(28)
0 0 1000 -500
Fig. 3. Partial Feedback Linearizat.ion Response with
gains kp = 9, kd = 3
0 0 -500 1000 1
and R =
0.5 , yielding the state feedback controller
back linearizati on co nt.roll er fo r larger gain valu es I\· = [-1650.0 , -460.2. -71 6. 1, -278.2] (29)
kp = 9 and kd =
6. In this case link 2 rotates
360 0 in t.he steady state. Csing t.he large gains,
we can then switch to a "balan cing" controller to The' above control law (29) is switched on wh en-
capture and balance th e second link about the ver- ever the acrobot reaches t.h e near vertical config-
tical whenever q2 passes close to zero (mod(2iT)). uratio n . The actual switching time is determin ed
We illustrat e this below using a Linear , Quadratic by trial and error. Figure (5) shows a plot of a
Regulator to balance the Acrobot about the ver- successful swing up and balan ce using the par-
tical. t ial feed back lin earization followed by the linear .
quadratic regulator.
836
al., 1992) was used to solving the swing up control
1 .'
2
probl em. The result in (Spong , 1994b) contains an
analysis of the resulting zero dynamics for I:2 sim-
ilar to that contained here for I: 1. For reasons of
space we will not include the analysis of the zero
dynamics in this paper . Instead we will discuss
the original energy pumping interpretation of our
algorithm which was the original motivation for
its derivation.
A feedback linearizing controller can be defined The total energy of the system is given by
for equation (31) according to
(32)
and the derivative of V along trajectories of the
Substituting the control (32) into (31) yields the system is given by
system I:2
(39)
(33)
Therefore , the change in total energy over a time
(34) interval [T - 1, T] is
837
Then we see from (40) that
V(T) - V(T - 1) =£ ( I/(t)1 ·Iqlldt 20(42) all 1.t. 1 _FYV _~ ... i _ _ t.on? --_ _ _ _ __
JT-l
i.e., the change in energy during the time interval
[T - 1, T] is nonnegative. Our strategy for swing-
ing link 2 rapidly in the direction of motion of ql
is designed to produce a net force during the time
[T - 1, T] of each swing with the "correct sign"
as above. Although the above simplified analysis
only approximately describes the true Acrobot , we
will see below that the total energy is indeed in-
-~r-----~---------=---------
creased with each swing as we might expect from
the above considerations. Fig. 7. Total Energy During the Swingup Motion
It turns out that a better response can be obtained robot. Ph.D. Thesis , Dept. of Electrical and
by smoothing out the reference command for joint Computer Engineering. University of Illinois at
to by using Urbana- Champaign .
Bortoff, S.A. and M.W. Spong (1992). Observer-
q~ = Cl'sat(qd ( 43) based pseudo-linearization using splines: The
rolling acrobot example. ASME Winter Annual
where sat() is the saturation function. This has
Meeting. Anaheim , CA.
the effect of straightening out the Acrobot at the
Elmquist , H. (1975). S.lmnon- User's Guide. Dept.
top of each swing, which actually increases the
of Automatic Control. Lund Inst. of Tech.
amplitude of link 1 with each swing and facili-
lsidori , A. (1989). N onlznear Control Systems. sec-
tates the capturing of the Acrobot at the vertical
ond ed .. Springer- Verlag. Berlin.
position. Oth er choices for qq
are possible. The
Kokot.ovic , P.V. , M. Krstic and I. Kanellakopou-
essential feature is that the reference function be
los (1992). Backstepping to passivity: Recur-
a so-called "first and third quadrant" function of
sive design of adaptive systems. IEEE Conf. on
ql. See (Spong, 1994b) for additional details and
Decision and Control. Tucson , AZ. pp. 3276-
an analysis of the resulting zero dynamics.
3280.
Figure (6) shows a swing up motion using the ref- Murray, R.M. and J. Hauser (1990). A case study
erence for q2 given by (43) . Again the LQR con- in approximate linearization: The acrobot ex-
troller is switched on at the top of the swing. Fig- ample. Proc. American Control Conference.
ure (7) shows a plot of the total energy during the Spong , M.W. (1994a). The control of underac-
swingup motion. tuated mechanical systems. First International
Symposium on Mechatronics. Mexico City.
Spong , M.W. (1994b). Swing up control of the ac-
robot. IEEE Int. Conf. on Robotics and Au-
tomation . San Diego , CA.
Spong, M.W. and M. Vidyasagar (1989). Robot
Dynamics and Control. John Wil ey & Sons,
lnc .. f' ew York , NY.
3. REFER[\C[S
838