Professional Documents
Culture Documents
1 Introduction
Bipedal robots form a subclass of legged robots. Their design is naturally inspired
from the functional mobility of the human body. On the practical side, the study of
mechanical legged locomotion has been motivated by its potential use as means of
locomotion in rough terrains, but in particular, the interest arises from diverse so-
ciological and commercial interests, ranging from the desire to replace humans in
hazardous occupations (de-mining, nuclear power plant inspection, military inter-
ventions, etc.), to the restoration of motion in the disabled [1].
For practical implementation, a good mechanical design and a good modeling,
play a very important role in achieving good performance. However, in real world
applications, bipedal robots are subject to many sources of uncertainty during walk-
ing; these could include a push from a human, an unexpected gust of wind, geomet-
ric perturbations of the terrain heights, or parametric uncertainties of non-modeled
friction forces [2]. For these reasons, the design of feedback control systems, capa-
ble of attenuating the effect of these uncertainties is critical to achieve the desired
walking gait.
For simplicity, the complete model of the biped robot considered in this work
consists of two parts: the differential equations describing the dynamics of the robot
during the single support phase (one foot swinging on the air, the other staying as
a pivot on the ground), and an impulse model of the contact event (the impact be-
tween the swing leg and the ground, which is modeled as a contact between two
rigid bodies as in the work by [1]), thus considering the double support phase is
1
2 Oscar Montano, Yury Orlov, Yannick Aoustin and Christine Chevallereau
considered instantaneous. Therefore, the complete model of the biped robot consid-
ered in this work is simplified as a hybrid system consisting of free-motion phases
separated by impacts. The study of hybrid dynamical systems has recently attracted
a significant research interest, basically, due to the wide variety of applications and
the complexity that arises from the analysis of this type of systems (see, e.g. [3], [4],
and references quoted therein). Particularly, the disturbance attenuation problem for
hybrid dynamical systems has been addressed by [5], [6], where impulsive control
inputs were admitted to counteract/compensate disturbances/uncertainties at time
instants of instantaneous changes of the underlying state. However, the drawbacks
of these approaches are 1) the complexity of finding a solution to the equations that
allows the synthesis of the control law, and 2) the physical implementation of impul-
sive control inputs is impossible in many practical situations, e.g., while controlling
walking biped robots.
In this regard, many authors have dealt with the problem of robustly controlling
the walking gait of biped robots: [7] presents an iterative approach to find Control
Lyapunov Functions, such that a mechanical system subject to impacts and fric-
tion remains stable. In [8], [9], PD control laws and a torque-optimization method
are combined to increase the robustness against joint-torque disturbances. Other ro-
bust control techniques, such as sliding modes control, have been designed for this
kind of systems (see e.g., the works by [10], [11]). While providing both finite-
time convergence to a desired reference trajectory and disturbance rejection, these
approaches also entail the well-known problem of chattering in the actuators. This
further motivates the study of robust control techniques such as the one presented
in this work, which attenuate the effect of disturbances while avoiding undesirable
and harmful effects on both the actuators, and the joints.
This chapter further develops the results presented in [12], studying the applica-
bility of the H∞ control technique (extended in [13] towards mechanical systems
operating under unilateral constraints) to the 32 degrees-of-freedom biped robot
ROMEO, of Aldebaran Robotics [14]. Results dealing with the orbital stabilization
of a simpler 2D, fully-actuated biped can be found in [15]. In contrast to previous
studies, this investigation contributes to the study of robustness of bipedal locomo-
tion while assuming an imperfect knowledge of the restitution rule at the collision
time instants in addition to external disturbance forces applied during the single
support phases.
2 Notation
The notation used throughout this work is rather standard. The argument t + is used
to denote the right-hand side value x(t + ) of a trajectory x(t) at an impact time instant
t whereas x(t − ) stands for the left-hand side value of the same; by default, x(t)
is reserved for x(t − ), thus implying an underlying trajectory to be continuous on
the left. Vectors are represented by bold, lowercase letters, whereas matrices are
represented by bold, uppercase letters.
H∞ - Stabilization of a 3D Bipedal Locomotion Under a Unilateral Constraint 3
3 Background materials
In this section, the H∞ control problem under unilateral constraints is stated, and
sufficient conditions for the existence of a solution are presented. Later on, the ap-
plicability of these results on the biped robot of interest will be studied.
Given a scalar unilateral constraint F(x1 ,t) ≥ 0, consider a nonlinear system, evolv-
ing within the above constraint, which is governed by continuous dynamics of the
form
ẋ1 = x2
(1)
ẋ2 = Φ(x1 , x2 ,t) +Ψ 1 (x1 , x2 ,t)w +Ψ 2 (x1 , x2 ,t)u
x1 (ti+ ) = x1 (ti− )
x2 (ti+ ) = µ0 (x1 (ti ), x2 (ti− ),ti ) + ω(x1 (ti ), x2 (ti− ),ti )wid
(3)
zdi = x2 (ti+ ) (4)
it could readily be extended to the over-actuated case with a correct choice of the
control inputs [16]. The treatment in the underactuated case is also possible using
the virtual constraint approach [17, 1] whenever it is applicable (e.g., for the unde-
graduation degree one similar to that of [18, 19]).
Admitting the above time-varying representation is particularly invoked to deal
with tracking problems where the plant description is given in terms of the state
deviation from the reference trajectory to track [20]. Therefore, if interpreted in
terms of mechanical systems, equation (1) describes the continuous dynamics in
terms of the position and velocity tracking errors (x1 and x2 , respectively), before
the underlying system hits the reset surface F(x1 ,t) = 0, depending on the position
error x1 only, whilst the restitution law, given by equation (3), is a physical law for
the instantaneous change of the velocity error when the resetting surface is hit.
Consider a causal feedback controller
u = κ(x,t) (5)
with the function κ(x,t) of class C1 such that κ(0,t) = 0. Such a controller is said to
be a locally (globally) admissible controller iff the undisturbed (w, wid = 0) closed-
loop system (1)–(10) is uniformly (globally) asymptotically stable.
The H∞ -control problem of interest consists in finding an admissible global con-
troller (if any) such that the L2 -gain of the disturbed system (1)–(10) is less than a
certain attenuation level γ > 0, that is the inequality
Z T NT
2
kzk2 dt + ∑ kzdi k ≤
t0 i=1
"Z
NT
# (6)
T N
2
γ 2 2
kwk dt + ∑ kwid k + ∑ β j (x(t −
j ),t j )
t0 i=1 j=0
locally holds for some positive definite functions β j (x,t), j = 0, . . . , NT , for all seg-
ments [t0 , T ] and a natural NT such that tNT ≤ T < tNT +1 , and for all piecewise
continuous disturbances w(t) and discrete ones wid , i = 1, 2, . . . . In turn, a locally
admissible controller (5) is said to be a local solution of the H∞ -control problem
if there exists a neighborhood U ∈ R2n of the origin, validating inequality (6) for
some positive definite functions β j (x,t), j = 0, . . . , NT , for all segments [t0 , T ] and
a natural NT such that tNT ≤ T < tNT +1 , for all piecewise continuous disturbances
w(t) and discrete ones wid , i = 1, 2, . . . , for which the state trajectory of the closed-
loop system starting from an initial point (x(t0 ) = x0 ) ∈ U remains in U for all
t ∈ [t0 , T ].
It is worth noticing that the above L2 -gain definition is consistent with the notion
of dissipativity, introduced by [21], and [22], and with iISS notion [23]. It represents
a natural extension to hybrid systems (see, e.g. the works by [6], [24], [25], and
[26]). In mechanical terms, for the disturbed case, even if the output z is not driven
to zero, the L2 -gain of the system is still locally less than the specified value γ, so
the output will be bounded around zero and in consequence the state trajectories
H∞ - Stabilization of a 3D Bipedal Locomotion Under a Unilateral Constraint 5
of the plant will evolve around the trajectory to track. To facilitate the exposition
the underlying system, chosen for treatment, has been pre-specified with the post-
impact velocity value x2 (t) in the discrete output (10) to be controlled. The general
case of a certain function κ d (x2 (t)) of the post-impact velocity value in the discrete
output (10) can be treated in a similar manner because the L2 -gain inequality (6) is
flexible in the choice of positive definite functions βk (x,t), k = 0, . . . , NT .
For later use, the continuous dynamics (1) are rewritten in the form
the nonlinear system is completely described beyond and at the contact with the
unilateral constraint F(x,t). In order to simplify the synthesis to be developed
and to provide reasonable expressions for the controller design, the assumptions
h1 > k12 = 0, k12 > k12 = I, which are standard in the literature (see, e.g., [27]) are
made. Relaxing these assumptions is indeed possible, but it would substantially
complicate the formulas to be worked out.
Below we list the hypotheses under which a solution to the problem in question is
derived. Given γ > 0, in a domain x ∈ B2n 2n 2n
δ ,t ∈ R, where Bδ ∈ R is a ball of radius
δ > 0, centered around the origin,
√
2
H1) The norm of the matrix function ω is upper bounded by 2 γ, i.e.,
√
2
kω(x,t)k ≤ γ. (11)
2
6 Oscar Montano, Yury Orlov, Yannick Aoustin and Christine Chevallereau
H2) there exist a smooth, positive definite, decrescent function V (x,t) and a positive
definite function R(x) such that the Hamilton–Jacobi–Isaacs inequality
∂V ∂V
+ (f(x,t) + g1 (x,t)α1 + g2 (x,t)α2 ) +
∂t ∂x
h1 > h1 + α2 > α2 − γ 2 α1 > α1 ≤ −R(x) (12)
holds with
∂V > ∂V >
1 > 1 >
α1 = g (x,t) , α 2 = − g (x,t)
2γ 2 1 ∂x 2 2 ∂x
H3) Hypotheses H1 is satisfied with the function V (x,t) which decreases along the
direction µ in the sense that the inequality
u = α2 (x,t), (14)
locally possesses a L2 -gain less than γ. Once Hypothesis H3) is satisfied as well, the
function V (x,t) constitutes a Lyapunov function of the disturbance-free closed-loop
system (1)-(10), (14) the uniform asymptotic stability of which is thus additionally
guaranteed.
Since the proof follows the same line of reasoning as that in the book by [27]
for the impact-free case here we provide only a sketch. Similar to the proof of [27,
Theorem 7.1], let us consider the function V (x,t) whose time derivative, computed
along the disturbed closed-loop system (1)-(10) between collision time instants t ∈
(tk ,tk+1 ), k = 0, 1, . . . , is estimated as follows [27, p.138]:
H∞ - Stabilization of a 3D Bipedal Locomotion Under a Unilateral Constraint 7
dV
≤ −kzk2 + γ 2 kwk2 − R(x). (15)
dt
Then integrating (15) from tk to tk+1 , k = 0, 1, . . . , yields
Z tk+1
[γ 2 kwk2 − kzk2 ]dt ≥
tk
Z tk+1 Z tk+1 (16)
dV (x(t),t)
R(x(t))dt + dt > 0.
tk tk dt
holds true with LVi > 0 being a local Lipschitz constant of V , in the ball of radius
kx(ti+ )k, centered around x(ti− ). Relations (17) and (18), coupled together, result in
Z T NT
(γ 2 kwk2 − kzk2 )dt ≥ − ∑ [2(LVi )kx(ti− )k −V (x(t0 ),t0 ) (19)
t0 i=1
with i = 1, . . . , N.
To complete the proof it remains to establish the asymptotic stability of the undis-
turbed version of the closed-loop system (1)-(10),(14). Indeed, the negative definite-
ness (15) of the time derivative of the Lyapunov function V (x,t) between the colli-
sion time instants, coupled to Hypothesis H3), ensures that Lemma 1 is applicable
to the undisturbed version of the closed-loop system (1)-(10), (14). By applying
Lemma 1, the required asymptotic stability is thus validated. t u
To present a local solution to the problem in question the underlying system is lin-
earized to
within impact-free time intervals (ti−1 ,ti ) where t0 is the initial time instant and
∂ f
ti , i = 1, 2, . . . are the collision time instants, whereas A(t) = , B1 (t) =
∂ x x=0
∂ h
g1 (0,t), B2 (t) = g2 (0,t), C(t) = , D12 (t) = k12 (0,t).
∂ x x=0
By the time-varying strict bounded real lemma [29, p.46], the following condition
is necessary and sufficient for the linear H∞ control problem (23)-(24) to possess a
solution: given γ > 0,
C) there exists a positive constant ε0 such that the differential Riccati equation
has a uniformly bounded symmetric positive definite solution Pε (t) for each ε ∈
(0, ε0 );
As shown below, this condition, if coupled to Hypothesis H1 and a certain mono-
tonicity condition, is also sufficient for a local solution to the nonlinear H∞ control
problem to exist under unilateral constraints.
Theorem 2. Let condition C be satisfied with some γ > 0. Then Hypothesis H2 hold
locally around the equilibrium (x = 0) of the nonlinear system (1)-(10) with
ε
V (x,t) = x> Pε (t)x, R(x) = kxk2 (26)
2
H∞ - Stabilization of a 3D Bipedal Locomotion Under a Unilateral Constraint 9
locally possesses a L2 -gain less than γ provided that Hypothesis H1 holds as well.
If in addition, Hypothesis H3 is satisfied with the quadratic function V (x,t), given in
(26), then the disturbance-free closed-loop system (1)-(10), (27) is uniformly asymp-
totically stable.
Proof. Due to [29, Theorem 24], Hypothesis H2 locally holds with (26). Then by
applying Theorem 1, the validity of Theorem 2 is concluded.
For the periodic tracking of period T with periodic impact instants ti+1 = ti + T, i =
1, 2, . . ., Theorem 2 admits a time-periodic synthesis (27) which is based on an ap-
propriate periodic solution Pε (t) of the periodic differential Riccati equation (25).
+
It should be noted that Pε (ti+1 ) = Pε (ti+ ), due to the periodicity, and inequality (13)
of H3 is then specified to the boundary condition
The bipedal robot considered in this section is walking on a rigid and horizontal
surface. It consists of the 32-DOF robot Romeo, of Aldebaran Robotics, depicted in
Fig. 1. Similar to the planar biped form the previous section, the walking gait takes
place in the sagittal plane and is composed of single support phases and impacts.
10 Oscar Montano, Yury Orlov, Yannick Aoustin and Christine Chevallereau
The configuration of the biped robot in single support can be described only by
the vector q = (q0 , q1 , . . . , q32 )> . We use the modified Denavit-Hartenberg notation
[30] to define the frame position for each joint (see Fig. 2). To define the geomet-
ric structure of the biped we assume that the link 0 (stance foot) is the base of the
bipedal robot while the link 12 (swing foot) is the terminal link. Considering the
torso, head and arms, one obtains a tree structure. To take into account explicitly the
contact with the ground, we have to add six more variables to describe the position
and orientation of the frame 0 with respect to a fixed Galilean frame Rg . Thus, we
can define the position, velocity, and acceleration vectors, X = (X0 > , α 0 > , q> )> ,
V = (0 V> 0 > > > 0 > 0 > > >
0 , ω 0 , q̇ ) , and V̇ = ( V̇0 , ω̇ 0 , q̈ ) . X0 and α 0 are the position and
orientation variables of frame R0 , while 0 V0 and 0 ω 0 are the linear and angular ve-
locities of R0 , with respect to the Galilean frame. Therefore, the complete dynamic
model of the biped can be written as follows:
0 ≤ F(X) ⊥ R2 ≥ 0 (30)
Fig. 2 Frames placement for the main limbs; the remaining 6 frames not appearing belong to the
hands (2 frames) and the neck and head (4 frames), thus completing the 32 degrees of freedom.
The zero frame R0 is attached to the left foot.
12 Oscar Montano, Yury Orlov, Yannick Aoustin and Christine Chevallereau
where τ = (τ 1 , . . . , τ 32 )> is the 32 × 1 vector of joint torques. The term H(q, q̇) is
the 32 × 1 vector of the centrifugal, Coriolis and gravity forces.
The above mentioned assumptions are verified during the numerical study using
model (29). If at least one of these conditions are not satisfied, the conditions to
construct (32) are not met and thus it is not valid. Thus, the reference trajectories
are designed taking these conditions as restrictions.
The impact is assumed to be inelastic with complete surface of the foot sole touching
the ground. This means that the velocity of the swing foot impacting the ground is
zero after impact. The double support phase is instantaneous and it can be modeled
through passive impact equations. An impact occurs at a time t = TI when the swing
leg touches the ground.
Since the impact is assumed to be passive, absolutely inelastic, and that the legs
do not slip [33], the following equations:
J1 V− = 0 (33)
J2 V+ = 0 (34)
hold. This means that the feet perfectly stick on the ground after the shock, thus
avoiding multiple impacts. Given these conditions, the ground reactions can be
viewed as impulsive forces. The algebraic equations, allowing one to compute the
jumps of the velocities, can be obtained through integration of the dynamic equa-
tions of the motion, taking into account the ground reactions during an infinitesimal
time interval from TI− to TI+ around an instantaneous impact.
The impact is assumed to be with complete surface of the foot sole touching
the ground. This means that the velocity of the swing foot impacting the ground
H∞ - Stabilization of a 3D Bipedal Locomotion Under a Unilateral Constraint 13
is zero after impact. After an impact, the right foot (previous stance foot) takes off
the ground, so the vertical component of the velocity of the taking-off foot must be
directed upwards right after an impact and the impulsive ground reaction in this foot
equals zeros. Thus, the impact dynamic model can be represented as follows [33]:
where V− is the velocity of the robot before the impact, and V+ is the velocity after
the impact; φe (X) represents a restitution law that determines the relations between
the velocities before, and after the impacts; X is the configuration of the robot at
the impact. The additive term wdei is introduced to account for inadequacies in this
restitution law. Thus, equation (35) renders the complementarity system (29)-(34)
complete.
Considering that the support foot velocity is zero before the impact, and the swing
foot velocity is zero after the impact, and combining (33)-(35), it is possible to
obtain the following expression:
Also, considering the reference frame x0 , y0 , z0 shown in Fig. 2, and since we are
assuming flat foot contact with the ground, the time-invariant unilateral constraint
F0 (q) ≥ 0 is determined by the height of the swing foot’s sole.
Remark 1. It is important to remark at this point, that if all the assumptions men-
tioned above are met, the so-called complementarity system [34] (29), (35), subject
to the unilateral constraint (30), is simplified to the form equations (32), (36), which
define a hybrid system that can controlled using the methodology developed in this
work. For the latter, the unilateral constraint can be defined as F(q), which repre-
sents the height of swing foot, as a function of the generalized coordinates of the
implicit-contact model (32).
τ = D(q̈r − u) + H (38)
that imposes on the undisturbed biped motion desired stability properties around qr
while also locally attenuating the effect of the disturbances. Thus, the controller to
be constructed consists of the feedback linearizing terms of (38) subject to u = 0,
which are responsible for the trajectory compensation, and a disturbance attenuator
u, internally stabilizing the closed-loop system around the desired trajectory. In what
follows, we confine our research to the trajectory tracking control problem where
the output to be controlled is given by
0 I
z = ρ p (qr − q) + 0 u, zd = qr (tk+ ) − q(tk+ ) (39)
ρv (q̇r − q̇) 0
The transitions occur in the error dynamics according to the following scenarios:
T1) The reference trajectory reaches its predefined impact time instant t = t k , k =
1, 2, . . . when it hits the unilateral constraint whereas the plant remains beyond
this constraint, that is, F0 (qr (t k )) = 0, F0 (x1 (t k ) + qr (t k )) 6= 0;
T2) The plant hits the unilateral constraint at t = t j , j = 1, 2, . . . while the reference
trajectory is beyond this constraint, that is, F0 (qr (t j )) 6= 0, F0 (x1 (t j ) + qr (t j )) =
0;
T3) Both the reference trajectory and the plant hits the unilateral constraint at the
same time instant t = t l , l = 1, 2, . . . (what can deliberately be enforced by
modifying the pre-specified reference trajectory on-line), that is, F0 (qr (t l )) = 0,
F0 (x1 (t l ) + qr (t l )) = 0.
Fig. 3 The three different scenarios for the transitions in the error dynamics.
These scenarios are illustrated in Figure 3. Transition errors are then represented
as follows.
Scenario T1:
x1 (t k+ )= x1 (t k− )
x2 (t k+ ) = µ 1 (x(t k− ),t k ) + wdk , (44)
x1 (t j+ )= x1 (t j− )
x2 (t j+ ) = µ 2 (x(t j− ),t j ) + wdj , (45)
16 Oscar Montano, Yury Orlov, Yannick Aoustin and Christine Chevallereau
x1 (t l+ )= x1 (t l− )
l+
x2 (t ) = µ (x(t l− ),t l ) + wdl , l = 1, 2, . . .
3 (46)
To put the previous equations into the form (3), it suffices to set
Clearly, the functions µ 0 (x,t), ω(x,t), F(x,t), thus specified, meet the assumptions,
imposed on the generic system (1)-(10) to be twice continuously differentiable in
the state domain for all t, and to be piece wise continuous, and uniformly bounded
in t, for all state variables x in some neighborhood around the origin.
To respect Condition C of Theorem 2 for the error system (1)-(10), the controlled
output (39) is specified with ρ p = 3500 and ρv = 500, and then, following the stan-
dard H∞ design procedure (see, e.g., [29, Section 6.2.1]), the disturbance attenua-
tion level and the perturbation parameter are set to γ = 200 and ε = 0.01 to ensure an
appropriate solvability of the perturbed differential Riccati equation (25), subject to
the boundary condition (28). Next, hypothesis H1 of Theorem 2 is then straightfor-
wardly verified with γ, thus specified, and with ω, being an identity matrix. Finally,
to comply with hypothesis H3 of Theorem 2 to be verified at the impact time in-
stants, the previously defined reference trajectory, is adapted on-line in such a man-
ner that the state error dynamics possess no jumps. Thus, inequality (13) becomes
redundant for the adapted trajectory because only trivial transitions with µ(x,t) = 0
are feasible.
H∞ - Stabilization of a 3D Bipedal Locomotion Under a Unilateral Constraint 17
-0.2
-0.4
-1
x+
21 {
-1.2
}x−
21
-1.4
0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85
Time [sec]
Fig. 4 Reference velocity adaptation for the first joint, with an impact at t l = 0.5 s.
For hybrid systems with state-triggered jumps, the jump times of the plant and the
reference trajectory are in general not coinciding. During the time interval caused by
this jump-time mismatch, the tracking error is large, even in the undisturbed case.
Since this behavior also occurs for arbitrarily small initial errors, the error dynamic
displays unstable behavior in the sense of Lyapunov. This behavior is known in
the literature as ”peaking”. It is expected to occur in all hybrid systems with state-
triggered jumps when considering tracking or observer design problems [35], and
imposes a difficulty in guaranteeing that the norm of the tracking error converges to
zero. In order to achieve synchronization (Scenario T3 from the previous section)
in our biped application, the reference trajectory is adapted, as illustrated in Fig.4
for the first joint q1 . Provided that the impact is detectable (e.g., by using a force
or touch sensor) it happens that either the reference trajectory hits the constraint
before the plant does, or the plant hits the constraint before the reference trajectory
does. In the former scenario, the reference trajectory is continuously extrapolated
until the plant collision occurs whereas in the latter scenario, the reference trajec-
tory is restarted on-line once the plant collision is detected. Either way, both the
plant trajectory and the adapted reference trajectory exhibit impacts at the same
time instants. Before the collision, the nominal reference trajectory and the adapted
one, are forced to be equivalent by the adaptation proposed. The position and ve-
locity tracking errors are measured, and once the impact of the plant is detected,
the adapted trajectory is updated on-line in such a manner that the new post-impact
+
error, x21 in Fig.4, coincides with the error measured before the impact (x21 (t l− )
in Fig.4), thereby rendering the evolution of the error to exhibit no jump, so as to
18 Oscar Montano, Yury Orlov, Yannick Aoustin and Christine Chevallereau
ensure a smooth control action. Following the idea of [36], a new polynomial in
terms of the tracking error is defined for the adapted trajectory, that starts from this
imposed condition, and will join the nominal reference trajectory at the middle of
the step with the same velocity, and will continue to be the same until the end of the
step. While the reference trajectory is recalculated after the impact, the perturbed
differential Riccati equation (25) is also updated, and its corresponding solution is
recomputed on-line.
To illustrate the performance issues of the developed stable bipedal gait synthe-
sis numerical simulations were performed for the model of a laboratory prototype,
whose parameters were drawn from the Aldebaran’s ROMEO documentation. The
contact constraints (no-take off, no rotation, and no sliding during the single sup-
port phase) are verified on-line to confirm the validity of (32), (36). The well-known
constraint (complementarity)-based approach [37] is utilized to simulate the biped
contact with the ground, using a Matlab emulator. The latter approach belongs to
the family of time-stepping approaches and it is often invoked for biped dynamics
simulations (see, e.g., the works by [38, 39]).
First, the undisturbed-case results are presented 1 , where the plant initial condi-
tions are deviated a 5% from the reference gait’s initial conditions. The control law
calculation time averaged at 1.5 ms. As predicted by the theory, Figure 5 depicts the
Lyapunov function V (x,t), decreasing smoothly and asymptotically towards zero,
illustrating that no peaking phenomena is present, thanks to the trajectory adapta-
tion.
As a next step, a persistent disturbance of 10 sin(t) Nm was applied to the hip,
while the velocities after the impact are deviated 5 % from their nominal values
(given by (36)), thus considering disturbances on both the single support and impact
phases. Six joints among the 32 were selected to clearly illustrate the effect of this
disturbance (both ankles, knees, and hip joints). This is depicted in Fig. 8, where
the error is small and bounded, and the robot maintains an stable walking gait. The
torques for these joints are shown in Fig. 9, where they stay between the buondaries
of ±150 Nm. Despite the disturbances, good performance of the closed-loop error
dynamics, driven by the proposed nonlinear H∞ state feedback, is still achieved.
Finally, the performance of the H∞ controller was compared against the perfor-
mance of a PD controller. In order to do a fair comparison, the same pre-feedback
(38) was used, and just the disturbance attenuator (27) was replaced by the PD con-
troller u = −Kp x1 − Kv x2 , with the constant matrices [Kp , Kv ] = B> 2 Pε where Pε
is the solution of the algebraic version of the Riccati equation (25) with Ṗε = 0.
The comparison results with the time-varying disturbance force 10 sin(t) + 10 N,
applied to the hip are shown in Fig.10, where it can be seen that after 6.13 s, the
4.5
3.5
3
V (x, t)
2.5
1.5
0.5
0
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8
Time [sec]
Fig. 5 Lyapunov function for the undisturbed system, with nonzero initial conditions.
0 Right foot
Left foot
ZMPx
-0.1
-0.2
-0.3
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6
Time [sec]
-0.05
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6
Time [sec]
Fig. 6 Zero moment point locations for both feet during the disturbance free walking gait. The
dashed lines represent the feet geometrical limits and the ZMP always rests inside of them, thus
illustrating no rotation of the support foot at each step.
20 Oscar Montano, Yury Orlov, Yannick Aoustin and Christine Chevallereau
−0.02
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8
0.02 P4
−0.02
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8
Fig. 7 Feet heights for 6 steps for Romeo, representing a stable motion. P1-P4 represent the heights
of the corners of each foot.
5 Conclusions
In this chapter, sufficient conditions for a local solution of the H∞ state feedback
tracking problem to exist are presented in terms of the appropriate solvability of an
independent inequality on discrete disturbance factor that occurs in the restitution
rule, and two coupled inequalities, involving a Hamilton-Jacobi-Isaacs inequality.
The former inequality ensures that the closed-loop impulse dynamics (when the
state trajectory hits the unilateral constraint) are ISS whereas the latter inequali-
ties, arising in the continuous-time state feedback, should impose the desired iISS
property on the continuous closed-loop dynamics between impacts.
The effectiveness of this synthesis procedure, based on solving a disturbed dif-
ferential Riccati equation, corresponding to the linearized system, is supported by
a numerical study, made for the state feedback design of a stable walking gait of a
32-DOF fully actuated biped robot. The desired disturbance attenuation is satisfac-
Left ankle position error Left knee position error Left hip position error
0.06 0.05 0.1
0.04
0 0.05
0.02
0
−0.05 0
−0.02
Degrees
Degrees
Degrees
−0.04
−0.1 −0.05
−0.06
Right hip position error Right knee position error Right ankle position error
0.05 0.1 0.15
0.1
0
0.05
0.05
−0.05 0
0
−0.05
Degrees
Degrees
Degrees
−0.1
−0.1
−0.05
H∞ - Stabilization of a 3D Bipedal Locomotion Under a Unilateral Constraint
−0.15
−0.15
Fig. 8 Joints errors for left and right hips, knees, and ankles, under a persistent continuous distur-
21
22
50 50 50
0 0 0
Nm
Nm
Nm
−50 −50 −50
50 50 50
0 0 0
Nm
Nm
Nm
Fig. 9 Torques for left and right hips, knees, and ankles, under a persistent continuous disturbance
Oscar Montano, Yury Orlov, Yannick Aoustin and Christine Chevallereau
H∞ - Stabilization of a 3D Bipedal Locomotion Under a Unilateral Constraint 23
3.5
PD
H∞
3
kq − qd kL2 2.5
1.5
0.5
0
0 1 2 3 4 5 6 7
Time [sec]
Fig. 10 Cumulative error comparison of the nonlinear H∞ controller (dashed lines) vs. the linear
H∞ PD controller (solid lines).
torily achieved under external disturbances during the free-motion phase, and in the
presence of uncertainty in the transition phase.
Acknowledgements The authors acknowledge the Financial support of ANR Chaslim grant and
CONACYT grant no.165958.
References
12. O. Montano, O. Y., Y. Aoustin, C. Chevallereau, in Proceedings of the IEEE-RAS 16th Inter-
national Conference on Humanoid Robots (IEEE, 2016)
13. O. Montano, Y. Orlov, Y. Aoustin, International Journal of Control 89(12), 1 (2016)
14. A. Robotics. Project romeo. www.ald.softbankrobotics.com/fr/cool-robots/romeo (2016). Ac-
cessed: 2016-07-01
15. O. Montano, Y. Orlov, Y. Aoustin, Proc. 9th World Congress of the International Federation
of Automatic Control (IFAC 2014) (2014)
16. S. Miossec, Y. Aoustin, in Robot Motion and Control, 2004. RoMoCo’04. Proceedings of the
Fourth International Workshop on (IEEE, 2004), pp. 53–59
17. Y. Aoustin, A. Formalsky, Journal of Computer and Systems Sciences International 42(4), 645
(2003)
18. O. Montano, Y. Orlov, Y. Aoustin, C. Chevallereau, Proceedings of the 14th European Control
Conference pp. 800–805 (2015)
19. O. Montano, Y. Orlov, Y. Aoustin, C. Chevallereau, Autonomous Robots pp. 1–19 (2016).
DOI 10.1007/s10514-015-9543-z
20. B. Brogliato, S. Niculescu, P. Orhant, Automatic Control, IEEE Transactions on 42(2), 200
(1997)
21. J. Willems, Archive for rational mechanics and analysis 45(5), 321 (1972)
22. D. Hill, P. Moylan, Automatic Control, IEEE Transactions on 25(5), 931 (1980)
23. J.P. Hespanha, D. Liberzon, A.R. Teel, Automatica 44(11), 2735 (2008)
24. S. Yuliar, M. James, J. Helton, Mathematics of Control, Signals and Systems 11(4), 335 (1998)
25. W. Lin, C. Byrnes, Automatic Control, IEEE Transactions on 41(4), 494 (1996)
26. J. Baras, M. James, in Decision and Control, 1993., Proceedings of the 32nd IEEE Conference
on (IEEE, 1993), pp. 51–55
27. Y. Orlov, Discontinuous systems–Lyapunov analysis and robust synthesis under uncertainty
conditions (Springer, 2009)
28. W. Haddad, V. Chellaboina, S. Nersesov, Impulsive and hybrid dynamical systems: stability,
dissipativity, and control (Princeton University Press, 2006)
29. Y. Orlov, L. Aguilar, Advanced H∞ Control–Towards Nonsmooth Theory and Applications
(Birkhauser:Boston, 2014)
30. W. Khalil, J. Kleinfinger, in Robotics and Automation. Proceedings. 1986 IEEE International
Conference on, vol. 3 (IEEE, 1986), vol. 3, pp. 1174–1179
31. J. Luh, M. Walker, R. Paul, Automatic Control, IEEE Transactions on 25(3), 468 (1980)
32. M. Walker, D. Orin, Journal of Dynamic Systems, Measurement, and Control 104(3), 205
(1982)
33. D. Tlalolini, Y. Aoustin, C. Chevallereau, Multibody System Dynamics 23(1), 33 (2010)
34. W. Heemels, B. Brogliato, European Journal of Control 9(2), 322 (2003)
35. J. Biemond, N. van de Wouw, W. Heemels, H. Nijmeijer, Automatic Control, IEEE Transac-
tions on 58(4), 876 (2013)
36. A. Grishin, A. Formalsky, A. Lensky, S. Zhitomirsky, The International Journal of Robotics
Research 13(2), 137 (1994)
37. C. Rengifo, Y. Aoustin, F. Plestan, C. Chevallereau, et al., Proceedings of Multibody Dynamics
(2011)
38. P. Van Zutven, D. Kostic, H. Nijmeijer, Simulation, Modeling, and Programming for Au-
tonomous Robots p. 521 (2010)
39. K. Yunt, C. Glocker, in 9th IEEE International Workshop on Advanced Motion Control (IEEE,
2005), pp. 665–671