Professional Documents
Culture Documents
4, AUGUST 2021
Abstract—This article presents a novel representation-free and dynamic yet robust locomotion [9]. As the capabilities of
model predictive control (RF-MPC) framework for controlling var- quadrupedal robot rapidly grow, related researches have geared
ious dynamic motions of a quadrupedal robot in three-dimensional toward motions beyond locomotion on flat terrains. For exam-
(3-D) space. Our formulation directly represents the rotational dy-
namics using the rotation matrix, which liberates us from the issues ple, ANYmal demonstrated the stair climbing capability [10];
associated with the use of Euler angles and quaternion as the orien- MIT Cheetah 2 overcame obstacles by planning jumping tra-
tation representations. With a variation-based linearization scheme jectories online [11]; MIT Cheetah 3 achieved leaping onto
and a carefully constructed cost function, the MPC control law is high platforms [12]; MIT Mini Cheetah could execute 360◦
transcribed to the standard quadratic program form. The MPC backflips [13]. In general, the ability of quadrupedal robots is
controller can operate at real-time rates of 250 Hz on a quadruped
robot. Experimental results including periodic quadrupedal gaits being developed toward applications that involve more dynamic
and a controlled backflip validate that our control strategy maneuvers in increasingly complex scenarios.
could stabilize dynamic motions that involve singularity in 3-D Controlling a robot to achieve similar mobility that rivals
maneuvers. their animal counterparts encompasses many challenges. For
Index Terms—Dynamics, legged robots, model predictive control instance, the control design should be capable of embracing the
(MPC), underactuated robots. inherent dynamics when robots are underactuated. In addition,
the controller must take into consideration the constraints im-
I. INTRODUCTION
posed by hardware capacity and the environment. Available so-
HE quadrupedal animals possess extraordinary compe-
T tence of navigating harsh terrains by executing agile yet
well-coordinated movements. For example, mountain goats
lutions for dynamic locomotion include heuristic controller [14],
inverse dynamics control [15], and hierarchical operational
space control [16]. Simulation result of backflip in Cassie is
demonstrate their extraordinary mobility on traversing steep presented in [17]. Recent hardware results on ANYmal [18]
cliffs [2]. Domesticated canine animals could be trained to demonstrated the potential of reinforcement learning in dynamic
execute a variety of acrobatic Parkour maneuvers [3]. These legged locomotion. Recent times have seen a surge in the use
remarkable abilities of quadrupedal animals motivated the de- of optimization-based approaches, especially model predictive
velopment of many quadrupedal robots. Minitaur [4] realized control (MPC) for legged robots. Successful applications of
various dynamic running gaits: ANYmal [5] and HyQ [6] MPC on humanoid [19], [20] and quadrupeds [21], [22] have
could navigate challenging terrains autonomously; MIT Chee- shown the efficacy of MPC in planning and controlling a wide
tah robots achieved galloping [7], high speed bounding [8], variety of dynamic motions.
Most of the MPC control frameworks in the quadrupedal
Manuscript received July 16, 2020; revised November 23, 2020; accepted robot community use Euler angles as the orientation representa-
December 14, 2020. Date of publication January 13, 2021; date of current version
August 5, 2021. This work was supported in part by the NAVER LABS Corp. tion [23]–[26] since many of the locomotion tasks do not involve
under Grant 087387, in part by the Air Force Office of Scientific Research large deviation from the nominal orientation. However, Euler
under Grant FA2386-17-1-4665, in part by the National Science Foundation angles representation has the singularity issue [27] (also known
under Grant 1752262, and in part by the Mechanical Engineering Department
of Korea Advanced Institute of Science, and Technology (KAIST). This article as Gimbal lock), which requires the motions to avoid singular
was recommended for publication by Associate Editor R. Vasudevan and Editor orientations of the Euler angle representation. This disadvantage
E. Yoshida upon evaluation of the reviewers’ comments. (Corresponding author: of Euler angles representation restricts the quadrupedal robot
Hae-Won Park.)
Yanran Ding and Chuanzheng Li are with the Department of Mechanical from executing motions such as climbing up near-vertical cliff
Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, as the mountain goat or the acrobatic Parkour as the trained
IL 61820 USA (e-mail: yding35@illinois.edu; cli67@illinois.edu). dogs. That is because these motions are highly likely to involve
Abhishek Pandala is with the Department of Mechanical Engineering, Vir-
ginia Polytechnic Institute and State University, VA 24061 USA (e-mail: passing through the vicinity of singularity. Quaternion [28] is a
agp19@vt.edu). singularity-free representation, but as mentioned in [29], quater-
Young-Ha Shin and Hae-Won Park are with the Department of Mechanical nions have two local charts that cover the special orthogonal
Engineering, Korea Advanced Institute of Science and Technology, Daejeon
34141, South Korea (e-mail: shsin000@kaist.ac.kr; haewonpark@kaist.ac.kr). group SO(3) twice. This ambiguity could cause unwinding
This article has supplementary material provided by the authors and color phenomenon [29], where the body may start arbitrarily close
versions of one or more figures available at https://doi.org/10.1109/TRO.2020. to the desired attitude and yet rotate through large angles be-
3046415.
Digital Object Identifier 10.1109/TRO.2020.3046415 fore reaching the desired orientation. Widely adopted by the
1552-3098 © 2021 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
DING et al.: REPRESENTATION-FREE MODEL PREDICTIVE CONTROL FOR DYNAMIC MOTIONS IN QUADRUPEDS 1155
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
1156 IEEE TRANSACTIONS ON ROBOTICS, VOL. 37, NO. 4, AUGUST 2021
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
DING et al.: REPRESENTATION-FREE MODEL PREDICTIVE CONTROL FOR DYNAMIC MOTIONS IN QUADRUPEDS 1157
where the higher order variation terms are neglected. Using the
B. Variation-Based Linearization
properties of cross product in [49, Table 2.1], the following
Although the NMPC (1) could be solved to obtain the control equality:
input, the presence of local optimum resulting from the non-
linear dynamics complicates the solution process. Furthermore,
δR k ω op = δRk ω̂ op − ω̂ op δRk (10)
convoluted nonlinear optimization does not lend itself well to
embedded implementations. To meet the real-time constraint, we is used to derive the expression of Ṙk
strive to formulate the MPC as a QP that can be efficiently solved
Ṙk = Rop ω̂ op + Rop ω̂ op δRk + Rop
δω k . (11)
on embedded systems. A variation-based linearization scheme
for the rotation matrix is proposed to linearize the nonlinear The dynamics of angular velocity ω̇ k is linearized as
rotational dynamics. The error on the non-Euclidean SO(3)
manifold is approximated by the corresponding variation [46]
B
I ω̇ k = RTop τ op + δRTk τ op + RTop δτ k +
with respect to the operating point. Then the variation dynamics ˆ k B Iω op − ω̂ op B Iδω k
− ω̂ op B Iω op − δω (12)
is derived based on the system model (4) in the manner of [33].
Recent work [47] achieved underactuated two-leg balancing on in which δτ k is the variation of the net torque
MIT Mini Cheetah using variational-based linearization on the 4 4
SO(3) manifold.
Assuming that the predicted variables are close to the op- δτ k = ûi,op δpk + r̂ i,op · δui,k (13)
erating point, the variation of the rotation matrix on so(3) i=1 i=1
could be approximated by δR using the derivative of the error where uop is GRF applied at the current step and δu is the
function on SO(3) as in [48]. The variation δR ∈ so(3) is a local variation of GRF from uop . Note that the GRF applied at the
approximation of the displacement between two points on the next time step is uop + δu1 .
SO(3) manifold. The rotation matrix at the kth prediction step The linearized dynamics will be used as affine dynamics as
is approximated using the first-order Taylor expansion of matrix well as in the construction of objective function in the MPC
exponential map formulation.
Rk ≈ Rop exp(δRk ) ≈ Rop (I + δRk ) (5)
C. Vectorization
where we use the commutativity of small rotations based on the
After the dynamics of rotation matrix R and angular velocity
assumption of δR being small.
ω are linearized, the matrix variables in (11) and (12) are still
The nonlinear dynamics of the rotation matrix is given as
difficult to be formulated into the standard QP form. This section
Ṙ = RB ω̂ (6) proposes a vectorization technique which uses Kronecker prod-
uct [50] to transform matrix–matrix products into matrix–vector
where the first-order approximation of rotation matrix Rk is products.
presented in (5). To get a linear approximation for Ṙ, we define Let us define a vector ξ ∈ R3 such that the skew-symmetric
the variation of angular velocity δω k as matrix ξ̂ = δR ∈ so(3) is an element in the tangent space at
δω k = ω k − RTk Rop ω op (7) the operating point. Let N ∈ R9×3 be a constant matrix such
that vec(v̂) = N v ∀v ∈ R3 , where vec(·) is the vectorization
where the transport map ω op → RTk Rop ω op enables comparison function. Then vec(δR) = N · ξ. The second and third terms
between tangent vectors at different points. This procedure of (11) could be vectorized as
is required because the tangent vectors Ṙk ∈ TRk SO(3) and
Ṙop ∈ TRop SO(3) lie in different tangent spaces and cannot vec(Rop ω̂ op δRk ) = (I ⊗ Rop ω̂ op )N ξ k
be compared directly, where TRop SO(3) refers to the tangent
space of SO(3) at Rop . Hence, the angular velocity ω could be δω k ) = (I ⊗ Rop )vec(
vec(Rop δω k ) (14)
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
1158 IEEE TRANSACTIONS ON ROBOTICS, VOL. 37, NO. 4, AUGUST 2021
where ⊗ is the Kronecker tensor operator. To derive the expres- The discrete dynamics of ω is propagated using forward Euler
sion for vec(
δω k ), one plugs (5) into (7) scheme ω k+1 = ω k + dtω̇ k .
vec(
δω k ) = N (ω k − ω op + ω̂ op ξ k ). (15) D. Discrete-Time Affine Dynamics
The vectorized version of (11) is The SRB model introduced in Section II-A has nonlinear
dynamics in R and ω. Hence, a variation-based linearization
vec(Ṙk ) = C cξ + C ξξ ξ k + Cω
ξ ωk (16)
scheme is proposed in Section II-B to linearize the nonlinear
where the constants are defined as dynamics. Section II-C presents a vectorization method to re-
formulate the matrix variables into the vector form. Based on
C cξ = vec(Rop ω̂ op ) − (I ⊗ Rop )N ω op
the aforementioned procedures, the new set of state and control
C ξξ = (I ⊗ Rop ω̂ op )N − (I ⊗ Rop )N ω̂ op vectors are defined as
xk := [pTk ṗTk ξ Tk B
ω Tk ]T ∈ R12
ξ = (I ⊗ Rop )N .
Cω (17)
The discrete orientation dynamics in terms of ξ is derived δuk := [δuT1,k δuT2,k δuT3,k δuT4,k ]T ∈ R12 (25)
by propagating the rotation matrix using the forward Euler where the new control input δui ∈ R3 is the variation of GRF
integration scheme from the operating point ui,op for the ith leg.
Rk+1 = Rk + Ṙk dt (18) By assembling the corresponding terms from (21) and (23)
into matrix form, the discrete-time affine dynamics could be
where dt is the MPC sampling time. When the rotation matrix expressed in the state-space form
is approximated by the first-order expansion in (5), the above
expression could be simplified to the following form: xt+k+1|t = A|op · xt+k|t + B|op · δut+k|t + d|op (26)
δRk+1 = δRk + RTop Ṙk dt. (19) where A|op ∈ R n×n
, B|op ∈ R , and d|op ∈ R are matri-
n×m n
The angular velocity dynamics in (12) is also vectorized. The E. Cost Function
second term in (12) is δRTk τ op = (I ⊗ τ Top )N ξ k and δτ k is
defined in (13). The last two terms could be further derived The cost function in the NMPC formulation (1) includes both
terminal and stage costs. In this work, the cost is set as a quadratic
ˆ k B Iω op − ω̂ op B Iδω k
−δω function that penalizes deviation from the reference trajectories.
The stage cost is
= (B
Iω op − ω̂ op B I)δω
(xk , uk ) = ||xk − xd,k ||2Qx + ||uk − ud,k ||2Ru (27)
= (B
Iω op − ω̂ op I)(ω k − ω op − ω̂ op ξ k ).
B
(22)
where ||x||2Q is a shorthand notation of the matrix norm xT Qx
Assembling these expressions into (12) and rearranging the where Q is a positive definite matrix; xd,k and ud,k are the
corresponding terms gives the vectorization of B I ω̇ k desired state and control at the kth prediction step; Qx and Ru
B
I ω̇ k = C ω̇ + C δp ξ ω δu are the block diagonal positive definite gain matrices for state
ω̇ pk + C ω̇ ξ k + C ω̇ ω k + C ω̇ δuk (23)
and control, respectively. The first term in (27) consisting of the
where error functions of the state vector could be decomposed as
C cω̇ = − ω̂ op B Iω op + RTop τ op − (B
Iω op − ω̂ op B I)ω op ||xk − xk,d ||2Q
− RTop ûop pop = ||epk ||2Qp + ||eṗk ||2Qṗ + ||eRk ||2QR + ||eωk ||2Qω (28)
where Qp , Qṗ , QR , Qω are diagonal positive definite weighting
δp
T i
C ω̇ = Rop ûop matrices; epk , eṗk are the error terms for deviations from the
i corresponding reference trajectories pdk , ṗdk constructed from the
user input. The error function for the rotation matrix and angular
C ξω̇ = I ⊗ τ Top N − (B
Iω op − ω̂ op B I)ω̂ op velocity are given by [32] as
eRk = log(RTd,k · Rk )∨
ω̇ = Iω op − ω̂ op I
Cω B B
(29)
1 2 3 4
C δu T
ω̇ = Rop [r̂ op , r̂ op , r̂ op , r̂ op ]. (24) eωk = ω k − RTk Rd,k ω d,k (30)
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
DING et al.: REPRESENTATION-FREE MODEL PREDICTIVE CONTROL FOR DYNAMIC MOTIONS IN QUADRUPEDS 1159
where Rd,k and ω d,k are the desired rotation matrix and angular where the z-axis is aligned with the normal vector of the ground
velocity trajectories. The terminal cost function is similarly and x, y are two axes orthogonal to each other that lie in the
defined. tangent plane at the contact point; uz,min i,k and uz,max
i,k are the
In the stage cost expression (28), all the error terms are in minimum and maximum normal forces at the kth predicted step
linear form of the state and control vectors except the error of for leg i. If leg i was in swing phase, then the value for both
orientation eRk , which involves matrix logarithm map as shown lower and upper bounds are set to zero so that the swing leg
in (29). A linear approximation of the nonlinear error term on the controller takes over and guides the foot toward the next foothold
rotation matrix is used. Taking the hat map on (29) and applying position. It could be observed that (36) denotes an intersection of
the matrix exponential map give a finite set of closed halfspaces in R3 . Hence, Ui is a polyhedron.
Similarly, the feasible force set U is also polyhedral.
exp(êRk ) = RTd,k Rk ≈ RTd,k Rop exp(ξ̂ k ) (31)
where the same approximation is made here as in (5). Taking G. Quadratic Program Formulation
the matrix logarithm of (31) and then applying the vee map give
Given the convex quadratic cost function from Section II-E,
the approximate error term on Rk
the affine dynamics from Section II-D, and linear force con-
eRk = log(RTd,k · Rop )∨ + ξ k . (32) straints from Section II-F, the general NMPC problem (1) could
be be reformulated as a QP
The error function defined in (32) is in linear form of the state
N −1
variable ξ k since both RTd,k and Rop are known matrices. The
error function could be interpreted as the sum of 1) the geodesic min. γ N T (xt+N |t ) + γ k (xt+k|t , δut+k|t ) (37a)
k=0
between Rop and Rd,k and 2) the vector ξ k which lies in the
tangent space at Rop . The orientation error function Ψ on R s.t. xt+k+1|t = Axt+k|t + Bδut+k|t + d (37b)
could therefore be defined as
δut+k|t ∈ Uk , k = 0, . . . , N − 1 (37c)
Ψ(ξ k ) = eTRk QR eRk = ||eRk ||2QR (33)
xt|t = x(t) = xop (37d)
which is in quadratic form of ξ k . Given that the weighting
where the cost function is defined in (27); the decay rate factor
matrix QR is positive definite, the orientation error function
γ ∈ (0, 1] discounts cost further from the current moment. The
Ψ is positive definite.
definition of the affine dynamics could be found in (26) and
The terminal cost T is similarly defined as (28) with terminal
the force constraint is expressed in (36). Note that we lifted the
gains. The cost of control is constructed as
explicit constraints on the state vectors but instead relied on the
||uk − ud,k ||2Ru = ||uop + δuk − ud,k ||2Ru (34) cost function for the state regulation. The QP in (37) could be
rewritten in a more compact form. Following the formulation
where δuk is the kth predicted variation from the operating point
in [52], the new optimization variable z is constructed as
uop .
z = [δuT0 , xT1 , . . . , δuTN −1 , xTN ]T ∈ R24N (38)
F. Force Constraints
such that (37) could be transcribed into the standard QP
The force constraints are imposed to ensure that the solved form [53]
GRF are physically feasible. When the foot is in contact with the
1 T
ground, the normal force should be nonnegative and the tangent minimize z P z + cT z
forces should lie within the friction cone, which is prescribed as 2
subject to Aineq · z ≤ bineq
{ui |uni ≥ 0, ||uti ||2 ≤ μ|uni |} (35)
Aeq · z = beq (39)
where μ is the coefficient of friction; superscripts (·)t and (·)n
indicate tangential and normal force components, respectively. where P ∈ RN (n+m) is a symmetric positive definite matrix
|| · ||2 is the 2-norm and | · | takes the absolute value of a scalar. assembled from the gain matrices Qx , Ru ; the inequality con-
Since the friction cone constraint is a second-order cone straint Aineq · z ≤ bineq imposes the force constraints; the equal-
constraint, it is not admissible to the QP formulation with linear ity constraint Aeq · z = beq respects the linear dynamics.
constraints. Instead, the conservative friction pyramid [51] is
used as an approximation of the friction cone. In addition, the III. NUMERICAL RESULTS
normal force is bounded to ensure that the commanded torque
This section first defines a function that quantifies the close-
does not exceed the actuator limits. The feasible control set U
ness to singularity for Euler angles in the numerical sense. Then
in (1) is defined as
we present the simulation results of RF-MPC stabilizing various
x/y x/y
Ui : = {δui ||ui,op + δui,k | ≤ μ|uzi,op + δuzi,k |, periodic gaits and an aperiodic 3-D acrobatic maneuver. Fur-
thermore, RF-MPC is compared with the MPC that uses Euler
uz,min
i,k ≤ uzi,op + δuzi,k ≤ uz,max
i,k , angles as orientation representation (EA-MPC) in the acrobatic
maneuver. The resulting QPs from the MPC formulation in all
uz,min
i,k ≥ 0} (36) of the simulations are solved using MATLAB quadprog. Gain
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
1160 IEEE TRANSACTIONS ON ROBOTICS, VOL. 37, NO. 4, AUGUST 2021
TABLE II
COST FUNCTION WEIGHTS FOR THE SIMULATIONS AND EXPERIMENTS. THE VALUES IN PARENTHESIS REPRESENT WEIGHTS ON THE TERMINAL COSTS
Note: Tst , Tsw , and Tpred all have the unit of [s]; Nhor is the MPC prediction horizon; Tpred is the prediction time step; fMPC is the MPC control frequency with the unit of [Hz].
*
0.13 s is the front stance time, and 0.3 s is the hind stance time.
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
DING et al.: REPRESENTATION-FREE MODEL PREDICTIVE CONTROL FOR DYNAMIC MOTIONS IN QUADRUPEDS 1161
Fig. 4. Simulation data of walking trot where the robot starts from static pose
and accelerates in the x-direction. (a) CoM velocity; the robot accelerates at
0.5 m/s2 and reaches the desired velocity of 0.5 m/s in the x-direction. (b) Fig. 5. Simulation data of bounding. (a) Phase portrait of body angle and an-
Orientation in terms of the log map of the rotation matrix. (c) Vertical ground gular velocity in the y-axis. (b) Velocity tracking performance in the x-direction.
reaction forces of four legs. (c) Vertical GRFs of FL and HL legs.
the CoM deviation at the end of the simulation is plotted for both phase. Similar to the walking trot, the robot is commanded to
RF-MPC and EA-MPC. As could be observed in Fig. 3(b), while start from the static pose and accelerates at 1.0 m/s2 to reach the
RF-MPC remains stable, EA-MPC is significantly affected by final velocity of 2.5 m/s.
singularity once |log(RTs Rd )∨ | < 0.3 rad, which corresponds Fig. 5(a) presents the phase portrait of angle and angular
to κ−1 (TΘ ) < 0.15. velocity along the y-axis. It shows that the motion converges
to a periodic orbit. The velocity tracking performance shown
B. Walking Trot in Fig. 5(b) demonstrates that the MPC controller is capable of
The data of a walking trot simulation is shown in Fig. 4. The stabilizing bounding velocity up to 2.5 m/s. The vertical GRF
robot starts from a stationary pose and accelerates in the x- profiles of legs FL and HL are shown in Fig. 5(c).
direction until it reaches the final velocity of 0.5 m/s. As could be
seen in Fig. 4(a), the velocity in the x-direction reaches 0.5 m/s D. Aperiodic Complex Dynamic Maneuver
and the velocity deviation for all directions is within ±0.1 m/s.
A complex acrobatic dynamic maneuver is presented in this
Fig. 4(b) shows that the orientation deviation in all directions is
section to demonstrate that RF-MPC is capable of controlling
bounded within ±0.02 rad. The vertical GRFs of all four legs are
aperiodic dynamic motions that involve orientations that corre-
shown in Fig. 4(c). Further details about the generation of the
spond to singularities in Euler angle representation. RF-MPC
reference trajectory for trotting could be found in Section III-F.
is benchmarked with an MPC controller with Euler angles for
The simulation is set up such that at each sampling time,
its orientation representation. In addition, initial condition is
the control input is applied to the original nonlinear model (4)
perturbed to investigate the robustness of RF-MPC.
simulated using MATLAB ode45 to integrate the dynamics. The
Fig. 6(a) shows the reference trajectory of the acrobatic
gait pattern in the walking trot simulation is executed using a
motion, which is a backflip with a twist. The reference CoM
time-based schedule.
trajectory is colored black and the reference poses are shown in
blue. The robot initially stands on a slope with the slope angle
C. Bounding of 45◦ , with the front of the robot facing upwards and body
To demonstrate the capability of RF-MPC to stabilize dy- parallel to the slope. The stance phase of this acrobatic jump
namic motions with large body attitude oscillation, the bounding consists of 0.1 s of all four feet in contact, followed by 0.1 s
simulation is presented. The bounding motion involves an aerial of only the hind feet pair in contact with the slope. After the
phase when all four feet lose contact with the ground. To stabilize stance phase, all four feet are airborne and the robot enters the
the bounding motion, the reference trajectory is designed based aerial phase for 0.3 s before landing. The landing direction of the
on the impulse-scaling principle [8] to preserve the periodicity robot is facing away from the slope. The feed-forward GRF and
of the gait. Details about the generation of reference trajectory the dynamically feasible reference trajectory are generated by
could be found in Section III-F3. Here, we kindly note that solving an off-line TO problem, where the slope is set to be 45◦ .
an elaborate reference trajectory is optional for RF-MPC to Further details about the generation of the reference trajectory
stabilize bounding. While a trivial reference such as that used is provided in Section III-F.
for trotting also works, the reference trajectory presented in As could be observed from Fig. 6(a), when the robot ap-
Section III-F3 enables bounding motions with longer aerial proaches the singularity, EA-MPC becomes unstable and exerted
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
1162 IEEE TRANSACTIONS ON ROBOTICS, VOL. 37, NO. 4, AUGUST 2021
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
DING et al.: REPRESENTATION-FREE MODEL PREDICTIVE CONTROL FOR DYNAMIC MOTIONS IN QUADRUPEDS 1163
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
1164 IEEE TRANSACTIONS ON ROBOTICS, VOL. 37, NO. 4, AUGUST 2021
Fig. 8. Overview of the control system. The user sends commands to the on-board computer (blue), where the finite state machine schedules the gait and sends
desired trajectories to the MPC block to formulate the QP. The custom QP solver qpSWIFT solves for the uop and sends it to the FSM. The FSM combines the
stance and swing forces and send to the joint controller (green), which maps leg forces to joint torque and send to the BLDC motors. The state estimator (green)
receives sensor signals for the MPC formulation of the next cycle. The previous solution of the QP u− op is sent to the MPC as the control at the operating point.
Fig. 10. Illustration of the hardware platform. (a) CAD model of the mechan-
Fig. 9. Schematics of the finite state machine (FSM). The gait planner sends ical components of the leg module, which includes ABAD, HIP, and KNEE
to all legs the timing schedules; the normalized variable si is the percentile modules and linkages of the leg. (b) Picture of the assembled quadruped platform,
completion of the current state. Δj , j ∈ {st, sw} are the reset maps and Gi are which integrates a computer, sensors including an IMU and 12 encoders.
the guard sets.
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
DING et al.: REPRESENTATION-FREE MODEL PREDICTIVE CONTROL FOR DYNAMIC MOTIONS IN QUADRUPEDS 1165
module and the picture of the assembled quadruped platform E. State Estimation
with all the parts integrated.
Kalman filter [63] has been applied for a range of applica-
tions in legged robots. Meanwhile, simple linear single-input
C. Computation Setup single-output complementary filters [64] have been proven to
The MPC framework is implemented using Simulink Real- work robustly in practice [65], [66]. The complementary filter
Time (SLRT). The encoder readings and lower-level kinematics performs low-pass filtering on a low-frequency estimation and
calculations are carried out at a base rate of 4 kHz, while the IMU high-pass filtering on a biased high-frequency estimation. For
signals are received and state estimation is performed at 1 kHz. instance, the CoM velocity ṗ is obtained by combining the
The user input from the joystick is updated at 23 Hz, and the QP CoM velocity estimate from leg kinematics data ṗenc and the
is solved between 160 and 250 Hz depending on the size of the accelerometer readings aacc from the on-board IMU
problem. The proposed QP (44) is solved by a custom QP solver
ṗk+1 = ṗk + ak · Δt
qpSWIFT [58] for all the experiments. Written in ANSI C, the
k − K p (ṗk − ṗk )
v
solver is library-free while it interfaces with SLRT through a ak = aacc enc
(51)
gateway s-function. An RF-MPC with prediction horizon of 6
entails solving a QP with 144 variables, 72 inequality, and 72 where ṗk is the estimated velocity from the previous iteration;
equality constraints. the subscript (·)k is the discrete time index; Δt is the IMU
sampling period; K vp is a positive diagonal gain matrix; aacc k is
the accelerometer reading; ṗenc
k is the average of all the velocities
D. Swing Leg Control
from contact feet to CoM based on kinematic calculations.
Since the leg mass is less than 10% of the total mass of Similarly, the CoM position p ∈ R3 is estimated by fusing the
the robot and the motor inertia is much smaller than body CoM position estimate from leg position kinematics penc and
inertia, the inertia effect of the legs could be neglected during the estimated CoM velocity ṗ
stance. Nevertheless, leg inertia is considered when designing
the swing leg controller. The swing leg is modeled as a 3-link F. Contact Detection
serial manipulator attached to a stationary base. The swing leg
controller consists of feed-forward and feedback terms, where Contact sensing plays a crucial role in legged locomotion.
the former is based on the workspace inverse dynamics However, conventional force estimation is fragile and noisy,
which is not suitable for dynamic locomotion applications.
τ ffsw = D(q)J −1 (afx − J̇ q̇) + h(q, q̇) (47) Proprioceptive sensing [67] is utilized in this work because of
the highly transparent actuation design. We use the generalized
where τ ffsw is the feed-forward torque; D(q) is the Inertia matrix
momenta based disturbance observer [68], which only requires
and h(q, q̇) includes the centrifugal, Corolis, and gravitational
proprioceptive measurements q, q̇ and the commanded torque
terms of the swing leg; q, q̇ are the joint angle and velocity
τ . In this work, only the knee joints are considered in contact
vectors; J is the leg Jacobian matrix and J̇ is its time derivative;
detection based on the assumption that the knee joint momentum
afx is workspace acceleration vector, which is defined as
is changed the most by the contact impact. The residual vector
afx = p̈fd + K fp f (pfd − pf ) + K fd f (ṗfd − ṗf ) (48) r k is defined as
k
where p̈fd is the desired foot workspace acceleration; pfd , ṗfd r k = K I · I q̇ k −
kn kn kn
(τ i + r i−1 )Δt , r 0 = 0 (52)
are the desired foot position and velocity; K fp f , K fd f are the i=1
position and velocity gain matrices. The full swing leg controller
consists of both feed-forward and feedback terms where r k ∈ R4 is the residual vector for the four legs; k is the
index for the current instance; K I is a diagonal gain matrix; I kn
τ sw = τ ffsw + K fp b (pfd − pf ) + K fd b (ṗfd − ṗf ) (49) is the diagonal inertia matrix for all the knee joints; q̇ kn
k is the
vector of knee joint velocity; τ kn k is the commanded knee torque;
where K fb fb
p and K d are the position and velocity gain matrices r 0 is the initial value of the residual. The summation accumulates
for the feedback term of the swing leg controller. all the previous residuals and the commanded torque. Contact is
The desired foot placement is a linear combination of a declared when the residual vector r k exceeds a threshold value
velocity-based feed-forward term and a capture-point [62] based rth .
feedback term.
T z0h h G. System Identification
f st
pstep = ph + ṗhd + (ṗ − ṗhd ) (50)
2 g 1) Friction Compensation: The gear ratio of the planetary
gearbox in each motor module is 23.36:1, which is higher than
where pfstep is the desired step location on the ground plane; ph is other quadruped robots with proprioceptive actuation scheme,
the projection of the hip joint on the ground plane and ṗh is the including MIT Cheetah 3 (7.67:1) [9], Mini Cheetah (6:1) [13],
corresponding velocity; ṗhd is the desired hip velocity projected and Minitaur (1:1) [4]. Due to the relatively higher gear ratio,
on the ground plane; Tst is the stance time; g is the gravitational the friction induced by the gearbox and bearing is modeled and
acceleration constant; z0h is the nominal hip height. compensated for more accurate force control. Following [13],
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
1166 IEEE TRANSACTIONS ON ROBOTICS, VOL. 37, NO. 4, AUGUST 2021
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
DING et al.: REPRESENTATION-FREE MODEL PREDICTIVE CONTROL FOR DYNAMIC MOTIONS IN QUADRUPEDS 1167
Fig. 14. Running trot experiment data. Zoomed-in views placed at the right
of the figure show the details of the signals. (a) Reference and measured CoM
vertical velocity in the z-direction. (b) Vertical GRF. (c) Knee torque.
Fig. 13. Walking trot experiment data. (a) Velocity tracking in the x-direction.
(b) Velocity tracking in the y-direction. (c) Orientation deviations along the x-
and y-axes, where the reference is 0. (d) Vertical GRF uz for front legs.
B. Walking Trot
To demonstrate that RF-MPC can stabilize basic locomotion
gaits, the walking trot experiment is presented. The robot could
move in any direction parallel to the ground while maintaining
the body orientation. Fig. 13(a) and (b) exhibits the velocity
tracking performance of the controller, and Fig. 13(c) shows that
the orientation deviation is kept small (within ±0.06 rd) during
the walking trot experiment; Fig. 13(d) presents the vertical GRF
during the walking trot. The velocities are measured from the
state estimation.
Fig. 15. Bounding experiment data. (a) Sequential snapshots of the robot in
C. Running Trot and Bounding a bounding experiment. (b) Orientation tracking in the y-direction log(R)∨y . (c)
To investigate the performance of RF-MPC for dynamic gaits, Angular velocity tracking in the y-direction B ωy . (d) Vertical GRF.
experiments of running trot and bounding gaits with full aerial
phases are conducted. Fig. 14 presents the running trot experi- The bounding gait leverages the full dynamics of the robot and
ment data, where Fig. 14(a) shows that the vertical CoM velocity involves extensive body pitch oscillation. A sequential snapshots
experiences 40 ms free fall during the aerial phase. Fig. 14(b) of the bounding experiment could be found in Fig. 15(a). The
shows that the robot could produce abruptly changing GRF as the robot starts from a static pose and the MPC stabilizes the
contact condition changes. During the trot running experiment, robot to follow the desired GRF and state trajectories. More
the vertical GRF could reach as high as 60 N while the knee details about reference trajectory generation could be found in
torque goes up to 6.3 Nm, as could be observed in Fig. 14(b) Section III-F. The reference and measured trajectories of ori-
and (c), respectively. entation and angular velocity in the y-direction are shown in
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
1168 IEEE TRANSACTIONS ON ROBOTICS, VOL. 37, NO. 4, AUGUST 2021
Fig. 16. Quadruped robot Panther performing a controlled backflip that passes through singularity pose. (a) Image sequence of the robot executing the controlled
backflip, with its front legs launching from a 80-mm high platform. (b) Body orientation reconstructed from the experimental data. The rotation matrix is represented
by the body coordinate frame axes (x-blue, y-red, and z-green); the Euler angles are visualized by the three colored rings with arrow (φ-blue, θ-red, and ψ-green).
The robot passes through the upright pose (a3) while the hind legs are in contact with the ground. The Gimbal lock effect is shown in (b3) where axes of Euler
angles are aligned.
Fig. 15(b) and (c). Since the robot started from a static pose,
there is an initial offset. The vertical GRF profile is shown in
Fig. 15(d). The transition from swing to stance phase occurs
when a touchdown event is declared by the contact detection
algorithm described in Section IV-F.
D. Controlled Backflip
To demonstrate the capability of RF-MPC to control dynamic
maneuvers that involve singularity poses, a controlled backflip
experiment is presented. As shown in Fig. 1, the robot flips
backwards around the y-axis, passing through the pose where the
robot is upright, before it lands with the upside-down orientation.
Note that even though the reference trajectory is generated based
on a 2-D model of the robot as presented in Section III-F,
RF-MPC controls the 3-D robot in the controlled backflip ex-
periment without resorting to decomposition of sagittal plane
motion and out-of-plane motion.
The image sequence in Fig. 16(a) is plotted along with the
body orientation, which is reconstructed from the experiment
data as shown in Fig. 16(b). The rotation matrix is represented
by the body coordinate frame axes (x-blue, y-red, and z-green);
the three color-coded rings correspond to the Euler angles with
the roll–pitch–yaw sequence convention (roll-blue, pitch-red,
and yaw-green). Note that the robot is at the singular pose at
around 300 ms as shown in Fig. 16(a3). In the corresponding
body orientation plot, the axes of the rings for Euler angles
almost coincide. Therefore, the robot indeed passes through the
singularity pose when the RF-MPC is actively controlling the
hind legs to track the reference trajectory. To the best of the
authors’ knowledge, this is the first instance of hardware ex- Fig. 17. Experimental data from 10 controlled backflip trials. The solid lines
perimental implementation of MPC to control acrobatic motion are the average (avg.) of all the tests, and the shaded tube is the range within one
which involves singularity. standard deviation. The black dash-dot curves in (a) and (c) are from the case
where the initial condition of the backflip is perturbed (pert.). The deep-shaded
Fig. 17 presents the data from the controlled backflip exper- area is when four legs are in contact; the light-shaded area is when hind legs are
iment, where the robot goes through three phases. The deep- in contact, and the nonshaded area is when all legs are in the impedance control
shaded area corresponds to the phase when all four legs are phase. (a) CoM height. (b) Knee torque of legs FL and HL. (c) Comparison
between the pitch angle θ and rotation matrix log(R)∨y . (d) Function κ−1 (TΘ )
in contact; the light-shaded area indicates the phase when only indicates that the robot indeed encountered the singular pose in the controlled
the hind legs are in contact; the nonshaded area corresponds to backflip experiment.
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
DING et al.: REPRESENTATION-FREE MODEL PREDICTIVE CONTROL FOR DYNAMIC MOTIONS IN QUADRUPEDS 1169
the landing phase. The RF-MPC controller is activated during using the rotation matrix, which presumes SO(3) structure,
the first two phases, and an impedance control is utilized in the linearizing around the operating point guarantees an accurate
third phase. Experiment data gathered from 10 backflip trials are dynamics model for predicted states that are close to the oper-
shown in Fig. 17, where the solid lines are the mean values of ating point. In comparison, when the orientation deviation from
all the tests, and the shaded tube is the value within one standard the reference trajectory is large, the dynamics linearized about
deviation. the reference no longer provide a realistic local approximation
As could be observed from Fig. 17(d), the robot passes of the original nonlinear dynamics. This phenomenon is shown
through the neighborhood of singularity as the number intro- in Fig. 7(e), where the prediction error of the rotation matrix
duced in Section III-A drops below the threshold 10−1 . Fig. 17(c) in scheme 1 became large, which leads to the failure of the
shows that the pitch angle θ is not monotonic throughout the controller. The decay rate γ is used in both schemes to discount
controlled backflip while log(R)∨y decreases monotonically. The the effect of states that are farther in the future, where the
dash-dot curves in Fig. 17(a) and (c) are from the experiment linearized model is less accurate.
trial where the initial state of the robot is perturbed. Specifically, A limitation of the proposed RF-MPC formulation is that the
the height of the stage on which the front legs are positioned predicted rotation matrices constructed from ξ are not elements
is increased from 80 to 130 mm. It could be observed that of the SO(3) manifold since the first-order approximation is
while in this case, the trajectory of the robot deviates more than unable to fully capture the SO(3) structure. Hence, the predic-
one standard deviation from the average; RF-MPC could still tion error is more pronounced for a longer prediction horizon.
stabilize the motion and land safely. Currently, prediction horizon remains an important design pa-
rameter with a tradeoff between the predictive ability of MPC
and the accuracy of the linearized dynamics model. Prediction
VI. DISCUSSION
horizon being too long leads to inaccurate rotational dynamics,
In this section, we briefly discuss some of the important while being too short leads to myopic behaviors. To mitigate this
aspects of this work. That includes the interpretation of some of issue, we envision a hierarchical framework with multiple MPCs
the experiment results, discussion of the findings, and limitations running at different rates. Specifically, an MPC with simpler
of this work. model and longer prediction horizon could be running at lower
In this work, the proposed RF-MPC uses the rotation matrix to update rate, while the RF-MPC with shorter prediction horizon
represent the orientation, which is capable of stabilizing dynamic could be running at a higher rate.
motion in 3-D that involves singularity in the Euler angle formu- The larger deviation toward the end of the bounding motion
lation. Specifically, Section V-D presents the controlled backflip may be caused by the simple state estimation and contact de-
experiment, where RF-MPC stabilized the robot to perform tection algorithms presented in Sections IV-E and IV-F, respec-
an acrobatic maneuver that passes through the singularity. The tively. The tracking error in Figs. 11, 13, and 15 could also be
simulation result in Section III-D suggests that the function value affected by these reasons since the velocities were measured
dropping below the threshold would result in the failure of the from state estimation instead of external sensors.
EA-MPC. Another problem of EA-MPC is shown in Fig. 17(c),
where the pitch angle θ decreased until − π2 and went back to 0 rd.
Notice how the monotonicity changed as the value of κ−1 (TΘ ) VII. CONCLUSION
went below the threshold of 10−1 s shown in Fig. 17(d). In In this article, we presented an RF-MPC framework that
contrast, log(R)∨y monotonically decreased to −π. Note that directly represents orientation using the rotation matrix instead
though the motion of the controlled backflip remains in the of using other orientation representations. Despite the local
sagittal plane, RF-MPC is stabilizing the 3-D motion without validity of linearized dynamics on the rotation matrix, this
decomposing the motion into in-plane and out-of-plane parts. approach introduced the possibility to stabilize 3-D complex
This experiment is our initial demonstration of the RF-MPC acrobatic maneuvers that involve singularities in the Euler an-
framework that could potentially open up the possibilities of gles formulation. By directly working on the rotation matrix,
controlling legged robots to perform the extremely agile motions this method avoided issues arising from the usage of other
as shown in [3]. representations such as unwinding phenomenon (quaternion) or
One of the findings is that a simulation case study shown singularity (Euler angles). The application of a variation-based
in Section III-E suggests that within the RF-MPC framework, linearization scheme and a vectorization routine linearized the
linearizing around the operating point (scheme 2) provides nonlinear dynamics and transformed the matrix variables into
more robust behavior compared with linearizing around the vector variables. The deliberate construction of the orientation
reference trajectory (scheme 1). The result is counterintuitive error function enabled us to formulate the MPC into the standard
because scheme 1 uses time-varying Jacobian matrices Ak , B k QP form.
parametrized by the reference trajectory within the prediction We reported both simulation and experiment results of the
horizon, while scheme 2 uses matrices parameterized only by the RF-MPC controller applied on the quadruped Panther robot.
operating point A|op , B|op throughout the prediction horizon. In the simulation case study presented in Section III-E, we
Namely, scheme 1 utilizes more information than scheme 2. Our found out that in the RF-MPC framework, linearizing around
conjecture for the reason why scheme 2 provides more robust- the operating point provides a more robust control strategy
ness is stated as follows. Since RF-MPC represents orientation compared with linearizing around the reference trajectory.
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
1170 IEEE TRANSACTIONS ON ROBOTICS, VOL. 37, NO. 4, AUGUST 2021
Experiments including pose/balance control, walking/running [17] X. Xiong and A. Ames, “Sequential motion planning for bipedal som-
trot, and bounding were conducted on the robot. In addition, ersault via flywheel slip and momentum transmission with task space
control,” 2020, arXiv:2008.02432.
the controlled backflip experiment demonstrated that RF-MPC [18] J. Hwangbo et al., “Learning agile and dynamic motor skills for legged
controller can stabilize dynamic motions that involve the sin- robots,” Sci. Robot., vol. 4, no. 26, 2019, Art. no. eeau5872.
gularity. By utilizing a custom QP solver qpSWIFT, the MPC [19] A. Herdt, H. Diedam, P.-B. Wieber, D. Dimitrov, K. Mombaur, and
M. Diehl, “Online walking motion generation with automatic footstep
could reach control frequency as high as 250 Hz. placement,” Adv. Robot., vol. 24, no. 5-6, pp. 719–737, 2010.
This novel RF-MPC framework is likely to open up possi- [20] J. Koenemann et al., “Whole-body model-predictive control applied to the
bilities for quadruped robots and legged robots in general to HRP-2 humanoid,” in Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst., 2015,
pp. 3346–3351.
realize extremely dynamic 3-D motions. We also envision to [21] M. Neunert et al., “Whole-body nonlinear model predictive control
equip the robot with special end-effectors (e.g., climbing robot through contacts for quadrupeds,” IEEE Robot. Autom. Lett. vol. 3,
with claws [70] or magnetic grippers), enabling it to climb no. 3, pp. 1458–1465, Jul. 2018.
[22] R. Grandia, F. Farshidian, A. Dosovitskiy, R. Ranftl, and M. Hutter,
up vertical surfaces and walk on the ceiling. Moreover, with “Frequency-aware model predictive control,” IEEE Robot. Autom. Lett.,
the emergence of powerful and light-weight computing units, vol. 4, no. 2, pp. 1517–1524, Jan. 2019.
the variation-based formulation could potentially be applied to [23] R. Grandia, F. Farshidian, R. Ranftl, and M. Hutter, “Feedback MPC
for torque-controlled legged robots,” in Proc. IEEE/RSJ Int. Conf. Intell.
stabilizing acrobatic maneuvers in UAVs. Robots Syst., 2019, pp. 4730–4737.
[24] C. Mastalli, I. Havoutis, M. Focchi, D. G. Caldwell, and C. Semini,
ACKNOWLEDGMENT “Motion planning for quadrupedal locomotion: Coupled planning, terrain
mapping, and whole-body control,” IEEE Trans. Robot., vol. 36, no. 6,
The authors would like to thank Prof. Patrick Wensing for pp. 1635–1648, Dec. 2020.
[25] S. Fahmi, C. Mastalli, M. Focchi, and C. Semini, “Passive whole-body
the insightful discussion, Prof. João Ramos for his advice and control for quadruped robots: Experimental validation over challenging
support, and Jaejun Park for his help in hardware assembly. terrain,” IEEE Robot. Autom. Lett., vol. 4, no. 3, pp. 2553–2560, Jul. 2019.
[26] M. Focchi et al., Heuristic Planning for Rough Terrain Locomotion in
Presence of External Disturbances and Variable Perception Quality.Cham,
REFERENCES Switzerland: Springer International Publishing, 2020, pp. 165–209.
[1] Y. Ding, A. Pandala, and H.-W. Park, “Real-time model predictive control [27] M. D. Shuster, “A survey of attitude representations,” Navigation, vol. 8,
for versatile dynamic motions in quadrupedal robots,” in Proc. Int. Conf. no. 9, pp. 439–517, 1993.
Robot. Autom., 2019, pp. 8484–8490. [28] B. Siciliano and O. Khatib, Springer Handbook of Robotics. Berlin,
[2] L. Aronson, The Ibex-King of the Cliffs, (in Hebrew), Tel Aviv, Israel: Germany: Springer, 2016.
Masada Press, 1982, p. 127. [29] S. P. Bhat and D. S. Bernstein, “A topological obstruction to global asymp-
[3] Alex & Jumpy - the Parkour Dog. YouTube. [Online]. Available: https: totic stabilization of rotational motion and the unwinding phenomenon,”
//www.youtube.com/watch?v=39oGCTAJ9Vw&t=117s in Proc. IEEE Amer. Control Conf., vol. 5, 1998, pp. 2785–2789.
[4] A. De and D. E. Koditschek, “Vertical hopper compositions for preflexive [30] X. Yang, Y. Chen, L. Chang, A. A. Calderón, and N. O. Pérez-
and feedback-stabilized quadrupedal bounding, pacing, pronking, and Arancibia, “Bee: A 95-mg four-winged insect-scale flying robot driven
trotting,” Int. J. Robot. Res. vol. 37, no. 7, pp. 743–778, 2018. by twinned unimorph actuators,” IEEE Robot. Autom. Lett., vol. 4, no. 4,
[5] M. Hutter et al., “ANYmal—A highly mobile and dynamic quadrupedal pp. 4270–4277, Oct. 2019.
robot,” in Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst., 2016, pp. 38–44. [31] C. G. Mayhew, R. G. Sanfelice, and A. R. Teel, “On quaternion-based
[6] C. Semini, N. G. Tsagarakis, E. Guglielmino, M. Focchi, F. Cannella, and attitude control and the unwinding phenomenon,” in Proc. Amer. Control
D. G. Caldwell, “Design of HyQ-a hydraulically and electrically actuated Conf., 2011, pp. 299–304.
quadruped robot,” Proc. Inst. Mech. Engineers, Part I: J. Syst. Control [32] F. Bullo and A. D. Lewis, Geometric Control of Mechanical Systems:
Eng., vol. 225, no. 6, pp. 831–849, 2011. Modeling, Analysis, and Design for Simple Mechanical Control Systems,
[7] S. Seok, A. Wang, M. Y. Chuah, D. Otten, J. Lang, and S. Kim, “De- vol. 49. Cham, Switzerland: Springer Science & Business Media, 2004.
sign principles for highly efficient quadrupeds and implementation on [33] G. Wu and K. Sreenath, “Variation-based linearization of nonlinear sys-
the MIT cheetah robot,” in Proc. IEEE Int. Conf. Robot. Autom., 2013, tems evolving on SO( 3) and S2,” IEEE Access, vol. 3, pp. 1592–1604,
pp. 3307–3312. 2015.
[8] H.-W. Park, P. M. Wensing, and S. Kim, “High-speed bounding with [34] T. Lee, M. Leok, and N. H. McClamroch, “Stable manifolds of saddle
the MIT cheetah 2: Control design and experiments,” Int. J. Robot. Res., equilibria for pendulum dynamics on S2 and SO(3),” in Proc. Decis.
vol. 36, no. 2, pp. 167–192, 2017. Control Eur. Control Conf. 50th IEEE Conf. Decis. Control Eur. Control
[9] G. Bledt, M. J. Powell, B. Katz, J. Di Carlo, P. M. Wensing, and S. Kim, Conf., 2011, pp. 3915–3921.
“MIT cheetah 3: Design and control of a robust, dynamic quadruped robot,” [35] E. S. Meadows, M. A. Henson, J. W. Eaton, and J. B. Rawlings, “Receding
in Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst., 2018, pp. 2245–2252. horizon control and discontinuous state feedback stabilization,” Int. J.
[10] P. Fankhauser, M. Bjelonic, C. Dario Bellicoso, T. Miki, and M. Hutter, Control, vol. 62, no. 5, pp. 1217–1229, 1995.
“Robust rough-terrain locomotion with a quadrupedal robot,” in Proc. [36] H. G. Bock and K.-J. Plitt, “A multiple shooting algorithm for direct
IEEE Int. Conf. Robot. Autom., 2018, pp. 5761–5768. solution of optimal control problems,” in Proc. IFAC Vol., vol. 17, no.
[11] H.-W. Park, P. M. Wensing, and S. Kim, “Jumping over obstacles with mit 2, pp. 1603–1608, 1984.
cheetah 2,” Robot. Auton. Syst., 2020, Art. no. 103703. [37] O. von Stryk, Numerical Solution of Optimal Control Problems by Direct
[12] Q. Nguyen, M. J. Powell, B. Katz, J. D. Carlo, and S. Kim, “Optimized Collocation. Basel: Birkhäuser Basel, 1993, pp. 129–143.
jumping on the mit cheetah 3 robot,” in Proc. Int. Conf. Robot. Autom., [38] R. J. Full and D. E. Koditschek, “Templates and anchors: Neuromechanical
2019, pp. 7448–7454. hypotheses of legged locomotion on land,” J. Exp. Biol., vol. 202, no. 23,
[13] B. Katz, J. Di Carlo, and S. Kim, “Mini cheetah: A platform for pushing the pp. 3325–3332, 1999.
limits of dynamic quadruped control,” in Proc. Int. Conf. Robot. Autom., [39] S. Kajita, “Study of dynamic biped locomotion on rugged terrain-
2019, pp. 6295–6301. derivation and application of the linear inverted pendulum mode,” in Proc.
[14] M. H. Raibert, Legged Robots That Balance. Cambridge, MA, USA: MIT IEEE Int. Conf. Robot. Autom., Sacramento, CA, 1991, pp. 1405–1411.
Press, 1986. [40] J. Ramos and S. Kim, “Humanoid dynamic synchronization through
[15] L. Righetti, J. Buchli, M. Mistry, M. Kalakrishnan, and S. Schaal, “Optimal whole-body bilateral feedback teleoperation,” IEEE Trans. Robot.,
distribution of contact forces with inverse-dynamics control,” Int. J. Robot. vol. 34, no. 4, pp. 953–965, Aug. 2018.
Res., vol. 32, no. 3, pp. 280–298, 2013. [41] X. Xiong and A. D. Ames, “Orbit characterization, stabilization and
[16] M. Hutter, C. Gehring, M. Bloesch, C. D. Remy, and R. Siegwart, “Hybrid composition on 3 d underactuated bipedal walking via hybrid passive linear
operational space control for compliant legged systems,” Robotics, 2013, inverted pendulum model,” in Proc. IEEE/RSJ Int. Conf. Intell. Robots
Art. no. 129. Syst., 2019, pp. 4644–4651.
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.
DING et al.: REPRESENTATION-FREE MODEL PREDICTIVE CONTROL FOR DYNAMIC MOTIONS IN QUADRUPEDS 1171
[42] D. E. Orin, A. Goswami, and S. H. Lee, “Centroidal dynamics of a [69] M. R. Jardin and E. R. Mueller, “Optimized measurements of unmanned-
humanoid robot,” Auton. Robots, vol. 35, no. 2-3, pp. 161–176, 2013. air-vehicle mass moment of inertia with a bifilar pendulum,” J. Aircr.,
[43] H. Dai, A. Valenzuela, and R. Tedrake, “Whole-body motion planning vol. 46, no. 3, pp. 763–775, 2009.
with centroidal dynamics and full kinematics,” in Proc. IEEE-RAS Int. [70] J. Park, D. H. Kong, and H. Park, “Design of anti-skid foot with passive
Conf. Humanoid Robots, 2014, pp. 295–302. slip detection mechanism for conditional utilization of heterogeneous foot
[44] P. M. Wensing and D. E. Orin, “Generation of dynamic humanoid behav- pads,” IEEE Robot. Autom. Lett., vol. 4, no. 2, pp. 1170–1177, Apr. 2019.
iors through task-space control with conic optimization,” in Proc. IEEE
Int. Conf. Robot. Autom., 2013, pp. 3103–3109.
[45] C. Li, Y. Ding, and H.-W. Park, “Centroidal-momentum-based trajectory Yanran Ding (Member, IEEE) received the B.S. de-
generation for legged locomotion,”Mechatronics, vol. 68, 2020, Art. no. gree from Shanghai Jiao Tong University (SJTU),
102364. Shanghai, China, in 2015, and the M.S. degree
[46] J. E. Marsden and T. S. Ratiu, “Introduction to mechanics and symmetry,” from the University of Illinois at Urbana-Champaign
Phys. Today, vol. 48, no. 12, 1995, Art. no. 65. (UIUC), Champaign, IL, USA, in 2017, both in me-
[47] M. Chignoli and P. M. Wensing, “Variational-based optimal control of chanical engineering. He is currently working toward
underactuated balancing for dynamic quadrupeds,” IEEE Access, vol. 8, the Ph.D. degree in mechanical engineering at the
pp. 49 785–49 797, 2020. Dynamic Robotics Laboratory, UIUC.
[48] T. Lee, M. Leoky, and N. H. McClamroch, “Geometric tracking control His research interests include the design of agile
of a quadrotor UAV on SE(3),” in Proc. 49th IEEE Conf. Decis. Control, robotic systems and optimization-based control for
2010, pp. 5420–5425. legged robots to achieve dynamic motions.
[49] R. Featherstone, Rigid Body Dynamics Algorithms. Berlin, Germany: Mr. Ding is one of the best student paper finalists in the International Confer-
Springer, 2014. ence on Intelligent Robots and Systems (IROS) 2017.
[50] A. Graham, Kronecker Products and Matrix Calculus With Applications.
Mineola, NY, USA: Courier Dover Publications, 2018.
[51] J. C. Trinkle, J.-S. Pang, S. Sudarsky, and G. Lo, “On dynamic multi- Abhishek Pandala (Graduate Student Member,
rigid-body contact problems with coulomb friction,” ZAMM-J. Appl. IEEE) received the M.Sc. degree from the Univer-
Math. Mechanics/Zeitschrift Für Angewandte Mathematik Und Mechanik, sity of Illinois at Urbana-Champaign (UIUC), Cham-
vol. 77, no. 4, pp. 267–279, 1997. paign, IL, USA, in 2019, and the B.Tech. and M.Tech.
[52] Y. Wang and S. Boyd, “Fast model predictive control using online opti- (Dual Degree) degrees from the Indian Institute of
mization,” IEEE Trans. Control Syst. Technol., vol. 18, no. 2, pp. 267–278, Technology Madras (IIT-M), Chennai, India, in 2017,
Mar. 2010. both in mechanical engineering. He is currently work-
[53] S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge, U.K.: ing toward the Ph.D. degree in mechanical engineer-
Cambridge University Press, 2004. ing at the Virginia Polytechnic Institute and State
[54] J. J. Craig, Introduction to Robotics: Mechanics and Control, 4th ed. New University, Blacksburg, VA, USA.
York, NY, USA: Pearson Education, 2008. His research interests include optimization-based
[55] J. Di Carlo, P. M. Wensing, B. Katz, G. Bledt, and S. Kim, “Dynamic lo- control of dynamical systems with application to high degree of freedom robots.
comotion in the MIT cheetah 3 through convex model-predictive control,”
in Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst., 2018, pp. 1–9. Chuanzheng Li received the B.S. degree in mecha-
[56] MIT-Biomimetics-Robotics-Lab, “Cheetah-software.” Accessed: Jun. tronics from Zhejiang University, Hangzhou, China,
29, 2020. [Online]. Available: https://github.com/charlespwd/project-title in 2014, and the M.S. degree in mechanical engi-
2019. neering from the University of Illinois at Urbana-
[57] M. Kelly, “An introduction to trajectory optimization: How to do your Champaign, Champaign, IL, USA, in 2017. He is
own direct collocation,” SIAM Rev., vol. 59, no. 4, pp. 849–904, 2017. currently in the Ph.D. program at University of Illinois
[58] A. G. Pandala, Y. Ding, and H.-W. Park, “qpSWIFT: A real-time sparse at Urbana-Champaign supervised by Dr. Hae-Won
quadratic program solver for robotic applications,” IEEE Robot. Autom. Park, working primarily on the design of mechatronic
Lett., vol. 4, no. 4, pp. 3355–3362, Oct. 2019. systems and the real-time control of legged robots.
[59] Y. Ding and H.-W. Park, “Design and experimental implementation of
a quasi-direct-drive leg for optimized jumping,” in Proc. IEEE/RSJ Int.
Conf. Intell. Robots Syst., 2017, pp. 300–305.
[60] Y. Ding, C. Li, and H.-W. Park, “Single leg dynamic motion planning with Young-Ha Shin (Student Member, IEEE) received
mixed-integer convex optimization,” in Proc. IEEE/RSJ Int. Conf. Intell. the B.S. degree in mechanical engineering from the
Robots Syst., 2018, pp. 1–6. Korea Advanced Institute of Science and Technology
[61] Y. Ding, C. Li, and H.-W. Park, “Kinodynamic motion planning for (KAIST), Daejeon, South Korea, in 2019. He is cur-
multi-legged robot jumping via mixed-integer convex program,” in Proc. rently working toward the M.S. degree in mechanical
IEEE/RSJ Int. Conf. Intell. Robots Syst., 2020, pp. 3998–4005. engineering at the Korea Advanced Institute of Sci-
[62] J. Pratt, J. Carff, S. Drakunov, and A. Goswami, “Capture point: A step ence and Technology (KAIST).
toward humanoid push recovery,” in Proc. IEEE 6th IEEE-RAS Int. Conf. His research interests include actuator design and
Humanoid Robots, 2006, pp. 200–207. model predictive control for quadruped robots.
[63] R. E. Kalman, “A new approach to linear filtering and prediction prob-
lems,” Trans. ASME-J. Basic Eng., vol. 82, no. Series D, pp. 35–45, 1960.
[64] W. T. Higgins, “A comparison of complementary and Kalman filtering,”
IEEE Trans. Aerospace Electron. Syst., vol. AES-11, no. 3, pp. 321–325, Hae-Won Park (Member, IEEE) received the B.S.
May 1973. and M.S. degrees from Yonsei University, Seoul, Ko-
[65] P. Corke, “An inertial and visual sensing system for a small autonomous rea, in 2005 and 2007, respectively, and the Ph.D.
helicopter,” J. Robotic Syst., vol. 21, no. 2, pp. 43–51, 2004. degree from the University of Michigan, Ann Arbor,
[66] S. Saripalli, J. M. Roberts, P. Corke, G. Buskey, and G. Sukhatme, “A tale MI, USA, in 2012, all in mechanical engineering.
of two helicopters,” in Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst., vol. He is an Assistant Professor of Mechanical Engi-
1, 2003, pp. 805–810. neering with the Korea Advanced Institute of Science
[67] P. M. Wensing, A. Wang, S. Seok, D. Otten, J. Lang, and S. Kim, and Technology (KAIST), Daejeon, South Korea. His
“Proprioceptive actuator design in the MIT cheetah: Impact mitigation research interests lie at the intersection of control,
and high-bandwidth physical interaction for dynamic legged robots,” IEEE dynamics, and mechanical design of robotic systems,
Trans. Robot., vol. 33, no. 3, pp. 509–522, Jun. 2017. with special emphasis on legged locomotion robots.
[68] A. De Luca and R. Mattone, “Sensorless robot collision detection and Dr. Park is the recipient of the 2018 National Science Foundation (NSF)
hybrid force/motion control,” in Proc. IEEE Int. Conf. Robot. Autom., CAREER Award and NSF most prestigious awards in support of early-career
2005, pp. 999–1004. faculty.
Authorized licensed use limited to: Korea Advanced Inst of Science & Tech - KAIST. Downloaded on September 23,2021 at 00:48:29 UTC from IEEE Xplore. Restrictions apply.