Professional Documents
Culture Documents
Abstract A robust distributed adaptive leader-following networks [10] to estimate the agents nonlinearities showed
control for multi-degree-of-freedom (multi-DOF) robot to be beneficial to aid the network consensus, which provides
manipulator-type agents is proposed to guarantee finite-time exponential convergence and ultimate boundedness guaran-
convergence for leader-following tracking and parameter
estimation via agent-based estimation and control algorithms. tees of the synchronization error. The work in [11] requires
The dynamics of each manipulator agent system of n degrees each agent to know the leaders joint velocity. In contrast,
including the leader agent are assumed unknown. For a work in [4] demonstrates the finite-time consensus reaching
specific leader-following network Laplacian, the agents of double-integrator systems and multi-robot systems, in
position, velocity and some switched control information can particular, for a leader-following objective. Here the multi-
be fed back to the communication network. In contrast to the
current multi-agent literature for robotic manipulators, the robot systems are of single DOF in nature and each agent
proposed approach does not require a priori information of the requires its own and its neighbors mass/inertia parameters,
leaders joint velocity and acceleration to be available to all which simplifies the construction of the control law and
agents due to the use of agent-based robust adaptive control the stability analysis. Another consensus control algorithm
elements. Due to the multi-DOF character of each agent, [12] introduces a constant position demand or with the
matrix theoretical results related to M-matrix theory used for
multi-agent systems needs to be extended to the multi-degree requirement for enhanced synchronization error information
context in contrast to recent scalar double integrator results. of not only direct neighbors. This introduces a two-hop
A simulation example of two-degree of freedom manipulators neighbor information. This information is in addition to the
exemplifies the effectiveness of the approach. requirement for the leaders initial joint position by all agents
[12].
I. I NTRODUCTION In contrast to recent work, it is of interest of this paper to
Distributed control of multi-agent systems have sparked a propose a distributed, adaptive, finite-time leader-following
substantial interest due to its significantly broad applications consensus control algorithm for a robotic manipulator multi-
in many fields such as swarming, flocking, rendezvous and agent system, extending recent results [17] for scalar agents
formation in mobile robots, unmanned aerial vehicles (UAV) to the context of multi-degree-of-freedom agents. Leader in-
and multi-manipulators. Prominent work shows that consen- formation (e.g. velocity) for agents which are not connected
sus control of multi-agent systems involves not just single- to the leader is avoided. To allow for this extension, matrix
integrator and double-integrator dynamics type systems [1], theory [18] usually developed for scalar agent and leader
[2], [3], [4], [5], [6], [7], [8], [9] but also a group of inter- systems (e.g. [19]) has to be extended to the context of multi-
connected multiple degree of freedom (multi-DOF) systems degree-of-freedom agents. This facilitates the formulation of
[4], [10], [11], [12], [13]. The leader-following distributed a distributed controller for multi-degree-of-freedom systems
consensus multi-agent problem for multi-manipulators saves and the stability analysis, e.g. the construction of a Lyapunov
the computational effort and simplifies the control imple- function suited to this context. The control law provides
mentation [13]. The field of cooperative control of multiple finite-time convergence of the synchronization error and an
manipulators has introduced a distributed and cooperative adaptive parameter estimation error: This is based on an
control structure different from the centralised [14] or a pure extension of [17], where also strong inspiration is taken
master-slave structure [15]. from [4]. However, it is to note here that the leader provides
In this paper, leader-following distributed control of information only to particularly pinned agents, while agents
robotic manipulators of n degrees-of-freedom acting as obtain information only from their neighbors in terms of
agents is considered. In particular, this paper considers finite- position, velocity and a switched control component of the
time convergence for synchronization between leader and neighbor.
follower, but also for parameter adaptation. According to The next section introduces a generic communication
Wang and Xiao in [16], [5], finite-time consensus allows network concept, necessary to define neighboring agents and
better disturbance rejection, enhances robustness against un- the leader-agent communication.
certainties and increase control accuracy [16].
Recent work on cooperative control of multi-manipulator II. L EADER - AGENT C OMMUNICATION S TRUCTURE
systems has advanced from scalar [1], [2], [4] to multi- Consider a directed tree G = (V, E) with nonempty
degree-of-freedom agents [10], [11], [12]. The use of neural- finite set of N nodes V = {v1 , . . . , vi , . . . , vN } where node
i represents the i-th agent. The defined graph is strongly
*Muhammad Nasiruddin Mahyuddin is currently a PhD student in connected and fixed consisting of directed edges or arcs
the Department of Mechanical Engineering, University of Bristol, BS8
1TR, UK. He is also a staff on study-leave at the School of Elec- E V V with no repeated edges and no self loops
trical and Electronics Engineering, Universiti Sains Malaysia. Email: (vi , vi ) 6 E, i. The connectivity matrix is denoted as
memnm@bristol.ac.uk or nasiruddin@ieee.org A = [aij ] with aij > 0. ThePin-degree matrix is a diagonal
**Guido Herrmann is a Reader in Dynamics and Control with the
Department of Mechanical Engineering, University of Bristol, BS8 1TR, matrix D = [di ] with di = j6=i aij the weighted in-degree
UK. Email: g.herrmann@bristol.ac.uk node i (i.e. i-th row sum of A). The graph Laplacian matrix
***Frank L. Lewis is the Moncrief-ODonnell Professor of Electrical which is defined as L = D A, L = [lij ], i, j = 1, , N,
Engineering and Senior Fellow of the Automation and Robotics Re-
search Institute at The University of Texas at Arlington, USA. Email: has all row sums equal to zero. The connectivity matrix A
lewis@uta.edu and L are irreducible [1], [19]. The leader communication
where i is the agent manipulators parameters associated where L + B RN N describes the communication
with the Coriolis/centrifugal and gravity matrix to be esti- topology of the leader-following multi-agent network. The
mated by the novel parameter estimation algorithm presented pinning gains are B = diag(b1 , , bi , , bN ) RN N ,
in this note. In Rnn is the identity matrix, qh0 = 1N q0 RnN , h
The overall agent manipulator dynamics can be expressed {1, 2} (Noting that 1N = (1, 1, , 1N )T RN ).
as, Property 4: The communication topology L can be
+
q1 = q2 , q2 = M vg (8) framed so that the irreducible L is upper triangular.
1497
This property avoids loops but also introduces a specific B. Auxiliary Integrated Regressors
leader-agent structure. The filtered torque formulation is now considered for an
Suppose h = qh qh0 , (h {1, 2}) represents the auxiliary regressor used for the adaptation algorithm. Define
disagreement vector to be used only for analysis. Then, the a filtered regressor matrix Wi (t) and vector Ni (t) as,
synchronisation error vector Ek = [ek1 , eki , . . . , ekN ]T
R2N , k {A, B}, i is assumed to be bounded by W i (t) = kF Fi Wi (t) + kF Fi Tfi (q1i , q2i )fi (q1i , q2i ),
kh k kek k/((L + B) In ), k {A, B},
h {1, 2} Wi (0) = wIl , (25)
(15) N i (t) = kF Fi Ni (t) + kF Fi Tfi (q1i , q2i )fi , (26)
where () denotes the minimum singular value of a matrix
and e = 0 if and only if the nodes synchronise, i.e. Ni (0) = 0
1498
rim leads to finite time convergence of the closed-loop, i.e. where the auxiliary torque input ci is:
rim = 0 is governed by
1/
e Aim = i |eAi |1/ sign(eAi ), (32)
1 N
X
ri
ci = aij cj + ki
The sliding variable ri for each manipulator i is therefore (di + bi )
j=1 k
r i k
ri = Bi + eAi Rn (33) j 6= i
where Bi = [|eBi1 | sign(eBi1 ), |eBi2 | sign(eBi2 ), (42)
. . . , |eBin | sign(eBin )]T
Rn and =
and ri = Ei ri . The control gains i are chosen so that
diag(i1 , i2 , . . . , in ). The sliding variable in (33) the matrix diag(1 M11 , 2 M21 , , N MN 1
) satisfies the
can be expressed for the overall network, conditions (38) for matrix N . This implies for suitable choice
B ) + (IN )EA RnN of ki > 0 that the parameter estimation errors Wi (t) i
r = E(E (34)
N (t)i , (i = 1, , N ) and synchronisation errors EA and EB
B ) = [T , T , . . . , T ]T . Differentiating r
where E(E converge to 0 in finite time in an arbitrarily large compact
B1 Bi BN
yields, set of EA , EB and i determined by k, 1i and 2i . The
EB + (IN )EA
r = E (35) parameter estimates converge to its true values.
The overall combined control law can be written as
where + +
= vg c r (43)
= diag(E
E 1 , . . . , E
i , . . . , E
N ) R(N nN n) (36)
where = In with = diag(1 , i , . . . , N ) RN N
i defined as
with E and = [
T1 ,
T,...,
T ]T . The corresponding auxiliary
i N
i = diag(|eB1 |1 , |eB2 |1 , , |eBn |1 ) Rnn torque input which encapsulates the decentralized switching
E laws for all agents is:
(37)
1
B. Leader following control law c = [[(D + B) In ]] [(A In )c + KSIGN (
r)] (44)
A set of adaptive control laws are to be defined in where K = diag([k1 , ki , . . . , kN ]) and
this section, which will solve the leader following control T
problem within a finite time. To facilitate the analysis and r1 ri rN
SIGN (r) = , ,..., (45)
design a result known from the cooperative control literature, kr1 k k ri k k
rN k
e.g. [19], is extended to the context of multi-degree-of- h i
1
freedom systems. Note that I + [(D + B) In ] (A In ) =
Lemma 1: Let L RN N be an irreducible and upper 1
[(D + B) In ] [[(D + B) In ] + (A In )]. Invoking
triangular matrix and B RN N may have at least
the associative property of the Kronecker product, i.e.
one diagonal element. Moreover, there is a matrix N = A F + B F = (A + B) F and since L = D A,
diag(N1 , N2 , , NN ) RnN nN for which Ni Rnn , then (44) can be simplified as,
i = 1, , N , are positive definite and the following in-
equalities hold c = [[(L + B) In ]]1 [KSIGN (
r)] (46)
(Ni ) > kNi+1 k > 0, i = 1, , N 1, (38) Proof of Theorem 1: The following Lyapunov function is
proposed,
then there exists a matrix P
V = Vr + V (47)
P = P P In , P = diag(x1 /y1 , x2 /y2 , , xN /yN ), 1 T 1 T 1 1 1
= r P r + N W W N (48)
x = (L + B)1 1N , y = (L + B)T 1N , 2 2
kNi k is defined as
1N = [1, 1, , 1]T , 1N RN , max > maxi=1,...,N where N
(Ni )
(t) = N
N (t) W (t) (49)
P = diag(1, max , , (N 1)
max ), (39)
= W (t) KIef f , (50)
so that:
KIef f = (IN ekF F t w) RN ll , =
,
P ((L + B) In )N + (((L + B) In )N )T P > 0 (40) (t) = diag(W1 (t), W2 (t), . . . , WN (t)) and N (t) =
W
T T T T
[N1 , N2 , , NN ] . Note that,
The proof of this Lemma can be found in the appendix. The
diagonal matrix P from Lemma 1 will be used in the leader (t) ekF F t wIl
W (t)
ekF F t 1 . (51)
1
W
following control law in the following Theorem: w
Theorem 1: Consider the multi-manipulator system with To compute Vr in our analysis, (36) is used to denote the
dynamics defined by (1), adaptive parameter estimation al- following:
gorithms (29) and communication interconnections between r = E r (52)
manipulator agents and its corresponding virtual leader
defined through the given Laplacian matrix L. The adaptive Differentiating (13),
control law i , for each agent manipulator i is: EA = [(L + B) In ] (q2 q20 ) (53)
i = vgi i ri ,
i + i ci + i E (41) EB = [(L + B) In ] (q2 q20 ) (54)
1499
The derivative of the sliding mode term in (35) can be written where for 2 = diag((Il 21 ), (Il 2i ), . . . , (Il 2N )),
as 2 > 0 the matrix is:
[(L + B) In ] M vg + Q B
0 + E Q P [(L + B) In ]Mvg W 1
r = E
= 1 (60)
(55) 2 W
where Q 0 = 1N [M 1 (V0 (q10 , q20 )q20 G0 (q10 ) + 0 )]
0 This leads after some manipulation to
and = (IN ). Differentiating V yields,
V (E) 2 Vr + V
1 1 W p p
h i
V = rT P r + N TW 1 N (56) N Vr N V ,
(E) (61)
t
Computing the derivative of W 1 N
=
W
1 KIef f for suitable choice of gains ki and i so that > 0 exists.
provides Following arguments of [4] and [17], this guarantees that
h 1 all agent trajectories follow the persistently exciting demand
+
i
W N = (57)
t trajectory of the leader by Property 3 within finite time, so
that (Wi1 ) and (Wi ) remain finite and strictly larger than
where KkIef hf = kiF F KIef f and = converge to their true values.
1 KIef f kF F W W
1 = [1 , , N ] for Rn . 0. The estimates,
W
def def
VII. L EADER -F OLLOWING M ULTI -M ANIPULATOR
We may now define ri = Ei ri , EB = E 1 EB , E XAMPLE
i = + 2i i e kF F t
wi , = [1 , , N ] and A simulation example is presented to illustrate the perfor-
def mance of the proposed distributed adaptive leader-following
= [(L + B) In ]Q 0 + EB + W
1 KIef f ,
def control algorithm: we consider a simple network of two
= [1 , . . . , N ] . Note that EB is not singular due to
T
manipulators of 2 DOF (revolute planar) with a leader
the choice of , 1 < < 2. Exploiting the fact that P and whose communication topology is defined by a Laplacian
E are diagonal, adopting the control torque in (43) and
1 1
incorporating the auxiliary torque input c (44) and (50) it matrix, L = 0 0 . The diagonal pinning matrix is
follows: B =diag(0, 1). The manipulator leader is controlled by
1 T means of a feedback linearisation controller following sinu-
V = r (P [(L + B) In ][M ]
2 soidal/SR signals for both joints. Table I shows the masses
+[M]T [(L + B) In ]T P ) r of the manipulator agents links to be estimated.
T
r P [(L + B) In ]Mvg W
+ 1 N
TABLE I
M ANIPULATOR S YSTEM PARAMETERS ,mi = [m1i , m2i ]T
rT P [(L + B) In ]M
[(L + B) In ]1
N N
Manipulator i Link 1 mass (kg) Link 2 mass (kg)
riT Pi i + T W 1 1 i
X X
[KSIGN (
r)] + N i i i Leader 2.35 3
i=1 i=1 Agent 1 1 1.35
N N Agent 2 0.5 1
T W 1 Ni T W 1 N
i
X X
1i N i i 2i N i i (58)
ik
kN
i=1 i=1 8
Agent 2
20
Agent 2
We may now analyse the matrix 6 Agent 1 Agent 1
15
4 Leader Leader
= rT P [(L + B) In ]M
Joint 1 Position( )
[(L + B) In ]1 K
Joint 2 Position( )
Setpoint Setpoint
M (59) 2
10
1500
1.5
2 for Agent 2 and Q = P ((L + B) In )N , Q = [Q ij ], Q
ij Rnn .
1.35 kg
The choice of P and the diagonal dominance
P of P
(L + B)
ii ) >
i1
1
1 kg (and P P (L+B)) guarantees (Q
j=1 Q ji
, where
Qji = 0 for j < i. This implies that Q = Q + QT satisfies
1 for Agent 1, m
1501
IEEE CDC 2013, FLORENCE, ITALY
1 ,2
MUHAMMAD NASIRUDDIN MAHYUDDIN
GUIDO HERRMANN (SUPERVISOR)1
FRANK L. LEWIS3
1
Presentation layout
Motivation
Problem Formulation
Distributed Adaptive Control Algorithm
Analysis
Simulation Results
Conclusions
2
Acknowledgement
I would like to thank Universiti Sains Malaysia for their support in terms of
sponsorship.
23