Professional Documents
Culture Documents
Abstract— In this paper, we investigate the formation control of planar two-vehicle formation control law. In [6], linear matrix
multiple unmanned aerial vehicles (UAVs), specifically unmanned inequality was adopted to design a decentralized overlapping
aircraft, in an obstacle-laden environment. The main contribution feedback control law for the formation of UAVs. An inner–
of this paper is to integrate the formation control, trajectory
tracking, and obstacle/collision avoidance into one unified optimal outer loop formation control structure was employed and flight
control framework. A nonquadratic avoidance cost is innovatively tested in [7] using a group of YF-22 research UAVs. The outer
constructed via an inverse optimal control approach, which leads loop guidance law minimizes the position tracking error, using
to an analytical, distributed, and optimal formation control law. the nonlinear dynamic inversion method, and generates desired
The stability and optimality of the closed-loop system are proven. pitch and roll angles, whereas the inner-loop controller is
In addition, the proposed optimal control law is dependent
only on the information from the local neighbors, rather than designed to track those desired angles using the classical linear
all UAVs’ information. Simulation of multiple UAVs’ formation control method. A decentralized model predictive control
flying demonstrates the effectiveness of the integrated optimal scheme for a number of cooperative spacecrafts or UAVs
control design with desired behaviors including formation flying, with communication constraints was investigated in [8]. The
trajectory tracking, and obstacle/collision avoidance. formation control of a group of vertical takeoff and landing
Index Terms— Collision avoidance, formation control, multiple UAVs was studied in [9]. The methodology is based on the
unmanned aerial vehicles (UAVs), obstacle avoidance, optimal separation of the translational and rotational dynamics by
control. virtue of the unit-quaternion extraction method for the desired
direction of the vehicle’s thrust and attitude to achieve the
I. I NTRODUCTION position-tracking objective.
In more practical applications, obstacle avoidance must
F ORMATION flying of multiple unmanned aerial vehicles
(UAVs) has been identified as the key technology for
many cooperative missions. It involves designing a centralized
be taken into consideration if an obstacle is detected right
on the trajectory of a UAV. Many formation controls with
obstacle avoidance capability use path planning approaches
or distributed controller that drives all UAVs to a desired
[10], [13], [14], [16], [17], [19] or potential functions [11],
formation while maintaining precise relative positions and
[12], [15], [18], [20], [35]–[37]. In particular, [15], [35], and
velocities, or synchronizing attitude and angular velocity if
[37] employed the same potential function and they originated
rotational motion is considered. Potential applications of mul-
from [36]. The work in [35] focused on the multiagent
tiple UAVs’ formation flying include monitoring of forest fires
collision avoidance problem in which cooperation among
and oil fields, border patrol, surveillance, reconnaissance, and
agents was not considered, because it was assumed that each
battle damage assessment, to name a few. Early studies [1]
agent knew its desired position to track. The work in [36]
also reported significant potential fuel savings that could be
investigated an evasion problem between an evader and a
gained by a close formation flight.
pursuer, and did not address a cooperative control problem
UAVs’ formation control problems have attracted intensive
either. In [21], a hierarchical architecture for the cooperative
research in the recent decade. Many formation control strate-
control of multiple UAVs was proposed. It features an upper
gies, such as leader-following [2], behavioral [3], and virtual
level with global situation awareness and team mission plan-
leader/virtual structure approaches [4], have been well studied.
ning, a middle level with local knowledge, formation control,
A Lie group setting for the formation control problem was
and obstacle avoidance, and a low level that interfaces with
presented in [5] as a natural outcome of the analysis of a
onboard baseline controllers, sensors, communication systems,
Manuscript received July 15, 2011; revised February 2, 2012; accepted and weapon systems. The collision avoidance was consid-
September 1, 2012. Manuscript received in final form September 11, 2012. ered as a moving obstacle avoidance, which is implemented
Date of publication November 12, 2012; date of current version August 12, by a potential value function design. In [22], a real-time
2013. This work was supported in part by the U.S. National Science
Foundation CAREER Award ECCS-0846877. Recommended by Associate trajectory planning strategy with obstacle avoidance and a
Editor M. Mattei. formation control law were combined in a simulation platform
J. Wang is with Department of Electrical Engineering and Computer for cooperative control of a group of UAVs. A new dual-
Science, University of Central Florida, Orlando, FL 32816 USA (e-mail:
wangjianan@ieee.org). mode control strategy was proposed in [23] for the UAVs’
M. Xin is with the Department of Aerospace Engineering, Mississippi State formation flying in both obstacle-free and obstacle-laden envi-
University, Starkville, MS 39759 USA (e-mail: xin@ae.msstate.edu). ronments. A “safe mode” is defined as an operation in an
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org. obstacle-free environment, and a “danger mode” is activated
Digital Object Identifier 10.1109/TCST.2012.2218815 when there is a chance of collision or obstacle in the path.
1063-6536 © 2012 IEEE
1732 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 21, NO. 5, SEPTEMBER 2013
Remark 4.2: One can always find proper weights to satisfy In addition, if X̂ 0 ∈ D0 , then the feedback control (32)
the condition (21). For instance, a small enough wc for minimizes J ( X̂ 0 , U(·)) in the sense that
the given w p and wv are applicable because ei ≥ 0. It
is obvious that R1 in the formation cost (20) is positive J ( X̂ 0 , φ( X̂(·))) = min J ( X̂ 0 , U(·)) (34)
semidefinite because both the diagonal elements in R1 are U(·)∈S( X̂0 )
positive semidefinite. The term −2w p wc L in the formation
cost (20) is used to guarantee that the analytical solution of where S( X̂ 0 ) denotes the set of asymptotically stabilizing
the Riccati equation for the optimal control law is a linear controllers for each initial condition X̂ 0 ∈ D. Finally, if
function of the Laplacian matrix L, and is thus completely D = Rn , = Rm , and
dependent on the information exchange topology, which will
be shown in the proof of Theorem 4.1. V ( X̂) → ∞ as X̂ → ∞. (35)
J2 has the form of
∞
The solution X̂(t) ≡ 0 of the closed-loop system is globally
J2 = h( X̂)dt (22)
0 asymptotically stable.
Proof: Refer to [26].
where h( X̂) contains the tracking penalty function as well
Before presenting the main result, we first define the penalty
as the obstacle/collision avoidance penalty function. It will
function g( X̂)
be constructed from an inverse optimal control approach
in Theorem 4.1. g( X̂) = gtr ( X̂) + ga ( X̂). (36)
J3 has the regular quadratic form of
∞ The first component is the tracking penalty function
J3 = U T R2 Udt (23)
0
n
where R2 = wc2 In
⊗ I3 is positive definite and wc is the gtr ( X̂) gtri ( X̂) (37)
weighting parameter. i=1
⎧ " #T
The following lemma is introduced to derive the main result ⎪ T
⎪
⎪ pi − pre f ν i − ν re f
of this paper and is used to prove both asymptotic stability and ⎪
⎪
⎨ 2
optimality of the proposed formation control algorithm. wt p I 3 wt p wt v I 3 pi − pre f
gtri ( X̂) ×
ν i − ν re f
Lemma 4.3 [26]: Consider the nonlinear controlled dynam- ⎪
⎪ wt p wt v I3 wt2v I3
⎪
⎪
ical system ⎪
⎩
if UAV i has access to the reference
0 if not
˙ = f ( X̂(t), U (t)), X̂(0) = X̂ , t ≥ 0
X̂(t) 0 (24) i = 1, . . . , n (38)
with f (0, 0) = 0 and a cost functional given by
∞ where wt p , wt v are tunable weights to control the tracking
J ( X̂ 0 , U(·)) = T ( X̂(t), U(t))dt (25) speed. pre f and ν re f are the position and velocity of the
0 reference, respectively.
where U(·) is an admissible control. Let D ⊂ Rn be an open Remark 4.3: The UAV that has access to the reference will
set and ⊆ Rm . Assume that there exists a continuously track the desired trajectory by means of minimizing the cost
differentiable function V : D → R and a control law φ : J2 that contains gtr ( X̂), which will be shown in Theorem 4.1.
D → such that Note that minimizing the cost function J1 guarantees that all
V (0) = 0 (26) the UAVs’ positions and velocities are coordinated to achieve
the desired formation and reach the consensus velocity ν cs
V ( X̂) > 0, X̂ ∈ D, X̂ = 0 (27) synchronously. But where the formation goes cannot be
φ(0) = 0 (28) specified as desired by merely minimizing J1 because the
V ( X̂) f ( X̂, φ( X̂)) < 0, X̂ ∈ D, X̂ = 0 (29) final UAVs’ velocity depends on each UAVs initial velocity
H ( X̂, φ( X̂)) = 0, X̂ ∈ D (30) and results from the negotiation among UAVs during the flight
via the optimal consensus formulation. Therefore, combining
H ( X̂, U) ≥ 0, X̂ ∈ D, U ∈ (31) the tracking penalty function gtr ( X̂) with the consensus
where H ( X̂, U) = T ( X̂, U) + V
( X̂) f ( X̂, U) is the Hamil- formation cost J1 , the formation is able to follow a desired
tonian function. The superscript
denotes partial differentiation trajectory pre f with the desired formation velocity ν re f , that
with respect to X̂. is, ν cs = ν re f . It is also worth noting that only one UAV
Then, with the feedback control having access to the reference is sufficient to guarantee the
entire formation to track the desired trajectory.
U(·) = φ( X̂(·)) (32) The second component in g( X̂) is the obstacle/collision
the solution X̂(t) ≡ 0 of the closed-loop system is locally avoidance penalty function
asymptotically stable and there exists a neighborhood of the
origin D0 ⊆ D such that
n n+q−1
ga ( X̂) = gi,k ( X̂) (39)
J ( X̂ 0 , φ( X̂(·))) = V ( X̂ 0 ), X̂ 0 ∈ D0 . (33) i=1 k=1
1736 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 21, NO. 5, SEPTEMBER 2013
⎧
⎪
⎪ 0, pi ∈ i,k
⎪
⎪" #
⎪
⎪ + r̄) − pi − ck
2 2
⎨ (R
2
" #2
gi,k ( X̂) pi −ck 2 −(r + r̄)2
⎪
⎪
⎪
⎪
x,y x,y
×(ν i − ν ak )T (ν i − ν ak ) pi ∈ i,k
⎪
⎪
⎩ not defined pi ∈ i,k
i = 1, . . . , n, k = 1, . . . , n + q − 1 (40)
x,y
where ck ∈ i ; ν i ∈ R2 is the projection of the UAV’s veloc-
ity vector ν i on the x̂ − ŷ plane; and ν ak ∈ R2 is the prescribed
obstacle/collision avoidance velocity. There are many ways to
avoid the obstacle or colliding UAVs from different directions.
Since UAVs move primarily along the down-range and cross-
range directions, we design the avoidance strategy in the (x̂, ŷ)
plane, which is shown in Fig. 4, where r̄ denotes the radius Fig. 4. Illustration of obstacle/collision avoidance.
of the colliding object (UAV or obstacle) and rsafe is defined
as a prespecified safe distance to the colliding object. rsafe
should be set large enough for safe avoidance. The coordinate of ck and OU in the coordinate frame x̂ − O − ŷ respectively.
system x̂ − O − ŷ is the horizontal plane of (x̂, ŷ, ĥ). The For the other layout, the rotation angle β can be calculated
coordinate system x̂
− OU − ŷ
is coplanar to x̂ − O − ŷ likewise with the sign difference. For instance, if ck is located
with the origin at the center of the UAV and the x̂
-axis along below the dashed horizontal line x̂, the rotation angle will be
the tangential direction from the UAV to the dot-and-dashed calculated as
safety circle centered at the colliding object with the radius β = ∠ck OU x̂
+ ∠ck OU x̂.
x,y
of r̄ + rsafe . The ν ak is chosen to be the projection of ν i on
Remark 4.4: The obstacle/collision avoidance penalty func-
the x̂ -axis. Apparently, the avoidance can be achieved if the
tion is designed to penalize the error between the current
UAV is driven to the direction of ν ak . x,y
velocity ν i and the avoidance velocity ν ak . The coefficient
ν ak can be calculated as follows. The rotation angle β
" 2 #2
between the two coordinate frames x̂ − O − ŷ and x̂
− OU − ŷ
(R + r̄)2 − pi − ck
can be determined from the geometry when the current UAV’s
" #2
position and the colliding object’s position are known. Then, pi − ck 2 − (r + r̄)2
the velocity vector ν i expressed in the x̂
− OU − ŷ
frame
x,y
x
,y
and denoted as ν i can be calculated by in g i,k ( X̂) is designed to guarantee the continuity of the
penalty function on the upper boundary R + r̄ of the reaction
x
,y
νix x,y region, which will be shown in the proof of Theorem 4.1.
νi = y
= C (β) · ν i (41)
νi In addition, the coefficient
" 2 #2
where (R + r̄)2 − pi − ck
cos β − sin β
C (β) = " #2
sin β cos β
pi − ck 2 − (r + r̄)2
is the rotation matrix.
The prescribed obstacle/collision avoidance velocity ν ak is of the avoidance penalty function (40) can be regarded as the
x,y x,y
obtained by x
weight on the tracking error (ν i −ν ak )T (ν i −ν ak ) between
−1 ν the UAV velocity and the avoidance velocity ν ak . The closer
ν ak = C (β) i . (42)
0 the UAV is to the colliding object, the smaller the denominator
of the coefficient is and thus the larger the weight is. Thus, the
To calculate the rotation angle β between the two coordinate
priority of obstacle/collision avoidance increases as the UAV
frames x̂ −O− ŷ and x̂
−OU − ŷ
, we can inspect the geometry
approaches the colliding object inside the reaction region. It
in Fig. 4. In the current layout of Fig. 4, the rotation angle
is also worth noting that the avoidance strategy as shown in
β = ∠ck OU x̂
− ∠ck OU x̂ Fig. 4 establishes a safety ball outside the colliding object
with a radius of rsafe + r̄ such that the UAV is driven to fly
where ∠ stands for the angle. ck denotes the center of the
along the direction tangential to this safety ball. This safety
colliding object; OU denotes the current position of the UAV.
ball serves as a buffer zone to ensure the avoidance if the UAV
It can be seen from the geometry that
does not exactly fulfill the avoidance velocity direction due to
rsafe + r̄
∠ck OU x̂
= arcsin the other requirements such as reference trajectory tracking
d and formation flying. rsafe can be prescribed large enough by
x̂ O − x̂U the designer in light of the size and maneuverability of the
∠ck OU x̂ = arccos
d aircraft.
where d denotes the distance from the UAV (OU ) to the center The main result of this paper is presented in the following
of the colliding object (ck ). x̂ O and x̂U are the x̂ components theorem.
WANG AND XIN: INTEGRATED OPTIMAL FORMATION CONTROL OF MULTIPLE UAVs 1737
Theorem 4.1: For a multi-UAV system (3) and (4) with the continuously differentiable with respect to X̂ in the open
$n+q−1
assumptions 1) and 2), one can always find proper weights domain ( k=1 i,k )c .
w p , wv , and wc such that the distributed feedback control law After showing that V ( X̂) is continuously differentiable, the
wp Hamiltonian function for this optimal control problem can be
U = φ (X) = − (L ⊗ I3 ) p − σ p written as
wc " " ## " # " # " #
wν 1
H X̂, U, V
T X̂ = T X̂, U + V
T X̂ f X̂, U
− (L ⊗ I3 )v − g (X) (43)
wc 2wc2 v T
" #
= X̂ R1 X̂ + h X̂ + U T R2 U
is an optimal control for the formation control problem (19) T " #
with the cost function term h( X̂) in J2 being + 2 X̂ P + g
T X̂ A X̂ + BU .
w p
T wv
T
h( X̂) = gv ( X̂)(L ⊗ I3 ) p̂ + g ( X̂)(L ⊗ I3 )ν̂ (49)
wc wc v
1
Setting (∂/∂U)H ( X̂, U, V
T ( X̂)) = 0 yields the optimal
2
−g
T
p ( X̂)ν̂ + 2 gv ( X̂) (44) control law
4wc
1
where gv
(X), gv
( X̂),and g
p ( X̂) in (43) and (44) denote the U ∗ = φ( X̂) = − R2−1 B T V
( X̂)
2
partial differentiation of g( X̂) with respect to the velocity error 1
= −R2−1 B T P X̂ − R2−1 B T g
( X̂). (50)
ν̂ and the position error p̂, respectively. In addition, the closed- 2
loop system is globally asymptotically stable. With (50) it follows that:
Proof: Pertaining to this optimal formation control problem, " " ## T
" #
we have the following equations corresponding to Lemma 4.3: V
T ( X̂) f X̂, φ X̂ = X̂ A T P + P A − 2P S P X̂
T
T
T ( X̂, U) = X̂ R1 X̂ + h( X̂) + U T R2 U (45) − X̂ P Sg
( X̂) + g
T ( X̂)
1
f ( X̂, U) = A X̂ + BU. (46) × (A− S P) X̂ − g
T ( X̂)Sg
( X̂)
2
A candidate Lyapunov function V ( X̂) is chosen to be (51)
T
V ( X̂) = X̂ P X̂ + g( X̂) (47) where S = B R2−1 B T . Using (50) and (51) into (49) yields
" # T
" #
where P is the solution of a Riccati equation, which will be H X̂, φ( X̂), V
T ( X̂) = X̂ A T P + P A + R1 − P S P X̂
shown afterward.
+g
T ( X̂)(A − S P) X̂ + h( X̂)
In order for the function V ( X̂) in (47) to be a valid
1
Lyapunov function, it must be continuously differentiable − g
T ( X̂)Sg
( X̂). (52)
with respect to X̂, or equivalently g( X̂) must be continu- 4
ously differentiable with respect to X̂ in the open domain In order to prove that the control law (50) is an optimal solu-
$n+q−1 tion for the formation control problem (19) using Lemma 4.3,
( k=1 i,k )c where “∪” and the superscript “c” denote
the union and complement of sets, respectively. From (38) the conditions (26)–(31) need to be verified.
and (40), it suffices to show that gi,k ( X̂) is continuously In order to satisfy the condition (30) in Lemma 4.3 or to
$n+q−1 let (52) be zero, we can let
differentiable in the open domain ( k=1 i,k )c . In fact,
this is true if gi,k ( X̂) and (dgi,k ( X̂)/d pi ) are continuous on A T P + P A + R1 − P S P = 0 (53)
the upper boundary R + r̄ of i,k . Since (40) implies that
and require that
lim pi −ck →(R+r̄)− gi,k ( X̂) = 0 = lim pi −ck →(R+r̄)+ gi,k ( X̂),
gi,k ( X̂) is continuous on the upper boundary R + r̄ of i,k by 1
g
T ( X̂)(A − S P) X̂ + h( X̂) − g
T ( X̂)Sg
( X̂) = 0. (54)
the definition of continuity. In addition 4
With (50), (53), and (54), it can be shown that
dgi,k ( X̂)
= H ( X̂, U, V
T ( X̂))
dp
⎧ i T T
⎪ 0, pi ∈ i,k = U T R2 U +h( X̂)+ X̂ R1 X̂ +(2 X̂ P +g
T ( X̂))(A X̂ + BU)
⎪
⎪
⎪
⎨ −4((R+r̄) i k
2 −(r+r̄)2 )((R+r̄)2 − p −c 2 ) T T
= U T R2 U +h( X̂)+ X̂ R1 X̂ +(2 X̂ P +g
T ( X̂))(A X̂ + BU)
" #3
pi −ck 2 −(r+r̄)2 T
⎪
⎪ − X̂ (A T P + P A + R1 − P S P) X̂
⎪ ×( pi − ck )(ν ix,y − ν ak )T (ν ix,y − ν ak ) pi ∈ i,k
⎪
⎩ = U T R2 U + h( X̂) + g
T ( X̂)(A X̂ + BU)
not defined pi ∈ i,k
T T
i = 1, . . . , n, k = 1, . . . , n + q − 1. (48) +2 X̂ P BU + X̂ P S P X̂
1
It is easy to see that lim pi −ck →(R+r̄)− (dgi,k ( X̂)/d pi ) = = U T R2 U + g
T ( X̂)Sg
( X̂) + g
T ( X̂)S P X̂
4
03×1 = lim pi −ck →(R+r̄)+ (dgi,k ( X̂)/d pi ). Hence, (dgi,k T T
+ X̂ P S P X̂ + (2 X̂ P + g
T ( X̂))BU
( X̂)/d pi ) is continuous on the upper boundary R + r̄ of 1 T T
i,k . Therefore, g( X̂) and the Lyapunov function V ( X̂) are = U T R2 U + (2 X̂ P + g
T ( X̂))S(2 X̂ P + g
T ( X̂))T
4
1738 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 21, NO. 5, SEPTEMBER 2013
T
+(2 X̂ P + g
T ( X̂))BU P in (57), the Lyapunov function becomes
1
= U T R2 U + V
T ( X̂)SV
( X̂) + U T B T V
( X̂) T
V ( X̂) = X̂ P X̂ + g( X̂) =
4 ⎧
= U T R2 U + φ( X̂)T R2 φ( X̂) − 2U T R2 φ( X̂) ⎪
⎪ w p wv p̄T (L 2 ⊗ I3 ) p̄ + wc wv v T (L ⊗ I3 )v
⎪
⎪
⎪
⎪ +2w p wc p̄T (L ⊗ I3 )v + gtr ( X̂) pi ∈ i,k
= [U − φ( X̂)]T R2 [U − φ( X̂)] ≥ 0. (55) ⎨
w p wv p̄T (L 2 ⊗ I3 ) p̄ + wc wv v T (L ⊗ I3 )v
⎪
⎪
Therefore, the condition (31) is satisfied. ⎪
⎪ +2w p wc p̄T (L ⊗ I3 )v + gtr ( X̂) + ga ( X̂) pi ∈ i,k
⎪
⎪
Next, substituting A, B, R1 , and R2 in (53) and assuming ⎩
not defined pi ∈ i,k
i = 1, . . . , n, k = 1, . . . , n + q − 1. (60)
P1 P2
P= ⊗ I3
P2 P3 Note that the property (16) is used in (60) so that the first
three terms in V ( X̂) contains only p̄ and v. V ( X̂) > 0 when
yields X̂ = 0 can be guaranteed by selecting a large enough wv for
the given w p such that the positive terms w p wv p̄T (L 2 ⊗ I3 ) p̄,
− 12 P22P1 − 12 P2 P3 w2p L 2 0n×n wc wv v T (L⊗I3 )v, and gtr ( X̂) are always greater than the sign-
wc wc
+ = 0.
P1 − 12 P3 P2 2P2 − 12 P32 0n×n wv2 L 2 − 2w p wc L indefinite terms and thus the condition (27) can be satisfied.
wc wc
(56) Next, the cost function term h( X̂) in J2 (see (22)) is
constructed from solving (54) and using (57)
Then, P can be solved in the analytical form w p
T wv
T
h( X̂) = g ( X̂)(L ⊗ I3 ) p̂ + g ( X̂)(L ⊗ I3 )ν̂
wc v wc v
w p wv L 2 w p wc L 1
2
P= ⊗ I3 . (57) −g
T
p ( X̂)ν̂ + g v ( X̂) (61)
w p wc L wc wv L 4wc 2
which turns out to be (44). Note that the cost function term
Note that the purpose of introducing the term −2w p wc L in
h( X̂) is obtained from the inverse optimal control strategy
R1 (see (20)) is to let it cancel 2P2 when solving P3 from
since it is not given a priori, but is constructed from the
(56) in order to make P3 a linear function of the Laplacian
optimality condition (54).
matrix L as mentioned in Remark 4.2.
Using (53) and (54), (51) becomes
Now, we will verify the condition (27). First note that the
matrix V
T ( X̂) f ( X̂, φ( X̂))
T
wt2p I3 wt p wt v I3 T 1
= − X̂ R1 X̂ + h( X̂) + ( X̂ P + g
T ( X̂))
wt p wt v I3 wt2v I3 2
1
×S(P X̂ + g ( X̂)) . (62)
in the tracking penalty function (38) is positive semidefinite. 2
To see this, we can inspect the eigenvalues of this matrix by It can be seen from (62) that the condition (29) can be met
T
when h( X̂) ≥ 0 since X̂ R1 X̂ is positive semidefinite and
wt2p I3 wt p wt v I3 T
det λI − ( X̂ P +(1/2)g
T ( X̂))S(P X̂ +(1/2)g
( X̂))is positive definite.
wt p wt v I3 wt2v I3
By selecting proper values of the weights w p , wv , and wc in
(λ − wt2p )I3 −wt p wt v I3 (61), one can always ensure h( X̂) ≥ 0. Specifically, it can be
= det . (58)
−wt p wt v I3 (λ − wt2v )I3 achieved by choosing a small enough wc for the given w p
and wv such that the positive term 1/(4wc2 )gv
( X̂)2 in (61)
Since (λ − wt2p )I3 and −wt p wt v I3 commute [28], that is, is always greater than the other sign-indefinite terms.
[(λ − wt2p )I3 ][−wt p wt v I3 ] = [−wt p wt v I3 ][(λ − wt2p )I3 ], we Substituting P and g
( X̂) into (50) leads to the optimal
have control law
wp wv 1
(λ − wt2p )I3 −wt p wt v I3 φ( X̂) = − (L ⊗ I3 ) p̂ − (L ⊗ I3 )ν̂ − 2 gv
( X̂). (63)
det wc wc 2wc
−wt p wt v I3 (λ − wt2v )I3
" # Substituting p̂ = p̄ − p̄cs and ν̂ = ν̄ − ν̄ cs into (63) and using
= det (λ − wt2p )(λ − wt2v )I3 − wt2p wt2v I3
" # (16), the optimal control law (63) becomes
= det λ2 I3 − (wt2p + wt2v )λI3 . (59) wp wv 1
φ (X) = − (L ⊗ I3 ) p̄ − (L ⊗ I3 )ν̄ − 2 gv
( X̂). (64)
wc wc 2wc
The eigenvalues are
Also note that
λ = 0, 0, 0, wt2p + wt2v , wt2p + wt2v , wt2p + wt2v ≥ 0 ∂gtr ( X̂) ∂ga ( X̂)
gv
( X̂) = gtr
( X̂) + ga
v ( X̂) = + . (65)
v
∂ ν̂ ∂ ν̂
which implies that the weighting matrix in gtr ( X̂) is positive Since pre f and ν re f are known to UAV i if it has access to the
semidefinite and gtr ( X̂) ≥ 0. Moreover, from the solution of
( X̂) = g
(X). Also, since ν is determined by
reference, gtr v trv ak
WANG AND XIN: INTEGRATED OPTIMAL FORMATION CONTROL OF MULTIPLE UAVs 1739
Fig. 7. Time histories of the positions xi and velocities ẋi . Fig. 8. Time histories of the positions yi and velocities ẏi .
Fig. 10. Time histories of the actual states and actual controls. Fig. 12. Top view of UAVs’ formation flying with obstacle/collision
avoidance.
where
σ p1 = [0; −100; 1] m; σ p2 = [100; 0; 1] m
σ p3 = [−100; 0; 1] m; σ p4 = [0; 100; 1] m.
The final velocity and altitude of the formation are desired
Fig. 11. 3-D trajectories of UAVs’ formation flying with two-obstacle
avoidance. to follow a straight-line reference trajectory with the initial
position of [ 0 −100 80 ]T m and the constant velocity of
[ 60 0 0 ]T m/s. Assume that only UAV 1 has access to the
and varies according to the altitude h i . In the simulated gust, reference. The associated weights in the control law are set to
the normal wind shear is given by w p = 0.08, wv = 2, wc = 2, wt p = 0.04, and wt v = 1. Two
simulation scenarios are considered as follows.
Vwi ,normal = 0.215Vm log10 (h i ) + 0.285Vm (67)
where Vm = 4.0 m/s is the mean wind speed at an altitude of A. Formation Control With Collision Avoidance and Without
80 m, which is the simulated altitude. The turbulence part of Obstacles on the Trajectories of UAVs
the wind gust Vwi ,tan has a Gaussian distribution with a zero In this scenario, an obstacle is assumed to appear at (−1000,
mean and a standard derivation of 0.09Vm . The constraints on 0, 80) m, which is not on the trajectory of any UAV. The
the control variables are Th i ≤ 125N, −1.5 ≤ n gi ≤ 2.0, and radius of the obstacle is assumed to be r1 = 15 m. The
−80° ≤ φbi ≤ 80°. safe distance rsafe to the colliding object is set to 30 m.
The undirected and connected communication topology of The simulation results of the four UAVs’ motion under the
the four UAVs is illustrated in Fig. 2 and can be described by proposed optimal formation control are shown in Figs. 5–10.
the Laplacian matrix L Fig. 5 demonstrates the 3-D trajectories and Fig. 6 shows the
⎡ ⎤ top view of the trajectory. As can be seen from Fig. 5, all the
1 −1 0 0
⎢ −1 2 −1 0 ⎥ UAVs are driven to a rhombic formation, and the formation
L=⎢ ⎣ 0 −1 2 −1 ⎦.
⎥ (68) follows the desired reference trajectory. It can also be seen
from Fig. 6 that the obstacle avoidance does not take effect
0 0 −1 1
since no obstacle enters the detection region of UAVs, whereas
The initial positions of the four UAVs are assumed to be the collision avoidance does take effect, as shown in the
at (0, −200, 70) m, (0, −60, 80) m, (0, 60, 90) m, and rectanglular box in Fig. 6, where UAV 2 and UAV 3 enter each
1742 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 21, NO. 5, SEPTEMBER 2013
Fig. 13. Time histories of the relative distances between UAVs and obstacles. Fig. 15. Time histories of the positions yi and velocities ẏi .
R EFERENCES
[1] D. F. Chichka and J. L. Speyer, “Solar-powered, formation-enhanced
aerial vehicle systems for sustained endurance,” in Proc. Amer. Control
Conf., Philadelphia, PA, Jun. 1998, pp. 684–688.
[2] P. K. C. Wang, “Navigation strategies for multiple autonomous mobile
robots moving in formation,” J. Robot. Syst., vol. 8, no. 2, pp. 177–195,
Apr. 1991.
[3] T. Balch and R. C. Arkin, “Behavior-based formation control for
Fig. 17. Time histories of the actual states and actual controls. multirobot teams,” IEEE Trans. Robot. Autom., vol. 14, no. 6, pp. 926–
939, Dec. 1998.
[4] F. L. Lewis and K. H. Tan, “High precision formation control of mobile
formation. When UAV 2 enters the detection region of UAV robots using virtual structures,” Auton. Robots, vol. 4, no. 4, pp. 387–
403, 1997.
3, the collision avoidance control prevents UAV 2 and UAV 3 [5] E. W. Justh and P. S. Krishnaprasad, “A simple control law for UAV
from moving closer such that the second collision avoidance formation flying,” ISR, Univ. Maryland, Baltimore, Tech. Rep. TR 2002-
can be observed. Soon after, obstacle 2 enters the detection 38, 2002.
[6] D. Stipanovic, G. Inalhan, R. Teo, and C. J. Tomlin, “Decentralized
region of UAV 1. The proposed control succeeds driving overlapping control of a formation of unmanned aerial vehicles,” Auto-
UAV 1 toward the positive ŷ direction to avoid obstacle 2. matica, vol. 40, no. 8, pp. 1285–1296, 2004.
Eventually, the UAVs achieve the desired formation and fly [7] Y. Gu, B. Seanor, G. Campa, M. R. Napolitano, L. Rowe, S. Gururajan,
and S. Wan, “Design and flight testing evaluation of formation control
along the reference trajectory. Fig. 13 gives the time histories laws,” IEEE Trans. Control Syst. Technol., vol. 14, no. 6, pp. 1105–1112,
of the relative distances between UAVs and obstacles. It can Nov. 2006.
be easily seen from the blue line of the bottom zoomed figure [8] J. Lavaei, A. Momeni, and A. G. Aghdam, “A model predictive
decentralized control scheme with reduced communication requirement
in Fig. 13 that the minimum relative distance between UAV for spacecraft formation,” IEEE Trans. Control Syst. Technol., vol. 16,
3 and obstacle 1 is 20 m, which is greater than the sum of no. 2, pp. 268–278, Mar. 2008.
obstacle 1’s radius (15 m) and the UAV’s radius (1.5 m). This [9] A. Abdessameud and A. Tayebi, “Formation control of VTOL-UAVs,”
in Proc. Joint 48th IEEE Conf. Decision Control, Shanghai, China, Dec.
implies the successful avoidance of obstacle 1. Also, from the 2009, pp. 3454–3459.
red line of the bottom zoomed figure in Fig. 13 one can see that [10] A. K. Das, R. Fierro, V. Kumar, J. P. Ostrowski, J. Spletzer, and C.
the minimum relative distance between UAV 1 and obstacle J. Taylor, “A vision-based formation control framework,” IEEE Trans.
Robot. Autom., vol. 18, no. 5, pp. 813–825, Oct. 2002.
2 is 15 m, which is greater than the sum of obstacle 2’s radius [11] H. G. Tanner and A. Kumar, “Formation stabilization of multiple agents
(10 m) and the UAV’s radius (1.5 m) as well. Thus, it implies using decentralized navigation functions,” in Robotics: Science and
the successful avoidance of obstacle 2. Systems I, S. Thrun, G. Sukhatme, S. Schaal, and O. Brock, Eds.
Cambridge, MA: MIT Press, 2005, pp. 49–56.
The time histories of the UAV’s positions and velocities [12] Y. Liang and H. H. Lee, “Decentralized formation control and obstacle
in Figs. 14–16 show that UAV 1 tracks the reference, and avoidance for multiple robots with nonholonomic constraints,” in Proc.
all UAVs fly into formation synchronously with the final IEEE Amer. Control Conf., Minneapolis, MN, Jun. 2006, pp. 5596–5601.
[13] J. Shao, G. Xie, and L. Wang, “Leader-following formation control of
desired altitude of 80 m and the final desired velocity of multiple mobile vehicles,” IET Control Theory Appl., vol. 1, no. 2, pp.
[ 60 0 0 ]T m/s, which is the same as the previous scenario. 545–552, Mar. 2007.
The three top figures in Fig. 17 illustrate the time histories of [14] C. D. L. Cruz and R. Carelli, “Dynamic model based formation control
and obstacle avoidance,” Robotica, vol. 26, no. 3, pp. 345–356, 2008.
the ground velocities, flight path angles, and heading angles, [15] S. Mastellone, D. M. Stipanovic, C. Graunke, K. Intlekofer, and M.
respectively, which are the actual states of UAVs and exhibit W. Spong, “Formation control and collision avoidance for multiagent
variations in the presence of obstacle and collision avoidance. nonholonomic systems: Theory and experiments,” Int. J. Robot. Res.,
vol. 27, no. 1, pp. 107–126, 2008.
The three bottom figures in Fig. 17 illustrate the actual control [16] M. Defoort, T. Floquet, A. Kokosy, and W. Perruquetti, “Sliding-mode
responses and they are all within the prescribed constraints. formation control for cooperative autonomous mobile robots,” IEEE
Trans. Ind. Electron., vol. 55, no. 11, pp. 3944–3953, Nov. 2008.
[17] A. K. Ray, P. Benavidez, L. Behera, and M. M. Jamshidi, “Decentralized
VI. C ONCLUSION motion coordination for a formation of rovers,” IEEE Syst. J., vol. 3,
In this paper, the multiple UAV’s formation control with no. 3, pp. 369–381, Sep. 2009.
[18] G. Antonelli, F. Arrichiello, and S. Chiaverini, “Experiments of for-
obstacle/collision avoidance was designed in a unified optimal mation control with multirobot systems using the null-space-based
control framework. A nonquadratic avoidance cost function behavioral control,” IEEE Trans. Control Syst. Technol., vol. 17, no. 5,
was constructed via a new inverse optimal control approach pp. 1173–1182, Sep. 2009.
[19] T. Dierks and S. Jagannathan, “Neural network output feedback control
such that the optimal control law can be obtained in an of robot formations,” IEEE Trans. Syst., Man, Cybern., B, Cybern.,
analytical form. The unified formulation can guarantee not vol. 40, no. 2, pp. 383–399, Apr. 2010.
1744 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 21, NO. 5, SEPTEMBER 2013
[20] J. Wang, X. B. Wu, and Z. L. Xu, “Potential-based obstacle avoidance in [36] G. Leitmann and J. Skowronski, “Avoidance control,” J. Optim. Theory
formation control,” J. Control Theory Appl., vol. 6, no. 3, pp. 311–316, Appl., vol. 23, pp. 581–591, Jul. 1977.
2008. [37] E. J. Rodriguez-Seda, J. J. Troy, C. A. Erignac, P. Murray, D. M.
[21] G. Vachtsevanos, L. Tang, and J. Reimann, “An intelligent approach to Stipanovic, and M. W. Spong, “Bilateral teleoperation of multiple mobile
coordinated control of multiple unmanned aerial vehicles,” in Proc. 60th agents: Coordinated motion and collision avoidance,” IEEE Trans.
Annu. Forum Amer. Helicopter Soc., Baltimore, MD, Jun. 2004, pp. 1–9. Control Syst. Technol., vol. 18, no. 4, pp. 984–992, Jul. 2010.
[22] H. L. Yuan, V. Gottesman, M. Falash, Z. H. Qu, E. Pollak, and J.
M. Chunyu, “Cooperative formation flying in autonomous unmanned
air systems with application to training,” in Advances in Cooperative
Control and Optimization (Lecture Notes in Control and Information
Sciences), vol. 369. New York: Springer-Verlag, 2007.
[23] X. Wang, V. Yadav, and S. N. Balakrishnan, “Cooperative UAV forma- Jianan Wang (S’09–M’12) received the B.S. degree
tion flying with obstacle/collision avoidance,” IEEE Trans. Control Syst. from Beijing Jiaotong University, Beijing, China,
Technol., vol. 15, no. 4, pp. 672–679, Jul. 2007. and the M.S. degree from the Beijing Institute of
[24] T. Keviczky, B. Francesco, K. Fregene, D. Godbole, and G. J. Balas, Technology, Beijing, in 2004 and 2007, respectively,
“Decentralized receding horizon control and coordination of autonomous both in control science and engineering, and the
vehicle formations,” IEEE Trans. Control Syst. Technol., vol. 16, no. 1, Ph.D. degree in aerospace engineering from Missis-
pp. 19–33, Jan. 2008. sippi State University, Starkville, in 2011.
[25] J. Shin and H. J. Kim, “Nonlinear model predictive formation flight,” He is currently a Post-Doctoral Research Fellow
IEEE Trans. Syst., Man, Cybern., A, Syst. Humans, vol. 39, no. 5, pp. with the Department of Electrical Engineering and
1116–1125, Sep. 2009. Computer Science, University of Central Florida,
[26] D. S. Bernstein, “Nonquadratic cost and nonlinear feedback control,” Orlando. His current research interests include co-
Int. J. Robust Nonlinear Control, vol. 3, no. 3, pp. 211–229, 1993. operative control of multiagent systems, unmanned aerial vehicle formation
[27] W. M. Haddad and V. Chellaboina, Nonlinear Dynamical Systems and control, trustworthy networked systems, and sensor networks.
Control: A Lyapunov-Based Approach. Princeton, NJ: Princeton Univ. Dr. Wang is a Senior Member of the American Institute of Aeronautics and
Press, 2000. Astronautics.
[28] W. Ren and R. W. Beard, Distributed Consensus in Multivehicle Coop-
erative Control. London, U.K.: Springer-Verlag, 2008.
[29] P. K. Menon and G. D. Sweriduk, “Optimal strategies for free-flight
air traffic conflict resolution,” J. Guidance, Control, Dynamics, vol. 22,
no. 2, pp. 357–368, Mar. 1999.
[30] P. K. Menon, “Short-range nonlinear feedback strategies for aircraft
pursuit-evasion,” J. Guidance, Control, Dynamics, vol. 12, no. 1, pp. Ming Xin (M’02) received the B.S. and M.S.
202–211, 1989. degrees from the Nanjing University of Aeronautics
[31] G. Lafferriere, A. Williams, J. Caughman, and J. J. Veerman, “Decen- and Astronautics, Nanjing, China, in 1993 and 1996,
tralized control of vehicle formations,” Syst. Control Lett., vol. 54, no. 9, respectively, both in automatic control, and the Ph.D.
pp. 899–910, 2005. degree in aerospace engineering from the Missouri
[32] J. N. Wang and M. Xin, “Multiagent consensus algorithm with obstacle University of Science and Technology (formerly
avoidance via optimal control approach,” Int. J. Control, vol. 83, no. 12, University of Missouri-Rolla), Rolla, in 2002.
pp. 2606–2621, Dec. 2010. He is currently an Associate Professor with the
[33] Y. J. Xu, “Nonlinear robust stochastic control for unmanned aerial Department of Aerospace Engineering, Mississippi
vehicles,” J. Guidance, Control, Dynamics, vol. 32, no. 4, pp. 1308– State University, Starkville. His current research
1319, Jul.–Aug. 2009. interests include nonlinear optimal control theory,
[34] “Advisory circular,” U.S. Department of Transportation and Federal guidance and control of aerospace vehicles, control of multi-agent systems,
Aviation Administration, Washington, DC, Tech. Rept. AC 120-28D, neural networks, and nonlinear estimation/filtering.
Jul. 1999. Dr. Xin was a recipient of the National Science Foundation CAREER Award
[35] D. M. Stipanovic, P. F. Hokayem, M. W. Spong, and D. D. Siljak, in 2009. He is an Associate Fellow of the American Institute of Aeronautics
“Cooperative avoidance control for multiagent systems,” J. Dynamic and Astronautics (AIAA) and a member of the AIAA Atmospheric Flight
Syst., Meas. Control, vol. 129, no. 5, pp. 699–707, 2007. Mechanics Technical Committee.