Professional Documents
Culture Documents
Abstract—A cooperative control approach for autonomous ve- based on the trade-off between different control objectives is
hicles is developed in order to perform different complex traffic the main advantages of MPC [3]. Therefore, it is an appropriate
maneuvers, e.g., double lane-switching or intersection situations. scheme for complex and constrained environments, and is
The problem is formulated as a distributed optimal control problem
for a system of multiple autonomous vehicles, and then solved using well suited to the complex traffic situations considered in this
a nonlinear Model Predictive Control (MPC) technique, where the paper.
distributed approach is used to make the problem computation- Recently, formulating a cooperative control design among
ally feasible in real-time. To provide safety, a collision avoidance multiple agents assigned to a specific task that can navigate au-
constraint is introduced, also in a distributed way. In the proposed tonomously without collision has received significant attention,
method, each vehicle computes its own control inputs using esti-
mated states of neighboring vehicles. In addition, a compatibility see e.g., [4]. Further, distributed control among multi-vehicle
constraint is defined that takes collision avoidance into account but systems can improve tractability and provide better scalability
also ensures that each vehicle does not deviate significantly from than centralized approaches, see [5] and the recent survey [6].
what is expected by neighboring vehicles. The method allows us to These methods have less communication and computation cost
construct a cost function for several different traffic scenarios. The compared to centralized methods, see e.g., [7], and therefore,
asymptotic convergence of the system to the desired destination
is proven, in the absence of uncertainty, and disturbances, for a they are appealing to be used for control of a system that contains
sufficiently small MPC control horizon. Simulation results show multiple autonomous vehicles.
that the distributed algorithm scales well with increasing number When more than one autonomous vehicle work in the same
of vehicles. area, the problem of vehicle collision has to be considered in
Index Terms—Autonomous vehicles, collision avoidance, control design in order to provide safety of passengers. Re-
cooperative vehicle systems, model predictive control (MPC). cently, many researchers have worked on improved safety in
transportation systems, see e.g., [8], [9] and our conclusion is
I. INTRODUCTION that it is important to consider the non-holonomic property of
vehicles in the control design. In addition, it is also impor-
UTONOMOUS vehicle technologies have the potential
A to fundamentally change the transportation system with a
possibility to reduce energy consumption, pollution, accidents
tant to predict future behavior of other vehicles to be able to
guarantee safety for the entire maneuver among autonomous
vehicles.
and congestion costs, see e.g., [1]. In addition, grouping vehicles In a previous paper [10], a first attempt was made to handle
leads to a reduction of space used by vehicles on the road, and this situation by a collision avoidance cost in an MPC criterion.
can significantly address the above traffic issues. Consequently, In comparison, in our now proposed method, a collision avoid-
the concept of cooperation and control algorithms that apply to ance constraint is introduced in a predictive way that takes the
multiple autonomous vehicle systems have become an important uncertain deviation between the actual predictive and estimated
topic for researchers in the automation and control field. trajectories of vehicles into consideration. This constraint to-
Among the existing motion planning methods, see [2], path gether with a compatibility constraint and the tuning parameters
planning approaches that are based on optimization, more ex- for the terminal region guarantee convergence and collision
plicitly Model Predictive Control (MPC), have been successful avoidance properties. The compatibility constraint motivated
in autonomous driving. Handling different limitations of both by [11]–[14] has an important role in guaranteeing both collision
the state and input control signals while planning maneuvers avoidance and the convergence of the system. The predictive
collision avoidance feature of our proposed method is one of
Manuscript received December 20, 2019; revised April 15, 2020 and August the contributions, and makes our method differ from some of
5, 2020; accepted September 14, 2020. Date of publication September 21, 2020;
date of current version May 25, 2021. The research was partly supported by the existing works that use MPC for multi-vehicle autonomous
the Linnaeus Center CADICS, funded by the Swedish Research Council, and driving, e.g., [10], [13].
partly by the Knut and Alice Wallenberg Foundation. (Corresponding author: There exist different traffic scenarios that if they can be per-
Fatemeh Mohseni.)
The authors are with the Department of Electrical Engineering, Linköping formed properly by autonomous vehicles, then different traffic
University, SE-58183 Linköping, Östergötland, Sweden (e-mail: fatemeh. issues, e.g., accidents and congestion costs can be reduced signif-
mohseni@liu.se; erik.frisk@liu.se; lars.nielsen@liu.se). icantly. One of the challenging scenarios in autonomous vehicle
Color versions of one or more of the figures in this article are available online
at https://ieeexplore.ieee.org. technology that is also considered in this work is intersection ma-
Digital Object Identifier 10.1109/TIV.2020.3025484 neuvers, see, e.g., [15]. In [16], by using Lagrangian formulation,
2379-8858 © 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
300 IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, VOL. 6, NO. 2, JUNE 2021
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
MOHSENI et al.: DISTRIBUTED COOPERATIVE MPC FOR AUTONOMOUS DRIVING IN DIFFERENT TRAFFIC SCENARIOS 301
To limit the scope of the work and focus on the controls; where t ∈ [tk , tk + hp ]. The first two terms in (3) are defined for
this work does not consider the uncertainty in sensing and performing the maneuver, and the last term is the terminal cost
assumes that reliable positional information from cameras, lidars which is needed to guarantee the convergence of the system. This
or radars are available. In addition, communication delays are terminal cost is a continuous differentiable function, satisfying
not taken into account and only forward paths are of interest. g(0) = 0 and g(P a (t)) > 0 for any P a (t) = 0, and is defined
Further, the mission is defined by a higher-level task planner. as g = i=1 gi = ni=1 pai (tk + hp ; tk )2 . Furthermore, (4)
n
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
302 IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, VOL. 6, NO. 2, JUNE 2021
− (γzi pai (τ ; zi (tk ))2 + γui uai (τ, zi (tk ))2 ) (12)
where γzi , γui and γti are weighting scalars given a priori. Since
γti , γzi and γui are positive values, (12) implies that Ωi is
invariant for the closed-loop nonlinear system żi = fi (zi , uki )
controlled by the local terminal feedback controller. In addition,
any trajectory of zik starting in Ωi stays in Ωi and converges to the
desired path. It is also assumed that for all zi ∈ Ωi , uki is the fea-
Fig. 2. An illustration of the optimal and estimated control signals.
sible stabilizing feedback, i.e., ui (τ ; zi (tk )) ∈ U . The condition
(12) means that the terminal cost γti pai (tk + hp ; tk )2 bounds
the system cost for any initial state starting in the terminal set.
This condition is based on the definition of the terminal
over [tk , tk + hp ]. With these estimates, the vehicle has all the
dynamic feedback controller and convergence property of that
information needed for solving its local optimal control problem.
following a similar approach as in [22].
The trajectories that each vehicle predicts for its neighbors are
Remark 1: Note that finding the associated positively in-
called the estimated trajectories. Further, before time t = tk
variant terminal-state region is not an easy task and is still a
each vehicle has transmitted its control signal computed at tk−1
challenging issue, even for control of one single nonholonomic
for the time interval [tk , tk−1 + hp ]. This control signal will be
vehicle, [23]. There exist some methods in the literature for
used as the first part of the estimated controller when the local
finding this region, e.g., solving an optimization problem to
DCMPC problems are solved at time t = tk . In addition, since
determine the borders of the region. However, following any
the control signal for the time interval [tk−1 + hp , tk + hp ] is not
of the existing methods does not result in a unique terminal
known, a stabilizing feedback controller is used for the second
state region for a given system. The size of the region depends
part of the estimated controller.
on the selection of the terminal controller parameters and the
Definition 2 (Estimated control): At every time instant τ ∈
non-linearity of the system to be controlled.
[tk , tk + hp ], the estimated control for each vehicle is defined
by the optimal predictive input and a terminal controller, i.e.,
D. Compatibility and Collision Avoidance Constraints
u∗i (τ ; tk−1 ), τ ∈ [tk , tk−1 + hp ]
ûi (τ ; tk ) = (8) In the centralized method, the controller calculates the trajec-
i (τ ; tk ), τ ∈ [tk−1 + hp , tk + hp ]
uK tory of all vehicles at the same time by considering the collision
In Fig. 2, the red curve is the optimal control signal com- avoidance constraint (5). But, in the distributed method, each
puted at time t = tk−1 . Then, when computing the optimal vehicle uses the estimated states of its neighbors and not all the
control input at time t = tk , each vehicle needs to know the vehicles, in the collision avoidance constraint. So, if any vehicle
trajectory of its neighboring vehicles also for the time interval moves away too far from its estimated trajectory, it may collide
[tk−1 + hp , tk + hp ] which is unknown. Therefore, the terminal with other vehicles. Therefore, inspired from [13] and [14], an
additional constraint called the compatibility constraint is added
controller uK i (τ ; tk ), a dynamic feedback controller here found
via standard full-state linearization, see for example [20], [21], to the DCMPC problem of each vehicle which ensures that each
is used to compute an estimate of the optimal trajectories, the vehicle does not deviate too far from the trajectory expected by
green curve in Fig 2. The dashed line indicates the estimated its neighbors.
control signal and contains a part of the red curve and the In other words, when the predictive trajectory is planned over
terminal controller. Finally, the resulting optimal controller for the prediction horizon τ ∈ [tk , tk + hp ], the collision avoidance
the whole interval [tk , tk + hp ] can be then computed which is must be guaranteed. Under ideal conditions, each vehicle i con-
the blue curve. According to the estimated control, the estimated siders the collision avoidance constraint (7) over [tk , tk + hp ].
predictive trajectory is given by However, the actual predictive trajectory pj (τ ; tk ) is unavailable
for vehicle i at tk , and the corresponding estimated predictive
zi∗ (τ ; tk−1 ), τ ∈ [tk , tk−1 + hp ] trajectory p̂j (τ ; tk ) is used instead. So, the influence of the
ẑi (τ ; tk ) = (9)
ziK (τ ; tk ), τ ∈ [tk−1 + hp , tk + hp ] uncertain deviation between the estimated and actual predictive
trajectories of vehicle i, ζi (τ ; tk ) = pi (τ ; tk ) − p̂i (τ ; tk ), on the
where ziK (τ ; tk ) is the estimated terminal trajectory. collision avoidance cannot be ignored. Therefore, a compati-
Definition 3 (Terminal set): For every vehicle i, and for bility constraint is proposed to handle the uncertain deviation.
ξi (tk ) ∈ (0, ∞), and τ = tk + hp , the terminal set is The distributed collision avoidance constraint is denoted by
Ωi (tk ) := {zi : zia (τ ; tk ) ≤ ξi (tk )}. (10) gcolij (pi , p̂j ) ≤ 0, where p̂j is the estimated path of the j th
neighbor. Constraints on the uncertain deviation ζi (τ ; tk ) is
Assumption 1: For every vehicle, the largest ξi (tk ) > 0 and imposed in each individual optimization problem.
the terminal feedback controller gains are chosen such that The compatibility constraint is important for guaranteeing the
pi (τ ; zi (tk )) − pj (τ ; zi (tk )) ≥ 2Di + Φi , ∀j ∈ Ni , (11) trajectory to converge to the desired target neighborhood and
avoiding collision, and is defined as
d
γti gi (pai (τ ; zi (tk ))) ≤ gcomi (ζi (τ ; tk )) = ζi (τ ; tk ) − ηi (tk ) ≤ 0 (13)
dt
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
MOHSENI et al.: DISTRIBUTED COOPERATIVE MPC FOR AUTONOMOUS DRIVING IN DIFFERENT TRAFFIC SCENARIOS 303
where ηi (tk ) = minj∈Ni {ηij (tk )} and ẑ˙j (t; zj (tk )) = f (ẑj (t; zj (tk )), ûj (t; zj (tk ))) (20)
minτ {p̂i (τ ; tk ) − p̂j (τ ; tk )} gcomi (ζi (τ ; tk ) ≤ 0) (21)
ηij (tk ) = − Di (14)
2
gcolij (pi (τ ; tk ), p̂j (τ ; tk )) ≤ 0 (22)
for τ ∈ [tk , tk + hp ] and j ∈ Ni . In (13), ηi (tk ) is introduced
to guarantee collision avoidance under uncertainty ζi . By using zi (tk + hp ; tk ) ∈ Ωi (tk ). (23)
the triangle inequality property for (13), we have
The constraint (19) is added because estimated states are
pi (τ ; tk ) − pj (τ ; tk ) needed in the compatibility constraint (21). In addition, (20) is
the kinematic model for the j th neighbor and can be removed if
= pi (τ ; tk ) − p̂j (τ ; tk ) − ζj (τ ; tk ) the cars communicate the estimated states. The main difference
≥ pi (τ ; tk ) − p̂j (τ ; tk ) − ηi (tk ). (15) between the centralized and distributed formulations defined in
Problems 1 and 2 is that the complexity inherent with the cen-
The constraint (7) can then be guaranteed by tralized approach is reduced by defining n smaller problems, one
pi (τ ; tk ) − p̂j (τ ; tk ) ≥ 2Di + Φi + ηi (tk ), ∀j ∈ Ni (16) for each vehicle, where only the connections between any given
vehicle and its neighbors are included, and these n problems
Thus the collision avoidance constraint is can be solved in parallel. The optimal solution for each DCMPC
problem is denoted by u∗i (τ ; tk ) and the optimal state trajectory
gcolij (pi (τ ; tk ), p̂j (τ ; tk )) =
for the ith vehicle is denoted by zi∗ (τ ; tk ), τ ∈ [tk , tk + hp ].
2Di + Φi + ηi (tk ) − pi (τ ; tk ) − p̂j (τ ; tk ) ≤ 0. (17) Assumption 2: For each vehicle i, it is assumed that the opti-
mal solution to Problem 2 exists and is numerically obtainable,
Therefore, if for all τ ∈ [tk , tk + hp ] the compatibility con-
and the position of each vehicle i is bounded.
straint (13) and the collision constraint (17) are satisfied, then
by implementing u∗i (τ ; tk ), the collision avoidance (7) will be
guaranteed over the time interval τ ∈ [tk , tk+1 ]. Compared with F. Convergence Results
the compatibility constraints proposed in [13] and [14], the Now, convergence is investigated and it is shown that the
constraint defined here also considers collision avoidance. closed-loop position of all vehicles converges to a neighborhood
of the desired destination P f . Note that here convergence does
E. DCMPC Control Design not imply optimality. First, it is proved that initial feasibility
To reformulate the centralized optimization problem into a of the implementation of the DCMPC algorithm provides sub-
distributed one including n different problems and to ensure sequent feasibility. Then, by considering the sum of all the
that each vehicle does not move away too far from the trajectory individual optimal cost functions, used as a Lyapunov function,
expected by its neighbors, a compatibility constraint is added to the convergence of the path of the vehicles to the destinations
each problem. Here, it is assumed that prediction horizon (hp ) will be proved similar to [11], [13], [24], [25].
and control horizon (hc ) are identical for all vehicles and are The set of initial states for which Problem 2 is feasible at
chosen such that hp covers the slowest system mode; and hc , initialization is denoted by Zσ . Feasibility of the optimization
to be shorter than the fastest system’s mode. In that way, the problem means that for every vehicle, there exists at least one
controller can react timely to disturbances. control input such that the objective function Ji is bounded
Let (10) define the terminal set at instant tk , (13) the com- and all the constraints are satisfied. In addition, subsequent
patibility constraint, and (17) the collision avoidance constraint. feasibility means that zi (:, tk ) ∈ Zσ implies zi (:, tk+1 ) ∈ Zσ .
The DCMPC problem can now be stated as follows. Lemma 1: Under Assumptions 1–2 for a system of n au-
Problem 2: For each vehicle i ∈ {1, . . . , n} at any MPC tonomous vehicles with model (1), by assuming that the predic-
update time tk , k ∈ N, and with given ẑj (t; tk ), zi (tk ), tion horizon has been tuned such that the Problem 2 is feasible
ẑi (t; tk ) for all τ ∈ [tk , tk + hp ], where j ∈ Ni , for given at initial time t0 , and assuming that there exist at least one
weighting scalars γzi , γui and γti find Ji∗ (tk , pai , p̂i , uai ) = feasible solution for the system, then by applying the input
minui (.) Ji (tk , pai , p̂i , uai ), where ui (τ ) = u∗i (τ ; tk ), τ ∈ [tk , tk+1 ), for a sufficiently small control
horizon hc , the optimization Problem 2 has a feasible solution
tk +hp
at any tk , k > 0.
Ji (tk , pai , p̂i , uai ) = γzi pai (τ ; tk )2 Proof: By considering that Def. 1, and Assumptions 1– 2 are
tk
satisfied, in a similar way as [11], the recursive feasibility is
+ γui uai (τ ; tk )2 dτ + γti gi (pai (tk + hp ; tk )) proven by induction.
(18) Let ui (.; tk ) and zi (.; tk ) denote the feasible control and state
subject to solution at time tk . The estimated control ûi (.; tk+1 ), defined
in (8), is a candidate control that can steer zi (tk+1 ; tk ) to the
żi (t; zi (tk )) = f (zi (t; zi (tk )), ui (t; zi (tk ))) terminal region Ωi in time tk+1 + hp . Under Assumption 2,
ui (τ ; zi (tk )) ∈ U , zi (τ ; zi (tk )) ∈ Z since there are no disturbances and full state measurement is
available, then for each vehicle zi (tk+1 ) = zi∗ (tk+1 ; tk ). There-
ẑ˙i (t; zi (tk )) = f (ẑi (t; zi (tk )), ûi (t; zi (tk ))) (19) fore, the estimated control ûi (.; tk+1 ) defined in (8) can be a
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
304 IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, VOL. 6, NO. 2, JUNE 2021
tk +hp
candidate control as the solution of the optimal control problem
at time tk+1 with initial condition zi (tk+1 ; tk+1 ) that can steer + γt g(P̂ (tk+1 + hp ; tk+1 )) −
a
γz P a∗ (τ ; tk )2
tk
zi (tk+1 ; tk ) to the terminal region in time tk+1 + hp .
In addition, from Assumption 1, the terminal region is invari- + γu ua∗ (τ ; tk )2 dτ − γt g(P a∗ (tk + hp ; tk )).
ant for the system model controlled with the terminal controller,
Denote, P a∗ (tk + hp ; tk ) = P̂ a (tk + hp ; tk+1 ). Then
and the control and terminal constraint remain feasible from
tk +hp tk +hp
the property of the terminal controller. Furthermore, the com-
patibility constraint is satisfied since zi (.; tk+1 ) = ẑi (.; tk+1 ) γz P̂ a (τ ; tk+1 )2 = γz P a∗ (τ ; tk )2 ;
tk+1 tk+1
for all i and ζi (τ ; tk+1 ) = p̂i (τ ; tk+1 ) − pi (τ ; tk+1 ) = 0, and
according to Assumption 1, (13), and (16), it holds that for by using Definition 2. Therefore,
[tk , tk+1 ], p∗i (τ ; tk ) − p∗j (τ ; tk ) ≥ 2Di + Φi . Consequently, V (tk+1 ) − V (tk ) ≤
according to Definition 1 and Assumption 1, it holds that tk +hp
γz P a∗ (τ ; tk )2 + γu ua∗ (τ ; tk )2 dτ +
2Di + Φi ≤ p̂i (τ ; tk+1 ) − p̂j (τ ; tk+1 ) tk+1
tk+1 +hp
p∗i (τ ; tk ) − p∗j (τ ; tk ), τ ∈ [tk+1 , tk + hp ],
= ∗
K∗
γz P̂ a (τ ; tk+1 )2 + γu ûa (τ ; tk+1 )2 dτ −
pKi (τ ; tk ) − pj (τ ; tk ), τ ∈ [tk + hp , tk+1 + hp ], tk +hp
tk+1
which guarantees the collision avoidance constraint for all t ∈ γz P a∗ (τ ; tk )2 + γu ua∗ (τ ; tk )2 dτ −
[tk+1 , tk+1 + hp ]. Therefore, the predictive input ui (τ ; tk+1 ) = tk
ûi (τ ; tk+1 ) is a feasible solution at time tk+1 . tk +hp
Theorem 1: Suppose that for a system of n autonomous γz P a∗ (τ ; tk )2 + γu ua∗ (τ ; tk )2 dτ +
tk+1
vehicle with dynamics (1), Assumptions 1–2 are satisfied and
the Problem 2 has a feasible solution for each vehicle at the γt g(P̂ a (tk+1 + hp ; tk+1 )) − γt g(P̂ a (tk + hp ; tk+1 )).
initial instant t0 . By solving the optimization Problem 2 at each
update time tk , k ≥ 0, by applying the input ui (τ ) = u∗i (τ ; tk ), Furthermore, by using (12),
τ ∈ [tk , tk+1 ) of each vehicle, the position tracking adaption V (tk+1 ) − V (tk ) ≤
vector of the system converges to zero. tk +hc
Proof: The proof follows from Lemma 1, Definition 1, − γz P a∗ (τ ; tk )2 + γu ua∗ (τ ; tk )2 dτ. (24)
Assumptions 1- 2, and using the sum of all n cost functions tk
as a Lyapunov criterion, i.e., V (tk ) = i∈n Ji (tk , pa∗
i , p̂i , ui ),
a∗
Since γz and γu are positive scalars, (24) becomes
∗
of the closed loop system Ż(t) = F (Z(t), U (t)). Following tk +hc
the arguments in [11], the function V (tk ) satisfies V (0) = 0 V (tk+1 ) − V (tk ) ≤ − γz P a∗ (τ ; tk )2 dτ (25)
and V (Z a (tk )) > 0 for Z a = 0. The difference between the tk
Lyapunov function for two consecutive update instants is for any hc ∈ (0, hp ]. Therefore,
t
V (tk+1 ) − V (tk ) V (t) − V (tk ) ≤ − γz P a∗ (τ )2 dτ (26)
tk
∗ ∗
= J (z(.; tk+1 ), u(.; tk+1 )) − J (z(.; tk ), u(.; tk ))
for any t ∈ [tk , tk+1 ]. By applying this recursively, it is implied
∗
Since u and û both are feasible controls and since u is the ∗ that for any times t1 , t2 , where t0 ≤ t1 < t2 ≤ ∞, along the
optimal control input, the cost of applying u∗ to the system is trajectory of the closed-loop system starting from any Z0 = Zσ ,
less than the total cost of applying û. So, the optimal cost function satisfies
t2
V (tk+1 ) − V (tk ) V (t2 ) − V (t1 ) ≤ − γz P a (τ )2 dτ. (27)
t1
tk+1 +hp
≤ γz P̂ a (τ ; tk+1 )2 + γu ûa (τ ; tk+1 )2 dτ By induction, (26) states that the optimal cost function is
tk+1 non-increasing for any hc ∈ (0, hp ]. Since by assumption J ∗
tk +hp is bounded at time t0 , then V (t0 ) < μ, and as a result V (t) < μ,
+ γt g(P̂ a (tk+1 + hp ; tk+1 )) − γz P a∗ (τ ; tk )2 ∀t ≥ t0 , where μ is positive. Following the argument in Theo-
tk rem 1 in [11], there exists a constant ρ > 0 such that P a (t0 ) ≤
+ γu ua∗ (τ ; tk )2 dτ − γt g(P a∗ (tk + hp ; tk )) ρ implies V (t0 ) < μ which results in V (t) < μ ⇒ P a (t) <
tk +hp where > 0. By using (27) and induction,
∞
= γz P̂ a (τ ; tk+1 )2 + γu ûa (τ ; tk+1 )2 dτ V (∞) − V (t0 ) ≤ − γz P a (τ )2 dτ (28)
tk+1
t0
tk+1 +hp
and since V (t0 ) < μ and γz > 0, by using Barbalat’s lemma it
+ γz P̂ a (τ ; tk+1 )2 + γu ûa (τ ; tk+1 )2 dτ
tk +hp is guaranteed that P a (t) → 0 as t → ∞.
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
MOHSENI et al.: DISTRIBUTED COOPERATIVE MPC FOR AUTONOMOUS DRIVING IN DIFFERENT TRAFFIC SCENARIOS 305
Note that all the conditions that have been used to prove
the convergence are sufficient but not necessary conditions. It
would be useful to have an analytic expression for validating
convergence of the method, but since the state and control
vectors that have been used are implicitly defined in (18), it
is difficult to analyze the method in other ways. However, in the
next section, the method has been tested and the problem has
been solved in a numerical way to show how the DCMPC method
achieves the control objective. In addition, there are many other
factors, e.g., disturbances, communication delays that could be
considered for practical application of the proposed method.
These factors are considered as future work.
V. SIMULATION RESULTS
The objective of this section is to assess the properties of
the distributed algorithm and illustrate performance in three
different traffic scenarios, “Cooperative lane-merging maneu-
ver,” “Double lane-switching,” and “Cooperative intersection”.
In addition, the effect of the optimization horizon hp on the
performance of the system is investigated. Also, the effect of
different values of the control horizon hc on the robustness of
Fig. 3. The cooperative lane-merging maneuver on highway scenario. Fig. (b)
the method is investigated in a situation where a non-autonomous illustrates the trajectories, (c) shows the distances between representative sample
vehicle is included, and then the difference between the com- pairs of vehicles and (d) is the heading angle during the maneuver.
plexity and computational time of the centralized (CCMPC)
and the distributed (DCMPC) methods are illustrated. In basic
discrete time receding horizon control, the first control is applied consecutive vehicles respectively. Assume that vehicles with odd
and the remaining controls are discarded and then optimization index numbers are initially moving in the construction site lane
is redone at each sampling instant. The DCMPC optimization (group 1) and the ones with even index numbers are in the other
problem is formulated in continuous time and therefore, the lane. The location of the construction site is Pcons = (0, 180)T .
problem is discretized before given to a numerical solver. So, The initial state vector of each vehicle in group 1 is set as
the number of optimization variables increases linearly with the
z(2k1 −1) = zs,g1 + (k1 − 1)zd,g1 , k1 = 1, . . . , ng1
optimization horizon hp and to reduce controller computational
complexity a control horizon hc ∈ (0, hp ] has been introduced. where zs,g1 = (0, 0, π/2, 0)T , zd,g1 = (0, Ds , 0, 0)T and ng1 is
Controls in the time interval [tk , tk + hc ] where tk is the MPC the number of vehicles in group 1. The initial state vector for
update time are used before re-optimizing. Thus, a smaller hc vehicles that are moving in the other lane (group 2), is
means solving more optimization problems and thereby hc can
be used to tune computational effort. A smaller hc results in a z(2 k2 ) = zs,g2 + (k2 − 1)zd,g2 , k2 = 1, . . . , ng2
more agile controller being able to react faster to disturbances where zs,g2 = (Dr , 0, π/2, 0)T and zd,g2 = (0, Ds , 0, 0)T .
and changes in the environment. The number of optimization In order to merge the vehicles before the construction site,
variables grows linearly with the prediction horizon hp . Thus, hc the final desired position of vehicles is formulated as a function
and hp controls computational effort and disturbance rejection of time in order to let vehicles keep moving on the middle of
and this is shown with numerical results in this section. the lane, continuing regular driving, after finishing the merging
For simplicity, for all scenarios, it is assumed that all vehicles maneuver. The desired destination for each vehicle is
have the same dynamics (1). In order to solve the optimization
problem, the Casadi framework [26] in MATLAB has been f Pcons + (Dr , (i − 1)Df )T , if |xi (tk ) − Dr | ≥ Dr /2
Pi (t) =
used. Two of the scenarios, “Cooperative lane-merging maneu- Pcons + (Dr , (i − 1)Df )T + tvi (tk ), otherwise.
ver” and the “Cooperative intersection maneuver” are inspired
The results for this scenario are shown in Fig. 3. Sub-figure 3-
from GCDC [27].
(a) shows the three phases related to the scenario, starting,
merging and the end of the scenario. In sub-figures (b) and (d),
A. Cooperative Lane-Merging Maneuver the blue and red curves correspond to group 1 and 2 respectively.
In this highway scenario, multiple vehicles in different lanes Figures 3-(b) and 3-(c) illustrate the trajectories and distances
receive a message from a construction site ahead that the left between pairs of vehicles. Thus, during the maneuver the dis-
lane is blocked up the road. The vehicles communicate to merge tance between vehicles is more than the 6[m] predefined safe
into a single lane before the site, see Fig. 3-(a). Let Dr , Ds distance, which means that there is no collisions during the ma-
and Df be the horizontal distance between the two lanes in the neuver where the vehicles merge successfully. Sub-figure 3-(d)
road, the initial and the final desired distance between every two illustrates the heading angle of the vehicles during the maneuver.
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
306 IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, VOL. 6, NO. 2, JUNE 2021
pf,g2 +(k2 −(ng1 +1))pfd,g2 , if|xk2 (tk )−Dr | ≥ Dr /2
pfk2 =
pf,g2 + (k2 − (ng1 + 1))pfd,g2 + tvk1 (tk ), otherwise
Fig. 4. The double lane-switching scenario. Fig. (b) illustrates the trajectories, where, k1 = 1, . . . , ng1 , pf,g1 = (Dr , 200), pfd,g1 = (0, Df ),
(c) shows the distances between sample representative pairs of vehicles and k2 = ng1 + 1, . . . , ng1 + ng2 , pf,g2 = (0, 200), pfd,g2 =
(d) is the heading angle during the maneuver.
(0, Df ) and ng2 = 3. The results are shown in Fig. 4 where
sub-figure 4-(a) illustrates the three phases related to the
scenario, starting, lane change and the end of the scenario.
B. Double Lane-Switching Scenario In sub-figures (b) and (d), the blue curves correspond to
group 1 and the red curves to group 2. Figures 4-(b) and 4-(c)
In this scenario, vehicles are moving in different lanes and illustrate the trajectories and distance between some sample
aim to switch lanes, see Fig. 4. This is a complex and difficult pairs of vehicles and show that the vehicles change their lanes
scenario since many paths are intersecting. In regular traffic, successfully. Note that all cars, both in front and at the back, can
when it comes to busy areas like highways where the incidents adjust their velocity depending on the vehicle model, distance of
may happen while changing from one lane to the other. In each car to the destination, and the objective function weights.
the US, more than 200,000 reported lane-change crashes occur As shown in the figure, there is no collision between vehicles.
annually in recent years; resulting in at least 60,000 injuries [28]. Sub-figure 4-(d) illustrates the heading angle of the vehicles.
Therefore, it is interesting to handle lane changes using au- Furthermore, Fig. 5 illustrates the compatibility constraint,
tonomous vehicles. Thus, it is fruitful to have a method for gcomi , for one vehicle from each group. Note that at some
multiple vehicles, e.g., platoons, that controls the vehicles in points gcomi = 0 and for these points (14) for at least one
a cooperative way and let them switch lanes in a safe way. neighbor of vehicle i is zero and as a result ηi (tk ) is equal to
Initial and desired final positions for each vehicle is de- zero. Therefore, the uncertain deviation between the estimated
fined and the DCMPC algorithm is applied in the double lane- and actual predictive trajectories of vehicle i is zero. These
switching maneuver. Assume that vehicles with index numbers points correspond to time instances at which the vehicles were
i = 1, . . . , 4 are initially moving on the left side of the road close to their neighbors and therefore, the deviation of pi from
(group 1) and i = 5, . . . , 7 are moving on the right side. The its estimated value must be small in order to avoid collision.
initial state vector of each vehicle in group 1 is set as
C. Intersection Maneuver Including Non-Autonomous Vehicle
zk1 = zs,g1 + (k1 − 1)zd,g1 , k1 = 1, . . . , ng1
In MPC, as the control horizon hc decreases, usually the
rejection of unknown disturbances improves. In order to in-
where zs,g1 = (0, 0, π/2, 0), zd,g1 = (0, Ds , 0, 0), and ng1 and
vestigate the sensitivity of the DCMPC method to changes
Ds are defined as before. The initial state vector for vehicles that
of hc , the cooperative intersection scenario is investigated in
are moving in the other lane (group 2), is defined as
this subsection by adding a non-autonomous vehicle to the
system.
zk2 = zs,g2 + (k2 − (ng1 + 1))zd,g2 This scenario demonstrates an intersection coordination ac-
tivity where one non-autonomous vehicle is approaching a busy
where k2 = ng1 + 1, . . . , ng1 + ng2 , zs,g2 = (Dr , 0, π/2, 0), road. It intends to do a left turn onto the road and approaching
zd,g2 = (0, Ds , 0, 0) and ng2 = 3. vehicles collaborate to allow it a comfortable and safe passage,
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
MOHSENI et al.: DISTRIBUTED COOPERATIVE MPC FOR AUTONOMOUS DRIVING IN DIFFERENT TRAFFIC SCENARIOS 307
Fig. 8. (a) The average of computational time of DCMPC with different pre-
diction horizon for each vehicle and for each iteration over the entire intersection
maneuver and for hc = 1 sec, and (b) the heading angle.
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
308 IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, VOL. 6, NO. 2, JUNE 2021
REFERENCES
[1] A. Vahidi and A. Sciarretta, “Energy saving potentials of connected
and automated vehicles,” Transp. Res. Part C: Emerg. Technol., vol. 95,
pp. 822–843, 2018.
Fig. 9. Normalized Computational time versus number of vehicles. [2] X. Hu, L. Chen, B. Tang, D. Cao, and H. He, “Dynamic path planning
for autonomous driving on various roads with avoidance of static and
moving obstacles,” Mech. Syst. Signal Process., vol. 100, pp. 482–500,
2018.
[3] A. Veksler, T. A. Johansen, F. Borrelli, and B. Realfsen, “Dynamic posi-
simulations show that a balanced trade-off between optimal- tioning with model predictive control,” IEEE Trans. Control Syst. Technol.,
ity performance and computation time is provided by using vol. 24, no. 4, pp. 1340–1353, Jul. 2016.
distributed MPC. [4] R. M. Murray, “Recent research in cooperative control of multivehicle
systems,” J. Dynamic Syst. Meas. Control, vol. 129, no. 5, pp. 571–583,
For comparison, the DCMPC and CCMPC controller were 2007.
both applied to scenarios with 2 − 7 vehicles. All the simulations [5] C. Wang and C. J. Ong, “Distributed model predictive control of dynam-
have been done using Matlab R2018b on an Intel Core i7-3770, ically decoupled systems with coupled cost,” Autom., vol. 46, no. 12,
pp. 2053–2058, 2010.
3.40 GHz processor, and CentOS Linux 64-bit machine. The [6] D. Ding, Q.-L. Han, Z. Wang, and X. Ge, “A survey on model-
results showed that when the system contains more than 2 vehi- based distributed control and filtering for industrial cyber-physical
cles, the DCMPC provides a considerably lower computational systems,” IEEE Trans. Ind. Inf., vol. 15, no. 5, pp. 2483–2499,
May 2019.
time compared to the CCMPC. In Fig. 9, the average normalized [7] N. Motee and A. Jadbabaie, “Optimal control of spatially distributed
controller computation time of both controllers is illustrated. The systems,” IEEE Trans. Autom. Control, vol. 53, no. 7, pp. 1616–1629,
computational time for the centralized controller is divided with Aug. 2008.
[8] R. Zheng, K. Nakano, S. Yamabe, M. Aki, H. Nakamura, and Y. Suda,
the number of vehicles to be able to compare the centralized “Study on emergency-avoidance braking for the automatic platooning of
with the distributed controller. One time unit in the figure is trucks,” IEEE Trans. Intell. Transp. Syst., vol. 15, no. 4, pp. 1748–1757,
2.63 seconds and the corresponding computational time in the Aug. 2014.
[9] L. Xu, L. Wang, G. Yin, and H. Zhang, “Communication information
distributed case is 0.2 seconds. As a real-time indicator, the structures and contents for enhanced safety of highway vehicle platoons,”
control horizon, hc , for the MPC controller is 1 second. Compu- IEEE Trans. Veh. Technol., vol. 63, pp. 4206–4220, Nov. 2014.
tational times are presented in normalized form mainly for two [10] F. Mohseni, E. Frisk, J. Åslund, and L. Nielsen, “Distributed model
reasons: the implementation is not optimized for computational predictive control for highway maneuvers,” IFAC-PapersOnLine, vol. 50,
no. 1, pp. 8531–8536, 2017.
performance and to keep the presentation as implementation [11] H. Chen and F. Allgower, “A quasi-infinite horizon nonlinear model
independent as possible. The computational time for CCMPC in predictive control scheme with guaranteed stability,” Autom., vol. 34,
our problem is approximately a quadratic function of the number no. 10, pp. 1205–1217, 1998.
[12] E. Camponogara, D. Jia, B. H. Krogh, and S. Talukdar, “Distributed
of vehicles, while for the distributed is approximately a linear model predictive control,” IEEE Control Syst., vol. 22, no. 1, pp. 44–52,
function. Since the size of the distributed optimal problem for Feb. 2002.
each vehicle does not increase too much by increasing the num- [13] W. B. Dunbar and R. M. Murray, “Distributed receding horizon control
for multi-vehicle formation stabilization,” Automatica, vol. 42, no. 4,
ber of vehicles, the computational time for DCMPC is a linear pp. 549–558, Apr. 2006.
function of the number of vehicles and clearly, the DCMPC algo- [14] B. Ding, L. Xie, and W. Cai, “Distributed model predictive control for
rithm is computationally advantageous for the larger number of constrained linear systems,” Int. J. Robust Nonlinear Control, vol. 20,
no. 11, pp. 1285–1298, 2010.
vehicles. [15] S. A. Fayazi and A. Vahidi, “Mixed-integer linear programming for optimal
scheduling of autonomous vehicle intersection crossing,” IEEE Trans.
Intell. Veh., vol. 3, no. 3, pp. 287–299, Sep. 2018.
VI. CONCLUSION [16] Y. Bichiou and H. A. Rakha, “Real-time optimal intersection control
system for automated/cooperative vehicles,” Int. J. Transp. Sci. Technol.,
In this paper, a cooperative control approach for autonomous vol. 8, no. 1, pp. 1–12, 2019.
vehicles was proposed to address traffic scenarios involving [17] K. Dresner and P. Stone, “A multiagent approach to autonomous intersec-
several autonomous vehicles. The problem was formulated as tion management,” J. Artif. Intell. Res., vol. 31, pp. 591–656, 2008.
[18] C. Samson, “Control of chained systems application to path following
a distributed optimal control problem and the proposed formu- and time-varying point-stabilization of mobile robots,” IEEEA. Isidori,
lation and constraints provide a collision-free trajectory for each Nonlinear Control Syst., 3rd ed. Springer-Verlag, (1995). Trans. Autom.
vehicle. This formulation has a generality property in the sense Control, vol. 40, no. 1, pp. 64–77, Jan. 1995.
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.
MOHSENI et al.: DISTRIBUTED COOPERATIVE MPC FOR AUTONOMOUS DRIVING IN DIFFERENT TRAFFIC SCENARIOS 309
[19] T. Fraichard and A. Scheuer, “From Reeds and Shepp’s to continuous- Fatemeh Mohseni was born in Tehran, Iran, in 1984.
curvature paths,” IEEE Trans. Robot., vol. 20, no. 6, pp. 1025–1035, She received the M.Sc. degree in electrical engineer-
Dec. 2004. ing from the Amirkabir University of Technology
[20] A. De Luca, G. Oriolo, and C. Samson, Feedback Control a Nonholonomic (formerly Tehran Polytechnic), Tehran, Iran, in 2012.
Car-Like Robot. Berlin, Heidelberg: Springer Berlin Heidelberg, 1998, Since 2015, she has been working toward the Ph.D.
pp. 171–253. degree with the Department of Electrical Engineer-
[21] A. Isidori, Nonlinear Control Syst.. Springer Science & Business Media, ing, Linköping University, Linköping, Sweden.
2013.
[22] D. Mayne, J. Rawlings, C. Rao, and P. Scokaert, “Constrained model
predictive control: Stability and optimality,” Automatica, vol. 36, no. 6,
pp. 789–814, 2000.
[23] D. Gu and H. Hu, “Receding horizon tracking control of wheeled mobile Erik Frisk was born in Stockholm, Sweden, in 1971.
robots,” IEEE Trans. Control Syst. Technol., vol. 14, no. 4, pp. 743–749, He received the Ph.D. degree in 2001. He is currently
Jul. 2006. a Professor with the Department of Electrical Engi-
[24] H. Genceli and M. Nikolaou, “Robust stability analysis of constrained l1- neering, Linköping University, Linköping, Sweden.
norm model predictive control,” AIChE J., vol. 39, no. 12, pp. 1954–1965, His main research interest has been model-based fault
1993. diagnostics and prognostics. Another main area of
[25] H. Michalska and D. Q. Mayne, “Robust receding horizon control of research is optimization techniques for autonomous
constrained nonlinear systems,” IEEE Trans. Autom. Control, vol. 38, vehicles in complex traffic scenarios.
no. 11, pp. 1623–1633, Nov. 1993.
[26] J. A. Andersson, J. Gillis, G. Horn, J. B. Rawlings, and M. Diehl, “Casadi:
A software framework for nonlinear optimization and optimal control,” Lars Nielsen was born in Sweden, in 1955. He re-
Math. Program. Comput., vol. 11, no. 1, pp. 1–36, 2019. ceived the Ph.D. degree in automatic control from
[27] J. Ploeg et al., “Cooperative automated maneuvering at the 2016 grand Lund University, Lund, Sweden, in 1985. Since 1992,
cooperative driving challenge,” IEEE Trans. Intell. Transp. Syst., vol. 19, he has been the Sten Gustafsson Professor of Vehicu-
no. 4, pp. 1213–1226, Apr. 2018. lar Systems with Linköping University, Linköping,
[28] G. Fitch, S. Lee, S. Klauer, J. Hankey, J. Sudweeks, and T. Dingus, Sweden. His research interests include automotive
“Analysis of lane-change crashes and near-crashes,” US Dept. Transp., control, diagnosis, and autonomy.
Nat. Highway Traffic Saf. Admin., 2009.
Authorized licensed use limited to: Universidad de Concepcion. Downloaded on September 12,2021 at 19:47:51 UTC from IEEE Xplore. Restrictions apply.