(W) Xiao2015

IET Control Theory & Applications
Research Article
ISSN 1751-8644
Fast gradient-based distributed Received on 22nd May 2014
Revised on 8th December 2014
optimisation approach for model predictive Accepted on 17th December 2014
doi: 10.1049/iet-cta.2014.0549
control and application in four-tank www.ietdl.org
benchmark
Xiaojun Zhou1 , Chaojie Li2 ,Tingwen Huang3 , Mingqing Xiao4
1
School of Information Science and Engineering, Central South University, Changsha 410083, People’s Republic of China
2
School of Electrical and Computer Engineering, RMIT University, Melbourne VIC 3000, Australia
3
Texas A & M University at Qatar, Doha 5825, Qatar
4
Department of Mathematics, Southern Illinois University, Carbondale, Illinois 62901, USA
E-mail: cjlee.cqu@163.com
Abstract: By taking both control and state vectors as decision variables, the subproblems of model predictive control
scheme can be considered as a class of separable convex optimisation problems with coupling linear constraints. A
Lagrangian dual method is introduced to deal with the optimisation problem, in which, the primal problem is solved by a
parallel coordinate descent method, and a fast dual ascend method is adopted to solve the dual problem iteratively. The
proposed approach is applied to the well-known hierarchical and distributed model predictive control four-tank benchmark.
Experimental results have testified the effectiveness of the proposed approach and shown that the benchmark problem
can be well stabilised.
1 Introduction Distributed optimisation is a kind of decomposition method for

solving distributed MPC problems, in which, a large optimisation
Model predictive control (MPC) is a kind of control method, in problem is decomposed into a number of smaller and more tractable
which, the current control vector is obtained by solving a finite ones. Decomposition methods can be divided into two main cate-
horizon open-loop optimal control problem, namely, a constrained gories: primal and dual decompositions. In primal decomposition,
optimisation problem. At each sampling period, using current state the optimisation problem is solved using the original formula-
of the plant as initial state, the optimisation problem yields an tion and variables, while the constraints are handled with methods
optimal control sequence and the first control in this sequence is like interior-point, penalty function, feasible direction, coordinate
applied to the plant [1]. The big advantage of this type of con- descent and Newton methods [9–12]. In dual decomposition, the
trol is its ability to achieve optimal control performance and to Lagrangian dual is introduced, and the main idea is to solve the
cope with hard constraints on both control and state vectors. In primal and dual problems alternatively [13–15]. A limitation of
recent years, the emergence of large-scale systems, such as power classical gradient-based methods based on Lagrangian duality is
systems, process control systems, multi-agent systems and trans- the low convergence rate. Fortunately, in [16, 17], it is shown that
portation systems, has challenged the applicability of traditional an accelerated gradient algorithm can be constructed using only the
MPC because of the high computational burden and low control first-order information. Furthermore, when the objective function is
performance [2–6]. separable (a function is called separable if it is a sum of functions
There exist three MPC strategies for designing such large-scale of its individual variables), it is able to construct parallel methods
systems, namely, distributed MPC, centralised MPC and decen- for solving the primal problem [18].
tralised MPC, of which, the distributed MPC is arguably the most The goal of this paper is to solve a benchmark MPC problem
promising one because it beats the decentralised MPC one in using a fast distributed optimisation approach. The contribution
control performance and outperforms the centralised MPC one in of the paper is three-fold: (i) the optimisation formulation which
computational burden [5]. From a centralised control point of view, enables parallel implementation is established for the hierarchi-
all subsystems are optimised with respect to an objective function cal and distributed (HD)-MPC benchmark, and to the best of our
in a single optimisation problem, making plantwide control difficult knowledge, this is the first time, since in traditional optimisa-
to coordinate and maintain. The key feature of a decentralised con- tion formulation for the benchmark, the objective function is not
trol framework is that there is no communication among different separable and thus hinders the parallel implementation; (ii) a fast
local controllers. It is well-known that strong interactions among distributed optimisation approach is proposed for the subproblems
different subsystems may impede achieving stability and desired in the MPC scheme, which combines a fast dual method with a par-
performance when using decentralised control. In distributed MPC, allel coordinate method for primal problem; (iii) the convergence
some level of communication may be established among differ- properties are proven for both the parallel coordinate method and
ent controllers to achieve better closed-loop control performance. the fast dual ascend method (FDAM). It should be noted that the
Distributed MPC can be non-cooperative or cooperative, depend- parallel coordinate descent algorithm in [11] cannot be appropri-
ing on whether the same global objective function is optimised in ately applied to the benchmark problem since it only deals with
each local controller. A controller is considered to have a non- simple box constraints. The accelerated dual gradient-projection
cooperative attitude if it only seeks to minimise its own objective algorithm in [13] has the similar computational complexity to this
function. On the other hand, if it aims to minimise not only its study for dual problem, but the parallel mechanism for primal prob-
own local cost but the system-wide (or global) cost, it is called a lem is lost. Although we have adopted the similar parallel algorithm
cooperative controller. A recent review of distributed MPC can be used in [15], the proposed FDAM is much simpler than that dual
found in [7, 8] and the references therein. gradient method, which, moreover, is an inexact dual method, and
IET Control Theory Appl., 2015, Vol. 9, Iss. 10, pp. 1579–1586
© The Institution of Engineering and Technology 2015 1579
cannot reflect the true physical balance relationship between the Table 1 Parameters of the benchmark plant
state and the control input in state equation.
Parameter Value Description
This paper is organised as follows. In Section 2, we give a brief
description of the HD-MPC benchmark problem. In Section 3, we h1max 1.36 m maximum level of tank 1
show that the subproblems of the MPC scheme can be reformulated h2max 1.36 m maximum level of tank 2
as a class of separable convex optimisation problems with cou- h3max 1.30 m maximum level of tank 3
pling linear constraints. Furthermore, a Lagrangian dual method h4max 1.30 m maximum level of tank 4
hmin 0.20 m minimum level of all cases
is introduced to deal with the constrained optimisation problem, qamax 3.26 m3 /h maximum flow of qa
in which, the primal problem is solved by a parallel coordinate qbmax 4 m3 /h maximum flow of qb
descent method, and a FDAM is adopted to solve the dual prob- qmin 0 m3 /h minimum flow of qa , qb
lem iteratively. Some experimental results are given to show the a1 1.31 × 10−4 m2 discharge constant of tank 1
effectiveness of the proposed approach in Section 4, followed by a2 1.51 × 10−4 m2 discharge constant of tank 2
the conclusion in Section 5. a3 9.27 × 10−5 m2 discharge constant of tank 3
a4 8.82 × 10−5 m2 discharge constant of tank 4
S 0.06 m2 cross section of all tanks
γa 0.3 ratio of the three-way valve
2 Problem description γb 0.4 ratio of the three-way valve
h10 0.65 m linearisation level of rank 1
The HD-MPC four-tank benchmark [19, 20] is widely used to h20 0.66 m linearisation level of rank 2
test, evaluate and compare different approaches in control system h30 0.65 m linearisation level of rank 3
design, because of its interesting properties: (i) there exists strong h40 0.66 m linearisation level of rank 4
coupling between subsystems and the degree of coupling can be
qa0 1.63 m3 /h linearisation flow of qa
conveniently manipulated; (ii) there exhibit non-linear dynamics in
the plant; (iii) the states can be measured; (iv) the plant is subject qb0 2.00 m3 /h linearisation flow of qb
to hard states and inputs constraints; and (v) the plant can be safely
operated.
As illustrated in Fig. 1, the four-tank plant has been divided into where hi , ai , i = 1, . . . , 4, denote the water level and the discharge
two subsystems which are coupled through the inputs. The tanks constant of tank i, respectively; qj , γj , j ∈ {a, b} represent the flow
at the top (tanks 3 and 4) discharge into the corresponding tanks and the ratio of the three-way valve of pump j, respectively; S is the
at the bottom (tanks 1 and 2). The three-way valves are emulated cross section of each tank, and g is the gravitational acceleration
by a proper calculation of the set-points of the flow control loops (g = 9.8 m2 /s in this study).
according to the considered ratio of the three-way valve. Thus, the By linearising the model at an operating point given by the equi-
inlet flows of the three-way valves qa and qb can be considered as librium levels and flows as shown in Table 1, and defining the
the manipulated variables of the real plant. deviation variables
The system is governed by the following differential equations
xi = hi − h0i , i = 1, . . . , 4
dh1 a1 a3 γa
=− 2gh1 + 2gh3 + qa uj = qj − qj0 , j ∈ {a, b} (2)
dt S S S
dh2 a2 a4 γb then we can obtain the continuous-time linear model as follows
=− 2gh2 + 2gh4 + qb
dt S S S
(1) ẋ = Ac x + Bc u
dh3 a3 1 − γb (3)
=− 2gh3 + qb y = Cc x
dt S S
dh4 a4 1 − γa where x = (x1 , x2 , x3 , x4 ), u = (ua , ub ), y = (x1 , x2 ) and
=− 2gh4 + qa
dt S S ⎡ 1 ⎤
1
− 0 0
⎢ τ1 τ3 ⎥
⎢ 1 1 ⎥
⎢ 0 − 0 ⎥
⎢ τ4 ⎥
Ac = ⎢
⎢
τ2
1
⎥
⎥ (4)
⎢ 0 0 − 0 ⎥
⎢ τ ⎥
⎣ 3
1⎦
0 0 0 −
τ4
⎡ γ ⎤
a
0
⎢ S γb ⎥
⎢ ⎥

⎢ 0 ⎥
⎢ S ⎥ 1 0 0 0
Bc = ⎢ ⎥
1 − γb ⎥ , C c = (5)
⎢ 0 0 1 0 0
⎢ S ⎥
⎣1 − γ ⎦
a
0
S
where
S 2h0i
τi = , i = 1, . . . , 4
ai g
is the time constant of tank i.
Using the ZOH (zero-order hold) method with a sampling period
of 5 s, the discrete-time model is obtained in the following

x(k + 1) = Ax(k) + Bu(k)
(6)
Fig. 1 Schematic diagram of the four-tank process y(k) = Cx(k)
1580 © The Institution of Engineering and Technology 2015
where Remark 1: It should be noted that by the traditional elimination
method, the corresponding objective function in the optimisation
⎡ ⎤
0.9705 0 0.0207 0 problem cannot be separable. A function is separable means that
⎢ 0 0.9663 0 0.0195⎥ it can be written as a sum of functions of its individual variables.
A =⎣
0 0 0.9790 0 ⎦ Owing to the inseparability, distributed optimisation approaches
0 0 0 0.9802 cannot be applied to this formulation.
⎡ ⎤
24.6291 0.5213

⎢ −0.1967 32.7684⎥ 1 0 0 0
B=⎣ ⎦ , C= 3 Distributed optimisation approach
0 49.4735 0 1 0 0
−19.8011 0
3.1 Dual decomposition
By using an MPC scheme, an online finite horizon open-loop For simplicity and compactness, we use xk−1 to represent x(k +
optimal control problem can be described as follows i|k) and uk−1 to represent u(k + i|k), the constrained finite horizon
linear quadratic optimisation problem (7) can be rewritten as

N −1
1
min J (k, u) = x(k + i + 1|k)2Q N

2 1 1 T
i=0 min J (x, u) = xkT Qxk + uk−1 Ruk−1
2 2
1 k=1
+ u(k + i|k)2R
2 s.t. xk = Axk−1 + Buk−1
s.t. x(k + i + 1|k) = Ax(k + i|k) + Bu(k + i|k) u ≤ uk ≤ u
x(k|k) = x(k) x ≤ xk ≤ x
T
x = [hmin , hmin , hmin , hmin ] ≤ x(k + i|k) x0 = x0 = x(k|k) (12)
T
≤ [h1max , h2max , h3max , h4max ] = x
Let zk = txkT , uk−1
T ]T , then (12) can be expressed as
u = [qmin , qmin ]T ≤ u(k + i|k) ≤ [qamax , qbmax ]T
= u, i = 0, 1, . . . , N − 1 (7)
N
1 T
min f (z) = z Hzk
z∈ 2 k
where u = [u(k|k)T , u(k + 1|k)T , . . . , u(k + N − 1|k)T ]T are the k=1
decision variables; Q 0, R 0 are strictly positive definite sym- s.t. Sz = b (13)
metric weighting matrices; and N is the length of prediction
horizon. At time k, u(k) = u(k|k) is implemented and the opti-
misation problem (7) is repeated at time k + 1. The goal of the where z = [z1T , z2T , . . . , zNT ]T ,
=1 × 2 · · · × N = {z|[xT ,
Q 0 T
MPC scheme is to achieve stabilisation of the discrete system at uT ]T ≤ zk ≤ [xT , uT ]T }, H = , b = (Ax0 )T 0 · · · 0
the origin by solving the optimal control problem [1]. 0 R
Considering that and
⎡ I −B 0 0 ··· 0 0 0 0 ⎤
x(k + i + 1|k) = Ax(k + i|k) + Bu(k + i|k)
⎢−A 0 I −B ··· 0 0 0 0 ⎥
x(k|k) = x(k) (8) S=⎢
⎣ .. .. .. .. .. .. .. .. .. ⎥
⎦
. . . . . . . . .
0 0 0 0 · · · −A 0 I −B
we have
⎡ ⎤ ⎡ ⎤ Partitioning the matrix S conformably as
x(k + 1|k) u(k|k)
⎢ x(k + 2|k) ⎥ ⎢ u(k + 1|k) ⎥
⎣ ··· ⎦ = S̄ ⎣ ··· ⎦ + T̄ x(k|k) (9) S = [S1 , . . . , SN ]
x(k + N |k) u(k + N − 1|k)

so that Sz = N k=1 Sk zk , the Lagrangian associated with the
where optimisation problem (13) can be expressed as
⎡ ⎤ ⎡ ⎤
B 0 ··· 0 A N

⎢ AB B · · · 0⎥ ⎢ A2 ⎥ 1
⎢ ⎥ ⎢ ⎥ L(z, λ) = zkT Hzk + λT Sk zk − (1/N )λT b (14)
S̄ = ⎢ .. .. .. .⎥, T̄ = ⎢ . ⎥ (10) 2
⎣ . . . .. ⎦ ⎣ .. ⎦ k=1
AN −1 B AN −2 B ··· B AN
where λ are the Lagrange dual (or multiplier) variables.
We use a primal-dual scheme to solve the constrained optimi-
By eliminating state variables x(k + i + 1|k), i = 0, . . . , N − 1, we sation problem (13). For a fixed λ, the primal (also called inner)
can get the traditional optimisation formulation for MPC problem subproblem can be defined by
as follows
1 T z(λ) = arg min L(z, λ) (15)

min J (k, u) = u H̄ u + ḡ T u z∈
u∈U 2
s.t. b̄l ≤ S̄u + T̄ x(k|k) ≤ b̄u (11) and the dual (also called outer) problem is
here H̄ = S̄ T Q̄S̄ + R̄, ḡ = (S̄ T Q̄T̄ )x(k|k), R̄ = diag{R, · · · , R}, max g(λ) (16)
Q̄ = diag{Q, · · · , Q}, U indicates the constraints imposed on flow
level, b̄l and b̄u indicate the constraints imposed on water level. where g(λ) = minz∈ L(z, λ) is the dual function.
Since the constraints are affine, it is easy to find that strong
duality holds, that is Parallel coordinate descent method (PCDM)
∀ k = 1, . . . , N , compute in parallel:
max g(λ) = g ∗ = f ∗ Step 0. Take zk0 ∈ k
Step i. (i > 0)

N repeat
1 T
= min f (z) = z Hzk Sz = b (17)
z∈ 2 k
k=1 Lk
z̄ki = arg min ∇k LT (z i , λ)(zk − zki ) + zk − zki 22
zk ∈k 2
which indicates that the constrained optimisation problem (13) can (22)
be solved based on a primal-dual algorithm framework [18] as 1 i N −1 i
follows zki+1 = z̄ + zk (23)
N k N
z i+1 = arg min L(z, λi ), (18a) until zki+1 − zki ≤

z∈
λi+1 = arg max g(λ, z i+1 ) (18b)

Remark 2: The inner problem can be solved in parallel since for a
3.2 Parallel algorithm for primal problem fixed λ, minimising ( 12 zkT
Hzk
+ λT Sk
zk
− (1/N )λT b) is indepen-
dent of minimising ( 12 zkT
Hzk
+ λT Sk
zk
− (1/N )λT b), ∀ k

, k
∈
As shown in (14) and (15), when the dual variables λ are fixed, {1, 2, . . . , N }, and they can be computed simultaneously.
then the objective function is separable when minimising the
inner problem (15) and thus solving the primal problem can be Introducing the following measure to calculate the distance
done in parallel. Distributed processors can deal with the N opti- between current iterative and the optimal solution
misation subproblems simultaneously, and they share the same
dual variable. The coordinator is to broadcast the dual variables
N
information to each processor and gather the results obtained by di2 = z i − z ∗ 21 = Lk zki − zk∗ 22 (24)
these distributed processors iteratively. In the following, a linearly k=1
convergent parallel coordinate descent method is introduced.
Let us partition the identity matrix conformably as where z ∗ is the optimal solution of (15) and zk∗ = (E k )T z ∗ , then
we have the following lemma.
I = [EkT , . . . , ENT ]T
Lemma 1: If L(z, λ) is coordinate-wise Lipschitz continuous with
constant Lk , using the algorithm PCDM for iterative update, with

such that z = N k=1 Ek zk , and define the partial gradient ∇k L(z, λ)
the distance defined in (24), then
of ∇L(z, λ) with respect to z as ∇k L(z, λ) = EkT ∇L(z, λ). It is
2
obvious to find that the gradient of L(z, λ) is coordinate-wise di+1 + 2L(z i+1 , λ) ≤ di2 + 2L(z i , λ)
Lipschitz continuous 2
+ ∇LT (z i , λ)(z ∗ − z i ) (25)
N
∇k L(z + Ek hk , λ) − ∇k L(z, λ) ≤ Lk hk
Proof:
where Lk = σmax (H ) is the largest singular value of H , and it is

N 2

easy to deduce that
2
di+1 = Lk zki+1 − zk∗
2
k=1
2
Lk
N
1 i N −1 i
L(z + Ek hk , λ) ≤ L(z, λ) + ∇k LT (z, λ)hk + hk 22 (19) = Lk z̄
N k + z k − z ∗
k
2 N 2
k=1
Introduce the following norm 1

z̄ i − z i 2
= di2 + Lk
N2 k k 2
k=1

N
z21 = Lk zk 21 2 i T i
(20) + z̄k − zki zk − zk∗
k=1 N
N

1 2
z̄ i − z i 2
and it is not difficult to check that L(z, λ) is strongly convex with = di2 + Lk − k k 2
N2 N
respect to z under the above norm, that is k=1

2 i T i
+ z̄k − zki z̄k − zk∗
L(z 1 , λ) ≥ L(z 2 , λ) + ∇LT (z 2 , λ)(z 1 − z 2 ) N
N
ρ
+ z 1 − z 2 21 ∀z 1 , z 2 ∈ 1 2
2
(21) ≤ di2 + Lk − z̄ki − zki 2
N
k=1

where ρ = {[σmin (H )]/[σmax (H )]} ≤ 1, here, σmin (H ) is the small- 2 i
i T i ∗

+ z̄ − zk z̄k − zk
est singular value of H . N k
Considering that for any given λ, the optimal solution of inner
problem (15) can be computed separably in parallel via the follow- The optimality condition for (22) implies that
ing coordinate update, which is a parallel version of the coordinate
descent method in [21] and further developed in [15]. [∇k LT (z i , λ) + Lk (z̄ki − zki )]T (zk − z̄ki ) ≥ 0, ∀zk ∈ k
Let zk∗ = zk to invoke the above inequality, we have namely
ρ 2
Lk (z̄ki − zki )T (z̄ki − zk∗ ) ≤ ∇k LT (z i , λ)(zk∗ − z̄ki ) L(z i , λ) − L(z ∗ , λ) + d ≥ ρdi2
2 i
Invoking (21) again with z 1 = z ∗ , z 2 = z i , we have
To continue, we can obtain
ρ 2

L(z ∗ , λ) ≥ L(z i , λ) + ∇LT (z i , λ)(z ∗ − z i ) + d

N
1 i 2 i
2
di+1 ≤ di2 + Lk − z̄ − zki 22
N k namely
k=1
ρ 2
2 T i ∗ i ∇LT (z i , λ)(z i − z ∗ ) ≥ L(z i , λ) − L(z ∗ , λ) + d ≥ ρdi2
+ ∇k L (z , λ)(zk − z̄k ) 2 i
N
N
or
2 Lk i
= di2 − z̄k − zki 22 + ∇k LT (z i , λ)(z̄ki − zki ) ρ 2
N 2 −∇LT (z i , λ)(z i − z ∗ ) ≤ −L(z i , λ) + L(z ∗ , λ) − d
k=1 2 i
2
N
≤ −ρdi2
+ ∇k LT (z i , λ)(zk∗ − zki )
N
k=1 Utilising Lemma 1, we can obtain
1 2
Due to the fact that d + L(z i+1 , λ) − L(z ∗ , λ)
2 i+1
1 i 1
z i+1 = z i + (z̄ − z i ) ≤ di2 + L(z i , λ) − L(z ∗ , λ)
N 2
1 i 1
N
= [z + Ek (z̄ki − zki )] − ∇LT (z i , λ)(z i − z ∗ )
N N
k=1
Defining β = [2ρ/(1 + ρ)] ∈ [0, 1], we have
and invoking (19) with z = z i , hk = z̄ki − zki , by the convexity of 1 2
L(z, λ) with respect to z, we have d + L(z i+1 , λ) − L(z ∗ , λ)
2 i+1
1
≤ di2 + L(z i , λ) − L(z ∗ , λ)
1
N
2
L(z i+1 , λ) ≤ L(z i + Ek (z̄ki − zki ), λ)
N 1 ρ
k=1 − β[L(z i , λ) − L(z ∗ , λ) + di2 ] + (1 − β)ρdi2
N 2
N

1 β 1 2
= L(z i , λ) + ∇k LT (z i , λ)(z̄ki − zki ) = 1− di + L(z , λ) − L(z ∗ , λ)
i
N N 2
k=1

Lk i i 2 Applying the above inequality iteratively, we can obtain the result
+ z̄ − zk 2
2 k consequently.
N
1 3.3 Fast gradient method for dual problem

= L(z i , λ) + ∇k LT (z i , λ)(z̄ki − zki )
N
k=1
Now, we need to calculate the gradient of the dual function
Lk i
+ z̄k − zki 22
2 g(λ) = min L(z, λ) = f (z) + λT h(z)
z∈
Utilising the above inequality, we can get the one of our main
where f (z) = Nk=1 zk Hzk , h(z) = Sz − b.
T
results consequently. Denote z(λ) = arg minz∈ L(z, λ), then we have
Theorem 1: Under the conditions of Lemma 1 and the condition ∇g(λ) = ∇z T (λ)∇f (z(λ)) + h(z(λ))
expressed in (21), the algorithm PCDM has the following linear
convergence + ∇z T (λ)∇hT (z(λ))λ

= ∇z T (λ) ∇f (z(λ)) + ∇hT (z(λ))λ) + h(z(λ))

β i
L(z i , λ) − L(z ∗ , λ) ≤ 1 − = h(z(λ))
N

1 2 owing to the following optimality conditions for z(λ):
× d0 + L(z 0 , λ) − L(z ∗ , λ) (26)
2
∇f (z(λ)) + ∇hT (z(λ)) = 0
2ρ
where β = 1+ρ ∈ [0, 1]. In the following, a FDAM is proposed, which resembles the
method in [22], in which, it proposed a fast gradient method which
can be used to accelerate the convergence of any method based on
Proof: Invoking (21) with z 1 = z i , z 2 = z ∗ , we have traditional gradient approach. However, the fast gradient method
in [22] is used to accelerate the convergence of computing pri-
ρ 2
L(z i , λ) ≥ L(z ∗ , λ) + d mal problem; while the proposed FDAM in this study is used to
2 i accelerate the convergence of computing dual problem.
On the other hand
Fast dual ascend method (FDAM)
Step 0. Take some η > 1, λ1 , and set λ̄1 = λ1 , α1 > 0, t1 = L(z i+1 , λi+1 ) − L(z i , λ̄i )
1 = f (z i+1 ) + (λi+1 )T (Sz i+1 − b)
Step i. (i ≥ 1),
repeat − f (z i ) − (λ̄i )T (Sz i − b)
z i = arg min{L(z, λ̄i )} (PCDM) (27) ≥ (z i+1 − z i )T ∇f (z i ) + (λi+1 )T (Sz i+1 − b)
z∈
while(1) − (λ̄i )T (Sz i − b)
≥ −(z i+1 − z i )T (S T λ̄i ) + (λi+1 )T (Sz i+1 − b)
λi+1 = λ̄i + αi (Sz i − b) (28)
− (λ̄i )T (Sz i − b)
zi+1 = arg min L(z, λi+1 ) (PCDM) (29)
z∈
= −(λ̄i − λi+1 )T (Sz i+1 − b)
if = −(λ̄i − λi+1 )T [(Sz i − b) − S(z i − z i+1 )]
1
(λi+1 − λ̄i )T S(z i − z i+1 ) ≤ λi+1 − λ̄i 22 1 i+1
2αi = λ − λ̄i 22 + (λ̄i − λi+1 )T S(z i − z i+1 )
αi
break
1
end if ≥ λi+1 − λ̄i 22
2αi
ai = ηai (30)
end while Combing the above two inequalities, we can get the result conse-
quently.

1+ 1 + 4ti2
ti+1 = (31) Lemma 3: The sequence {z i , λi } generated by FDAM satisfies
2
ti − 1 i+1 2αi ti2 vi − 2αi+1 ti+1
2
vi+1 ≥ ui+1 22 − ui 22 (35)
λ̄i+1 = λi+1 + (λ − λi ) (32)
ti+1
where vi = L(z ∗ , λ∗ ) − L(z i+1 , λi+1 ), ui = ti λi+1 − (ti − 1)λi − λ∗ .
until λi+1 − λi ≤
Proof: Invoking Lemma 2 with i = i + 1, and λ = λi+1 , we obtain
2αi+1 (vi − vi+1 ) ≥ λ̄i+1 − λi+2 22

Remark 3: In (27) and (29) of FDAM, the PCDM is used to solve + 2(λi+1 − λ̄i+1 )T (λ̄i+1 − λi+2 )
the primal problem. The condition (λi+1 − λ̄i )T S(z i − z i+1 ) ≤
2αi λ
1 i+1 − λ̄i 2 is used to make sure that the dual function is
2 Invoking Lemma 2 again with i = i + 1, and λ = λ∗ , we obtain
ascending.
−2αi+1 vi+1 ≥ λ̄i+1 − λi+2 22
Lemma 2: Let sequence {z i , λi } be generated by FDAM, and if + 2(λ∗ − λ̄i+1 )T (λ̄i+1 − λi+2 )
Multiplying the first inequality by ti2 and then adding it to the

1 2 − t 2 ), and using t 2 − t 2 = t
(λi+1 − λ̄i )T S(z i − z i+1 ) ≤ λi+1 − λ̄i 22 (33) second inequality by (ti+1 i i+1 i i+1 , we
2αi have
then for any dual feasible pair (z, λ), we have 2αi+1 (ti2 vi − ti+1
2
vi+1 ) ≥ ti+1 (λ̄i+1 − λi+2 )22
+ 2ti+1 (λ̄i+1 − λi+2 )
1
L(z i+1 , λi+1 ) − L(z, λ) ≥ λ̄i − λi+1 22 [(ti+1 − 1)λi+1 − ti+1 λ̄i+1 + λ∗ ]
2αi
1 Applying the relation
+ (λ − λ̄i )T (λ̄i − λi+1 ) (34)
αi
a − b22 + 2(a − b)T (b − c) = a − c22 − b − c22
Proof: On the one hand
with a = ti+1 λi+2 , b = ti+1 λ̄i+1 , c = (ti+1 − 1)λi+1 + λ∗ and by
the fact that {ai } is non-increasing, we have
L(z i , λ̄i ) − L(z, λ) = f (z i ) + (λ̄i )T (Sz i − b)
2αi ti2 vi − 2αi+1 ti+1
2
vi+1
− f (z) − λT (Sz − b)
≥ ti+1 λi+2 − (ti+1 − 1)λi+1 − λ∗ 22
≥ (z i − z)T ∇f (z) + (λ̄i )T (Sz i − b)
− ti+1 λ̄i+1 − (ti+1 − 1)λi+1 − λ∗ 22
− λT (Sz − b)
≥ −(z i − z)T (S T λ) + (λ̄i )T (Sz i − b) Since λ̄i+1 = λi+1 + [(ti − 1)/(ti+1 )](λi+1 − λi ), we have
− λT (Sz − b) ti+1 λ̄i+1 − (ti+1 − 1)λi+1 − λ∗ = ti λi+1 − (ti − 1)λi − λ∗

= −(λ − λ̄i )T (Sz i − b)
that is to say
1
= (λ − λ̄i )T (λ̄i − λi+1 )
αi 2αi ti2 vi − 2αi+1 ti+1
2
vi+1 ≥ ui+1 22 − ui 22
Lemma 4 [22]: Let {ai , bi } be positive sequences of reals satisfying (34)
≤ 2(λ̄1 − λ∗ )T (λ̄1 − λ2 )
ai − ai+1 ≥ bi+1 − bi , ∀i≥1 − λ̄1 − λ2 22 + λ2 − λ∗ 22
then, ai ≤ a1 + b1 , ∀ i ≥ 1. = λ̄1 − λ∗ 22 − λ2 − λ∗ 22 + λ2 − λ∗ 22

= λ1 − λ∗ 22
Lemma 5 [22]: The positive sequence {ti } generated by namely

1+ 1 + 4ti2 λ1 − λ∗ 22 2λ1 − λ∗ 22
ti+1 = L(z ∗ , λ∗ ) − L(z i+1 , λi+1 ) ≤ =
2 2αi ti2 αi (i + 1)2
with t1 = 1 satisfies ti ≥ (i + 1)/2, ∀ i ≥ 1.
Theorem 2: Let sequence {z i , λi } be generated by FDAM, then for Remark 4: From Theorem 2. it can be seen that the convergence
any i ≥ 1, we have rate for the proposed FDAM is O(1/i2 ) (i is the iteration counter).
Compared with traditional dual ascend method (DAM) (whose con-
2λ1 − λ∗ 22 vergence rate is O(1/i)) without accelerated strategy, the proposed
L(z ∗ , λ∗ ) − L(z i+1 , λi+1 ) ≤ FDAM has better convergence properties. Anyway, we have to
αi (i + 1)2
admit that at each iteration, the time consumption in FDAM is a
Proof: Invoking Lemma 4 with ai = 2αi ti2 vi , bi = ui 22 along with little more than that in DAM because of the additional computation.
Lemma 3, we have However, this factor plays much less important role in computing
speed than the time consumed in longer iterations.
2αi ti2 vi ≤ a1 + b1
= 2α1 [L(z ∗ , λ∗ ) − L(z 2 , λ2 )] + λ2 − λ∗ 22 4 Numerical results
In the experiment, it is assumed that the initial state x(0) =
0.5 [0.5, 0.5, 0.5, 0.5] and both Q and R are fixed as identity matri-
ces, then at time k = 1, 2, . . . , the proposed FDAM is used
to solve a sequence of constrained optimisation problems. To
0 make sure that (33) is always satisfied, we set α constant at
DAM [{σmin (H )}/{2σmax (S)}].
FDAM By simple calculation, it can be found that lower and upper
IDFG bound for state and control are as follows
0.5
−L(z, λ)
1 1
u= [−1.63; −2], u = [1.63; 2]
1
3600 3600
x = [−0.45; −0.46; −0.45; −0.46]
x = [0.71; 0.7; 0.65; 0.64]
1.5
For k = 1, we will first show that the proposed FDAM has faster
convergence rate than the corresponding DAM without accelerated
2
0 2000 4000 6000 8000 10000 strategy. Fig. 2 illustrates the iterative curves of the −L(z, λ) using
Iterations DAM, FDAM and IDFG (inexact dual fast gradient method) in [15]
(the feasibility tolerance in = 1 × 10−5 ), respectively, and it is
Fig. 2 Iterative curves of the −L(z, λ) using DAM, FDAM and IDFG shown that much fewer number of iterations are needed to obtained
x 10 4
0.6 2
0.5 1
0.4 x 0
1
x2
0.3 1
x3
Controls
States
x 2
0.2 4
u1
0.1 3 u
2
0 4
0.1 5
0.2 6
50 100 150 200 250 300 350 400 450 500 0 100 200 300 400 500
Time Time
Fig. 3 Evolution of states and control inputs with the length of prediction horizon N = 10 using FDAM
the global solution when using FDAM, while the performance of 3 Keviczky, T., Borrelli, F., Balas, G.J.: ‘Decentralized receding horizon control
the proposed FDAM is similar to IDFG but with much simper for large scale dynamically decoupled systems’, Automatica, 2006, 42, (12),
pp. 2105–2115
implementation since the updating of current dual variables needs 4 Baskar, L.D., De, S.B., Hellendoorn, H.: ‘Traffic management for automated
to use accumulated historical information in IDFG [15]. highway systems using model-based predictive control’, IEEE Trans. Intell.
The proposed FDAM is applied to solve the subproblems of Transp. Syst., 2012, 13, (2), pp. 838–847
MPC scheme sequentially, and only the first control input is used to 5 Li, H.P., Shi, Y.: ‘Distributed model predictive control of constrained nonlinear
systems with communication delays’, Syst. Control Lett., 2013, 62, (10), pp.
generate the next state. Repeatedly, the generated control inputs and 819–826
the states are illustrated in Fig. 3, which shows that the benchmark 6 Song, Y., Fang, X.S.: ‘Distributed model predictive control for polytopic uncer-
plant can be well stabilised using the proposed approach. tain systems with randomly occurring actuator saturation and packet loss’, IET
Control Theory Appl., 2014, 8, (5), pp. 297–310
7 Christofides, P.D., Scattolini, R., Muñoz de la Peña, D., Liu, J.F.: ‘Distributed
model predictive control: a tutorial review and future research directions’,
5 Conclusion Comput. Chem. Eng., 2013, 51, pp. 21–41
8 Negenborn, R., Maestre, J.: ‘Distributed model predictive control: an overview
and roadmap of future research opportunities’, IEEE Control Syst. Mag., 2014,
We have studied the HD-MPC four-tank benchmark using an 34, (4), pp. 87–97
MPC scheme, in which, a fast distributed optimisation approach 9 Camponogara, E., Scherer, H.F.: ‘Distributed optimization for model predictive
is proposed to solve the constrained optimisation subproblems. By control of linear dynamic networks with control-input and output constraints’,
introducing Lagrangian dual, the primal problem is solved by a IEEE Trans. Autom. Sci. Eng., 2001, 8, (1), pp. 233–242
10 Dunbar, W.B.: ‘Distributed receding horizon control of dynamically cou-
parallel coordinate descent method, and a FDAM is adopted to pled nonlinear systems’, IEEE Trans. Autom. Control, 2007, 52, (7),
solve the dual problem iteratively. The convergence rate of the pp. 1249–1263
proposed FDAM is given and its superiority to classical gradient- 11 Necoara, I., Clipici, D.: ‘Efficient parallel coordinate descent algorithm for con-
based method is illustrated. The experimental results have shown vex optimization problems with separable constraints: application to distributed
MPC’, J. Process Control, 2013, 23, (3), pp. 243–253
that the proposed approach can stabilise the benchmark plant. 12 Patrinos, P., Sopasakis, P., Sarimveis, H.: ‘A global piecewise smooth Newton
method for fast large-scale model predictive control’, Automatica, 2011, 47,
(9), pp. 2016–2022
13 Giselsson, P., Doan, M.D., Keviczky, T., Schutter, B.D., Rantzer, A.: ‘Acceler-
6 Acknowledgments ated gradient methods and dual decomposition in distributed model predictive
control’, Automatica, 2013, 49, (3), pp. 829–833
This publication was made possible by NPRP Grant # NPRP 4- 14 Patrinos, P., Bemporad, A.: ‘An accelerated dual gradient-projection algorithm
1162-1-181 from the Qatar National Research Fund (a member for embedded linear model predictive control’, IEEE Trans. Autom. Control,
2014, 59, (1), pp. 18–33
of Qatar Foundation), and it was also supported by the National 15 Necoara, I and Nedelcu, V.: ‘Rate analysis of inexact dual first-order methods
Science Found for Distinguished Young Scholars of China (Grant application to dual decomposition’, IEEE Trans. Autom. Control, 2014, 59, (5),
No. 61025015) and the Foundation for Innovative Research Groups pp. 1232–1243
of the National Natural Science Foundation of China (Grant No. 16 Nesterov, Y.: ‘A method of solving a convex programming problem with
convergence rate O(1/k 2 )’, Sov. Math. Dokl., 1983, 27, (2), pp. 372–376
61321003). The statements made herein are solely the responsibil- 17 Nesterov, Y.: ‘Smooth minimization of non-smooth functions’, Math. Program.,
ity of the authors. The authors thank the anonymous reviewers for 2005, 103, (1), pp. 127–152
their valuable comments and suggestions that helped improve the 18 Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, J.: ‘Distributed optimiza-
quality of this paper. tion and statistical learning via the alternating direction method of multipliers’,
Found. Trends® Mach. Learn., 2011, 3, (1), pp. 1–122
19 Johansson, K.H.: ‘The quadruple-tank process: A multivariable laboratory pro-
cess with an adjustable zero’, IEEE Trans. Control Syst. Technol., 2000, 8, (3),
7 References pp. 456–465
20 Alvarado, I., Limon, D., Muñoz de la Peña, D. et al.: ‘A comparative analysis
1 Mayne, D.Q., Rawlings, J.B., Rao, C.V., Scokaert, P.O.M.: ‘Constrained of distributed MPC techniques applied to the HD-MPC four-tank benchmark’,
model predictive control: stability and optimality’, Automatica, 2000, 36, (6), J. Process Control, 2011, 21, (5), pp. 800–815
pp. 789–814 21 Nesterov, Y.: ‘Efficiency of coordinate descent methods on huge-scale opti-
2 Venkat, A.N., Hiskens, I.A., Rawlings, J.B., Wright, S.J.: ‘Distributed MPC mization problems’, SIAM J. Optim., 2012, 22, (2), pp. 341–362
strategies with application to power system automatic generation control’, IEEE 22 Beck, A., Teboulle, M.: ‘A fast iterative shrinkage-thresholding algorithm for
Trans. Control Syst. Technol., 2008, 16, (6), pp. 1192–1206 linear inverse problems’, SIAM J. Imaging Sci., 2009, 2, (1), pp. 183–202
Copyright of IET Control Theory & Applications is the property of Institution of Engineering
& Technology and its content may not be copied or emailed to multiple sites or posted to a
listserv without the copyright holder's express written permission. However, users may print,
download, or email articles for individual use.

(W) Xiao2015

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

(W) Xiao2015

Uploaded by

Copyright:

Available Formats

IET Control Theory & Applications

1 Introduction Distributed optimisation is a kind of decomposition method for

1 T z(λ) = arg min L(z, λ) (15)

z i+1 = arg min L(z, λi ), (18a) until zki+1 − zki ≤

λi+1 = arg max g(λ, z i+1 ) (18b)

− (1/N )λT b), ∀ k

where Lk = σmax (H ) is the largest singular value of H , and it is

Introduce the following norm 1

1 3.3 Fast gradient method for dual problem

2αi+1 (vi − vi+1 ) ≥ λ̄i+1 − λi+2 22

Multiplying the ﬁrst inequality by ti2 and then adding it to the

− λT (Sz − b) ti+1 λ̄i+1 − (ti+1 − 1)λi+1 − λ∗ = ti λi+1 − (ti − 1)λi − λ∗

then, ai ≤ a1 + b1 , ∀ i ≥ 1. = λ̄1 − λ∗ 22 − λ2 − λ∗ 22 + λ2 − λ∗ 22

Lemma 5 [22]: The positive sequence {ti } generated by namely

with t1 = 1 satisﬁes ti ≥ (i + 1)/2, ∀ i ≥ 1.

You might also like

(W) Xiao2015

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

(W) Xiao2015

Uploaded by

Copyright:

Available Formats

IET Control Theory & Applications

1 Introduction Distributed optimisation is a kind of decomposition method for

1 T z(λ) = arg min L(z, λ) (15)

z i+1 = arg min L(z, λi ), (18a) until zki+1 − zki ≤ 

λi+1 = arg max g(λ, z i+1 ) (18b)

− (1/N )λT b), ∀ k

where Lk = σmax (H ) is the largest singular value of H , and it is

Introduce the following norm 1  

1 3.3 Fast gradient method for dual problem

2αi+1 (vi − vi+1 ) ≥ λ̄i+1 − λi+2 22

Multiplying the ﬁrst inequality by ti2 and then adding it to the

− λT (Sz − b) ti+1 λ̄i+1 − (ti+1 − 1)λi+1 − λ∗ = ti λi+1 − (ti − 1)λi − λ∗

then, ai ≤ a1 + b1 , ∀ i ≥ 1. = λ̄1 − λ∗ 22 − λ2 − λ∗ 22 + λ2 − λ∗ 22

Lemma 5 [22]: The positive sequence {ti } generated by namely

with t1 = 1 satisﬁes ti ≥ (i + 1)/2, ∀ i ≥ 1.

You might also like

z i+1 = arg min L(z, λi ), (18a) until zki+1 − zki ≤

Introduce the following norm 1