Professional Documents
Culture Documents
Abstract—In this paper, a novel competitive swarm optimizer where t is the iteration (generation) number, Vi (t) and
(CSO) for large scale optimization is proposed. The algorithm is Xi (t) represent the velocity and position of the i-th particle,
fundamentally inspired by the particle swarm optimization but is respectively; ω is termed inertia weight [3], c1 and c2 are
conceptually very different. In the proposed CSO, neither the per-
sonal best position of each particle nor the global best position (or the acceleration coefficients [4], R1 (t) and R2 (t) are two vec-
neighborhood best positions) is involved in updating the particles. tors randomly generated within [0, 1]n ; pbesti (t) and gbest(t)
Instead, a pairwise competition mechanism is introduced, where are the best solution of the i-th particle found so far, often
the particle that loses the competition will update its position known as the personal best, and the best solution found
by learning from the winner. To understand the search behav- by all particles so far, known as the global best, respec-
ior of the proposed CSO, a theoretical proof of convergence is
provided, together with empirical analysis of its exploration and tively. Kennedy [5] has referred c1 R1 (t)(pbesti (t) − Xi (t)) and
exploitation abilities showing that the proposed CSO achieves c2 R2 (t)(gbest(t)−Xi (t)) as the cognitive component and social
a good balance between exploration and exploitation. Despite component, respectively.
its algorithmic simplicity, our empirical results demonstrate that Due to its conceptual simplicity and high search effi-
the proposed CSO exhibits a better overall performance than ciency, PSO has attracted much research interest over the
five state-of-the-art metaheuristic algorithms on a set of widely
used large scale optimization problems and is able to effectively past decades and has been successfully applied to a num-
solve problems of dimensionality up to 5000. ber of applications, such as water distribution network
design [6], parameter optimization in suspension system [7],
Index Terms—Competition, competitive swarm optimizer, con-
vergence analysis, large scale optimization, learning, particle resource allocation [8], task assignment [9], DNA sequence
swarm optimization. compression [10], and many others [11]. However, it has
been found that PSO perform poorly when the optimiza-
tion problem has a large number of local optima or is
I. I NTRODUCTION
high-dimensional [12]. The above weaknesses can usually be
ARTICLE swarm optimization (PSO) is one powerful and
P widely used swarm intelligence paradigm [1] introduced
by Kennedy and Eberhart in 1995 [2] for solving optimiza-
attributed to the premature convergence that often occurs in
PSO [13].
As a typical population-based optimization technique, con-
tion problems. The algorithm is based on a simple mechanism vergence speed, and global search ability are two critical
that mimics swarm behaviors of social animals such as bird criteria for the performance of PSO algorithms. In order to
flocking. Due to its simplicity in implementation, PSO has alleviate premature convergence by achieving a good balance
witnessed a rapid increase in popularity over the past decades. fast convergence and global search ability, a number of PSO
PSO contains a swarm of particles, each of which has a variants have been suggested, which can be largely classified
position and velocity flying in an n-dimensional search space, into the following four categories [13].
representing a candidate solution of the optimization prob- 1) Adaptation of the control parameters. ω, c1 , and c2
lem to be solved. To locate the global optimum, the velocity, are the three control parameters in the canonical
and position of each particle are updated iteratively using the PSO, as shown in (1). ω, termed the inertia weight,
following equations: was first proposed by Shi and Eberhart [14] to bal-
Vi (t + 1) = ωVi (t) + c1 R1 (t)(pbesti (t) − Xi (t)) ance between global search and local refinement. The
(1) inertia weight ω was further modified by linearly
+ c2 R2 (t)(gbest(t) − Xi (t)) decreasing it from 0.9 to 0.4 over the search pro-
cedure [15]. Another important modification of ω is
Xi (t + 1) = Xi (t) + Vi (t + 1) (2) the introduction of fuzzy inference [16]. Methods for
Manuscript received July 21, 2013; revised December 31, 2013 and April adapting c1 and c2 (called the acceleration coeffi-
4, 2014; accepted April 5, 2014. This work is supported in part by Honda cients) have also been suggested. Time-varying accel-
Research Institute Europe. This paper was recommended by Associate Editor eration coefficients were first introduced by Ratnaweera
Q. Zhao.
The authors are with the Department of Computing, University et al. [17]. Similarly, a time-varying PSO based on a
of Surrey, Guildford GU2 7XH, U.K. (e-mail: r.cheng@surrey.ac.uk; novel operator was introduced in [18]. Most recently,
yaochu.jin@surrey.ac.uk). a multiple parameter control mechanism has been
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org. introduced to adaptively change all the three parameters
Digital Object Identifier 10.1109/TCYB.2014.2322602 in [19].
2168-2267 c 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
2) Modifications in topological structures. The motiva- the diversity of p1 itself is generated by the difference between
tion of introducing topological structures in PSO is to pbesti and gbest, refer to (4). However, in practice, due to
enhance the swarm diversity using neighborhood con- the global influence of gbest, pbesti is very likely to have
trol [20], [21]. Several topological structures have been a value similar to or even the same as gbest, thus consider-
proposed [22], including the ring topology and the von ably reducing the swarm diversity. In other words, as a source
Neumann topology. In [23], a fully informed PSO (FIPS) of diversity, p1 largely determines how well PSO is able to
was developed, where the update of each particle is balance exploration and exploitation in the search procedure.
based on the positions of several neighbors. Another Having noticed this, Mendes et al. [23] proposed a modified
important example is the comprehensive learning PSO PSO, where p1 is the best particle in the neighborhood of a
(CLPSO) introduced in [24], where particles update particle rather than a combination of gbest and pbesti . Out of
each dimension by learning from different local best similar considerations, Liang et al. [24] also introduced a PSO
positions. Recently, a distance-based locally informed variant without gbest, where the update strategy aims to learn
particle swarm optimizer was proposed specifically to from pbesti only.
tackle multimodal problems in [25]. In order to address premature convergence, a step further
3) Hybridization with other search techniques. Since dif- might be to completely get rid of gbest and pbesti . An attempt
ferent search techniques have different strengths, it is a was made along this line in a multiswarm framework [42],
natural idea to hybridize PSO with other search meth- where neither gbest nor pbesti has been used. In this mul-
ods. One straightforward idea is to combine PSO with tiswarm framework, the update of particles is driven by a
different evolutionary algorithms, such as genetic algo- pairwise competition mechanism between particles from the
rithms [26], differential evolution [27], and ant colony two swarms. After each competition, the loser will be updated
optimization [28], [29]. Another important idea is to according to the information from the winner swarm, while
integrate PSO with local search techniques [30], [31]. the winner will be updated using a mutation strategy. In the
In addition, various search operators based on sociolog- experiments, the framework showed promising performance
ical or biological concepts have also been proposed, such on relatively high-dimensional problems.
as the niching PSO [32], cultural-based PSO [33] and Following the idea in [42], in this paper, we explore the
the aging theory inspired PSO (ALC-PSO) [13]. Other use of the competition mechanism between particles within
hybrid PSO variants include PSO with Gaussian muta- one single swarm. In addition, the particle loses a compe-
tion [34], PSO with chaos [35], orthogonal learning PSO tition will learn from the winner particle instead of from
(OLPSO) [36], PSO with a moderate-random-search gbest or pbesti . Since the main driving force behind this
strategy [37], and a very recently proposed PSO with idea is the pairwise competition mechanism between differ-
a periodic mutation strategy and neural networks [38]. ent particles and neither gbest nor pbesti is involved in the
4) Multiswarm PSO. One early work on multiswarm PSO search process, we term the proposed algorithm competitive
was reported in [39], where the sub-swarms cooperate swarm optimizer (CSO) to avoid ambiguity. CSO distin-
to solve large scale optimization problems. In [30], a guishes itself from the canonical PSO mainly in the following
dynamic multiswarm PSO (DMS-PSO) algorithm is pro- aspects.
posed to dynamically change the neighborhood structure 1) In the canonical PSO, the dynamic system is driven
for a higher degree of swarm diversity, even with a very mostly by the global best position gbest and individ-
small swarm size. More multiswarm PSO variants can ual best position pbesti , whilst in CSO, there is no more
be found in [40] and [41]. gbest or pbesti . Instead, the dynamic system is driven
Since most PSO variants introduce new mechanisms or by a random competition mechanism, where any particle
additional operators, the enhancement of search performance could be a potential leader.
is often at the cost of increasing the computational complex- 2) In PSO, the historical best positions are recorded, whilst
ity. Furthermore, due to the strong influence of the global best in CSO, there is no memory used to memorize the his-
position gbest on the convergence speed [39], premature con- torical positions. By contrast, the particles that lose a
vergence remains a major issue in most existing PSO variants. competition learn from the winners in the current swarm
To take a closer look into the influence of gbest on premature only.
convergence, we rewrite (1) as follows: The rest of this paper is organized as follows. Section II
presents the details of CSO, followed by an empirical anal-
Vi (t + 1) = ωVi (t) + θ1 (p1 − Xi (t)) (3) ysis of search behaviors and a theoretical convergence proof
in Section III. Section IV first presents some statistical results
where
θ1 = c1 R1 (t) + c2 R2 (t), that compare CSO with a few state-of-the-art algorithms on
c1 R1 (t) the CEC’08 benchmark functions of dimensionality up to
p1 = pbesti (t) 1000 [43]. Empirical investigations on the influence of the
c1 R1 (t) + c2 R2 (t) (4)
parameter settings are also conducted. The search ability of
c2 R2 (t)
+ gbest(t). CSO has been challenged further with the test functions
c1 R1 (t) + c2 R2 (t) of 2000 and 5000 dimensions, which, to the best of our
From (3), it can be noticed that the difference between p1 knowledge, are the highest dimensions that have ever been
and Xi serves as the main source of diversity. More precisely, reported in the evolutionary optimization literature. Finally,
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
CHENG AND JIN: COMPETITIVE SWARM OPTIMIZER FOR LARGE SCALE OPTIMIZATION 3
CHENG AND JIN: COMPETITIVE SWARM OPTIMIZER FOR LARGE SCALE OPTIMIZATION 5
Fig. 3. Illustration of a situation where a search is performed in a multimodal Fig. 5. Illustration of a situation where a search is performed in a multimodal
space using a PSO variant without gbest. In this case, particle l will fly over space using a PSO variant with neither gbest(t) nor pbest(t) (CSO). In this
the global optimum region due to the attraction of the pbestw . case, particle l(t) is only attracted by particle w(t), thereby flying over the
local optimum region.
When t becomes very large in the late search stage, the fol-
lowing relationship between pbestw (t), pbestl (t), and gbest(t)
holds:
pbestw (t) ≈ gbest(t),
. (12)
pbestl (t) ≈ gbest(t)
Let
F1 (t) = |f (Xl (t)) − f (gbest)|
gbest(t) + gbest(t)
= f (Xl (t)) − f
2
gbest(t) + pbestl (t)
≈ f (Xl (t)) − f (13)
2
= |f (Xl (t)) − f (p1 )|
F2 (t) = |f (Xl (t)) − f (Xw (t))|
Fig. 4. Illustration of a situation where a search is performed in a multimodal
space using a PSO variant without gbest(t). In this case, the pbestw (t) of
= |f (Xl (t)) − f (p2 )|
particle w(t) is located in a local optimum region, serving as a local gbest.
As a result, both particles l(t) and w(t) will be attracted by pbestw (t).
where p1 is the expected value of p1 in (4) and p2 is the
expected value of p2 in (9) with ϕ = 0. Then the following
relationship can be obtained from (10), (11), and (12):
which will pull particle w(t + 1) back into the local optimum F2 (t) ≤ F1 (t). (14)
region again. In this situation, pbestw (t) has played the role of
a local gbest, although no gbest(t) is adopted. In comparison, The relationship in (14) indicates that CSO, in comparison
both the situations as shown in Figs. 2 and 4 can be avoided with the canonical PSO, has a better ability to exploit the
by CSO because both gbest(t) and pbest(t) are removed, refer small gaps between two positions whose fitness values are
to Fig. 5. very similar.
2) Exploitation: Exploitation is required in the later search
stage to refine the solution found at the exploration stage. To B. Theoretical Convergence Proof
analyze the exploitation behavior of CSO, we randomly pick
Similar to most theoretical convergence analysis of
up two particles w and l from the swarm and the following
PSO [49]–[51], a deterministic implementation of CSO is con-
relationship holds:
sidered to theoretically analyze its convergence property. It
f (Xw (t)) ≤ f (Xl (t)) (10) should also be pointed out that the proof does not guarantee
the convergence to the global optimum.
According to the definitions of gbest and pbest in the canonical For any particle i in P(t), it can have the following two
PSO, the following relationship can be obtained: possible behaviors after participating in a competition.
f (gbest(t)) ≤ f (pbestw (t)) ≤ f (Xw (t)), 1) Xi (t + 1) = Xi (t), if Xi (t) is a winner.
(11)
f (gbest(t)) ≤ f (gbestl (t)) ≤ f (Xl (t)). 2) Xi (t + 1) is updated using (6) and (7), if Xi (t) is a loser.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
In case Xi (t) is a winner, the particle will not be updated. where the eigenvalues are
Therefore, we only need to consider the case when Xi (t) ⎧
is a loser and then updated. Without loss of generality, we ⎪
⎨ λ1 = 3 − θ ( 32 −θ)2 −2
4 2 + 2
can rewrite (6) and (7) by considering an one-dimension (22)
deterministic case ⎪
⎩ ( 32 −θ)2 −2
θ
λ2 = 34 − 2 − 2
1
Vi (t + 1) = Vi (t) The necessary and sufficient condition for the convergence,
2
1 i.e., the equilibrium point is a stable attractor, is that |λ1 | < 1
+ (Xw (t) − Xi (t)) and |λ2 | < 1, leading to the result
2 (15)
1
+ ϕ (X̄(t) − Xi (t)) θ >0 (23)
2
Xi (t + 1) = Xi (t) + Vi (t + 1)
where if θ is substituted by ϕ using (16), the condition for
where is the expected value of R1 , R2 , and R3 , Xw (t) is the
1 convergence on ϕ is
2
position of the winner in the competition with the i-th particle. ϕ > −1. (24)
Theorem 1: For any given ϕ ≥ 0, the dynamic system
described by (15) will converge to an equilibrium. Therefore, ϕ ≥ 0 is a sufficient condition for the convergence
Proof: Let of the system.
From Theorem 1, we can conclude that the algorithm will
1+ϕ converge to an equilibrium regardless the exact value of ϕ, as
θ=
2 (16) long as ϕ > −1. In this paper, only nonnegative ϕ ≥ 0 will
1 ϕ be adopted.
p= Xw (t) + X̄(t)
1+ϕ 1+ϕ
then (15) can be simplified to IV. E XPERIMENTAL S TUDIES
1
Vi (t + 1) = Vi (t) + θ (p − Xi (t)) In this section, we will perform a set of experiments con-
2 (17) ducted on the seven benchmark functions proposed in the
Xi (t + 1) = Xi (t) + Vi (t + 1) CEC’08 special session on large scale optimization prob-
The search dynamics described in (17) can be seen as a lems [43]. Among the seven functions, f1 (Shifted Sphere), f4
dynamical system, and the convergence analysis of the system (Shifted Rastrigin) and f6 (Shifted Ackley) are separable func-
can be conducted by using the well-established theories on tions, while the other four functions f2 (Schwefel Problem),
stability analysis in dynamical systems. To this end, we rewrite f3 (Shifted Rosenbrock), f5 (Shifted Griewank), and f7 (Fast
system (17) in the following form: Fractal) are nonseparable. Note that f5 becomes more separable
as the dimension increases, because the product component of
y(t + 1) = Ay(t) + Bp (18) f5 becomes increasingly less significant [52] with an increas-
where ing dimension. Therefore, in the following experiments, if the
1 dimension is equal to or higher than 500, f5 will be regarded
Vi (t) −θ θ a separable function.
y(t) = , A = 21 ,B = (19)
X(t) 2 1 − θ θ At first, experiments are conducted to empirically under-
where A is called state matrix in dynamical system theory, p stand the influence of the two parameters in CSO, namely,
is called external input that drives the particle to a specific the swarm size m and the social factor ϕ. Then, CSO is
position and B is called input matrix that controls external compared with a few recently proposed algorithms for large
effect on the dynamics of the particle. scale optimization on 100-D, 500-D, and 1000-D bench-
If there exists an equilibrium y∗ that satisfies y∗ (t + 1) = mark functions. Afterwards, regarding the scalability to search
∗
y (t) for any t, it can be calculated from (18) and (19) dimensions, CSO is further challenged on 2000-D and 5000-D
functions. Finally, the influence of neighborhood control on
y∗ = 0 p (20) CSOs swarm diversity and search performance is investigated.
The experiments are implemented on a PC with an
which means that the particles will finally stabilize at the same
Intel Core i5-2500 3.3GHz CPU and Microsoft Windows 7
position, provided that p is constant, i.e., an optimum (local
Enterprise SP1 64-bit operating system, and CSO is written in
or global) has been found, so that no more update for p will
language C on Microsoft Visual Studio 2010 Enterprise.
happen.
All statistical results, unless otherwise specified, are aver-
Convergence means that the particles will eventually settle
aged over 25 independent runs. For each independent run,
down at the equilibrium point y∗ . From the dynamical system
the maximum number of FEs, as recommended in [43], is
theory, we can know that the convergence property depends
set to 5000 ∗ n, where n is the search dimension of the
on the eigenvalues of the state matrix A
test functions. In the comparisons between different statisti-
3 1 cal results, two-tailed t-tests are conducted at a significance
λ −
2
−θ λ+ =0 (21)
2 2 level of α = 0.05.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
CHENG AND JIN: COMPETITIVE SWARM OPTIMIZER FOR LARGE SCALE OPTIMIZATION 7
TABLE I
S TATISTICAL R ESULTS (M EAN VALUES AND S TANDARD D EVIATIONS ) OF O PTIMIZATION E RRORS O BTAINED BY CSO ON T WO N ONSEPARABLE
F UNCTIONS f2 , f3 , AND T WO S EPARABLE F UNCTIONS f1 , f6 OF 500 D IMENSIONS W ITH THE S WARM S IZE m VARYING
F ROM 200 TO 1000 AND ϕ F ROM 0 TO 0.3
CHENG AND JIN: COMPETITIVE SWARM OPTIMIZER FOR LARGE SCALE OPTIMIZATION 9
TABLE IV
S TATISTICAL R ESULTS (F IRST L INE ) AND THE t VALUES (S ECOND L INE ) OF O PTIMIZATION E RRORS ON 100-D T EST F UNCTIONS
TABLE V
S TATISTICAL R ESULTS (F IRST L INE ) AND THE t VALUES (S ECOND L INE ) OF O PTIMIZATION E RRORS ON 500-D T EST F UNCTIONS
TABLE VI
S TATISTICAL R ESULTS (F IRST L INE ) AND THE t VALUES (S ECOND L INE ) OF O PTIMIZATION E RRORS ON 1000-D T EST F UNCTIONS
In addition, the convergence profiles of one typical separable performance does not deteriorate seriously as the dimension
function (f1 ) and one typical nonseparable function (f5 ) are increases.
plotted in Fig. 8. It can be seen that, although the convergence To further examine the search ability of CSO on the func-
speed of the proposed is not so fast as the CCPSO2 or MLCC tions of even higher dimensions, e.g., 2000-D or even 5000-D,
at the very beginning, it is able to perform a relatively more additional experiments have been performed on f1 to f6 of 2000
consistent convergence to continuously improve the solution and 5000 dimensions. f7 is excluded from this experiment for
quality. the reason that its global optimum is dimension dependent and
thus it is not easy to evaluate the scalability.
It must be stressed that optimization of problems of 2000
C. Scalability to Higher Dimensionality and 5000 dimensions is very challenging for CSO since it
From the statistical results of the optimization errors sum- has not adopted any particular strategies tailored for solving
marized in Tables IV–VI, it can be noticed that CSO has large scale optimization, e.g., the divide-and-conquer strat-
shown very good scalability to the search dimension, i.e., the egy. Furthermore, to the best of our knowledge, optimization
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
on the 2000-D and 5000-D test problems are much better than
those obtained by the compared algorithms. Meanwhile, CSO
shows similar scalability to CCPSO2 and MLCC on f3 and f6 .
CHENG AND JIN: COMPETITIVE SWARM OPTIMIZER FOR LARGE SCALE OPTIMIZATION 11
Fig. 9. Statistical results of optimization errors obtained by CSO, CCPSO2, MLCC, sep-CMA-ES, EPUS-PSO, and DMS-PSO on 100-D, 500-D, 1000-D f1
to f6 , together with the statistical results of optimization errors obtained by CCPSO2, sep-CMA-ES on 2000-D f1 , f3 , and the statistical results of optimization
errors obtained by CSO on 2000-D, 5000-D f1 to f6 . Note that due to the logarithmic scale used in the plots, errors of 0 cannot be shown.
TABLE X
S TATISTICAL R ESULTS OF O PTIMIZATION E RRORS O BTAINED BY CSO- N
AND CSO ON 1000-D F UNCTIONS
Fig. 10. Swarm diversity profiles during 500 000 fitness evaluations (FEs)
of CSO with neighborhood control (denoted as CSO-n) and the original CSO
on 500-D functions on a separable function f1 and a nonseparable function discussed above, the neighborhood control is expected to gen-
f3 , respectively. erate a higher degree of swarm diversity and the experimental
results in Fig. 10 has empirically confirmed this expectation.
TABLE IX Therefore, one possible reason for the inferior performance of
S TATISTICAL R ESULTS OF O PTIMIZATION E RRORS O BTAINED BY CSO- N CSO using neighborhood control is that for these test func-
AND CSO ON 500-D F UNCTIONS
tions, the diversity of the global CSO is already sufficient and
therefore additional diversity may slow down the convergence.
In the global version of CSO, the main source of swarm
diversity comes from the random pairwise competitions, where
the swarm size can be an important factor to determine the
amount of diversity. More specifically, a bigger swarm size is
able to provide more combinations of random pairwise com-
petitions, thus generating a higher degree of swarm diversity,
and vice versa. Following this line of thoughts, the perfor-
mance of CSO-n may be improved by reducing the swarm
In the first set of experiments, the same parameter settings size. Therefore, a second set of experiments have been con-
as in the original CSO have been used for CSO-n, refer to ducted using a different parameter setup, where the swarm
Table III. The experimental results shown in Tables IX and X size is set to m = 150 for 500-D functions and m = 200 for
indicate that CSO-n is outperformed by the original CSO. As 1000-D functions, respectively.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TABLE XI
S TATISTICAL R ESULTS OF O PTIMIZATION E RRORS O BTAINED BY CSO- N extremely high-dimensional problems. In addition, we have
AND CSO ON 500-D F UNCTIONS empirically investigated the influence of neighborhood control
on the swarm diversity and search performance of CSO, which
suggests that neighborhood control can enhance diversity and
therefore enable us to user a smaller swarm size for large scale
optimization problems.
In the future, we will investigate the application of CSO to
solving other challenging optimization problems, such as mul-
tiobjective problems [64] and many-objective problems [65].
Application of CSO to complex real-world problems is another
TABLE XII
S TATISTICAL R ESULTS OF O PTIMIZATION E RRORS O BTAINED BY CSO- N important future work.
AND CSO ON 1000-D F UNCTIONS
R EFERENCES
[1] J. Kennedy, Swarm Intelligence. Seacaucus, NJ, USA: Springer, 2006.
[2] J. Kennedy and R. Eberhart, “Particle swarm optimization,” in Proc.
IEEE Int. Conf. Neural Netw., vol. 4. Perth, WA, USA, 1995,
pp. 1942–1948.
[3] Y. Shi and R. Eberhart, “A modified particle swarm optimizer,” in Proc.
IEEE Int. Conf. Evol. Comput., Anchorage, AK, USA, 1998, pp. 69–73.
[4] R. Eberhart and J. Kennedy, “A new optimizer using particle swarm
theory,” in Proc. Int. Symp. MHS, Nagoya, Japan, 1995, pp. 39–43.
[5] J. Kennedy, “The particle swarm: Social adaptation of knowledge,” in
As shown by the statistical results of optimization errors Proc. IEEE Int. Conf. Evol. Comput., Indianapolis, IN, USA, 1997,
in Tables XI and XII, after reducing the swarm size, CSO-n pp. 303–308.
[6] I. Montalvo, J. Izquierdo, R. Pérez, and P. L. Iglesias, “A diversity-
is able to outperform the global CSO on most test functions enriched variant of discrete PSO applied to the design of water
studied in this paper, except for f7 . Interestingly, the perfor- distribution networks,” Eng. Optim., vol. 40, no. 7, pp. 655–668, 2008.
mance of CSO on f7 is always better than CSO-n. As described [7] A. Alfi and M.-M. Fateh, “Parameter identification based on a modified
PSO applied to suspension system,” J. Softw. Eng. Appl., vol. 3, no. 3,
in [43], f7 is a very special function which has large amount pp. 221–229, 2010.
of random noise and its global optimum is unknown. One [8] Y.-J. Gong et al., “An efficient resource allocation scheme using par-
possible reason for such a consequence is that f7 , as a noisy ticle swarm optimization,” IEEE Trans. Evol. Comput., vol. 16, no. 6,
pp. 801–816, Dec. 2012.
function, is very sensitive to the swarm diversity, and the orig- [9] S.-Y. Ho, H.-S. Lin, W.-H. Liauh, and S.-J. Ho, “OPSO: Orthogonal
inal CSO, which maintains less swarm diversity, is able to particle swarm optimization and its application to task assignment prob-
achieve better performance on it. lems,” IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 38, no. 2,
pp. 288–298, Mar. 2008.
To summarize, since the random pairwise competitions are [10] Z. Zhu, J. Zhou, Z. Ji, and Y.-H. Shi, “DNA sequence compression using
able to generate sufficient amount of diversity in the swarm, adaptive particle swarm optimization-based memetic algorithm,” IEEE
the original CSO can work properly without using neighbor- Trans. Evol. Comput., vol. 15, no. 5, pp. 643–658, Oct. 2011.
[11] R. C. Eberhart and Y. Shi, “Particle swarm optimization: Developments,
hood control, if a relatively large swarm size is used for large applications and resources,” in Proc. IEEE Congr. Evol. Comput., vol. 1.
scale optimization problems. However, use of neighborhood Seoul, South Korea, 2001, pp. 81–86.
control, which is able to further enhance diversity, can enable [12] Y. Yang and J. O. Pedersen, “A comparative study on feature selection
us to use a smaller swarm size even for large scale problems, in text categorization,” in Proc. ICML, San Francisco, CA, USA, 1997,
pp. 412–420.
which is very attractive in practice. [13] W.-N. Chen, J. Zhang, Y. Lin, and E. Chen, “Particle swarm optimiza-
tion with an aging leader and challengers,” IEEE Trans. Evol. Comput.,
vol. 17, no. 2, pp. 241–258, Apr. 2013.
V. C ONCLUSION [14] Y. Shi and R. C. Eberhart, “Parameter selection in particle swarm opti-
In this paper, we have introduced a new swarm algorithm mization,” in Evolutionary Programming VII. Berlin, Germany: Springer,
1998, pp. 591–600.
termed CSO. The algorithm is based on a pairwise competition [15] Y. Shi and R. C. Eberhart, “Empirical study of particle swarm opti-
mechanism and adopts a novel update strategy, where nei- mization,” in Proc. IEEE Congr. Evol. Comput., Washington, DC, USA,
ther gbest nor pbest is used. Theoretical proof of convergence 1999, pp. 1945–1950.
and empirical analysis of search dynamics are given to under- [16] Y. Shi and R. C. Eberhart, “Fuzzy adaptive particle swarm optimization,”
in Proc. IEEE Congr. Evol. Comput., vol. 1. Seoul, South Korea, 2001,
stand the search mechanisms of CSO. Despite its simplicity pp. 101–106.
in algorithmic implementation, CSO has shown to perform [17] A. Ratnaweera, S. K. Halgamuge, and H. C. Watson, “Self-organizing
surprisingly well on large scale optimization problems, out- hierarchical particle swarm optimizer with time-varying acceleration
coefficients,” IEEE Trans. Evol. Comput., vol. 8, no. 3, pp. 240–255,
performing many state-of-the-art meta-heuristics tailored for Jun. 2004.
large scale optimization. Our comparative studies conducted [18] R. Cheng and M. Yao, “Particle swarm optimizer with time-varying
on 100-D, 500-D, and 1000-D CEC’08 benchmark prob- parameters based on a novel operator,” Appl. Math. Inf. Sci., vol. 5,
no. 2, pp. 33–38, 2011.
lems demonstrate that CSO performs consistently well on [19] M. Hu, T. Wu, and J. D. Weir, “An adaptive particle swarm optimization
those test functions, especially in the high dimensional cases. with multiple adaptive methods,” IEEE Trans. Evol. Comput., vol. 17,
The performance of CSO has been further demonstrated on no. 5, pp. 705–720, Oct. 2013.
[20] P. N. Suganthan, “Particle swarm optimiser with neighbourhood oper-
2000-D and 5000-D functions and the experimental results ator,” in Proc. IEEE Congr. Evol. Comput., vol. 3. Washington, DC,
show that CSO has reasonably good scalability on these USA, 1999, pp. 1958–1962.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
CHENG AND JIN: COMPETITIVE SWARM OPTIMIZER FOR LARGE SCALE OPTIMIZATION 13
[21] J. Kennedy, “Small worlds and mega-minds: Effects of neighborhood [47] D. Karaboga and B. Akay, “A survey: Algorithms simulating bee swarm
topology on particle swarm performance,” in Proc. IEEE Congr. Evol. intelligence,” Artif. Intell. Rev., vol. 31, nos. 1–4, pp. 61–85, 2009.
Comput., vol. 3. Washington, DC, USA, 1999, pp. 1931–1938. [48] R. Storn and K. Price, “Differential evolution—A simple and efficient
[22] J. Kennedy and R. Mendes, “Population structure and particle swarm heuristic for global optimization over continuous spaces,” J. Global
performance,” in Proc. IEEE Congr. Evol. Comput., vol. 2. Honolulu, Optim., vol. 11, no. 4, pp. 341–359, 1997.
HI, USA, 2002, pp. 1671–1676. [49] M. Clerc and J. Kennedy, “The particle swarm-explosion, stability, and
[23] R. Mendes, J. Kennedy, and J. Neves, “The fully informed particle convergence in a multidimensional complex space,” IEEE Trans. Evol.
swarm: Simpler, maybe better,” IEEE Trans. Evol. Comput., vol. 8, no. 3, Comput., vol. 6, no. 1, pp. 58–73, Feb. 2002.
pp. 204–210, Jun. 2004. [50] I. C. Trelea, “The particle swarm optimization algorithm: Convergence
[24] J. J. Liang, A. Qin, P. N. Suganthan, and S. Baskar, “Comprehensive analysis and parameter selection,” Inf. Process. Lett., vol. 85, no. 6,
learning particle swarm optimizer for global optimization of multimodal pp. 317–325, 2003.
functions,” IEEE Trans. Evol. Comput., vol. 10, no. 3, pp. 281–295, [51] J. L. Fernandez-Martinez and E. Garcia-Gonzalo, “Stochastic stability
Jun. 2006. analysis of the linear continuous and discrete PSO models,” IEEE Trans.
[25] B. Qu, P. Suganthan, and S. Das, “A distance-based locally informed Evol. Comput., vol. 15, no. 3, pp. 405–423, Jun. 2011.
particle swarm model for multimodal optimization,” IEEE Trans. Evol. [52] X. Li and Y. Yao, “Cooperatively coevolving particle swarms for large
Comput., vol. 17, no. 3, pp. 387–402, Jun. 2013. scale optimization,” IEEE Trans. Evol. Comput., vol. 16, no. 2, pp. 1–15,
[26] J. Robinson, S. Sinton, and Y. Rahmat-Samii, “Particle swarm, genetic Apr. 2011.
algorithm, and their hybrids: Optimization of a profiled corrugated horn [53] Z. Yang, K. Tang, and X. Yao, “Multilevel cooperative coevolution for
antenna,” in Proc. IEEE Antennas Propag. Soc. Int. Symp., vol. 1. 2002, large scale optimization,” in Proc. IEEE Congr. Evol. Comput., Hong
pp. 314–317. Kong, 2008, pp. 1663–1670.
[27] C.-F. Juang, “A hybrid of genetic algorithm and particle swarm optimiza- [54] R. Ros and N. Hansen, “A simple modification in CMA-ES achieving
tion for recurrent network design,” IEEE Trans. Syst., Man, Cybern. B, linear time and space complexity,” in Parallel Problem Solving from
Cybern., vol. 34, no. 2, pp. 997–1006, Apr. 2004. Nature—PPSN X. Berlin, Germany: Springer, 2008, pp. 296–305.
[28] N. Holden and A. A. Freitas, “A hybrid particle swarm/ant colony algo- [55] S.-T. Hsieh, T.-Y. Sun, C.-C. Liu, and S.-J. Tsai, “Solving large scale
rithm for the classification of hierarchical biological data,” in Proc. IEEE global optimization using improved particle swarm optimizer,” in Proc.
SIS, 2005, pp. 100–107. IEEE Congr. Evol. Comput., Hong Kong, 2008, pp. 1777–1784.
[29] P. Shelokar, P. Siarry, V. K. Jayaraman, and B. D. Kulkarni, “Particle [56] M. Potter and K. De Jong, “A cooperative coevolutionary approach to
swarm and ant colony algorithms hybridized for improved continuous function optimization,” in Parallel Problem Solving from Nature, PPSN
optimization,” Appl. Math. Comput., vol. 188, no. 1, pp. 129–142, 2007. III. Berlin, Germany: Springer, 1994, pp. 249–257.
[30] J. Liang and P. Suganthan, “Dynamic multi-swarm particle swarm [57] Z. Yang, K. Tang, and X. Yao, “Large scale evolutionary opti-
optimizer,” in Proc. IEEE SIS, 2005, pp. 124–129. mization using cooperative coevolution,” Inf. Sci., vol. 178, no. 15,
[31] Z.-H. Zhan, J. Zhang, Y. Li, and H.-H. Chung, “Adaptive particle swarm pp. 2985–2999, 2008.
optimization,” IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 39, [58] N. Hansen and A. Ostermeier, “Completely derandomized self-
no. 6, pp. 1362–1381, Dec. 2009. adaptation in evolution strategies,” Evol. comput., vol. 9, no. 2,
[32] R. Brits, A. P. Engelbrecht, and F. van den Bergh, “Locating multiple pp. 159–195, 2001.
optima using particle swarm optimization,” Appl. Math. Comput., vol. [59] M. A. Montes de Oca, T. Stutzle, M. Birattari, and M. Dorigo,
189, no. 2, pp. 1859–1883, 2007. “Frankenstein’s PSO: A composite particle swarm optimization algo-
rithm,” IEEE Trans. Evol. Comput., vol. 13, no. 5, pp. 1120–1132,
[33] M. Daneshyari and G. G. Yen, “Cultural-based multiobjective particle
Oct. 2009.
swarm optimization,” IEEE Trans. Cybern., vol. 41, no. 2, pp. 553–567,
[60] X. Li, “Niching without niching parameters: Particle swarm optimiza-
Apr. 2011.
tion using a ring topology,” IEEE Trans. Evol. Comput., vol. 14, no. 1,
[34] N. Higashi and H. Iba, “Particle swarm optimization with Gaussian
pp. 150–169, Feb. 2010.
mutation,” in Proc. IEEE SIS, 2003, pp. 72–79.
[61] X. Li and K. Deb, “Comparing lbest PSO niching algorithms using
[35] B. Liu, L. Wang, Y.-H. Jin, F. Tang, and D.-X. Huang, “Improved particle different position update rules,” in Proc. IEEE Congr. Evol. Comput.,
swarm optimization combined with chaos,” Chaos, Solitons, Fractals, Barcelona, Spain, 2010, pp. 1–8.
vol. 25, no. 5, pp. 1261–1271, 2005. [62] O. Olorunda and A. Engelbrecht, “Measuring exploration/exploitation
[36] Z.-H. Zhan, J. Zhang, Y. Li, and Y.-H. Shi, “Orthogonal learning par- in particle swarms using swarm diversity,” in Proc. IEEE Congr. Evol.
ticle swarm optimization,” IEEE Trans. Evol. Comput., vol. 15, no. 6, Comput., Hong Kong, 2008, pp. 1128–1134.
pp. 832–847, Dec. 2011. [63] A. Ismail and A. Engelbrecht, “Measuring diversity in the coopera-
[37] H. Gao and W. Xu, “A new particle swarm algorithm and its glob- tive particle swarm optimizer,” in Swarm Intelligence. Berlin, Germany:
ally convergent modifications,” IEEE Trans. Cybern., vol. 41, no. 5, Springer, 2012, pp. 97–108.
pp. 1334–1351, Oct. 2011. [64] K. Deb, Multi-Objective Optimization Using Evolutionary Algorithms.
[38] Y. V. Pehlivanoglu, “A new particle swarm optimization method Hoboken, NJ, USA: Wiley, 2001.
enhanced with a periodic mutation strategy and neural networks,” IEEE [65] H. Ishibuchi, N. Tsukamoto, and Y. Nojima, “Evolutionary many-
Trans. Evol. Comput., vol. 17, no. 3, pp. 436–452, Jun. 2013. objective optimization: A short review,” in Proc. IEEE Congr. Evol.
[39] F. van den Bergh and A. P. Engelbrecht, “A cooperative approach to Comput., Hong Kong, 2008, pp. 2419–2426.
particle swarm optimization,” IEEE Trans. Evol. Comput., vol. 8, no. 3,
pp. 225–239, Jun. 2004.
[40] S. Baskar and P. N. Suganthan, “A novel concurrent particle swarm
optimization,” in Proc. IEEE Congr. Evol. Comput., vol. 1. 2004,
pp. 792–796.
[41] G. G. Yen and M. Daneshyari, “Diversity-based information exchange
among multiple swarms in particle swarm optimization,” Int. J. Comput.
Intell. Appl., vol. 7, no. 1, pp. 57–75, 2008.
[42] R. Cheng, C. Sun, and Y. Jin, “A multi-swarm evolutionary framework
based on a feedback mechanism,” in Proc. IEEE Congr. Evol. Comput.,
Cancun, Mexico, 2013, pp. 718–724.
[43] K. Tang et al., “Benchmark functions for the CEC’2008 special ses- Ran Cheng received the B.Eng. degree from
sion and competition on large scale global optimization,” Nat. Inspired Northeastern University, Shenyang, China, in 2010.
Comput. Appl. Lab., USTC, Hefei, China, Tech. Rep., 2007. He is currently pursuing the Ph.D. degree with the
[44] Y. Jin and B. Sendhoff, “Fitness approximation in evolutionary Department of Computing, University of Surrey,
computation—A survey,” in Proc. GECCO, 2002, pp. 1105–1112. Guildford, U.K.
[45] D. Lim, Y. Jin, Y.-S. Ong, and B. Sendhoff, “Generalizing surrogate- His current research interests include multi/many-
assisted evolutionary computation,” IEEE Trans. Evol. Comput., vol. 14, objective optimization, estimation of distribution
no. 3, pp. 329–355, Jun. 2010. algorithms, and swarm intelligence.
[46] M. Dorigo and C. Blum, “Ant colony optimization theory: A survey,”
Theor. Comput. Sci., vol. 344, no. 2, pp. 243–278, 2005.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Yaochu Jin (M’98–SM’02) received the B.Sc., multiobjective machine learning, and data analytics. His current research is
M.Sc., and Ph.D. degrees from Zhejiang University, funded by EC FP7, U.K. EPSRC, and industries including Santander, Aero
Hangzhou, China, in 1988, 1991, and 1996, respec- Optimal, Bosch U.K., HR Wallingford, and Honda. He has delivered 16 invited
tively, and the Dr.-Ing. degree from Ruhr-University keynote speeches at international conferences and has published 200 peer-
Bochum, Bochum, Germany, in 2001. reviewed journal and conference papers. He has been granted eight U.S., EU,
He is currently a Professor and a Chair of and Japan patents.
Computational Intelligence with the Department of Dr. Jin is an Associate Editor of the IEEE TRANSACTIONS ON CYBERNETICS,
Computing, University of Surrey, Guildford, U.K., the IEEE TRANSACTIONS ON NANOBIOSCIENCE, the IEEE COMPUTATIONAL
where he is also the Head of the Nature Inspired INTELLIGENCE MAGAZINE , Evolutionary Computation (MIT), BioSystems
Computing and Engineering Group. His current (Elsevier), and Soft Computing (Springer). He is currently an IEEE Distinguished
research interests include interdisciplinary areas that Lecturer and Vice President for Technical Activities of the IEEE Computational
bridge the gap between computational intelligence, computational neuro- Intelligence Society. He was the recipient of the Best Paper Award of the
science, computational systems biology, and nature-inspired real-world driven 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and
problem-solving, ranging from big data driven evolutionary optimization to Computational Biology. He is a fellow of the British Computer Society.