Professional Documents
Culture Documents
ar t ic l e i nf o a b s t r a c t
More than fifteen years after the beginning of the development of AutoGraphiX (AGX), a third version of
Keywords: the software is made available. Since the program was rewritten from scratch, it was the opportunity to
Variable neighbourhood search look forward and consider new avenues. From the user's point of view, the interface is completely
Graph theory changed, which allows the display of multiple information which was not possible in the previous
Extremal graphs versions. However, one of the main improvements is that it is designed to help researchers in the field of
AutoGraphiX complex networks. In these days when increasing research is applied to complex networks (such as
social networks), the use of quantities related to vertices, indicating the centrality (the importance of an
actor in the network measured as a topological indicator) naturally leads researchers toward the
mathematical study of these quantities. This new paradigm implies a complete change in the optimi-
zation algorithm that now natively handles multi objective optimization problems involving vertex-
related measures.
& 2015 Elsevier Ltd. All rights reserved.
http://dx.doi.org/10.1016/j.cor.2015.12.009
0305-0548/& 2015 Elsevier Ltd. All rights reserved.
Please cite this article as: Caporossi G. Variable Neighborhood Search for extremal vertices : The AutoGraphiX-III system. Computers
and Operations Research (2016), http://dx.doi.org/10.1016/j.cor.2015.12.009i
2 G. Caporossi / Computers & Operations Research ∎ (∎∎∎∎) ∎∎∎–∎∎∎
Let us illustrate with an example. The energy of a graph is [12] for more information on the system and its use. The following
defined as section recalls background information about the search for
X
n extremal graphs by the mean of the Variable Neighborhood Search
EðGÞ ¼ j λi ðGÞj ; (VNS) metaheuristic [14,17]. Then, the problem of comparing
i¼1 invariants is discussed before the extension of multi objective
where λi ðGÞ is the ith eigenvalue of the adjacency matrix of G. The optimization to vertex related values (and the difficulties it
problem represented in Fig. 1 consists in maximizing and mini- involves). Some algorithms are proposed, and then compared
mizing the energy of the graph and its complement. Calculus before the conclusion.
nodes are computed sequentially from the top to the bottom. Each
calculus node is connected to its argument by the top. Rectangular
nodes are computed directly from the graph and do not require 2. Variable neighborhood search for extremal graphs
argument. Ellipses represent operators that may either need
arguments or be arguments. Finally, diamonds, arrows or double Let G ¼ ðV; EÞ be a connected undirected graph without loops or
arrows correspond to final state calculus nodes that may either multiple edges. Let Gn be the set of all graphs on j V j ¼ n vertices,
link to the display of the graph (highlights or values on the nodes and let i : Gn -R be a graph invariant, a function whose value does
or edges) or to the optimizer (indicating a function to be mini- not rely on the labelling of the vertices. Some of the most common
mized, maximized or both sequentially). The white circle on the graph invariants are the number of edges, diameter, chromatic
top right rectangle (named ADJACENCY) indicates that this matrix number, independence number, etc. Graph invariants provide a
is computed from the complement of the graph under study. On powerful analytical tool for investigation of structures of graphs.
that example, the left sequence of calculus nodes reads as follows They, combined in convenient algebraic relations, contain global
(from the top to the bottom): ADJACENCY – EIGENVALUES – ABS – and general information about a graph.
SUM – MAXMIN. In the algebraic notation, this would mean A way to study some particular graph invariant from the
MaxMinðsumðabsðeigenvaluesðadjacencyÞÞÞÞ. The sequence on the mathematical point of view is to describe it or bound it by a
right is the same except that the adjacency matrix is computed for function of other invariants, which could be achieved through the
the complement of the graph. search for extremal graphs and their description.
The energy of the graph and its complement are both max- Without lose of generality, the problem of finding extremal
imized and minimized, which means that AGX-III will run simul- graphs could be described as the following combinatorial opti-
taneously or sequentially (depending on the number of threads) mization problem:
4 optimization problems, each of them involving a combination of
2 objectives. max iðGÞ ð1Þ
This paper focuses on the optimization module and the cap-
abilities it provides. The reader could refer to the AGX-III webpage subject to
Please cite this article as: Caporossi G. Variable Neighborhood Search for extremal vertices : The AutoGraphiX-III system. Computers
and Operations Research (2016), http://dx.doi.org/10.1016/j.cor.2015.12.009i
G. Caporossi / Computers & Operations Research ∎ (∎∎∎∎) ∎∎∎–∎∎∎ 3
▹
and Hansen [2,3,5], who proposed to study arbitrary functions of
3: Note i(G) the value associated to G. To be two invariants by applying an operator between them (for exam-
maximized ple the summation, the substraction, the product or the ratio). This
4: repeat approach implicitly assigns a coefficient 1 to each invariant, and is
5: Let G0 A tðGÞ \ Gnr so that iðG0 Þ 4 iðGÞ therefore sensitive to the scaling of the invariants [4]. A more
6: if G0 exists then, general approach was proposed by Mélot [16] with the system
7: G’G0 . GraPHedron in which all possible graphs of limited order are
8: until G0 doesn't exist.
to t
▹ G is a local optimum according considered and studied as well as the values for the invariants
under study. Based on the values of pairs of invariants for all
graphs, GraPHedron thus provides a better description of the
9: return G
relations between the considered invariants than the one pro-
posed by Aouchiche and Hansen.
It is natural to combine both approaches and replace the heavy
Algorithm 2. Perturbation in AGX.
task of the enumeration by multi objective optimization, which
1: Let G be an initial graph. leads to a description of the relations among invariants similar to
2: Let k be a positive integer. that of GraPHedron, but based upon graphs of larger order that are
3: function PERTURB(G, k) heuristically obtained. AutoGraphiX-III implements this approach
4: for step ¼ 1-k (and the capability to study vertex-related measures).
5: Randomly choose G0 A tðGÞ. In the case of multi objective optimization, the objective
6: G’G0 function is IðGÞ A Rq , a vector whose components are graph invar-
7: return G iants ij(G), j ¼ 1; …; q.
Please cite this article as: Caporossi G. Variable Neighborhood Search for extremal vertices : The AutoGraphiX-III system. Computers
and Operations Research (2016), http://dx.doi.org/10.1016/j.cor.2015.12.009i
4 G. Caporossi / Computers & Operations Research ∎ (∎∎∎∎) ∎∎∎–∎∎∎
Let us now consider, without lost of generality, the multi 3.2. Example
objective optimization problem that consists in maximizing all the
components of I(G). This problem could be described as: The transmission tv of the vertex v is defined as the sum of
max IðGÞ ð5Þ geodesic (the shortest) distances from v to all other vertices of G.
The betweenness buv of the edge (u,v) is the number of shortest
subject to paths between pairs of vertices of G that involve the edge (u,v)
GA Gnr : ð6Þ [13]. From the betweenness can be computed the adjusted
betweenness centrality cv [11] closely related to Freeman's
Consider two graphs G1 ; G2 A Gnr and the vector I ¼ IðG1 Þ IðG2 Þ. betweenness centrality as follows:
X
If I A Rq þ and I a 0 (I has some non 0 components), then G1 cv ¼ buv ð10Þ
ðu;vÞ A E
dominates G2, and we write IðG1 Þ 4 IðG2 Þ.
If I has both strictly positive and strictly negative components, Studying the relation between tv and cv, in terms of invariants
or I ¼0 (all components of I are 0), then G1 and G2, are not could be achieved by finding relations between maxðt v Þ and
comparable, and we write IðG1 Þ IðG2 Þ. maxðcv Þ. Using AGX with multi objective optimization (with order
n¼ 10) provides a tentative pareto front as described on Fig. 2,
The set Gn Gnr of Pareto optimal solutions for the problem (5)– where 17 r maxðt v Þ r 45 and 49 rmaxðcv Þ r 81.
(6) is such that: On n¼ 10 vertices, the automated conjecture search engine of
AGX provides the relation:
1. 8 G2 A Gnr ⧹Gn (G1 A Gn so that IðG1 Þ 4 IðG2 Þ and 2 maxðt v Þ þ maxðcv Þ r 139;
2. 8 G1 ; G2 A Gn , IðG1 Þ IðG2 Þ.
and indicates it is respected as a equality for 9 graphs. When n ¼11,
3.1. Algorithmic issues the relation becomes:
2maxðt v Þ þ maxðcv Þ r170:
In the case of multi objective optimization, the problem is not
In both cases, the relation is satisfied as an equality by the path, for
to find an optimal solution, but a set of Pareto optimal solutions Gn .
which:
Various approaches may be considered for this problem. The first
one, which does not require any special implementation as long as nðn 1Þ
max t v ¼
the software may handle constraints, consists in optimizing an jnk2 lnm jnk lnm
objective function, while the others are expressed as constraints in max cv ¼ þ þ1 1 ;
which they are bounded by parameters. This approach was used 2 2 2 2
with earlier versions of AGX (for example [7]). which leads to Conjecture 1.
Using this technique, if the objective function to be optimized is
Conjecture 1. Let G ¼ ðV; EÞ be a connected simple graph. Let tv and
ip(G), with 1 rp rq, the problem (5)–(6) becomes:
cv respectively denote the transmission and adjusted betweenness
max ip ðGÞ ð7Þ centrality of the vertex v, then the inequality
2
subject to 3n
2 maxðt v Þ þ maxðcv Þ r n 1 ð11Þ
is Z as 8 s ¼ 1…q; s a p ð8Þ 2
holds. Furthermore, the bound is tight if and only if G is a spider (a
GA Gnr ; ð9Þ tree with only one vertex of degree larger than 2).
where as is a bound for the sth objective, used as a parameter.
Another approach to handle the multi objective optimization
issue is to adapt the VNS algorithm, as well as the local search. In 4. Multiobjective optimization for vertex related values
Algorithm 1, line 3 must be replaced by “Note i(G) the value
associated to G” and line 5 by “Let G0 A tðGÞ \ Gnr so that IðG0 Þ 4 IðGÞ ”. In some situations a global description of a graph (by the mean
Algorithm 3 should be replaced by Algorithm 4. of invariants) is not useful. For example, in the study of social
networks one could focus only on individual vertices, not the
Algorithm 4. VNS for multi objective optimization in AGX. whole graph. Instead of using graph invariants, one then would
1: Define kmax the maximum perturbation magnitude use vertex-related quantities, such as the degree d(v) (number of
2: Let G be an initial graph ▹Usually a random graph vertices adjacent to v) or the transmission tv. As it is the case for
3: Set Gn ¼ fGg the set of best known solutions invariants, vertex quantities are often better described in relation
4: Set k’1 with other quantities. The problem of multi objective optimization
5: repeat could be extended to values computed for each vertex.
6: Randomly chose G A Gn
4.1. Algorithmic issues
7: G0 ’PERTURBðG; kÞ
0
8: G0 ’LSðG0 Þ
In terms of optimization, the solution of such a problem is a set of
9: if G″ is not dominated by any graph G A Gn then vertices associated to their corresponding graph. When working on
10: Set Gn ’fG″ g [ Gn individual vertices, the objective function, i.e. the vector I(G) is no
11: Remove all dominated solutions from Gn more appropriate, but should be replaced by a set of vectors, I(v),
12: Set k’1 8 v A G. The approach that consists in solving the multi objective
13: else optimization problem by a parametric optimization of a single
14: Set k’ðk þ1Þmodðkmax Þ objective function cannot be applied anymore. If some vertices do
15: until the stopping criterion is reached not respect the constraint added for parameterization, the whole
Please cite this article as: Caporossi G. Variable Neighborhood Search for extremal vertices : The AutoGraphiX-III system. Computers
and Operations Research (2016), http://dx.doi.org/10.1016/j.cor.2015.12.009i
G. Caporossi / Computers & Operations Research ∎ (∎∎∎∎) ∎∎∎–∎∎∎ 5
Fig. 2. Representation of the Pareto front, maxðcv Þ on the x-axis and maxðt v Þ on the y-axis for n¼ 10.
graph is avoided, even if some other vertices of that same graph It seems impossible to extend or modify the definition of dom-
could be pareto optimal. The optimization would then be achieved ination of a graph by another and get an efficient local search. If the
over a subset of the realizable solutions, which is clearly suboptimal. values of objective function used in the optimization refer to vertices,
Therefore, it is mandatory to develop a proper strategy in order to then the individual vertices associated to the corresponding graph
solve this problem. An approach to handle this problem is to modify must be considered. Therefore, we cannot conclude that a graph
the definition of domination (used in Algorithm 1, line 5) and dominates another, but that it dominates a vertex of a graph. The
eventually adapt the local search algorithm. corresponding definition could be as follows:
Definition 1. Domination in the strong sense: We say that the Definition 3. A graph G2 dominates the pair ðG1 ; uÞ if (v A VðG2 Þ so
graph G2 dominates the graph G1 if (v A VðG2 Þ so that IðvÞ 4 IðuÞ that IðvÞ 4 IðuÞ.
8 u A V ðG1 Þ.
By using this definition, G2 will implicitly be associated to the
This definition is very restrictive. In the context of a local search, it vertex v, thus forming the pair ðG2 ; vÞ. So, if we use this definition
leads to poor results because it is very easy to find a graph G which instead of Definition 2, we solve the cycling problem and would
is not dominated by any G0 A tðGÞ. In that case the local search stops theoretically produce the appropriate results.
even if G is not a good solution. None of the proposed definitions is completely appropriate.
The two first would lead to inefficient algorithms from a theore-
Definition 2. Domination in the weak sense: We say that G2
tical point of view. Indeed, Algorithm 4 using the Definition 3 is
dominates G1 if ( v A VðG2 Þ such that ∄u A VðG1 Þ so that IðuÞ 4 IðvÞ.
technically adapted for the optimization in the case of vertex
According to this definition, graph G2 dominates graph G1 if there is a related values, but it involves the implementation of special data
vertex in G2 that is not dominated by any vertex of G1. However, it is structures to identify the vertex associated to each solution. Fur-
possible that a vertex v of VðG2 Þ is not dominated by any vertex of thermore, the use of such a structure allows only a single supposed
VðG1 Þ and there also exists a vertex u of VðG1 Þ that is not dominated Pareto optimal vertex after each local search, which implies a large
by any vertex of VðG2 Þ. In such case u and v are not comparable. number of local searches in order to identify a complete set of
Implementing this definition in Algorithm 1 would cause it to cycle. Pareto optimal solution.
Please cite this article as: Caporossi G. Variable Neighborhood Search for extremal vertices : The AutoGraphiX-III system. Computers
and Operations Research (2016), http://dx.doi.org/10.1016/j.cor.2015.12.009i
6 G. Caporossi / Computers & Operations Research ∎ (∎∎∎∎) ∎∎∎–∎∎∎
11: 0
until G does not exist.
according to t
▹ G is a set of local optimum
As in the previous example, let us consider transmission and
adjusted centrality. These values are associated to vertices, so, the
12: return G search for extremal graphs should be replaced by a search for
Fig. 3. Representation of the Pareto front, cv on the x-axis and tv on the y-axis.
Please cite this article as: Caporossi G. Variable Neighborhood Search for extremal vertices : The AutoGraphiX-III system. Computers
and Operations Research (2016), http://dx.doi.org/10.1016/j.cor.2015.12.009i
G. Caporossi / Computers & Operations Research ∎ (∎∎∎∎) ∎∎∎–∎∎∎ 7
5. Performance comparison Fig. 4. Representation of the dominated surface with 3 solutions when both
objective functions are maximized.
Please cite this article as: Caporossi G. Variable Neighborhood Search for extremal vertices : The AutoGraphiX-III system. Computers
and Operations Research (2016), http://dx.doi.org/10.1016/j.cor.2015.12.009i
8 G. Caporossi / Computers & Operations Research ∎ (∎∎∎∎) ∎∎∎–∎∎∎
6. Conclusion the objective function and different values for the discriminating
invariant.
The main goal of this work was to extend the capabilities of
AGX to handle vertex related values. As stressed in Section 4.1, this
implies the use of a native multi objective optimization algorithm. Acknowledgments
Apart from its capability to handle problems involving vertex
related values, one of the main contributions of AGX-III is thus its This work was supported by NSERC (Canada) (Grant no. RGPIN-
optimization module. It turns out that the new proposed algo- 2015-04945). The author would like to thank an anonymous
rithm performs significantly better than the natural approach that referee for his useful comments.
was illustrated by Algorithm 4.
A deeper analysis of the results indicates that Algorithm 4 fails
to diversify the search and the solution often remains on the same References
portion of the Pareto front. This result is not surprising and a
similar effect was already noticed in AGX when optimizing [1] Aouchiche M, Bonnefoy J-M, Fidahoussen A, Caporossi G, Hansen P, Lacheré J,
invariants that could only take few values, such as diameter or et al. Variable neighborhood search for extremal graphs. 14. The AutoGraphiX
2 system. In: Liberti L, Maculan N, editors. Global Optimization. From Theory
maximum degree (this difficulty is known as plateau). The algo-
to Implementation. New-York: Springer Science; 2006.
rithm then fails to find any improved solution, because it does not [2] Aouchiche M, Caporossi G, Hansen P. Variable neighborhood search for
explore other solutions which are equivalent to the incumbent. In extremal graphs. 20. automated comparison of graph invariants. MATCH
the present context, this phenomena is the following: when a Commun Math Comput Chem 2007;58:365–84.
[3] Aouchiche M, Hansen P. Proximity and remoteness in graphs: Results and
graph G is found, Algorithm 4 simply ignores alternative graphs conjectures. Networks 2011;58:95–102.
G0 A tðGÞ that do not dominate G, even if some of them are not [4] Aouchiche M, Hansen P. The normalized revised szeged index. MATCH Com-
dominated by G either (they are equivalent to the incumbent). In mun Math Comput Chem 2012;67:369–81.
[5] Aouchiche M, Hansen P, Zheng M. Variable neighborhood search for extremal
Algorithm 6, those graphs are stored in the current set of Pareto graphs. 19. further conjectures and results about the randic index. MATCH
optimal solutions Gn , and their neighborhood is likely to be Commun Math Comput Chem 2007;58:83–102.
explored later. [6] Balaban AT. Distance connectivity index. Chem Phys Lett 1982;89:399–404.
[7] Caporossi G, Cvetkovic D, Gutman I, Hansen P. Variable neighborhood search
In the present paper, a VNS algorithm for attempting to find for extremal graphs. 2. Finding graphs with extremal energy. J Chem Inf
Pareto optimal solution for vertex related values objectives is Comput Sci 1999;39(6):984–96.
proposed. The problem has a special characteristic as many Pareto [8] Caporossi G, Hansen P. Variable neighborhood search for extremal graphs. 1.
The AutoGraphiX system. Discret Math 2000;212:29–44.
optimal solutions (vertices) may be associated to the same graph.
[9] Caporossi G, Hansen P. Variable neighborhood search for extremal graphs. v.
In this case, it was shown that keeping a variety of non dominated Three ways to automate finding conjectures. Discret Math 2004;276:81–94.
solutions during the local search is better than the natural [10] Caporossi G, Hansen P. A learning optimization algorithm in graph theory.
Versatile search for extremal graphs using a learning algorithm. Lecture Notes
approach which consists in performing a local search on a single
in Computer Science 2012;7219:16–30.
solution and eventually add it to the Pareto front because the [11] Caporossi G, Paiva M, Vukicević D, Segatto M. Centrality and betweenness:
classical approach cannot add more than a single new solution to Vertex and edge decomposition of the wiener index. MATCH Commun Math
the Pareto front after each local search. Comput Chem 2012;68:293–302.
[12] Caporossi, G. AutoGraphiX web page 〈https://www.gerad.ca/Gilles.Caporossi/
The new algorithm may lead to a promising approach to handle agx〉; last [accessed 25.10.2015].
the optimization problem related to plateau, as the drawbacks [13] Freeman LC. A set of measures of centrality based on betweenness. Sociometry
from Algorithm 4 are being rather well corrected by Algorithm 6. 1977;40:35–41.
[14] Hansen P, Mladenović N. Variable neighborhood search: Principles and
Therefore, the use of Algorithm 6 can be extended to the classical applications. Eur J Oper Res 2001;130:449–67.
optimization in AGX. However, the case of two non comparable [15] Majstorović S, Caporossi G. Bounds and relations involving adjusted centrality
solution is different from the case of two graphs with the same of the vertices of a tree. Graphs Comb. 2015;31:2319–34.
[16] Mélot H. Facet defining inequalities among graph invariants: the system
objective value, as in this last situation, one needs to avoid that GraPHedron. Discret Appl Math 2008;156:1875–91.
both graphs are isomorphic, otherwise the set Gn could yield a [17] Mladenović N, Hansen P. Variable neighborhood search. Comput Oper Res
huge number of isomorphic graphs that would slow down the 1997;24:1097–100.
[18] Vukicević D, Caporossi G. Network descriptors based on betweenness cen-
optimization. A way to reduce the number of these graphs in Gn is
trality and transmission and their extremal values. Discret Appl Math
to check the isomorphism, which would be computationally 2013;161:2678–86.
intensive. Instead of using an isomorphism test, it is possible to
use a discriminating invariant such as the Balaban index. A graph
would then be added to Gn only if it has the best known value for
Please cite this article as: Caporossi G. Variable Neighborhood Search for extremal vertices : The AutoGraphiX-III system. Computers
and Operations Research (2016), http://dx.doi.org/10.1016/j.cor.2015.12.009i