TM06 ACO TSP Pseudoccode Module

MODUL PERKULIAHAN
TEKNIK
OPTIMASI
ACO
Traveling Salesman Problem
Fakultas Program Studi Tatap Muka Kode MK Disusun Oleh
06
Teknik Teknik ELektro Zendi Iklima, ST., S.Kom., M.Sc.
Abstract Kompetensi
Pada mata kuliah ini, mahasiswa belajar Ketepatan dalam menguraikan
mengenai Swarm Intelligence (SI), definisi, konsep dasar Ant Colony
konsep dasar SI, serta bagaimana Optimization
penggunaan SI. Selain itu mahasiswa juga akan Ketepatan dalam menjelaskan jenis -
dikenalkan mengenai Swarm Robotics (SR). jenis Ant Colony Optimization
Serta mahasiswa akan belajar Ketepatan dalam memberikan
mengenai konsep dasar, visualisasi, formulasi, contoh aplikasi Ant Colony
hingga teknik aplikasi dari beberapa algoritma Optimization
antara lain Ant Colony
Optimization (ACO), Bee Colony Optimization
(BCO), Particle Swarm Optimization (PSO), Birds
Flocking (FoB).
ACO DATA STRUCTURES
The main tasks to be considered in an ACO algorithm are the solution construction, the management
of the pheromone trails, and the additional techniques such as local search. In addition, the data
structures and parameters need to be initialized and some statistics about the run need to be
maintained. We give a high-level view of the algorithm, while in the following we give some details on
how to implement the different procedures of AS in an efficient way.
In the data initialization, (1) the instance has to be read; (2) the distance matrix has to be computed;
(3) the nearest-neighbor lists for all cities have to be computed; (4) the pheromone matrix and the
choice_info matrix have to be initialized; (5) the ants have to be initialized; (6) the algorithm’s
parameters must be initialized; and (7) some variables that keep track of statistical information, such
as the used CPU time, the number of iterations, or the best solution found so far, have to be initialized.
A possible organization of these tasks into several data initialization procedures is indicated in figure
3.16.
2018 Teknnik Optimasi Pusat Bahan Ajar dan eLearning

2 Zendi Iklima, ST., S.Kom,. M.Sc. http://www.mercubuana.ac.id
Termination Condition The program stops if at least one termination condition applies. Possible
termination conditions are: (1) the algorithm has found a solution within a predefined distance from
a lower bound on the optimal solution quality; (2) a maximum number of tour constructions or a
maximum number of algorithm iterations has been reached; (3) a maximum CPU time has been spent;
or (4) the algorithm shows stagnation behavior.
Solution Construction The tour construction is managed by the procedure ConstructSolutions, shown
in figure 3.17. The solution construction requires the following phases.
1. First, the ants’ memory must be emptied. This is done in lines 1 to 5 of procedure ConstructSolutions
by marking all cities as unvisited, that is, by setting all the entries of the array ants.visited to false for
all the ants.
2. Second, each ant has to be assigned an initial city. One possibility is to assign each ant a random
initial city. This is accomplished in lines 6 to 11 of the procedure. The function random returns a
random number chosen according to a uniform distribution over the set{1, … , 𝑛}.

3. Next, each ant constructs a complete tour. At each construction step (see the procedure in figure
3.17) the ants apply the AS action choice rule [equation (3.2)]. The procedure ASDecisionRule
implements the action choice rule and takes as parameters the ant identifier and the current
construction step index; this is discussed below in more detail. 4. Finally, in lines 18 to 21, the ants
move back to the initial city and the tour length of each ant’s tour is computed. Remember that, for
the sake of simplicity, in the tour representation we repeat the identifier of the first city at position
𝑛 + 1; this is done in line 19.
As stated above, the solution construction of all of the ants is synchronized in such a way that the ants
build solutions in parallel. The same behavior can be obtained, for all AS variants, by ants that
construct solutions sequentially, because the ants do not change the pheromone trails at construction
time (this is not the case for ACS, in which case the sequential and parallel implementations give
different results).
While phases (1), (2), and (4) are very straightforward to code, the implementation of the action choice
rule requires some care to avoid large computation times. In the action choice rule an ant located at
city 𝑖 probabilistically chooses to move to an unvisited city 𝑗 based on the pheromone trails 𝜏𝑖𝑗 and
the heuristic information 𝜂𝑖𝑗 [see equation (3.2)].
Here we give pseudo-codes for the action choice rule with and without consideration of candidate
lists. The pseudo-code for the first variant ASDecisionRule is given in figure 3.18. The procedure works
as follows: first, the current city c of ant k is determined (line 1). The probabilistic choice of the next
city then works analogously to the roulette wheel selection procedure of evolutionary computation
(Goldberg, 1989): each value choice_info[c][j] of a city 𝑗 that ant 𝑘 has not visited yet determines a
slice on a circular roulette wheel, the size of the slice being proportional to the weight of the
associated choice (lines 2–10). Next, the wheel is spun and the city to which the marker points is
chosen as the next city for ant 𝑘 (lines 11–17). This is implemented by
1. summing the weight of the various choices in the variable sum_ probabilities,
2. drawing a uniformly distributed random number 𝑟 from the interval [0;sum_ probabilities],
3. going through the feasible choices until the sum is greater or equal to 𝑟.
Finally, the ant is moved to the chosen city, which is marked as visited (lines 18 and 19).

These construction steps are repeated until the ants have completed a tour. Since each ant has to visit
exactly 𝑛 cities, all the ants complete the solution construction after the same number of construction
steps.
When exploiting candidate lists, the procedure ASDecisionRule needs to be adapted, resulting in the
procedure NeighborListASDecisionRule, given in figure 3.19. A first change is that when choosing the
next city, one needs to identify the appropriate city index from the candidate list of the current city 𝑐.
This results in changes of lines 3 to 10 of figure 3.18: the maximum value of index 𝑗 is changed from 𝑛
to 𝑛𝑛 in line 3 and the test performed in line 4 is applied to the 𝑗 − 𝑡ℎ nearest neighbor given by
nn_list[c][j]. A second change is necessary to deal with the situation in which all the cities in the
candidate list have already been visited by ant 𝑘. In this case, the variable sum_ probabilities keeps its
initial value 0.0 and one city out of those not in the candidate list is chosen: the procedure
ChooseBestNext (see pseudo-code in figure 3.20) is used to identify the city with maximum value of
(𝜏𝑖𝑗 )𝛼 (𝜂𝑖𝑗 )𝛽 as the next to move to.

Things to Remember
• The TSP was the first combinatorial optimization problem to be attacked by ACO. The first
ACO algorithm, called Ant System, achieved good performance on small TSP instances but
showed poor scaling behavior to large instances. The main role of AS was that of a ‘‘proof-of-
concept’’ that stimulated research on better-performing ACO algorithms as well as
applications to different types of problems.
• Nowadays, a large number of di¤erent ACO algorithms are available. All of these algorithms
include a strong exploitation of the best solutions found during the search and the most
successful ones add explicit features to avoid premature stagnation of the search. The main
di¤erences between the various AS extensions consist of the techniques used to control the
search process. Experimental results show that for the TSP, but also for other problems, these
variants achieve a much better performance than AS.
• When applying ACO algorithms to the TSP, the best performance is obtained when the ACO
algorithm uses a local optimizer to improve the solutions constructed by the ants. As we will
see in chapter 5, this is typical for the application of ACO to N P-hard optimization problems.
• When using local search, it is typically sufficient to apply a small constant number of ants to
achieve high performance, and experimental results suggest that in this case the role played
by the heuristic information becomes much less important.
• The implementation of ACO algorithms is often rather straightforward, as shown in this
chapter via the example of the implementation of AS for the TSP. Nevertheless, care should
be taken to make the code as efficient as possible.
• The implementation code for most of the ACO algorithms presented in this chapter is available
at www.aco-metaheuristic.org/aco-code/.

Sample Code (Do it by your self using google colab)
import random as rn
import numpy as np
from numpy.random import choice as np_choice
class AntColony(object):
def __init__(self, distances, n_ants, n_best, n_iterations, decay,

alpha=1, beta=1):
"""
Args:
distances (2D numpy.array): Square matrix of distances. Dia
gonal is assumed to be np.inf.
n_ants (int): Number of ants running per iteration
n_best (int): Number of best ants who deposit pheromone
n_iteration (int): Number of iterations
decay (float): Rate it which pheromone decays. The pheromon
e value is multiplied by decay, so 0.95 will lead to decay, 0.5 to much
faster decay.
alpha (int or float): exponenet on pheromone, higher alpha
gives pheromone more weight. Default=1
beta (int or float): exponent on distance, higher beta give
distance more weight. Default=1
Example:
ant_colony = AntColony(german_distances, 100, 20, 2000, 0.9
5, alpha=1, beta=2)
"""
self.distances = distances
self.pheromone = np.ones(self.distances.shape) / len(distances)
self.all_inds = range(len(distances))
self.n_ants = n_ants
self.n_best = n_best
self.n_iterations = n_iterations
self.decay = decay
self.alpha = alpha
self.beta = beta
def run(self):
shortest_path = None
all_time_shortest_path = ("placeholder", np.inf)
for i in range(self.n_iterations):
all_paths = self.gen_all_paths()
self.spread_pheronome(all_paths, self.n_best, shortest_path
=shortest_path)
shortest_path = min(all_paths, key=lambda x: x[1])
print (shortest_path)
if shortest_path[1] < all_time_shortest_path[1]:

all_time_shortest_path = shortest_path
self.pheromone * self.decay
return all_time_shortest_path
def spread_pheronome(self, all_paths, n_best, shortest_path):

sorted_paths = sorted(all_paths, key=lambda x: x[1])
for path, dist in sorted_paths[:n_best]:
for move in path:
self.pheromone[move] += 1.0 / self.distances[move]
def gen_path_dist(self, path):

total_dist = 0
for ele in path:
total_dist += self.distances[ele]
return total_dist
def gen_all_paths(self):
all_paths = []
for i in range(self.n_ants):
path = self.gen_path(0)
all_paths.append((path, self.gen_path_dist(path)))
return all_paths
def gen_path(self, start):

path = []
visited = set()
visited.add(start)
prev = start
for i in range(len(self.distances) - 1):
move = self.pick_move(self.pheromone[prev], self.distances[
prev], visited)
path.append((prev, move))
prev = move
visited.add(move)
path.append((prev, start)) # going back to where we started
return path
def pick_move(self, pheromone, dist, visited):

pheromone = np.copy(pheromone)
pheromone[list(visited)] = 0
row = pheromone ** self.alpha * (( 1.0 / dist) ** self.beta)
norm_row = row / row.sum()

move = np_choice(self.all_inds, 1, p=norm_row)[0]
return move

to run
distances = np.array([[np.inf, 2, 2, 5, 7],

[2, np.inf, 4, 8, 2],
[2, 4, np.inf, 1, 3],
[5, 8, 1, np.inf, 2],
[7, 2, 3, 2, np.inf]])
ant_colony = AntColony(distances, 1, 1, 100, 0.95, alpha=1, beta=1)

shortest_path = ant_colony.run()
print ("shorted_path: {}".format(shortest_path))


TM06 ACO TSP Pseudoccode Module

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

TM06 ACO TSP Pseudoccode Module

Uploaded by

Copyright:

Available Formats

MODUL PERKULIAHAN

Fakultas Program Studi Tatap Muka Kode MK Disusun Oleh

2018 Teknnik Optimasi Pusat Bahan Ajar dan eLearning

2018 Teknnik Optimasi Pusat Bahan Ajar dan eLearning

2018 Teknnik Optimasi Pusat Bahan Ajar dan eLearning

2018 Teknnik Optimasi Pusat Bahan Ajar dan eLearning

2018 Teknnik Optimasi Pusat Bahan Ajar dan eLearning

def init(self, distances, n_ants, n_best, n_iterations, decay,

2018 Teknnik Optimasi Pusat Bahan Ajar dan eLearning

def spread_pheronome(self, all_paths, n_best, shortest_path):

def gen_path_dist(self, path):

def gen_path(self, start):

def pick_move(self, pheromone, dist, visited):

row = pheromone ** self.alpha * (( 1.0 / dist) ** self.beta)

norm_row = row / row.sum()

2018 Teknnik Optimasi Pusat Bahan Ajar dan eLearning

distances = np.array([[np.inf, 2, 2, 5, 7],

ant_colony = AntColony(distances, 1, 1, 100, 0.95, alpha=1, beta=1)

2018 Teknnik Optimasi Pusat Bahan Ajar dan eLearning

You might also like