Ants find the shortest path: a mathematical proof

Swarm Intell
DOI 10.1007/s11721-013-0076-9
Ants find the shortest path: a mathematical proof
Jayadeva · Sameena Shah · Amit Bhaya · Ravi Kothari ·
Suresh Chandra
Received: 21 May 2011 / Accepted: 25 January 2013
© Springer Science+Business Media New York 2013
Abstract In the most basic application of Ant Colony Optimization (ACO), a set of artificial ants find the shortest path between a source and a destination. Ants deposit pheromone
on paths they take, preferring paths that have more pheromone on them. Since shorter paths
are traversed faster, more pheromone accumulates on them in a given time, attracting more
ants and leading to reinforcement of the pheromone trail on shorter paths. This is a positive
feedback process that can also cause trails to persist on longer paths, even when a shorter
path becomes available. To counteract this persistence on a longer path, ACO algorithms
employ remedial measures, such as using negative feedback in the form of uniform evaporation on all paths. Obtaining high performance in ACO algorithms typically requires fine
tuning several parameters that govern pheromone deposition and removal. This paper proposes a new ACO algorithm, called EigenAnt, for finding the shortest path between a source
and a destination, based on selective pheromone removal that occurs only on the path that is
actually chosen for each trip. We prove that the shortest path is the only stable equilibrium
for EigenAnt, which means that it is maintained for arbitrary initial pheromone concentraJayadeva () · S. Chandra
Department of Electrical Engineering, Indian Institute of Technology, Delhi, Hauz Khas,
New Delhi 110016, India
e-mail: [email protected]
S. Chandra
e-mail: [email protected]
S. Shah
Thomson Reuters Corp. R & D, 195, Broadway, New York 10007, USA
e-mail: [email protected]
A. Bhaya
Federal University of Rio de Janeiro, Department of Electrical Engineering, PEE/COPPE/UFRJ,
P.O. Box 68504, Rio de Janeiro, RJ 21945-970, Brazil
e-mail: [email protected]
R. Kothari
IBM India Research Lab., 4 Block C, Institutional Area, Vasant Kunj, New Delhi 110070, India
e-mail: [email protected]
Swarm Intell
tions on paths, and even when path lengths change with time. The EigenAnt algorithm uses
only two parameters and does not require them to be finely tuned. Simulations that illustrate
these properties are provided.
Keywords Ant Colony Optimization · Stability analysis · Pheromone · Collective
foraging · Stagnation · Self-organization · Selective removal of pheromone · Distributed
optimization · Optimization · Swarm intelligence
1 Introduction
Collective foraging by ants is one of the most remarkable instances of co-operative computation in the natural world. Ants deposit pheromone as they travel and can also sense
the concentration of pheromone that may already be present on a path. Amongst a set of
alternative paths that lead from the source to the destination, the path that happens to be
more heavily traveled on by ants, and therefore has a higher concentration of pheromone,
tends to be chosen by the ants that follow (Bonabeau et al. 1999; Deneubourg et al. 1990;
Dorigo 2007). The seemingly simple behavior of depositing and sensing pheromone can
lead to the emergence of a foraging trail on the shortest path between the source and the
destination.
The selection of the shortest path is remarkable considering that it is achieved without explicit communication about the merits of one path over another, since an ant that
has chosen one path has no idea of the length of any path. Moreover, the decision making is distributed, based only on local information and indirect communication, without any
centralized control. Self-organization in ants has motivated many distributed optimization
techniques. Ant algorithms are known to yield good results on many problems including the
traveling salesman problem (Abdelbar and Wunsch 2012; Dorigo and Gambardella 1997;
Dorigo et al. 1996; Meyer 2004; Stützle and Hoos 2000), vehicle routing (Reimann et al.
2002, 2003; Yildirim and Çatay 2012), scheduling (Merkle et al. 2002), and routing in
communications networks (Di Caro and Dorigo 1998a, 1998b; Ducatelle et al. 2010;
Ghazy et al. 2012; Mapisse et al. 2011) (see Dorigo and Stützle 2004 for a comprehensive
overview).
Throughout the paper, the term ant is used to mean artificial ant: an agent of ACO subject
to the assumptions that follow. Ants prefer to traverse paths that have a higher pheromone
concentration: in other words, the probability of choosing a path is, in some well defined
way, proportional to the pheromone concentration on it. Paths with more pheromone on
them are preferred by ants, leading to pheromone build-up on paths. Assuming constant
ant velocities, shorter paths are traversed in less time, causing pheromone to accumulate
more rapidly on them. Consequently, shorter paths are increasingly preferred, and eventually, almost all the ants choose the shortest path. We refer to this as a positive feedback that
reinforces shorter paths.
However, since ants have no knowledge of path lengths, positive feedback may also cause
stagnation in a situation where the shortest path changes over time. To be more explicit,
consider a situation in which a shorter path B is discovered a long time after the discovery of
the previously shortest path A. In such a case, a large amount of pheromone has already built
up on path A, and it may continue to be the preferred path (Shah et al. 2008). Quicker returns
on path B can revert some of the initial bias on the longer path A, but Shah et al. (2010)
showed that there is a theoretical limit to the initial bias on path A that can be overcome.
Above this limit, a longer path will continue to get chosen and reinforced, despite the fact
that a shorter one has been discovered subsequently. Thus, late appearance of a shorter path
Swarm Intell
will not always bring the system out of premature convergence to a longer one. We refer to
this stagnation as the lack of plasticity.
A commonly used strategy to aid in plasticity is to limit the maximum amount of
pheromone deposited on any path (Stützle and Hoos 2000). This improves the performance
of the foraging model, because the bias on any sub-optimal path does not increase above
a specified threshold, thus facilitating bias reversion. However, determining the optimal
threshold is difficult (Yuan et al. 2012) as its relation to the problem at hand, to the number
and quality of possible solutions, and to the exploration probability is not yet well understood. It is possible for several sub-optimal paths to end up with the maximum allowed
pheromone concentration, preventing convergence to the optimal path. It is therefore clear
that positive reinforcement alone may not be enough to counter the initial bias on paths, and
that there needs to be some form of negative reinforcement to aid plasticity. In this context,
negative reinforcement or negative feedback refers to a mechanism that reduces the amount
of pheromone on a path.
The most commonly adopted strategy for negative reinforcement is pheromone evaporation (Dorigo et al. 1996). It consists of decreasing the amount of pheromone on all
paths at a fixed rate. In the presence of evaporation, sustaining a pheromone trail requires continued reinforcement by way of more deposition. Evaporation is known to significantly improve the performance of ant algorithms (Dorigo and Stützle 2004). If a
shorter path is discovered at some instant, evaporation can help in overcoming the preexisting bias for an earlier established path. Once again, there is a limit to the initial
bias on sub-optimal paths that can be overcome in this manner. Evaporation does not
ensure or completely explain the selection of the optimal solution (Shah et al. 2010;
Van Vorhis Key and Baker 1982). Nevertheless, the introduction of evaporative pheromone
removal represents an important milestone in the literature, and is almost universally employed in applications (see, for example, Bandieramonte et al. 2010; Ding et al. 2003;
Dorigo and Gambardella 1997; Parpinelli et al. 2002; Zecchin et al. 2005, in addition to
those already cited above).
Negative feedback has been reported in biological studies, and arises in the form of
selective removal of pheromone in certain species (Robinson et al. 2008), deposition of
a lower amount of pheromone when a large quantity already exists (Jaffe 1980), or the
mechanical rubbing off of pheromone while walking (Dorigo and Gambardella 1997;
Jackson et al. 2006). Evaporation is also a kind of negative feedback. However, we propose
a negative feedback mechanism that reduces pheromone only on the path that is actually
chosen by each ant, and thus differs from evaporation which is a phenomenon affecting all
paths equally.
In this paper, we propose an ant colony optimization algorithm, called EigenAnt, for
finding the shortest path between a source and a destination. Analysis of EigenAnt, which
essentially deals with the dynamics of the pheromone concentrations, gives our main result.
We show that, irrespective of initial biases on paths, EigenAnt will always cause pheromone
to concentrate on the shortest path. If path lengths change, EigenAnt will cause pheromone
concentrations to change in response. In other words, EigenAnt is not only plastic enough
to shift to a subsequently discovered shorter path, but is also stable in the presence of perturbations. We show that power law variants of EigenAnt also possess the same property, but
require less pheromone to maintain a trail on the shortest path.
The remainder of this paper is organized as follows. In Sect. 2, we propose the EigenAnt
algorithm. In Sect. 3, we analyze its stability properties. Section 4 contains simulation results. Section 5 contains concluding remarks.
Swarm Intell
Fig. 1 The shortest path problem setup for the EigenAnt algorithm, showing the source and destination
nodes, the n paths connecting them, as well as the parameters and variables associated with each path. These,
at ant trip t for the ith path, are the path length Li , the pheromone concentration Cit , the probability of
choosing the ith path pit , the pheromone removal coefficient α and the deposition coefficient β. In addition,
a weight di = f (Li ) is associated with the ith path, where f is a monotonically decreasing function of Li
2 The EigenAnt algorithm
2.1 Motivation and main assumptions
We first consider the case of a graph with two nodes, a source and a destination node, which
are connected by n edges. Each of these edges is termed a path between the source and the
destination nodes, and has an associated length. In this section, we examine the application
of EigenAnt to finding the shortest path in such a graph; in other words, how EigenAnt can
select the shortest amongst n edges. In the sequel, we provide simulation results on larger
graphs, where the graph contains additional nodes, and where paths between the source and
the destination may consist of several edges that pass through intermediate nodes.
We denote the pheromone concentrations on all paths by the vector C. The ith component
of C, denoted by Ci , represents the amount of pheromone on the ith path. The number of
paths
n is denoted by n, and the total pheromone concentration is denoted by V , where V =
i=1 Ci . The length of the ith path is denoted by Li . For each relevant variable, whenever
necessary, we may use a superscript enumerating the trip at which the variable is being
considered: for example, Cit denotes the pheromone concentration on the ith path for the t th
ant trip.
Ants randomly choose a path: the probability of choosing the ith path on the t th trip is
Ct
denoted by pit , and is given by pit = V it . Observe that the probability pit also has the interpretation of normalized pheromone concentration and we will use this alternative terminology
whenever appropriate. In the EigenAnt algorithm, the pheromone on the chosen path, say
the ith, is updated after the t th trip as follows:
ΔCit = −αCit + βdi pit ,
(1)
where
:=
−
and di = f (Li ), f is a monotonically decreasing function of the
length Li . The n × n diagonal matrix which has the positive numbers di as its diagonal
entries is denoted as D. Figure 1 illustrates the notation used in the EigenAnt algorithm. In
summary:
ΔCit
Cit+1
Cit
1. Each path i = 1, 2, . . . , n is an edge, with length Li . The pheromone concentration on
the ith path at trip t is denoted by Cit .
Swarm Intell
2. Each ant chooses a path probabilistically: at trip t , the probability of choosing the ith
path is given by
pit = Cit /V t ,
where
Vt =
(2)
Cit .
(3)
i
3. Let the chosen path be the j th one. Each ant removes −αCjt pheromone and deposits
pheromone equal to βf (Lj )pjt , where f (·) is any monotonically decreasing function of
its argument.
The expected change in the pheromone concentration on the ith path at the t th epoch is
given by
(4)
ΔCit = pit −αCit + βdi pit ,
or, equivalently, using (2),
ΔCit
Cit
Cit
t
=
t −αCi + βdi
t ,
j Cj
j Cj
(5)
which can be written compactly as
1
1
ΔCt = t diag Ct −αCt + t βDCt ,
V
V
(6)
where ΔCt is a vector in Rn and diag(Ct ) is defined as the diagonal matrix having entries
C1t , . . . , Cnt on its principal diagonal.
A power law relating the probability of choosing a path and the pheromone concentration on it has also been proposed in the literature (Deneubourg et al. 1990; Sumpter and
Beekman 2003). Following this idea, a generalization of (2) is to assume that the probability
of choosing the ith path in the t th trip is given by pit =
(C t )k
i t k
j (Cj )
. Since a large difference
between the deposition and removal rates is not reasonable, the negative feedback on the
ith path is assumed to be of the form α(Cit )k . This leads to the following equation for the
expected change in pheromone concentration on the t th trip:
k
(C t )k
(C t )k
(7)
ΔCit = i t k −α Cit + βdi i t k ,
j (Cj )
j (Cj )
where k ≥ 1 is an integer. Clearly (5) is a special case of (7), for k = 1.
An important observation that emerges from the above discussion is that the initial distribution of pheromone concentrations should be such that Ci0 = 0, i = 1, . . . , n, because a
zero initial pheromone concentration, say Cj0 = 0, means that path j has zero probability of
being chosen (however, see Sect. 4.3).
The EigenAnt algorithm is summarized by the pseudocode given in Algorithm 1, in
which it is assumed that the exponent k in (7) is chosen as 1. The ith component Ji of
the path counter vector J maintains a count of the total number of ant trips on path i. We
assume that simulations are carried out for a total of Jmax trips. The pseudocode initializes
Ci0 = n1 , i = 1, . . . , n. In the sequel, we show that EigenAnt finds the shortest path regardless
of initial pheromone concentrations.
Remark The MATLAB code for EigenAnt is available from the authors on request.
Swarm Intell
Algorithm 1 EigenAnt pseudocode
1: Initialize trip counter vector J = (Ji ) = 0.
2: Initialize total ant trip counter t = 1. Choose Jmax ∈ N, the total number of ant trips.
3: Initialize path i with pheromone concentration: Cit = (1/n), i = 1, . . . , n.
4: Initialize probability of choosing the ith path: pit = (1/n), i = 1, . . . , n.
5: Initialize the weights di as 1/Li , where Li is the length of the ith path.
6: Choose the pheromone removal parameter, α and the pheromone deposition parameter, β.
7: while i Ji ≤ Jmax do
8:
Choose a path randomly in accordance with the distribution of probabilities pjt , j =
9:
10:
11:
1, . . . , n.
if the ith path is chosen then
Update the trip counter of the ith path: Ji ← Ji + 1.
Update pheromone only on path i: Cit+1 ← (1 − α)Cit + βdi pit .
13:
No changes in pheromone on paths j = i: Cjt+1 = Cjt .
end if
14:
Update path choice probabilities: pit+1 ← n
12:
Cit+1
t+1 , i = 1, . . . , n.
j =1 Cj
15:
Update total ant trip counter: t ← t + 1.
16: end while
17: Return normalized pheromone concentrations matrix (pit versus trip t) and final trip counter
vector J.
3 Analysis of the EigenAnt algorithm
This section presents the analysis of the EigenAnt algorithm, which involves studying the
equilibria of (5) and (7), as well as their stability properties.
3.1 Analysis of the EigenAnt pheromone update rule
We assume that, at steady state, the expected change in pheromone concentrations is zero:
ΔCi = 0, for each path i = 1, 2, . . . , n. This may compactly be written as
ΔC = 0.
(8)
We will refer to pheromone concentration vectors that satisfy (8) as equilibrium points of
the system (5) or (7).
In order to simplify the proof, we assume that the path lengths can be arranged in strictly
increasing order:1
L 1 < L2 < L3 < · · · < Ln ,
which, in turn, implies that the entries di of the diagonal matrix D satisfy
d 1 > d 2 > d 3 > · · · > dn .
(9)
The main theorem of this paper can now be stated as follows.
1 This implies that all paths are of different lengths. This assumption can easily be relaxed, at the cost of
slightly increased technicalities, without changing the main idea of the proof.
Swarm Intell
Theorem 1 The vectors βα di ei , i = 1, . . . , n, are all equilibrium points of the system (6),
where ei denotes the ith canonical vector in Rn . The equilibrium point ( βα d1 )e1 of (6) is
stable, and all the other equilibrium points βα di ei , i = 1 are unstable.
The proof is divided into three parts:
1. For equilibrium point analysis, it is shown that the power law rule given in (7) for k ≥ 1
can be reduced to the case k = 1, which is (5). Thus only (5), or equivalently, (6) needs
to be analyzed.
2. The equilibrium points for (6) are calculated and shown to be the vectors ( βα di )ei , i =
1, . . . , n.
3. We show that the equilibrium point ( βα d1 )e1 is stable. We do so by showing that perturbations from this equilibrium state cause the system to revert to it. Furthermore, all
other equilibrium points βα di ei , i = 1 are unstable, in the sense that there always exist
perturbations that cause divergence from them.
Proof [Part 1: Reduction of k ≥ 1 to k = 1] Let the concentrations at equilibrium be denoted
by Ci∗ , i = 1, . . . , n. Then, for either (5) or (7), the equilibrium points are found from (8).
Specifically, for (7), the equilibrium points are given by
k
(C ∗ )k
α Ci∗ = βdi i ∗ k .
j (Cj )
(10)
Notice that if (Ci∗ )k is replaced by C̄i∗ in (10), then it becomes exactly the equilibrium
point equation for (6) or (5). Thus, from this point onwards, the analysis is carried out for
the case k = 1.
[Part 2: Calculation of the equilibria of (6)] For this part of the proof, we write V = j Cj
as 1T C, in order to emphasize dependence on the vector C, where 1 ∈ Rn is the vector with
all n components equal to 1. Then, for k = 1, the equilibrium point equation (8) corresponding to (6) can be written in vector form as:
αC =
β
DC,
1T C
DC =
α T 1 C C.
β
which, in turn, can be written as
(11)
If we define the scalar λ := βα (1T C), then (11) becomes
DC = λC,
(12)
which looks like an eigenvalue-eigenvector equation, with λ being the eigenvalue and C the
eigenvector, although it should not be forgotten that λ is a function of C.
Observe that, since D is a diagonal matrix, its eigenvectors are the canonical vectors ei ,
corresponding to the eigenvalues di . This implies that all solutions of (12), and hence of (11)
are of the form C = μi ei . That is, equilibria are of the form Ci = μi , Cj = 0, ∀j = i, for
some index i ∈ {1, 2, . . . , n}. Indeed, observing that 1T μi ei = μi , and substituting into (11)
yields
α
(13)
Dμi ei = (μi )μi ei .
β
Swarm Intell
Dividing both sides by μi = 0 and simplifying, we see that
μi =
β
di .
α
(14)
This underlines the fact that (11) is not an eigenvalue-eigenvector equation,2 but a related
nonlinear equation, which has exactly n solutions βα di ei , i = 1, . . . , n.
From part 1 of the proof, at equilibrium, for any general value of k, (Ci∗ )k can be replaced
by Ci∗ . Thus, the one-to-one correspondence between the equilibria of (5) and (7) is as
follows:
1k
β
β
di ei of (5) corresponds to equilibrium
di ei of (7).
(15)
Equilibrium
α
α
This indicates that the equilibrium is maintained for k > 1 with less pheromone than is
required in the case k = 1.
This completes the second part of the proof, namely the calculation of the equilibrium
points. Note that the proof has also shown that all possible equilibrium points correspond to
bounded pheromone concentrations.
[Part 3: Stability of the equilibrium corresponding to the shortest path and instability of all
other equilibria]
In this final part of the proof, we will carry out a perturbation analysis around an equilibrium point μi ei . The analysis will show that there always exist perturbations around an
equilibrium point μi ei , i = 1, that will cause the system to move away from such an equilibrium point. On the other hand, perturbations around μ1 e1 will cause the system to move
in such a way as to return to it, nullifying the perturbation. The starting point of the analysis
is (6), which, after multiplying throughout by the scalar V 2 , can be rewritten in vector form
as follows.
2
(16)
V ΔC = diag C(−αV C + βDC).
Note that, since V 2 is always positive, the sign of the scalar product of the vector V 2 ΔC
with any other vector v is always the same as that of the scalar product of ΔC with v.
Thus, in the remainder of the proof, we will use (the right hand side of) Eq. (16) in the
perturbation analysis of the vector ΔC.
The perturbation analysis is carried out around the equilibrium point C = μi ei : C =
(0, 0, . . . , 0, μi , 0, . . . , 0)T . The pheromone concentration vector after perturbation is denoted
C + = μi ei + = (1 , 2 , . . . , i−1 , μi + i , i+1 , . . . , n )T .
(17)
Since the pheromone on any path must be non-negative, we have
j ≥ 0, ∀j = i;
i ≥ −μi ,
1T > −μi .
(18)
After perturbing the equilibrium as in (17), we have
V = 1T (C + ) =
β
di + 1T .
α
(19)
2 This is because the eigenvalue-eigenvector equation Ax = λx, for a fixed eigenvalue λ has infinitely many
solutions. In fact, if x is an eigenvector, then y = γ x, for all γ ∈ R is also a solution. However, in the present
case, there are exactly n solutions, corresponding to n unique values of γ .
Swarm Intell
Substituting (17) and (19) in (16) and dropping higher order (O( 3 )) terms, in order to
evaluate the component of V 2 ΔC in the direction of canonical vector ej leads to
j = i,
j2 β(dj − di ),
2
T
(20)
V ΔC ej =
2
T
−(μi + i ) (1 ), j = i.
We first analyze the equilibria μi ei , for the case i = 1. From (9), (dj − di ) > 0 for j < i,
and (dj − di ) < 0 for j > i. Thus, from (20), V 2 ΔCT ej > 0, for j < i (respectively,
V 2 ΔCT ej < 0, for j > i), so that perturbations in the j th coordinate direction, for j < i,
increase, whereas perturbations in directions corresponding to j > i tend to decrease. This
implies that, for all equilibria μi ei , i = 1, there exist directions (all ej , j < i) in which
perturbations cause movement away from the equilibrium: in other words, these equilibria
are unstable.
On the other hand, for the equilibrium μ1 e1 , (dj − d1 ) < 0 for all j , so that all perturbations in the directions ej , j = i decrease. For perturbations in the direction e1 , we observe
that if 1T > 0, then V 2 ΔC < 0; conversely, if 1T < 0, then V 2 ΔC > 0. This means
that the equilibrium μ1 e1 is stable. Since 1T is the perturbation in the total pheromone
on all paths, this also means that the system moves in a direction so that the total amount
of pheromone is conserved. Consequently, the discussion on stability of equilibrium points
leads us to conclude that pheromone remains concentrated on the shortest path, and that the
total pheromone remains conserved.
Remark 1 As the proof indicates, the scaled eigenvector ( βα d1 )e1 associated with the dominant eigenvalue (the largest diagonal element of βα D) is the only asymptotically stable equilibrium point of (6). This motivated the choice of the name EigenAnt for the algorithm. Our
proof parallels the stability proof for Oja’s rule (Hertz et al. 1991), for a system that finds the
first principal component of a given data set. The proof also makes it clear that the shortest
path always corresponds to the only locally asymptotically stable equilibrium. Thus, if the
shortest path changes at any time, convergence to the new shortest path should be expected
to occur, since path choice is probabilistic.
Remark 2 The proof also shows that the total pheromone concentration is bounded at steady
1
state, and is equal to μ1 = ( βα d1 ) k . At steady state, the trail is sustained by ants choosing
it, causing both deposition and removal to occur on the chosen path in accordance with the
proposed dynamics. The dependence on k indicates that power law variants of EigenAnt
require less pheromone to maintain a trail, which is a possible advantage for a power law
dependence of path selection probabilities on pheromone concentrations. Note that, in the
notation of the theorem, the normalized pheromone concentration vector can be expected,
in the presence of perturbations caused by the probabilistic path choices, to converge to the
vector e1 .
3.2 The difference between selective removal and evaporation
In this section, we briefly discuss the difference between evaporative removal of pheromone
of the kind employed in Ant System (AS) (Dorigo 2007; Dorigo et al. 1996), and selective
removal of pheromone (or selective deposition of negative pheromone). The latter is the
kind of negative feedback employed in EigenAnt. The difference between the two is that in
the case of evaporative removal, pheromone gets removed even when no pheromone might
be added, whereas in EigenAnt, removal takes place only on paths that are selected by ants;
Swarm Intell
ants also reinforce these trails by depositing pheromone on them. In mathematical terms, let
si be a binary switching variable defined as:
1 if path i is chosen,
si =
(21)
0 otherwise.
Then, the equation for pheromone update using a selective removal model such as EigenAnt
is of the form
Ci
ΔCi = si −αCi + βf (Li )
,
(22)
V
whereas the Ant System update equation, specialized to the case of a single source and
destination node, which assumes evaporative removal (Dorigo 2007; Dorigo et al. 1996) is
ΔCi = −ρCi + si Q/Li ,
for all i,
(23)
where Q is a suitably chosen constant, often set to Q = 1. Also, in the AS model, the path
choice probability for the ith path is given by
C μ (1/Li )ν
,
pi = i μ
ν
j Cj (1/Lj )
(24)
where μ ≥ 0 and ν ≥ 1 are exponents to be suitably chosen and, indeed, an adequate choice
is crucial for the good performance of AS, even though this is not always a simple task
(Yuan et al. 2012).
The main differences to be noted are as follows. The first is that the binary switching
variable si affects both removal and deposition in the case of a selective removal model
like EigenAnt, whereas it affects only deposition in the case of the AS model employing
evaporative removal. The second is that calculation of the path choice probability for the ith
path, pi , in the AS model (24) involves knowledge of all the path lengths Lj , j = 1, 2, . . . , n,
whereas the corresponding calculation for EigenAnt (2) does not.
We performed a set of simulations to compare AS with EigenAnt. We consider a simple
scenario in which there are three paths from the source to the destination, having lengths
L1 = 1, L2 = 2, and L3 = 4, respectively. On each trip, an ant probabilistically chooses one
of the three paths, based on the normalized pheromone concentration on each of them. The
initial pheromone concentration on path 1 is C10 = 1 while it is increased from 1 to 10 on the
other two paths in steps of 1. For each of the 100 initial pheromone concentration vectors
obtained in this way, 1000 ant trips were simulated and the resulting path counter for path
1 divided by 1000 in order to compute the approximate probability of convergence to path
1. In all simulations, f (Li ) = 1/Li . Figures 2 and 3 show these approximate probabilities
for EigenAnt and for AS, respectively. Simulations were performed using an arbitrary (i.e.,
untuned) choice of the parameters. This comparison is, however, merely illustrative of the
possibilities and, without further experiments, can only be interpreted to say that proper
tuning of parameters seems to be more critical for AS than it is for EigenAnt.
The next experiment consisted of fixing the initial concentration at C = (1, 10, 10)T and
plotting the approximate probability of convergence to path 1, despite the heavy initial bias.
In the case of EigenAnt, this is done for all values of α ∈ [0.1, 1] and β ∈ [0.1, 1], while
for AS, it is done for all values of ρ ∈ [0.1, 1] and Q ∈ [0.1, 1]. Plots of approximate convergence probabilities for EigenAnt and for AS are shown in Figs. 4 and 5, respectively.
AS uses four parameters: ρ, Q, μ, and ν. In order to display three dimensional plots of the
approximate probabilities, AS parameters μ and ν have been fixed at 0.5 and 2, respectively.
The plots show that EigenAnt is less sensitive to the choice of parameters.
Swarm Intell
Fig. 2 Variation in the approximate probability of convergence to the shortest path (path 1), where L1 = 1,
L2 = 2 and L3 = 4, with respect to increasing pheromone concentrations on longer paths. The initial
pheromone concentration on path 1 is C10 = 1 while it is increased from 1 to 10 on the other two paths
in steps of 1, using EigenAnt’s selective removal of pheromone, with α = 0.8, and β = 1. The approximate
probability of convergence to path 1 is always 1, irrespective of increasing initial pheromone concentrations
C20 and C30 on the longer paths
Fig. 3 Variation in the
approximate probability of
convergence to the shortest path
(path 1), where L1 = 1, L2 = 2
and L3 = 4, with respect to
increasing pheromone
concentrations on longer paths.
The initial pheromone
concentration on path 1 is C10 = 1
while it is increased from 1 to 10
on the other two paths in steps
of 1. Pheromone update is carried
out using (23) with μ = 1, ν = 1,
ρ = 0.2, Q = 1. The approximate
probability of convergence to
path 1 decreases with increasing
bias on the longer paths. Note
that the parameters μ, ν, ρ, Q
have not been optimized
4 Numerical experiments
In this section, we provide simulation results for EigenAnt under different conditions. In all
simulations, f (Li ) = 1/Li . First, we test whether EigenAnt finds the shortest path, when all
paths other than the shortest one have significantly larger initial pheromone concentrations.
All the simulations were performed on a laptop with 3 GB RAM and an Intel Core 2 Duo
CPU with a 2.66 GHz system clock.
Swarm Intell
Fig. 4 Approximate probability
of convergence to the shortest
path (path 1), using EigenAnt,
with respect to variation in
parameter values. The path
lengths are L1 = 1, L2 = 2, and
L3 = 4. The initial pheromone
concentration vector C is
(1, 10, 10)T for all simulations.
The approximate probability of
convergence to path 1 is
always 1, regardless of the
choices of α and β, showing
robustness to the choice of these
parameters
Fig. 5 Approximate probability
of convergence to the shortest
path (path 1) with respect to
variation in parameter values.
The path lengths are L1 = 1,
L2 = 2, and L3 = 4. The initial
pheromone concentration
vector C is (1, 10, 10)T for all
simulations. Pheromone update is
carried out using AS (23), for
which parameters μ and ν have
been fixed at 0.5 and 2,
respectively. The approximate
probability of convergence to
path 1 decreases rapidly for large
values of Q, but it is relatively
robust to changes in ρ, for Q
below 0.5
In the second set of experiments, we moved the destination during the experiment after
EigenAnt has converged to the shortest path. In other words, we permute the path lengths,
and a different path now becomes the shortest one. We test whether EigenAnt finds the
shortest path after this change in path lengths.
Thirdly, we examine the case when all paths are not known a priori, but are discovered
according to some exploration probability. We investigate how EigenAnt adapts in response
to these discoveries.
4.1 Convergence in the presence of a large initial bias on sub-optimal paths
In each experiment, there were ten possible paths between the source and the destination.
The length of each path is given by its index, for example, path 7 is 7 units long: L7 = 7.
The diagonal elements, di , of D were chosen as di = f (Li ) = L1i . The initial pheromone
distribution was chosen as Ci0 = i, intentionally biased towards long paths, with path 10
Swarm Intell
Fig. 6 The plots show (a) the
evolution of normalized
pheromone concentration and
(b) the number of ant trips on
each of a set of ten paths whose
lengths range from 1 to 10, with
path length Li = i. The initial
pheromone distribution was
chosen as Ci0 = i, intentionally
biased towards long paths, with
path 10 having 10 times more
pheromone than path 1. Despite
this, the pheromone rapidly
concentrates on the shortest path.
The parameter values used are
α = 0.5, and β = 1. The plots
correspond to a snapshot taken
after a total of 80 ant trips
initially having 10 times more pheromone than path 1. Large initial pheromone bias on suboptimal paths causes many ant algorithms to fail to converge to the optimal path (Chen et al.
2009). Figures 6 and 7 show the evolution of normalized pheromone concentration on a
set of ten paths. Despite the large initial bias, pheromone gets rapidly concentrated on the
shortest path.
4.2 Convergence results with changing path lengths
In this set of experiments, the initial path length vector was L = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]
and was changed, at t = 80, to L = [10, 9, 8, 7, 6, 5, 4, 3, 2, 1], so that path 1, which was
initially the shortest path, with L1 = 1, became the longest one at t = 80. At this time, the
erstwhile longest path, path 10, became the shortest one, with length L10 = 1. Figures 8
and 9 present simulation results for the case when f (Li ) = 1/Li , showing that the nor-
Swarm Intell
Fig. 7 The plots show (a) the
evolution of normalized
pheromone concentration and
(b) the number of ant trips on
each of the ten paths in the
example referred to in Fig. 6. The
plots correspond to a snapshot
taken after 500 ant trips
malized pheromone distribution corrects itself, demonstrating the plasticity of the proposed
EigenAnt dynamics.
4.2.1 Experimental results on larger graphs
Practical applications of ant colony optimization involve finding the shortest path on a general graph, where nodes other than the source and destination node are present, and paths
from the source to the destination are composed of multiple edges spanning intermediate
nodes. Application of EigenAnt to such examples is on the lines of the procedure adopted
by well known ACO algorithms, using ‘forward’ and ‘backward’ ants (i.e., ants going from
source to destination and those going from destination to source). At each node, the relative
pheromone concentration of outgoing edges, where the directionality is defined by the type
of ant (forward or backward), is used to determine the probability of choosing any one of
Swarm Intell
Fig. 8 The plots show (a) the
normalized pheromone
concentration and (b) the number
of ant trips made on each of a set
of ten paths of lengths ranging
from 1 to 10, with path length
Li = i, corresponding to a
snapshot taken after a total of 120
ant trips. The initial pheromone
distribution was chosen as a
vector in R10 , with each entry
being 1/10. The initial path
length vector was
L = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]
and was changed, at t = 80, to
L = [10, 9, 8, 7, 6, 5, 4, 3, 2, 1].
As a result, path 1, which was
initially the shortest path with
L1 = 1 became the longest one at
t = 80. The parameter values
used are α = 0.5, β = 1, and
k = 1. The path choice
probabilities are in the process of
converging to e1 when the
change in the length vector
occurs
them, while the deposition and removal of pheromone is as prescribed earlier. Figure 10
shows an example with 11 nodes, in which all paths between the source and the destination
have four segments. At node 4, for example, a forward ant chooses a path to one of nodes 5,
6, and 7, since outgoing edges from node 4 lead to one of these nodes.
Figure 11 shows a plot of the lengths of trips made by ants, against the total number of
ant trips. The initial pheromone concentration on each edge was chosen to be equal to its
length. Each path traversal is marked by a dot. In this example, paths between the source
and the destination range in length from 13 to 27. The figure shows that as ant trips progress,
the shortest path is chosen increasingly often. The shortest path, which has a length of 13,
appears as a dense black line in Fig. 11(a), while longer paths appear as isolated dots. The
histogram, Fig. 11(b), shows that most trips are made on the shortest path.
We also ran EigenAnt on a larger example, with 10 stages of 10 nodes each between
the source and the destination. Each node in a stage is connected to every node in the next
Swarm Intell
Fig. 9 The plots show (a) the
normalized pheromone
concentration and (b) the number
of ant trips made on each of a set
of ten paths of lengths ranging
from 1 to 10, with path length
Li = i, for the example referred
to in Fig. 8, corresponding to a
snapshot taken after 400 ant trips.
The path choice probabilities are
in the process of converging
to e1 , when the change in the
length vector occurs, and
subsequent convergence is to e10
Fig. 10 A three stage multi-hop
network with three nodes in each
stage. Node 1 is the source node
and 11 is the destination node.
The lengths of each edge are
indicated. The shortest path has a
length of 13, and is shown by the
heavier lines connecting nodes
1–4–7–9–11. Observe that the
optimal path from source to
destination requires the choice of
locally nonoptimal edges at
nodes 1 and 4
Swarm Intell
Fig. 11 (a) Evolution of lengths
of paths traversed by ants against
the total number of ant trips.
Each traversed path is marked by
a dot against its length. Initial
pheromone bias causes trails to
be formed on longer paths at first.
Over time, EigenAnt updates
cause ants to choose the shortest
paths more frequently.
Eventually, pheromone gets
concentrated only on the shortest
path. Parameter values are
α = 0.1, β = 2, and
f (Li ) = 1/Li . (b) Histogram of
traversed path lengths, in the case
of the network of Fig. 10
stage. The number of possible routes is 1010 . Edge lengths were assigned random values,
and the initial pheromone on each edge was chosen to be equal to its length. Figure 12 shows
the evolution of traversed path lengths against the number of ant trips. In this example, path
lengths range between 13 and 72. Each trip is marked by a dot against its length. The shortest
path with length 13 appears as a dense line, indicating that most trips are made on this path
as time progresses.
4.3 Convergence when new paths are discovered according to an exploration probability
Thus far, we have assumed that in EigenAnt all available paths are known or, equivalently,
that all paths have a non-zero initial pheromone concentration. We now examine the case
when all paths are not known in advance, or, equivalently, that some paths initially have
zero pheromone concentration. Suppose that an exploration probability q is specified. In
this case, EigenAnt can be thought of as a two stage process. The first stage consists of a
Swarm Intell
Fig. 12 Lengths of traversed
paths against the number of ant
trips. The total number of
possible routes between source
and destination is 1010 . Each
traversed path is marked by a dot
against its length. Initial
pheromone bias causes trails to
be formed on longer paths at first.
Over time, EigenAnt updates
cause ants to choose the shortest
path more frequently. The plot
shows only the first 4000 ant trips
Fig. 13 This plot shows how the
probability of choosing paths
varies when newer paths are
discovered. The simulation shows
that trails switch to newer paths
only if they are shorter. Please
refer to the text for further details
decision between exploration and exploitation of prior knowledge: in mathematical terms, a
pheromone-free path is chosen with probability q, while a path with non-zero concentration
is chosen with probability (1 − q). In the former case, the pheromone concentration of the
chosen path is set to a prespecified (small) positive value, and the current number of known
paths incremented by one, after which, in the second stage, the steps of Algorithm 1 are
followed. In the latter case, in which a known path is chosen, the first and second stages
collapse into one, which consists of merely applying Algorithm 1.
Figure 13 shows the results obtained for a simulation run in which the exploration probability was chosen as q = 0.1. Initially, only paths 1, 2, and 3 are known. The initial
pheromone concentrations on these paths are 0.8, 0.7, and 0.1, respectively. The lengths
of the 10 paths are 10, 9, 8, 6, 7, 5, 4, 2, 1, and 3. The shortest path, path 9, which has a
length of 1, is discovered after about 160 ant trips. The simulation shows that trails switch
to newly discovered paths that are shorter, but otherwise remain on the previously found
shorter paths.
Swarm Intell
This behavior may be understood in the light of the analysis presented in Sect. 3. As a
new path gets discovered, the D matrix gets updated, and a new diagonal element is added
to it. If the maximal eigenvector of the updated D matrix is larger than the previous one, the
previous eigenvector becomes unstable. This may be understood from (20). Of course, if the
maximal eigenvector is the same as the earlier one, indicating that the new path is not shorter
than the currently best known one, then the system remains in its previous equilibrium, and
the amount of pheromone on the new path does not grow.
5 Conclusion
In this paper we have proposed a new ant colony optimization algorithm called EigenAnt.
The principal novel feature of EigenAnt is the use of negative feedback in the form of “selective removal of pheromones”: in other words, removal only from the probabilistically
selected path. This is in contrast with evaporation, normally used in existing ACOs, which
affects all paths, regardless of whether they are chosen or not.
There are two problems that lead to deterioration in the performance of a conventional
ACO algorithm—stagnation and premature stabilization at a sub-optimal solution. For the
problem of selecting the shortest among a set of paths directly connecting the source and
the destination nodes, we provided a theoretical stability analysis of the proposed EigenAnt
algorithm, as well as simulation results that confirm that the EigenAnt algorithm prevents
premature stabilization at a sub-optimal solution, even if initial pheromone concentrations
on sub-optimal paths are much larger than on the shortest path. Furthermore, we have experimentally shown that EigenAnt does not suffer from stagnation when new paths are discovered, when path lengths change over time, or when paths that were previously optimal
ones suddenly become sub-optimal. Even in dynamically changing scenarios, pheromone
concentrations in EigenAnt quickly change to reflect the new situation. These properties
suggest that the proposed EigenAnt algorithm might be successful also when applied to
more challenging combinatorial optimization problems. We are currently working in this
direction.
Acknowledgements The authors would like to thank reviewers 2 and 3 for constructive criticism, as well
as reviewer 1, the Associate Editors and the Editor for detailed comments and suggestions. The work of the
first author (J) was partially supported by a grant from the DST (Indo–Brazil International Collaboration).
The work of the last author (AB) was partially supported by grants from FAPERJ (CNE) and CNPq (BPP,
Brazil–India International Collaboration).
References
Abdelbar, A. M., & Wunsch, D. C. (2012). Improving the performance of MAX-MIN ant system on the
TSP using stubborn ants. In Proceedings of the fourteenth international conference on genetic and
evolutionary computation conference companion, GECCO Companion’12 (pp. 1395–1396). New York:
ACM.
Bandieramonte, M., Di Stefano, A., & Morana, G. (2010). Grid jobs scheduling: the alienated ant algorithm
solution. Multiagent and Grid Systems, 6(3), 225–243.
Bonabeau, E., Dorigo, M., & Theraulaz, G. (1999). Swarm intelligence: from natural to artificial systems.
New York: Oxford University Press.
Chen, L., Sun, H. Y., & Wang, S. (2009). First order deceptive problem of ACO and its performance analysis.
Journal of Networks, 4(10), 993–1000.
Deneubourg, J. L., Aron, S., Goss, S., & Pasteels, J. M. (1990). The self-organizing exploratory pattern of the
Argentine ant. Journal of Insect Behaviour, 3, 159–168.
Swarm Intell
Di Caro, G., & Dorigo, M. (1998a). Ant colonies for adaptive routing in packet-switched communications
networks. In A. E. Eiben, T. Bäck, M. Schoenauer, & H. P. Schwefel (Eds.), Lecture notes in computer
science: Vol. 1498. Parallel problem solving from nature—PPSN V: 5th international conference (pp.
673–682). Berlin: Springer.
Di Caro, G., & Dorigo, M. (1998b). AntNet: distributed stigmergetic control for communications networks.
Journal of Artificial Intelligence Research, 9, 317–365.
Ding, Y. Y., He, Y., & Jiang, J. P. (2003). Multi-robot cooperation method based on the ant algorithm. In
Proceedings of the swarm intelligence symposium, 2003. SIS’03 (pp. 14–18). New York: IEEE Press.
Dorigo, M. (2007). Ant colony optimization. Scholarpedia, 2(3), 1461. http://www.scholarpedia.org/
article/Ant_Colony_Optimization.
Dorigo, M., & Gambardella, L. M. (1997). Ant colony system: a cooperative learning approach to the traveling salesman problem. IEEE Transactions on Evolutionary Computation, 1(1), 53–66.
Dorigo, M., Maniezzo, V., & Colorni, A. (1996). Ant system: optimization by a colony of cooperating agents.
IEEE Transactions Systems, Man, Cybernetics-Part B, 26(1), 29–41.
Dorigo, M., & Stützle, T. (2004). Ant colony optimization. Cambridge: MIT Press.
Ducatelle, F., Di Caro, G. A., & Gambardella, L. M. (2010). Principles and applications of swarm intelligence
for adaptive routing in telecommunications networks. Swarm Intelligence, 4(3), 173–198.
Ghazy, A. M., El-Licy, F., & Hefny, H. A. (2012). Threshold based AntNet algorithm for dynamic traffic
routing of road networks. Egyptian Informatics Journal, 13(2), 111–121.
Hertz, J., Krogh, A., & Palmer, R. G. (1991). Introduction to the theory of neural computation. Redwood
City: Addison-Wesley.
Jackson, D. E., Martin, S. J., Holcombe, M., & Ratnieks, F. L. W. (2006). Longevity and detection of persistent foraging trails in Pharaoh’s ants, Monomorium pharaonis (L.). Animal Behaviour, 71, 351–359.
Jaffe, K. (1980). Theoretical analysis of the communication system for chemical mass recruitment in ants.
Journal of Theoretical Biology, 84, 589–609.
Mapisse, J., Cardoso, P., & Monteiro, J. (2011). Ant colony optimization routing mechanisms with bandwidth
sensing. In EUROCON—international conference on computer as a tool (pp. 39–42). Lisbon: IEEE
Press.
Merkle, D., Middendorf, M., & Schmeck, H. (2002). Ant colony optimization for resource-constrained project
scheduling. IEEE Transactions on Evolutionary Computation, 6(4), 333–346.
Meyer, B. (2004). Convergence control in ACO. In Lecture notes in computer science: Vol. 3103. Genetic
and evolutionary computation (GECCO) (pp. 1–12), Seattle, WA. Berlin: Springer.
Parpinelli, R. S., Lopes, H. S., & Freitas, A. A. (2002). Data mining with an ant colony optimization algorithm. IEEE Transactions on Evolutionary Computation, 6(4), 321–332.
Reimann, M., Doerner, K., & Hartl, R. F. (2003). Analyzing a unified ant system for the VRP and some of its
variants. In Lecture notes in computer science: Vol. 2611. Proceedings of EvoWorkshops: applications
of evolutionary computing (pp. 300–310). Berlin: Springer.
Reimann, M., Stummer, M., & Doerner, K. (2002). A savings based ant system for the vehicle routing problem. In Proceedings of the genetic and evolutionary computation conference, GECCO’02 (pp. 1317–
1326). San Francisco: Morgan Kaufmann Publishers.
Robinson, E. J. H., Ratnieks, F. L. W., & Holcombe, M. (2008). An agent based model to investigate the roles
of attractive and repellent pheromones in ant decision making during foraging. Journal of Theoretical
Biology, 255, 250–258.
Shah, S., Kothari, R., & Jayadeva Chandra, S. (2008). Mathematical modeling and convergence analysis of
trail formation. In Proceedings of the 23rd national conference on artificial intelligence, advancement
of artificial intelligence (AAAI)’08 (Vol. 1, pp. 170–175). Chicago: AAAI Press.
Shah, S., Kothari, R., Jayadeva, & Chandra, S. (2010). Trail formation in ants: a generalized Polya urn process. Swarm Intelligence, 4(2), 145–171.
Stützle, T., & Hoos, H. H. (2000). MAX-MIN ant system. Future Generation Computer Systems, 16(8),
889–914.
Sumpter, D. J. T., & Beekman, M. (2003). From non-linearity to optimality: pheromone trail foraging by ants.
Animal Behaviour, 66, 273–280.
Van Vorhis Key, S. E., & Baker, T. C. (1982). Trail-following responses of the Argentine ant Iridomyrmex
humilis (Mayr), to a synthetic trail pheromone component and analogs. Journal of Chemical Ecology,
8(1), 3–14.
Yildirim, U. M., & Çatay, B. (2012). A time-based pheromone approach for the ant system. Optimization
Letters, 6(6), 1081–1099.
Yuan, Z., Montes de Oca, M. A., Birattari, M., & Stützle, T. (2012). Continuous optimization algorithms for
tuning real and integer parameters of swarm intelligence algorithms. Swarm Intelligence, 6(1), 49–75.
Zecchin, A. C., Simpson, A. R., Maier, H. R., & Nixon, J. B. (2005). Parametric study for an ant algorithm
applied to water distribution system optimization. IEEE Transactions on Evolutionary Computation,
9(2), 175–191.

Download Report

Ants find the shortest path: a mathematical proof

Paperzz.com

Your Paperzz