0% found this document useful (0 votes)

218 views200 pages

PHD Thesis

ACO technique

Uploaded by

Kishore Ck

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

218 views200 pages

PHD Thesis

ACO technique

Uploaded by

Kishore Ck

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 200

U N I V E R S I T E

L I B R E

D E

B R U X E L L E S,

Faculte des Sciences appliquees

U N I V E R S I T E

D E U R O P E

academique

Annee
2005-2006

ANT COLONY OPTIMIZATION AND LOCAL SEARCH

FOR THE
PROBABILISTIC TRAVELING SALESMAN PROBLEM:
A CASE STUDY IN
STOCHASTIC COMBINATORIAL OPTIMIZATION

Directeur de Memoire:
Prof. Marco Dorigo

Codirecteur de Memoire:
Prof. Luca Maria Gambardella

Memoire
de fin detudes
present
e par
Leonora Bianchi en vue de lobtention du

titre de Docteur en Sciences Appliquees

Ant Colony Optimization and Local Search

for the
Probabilistic Traveling Salesman Problem:
A Case Study in
Stochastic Combinatorial Optimization

Leonora Bianchi

Libre de Bruxelles
Universite

Summary
In this thesis we focus on Stochastic Combinatorial Optimization Problems (SCOPs),
a wide class of combinatorial optimization problems under uncertainty, where part of
the information about the problem data is unknown at the planning stage, but some
knowledge about its probability distribution is assumed.
Optimization problems under uncertainty are complex and difficult, and often classical algorithmic approaches based on mathematical and dynamic programming are able
to solve only very small problem instances. For this reason, in recent years metaheuristic algorithms such as Ant Colony Optimization, Evolutionary Computation, Simulated
Annealing, Tabu Search and others, are emerging as successful alternatives to classical
approaches.
In this thesis, metaheuristics that have been applied so far to SCOPs are introduced
and the related literature is thoroughly reviewed. In particular, two properties of
metaheuristics emerge from the survey: they are a valid alternative to exact classical
methods for addressing real-sized SCOPs, and they are flexible, since they can be quite
easily adapted to solve different SCOPs formulations, both static and dynamic. On
the base of the current literature, we identify the following as the key open issues in
solving SCOPs via metaheuristics: (1) the design and integration of ad hoc, fast and
effective objective function approximations inside the optimization algorithm; (2) the
estimation of the objective function by sampling when no closed-form expression for the
objective function is available, and the study of methods to reduce the time complexity
and noise inherent to this type of estimation; (3) the characterization of the efficiency
of metaheuristic variants with respect to different levels of stochasticity in the problem
instances.
We investigate the above issues by focusing in particular on a SCOP belonging to
the class of vehicle routing problems: the Probabilistic Traveling Salesman Problem
(PTSP). For the PTSP, we consider the Ant Colony Optimization metaheuristic and
we design efficient local search algorithms that can enhance its performance. We obtain
state-of-the-art algorithms, but we show that they are effective only for instances above
a certain level of stochasticity, otherwise it is more convenient to solve the problem
as if it were deterministic. The algorithmic variants based on an estimation of the
objective function by sampling obtain worse results, but qualitatively have the same
behavior of the algorithms based on the exact objective function, with respect to the
level of stochasticity. Moreover, we show that the performance of algorithmic variants
based on ad hoc approximations is strongly correlated with the absolute error of the
approximation, and that the effect on local search of ad hoc approximations can be
very degrading.
Finally, we briefly address another SCOP belonging to the class of vehicle routing
problems: the Vehicle Routing Problem with Stochastic Demands (VRPSD). For this
problem, we have implemented and tested several metaheuristics, and we have studied
the impact of integrating in them different ad hoc approximations.

iii

Acknowledgments
The research work of this thesis has been mainly done at IDSIA, the Dalle Molle
Institute for Artificial Intelligence in Lugano, Switzerland. I express my sincere thanks
to all people that have been at IDSIA since I arrived there in the year 2000, because
of the friendly and special working environment they created.
An important part of this thesis is rooted in one month spent at IRIDIA, Universite
Libre de Bruxelles, Brussels, Belgium. I must thank all the people working there, and
particularly Joshua Knowles, because, without already being involved in my subject
of research, he was very open and we could have a profitable exchange of ideas, that
resulted in my first journal paper. From IRIDIA I also thank the secretary, Muriel
Decreton, for her help in the bureaucratic formalities that she carried out for me while
I was in Switzerland.
I also thank all the people involved in the Metaheuristics Network, particularly
those with whom I worked for the research about the stochastic vehicle routing problem:
Mauro Birattari, Marco Chiarandini, Max Manfrin, Monaldo Mastrolilli, my husband
Fabrizio Oliverio, Luis Paquete, Olivia Rossi-Doria, and Tommaso Schiavinotto. From
each of them I learnt something very useful for my research. I especially thank Mauro
Birattari and Marco Chiarandini for their support with statistical analysis of results.
I acknowledge financial support by two sources: the Swiss National Science Foundation project titled On-line fleet management, grant 16R10FM; and the Metaheuristics Network, a Research and Training Network founded by the Improving Human
Potential Programme of the Commission of the European Communities, grant HPRNCT-1999-00106.
I am particularly grateful to Luca Maria Gambardella for having supported and
guided my research since the beginning, and to Marco Dorigo for his careful supervision
of my work.

Contents
Summary

Acknowledgments

Contents

List of Algorithms

Original contributions and outline

xiii

Metaheuristics for Stochastic Combinatorial Optimization

1 Introduction
1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.2 Modeling approaches to uncertainty . . . . . . . . . . . . . . . . . . . .
2 Formal descriptions of SCOPs
2.1 General but tentative SCOP definition
2.2 Static SCOPs . . . . . . . . . . . . . .
2.3 Dynamic SCOPs . . . . . . . . . . . .
2.4 Objective function computation . . . .
2.4.1 Ad hoc approximations . . . .
2.4.2 Sampling approximation . . . .

.
.
.
.
.
.

3 Metaheuristics for SCOPs

3.1 Ant Colony Optimization . . . . . . . . . .
3.1.1 Introduction to ACO . . . . . . . . .
3.1.2 ACO for SCOPs . . . . . . . . . . .
3.1.2.1 Exact objective and ad hoc
3.1.2.2 Sampling approximation .
3.1.2.3 Markov Decision Processes
3.2 Evolutionary Computation . . . . . . . . .
3.2.1 Introduction to EC . . . . . . . . . .
vii

.
.
.
.
.
.

. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
approximation
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .

.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.
.
.

3
3
4

.
.
.
.
.
.

9
9
10
11
14
15
15

.
.
.
.
.
.
.
.

17
18
18
19
20
21
23
24
24

3.2.2

3.3

3.4

3.5

3.6
3.7

EC for SCOPs . . . . . . . . . . . . . . . . . . . . . . . .
3.2.2.1 Exact objective and ad hoc approximation . . .
3.2.2.2 Sampling approximation . . . . . . . . . . . . .
3.2.2.3 Markov Decision Processes . . . . . . . . . . . .
Simulated Annealing . . . . . . . . . . . . . . . . . . . . . . . . .
3.3.1 Introduction to SA . . . . . . . . . . . . . . . . . . . . . .
3.3.2 SA for SCOPs . . . . . . . . . . . . . . . . . . . . . . . .
3.3.2.1 Exact objective and ad hoc approximation . . .
3.3.2.2 Sampling approximation . . . . . . . . . . . . .
Tabu Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.4.1 Introduction to TS . . . . . . . . . . . . . . . . . . . . . .
3.4.2 TS for SCOPs . . . . . . . . . . . . . . . . . . . . . . . .
3.4.2.1 Exact objective and ad hoc approximation . . .
3.4.2.2 Sampling approximation . . . . . . . . . . . . .
Stochastic Partitioning Methods . . . . . . . . . . . . . . . . . .
3.5.1 Stochastic Partitioning Methods for SCOPs . . . . . . . .
3.5.1.1 Exact objective and ad hoc approximation . . .
3.5.1.2 Sampling approximation . . . . . . . . . . . . .
Other algorithmic approaches to SCOPs . . . . . . . . . . . . . .
Discussion and open issues . . . . . . . . . . . . . . . . . . . . . .
3.7.1 Using the Sampling approximation . . . . . . . . . . . . .
3.7.2 Experimental comparisons among different metaheuristics
3.7.3 Theoretical convergence properties . . . . . . . . . . . . .

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

ACO and local search for the PTSP

4 The
4.1
4.2
4.3
4.4
4.5

Probabilistic Traveling Salesman Problem

Statement of the problem . . . . . . . . . . . . .
Literature review . . . . . . . . . . . . . . . . . .
Benchmark of PTSP instances . . . . . . . . . .
Lower bound of the optimal solution value . . . .
Simple constructive heuristics . . . . . . . . . . .
4.5.1 Experimental analysis . . . . . . . . . . .

5 Ant Colony Optimization

5.1 A straightforward implementation . . . . . . .
5.1.1 The pACS algorithm . . . . . . . . . . .
5.1.2 Experimental analysis . . . . . . . . . .
5.1.2.1 Computational environment .
5.1.2.2 Tuning . . . . . . . . . . . . .
5.1.2.3 Results . . . . . . . . . . . . .
5.2 The use of objective function approximations in
5.2.1 Motivation . . . . . . . . . . . . . . . .
viii

25
25
27
29
30
30
31
31
31
35
35
37
37
37
39
39
40
40
43
44
44
45
47

.
.
.
.
.
.

51
51
53
56
57
60
62

. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
ACO
. . . .

.
.
.
.
.
.
.
.

65
65
66
68
68
68
70
75
75

.
.
.
.
.
.

5.2.2

.
.
.
.
.
.
.
.
.
.
.
.
.

76
76
76
77
77
79
79
79
81
82
82
83
86

6 Local search
6.1 The issue of complexity and three options to deal with it . . . . . . . . .
6.2 The 2-p-opt and the 1-shift operators . . . . . . . . . . . . . . . . . . . .
6.3 The infeasibility of using the exact full objective function . . . . . . . .
6.4 Exact recursive local search . . . . . . . . . . . . . . . . . . . . . . . . .
6.4.1 A bit of history . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.4.2 Derivation of local search costs for the heterogeneous PTSP . . .
6.4.2.1 The cost of 2-p-opt moves . . . . . . . . . . . . . . . . .
6.4.2.2 The cost of 1-shift moves . . . . . . . . . . . . . . . . .
6.4.3 Derivation of local search costs for the homogeneous PTSP . . .
6.4.3.1 The cost of 2-p-opt moves . . . . . . . . . . . . . . . . .
6.4.3.2 The cost of 1-shift moves . . . . . . . . . . . . . . . . .
6.4.4 Computational test . . . . . . . . . . . . . . . . . . . . . . . . . .
6.4.4.1 Pseudocode of the 2-p-opt and 1-shift local search . . .
6.4.4.2 Experimental setup . . . . . . . . . . . . . . . . . . . .
6.4.4.3 Comparison between correct and incorrect local search
6.5 Approximated local search . . . . . . . . . . . . . . . . . . . . . . . . . .
6.5.1 Ad hoc approximations for the 1-shift . . . . . . . . . . . . . . .
6.5.1.1 1-shift-T approximation . . . . . . . . . . . . . . . . . .
6.5.1.2 1-shift-P approximation . . . . . . . . . . . . . . . . . .
6.5.2 Sampling approximation for the 1-shift: 1-shift-S . . . . . . . . .
6.5.3 Pseudocode of the approximated 1-shift local search . . . . . . .
6.6 Overview of the results . . . . . . . . . . . . . . . . . . . . . . . . . . . .

89
89
90
92
92
92
94
94
101
105
105
108
112
112
113
115
117
119
120
121
121
123
123

7 Integration of ACO with local search

7.1 Description of hybrid algorithms . . . . . . . . . . . .
7.1.1 Hybridization of simple constructive heuristics
7.1.2 Hybridization of ACO . . . . . . . . . . . . . .
7.2 Experimental analysis . . . . . . . . . . . . . . . . . .
7.2.1 Advantage of using a local search with ACO .

127
128
128
129
129
130

5.3

The pACS-T algorithm based on the TSP approximation . .

5.2.2.1 The TSP approximation . . . . . . . . . . . . . . .
5.2.2.2 The pACS-T algorithm . . . . . . . . . . . . . . . .
5.2.3 The pACS-D algorithm based on the Depth approximation .
5.2.3.1 The Depth approximation . . . . . . . . . . . . . . .
5.2.3.2 The pACS-D algorithm . . . . . . . . . . . . . . . .
5.2.4 The pACS-S algorithm based on the Sampling approximation
5.2.4.1 The Sampling approximation . . . . . . . . . . . . .
5.2.4.2 The pACS-S algorithm . . . . . . . . . . . . . . . .
5.2.5 Experimental analysis . . . . . . . . . . . . . . . . . . . . . .
5.2.5.1 Tuning . . . . . . . . . . . . . . . . . . . . . . . . .
5.2.5.2 Results . . . . . . . . . . . . . . . . . . . . . . . . .
Overview of the results . . . . . . . . . . . . . . . . . . . . . . . . . .

.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.

7.3

7.2.2 Exact versus approximated local search . . . . . . . . . . . . . . 131

Overview of the results . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132

8 Conclusions

135

A Detailed results of ACO for the PTSP

139

B Hybrid Metaheuristics for the VRPSD

B.1 Introduction . . . . . . . . . . . . . . . . . . . .
B.2 The VRPSD . . . . . . . . . . . . . . . . . . .
B.2.1 The OrOpt local search . . . . . . . . .
B.2.1.1 VRPSD approximation scheme
B.2.1.2 TSP approximation scheme . .
B.2.2 Benchmark . . . . . . . . . . . . . . . .
B.3 The metaheuristics . . . . . . . . . . . . . . . .
B.4 Experimental Setup . . . . . . . . . . . . . . .
B.5 First hybridization . . . . . . . . . . . . . . . .
B.6 Second hybridization . . . . . . . . . . . . . . .
B.7 Conclusions . . . . . . . . . . . . . . . . . . . .

147
147
149
153
153
154
154
155
158
159
160
162

.
.
.
.
.
.
.
.
.
.
.

List of Algorithms
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

Ant Colony Optimization (ACO) . . . . . . . . . . . . .

S-ACO . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Evolutionary Computation (EC) . . . . . . . . . . . . .
Simulated Annealing (SA) . . . . . . . . . . . . . . . . .
Stochastic Simulated Annealing (SSA) . . . . . . . . . .
Tabu Search (TS) . . . . . . . . . . . . . . . . . . . . . .
Stochastic Branch and Bound (SBB) . . . . . . . . . . .
pACS . . . . . . . . . . . . . . . . . . . . . . . . . . . .
pACS-T . . . . . . . . . . . . . . . . . . . . . . . . . . .
pACS-D . . . . . . . . . . . . . . . . . . . . . . . . . . .
pACS-S with variable number of samples . . . . . . . .
Swapping local search(, BSF ) . . . . . . . . . . . . . .
2-p-opt(, BSF ) . . . . . . . . . . . . . . . . . . . . . .
1-shift(, BSF ) . . . . . . . . . . . . . . . . . . . . . . .
1-shift-T(, BSF ) . . . . . . . . . . . . . . . . . . . . .
1-shift-P(, BSF ) . . . . . . . . . . . . . . . . . . . . .
1-shift-S(, BSF ) with fixed number of samples N . . .
hybrid pACS (pACS + 1-shift, pACS + 1-shift-T, pACS
hybrid pACS-S (pACS-S + 1-shift-S) . . . . . . . . . . .
Computation of the VRPSD objective function f0 (Q) .
OrOpt(p), with p {V RP SD, T SP } . . . . . . . . . .

. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
+ 1-shift-P) .
. . . . . . . .
. . . . . . . .
. . . . . . . .

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

19
22
25
30
32
36
41
66
77
80
82
113
114
114
124
124
125
129
130
151
155

xii

Original contributions and

outline
The contributions of this thesis stay at the intersection of two very active streams of
research inside the field of combinatorial optimization: Stochastic Combinatorial Optimization Problems (SCOPs) on one side, and metaheuristics on the other side. The
general logical structure of the thesis, which is schematized by Figure 1, is the following. Part I addresses the field of solving SCOPs via metaheuristics; Part II focuses on
solving one particular SCOP, the Probabilistic Traveling Salesman Problem (PTSP),
via one particular metaheuristic, Ant Colony Optimization (ACO); and Appendix B
considers solving another particular SCOP, the Vehicle Routing Problem with Stochastic Demands (VRPSD), via several metaheuristics. For each of these conceptual blocks,
we describe here the main original contributions, with pointers to the corresponding
scientific publications where part of the ideas have appeared or will appear.
Clear definition of the scope of SCOPs There is an increasing interest of the operations research community in addressing optimization problems that include
in their mathematical formulation uncertain, stochastic, and dynamic information. The multitude of model formulations in the literature makes it difficult
to compare different solution approaches and identifying promising directions of
research. Chapter 1 proposes a classification of the modeling approaches to uncertainty according to dynamicity and type of description of uncertain data, and
precisely defines the scope of SCOPs. Chapter 2 recalls the main formal definitions of both static and dynamic SCOPs from the literature, by providing links to
the corresponding algorithmic domains, with the aim of giving a clear view of the
intersection between classical approaches and new ones based on metaheuristics.
Survey of metaheuristics applications to SCOPs Chapter 3 aims at putting under a unifying view the several applications of metaheuristics to SCOPs, by filling
a gap in the literature, where a number of surveys and books about solving SCOPs
via classical techniques exist, but none about using metaheuristics, despite the
research literature is already quite rich. This chapter also selects and discusses
some open issues that emerge from the survey. In particular, the issue of using
approximations of the objective function inside optimization algorithms will be
further investigated in Part II. The content of Part I (from Chapter 1 to Chapter
xiii

SCOPs
Part I

PTSP

Part II
Appendix B

VRPSD

ACO

metaheuristics

Figure 1: General outline of this thesis (acronyms are explained in the text).

3), is the source of an article ([37]) which is in preparation for submission to an

international journal .
A benchmark for the PTSP The PTSP has many features that makes it useful as
a test problem for algorithms addressing SCOPs, and it is likely it will be considered again in the literature. Thus, we think that the creation of a benchmark of
instances for the PTSP will be a useful instrument for future research. Chapter
4, besides introducing the PTSP as a particular case study of the SCOPs class,
describes a benchmark of 432 PTSP instances that we have set up for the experimental evaluation of algorithms for the PTSP. The benchmark has been carefully
designed, in particular to allow the analysis of the behavior of optimization algorithms for different levels of stochasticity. In order to facilitate future comparisons
with our results, we have also used a known lower bound from the literature to
evaluate the lower bound of the optimal solution values for the instances of the
PTSP benchmark.
ACO for the PTSP Chapter 5 develops several ACO algorithms for the PTSP, and
focuses on two aspects. First, the dependence of ACO performance on PTSP
instance characteristics. Second, the impact of different approximations of the
objective function on the performance of ACO. Preliminary experiments about
the first ACO algorithm of this chapter, pACS, have been published in the proceedings of the 7th International Conference on Parallel Problem Solving from
Nature (PPSN-VII) [38] and in the proceedings of the 3rd International Workshop
on Ant Algorithms (ANTS 2002) [39].
Powerful local search algorithms for the PTSP Chapter 6 explores, in the specific context of the PTSP, a range of possibilities among which to choose when
designing a local search algorithm for a SCOP that has a computationally exxiv

pensive objective function. The most promising alternatives are: using an approximated cost for the local search operators, and finding cost expressions that
can be computed efficiently and exactly by exploiting some recursive mechanism.
In Chapter 6 different cost approximations are proposed and analyzed, and, for
the 2-p-opt and 1-shift local search operators, efficient and exact cost expressions
are derived. The derivation and preliminary experimental results based on these
efficient local search algorithms (described in Section 6.4) have been published in
two articles in the European Journal of Operations Research [42, 35].
State-of-the-art algorithm combining ACO with local search In Chapter 7 we
obtain a state-of-the-art algorithm for solving the PTSP, by combining one ACO
algorithm proposed in Chapter 5 and the powerful 1-shift local search derived in
Chapter 6. Chapter 7 also achieves a classification in terms of solution quality
of the different local search variants based on different cost approximations for
the local search proposed in Chapter 6. The contents and experimental results
of Chapters 5 and 7 are being condensed in an article ([40]) in preparation for
submission to an international journal.
Using objective function approximations in metaheuristics for the VRPSD
Appendix B analyzes the performance of several metaheuristics on the VRPSD,
which, like all SCOPs, has a computationally demanding objective function. Two
types of approximations of the objective function are used inside the metaheuristics. One approximation based on the deterministic traveling salesman problem
reveals to be particularly efficient when coupled to two metaheuristics (Iterated
Local Search and Evolutionary Computation), and leads to state-of-the-art algorithms for the VRSPD. This work has been possible thanks to the collaboration of
several people from different research labs, who have participated to the Metaheuristics Network an European project funded by the Marie Curie programme
[143]. Preliminary results have been published in the proceedings of the 8th International Conference on Parallel Problem Solving from Nature (PPSN VIII)
[32], and the contents of Appendix B have been accepted for publication by the
Journal of Mathematical Modelling and Algorithms [33].
Summarizing, the original contributions of this thesis can be quantified in terms of the
following scientific publications:
Articles in international journals: 3 articles published ([42], [35], [33]), and 2
articles in preparation ([37], [40]).
Articles in international conference proceedings: 3 articles published ([38], [39],
[32]).
It follows the complete list of mentioned scientific publications.
xv

[42] L. Bianchi, J. Knowles, and N. Bowler. Local search for the probabilistic traveling salesman problem: correction to the 2-p-opt and 1-shift algorithms. European
Journal of Operational Research, 162(1):206219, 2005.
[35] L. Bianchi and A. M. Campbell. Extension of the 2-p-opt and 1-shift algorithms
to the heterogeneous probabilistic traveling salesman problem. European Journal of
Operational Research. To appear.
[33] L. Bianchi, M. Birattari, M. Manfrin, M. Mastrolilli, L. Paquete, O. RossiDoria, and T. Schiavinotto. Hybrid metaheuristics for the vehicle routing problem
with stochastic demands. Journal of Mathematical Modelling and Algorithms. To appear.
[37] L. Bianchi, M. Dorigo, L. M. Gambardella, and W. J. Gutjahr. Metaheuristics
in stochastic combinatorial optimization: a survey. Technical Report IDSIA-08-06,
IDSIA - Dalle Molle Institute for Artificial Intelligence, Manno, Switzerland, March
2006.
[40] L. Bianchi, L. M. Gambardella, and M. Dorigo. Ant colony optimization and
local search for the probabilistic traveling salesman problem. Technical Report IDSIA02-06, IDSIA - Dalle Molle Institute for Artificial Intelligence, Manno, Switzerland,
2006.
[38] L. Bianchi, L. M. Gambardella, and M. Dorigo. An ant colony optimization
approach to the probabilistic traveling salesman problem. In J. J. Merelo Guervos,
P. Adamidis, H.-G. Beyer, J.-L. Fernandez-Villaca
nas, and H.-P. Schwefel, editors,
Proceedings of the 7th International Conference on Parallel Problem Solving from Nature (PPSN VII), volume 2439 of Lecture Notes in Computer Science, pages 883892.
Springer, London, UK, 2002.
[39] L. Bianchi, L. M. Gambardella, and M. Dorigo. Solving the homogeneous probabilistic traveling salesman problem by the ACO metaheuristic. In M. Dorigo, G. Di
Caro, and M. Sampels, editors, Proceedings of the 3rd International Workshop on Ant
Algorithms (ANTS 2002), volume 2463 of Lecture Notes in Computer Science, pages
176187. Springer, London, UK, 2002.
[32] L. Bianchi, M. Birattari, M. Chiarandini, M. Manfrin, M. Mastrolilli, L. Paquete, O. Rossi-Doria, and T. Schiavinotto. Metaheuristics for the vehicle routing problem with stochastic demands. In X. Yao, E. Burke, J. A. Lozano, J. Smith, J. J. Merelo
Guervos, J. A. Bullinaria, J. Rowe, P. Ti
no, A. Kaban, and H.-P. Schwefel, editors,
Proceedings of the 8th International Conference on Parallel Problem Solving from Nature (PPSN VIII), volume 3242 of Lecture Notes in Computer Science, pages 450460.
Springer, Berlin, Germany, 2004.

xvi

Part I

Metaheuristics for Stochastic

Combinatorial Optimization

Chapter 1

Introduction
The focus of the first part of this thesis (from Chapter 1 to Chapter 3) is on Stochastic
Combinatorial Optimization Problems (SCOPs), a wide class of combinatorial optimization problems under uncertainty, where all or part of the information about the
problem data is unknown, but some knowledge about its probability distribution is
assumed. Our intention is to put under a unifying view the several applications of
metaheuristics to SCOPs, by filling a gap in the literature, where a number of surveys
and books about solving SCOPs via classical techniques exist, but none about using
metaheuristics, despite the research literature is already quite rich.
The first part of the thesis is organized as follows. Chapter 1 motivates, in Section
1.1, the interest for studying SCOPs via metaheuristics, and in Section 1.2 proposes a
classification of the modeling approaches to uncertainty according to dynamicity and
type of description of uncertain data, and precisely defines the scope of SCOPs. Chapter 2 recalls the main formal definitions of both static and dynamic SCOPs from the
literature, by providing links to the corresponding algorithmic domains, with the aim
of giving a clear view of the intersection between classical approaches and new ones
based on metaheuristics. Chapter 2 also introduces the issue of objective function
computation in SCOPs, which may involve different types of objective function approximations. Chapter 3 reviews the main applications to SCOPs of metaheuristics
for which a significant amount of interesting literature exists, namely Ant Colony Optimization (ACO), Evolutionary Computation (EC), Simulated Annealing (SA), Tabu
Search (TS), Stochastic Partitioning Methods (SPM), Progressive Hedging (PH), and
Rollout Algorithms (RO). Finally, it selects and discusses some open issues by taking
a transversal view on the reviewed metaheuristics.

1.1

Motivation

There is an increasing interest of the operations research community in addressing optimization problems that include in their mathematical formulation uncertain, stochastic,
and dynamic information. Problem solving under uncertainty has a very high impact
on real world contexts, since optimization problems arising in practice are becoming
3

CHAPTER 1. INTRODUCTION
Acronym
PTSP
TSPTW
VRPSD
VRPSDC
SCP
SSP
SDTCP
SOPTC

Full SCOP name

Probabilistic Traveling Salesman Problem
Traveling Salesman Problem with Stochastic Time Windows
Vehicle Routing Problem with Stochastic Demands
Vehicle Routing Problem with Stochastic Demands and Customers
Set Covering Problem
Shop Scheduling Problem
Stochastic Discrete Time-Cost Problem
Sequential Ordering Problem with Time Constraint

Table 1.1: Explanation of acronyms used to refer to some relevant SCOPs.

increasingly complex and dynamic, also thanks to the fast development of telecommunications that makes not only the perception but also the changes of the world more rapid,
stochastic and difficult to forecast. Therefore, the study of optimization algorithms for
SCOPs is an aspect of operations research which is of increasing importance.
In recent years, metaheuristic algorithms such as Ant Colony Optimization (ACO),
Evolutionary Computation (EC), Simulated Annealing (SA), Tabu Search (TS), Stochastic Partitioning Methods (SPM), and others, have emerged as successful alternatives
to classical approaches based on mathematical and dynamic programming for solving
SCOPs. In fact, due to the high complexity and difficulty of optimization problems under uncertainty, often classical approaches (that guarantee to find the optimal solution)
are feasible only for small size instances, and they could require a lot of computational
effort. In contrast, approaches based on metaheuristics are capable of finding good
and sometimes optimal solutions to problem instances of realistic size, in a generally
shorter computation time. Table 1.2 lists some papers in the literature providing evidence about the advantages in solving SCOPs via metaheuristics instead of using exact
classical methods, and Table 1.1 explains the acronyms used to refer to the SCOPs
involved in Table 1.2.

1.2

Modeling approaches to uncertainty

In defining the scope of SCOPs one should consider the many ways in which uncertainty may be formalized. Uncertainty is included in the formulation of optimization
problems in order to go nearer to real world conditions, but models should also be a
bit simplified, in order to be tractable analytically or numerically. The efforts done
in reaching a good trade-off between usefulness of the model and tractability of the
problem have produced a multitude of formalizations of uncertainty. This is even more
evident for metaheuristics, because, due to their simplicity, they may be easily applied to complex formulations that would be considered intractable for many classical
algorithmic approaches.

1.2. MODELING APPROACHES TO UNCERTAINTY

Reference(s)

SCOP

Metaheuristic(s)

Exact method

5
Advantage of the metaheuristic(s) over the exact
method

Beraldi
and SCP
Ruszczy
nski
[16] (2005)

SPM (Beam Branch and Bound

Search)

Maximum deviation from

optimal solution is 5%, and
in some cases Beam Search
finds the optimal solution.
The running time reduction
with respect to Branch and
Bound is roughly between
20% and 90%

Bianchi
al. [32]
(2005)

ACO,
SA, Integer L-shaped method
TS, ILS, EC by Gendreau et al. [86]
solves instances with up to
70 customers

ACO, SA, TS, ILS, EC

address instances with up
to 200 customers (distance
from the optimum unknown)

et VRPSD
[33]

Gutjahr [100] TSPTW

(2004)

ACO

Complete
Enumeration ACO and SA solve instances
solves in about 4 hours with up to 20 customers in
instances with 10 customers a few seconds (distance from
the optimum unknown)

Branke
and PTSP
Guntsch
[51]
(2003)
[52]
(2004),
Bowler
et
al. [50] (2003),
Bianchi [38]
[39] (2002)

ACO, SA

Branch and Cut by Laporte In [38, 39], ACO addresses

et al. [132] solves instances instances with up to 200 cuswith up to 50 customers
tomers. In [51, 52] ACO addresses instances with up to
400 customers. In [50], SA
addresses instances with up
to 120 customers (distance
from the optimum unknown)

Finke et al. SSP

[77] (2002)

Mixed Integer Linear Programming solves instances

with up to 10 jobs and 5 machines

29 out of 30 small instances

solved to optimality.
Instances with up to 20 jobs
and 10 machines have been
addressed (distance from the
optimum unknown)

Gutjahr
et SDTCP
al.
[104]
(2000)

SPM
(Stochastic
Branch and
Bound)

Complete Enumeration approaches fail to generate results within a reasonable

amount of time even for
medium-size problems

Stochastic
Branch
and
Bound outperforms classic
techniques both in solution
quality and in runtime

Costa and Sil- SOPTC

ver [65] (1998)

Branch and Bound solves

instances with up to 14
causes, with a computation time from 0.1 to about
30000 seconds

TS is much faster (always

0.1 or 0.2 seconds for the
small instances with up to 14
customers). Addressed also
big instances with up to 500
customers (distance from the
optimum unknown)

Gendreau et VRPSDC TS
al. [88] (1996)

Integer L-shaped method

by Gendreau et al. [86]
solves instances with up to
70 customers

TS is faster (from Tables I

and II of [88] the time gain of
TS with respect to the exact
method may be computed)

Table 1.2: Evidence about some of the advantages in solving SCOPs via metaheuristics
instead of using exact classical methods. Metaheuristics are capable of finding good
and sometimes optimal solutions to problem instances of realistic size, in a generally
shorter computation time.

CHAPTER 1. INTRODUCTION

When considering models of optimization problems under uncertainty, there are

mainly two aspects to define: first, the way uncertain information is formalized, and
second, the dynamicity of the model, that is, the time uncertain information is revealed
with respect to the time at which decisions must be taken. The several modeling
approaches differ in the way the first and/or the second aspects are defined. Here,
we propose a classification of models according to these two aspects, uncertainty and
dynamicity, as schematized in Figure 1.1. This thesis (Part I) will then focus only on
a subset of models, that correspond to our definition of SCOPs.
uncertainty

Pure
Online
Problems

total uncertainty
interval values

Robust COPs

fuzzy values

Fuzzy COPs

random variables
perfect knowledge

Stochastic COPs
(SCOPs)
Determ. COPs
(DCOPs)

static

dynamicity

dynamic

Figure 1.1: Scheme for the conceptual classification of Combinatorial Optimization

Problems (COPs) under uncertainty. The first part of this thesis focuses on Stochastic
COPs (SCOPs) and to solution methods based on metaheuristics.

Uncertain information may be formalized in several ways (vertical axis of Figure 1.1). The case of perfect knowledge about the data of the problem corresponds
to the classical field of solving (Deterministic) Combinatorial Optimization Problems
(DCOPs) (low left corner of Figure 1.1). Here, all information is available at the decision stage, and it is used by optimization algorithms to find a possibly optimal solution.
The concrete application of a solution found would lead exactly to the cost of the solution as computed by the optimization algorithm, therefore DCOPs are also considered
static problems, because from the point of view of the decision maker, there is nothing
else to be decided after the optimization took place1 . A typical example of DCOP
is the well known Traveling Salesman Problem (TSP) [96], where, given a set of customers and the set of distance values among each couple of customers, one must find
the Hamiltonian tour (that is, a tour visiting once each customer) of minimal length.
1

Nevertheless, a solution algorithm may use a dynamic or stochastic mechanism also in these
cases, as, for example, the dynamic programming algorithm applied to (deterministic, static) shortest
path problems, or algorithms that involve some random choice such as virtually all metaheuristics and
local search procedures.

1.2. MODELING APPROACHES TO UNCERTAINTY

Despite its simple formulation, the TSP is an NP-hard problem, like many DCOPs.
Let us now consider problem formulations involving uncertainty (upper levels of
Figure 1.1). One possibility is to describe uncertain information by means of random
variables of known probability distributions. This is what we assume in SCOPs (a
more precise definition and examples of SCOPs will be given in Chapter 2). Under
this assumption, the optimization problem is stochastic, and the objective function
strongly depends on the probabilistic structure of the model. Typically, the objective
function involves quantities such as an expected cost, the probability of violation of
some constraints, variance measures, and so on. In SCOPs one can distinguish a
time before the actual realization of the random variables, and a time after the random
variables are revealed, because the associated random events happen. Static SCOPs are
characterized by the fact that decisions, or, equivalently, the identification of a possibly
optimal solution, is done before the actual realization of the random variables. This
framework is applicable when a given solution may be applied with no modifications
(or very small ones) once the actual realization of the random variables are known. The
literature sometimes addresses this type of problems as a-priori optimization. As an
example of this class of problems, consider the probabilistic TSP (PTSP), that consists
in finding a Hamiltonian tour visiting all customers (the a priori tour) of minimum
expected cost, given that each customer has a known probability of requiring a visit.
Once the information of which customers actually require a visit on a certain day is
known, the customers requiring a visit are visited in the order of the a priori tour,
simply skipping the customers not requiring a visit.
Dynamic SCOPs arise when it is not possible or not convenient to design a solution
that is usable as it is for any realization of the random variables. In this case, decisions
that need an optimization effort must be taken also after the random events have
happened. This could also be done in stages, because it is often the case that the
uncertain information is not revealed all at once, but in stages. As an example of
dynamic SCOP, consider for instance a TSP where new customers of known positions
appear with a certain probability while the salesman has already started to visit the
customers known a priori. In this case an a priori tour must be modified dynamically
in order to include the new customers in the visiting tour.
Another way of formalizing uncertainty is to identify the uncertain information
with fuzzy quantities (vectors or numbers), and constraints with fuzzy sets. This approach has its roots in Bellman and Zadeh [15] and in Zimmermann [187], but currently
occupies a minor portion of the optimization literature.
An approach which is receiving increasing attention in the last years is the one of
robust optimization, which assumes that uncertain information is known in the form
of interval values. For example, one could consider the robust TSP, where the cost of
arcs between couples of customers is given by interval values. These costs could have
the meaning of travel times, being small if there is no or little traffic, and being high
in case of traffic congestion. The robustness approach consists in finding solutions that
hedge against the worst contingency that may arise, given that no knowledge about
the probability distribution of random data is known. One possible way of quantifying
robustness is the minmax criterion, under which the robust decision is the one for which

CHAPTER 1. INTRODUCTION

the highest level of cost taken across all possible future input data scenarios is as low
as possible. Both static and dynamic versions of robust optimization problems may be
formulated. For a good introduction to robust optimization, see for instance the book
by Kouvelis and Yu [130].
On the highest level of Figure 1.1 we placed problems that we call Pure Online,
where the input is modeled as a sequence of data which are supplied to the algorithm
incrementally, but without making any assumption that can help to make a prevision on the new data. An algorithm for a Pure Online Problem produces the output
incrementally without knowing the complete input, and its performance is evaluated
with respect to an abstract competitor, who knows all the complete (past and future)
data, and that is able to solve the offline problem optimally. This way of evaluating
algorithms is called in the literature competitive analysis [4, 49]. An example of Pure
Online problem is the Dynamic Traveling Repair Problem [117], where a set of servers
move from point to point in a metric space; the speed of each server is constant, so
the time it takes to travel from one point to another is proportional to the distance
between two points; time is continuous and at any moment a request for service can
arrive at any point in the space; each job also specifies a deadline; if a job is serviced,
a server must reach the point where the request originated by its deadline; the goal is
to service as many incoming request as possible by their deadlines.
We should again remark that in this thesis we restrict to SCOPs (the shaded box
in Figure 1.1). SCOPs are combinatorial optimization problems, having by definition
a discrete decision space. By this choice we neglect the vast field of continuous optimization under uncertainty, although the scheme we have just proposed for classifying
problems under uncertainty equally applies to continuous problems.
SCOPs are relevant in many practical contexts, such as vehicle routing problems,
where stochasticity is due to variable customers demands, or variable travel times, routing on information networks, where stochasticity is due to the variability of traffic and
the related speed of information packages, finance, scheduling, location problems and
many other contexts. All these problem domains may be, and usually are, also modeled
as DCOPs. The advantage of using SCOPs over DCOPs is that the solutions produced
may be more easily and better adapted to practical situations where uncertainty cannot
be neglected, such as thrash collection, cash collection from banks, location of emergency services, and so on. Of course, the use of SCOPs instead of DCOPs comes at a
price: first, the objective function is typically much more computationally demanding
in SCOPs than in DCOPs; second, for a practical application of SCOPs, there is the
need to assess probability distributions from real data or subjectively, a task that is far
from trivial. For a discussion about the issue of computational burden and complexity
in certain SCOP formulations, see for instance Haneveld and van der Vlerk [109], and
Dyer and Stougie [73]. The ways this issue is managed in metaheuristics applied to
SCOPs will be described in detail in Chapter 3.

Chapter 2

Formal descriptions of SCOPs

The class of SCOPs is so important and has impact in so many domains that several
research areas are dedicated to its study: Stochastic Integer Programming, Markov
Decision Processes (which is part of Stochastic Dynamic Programming) and Simulation
Optimization being the main ones. Each research area corresponds to a particular
way of modeling, formulating and solving optimization problems under uncertainty,
and it is often treated separately in the optimization literature. The application of
metaheuristics to SCOPs is a quite recent and fast growing research area, and it is
thus natural that many of the papers borrow from the classical SCOP literature the
same problem formulations. In this chapter we first give, in Section 2.1, a definition
of SCOPs which is very general, but is only tentative since it does not well specifies
what are the quantities to be minimized or maximized (in a minimization, respectively
maximization problem). The limits of this tentative SCOP definition justify the need to
consider other formalizations, as we do in Section 2.2 and 2.3, where we recall the main
formal definitions of, respectively, static and dynamic SCOPs from the literature. We
also provide pointers to the research areas that originally proposed the various SCOP
definitions. In Section 2.4, we introduce the important issue of objective function
approximations in SCOPs, which will be the leitmotif of the rest of this thesis.

2.1

General but tentative SCOP definition

Let us now give a general definition of SCOP, as proposed by Kall and Wallace [127].
(Throughout the first part of the thesis we use the minimization form for optimization
problems, the maximization form is equivalent and can be derived in a straightforward
manner by substituting the word min with the word max).
Definition 1 (SCOP - tentative)
Consider a probability space (, , P ) ([95]), where is the domain of random variables
(typically a subset of Rk ), is a family of events, that is subsets of , and P is a
probability distribution on , with P () = 1. Consider also a finite set S of decision
variables x. S is typically a subset of Rn . The random variable could also depend on
9

CHAPTER 2. FORMAL DESCRIPTIONS OF SCOPS

the decision variable x, in that case it is denoted by x . Given a cost function G and
constraint functions Hi , i = 1, 2, . . . , m, mapping (x, ) (S, ) to R, find
(

minxS G(x, ),
subject to Hi (x, ) 0, i = 1, 2, . . . , m.

(2.1)

Note, however, that according to the above definition, a SCOP is not well defined,
since the meaning of min as well as of the constraints are not clear at all [127]. In
fact, how could one take a decision on x before knowing the value of , and how could
one verify if Hi (x, ) 0, if is not yet known? Moreover, since is a random
variable, also G(x, ) and Hi (x, ) are random variables. For these reasons, Definition
1 is not operational, and it must be refined. There are several possibilities to do this,
giving rise to different SCOP variants, both static and dynamic. These are also called
deterministic equivalents of Definition 1. Let us first focus on static SCOPs, and later
on dynamic SCOPs.

2.2

Static SCOPs

Definition 2 (Stochastic Integer Program - SIP)

Given a probability space (, , P ), a finite set S of feasible solutions x, a real valued cost function G(x, ) of the two variables x S and , and denoting by
EP (G(x, )) the expected value of G(x, ) over according to P , find

min g(x) := EP (G(x, )) .
xS

(2.2)

The above definition is maybe the simplest SCOP formulation, and it does not consider
random constraints (observe, though, that deterministic constraints could be implicitly
included in the definition of the domain S of decision variables).
In some cases the cost function G is deterministic, that is, G only depends on x and
not on the random variable , but constraints do depend on the random variable . In
such situation it might be impossible to enforce Hi (x, ) 0 for all . Thus, one
could relax the notion of constraint satisfaction by allowing constraint violation, and
by imposing that constraints are satisfied at least with some given probabilities. This
leads to the following
Definition 3 (Chance Constrained Integer Program - CCIP)
Given a probability space (, , P ), a finite set S of feasible solutions x, a real valued
cost function G(x), a set of real valued constraint functions Hi (x, ), and a set of
constraint violation probabilities i , with 0 i 1 and i = 1, 2, . . . , m, find
(

min G(x),
xS

subject to Prob{Hi (x, ) 0} 1 i , i = 1, 2, . . . , m.

(2.3)

2.3. DYNAMIC SCOPS

Both the Stochastic and Chance Constrained Program formulations have been originally proposed in the context of Mathematical Programming applied to SCOPs, and
this field is also called in the literature Stochastic Integer Programming (SIP), a subset of the broader field of Stochastic Programming [46]. The Stochastic Programming
community has a very active website [168] where updated bibliographic references and
papers are available. Recent surveys on SIP include [109] and [128] (the latter overviews
SIP applications in the context of location routing problems). Let us now focus on some
dynamic SCOP deterministic equivalents of Definition 1.

2.3

Dynamic SCOPs

Informally, a stochastic dynamic problem is a problem where decisions are taken at

discrete times t = 1, . . . , T , the horizon T being finite or infinite. Decisions taken
at time t may influence the random events that happen in the environment after t.
In dynamic SCOPs the concept of solution used in static SCOPs is no longer valid.
For example, in the dynamic TSP that we described in Chapter 1.2, a tour among
the set of customers known at the beginning of the day cannot be traveled as it is in
practice, but it must be modified when new observations (new customers) are known.
What the decision maker can do before the observation-decision process starts is to
decide which policy (or strategy) to adopt, that is, to specify a set of rules that say
what action will be taken for each possible random future event. For example, in the
dynamic TSP, a possible policy consists in re-optimizing the portion of route among
the not-yet-visited customers each time that a new customer appears. Another policy,
which is less computationally expensive, but that possibly leads to a more costly tour,
is to re-optimize at stages, only after a certain number of new customers has appeared.
Note that in solving dynamic SCOPs one has to make a double effort: first, decide
which policy to adopt, second, given the policy, solve the optimization sub-problems
emerging dynamically. Both parts have influence on the final solution cost, but often
the choice of the policy is due to factors that are outside the control of the decision
maker. For instance, in the dynamic TSP one could be forced not to optimize every
time a new customer arrives, in case customers want to know in advance the vehicle
arrival time.
Among the dynamic formulations the most common ones are those belonging to the
class of Stochastic Programming with Recourse (Two-stage and Multiple-stage Integer
Stochastic Programs) and Markov Decision Processes.
Definition 4 (Two-stage Stochastic Integer Program - TSIP)
Given a probability space (, , P ), a finite set S1 of first-stage decisions x1 , a finite
set S2 of second-stage decisions x2 , and real valued cost functions f1 and f2 , find

min g1 (x1 ) := f1 (x1 ) + EP G(x1 , ) ,
(2.4)
x1 S1

where
G(x1 , ) :=

min

x2 S2 (x1 ,)

f2 (x1 , x2 , ).

(2.5)

CHAPTER 2. FORMAL DESCRIPTIONS OF SCOPS

Given the above definition, solving a Two-stage Stochastic Integer Program consists in
solving two problems: a DCOP for the second-stage (equation (2.5)), and a Stochastic
Integer Program (Definition 2) for the first-stage (equation (2.4)). The meaning of
the two-stage decision process is the following. The first-stage decision x1 must be
taken before knowing the actual value of the random variable . After the value of
is observed, it may be convenient or necessary to take some other decision (the
second-stage decision x2 ) in order to better adapt to the new situation discovered. The
second-stage decision is also called recourse action, because in some practical situations
it has the effect of repairing the consequences of an action (x1 ) taken before knowing
the value of the random variable. Informally, a Two-stage Stochastic Integer Program
cosists in finding the best decision now, with the hypothesis that I will also take the best
decision when I will know the value of the random quantities. A practical example of a
Two-stage Integer Program is the Vehicle Routing Problem with Stochastic Demands
(VRSPD), where a vehicle tour among a set of customers is decided, prior of knowing
the actual demand of each customer. The vehicle travels along the tour, and the driver
discovers the actual demand of a customer only when arriving at that customer. When
a customer demand is known and the customer has been serviced, the next best decision
may be to go back to the depot for replenishment, or to proceed to the next planned
customer. The choice between these options is part of the second-stage optimization
problem. In this context, the tour planned a priori may be interpreted as the firststage decision x1 , while the set of return trips to the depot may be interpreted as the
second-stage decision x2 .
The Two-stage Stochastic Integer Program may be easily extended to the general
Multi-stage case.
Definition 5 (Multi-stage Stochastic Integer Program - MSIP)
Consider T decision stages t = 1, 2, . . . , T , and correspondingly, T decision variables
xt St (with St finite subsets depending on (x1 , . . . , xt1 , 1 , . . . t1 )), and T random
variables t belonging to probability spaces (t , t , Pt ). The problem consists in finding

min g(x) := f1 (x1 ) + EP1 (G1 (x1 , 1 ))
(2.6)
x1 S1

where, for t = 1, 2, . . . , T 2,

ft+1 (x1 , . . . , xt+1 , 1 , . . . , t+1 )
xt+1 St+1

+EPt+1 (Gt+1 (x1 , . . . , xt+1 , 1 , . . . , t+1 )) ,

Gt (x1 , . . . , xt , 1 , . . . t ) =

min

(2.7)

and
GT 1 (x1 , . . . , xT 1 , 1 , . . . , T 1 ) = min fT (x1 , . . . , xT , 1 , . . . , T ).
xT ST

(2.8)

Observe that, from the above definition, solving a Multi-stage Stochastic Integer Program consists in solving one DCOP for the last stage (equation (2.8)), and T 1
Stochastic Integer Programs for the intermediate stages (equations (2.6) and (2.7)).

2.3. DYNAMIC SCOPS

Let us now introduce the framework of Markov Decision Processes (MDP). The
central concept in MDP is the state, which at each time step describes the knowledge
about the problem (called here system). In MDP the Markov property is assumed, that
is, future behavior of the system does only depend on past history through the current
state and the current decision taken. Obviously, this also depends on the way a state
is defined. Often, the state corresponds to the physical description of the system, but
this is not always the case. We now briefly provide a standard formulation of MDP.
For a complete discussion, see for instance [151].
Definition 6 (Markov Decision Process - MDP)
Consider a 4-tuple (X, A, P, C), where X is a finite state space, and A is a finite set
of possible actions or decisions. At state x, we denote the set of admissible actions
by A(x). For each x X, A(x) is a finite set. P is a state transition function, that
describes the stochastic behavior of the system over time: at time t, if the current
state is x X, and action a A is chosen, then the next state is y with probability
P (y|x, a). P is thus a function from X A(X) = {(x, A(x)|x X)} to a probability
distribution over X. C specifies the costs of actions depending on the state in which
they are performed. Taking an action a in state x has a cost c(x, a). C is a function
from X A(X) to R.
A policy is defined as a mapping from X to A(X) that associates to every x a
feasible action a A(x), and is the set of all possible policies. Informally, a policy
tells what actions need to be taken at which state, and this is what the decision maker
needs to know in order to take a decision at each discrete decision time, once the actual
state of the system is known.
Let us now define the cost associated to a policy. Let Xt , t = 0, 1, 2, . . . , T be a
random variable that denotes the state at time t. For a given policy , and a
given initial state X0 = x0 , if the decision maker follows the policy over time, a
particular system path is given as a sequence of states and actions (X0 = x0 , a0 , X1 =
x1 , a1 , . . . , Xt = xt , at , Xt+1 = xt+1 , at+1 , . . . , aT 1 , XT = xT ), where at is the action
taken at time t and at = (xt ), with xt X. Over the system path, the system
accumulates the discounted costs defined, for T < , as
T
X

t C(xt , (xt )), with (0, 1].

(2.9)

t=0

For T = , (that is called discount factor) must be smaller than 1. Given a policy
, the accumulated discounted cost (Equation (2.9)) is a random quantity, due to the
randomness of the system path. The expected discounted cost of a policy over all possible
system paths may be computed as follows:
" T
#
X
X
t
J() =
P (x0 , x1 , . . . , xT )
C(xt , (xt )) ,
(2.10)
x0 ,...,xT

t=0

with (0, 1], and where P is the probability of a particular path:

P (x0 , x1 , . . . , xT ) = (x0 )Tt=0 P (xt+1 |xt , (xt )),

(2.11)

CHAPTER 2. FORMAL DESCRIPTIONS OF SCOPS

(x0 ) being the probability that state X0 = x0 .

Given a 4-tuple (X,A,P,C) and the associated finite set of policies , an MDP
consists in finding
min J().
(2.12)

The model we have defined is known both as Markov Decision Process and Stochastic Dynamic Programming. The first term comes from the fact that for any fixed policy,
the resulting model is a Markov chain. The second term comes from the family of solution methods based on Dynamic Programming [19], originally designed for solving
optimal control problems. Dynamic Programming and MDP are also related to (and
could be seen as part of) Reinforcement Learning [172] and Neuro-Dynamic Programming [22], which mainly focus on methods for approximating optimal solutions of MDP
and for dealing with incomplete information about the state of the system. Stochastic
Dynamic Programming is also related to Stochastic (Mathematical) Programming, and
in particular to Multi-stage Integer Programs. For a clear exposition on the relation
between these two domains, see for instance Kall and Wallace [127]).
Finding the optimal policy can be a prohibitive task unless the state and/or action
space is very small, which is usually not the case. For example Value Iteration, a well
known exact method for solving MDP [151], has a computational complexity polynomial
in |X|, |A|, and 1/(1 ), and one single iteration takes O(|X|2 |A|). There are several
approximation algorithms (that is, algorithms that do not guarantee to find the optimal
solution) that try to find good solutions in a feasible computation time via techniques
such as structural analysis, aggregation, sampling feature extraction and so on. See,
for example, [151, 172, 22]. Recently, also some metaheuristics have been applied to
approximately solve MDP, and they are discussed in the remainder of the first part of
this thesis.

2.4

Objective function computation

As we have seen, all the above SCOP formulations involve the computation of one or
more expected values for evaluating the objective function. As a consequence, three
different situations may arise when computing SCOP objective functions:
1. closed-form expressions for the expected values are available, and the objective
function is computed exactly based on these objective values;
2. as in case 1, closed-form expressions for the expected values are available, but the
computation of the objective function value is considered to be too time consuming to be done during optimization. Therefore, ad hoc and fast approximations
of the objective function are designed and used during optimization (possibly
alternating exact and approximated evaluations);
3. the problem is so complex in terms of decision variables and/or in terms of probabilistic dependencies, that no closed-form expression exists for the computation

2.4. OBJECTIVE FUNCTION COMPUTATION

SCOP formulation
SIP

CCIP
TSIP
MSIP
MDP

Exact

Ad hoc approximation

15
Sampling approximation

[74],
[175]

[38], [39], [51], [52], [32], [99], [100], [45], [181],

[33], [76]
[185], [186], [121], [84],
[102], [103], [156], [79],
[7], [6], [115], [5], [56],
[105], [157], [138], [77],
[67], [65], [101], [104],
[150]
[16]
[9]
[139]
[88]
[155], [136], [23]
[60], [61] [134], [135]

Table 2.1: Papers on metaheuristic applications to SCOPs, classified according to

SCOP formulation (rows) and type of objective function computation (columns).
of the expected value of the objective function. Therefore, the objective function
value is estimated by simulation with the so-called sampling approximation.
All the three above situations have been addressed within the metaheuristics literature,
as summarized in Table 2.1. Let us now give some introductory information on the use
of ad hoc and sampling approximations in SCOPs.

2.4.1

Ad hoc approximations

The design of ad hoc approximations is strongly problem dependent, and no general rule
exists for finding efficient approximations of the objective function. Examples of ad hoc
approximations in the literature include: the use of the objective function of a DCOP
similar in some respects to the SCOP considered; the use of truncated expressions for
the expected values, by neglecting terms that are estimated to be small; the use of
scenarios, instead of considering the true probabilistic model. Ad hoc approximations,
if on one side accelerate the evaluation and comparison among solutions, on the other
side introduce a systematic error in the computation of objective values. Usually, the
systematic error cannot be reduced unless a different, more precise ad hoc approximation is designed, and it can only be evaluated by comparison with the exact objective
value. Thus, metaheuristics typically alternate exact and approximated evaluations
during the optimization process. More details about the way ad hoc approximations
are used by metaheuristics are given in Chapter 3.

2.4.2

Sampling approximation

When a closed-form expression for the expected value(s) of the objective function is
not available, one common choice is to estimate expectations by Monte Carlo-type

CHAPTER 2. FORMAL DESCRIPTIONS OF SCOPS

simulations. For example, in the case of the Stochastic Integer Program (Definition 2),
the objective function g(x) is typically approximated by the sample average
N
1 X
gN (x) :=
G(x, j )
N

(2.13)

j=1

where 1 , 2 , . . . , N is a random sample of N independent, identically distributed

(i.i.d.) realizations of the random vector . The sample average is also referred to as
sample estimate, and the random realizations as random scenarios. In this thesis, we
will use these terms interchangeably.
The main difference between SCOPs requiring simulation for estimating the objective function and DCOPs, or SCOPs with exactly computable objective function is
that, in the first-mentioned case, it is not possible to decide with certainty whether a
solution is better than another one. This can only be tested by statistical sampling,
obtaining a correct comparison result only with a certain probability. Thus, the way
sampling approximation is used in metaheuristics largely depends on the way solutions are compared and the best solution among a set of other solutions is selected
(selection-of-the-best method).
A huge research area devoted to solving problems with simulated objective function is Simulation Optimization. Following the definition given by Fu [81], Simulation
Optimization means searching for the settings of controllable decision variables that
yield the maximum or minimum expected performance of a stochastic system that is
presented by a simulation model. A compact picture of the field is given by the reviews

of the Winter Simulation Conference [8, 146]. The latest one by Olafsson
and Kim [146]
emphasizes discrete problems and practical approaches, including some references to
metaheuristics. Until a few years ago, the literature on Simulation Optimization was
especially focussed on theoretical results of convergence of mathematically elegant algorithms. Interestingly, as noted by Fu [80], the many new commercial software packages
for simulation do not take advantage of the theoretical results of the literature. On the
contrary, most of them rely on metaheuristics such as Genetic Algorithms and Neural
Networks, that are more easily adaptable to complex real-world simulations, but often
their integration into commercial packages lacks rigor and is not provably convergent.
Fu speculates that an interesting direction of research would be the development of
algorithms that take advantage of the theoretical results of the literature, but are still
flexible and applicable to real-world situations, so to fill the gap between theory and
practice. Indeed, recent developments in Simulation Optimization, especially relying
on metaheuristics, go in this direction.

Chapter 3

Metaheuristics for SCOPs

A metaheuristic is a set of concepts that can be used to define heuristic methods capable
of being applied to a wide range of different problems. A metaheuristic can be seen as a
general algorithmic framework which can be applied to different optimization problems
with relatively few modifications to make them adapted to a specific problem.
Metaheuristics that have been applied to SCOPs include: ACO, EC, SA, and TS
(the meanings of these acronyms are listed in Table 3.1). For a review and comparison among these and other metaheuristics in the context of DCOPs, see for instance
the paper by Blum and Roli [48], and the publications pointed to by [143]. Besides
these metaheuristics, there are some algorithms such as Stochastic Partitioning Methods, Progressive Hedging, and Rollout Algorithms (see Table 3.1) that could be called
metaheuristics, even if they are not commonly known as such, or that make use of
metaheuristics as part of the algorithmic procedure.
The following sections focus on the cited metaheuristics. The format of Section 3.1,
3.2, 3.3, 3.4, and 3.5 describing respectively ACO, EC, SA, TS and SPM is the same:
first some background information on the principles of the metaheuristic is given, then
the results and issues of applying the metaheuristic to SCOPs are reviewed. For each
metaheuristic, the reviewed papers have been grouped in different parts, respectively
focussing on:
SCOPs with exactly computed objective or ad hoc approximations;
SCOPs with sampling approximation;
Markov Decision Processes.
MDPs have been treated separately because they require a particular modeling effort
for each metaheuristic, which is quite different from the modeling of static SCOPs
and of TSIPs and MSIPs. Section 3.6 briefly gives references to the SCOP literature
involving metaheuristics that are still at their early stage in the SCOP domain (PH
and RO). Finally, Section 3.7 takes a transversal view on the reviewed metaheuristics,
and selects some open issues for discussion.

CHAPTER 3. METAHEURISTICS FOR SCOPS

Acronym
ACO
EC

Full name of the metaheuristic

Ant Colony Optimization
Evolutionary Computation

= (EP+ES+GA) = (Evol. Programming + Evol. Strategies + Genetic Algorithms)

SA
TS
SMP

Simulated Annealing
Tabu Search
Stochastic Partitioning Methods

= (BS+SBB+NP) =(Beam Search + Stoch. Branch and Bound + Nested Partitions)

PH
RO

Progressive Hedging
Rollout Algorithms
Table 3.1: Acronyms used for the metaheuristics described in this thesis.

3.1
3.1.1

Ant Colony Optimization

Introduction to ACO

The first algorithms based on the ant colony analogy appeared at the beginning of the
nineties in a paper by Dorigo et al. [70] later published as [71]. ACO is now a widely
studied metaheuristic for combinatorial optimization problems, as the recent book by
Dorigo and St
utzle [72] testifies.
The inspiring concept that links optimization with biological ants is based on the
observation of their foraging behavior: when walking on routes from the nest to a
source of food, ants seem to find not simply a random route, but a quite good one,
in terms of shortness, or, equivalently, in terms of time of travel; thus, their behavior
allows them to solve an optimization problem. This kind of success of biological ants is
entirely explained by their type of communication and by their way of deciding where
to go: While walking, ants deposit a chemical called pheromone on the ground, and
they tend to choose routes marked by strong pheromone concentrations. Given two
initially unexplored routes, a short and a long one, between the nest and the source of
food, ants choose at first randomly which one to walk. Ants that have chosen, at first
by chance, the shorter route are the first to reach the food and to start their return to
the nest. Therefore, pheromone starts to accumulate faster on the shorter route than
on the longer one. Subsequent ants tend to follow the shorter route because it has more
pheromone, thus reinforcing it more and more, and further attracting other ants on the
good route.
Combinatorial problems addressed by ACO are usually encoded by a construction
graph G = (V, A), a completely connected graph whose nodes V are components of
solutions, and arcs A are connections between components. Finding a solution means
constructing a feasible walk in G. For example, in the TSP nodes correspond to customers, arcs correspond to streets connecting customers, and a feasible solution is a
Hamiltonian path on the graph. The construction graph encoding is also used in current
ACO applications to SCOPs and to dynamic optimization problems. Some examples
are also described in [72].

3.1. ANT COLONY OPTIMIZATION

The ACO algorithm is essentially the interplay of three procedures [68]: ConstructAntsSolutions, UpdatePheromones, and DeamonActions, as represented by Algorithm
1.
ConstructAntsSolutions is the process by which artificial ants construct walks on the
construction graph incrementally and stochastically. For a given ant, the probability
pkl to go from a node k to a feasible successor node l is an increasing function of kl
and kl (u), where kl is the pheromone on arc (k, l), and kl (u) is the heuristic value of
arc (k, l), which should be a reasonable guess of how good arc (k, l) is (for example, in
the TSP kl is the reciprocal of the distance between customer k and customer l). The
heuristic value may depend on the partial walk u.
UpdatePheromones is the process by which pheromone is modified on arcs. Pheromone
may be both increased and decreased. Pheromone is modified (decreased) by each ant
on each arc as soon as it is added to a partial walk on the construction graph, this
operation is called local update. Moreover, pheromone is further modified (increased)
on selected good solutions to more strongly bias the search in future iterations, and this
operation is called global update. Decreasing pheromone on selected arcs is important,
in order to avoid too rapid convergence of the algorithm to suboptimal solutions. Interestingly, pheromone decreases also in the biological environment, due to evaporation.
DeamonActions are centralized operations, such as comparing solution values among
ants in order to find the best solution, or running a local search procedure.
Algorithm 1 Ant Colony Optimization (ACO)
while termination condition not met do
ScheduleActivities
ConstructAntsSolutions
UpdatePheromone
DeamonActions
end ScheduleActivities
end while
Several convergence proofs have been provided for the ACO metaheuristic. Convergence in value, which, informally, means that the algorithm will find at least once
the optimal solution, has been given by Gutjahr [97] and by St
utzle and Dorigo [169].
Convergence in solution, which, informally, means that the algorithm will generate over
and over the optimal solution, has been given by Gutjahr [98]. For details and for a
comprehensive discussion, see Dorigo and St
utzle [72].

3.1.2

ACO for SCOPs

The investigation of ACO applications to SCOPs is at its early stages, the first works
being appeared at conferences after year 2000. Nevertheless, the ACO literature contains both theoretical and experimental works that cover both static and dynamic
SCOPs.

20
3.1.2.1

CHAPTER 3. METAHEURISTICS FOR SCOPS

Exact objective and ad hoc approximation

The first SCOPs that have been addressed by ACO are the probabilistic TSP (PTSP),
in Bianchi et al. [38, 39] and Branke and Guntsch [51, 52], and the vehicle routing
with stochastic demands (VRPSD) in Bianchi et al. [32, 33]. These problems are
Stochastic Integer Programs with closed form expression for the objective function, that
is, the objective function is computable a priori, independently of particular random
realizations of the stochastic variables.
The PTSP and the VRPSD have in common the fact that their solution structure (and the corresponding construction graph) is very similar to their deterministic
counterpart (the TSP, respectively capacitated VRP). The main difference with the
respective deterministic counterpart problem is the much higher computational complexity of the objective function in the stochastic version of the problem. In the PTSP,
the objective function is computable in O(n2 ) time, n being the number of customers,
while in the TSP it only requires O(n) time. In the VRPSD, the objective requires
O(nKQ), where n is the number of customers, K is the number of possible demand
values of each customer, and Q is the vehicle capacity, while the capacitated VRP
objective only requires O(n) time. The fact that the difference between the stochastic
and deterministic versions of these problems mainly lies in the objective function makes
them particularly appropriate for studying a first application of ACO to SCOPs. In
fact, in this case it is possible to apply to the stochastic problem an ACO algorithm
originally designed for the deterministic problem with almost no modifications.
In [38, 39], the authors experimentally investigate on the PTSP two versions of
ACO: ACS and pACS. ACS, that was originally designed for the TSP by Gambardella
and Dorigo [82] and by Dorigo and Gambardella [69], solves the PTSP using the objective function of the TSP (the length of a Hamiltonian path) as a rough but fast
approximation of the PTSP objective function. The second version of ACO considered
in [38, 39], pACS, is identical to ACS except for the fact that it uses the PTSP objective
function (the expected length). Such difference implies that pACS considers as good
solutions different solutions with respect to ACS, and so pACS reinforces pheromone
(during global updating) on different solutions with respect to ACS, with the consequence that ants in pACS converge on different solutions. Note, however, that ACS and
pACS use the same, TSP specific, heuristic information (the reciprocal of the distance
between two customers). Experimental results on PTSP instances with homogeneous
customers probabilities have shown that pACS is better than ACS, except for the case
when the customers probabilities are close to 1, in which case ACS is more efficient than
pACS. This means that the overhead of the time consuming PTSP objective function
is not justified in those cases where the approximate objective function, which can be
computed much faster, is close enough to the exact one. The idea to employ faster
approximations of the exact objective function has been further developed in [51, 52].
The authors propose an ad hoc approximation of the expected cost that neglects the
least probable customers configurations. This approximation is shown experimentally
to accelerate convergence without significantly worsening the solution quality. Another
issue addressed by [51, 52] is the design of PTSP-specific heuristics to guide the ants

3.1. ANT COLONY OPTIMIZATION

construction process. The authors experimentally analyze different heuristics, and show
that one of them indeed improves the quality of solution constructed by ants, but at
the cost of a higher computational time.
An important aspect in designing ACO for SCOPs (but also for classical DCOPs),
is the application of a local search procedure to improve solutions found by ants (the
local search is part of the DeamonActions of Algorithm 1). In order to be competitive
with state-of-the-art algorithms, it has been necessary for ACO algorithms to use a
local search both in the PTSP [52] and the VRPSD [32]. Unfortunately, designing an
effective local search for a stochastic problem with a computationally expensive objective function may be a quite challenging task. The reason is that in local search it is
very important to compute efficiently the change, or delta, of the objective function
between two neighboring solutions. When the objective function is complex like in
most SCOPs, it is difficult to find a delta expression which is both exact and fast to be
computed. For the PTSP it has been possible to derive for two local search operators,
the 1-shift and the 2-p-opt, recursive, fast and exact expressions for the objective function delta [42, 35]. The 1-shift and 2-p-opt are very efficient, since they can explore the
whole neighborhood of a solution in O(n2 ) time, the same time it would take for the
same operators in the TSP. For the VRPSD, a local search operator is available, the
OrOpt, with an efficient delta expression which is not exact, but approximated, and
has been introduced by Yang et al. in [183] (we will call this Yang approximation). In
Bianchi et al. [32, 33], besides the Yang approximation, one based on computing the
length difference between two neighboring solutions has been considered. This last approximation is equivalent to treat a VRPSD solution (which is a Hamiltonian path) like
a solution for the TSP, and it is faster but less accurate than the Yang approximation.
In [32, 33], the impact of using the two above types of delta approximation has been
tested on several metaheuristics, namely ACO, EC, SA, TS, and Iterated Local Search.
In ACO, the use of the rough but efficient TSP approximation lead to better results
than the Yang approximation (even though ACO was not able to reach the quality of
the best performing metaheuristics, that were Iterated Local Search and EC).
3.1.2.2

Sampling approximation

When ACO is applied to this type of problems, the DeamonActions procedure (Algorithm 1) must implement ways of performing statistical tests for comparing the sample
average values of the solutions generated by the ants, in order to select the best solution
(or a set of best solutions). Sampling could also be used in ConstructAntsSolutions, in
order to estimate heuristic values k,l (u), when the chosen heuristic depends on random
variables.
The first sampling-based ACO, called S-ACO, has been proposed and analyzed by
Gutjahr in two papers [99, 100]. The first paper [99] theoretically analyzes S-ACO, by
proving convergence to the optimal solution with probability one. The second paper
[100] experimentally studies S-ACO on two stochastic routing problems, the PTSP, and
the TSP with time windows and stochastic service times (TSPTW). S-ACO has been
applied in a third paper by Rauner et al. [152] to a policy optimization problem in

CHAPTER 3. METAHEURISTICS FOR SCOPS

healthcare management. Algorithm 2 summarizes the functioning of S-ACO, showing

in particular how sampling is used; for details about procedures ConstructAntsSolutions
and UpdatePheromone, see [99, 100]. In every iteration, after ants have constructed their
Algorithm 2 S-ACO
1: for iteration k = 1, 2, . . . do
2:
ConstructAntsSolutions [s ants construct their walk x , = 1, 2, . . . , s on the
graph G]
3:
from {x1 , . . . , xs } select a walk x;
4:
if k = 1 then
5:
set x = x [x is the current approximation of the optimal solution]
6:
else
7:
based on Nk P
independent random scenarios , compute a sample estimate
k
gk (x) = 1/Nk N
=1 G(x, ) of x;
8:
based on Nk independent
random scenarios 0 , compute a sample estimate
PNk

gk (x ) = 1/Nk =1 G(x , 0 ) of x ;
9:
if gk (x) < gk (x ) then
10:
set x = x;
11:
end if
12:
end if
13:
UpdatePheromone
14: end for
solutions x , only one ant solution x is selected for being further compared with the
current best solution (step 3 of Algorithm 2). Interestingly, for the sake of convergence,
it does not matter how the ant solution is selected [99]. A possible way to do it, which
has been chosen in [100], is to evaluate each x on a same random scenario drawn
specifically for a given iteration, and to take x as the solution with the best value. In
the case of the more complex problem treated in [152], selecting x based on a single
random scenario turned out as suboptimal; better results were obtained by choosing
several (but not too many) scenarios. After x has been selected, it is then again
evaluated, together with the current best solution x , in order to decide whether it is
better than x . This is done by estimating x by sampling over Nk scenarios and x
over Nk scenarios 0 . In the convergence proof of [99], it has been necessary to impose
that and 0 are independent, but in practice [100], if = 0 S-ACO also performs
well. The number of sample scenarios Nk is a critical parameter of S-ACO: if too small,
the estimate and comparison of solutions will be often faulty, but if Nk is too big, the
computational time required for one solution evaluation could become a problem. As
shown in [99], for proving convergence it is sufficient that Nk increases linearly with
the iteration number k. This result is interesting especially if compared with the faster
than quadratic increase recognized as necessary for the corresponding SA approach
in [102, 115]. In practical implementations of S-ACO, it may be more convenient
to choose the sample size Nk adaptively, based on some statistical test. In [100],
one version of S-ACO establishes the sample size by means of a parametric statistical

3.1. ANT COLONY OPTIMIZATION

test: Nk is gradually increased till when the difference between the sample estimation
for the two solutions being compared is larger than 3 times their estimated standard
deviation. This kind of sample schedule, also known as variable-sample strategy, has
been theoretically analyzed in the context of random search algorithms by Homem-deMello [116].
More recently, the ACO/F-Race algorithm has been proposed by Birattari et al. [45],
where at each iteration the selection of the new best solution (steps 3 to 12 of Algorithm
2) is done with a procedure called F-Race, which is more sophisticated than the simple
parametric test of S-ACO. As explained in [45], F-Race consists in a series of steps
at each of which a new scenario is sampled and is used for evaluating the solutions
that are still in the race (at the beginning, all solutions generated by ants in a given
iteration, together with the current best solution, are in the race). At each step, a
Friedman test is performed and solutions that are statistically dominated by at least
another one are discarded from the race. The solution that wins the race is stored as the
new current best solution. Preliminary experiments on homogeneous instances of the
PTSP problem have shown that ACO/F-Race improves over the parametric procedure
adopted by S-ACO.
3.1.2.3

Markov Decision Processes

To our knowledge, there are only two papers that use ACO to solve MDP, the first
one by Chang et al. [60] and the second one by Chang [59]. Both papers design ACO
algorithms to solve MDP, and theoretically analyze their properties by providing convergence proofs.
Chang et al. [60] focus on MDP with infinite horizon (that is, T = ). This
simplifies a little the problem, because in this case the optimal policy is stationary,
that is, it does not depend on time. The construction graph on which ACO algorithms
proposed in [60] are based is represented in Figure 3.1. The states in X are arranged

(x, a1)

(x, a|A(x)|)
Figure 3.1: Construction graph of the ACO algorithm proposed in [60] for solving
Markov Decision Processes with stationary policies.
in an arbitrary order O; each state x X corresponds to a vertex of the construction
graph, and each arc of the construction graph corresponds to a pair (x, a) of state
x X and an admissible action a A(x). The direction of the arc goes from the

CHAPTER 3. METAHEURISTICS FOR SCOPS

current state x to the next state with respect to the order O. A particular ant traverses
all the states in the construction graph following the order O from the first state of
O. When it moves from a state x to a state y, it traverses randomly the arc (x, a)
with a transition probability that depends on the pheromone on that arc, and on some
heuristic appropriately defined (see [60] for details). Once an ant finishes the tour,
it has generated a stationary policy, which, in the context of MDP, is a candidate
solution to the problem. In [60], two ACO versions are proposed, ANT-PI and ANTTD. The first one is inspired by a well-known exact method for MDP, policy iteration
(PI) [151], and assumes that the state transition probabilities P of the MDP system are
known. The second one considers the case of unknown P , and uses one the well-known
reinforcement learning algorithms called temporal difference learning (TD) [172] for
evaluating the average value of a given policy. It is proven that both ANT-PI and ANTTD probabilistically converge to the optimal solution. For practical implementations,
due to the high computational complexity inherent to the problem, the authors suggest
that parallel implementations of the proposed ACO algorithms are used.
In Chang [59], the focus is on MDP with completely unknown system dynamics,
that is, unknown state transition probability P and unknown costs C. A finite horizon
T is considered and, for simplicity, it is assumed that every action is admissible at
every state. An ACO algorithm called ANT-EE is proposed, which is based on a TD
algorithm for evaluating a policy, similarly to the ANT-TD algorithm of [60]. Theorems
are provided that show that ANT-EE has the same convergence properties as Q-learning
[177], one of the best known basic techniques in reinforcement learning.

3.2
3.2.1

Evolutionary Computation
Introduction to EC

EC [31] is a collective term for all variants of optimization algorithms that are inspired
by Darwinian evolution. In this context, a solution to a given optimization problem
is called an individual, and a set of solutions is called a population. The basic structure of an EC algorithm is given by Algorithm 3. Every iteration of the algorithm
corresponds to a generation, where certain operators are applied to some individuals
of the current population to generate the individuals of the population of the next
generation. Typical operators are those of recombination, that recombine two or more
individuals to produce new individuals, and mutation, that modify single individuals to
obtain self-adaptation. At each generation, only some individuals are selected for being
elaborated by recombination and mutation operators, or for being just repeated in the
next generation without any change, on the base of their fitness measure (this can be
the objective function value, or some other kind of quality measure). Individuals with
higher fitness have a higher probability to be selected.
In the literature there are mainly three different categories of EC that have been developed independently from each other: Evolutionary Programming (EP), proposed by
Fogel et al. in 1966 [78], Evolutionary Strategies (ES) proposed by Rechenberg in 1973
[153], and Genetic Algorithms proposed by Holland in 1975 [114]. Presently, algorithms

3.2. EVOLUTIONARY COMPUTATION

that fall in the EP and ES category mostly apply to continuous optimization problems,
while GA are more specific for discrete and combinatorial optimization problems. Recent overviews about EC include Hertz and Kobler [112], and Calegari et al. [57]. For
the convergence properties of EC and GA, see for instance Rudolph [161], Vose [180],
and Reeves and Rowe [154].
Algorithm 3 Evolutionary Computation (EC)
P = GenerateInitialPopulation()
while termination condition not met do
P 0 = Recombine(P )
P 00 = Mutate(P 0 )
Evaluate(P 00 )
P = Select(P 00 P )
end while

3.2.2

EC for SCOPs

There is a very large amount of literature on applying EC to optimization problems

under uncertainty, such as problems with noisy fitness, with time varying and dynamic fitness, and with approximated fitness. For a recent survey on how the EC
literature addresses different types of uncertainty, see Jin and Branke [123]. Other
reviews include a book by Arnold [10], and a paper by Beyer [30]. Although SCOPs
are a particular case of optimization under uncertainty, the translation to the SCOP
domain of the results from the EC literature on optimization under uncertainty is not
easy. The main difficulty is that most papers focus on continuous optimization, and
they often restrict their attention to the optimization problem characterized by ad hoc
test functions, such as the spherical objective function f (x) = xT x, x RN . Also
when discrete optimization problems are considered, experiments are often restricted
to the onemax bit-counting function, which can be regarded as the counterpart of the
spherical objective function in binary search spaces.
In the following, we outline the contributions of EC in the SCOP domain, and we
also highlight the main ideas and methods proposed for problems under uncertainty
that may be relevant for SCOPs, even if they have not been directly tested on specific
problems from this domain.
3.2.2.1

Exact objective and ad hoc approximation

The EC literature addressing this kind of problems may be roughly divided in two
groups. In the first group ([74, 139]) EC algorithms use the exactly computable objective function as it is, even if computationally expensive, while in the second group
([32, 33], and references cited in [122]) EC exploits also computationally more efficient
objective function (fitness) approximations. Let us briefly analyze these two groups of
papers.

CHAPTER 3. METAHEURISTICS FOR SCOPS

In [74] Easton and Mansour apply a distributed GA to three different labor scheduling problems, one of which is formulated as a stochastic goal programming problem.
Their algorithm operates in parallel on a network of three workstations. Separate
sub-populations evolve independently on each processor, but occasionally the fittest
solutions migrate over the network to join the other sub-populations. Also infeasible
solutions are accepted (with a fitness penalty) in order to encourage the exploration of
promising regions of the search space. The proposed GA is compared experimentally
to a SA and a TS metaheuristic previously developed by other authors (respectively in
[54, 55] and in [75]), and it is shown to outperform both of them.
In [139] Guo and Mak consider a vehicle routing problem with stochastic demand
and soft time windows, which is formulated as a Two-stage Stochastic Integer Program
(Definition 4). The authors propose an EC algorithm called Age-GA where, instead
of being replaced by their offspring after each iteration, individuals may grow up and
generate new offspring continuously before death, and the population comprises individuals from various age-groups. With the same amount of computational effort, it is
possible to use a larger population size in Age-GA than in a canonical GA. The paper
shows that, on a set of eighteen randomly generated instances, Age-GA outperforms a
canonical GA without the aging mechanism.
To the group of papers using in SCOPs efficient approximations of the fitness function belong [32, 33] by Bianchi et al. (also cited in Section 3.1), that compare a simple EC with other metaheuristics (ACO, SA, TS, and Iterated Local Search) for the
VRPSD. Similarly to the other metaheuristics, EC is integrated with the OrOpt local
search operator, where two approximations for the objective value difference between
neighboring solutions have been tested, the Yang and the TSP approximation. The exact VRPSD objective function is used for accepting a new solution in the local search,
and for the selection of a new population. EC, like ACO and Iterated Local Search,
performs better with the TSP approximation. Interestingly, EC is improved even more
when in [33], instead of OrOpt, a more TSP-specific local search (3-opt) is used. EC,
together with Iterated Local Search, is shown to be the best performing among the
tested metaheuristics.
Here, it is useful to note that there is a thread in the EC literature that focuses on
the use of computationally efficient approximations of the original fitness in continuous
optimization problems. Some aspects of this issue that are developed in the context of
continuous optimization may be relevant to SCOPs as well. Fitness approximations are
also known as approximate models, meta-models or surrogates. A comprehensive survey
on fitness approximation in EC has been written by Jin [122]. This growing research
area is particularly oriented to continuous optimization problems with extremely time
consuming objective function computations, such as, for instance, structural design
optimization [12], where one single fitness evaluation may take over ten hours on a highperformance computer. The issue of how the approximate model can be incorporated
in the EC algorithm, which has been widely addressed by the EC literature on fitness
approximation, is quite independent from the continuous or discrete nature of the
optimization problem. Nevertheless, most of the ideas still havent been applied to
SCOPs. For a review and pointers to existing literature, see Section 4 of [122].

3.2. EVOLUTIONARY COMPUTATION

3.2.2.2

Sampling approximation

The EC literature about optimization with noisy fitness function is also relevant for
SCOPs with sampling estimated objective function. In fact, noise is mostly assumed
to be additive with zero-mean, which is the case when Monte Carlo sampling is used to
estimate the objective function. Section II of Jin and Branke [123] is a good overview
about the methodological approaches used in EC to deal with noisy fitness functions.
The authors identify three main strategies for dealing with noise: explicit averaging,
implicit averaging, and modifying the selection process.
Explicit averaging corresponds to the computation of sample averages of the fitness
function performing repeated measures of the noisy fitness and computing their average. This is very similar to the Simulation Optimization technique we have illustrated
in Section 2.4.2. Aizawa and Wah [3] propose either to increase the number of samples with the generation counter, or to higher the number of samples for individuals
that have a high estimated variance. Stagge [167] proposes to adjust the number of
samples according to the probability of an individual of being among the best ones.
Branke and Schmidt [53] also propose an adaptive sampling method that takes additional samples of two individuals participating in a tournament until the normalized
fitness difference between the two individuals falls below some threshold. The normalized fitness difference is obtained dividing the difference of the observed fitnesses by
the standard deviation of the difference: (gN (x) gN (y))/d . Cant
u-Paz [58] uses an
adaptive sampling method that consists in taking the smallest number of samples necessary to make a decision between competing individuals during the selection process.
Their approach is very similar to Branke and Schmidts, but differs in that they take
samples one at a time from the individual with the highest observed variance, and they
use standard statistical tests to select the winner of the tournament with a certain
probability. Similar to the approach of Cant
u-Paz, Teller and Andre [174] propose a
method that allocates varying numbers of samples to evaluate individuals. Individuals
are initially evaluated with a small number of samples, and are further evaluated only
if there is some chance that the outcome of the tournaments they participate in can
change. A similar technique has been developed by Giacobini et al. [89].
The second type of strategy for dealing with noise in EC is implicit averaging. Its
aim is to reduce the influence of noise by using a large population size, instead of
performing more fitness measures of a single individual. The intuition behind implicit
averaging is the following ([123], p.305): Because promising areas of the search space are
sampled repeatedly by the EC algorithm, and there are usually many similar solutions
in the population, when the population is large, the influence of noise in evaluating an
individual is very likely to be compensated by that of a similar individual. This can be
regarded as an implicit averaging effect. In the literature, conflicting conclusions have
been reported on whether it is more effective the use of implicit or explicit averaging,
given a fixed fitness evaluation number per generation is allowed. For a summary about
the history of implicit averaging we direct the reader to [123] and to the references cited
therein.
The third strategy used in EC to reduce the influence of noise is modifying the se-

CHAPTER 3. METAHEURISTICS FOR SCOPS

lection process of individuals. One example is to accept an offspring individual only if

its fitness is better that that of its parents by at least a predefined threshold, as done by
Markon et al. [140]. Beielstein and Markon [14] study the relationship between threshold selection and hypothesis testing techniques. Another way of modifying the selection
process with respect to standard EC is to eliminate random choices during selection,
in order to exploit the uncertainty due to the noisy fitness as a sort of randomization
effect. This method has been proposed by Branke and Schmidt [53].
Convergence properties of EC and the dynamics of the fitness function when noise
is present have been analyzed in several papers, for example by Miller and Goldberg
[142] and by Beyer [30].
In all the above cited papers aiming at reducing the influence of noise via implicit
and explicit averaging, or via modifications of the selection process, the computational
experience is unfortunately limited to ad hoc continuous or discrete test functions. It
appears that an experimental validation of the various techniques in the SCOP domain
is still missing in the literature. In fact, the few papers applying EC to SCOPs that we
are going to outline below, either use very simple techniques for dealing with noise or
rely on methods that are unrelated to the main EC literature on noisy fitness functions.
Watson et al. [181] address a stochastic warehouse scheduling problem where the
objective function must be estimated by simulation. The authors consider a GA, a
solution construction heuristic specific for that problem, two local search and a random
search algorithm. Two versions of the GA and the local search algorithms are considered where the (set of) starting solution(s) is randomly generated in one case, and
provided by the constructive heuristic in the other case. In order to keep the run time
of the algorithms feasible, the simulator is used in a fast but inaccurate mode. Only
final solutions are eventually evaluated with a more accurate - two order of magnitude
slower - simulator mode. The constructive heuristic exploits specific knowledge about
the internal states of the simulator in order to construct a solution. Instead, in the
GA and local search algorithms the simulator is used as a black box, that, provided a
solution, returns a real value indicating the solution quality. Experimental results show
that GA initialized with the domain-specific construction heuristic outperforms all the
other algorithms. Moreover, all algorithms perform worse when initialized by random
solutions. The results also highlight an interesting phenomenon related to the use of a
black box, fast but inaccurate simulator for the evaluation of solutions during the execution of the GA. As better and better solutions according to this simulator are found,
it is observed that the correlation with solution values given by the slow-accurate simulator (evaluated a posteriori) decreases. This implies that the final solution returned
by the GA as best solution may be quite bad with respect to the nearly-exact objective
value. It is reasonable to think that this is a general phenomenon that can happen in
any metaheuristic exploiting a non-exact or noisy objective function evaluation, particularly when estimating the objective function by sampling and with a fixed (low)
number of samples. One possibility to overcome this problem is to keep in memory a
set of promising solutions encountered during the execution of the algorithm, and to
evaluate them a posteriori with the accurate simulator, or to apply more sophisticated
adaptive sampling techniques. Yoshitomi [185] and Yoshitomi and Yamaguchi [186] use

3.2. EVOLUTIONARY COMPUTATION

GA for solving the stochastic job-shop scheduling problem. In both papers, the best
solution is extracted among the set of solutions that have been more frequently present
through the generations of the GA. In [186], Monte Carlo sampling is used to select
among the set of most frequent solutions the best final solution. Other applications of
GA based on Monte Carlo sampling are the ones by Sudhir Ryan Daniel and Rajendran [171] applying GA to the inventory optimization problem in a serial supply chain,
and by Jellouli and Chatelet [121] using GA for addressing a supply-chain management
problem in a stochastic environment.

3.2.2.3

Markov Decision Processes

Chang et al. [61] propose an EC algorithm called Evolutionary Policy Iteration (EPI)
for solving inifinite horizon discounted reward MDPs. EPI is particularly suited for
problems where the state space is small but the action space is very large. This situation
makes the use of the well-known policy iteration (PI) algorithm [151] for solving MDPs
impractical, since PI must perform a maximization over the entire action space. EPI
eliminates the need of maximizing over the entire action space by directly manipulating
policies via a method called policy switching that generates an improved policy from
a set of given policies. The computation time for running the policy switching is on
the order of the state space size. The algorithmic structure of EPI is that of standard
GA, with appropriate modifications and extensions required for the MDP setting. EPI
iteratively generates a population or a set of policies such that the performance of the
best policy for a population monotonically improves with respect to a defined fitness
function. In [61], it is proved that EPI converges with probability one to a population
whose best policy is an optimal policy.
Lin et al. [134, 135] focus on finite horizon partially observed Markov decision processes (POMDPs), an extension of MDPs that consider situations such as: State observations are costly; sensors that measure the current state value are noise-corrupted; at
least part of the current state value is inaccessible. In [134, 135], a GA is developed for
finding a good approximation of the value function. In [135], the GA is combined with
a mixed integer programming algorithm, and this combination is capable of determining the value function and of finding the optimal policy in less computation time than
other exact methods.
Yokoama and Lewis III [184], address a stochastic dynamic production cycling problem whose original formulation is that of an MDP. In order to reduce the search space
dimensionality, the problem is first re-formulated in a two-level problem. The first level
consists in a DCOP that, in order to be solved, needs that a series of MDP sub-problems
(constituting the second level) are solved. The first level DCOP is addressed by a GA,
which calls dynamic programming algorithms as sub-routines to solve the second level
MDPs.

CHAPTER 3. METAHEURISTICS FOR SCOPS

3.3

Simulated Annealing

3.3.1

Introduction to SA

The SA algorithm has been introduced in the area of combinatorial optimization by

Kirkpatrick et al. [129]. It relies on a model developed by Metropolis et al. [141] for
simulating the physical annealing process, where particles of a solid arrange themselves
into a thermal equilibrium. An introduction to SA can be found in van Laarhoven and
Aarts [179] or Aarts and Korst [1].
The standard type of applications concerns combinatorial optimization problems of
the form
minxS g(x),
where S is a finite set of feasible solutions. The algorithm uses a pre-defined neighborhood structure on S. A control parameter which is called temperature in analogy
to the physical annealing process governs the search behavior. In each iteration, a
neighbor solution y to the current solution x is computed. If y has a better objective
function value than x, the solution y is accepted, that is, the current solution x is
replaced by y. If, on the other hand, y has a worse objective function value than x, the
solution y is only accepted with a certain probability depending on (i) the difference of
the objective function values in x and y, and (ii) the temperature parameter.
In pseudocode, the SA algorithm can be represented as follows (cf. [1], p. 16):
Algorithm 4 Simulated Annealing (SA)
Initialize state x and temperature parameter T1 ;
for iteration k = 1, 2, . . . do
select y randomly from S(x);
if g(y) g(x) then
set x = y;

else if exp g(x)g(y)
uniform[0,1] then
Tk
set x = y;
end if
update Tk to Tk+1 ;
end for
Therein,
x and y are feasible solutions from S;
T1 ,T2 , . . . is a (usually decreasing) sequence of values for the temperature parameter; the update of the values Tk is done according to a so-called cooling schedule;
the sets S(x) form the pre-defined neighborhood structure: to each feasible solution
x S, a set S(x) S \ {x} of neighbor solutions is assigned;
uniform[, ] is a procedure selecting a uniformly distributed (pseudo-)random
number from the interval [, ].

3.3. SIMULATED ANNEALING

Several results showing convergence of SA to the set of optimal solutions under

suitable cooling schedules have been obtained by diverse authors, for example Geman
and Geman [85], Gelfand and Mitter [83], or Hajek [107]. Essentially, convergence can
be assured by a cooling schedule where Tk is decreased as / log k, with sufficiently
large . In practice, cooling is usually done faster for computation time reasons. (For
more details, see [1].)

3.3.2

SA for SCOPs

In the literature, several extensions of the SA algorithm above have been suggested for
treating Stochastic Integer Programs (Definition 2), both in the case of exact objective
and ad hoc approximation, and sampling approximation.
3.3.2.1

Exact objective and ad hoc approximation

One early application of SA in the context of SCOPs is due to Teodorovic and Pavkovic
[175]. The authors address a VRPSD with multiple vehicles, and use SA in two stages,
first for partitioning the customers among the different vehicles, and second to improve
the single vehicle routes. In this preliminary work, computational results are reported
only for one instance of 50 customers.
More recently, the already cited papers [32, 33] by Bianchi et al. (see Section 3.1 and
3.2) have applied to the VRPSD a simple SA algorithm, together with other metaheuristics (ACO, EC, TS, and Iterated Local Search). Similarly to the other metaheuristics,
two approximations for the objective value difference between neighboring solutions
generated according to the OrOpt scheme have been tested, the Yang and the TSP
approximation. Differently from what happens for ACO, SA performs better when
using the more accurate but more computationally expensive Yang approximation. On
average, SA does not perform significantly different from ACO, and it is not able to
reach the quality of the best performing metaheuristics, that are EC and Iterated Local
Search.
3.3.2.2

Sampling approximation

Algorithm 5 shows a typical basic structure of an SA modification to the solution of

Stochastic Integer Programs (Definition 2) with sampling estimated objective function.
The approaches from the literature outlined below follow this general scheme. Differences stay particularly in the way step 5 (estimation of the objective value), step
11 (choice of a new approximation of the optimal solution), and step 12 (temperature
level) are implemented in Algorithm 5.
Gelfand and Mitter [84] investigate the case where the observation of the objective
function g(x) is disturbed by random noise Wk in iteration k of the SA process, such
that instead of g(x), the estimate gk (x) = g(x) + Wk is observed. They show that if
Wk is normally distributed with mean zero and variance k2 , if certain conditions on
the values k and on acceptance probabilities are satisfied, and if a suitable cooling

CHAPTER 3. METAHEURISTICS FOR SCOPS

Algorithm 5 Stochastic Simulated Annealing (SSA)

1: Initialize state x, temperature parameter T1 and sample size N1 ;
2: Set x = x [x is the current approximation of the optimal solution];
3: for iteration k = 1, 2, . . . do
4:
select y randomly from S(x);
5:
compute sample average estimates gk (x) and gk (y) for the costs in x resp. y;
6:
if gk (y) gk (x) then
7:
set x = y;

k (y)
8:
else if exp gk (x)g
uniform[0,1] then
Tk
9:
set x = y;
10:
end if
11:
compute a new current approximation x of the optimal solution;
12:
update Tk to Tk+1 ;
13:
update Nk to Nk+1 ;
14: end for
schedule (ensuring convergence of ordinary SA) is used, then the convergence property
of ordinary SA remains valid.
Gutjahr and Pflug [102] follow a similar approach by showing that under suitable
conditions on the peakedness (Birnbaum [47]) of the noise distribution, convergence
of the current solutions to the set of optimal solutions can be guaranteed. To be
more specific, let us call a symmetric distribution 1 more peaked around zero than a
symmetric distribution 2 , if for all t > 0, the probability mass on the interval [t, t]
is larger or equal under 1 than under 2 . Then, if the distribution of the noise Wk is
more peaked around zero than a normal distribution N (0, k2 ), where k = O(k ) with
a constant > 1, the distribution of the solution in iteration k converges as k
to the uniform distribution on the set of global optimizers, provided that a suitable
cooling schedule (ensuring convergence of ordinary SA) is used. Decreasing k with the
required rate can be achieved by increasing the sample size Nk more than quadratically
in k, that is, by imposing that Nk = O(k ) with > 2. An application of the technique
of [102] to a discrete time/cost tradeoff problem in activity planning has been reported
in [103].
Other approaches have been presented by Roenko [156], who proposes to store
the feasible solutions produced during the execution of the algorithm and to compare
them with the solution generated in each current iteration, and by Fox and Heine [79],
who derive a convergence result based on the assumption that with probability one,
the objective function estimates gk (x) coincide after some finite time with the true
objective function values g(x), as it can be achieved by consistent estimators in the
case of only finitely many possible objective function values. The last assumption can
also be relaxed, if some more complicated condition can be verified, but Fox and Heine
argue that in each computer representation, objective function values are taken from
some finite domain (given by the machine number precision) anyway. The algorithm
indicated by Fox and Heine does not use independent sampling from scratch in each

3.3. SIMULATED ANNEALING

iteration, as it is done in [84] and [102], but cumulates the sampling results, which is
of course advantageous from a computation time viewpoint.
Alrefaei and Andradottir [7] pursue a different idea by keeping the temperature
parameter Tk constant during the process instead of decreasing it toward zero (as usual
in ordinary SA). To obtain convergence, two alternative techniques are suggested. The
first, let us call it A1, consists in the following procedure: In each iteration k, for the
current solution x chosen in this iteration, a counter Vk (x) is increased by one in order
to register how often x has been visited since the start of the algorithm. The current
number Vk (x) of visits is divided by the number D(x) of neighbors of x. The estimated
optimal solution x in iteration k is then defined as that solution x = x for which
Vk (x)/D(x) is maximal, among all the solutions x that have been encountered so far.
The second technique, let us call it A2, is to estimate the objective function value of
solutions x and y (step 5 of Algorithm 5), by cumulating previous estimates of x and y
(if any), and then, choose as new approximation x of the optimal solution at iteration
k the solution with the smaller estimated objective value, among all solutions evaluated
so far. Both A1 and A2 compute sample averages with an increasing number of samples
at each iteration k.
Alrefaei and Andradottir show that both alternatives guarantee, under mild conditions, convergence with probability 1 to the set of optimal solutions. Their article
also reports on experimental comparisons showing a superiority of the introduced new
algorithms over the previous approaches in [84], [102] and [79]; among the two new algorithms, A2 turns out to yield better results than A1. The experiments are restricted
to a test instance with only 50 feasible solutions, therefore it is not clear whether the
results can be generalized to larger search spaces; nevertheless, the empirical findings
give some evidence that using the solution with best objective function estimate so far
as the proposed solution may be a very good choice. Interestingly, for the considered
test instance, a random-search-like neighborhood structure including all elements of S
(different from x) into the neighborhood S(x) of x produces, for all tested algorithms,
better results than a more restricted neighborhood. This seems to indicate that in the
stochastic case, the hill-climbing feature of SA gains importance only for larger solution
spaces S.
A further important contribution of [7] is that the article discusses optimization
both in a transient and in a steady-state simulation context. It is shown that if g(x)
is given as the expectation of a functional G(x, ) of a stochastic process in either a
transient or a steady-state situation, then the theoretical result derived for the simple
static SCOP case (corresponding to our Definition 2) still remains valid.
One practical limitation of approaches such as the two just described by Alrefaei
and Andradottir [7] and the one by Roenko [156] is that they require the storage of
information about all or most of the solutions encountered by the algorithm, and this
is an infeasible task for problems that have a combinatorial nature.
Alkhamis et al. [6] use again a decreasing cooling schedule for the parameters Tk .
They propose to decide on acceptance or rejection of a neighbor solution y by means
of a statistical significance test: A confidence interval for the difference between the
true objective function values in x resp. y is computed; depending on the position of

CHAPTER 3. METAHEURISTICS FOR SCOPS

the value zero in relation to this confidence interval, the neighbor solution is judged as
equal, better or worse than the current solution x. After that, the usual acceptance rule
of SA is applied. The authors are able to show that on certain conditions on sample
size and cooling schedule, the classical SA convergence property is still satisfied.
Homem-de-Mello [115], [116] presents a comprehensive framework for describing and
analyzing variants of SA for SCOPs. The framework enables a thorough theoretical
analysis and opens a broader range of flexibility in the choice of sampling distributions.
Using ergodicity theory, Homem-de-Mello proves in [115] a rather general convergence
theorem for a variable-sample modification of SA. The theorem includes the result in
[102] as a special case, but does not make use of any normality assumptions related
to noise distributions anymore. In [116], this approach is further generalized beyond
the area of SA, although the described analytical techniques and algorithmic ideas
remain applicable in a SA context, as well as in the context of other metaheuristics
dealing with SCOPs with objective function estimated by sampling. In particular, the
author presents the interesting idea of adaptively modifying the sample size Nk during
the iterations of the algorithm, in such a way that Nk is usually only increased if the
result of a t-test indicates that higher accuracy of the objective function estimates is
required. To preserve the convergence property, the sample size is increased at some
specific points in time regardless of the t-test.
In Alkhamis and Ahmed [5], the acceptance rule based on confidence intervals
developed in [6] is modified by applying the constant-temperature schedule of [7] instead
of the classical decreasing temperature schedule. As the current estimated solution,
the authors take the solution with the maximum (normalized) number of visits so far.
Again, a convergence result is given.
There are also some purely experimental papers involving SA and SCOPs with sampling estimated objective function. The earliest is a paper by Bulgak and Sanders [56]
addressing a buffer allocation problem in the context of a complex manufacturing system. The objective function to be maximized (the efficiency of the system) is estimated
by means of a discrete event simulator. Similarly to [116], an adaptive sampling procedure is used, where the number of samples is gradually increased for testing whether
a candidate solution is statistically better than the current best solution.
Haddock and Mittenthal [105] investigate the feasibility of using an SA algorithm
in conjunction with a simulation model to find the optimal parameter levels at which
to operate a system. The authors modify Kirkpatrick et al. [129] by substituting an
estimate of the expected value of the system response (the objective function) in all
places requiring a deterministic objective function value.
Rosen et al. [157] propose a combined procedure, called RS team method, that improves the SA of Haddock and Mittenthal [105] by initially searching for good solutions
to be then employed as starting solutions by SA. The initial search for good starting
solutions is done by the use of first-order linear approximations of the model, adapting
the technique of response surface methodology to the case of a discrete decision space.
The RS team method is tested on a simulation model of a semi-conductor manufacturing process consisting of over 40 workstations, and it is experimentally compared with
the SA algorithm of Haddock and Mittenthal [105].

3.4. TABU SEARCH

Bowler et al. [50] use a stochastic SA algorithm to experimentally analyze the

asymptotic behavior of (sub)optimal homogeneous PTSP solutions, in the limit of pn
(customers probability times number of customers) going to infinity. The PTSP objective function is estimated by sampling, and the sampling estimation error is used
instead of the annealing temperature. Temperature decrease during the execution of
the SA algorithm is mimicked by an increase in the accuracy of the objective function
estimation, which, in turn, is obtained by increasing the number of samples.
Finally, two papers [100] and [150] focus on different metaheuristics, but involve
SA in experimental comparison. The paper by Gutjahr [100] that we also cited in
Section 3.1 focuses on S-ACO, and reports experimental comparisons between S-ACO
and the SSA algorithm of Gutjahr and Pflug [102]. Pichitlamken and Nelson [150],
while focusing on a Stochastic Partitioning Method that will be described in Section
3.5, use the SA algorithm of Alrefaei and Andradottir [7] as a term of comparison in
the experimental analysis of their algorithm.
Although variants of SA for SCOPs have received a great deal of attention in the
last decade, such that, for example, the question under which conditions convergence
to the optimal solution is ensured can now be considered as relatively well understood,
there is a comparably smaller body of comprehensive experimental results aiming at
interesting questions such as: Which properties of the problem instance make which
algorithmic variant well-suited? In particular, there seems still to be little empirical
knowledge about the influence of the search space size on the performance of the single
variants.

3.4
3.4.1

Tabu Search
Introduction to TS

The main ideas characterizing the TS metaheuristic were independently proposed in

the eighties by Glover [90] and Hansen [110], and since then TS has been widely applied
to combinatorial optimization problems. A comprehensive introduction to TS can be
found in the book by Glover and Laguna [94], or in Hertz, Taillard and de Werra [113].
TS is essentially a sophisticated and improved type of local search, an algorithm that
in its simplest form, also known as Hill Climbing, works as follows. Consider a starting
current solution, evaluate its neighboring solutions (according to a given neighborhood
structure), and set the best or the first found neighbor which is better than the current
solution as new current solution. Iterate this process until an improving solution is
found in the neighborhood of a current solution. The local search stops when the
current solution is better than all its neighbors, that is, when the current solution is a
local optimum.
Such a simple and very general local search behaves quite poorly in practice, particularly because when a local optimum is found, the algorithm stops improving, and
combinatorial problems often have local optima whose objective values are much worse
than that of the global optimum. The strength of the TS metaheuristic with respect to
simple local search is that, by employing three TS-specific concepts, it avoids to get pre-

CHAPTER 3. METAHEURISTICS FOR SCOPS

maturely stuck in a local optimum. These TS-specific concepts are: best improvement,
tabu lists, and aspiration criteria.
Best improvement means that each current solution is always replaced by its best
neighbor, even if the best neighbor is worse than the current solution. This is clearly
a way not to get stuck in local optima. Using best improvement poses the problem
of possible cycling among already visited solutions, because it is possible, for example,
that the best neighbor of a solution is indeed the last visited current solution. In
order to avoid cycling, choosing recently visited solutions is forbidden, by storing some
attributes of these solutions in the so-called tabu lists. Whole solutions are not stored
in a tabu list, because this would require too much memory for most combinatorial
optimization problems. The choice of attributes is a delicate point. Typically, tabu lists
store the moves that should be performed in order to go from one solution to another,
or the differences between solutions. In this way the memory requirement of tabu
lists is feasible, but another problem arises: forbidding all solutions corresponding to
a tabu attribute may forbid also solutions that have not yet been visited, and possibly
also very good or optimal solutions. TS employs aspiration criteria for solving this
problem. An aspiration criterion is a condition that, if satisfied, allows to set as new
current solution a solution obtained by performing a tabu move. A typical example of
aspiration criterion is requiring that a solution is better than the best solution found
from the beginning of the algorithm.
In pseudocode, the TS metaheuristic may be represented as in Algorithm 6, where
Algorithm 6 Tabu Search (TS)
Generate a starting current solution x
Initialize the tabu lists
for iteration k = 1, 2, . . . do
Set A(x, k) = {y S(x)\T (x, k) T(x, k)}
Set x = arg minyA(x,k) g(y)
Update the tabu lists and the aspiration criteria
end for
x, y are feasible solutions of the combinatorial optimization problem, A(x, k) is the set
of solutions among which the new current solution is chosen at iteration k, S(x) is the
set of neighbors of x, T (x, k) is the set of tabu moves at iteration k, and T(x, k) is the
set of tabu moves satisfying at least one aspiration criterion. In TS, typical stopping
criteria may be a maximum CPU time, a maximum number of consecutive iteration
not producing an improving solution, or the emptiness of the set A(x, k).
Theoretical properties of convergence of TS to the optimal solutions has been analyzed only quite recently by Hanafi [108] and by Glover and Hanafi [93]. Both papers
derive convergence results for a version of TS where the choice of a given neighborhood
and a decision criterion for selecting moves force some solutions to be revisited before
exploring other new ones.

3.4. TABU SEARCH

3.4.2

TS for SCOPs

In comparison to the wide literature about TS for DCOPs, there are still very few
papers applying TS to SCOPs. These works are all experimental, and address static
SCOPs both in the case of exact objective and ad hoc approximation, and sampling
approximation.
3.4.2.1

Exact objective and ad hoc approximation

As we have already pointed out, one of the major difficulties when solving SCOPs is
that the objective function, even if explicitly computable, is computationally expensive. In local search algorithms, TS included, it is crucial to be able to evaluate the
neighborhood of a solution efficiently. Therefore, one of the main issues of applying TS
to SCOPs is indeed to find efficient approximations of the objective value difference
between couples of neighboring solutions.
Gendreau et al. [88] propose a TS algorithm for solving the vehicle routing problem
with stochastic demands and customers. One of the major contribution of their paper is
indeed the development of an easily computed approximation for the objective function,
used for the evaluation of potential moves. The proposed TS was quite successful in
experiments: for instances up to about 50 customes, it was able to find optimal solutions
in about 90% of cases, with an average deviation of 0.38% from optimality.
Other papers applying TS to SCOPs are the already cited [32, 33] by Bianchi et
al. (see Section 3.1, 3.2, and 3.3), where a simple TS algorithm has been compared
with other metaheuristics (ACO, EC, SA, and Iterated Local Search). Similarly to the
other metaheuristics, two approximations for the objective value difference between
neighboring solutions generated according to the OrOpt scheme have been tested, the
Yang and the TSP approximation. Even if the two approximations have different
characteristics (the first one is more accurate but more computationally expensive than
the second), the quality of results produced by the two versions of TS seemed to be
quite insensitive to the type of approximation. In [33], TS obtained results better than
ACO and SA, but worse than EC.
3.4.2.2

Sampling approximation

In the literature two types of contributions may be distinguished: papers that inside TS
use simulation as a black box for the evaluation of the objective value of solutions, and
papers that adapt the simulation procedure to the different components of TS, such as
neighborhood exploration, setting of tabu moves, verification of aspiration criteria, in
order to speed up the computation.
To the first group belong the papers by Lutz et al. [138], Finke et al. [77], Dengiz and
Alabas [67]. These papers apply quite standard TS techniques, and are usually very
time consuming, since the evaluation of solutions by simulation is a time consuming
process often relying on extern or commercial simulation packages. The advantage
of using simulation is that in this way the real objective function is considered, in

CHAPTER 3. METAHEURISTICS FOR SCOPS

problems where a rigorous mathematical programming formulation would impose severe

unrealistic restrictions.
Among the second group of papers (adapting simulation to the different components of TS), we describe in some detail the works by Costa and Silver [65] and by
Aringhieri [9]. Costa and Silver [65] describe a TS algorithm for a problem in the
context of cause-effect analysis, where the true cause of an undesirable effect must be
recognized and eliminated. Given that the time to investigate a cause is a random
variable with known probability distribution, the goal is to establish a fixed sequence
of n causes so as to maximize the expected reward associated with discovering the true
cause within a specified time horizon. This problem is also called stochastic ordering
problem with time constraint (SOPTC).
The TS developed in this context, called NTS (Noisy TS), is based on sampling
and statistical tests, and is suited for all optimization problems where the evaluation of
the objective function is computationally expensive due to the presence of noise in the
problem definition. In the following we only describe the characteristics of NTS that
are directly related to the stochastic nature of the problem. What we do not describe,
is part of standard TS techniques for permutation problems. The objective value of a
new current solution is computed by a sample average of the type of Equation (2.13).
N samples are generated according to the so-called descriptive sampling technique as
described in [126], in order to obtain substantial variance reduction with respect to
other sampling methods. Descriptive sampling has been adopted by Costa and Silver
also because in this way the quality of estimation of the exact objective value does not
depend on the quality of the pseudo-random generator used. The estimation of the
objective function takes O(N n) time, and if N is large enough to guarantee a good
estimation quality, this computation may be quite time consuming. For this reason,
the evaluation of the (possible many) neighbors of a solution is done with the following
method relying on a smaller number of samples. A statistical test is used to decide
whether a considered neighbor yc A(x, k) is better than the best neighbor yb A(x, k)
examined so far in the current iteration k. The decision is done in two phases. First,
a small number Nc < N of samples is randomly generated for estimating the expected
value gNc (yc ) of yc . The decision as to wether the true objective value of yc , is higher
than that of yb is done by hypothesis testing. Second, if the test has decided that yc is
better than yb , this is further ascertained, by using all the N samples. If it results that
gN (yc ) > gN (yb ), than yb is replaced by yc . Since N is finite, notwithstanding the use
of this double-check procedure, there is a certain probability that yb is not truly the
best feasible neighbor, and that the best solution so far is updated with not the truly
best solution so far. In order to lesser the risk of missing a very good solution due to
the bad quality of sampling, NTS keeps track of the ns best solutions encountered so
far. At the end of the run all solutions in this list are re-evaluated with a number of
samples > N , and the best solution according to this new estimation is the solution
returned by NTS.
In [65], the influence on the performance NTS of several factors has been experimentally analyzed: the hypothesis testing technique (the t-test, the Wilcoxon test,
and the Median test have been compared), and the number of samples N, Nc and

3.5. STOCHASTIC PARTITIONING METHODS

to be used in the different phases of NTS. NTS has been compared also with a TS
that is similar in everything to NTS, except for the fact that the objective function is
computed exactly on the base of a closed form expression available for SOPTC, and
no hypothesis test is performed. TS outperforms NTS both in computation time and
solution quality, but the solution quality is only slightly better than NTS. This is a
result that encourages the use of NTS for problems with very complex, or impossible
to compute, objective functions. Note, however, that when a closed form expression
for the objective function is available, even if it is quite computationally expensive like
in SOPTC, it may still be more efficient to use the classical TS algorithm, instead of
NTS.
An application where sampling is employed to save time with respect to using the
exact objective function is the one by Aringhieri [9], that applies TS to a Chance
Constrained Program (Definition 3). Constraints
are supposed to be linear functions,
P
that is, in Definition 3 we pose Hi (x, ) = j aij xj bi (), with j = 1, 2 . . . , n and
x S Rn . Note that in this problem, only the vector b is assumed to be random. In
the proposed TS, sampling is used to estimate the probability pi (x, k) that at iteration k
solution x violates constraint bi . Given a set of N m random samples bi,r , i = 1, 2, . . . , m,
r = 1, 2, . . . N , the probabilities are estimated as follows
PN
pi (x, k) =

r=1 i,r

, where i,r =

1
0

P
if
j aij xj bi,r > 0
otherwise

(3.1)

Probabilities pi (x, k) are used to define the concept of probably tabu moves that in
practice extends the set of tabu moves. A move is probably tabu at iteration k, if
it leads to a solution x for which pi (x, k) > i , i = 1, 2, . . . , m (compare this with
Equation (2.3)). Given the set P (x, k) of probably tabu neighbors of x, the new TS,
called SIMTS-CCP (simulation TS for Chance Constrained Programs), can be obtained
from algorithm 6 by modifying the computation of A(x, k) as
A(x, k) = {y S(x)\T (x, k)\P (x, k) T(x, k)}.

(3.2)

In [9] the SIMTS-CCP algorithm has been applied to two NP-hard optimization problems arising in the design of telecommunication networks. Preliminary computational
results show that solution quality is comparable to that obtained by a TS algorithm
that addresses the problem as deterministic, and the increase in computation time is
acceptable.

3.5
3.5.1

Stochastic Partitioning Methods

Stochastic Partitioning Methods for SCOPs

We have grouped under the name of SPM the Beam Search heuristic applied to SCOPs
[16, 76], the Stochastic Branch and Bound [144, 145], and the combined procedure
inspired by Nested Partitions [150]. These methods, explicitly designed for SCOPs,

CHAPTER 3. METAHEURISTICS FOR SCOPS

follow in different ways the same search strategy: the search space is recursively partitioned in sub-spaces, and the computation effort is concentrated on the sub-spaces
that are estimated to be the most promising ones. SPM are not usually considered as
belonging to the class of metaheuristics, but they could, since inside the general search
strategy, several heuristics may be employed for the evaluation of search sub-spaces, for
the improvement of solutions, and for the estimation and comparison among solutions.
In the following, we introduce the different SPM methods in the context of the type of
SCOP that each method mainly focus on.
3.5.1.1

Exact objective and ad hoc approximation

The Beam Search (BS) heuristic is a heuristic strategy closely related to Branch and
Bound, where the search space is recursively partitioned in sub-spaces, for which upper
and lower bounds for the objective function are computed, in order to guide the search
in the more promising partitions. Unlike Branch and Bound, BS reduces the width
of the search moving downward in the search tree only from a limited number of best
promising nodes. The success of BS depends on the evaluation function that is used
to select the nodes that will be further explored. Typically, in BS different evaluation
functions are used. First, a simple but imprecise evaluation function is used to discard
some nodes (this phase is called filtering); second, nodes that survive filtering are
subject to a more precise and time consuming evaluation. Thus, the main principles
behind BS (partitioning the search space and dosing the computation effort in specific
partitions) are similar to those of SBB and NP. The BS has been only recently applied
to SCOPs. Beraldi and Ruszczy
nski [16] consider Chance Constrained problems like
the ones we described in section 2.2. They apply BS to a set covering problem with
probabilistic constraints, and show experimentally that BS allows a considerable time
saving with respect to an exact Branch and Bound algorithm, and the solution quality
of BS goes from optimal to 5% worse than optimal. Erel, et al. [76] present a BS-based
method for the stochastic assembly line balancing problem in U-lines. Computational
experiments indicate that the average performance of the proposed method is better
than the best-known heuristic in the literature for the traditional straight-line problem.
3.5.1.2

Sampling approximation

Stochastic Branch and Bound (SBB) has been first proposed by Norkin, Ermoliev, and
Ruszczy
nski [144], as a method for solving problems where the objective function must
be estimated by sampling as described in Section 2.4.2. This algorithm extends to
SCOPs the main principle of the classical Branch and Bound, that is, the computation
of upper and lower bounds for the objective function of portions of the search space,
in order to guide the search. The main difference with respect to classical Branch and
Bound is that here, due to the stochastic and non-exact estimation of the objective
function (and thus of the upper and lower bounds), sub-spaces cannot in general be
cut during the search, but a sort of backtracking into previously evaluated sub-spaces
may be necessary.

3.5. STOCHASTIC PARTITIONING METHODS

The SBB algorithm proposed in [144] is represented by the pseudocode of Algorithm 7, and works as follows. Given the search space S, the algorithm constructs
increasingly finer partitions of S, denoted by P = {S 1 , S 2 , . . .}. The original problem
of finding g (S) := minxS {g(x)} (see Equation (2.2)), is divided into the sub-problems
of finding g (S r ) := minxS r {g(x)}, with r = 1, 2, . . ., and g (S) = minS r P {g (S r )}.
Assume that there exist functions L and U from P to R such that, for each S r P,
L(S r ) g (S r ) U (S r ), and U (S r ) = g(
x) for some x
S r , and if S r is a
r

r
r
singleton set, then L(S ) = g (S ) = U (S ). Suppose that the lower and upper
bounds L(S r ) and U (S r ) cannot be exactly computed, but instead estimates l (S r )
and m (S r ) are used, respectively, assuming that almost surely liml l (S r ) = L(S r ),
and limm m (S r ) = U (S r ).
Algorithm 7 Stochastic Branch and Bound (SBB)
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:
11:
12:
13:

Set P0 = S, 0 (S) = l0 (S), 0 (S) = m0 (S);

for iteration k = 0, 1, 2, . . . do
Select the lowest-bound subset Sk argminS r Pk {k (S r )} and a current solution
xk argminS r Pk {k (S r )};
if the lowest-bound subset Sk is a singleton then
Pk+1 = Pk ;
else
Construct a partition of the lowest-bound subset Pk0 (Sk ) = {S1k , S2k , . . . , Snkk };
Construct a new full partition Pk+1 = Pk \{Sk } Pk0 (Sk );
end if
for all subsets S r Pk do
Update the estimates of lower and upper bounds k (S r ) = lk (S r ), k (S r ) =
mk (S r );
end for
end for

Norkin et al. [144] proved the following convergence result: Suppose the indices lk
and mk are chosen in such a way that whenever a subset S r is an element of Pk for
infinitely many k, then limk lk = and limk mk = . Then with probability
one there exists an iteration k0 such that for all k k0 , the lowest-bound subsets Sk
are singletons and contain optimal solutions only. As suggested in [144], the estimation of a lower bound L(S r ) for g (S r ), may be done by exchanging the minimization
))
and the expectation operator, since g (S r ) = minxS r g(x) = minxS r EP (G(x,

EP minxS r G(x, ) . Thus, one may chose L(S r ) = EP minxS r G(x, ) , and the
estimation of the lower bound L(S r ) may be computed by the sample average
N
1 X
(S ) =
min G(x, j ),
xS r
N
N

(3.3)

j=1

where 1 , 2 , . . . , N is an independent, identically distributed (i.i.d.) random sample

of N realizations of the random vector .

CHAPTER 3. METAHEURISTICS FOR SCOPS

In general, the practical application of SBB implies one major difficulty: computing
an estimation of the lower bound by Equation (3.3) requires solving a possibly NP-hard
deterministic combinatorial optimization problem, minxS r G(x, j ), for every sample
scenario j , and this is unfeasible in a reasonable amount of computation time, unless
very small problem instances are addressed.
Gutjahr et al. [101] use SBB to solve small instances of the single-machine-tardiness
scheduling problem. They consider different sampling techniques for estimating lower
bounds, and report computational experiments.
As a way to make SBB more efficient, Gutjahr et al. [104] propose to use heuristics
or metaheuristics to approximately solve the deterministic subproblems for the lower
bound estimation of Equation (3.3), as schematized by Figure 3.2. The authors focus on
the problem of Activity Crashing in Project Management, and show experimentally that
the replacement of an exact solution to deterministic subproblems by a heuristic one (in
this case a local search algorithm) is very advantageous. The authors also say that it is
possible to extend the convergence results of [144] to cases in which the deterministic
subproblems are approximately solved by a search heuristic with a random starting
point, keeping track of the best solution found so far. Another practical enhancement
of the SBB proposed in [104] is the use of Importance Sampling as a technique to reduce
the variance of the sample average estimates. Without the use of a variance-reduction
technique, the number of Monte Carlo samples (and thus of computation time) required
to obtain a sample average with the same variance would be much greater.
Stochastic Branch and Bound (SBB)

Solution of the determ. subproblems

heuristic

exact

Complete
Enumeration

Branch and
Bound

Dynamic
Programming

[76]

Local search

ACO

[79]

Figure 3.2: Possible ways of solving the deterministic subproblems for the computation
of the lower bound (Equation (3.3)) in SBB.

Pichitlamchen and Nelson [150] propose a combined procedure extending the Nested

Partitions (NP) method by Shi and Olafsson

[166] to SCOPs where the objective is
estimated by sampling. NP is based on identifying a sequence of most promising
subsets of the search space S, and concentrating the search of good solutions there.
At each iteration, the most promising subset of S is partitioned into M subsets, and
the entire surrounding region is aggregated into one single subset of S. Thus, at each

3.6. OTHER ALGORITHMIC APPROACHES TO SCOPS

iteration NP looks at a partition of M + 1 subsets of the search space S. From each of

these M + 1 subsets, a random solution is chosen using some random sampling scheme,
and the objective value of each solution is evaluated, in order to decide which is the most
promising subset of the next iteration. With respect to NP, the combined procedure
of Pichitlamchen and Nelson applied to SCOPs includes a number of enhancements.
First, in each of the current M + 1 subsets, more than one solution is randomly chosen
for evaluating the most promising subset. Second, solutions here are evaluated by
the sample average estimation the objective value (see Equation (2.13)). Moreover, in
order to select the best solution of each subset, and the best solution of all subsets, a
statistical procedure called Sequential Selection with Memory (SMM) is used. SMM
guarantees to select the best or near-best alternative among a set of solutions with a
user-specified probability. It also exploits memorized information (samples and sample
averages) on previously encountered solutions. The spirit of SMM is similar to the FRace procedure proposed in the context of ACO [45] (see Section 3.1), since it consists in
a series of steps in which a set of competing solutions is evaluated and the worst of them
are eliminated. For details about SMM, see also [148, 149]. The combined procedure
based on NP also applies a Hill Climbing local search (HC) to the best solution of each
iteration. In this way, the computational effort is concentrated on the most promising
subset of the search space. Another specific characteristic of the combined procedure
of Pichitlamchen and Nelson is that at the end of the algorithm, the solution having
the smallest sample average accumulated over all visits to that solution is returned as
final solution. Pichilamchen and Nelson call their combined procedure NP+SMM+HC,
a name that underlines its main building blocks just described. In [150], they provide a
proof that, with probability one, NP+SMM+HC finds one of the optimal solutions as
the number of iterations goes to infinity. Moreover, numerical experiments applying the
algorithm to an (s,S) Inventory Problem and to a Three-Stage Buffer allocation problem
show that NP+SMM+HC has a good performance in comparison to a pure random
search and to a SA algorithm. While the convergence guarantee of NP+SMM+HC
is due to the global guidance system provided by NP, the practical performance is
enhanced by the use of SMM selection-of-the-best method and HC local search.

3.6

Other algorithmic approaches to SCOPs

In this section we briefly give some references to other approaches for solving SCOPs,
such as Progressive Hedging and Rollout Algorithms, that we have not reviewed in the
previous sections because they are still not very frequent in the literature. Nevertheless,
we include this section because the referenced approaches may be either classified as
metaheuristics, or they may involve metaheuristics as part of their solution strategy.
Progressive Hedging (PH) is an algorithm proposed by Rockafellar and Wets [155]
for solving multistage stochastic programs. It is based on considering a set of few representative scenarios that capture the uncertain future; for each of these scenarios, a
deterministic optimization subproblem is solved; in this way one ends up with more
solutions, neither of which is in general feasible for the original problem. Therefore,

CHAPTER 3. METAHEURISTICS FOR SCOPS

a sort of averaging procedure among the solutions of the subproblems is performed,

in order to obtain a blended solution that hedges against future uncertainty. Some
extensions of PH involve the use of heuristics or metaheuristics to solve the deterministic subproblems. For example, Lkketangen and Woodruff [136] integrate TS in the
PH framework and apply this combined procedure to mixed integer (0,1) multistage
stochastic programming. Haugen et al. [111] propose an extension of PH that is explicitly proposed as a metaheuristic: rather that using a heuristic algorithm to solve
deterministic subproblems, it uses an algorithm for subproblems that is exact in its
usual context, but severs as a heuristic for the proposed PH metaheuristic.
Rollout algorithms (RO) are an emerging class of methods for solving combinatorial optimization problems, that are capable of improving the performance of a given
heuristic through sequential applications. Originally, the ideas underlying RO have
been proposed by Tesauro and Galperin in 1997 [176] for developing a simulation-based
algorithm to play blackgammon. In the same year, Bertsekas et al. [23] formalized RO
for combinatorial optimization problems, by applying them to a machine maintainance
and repair problem. RO are based on the Policy Iteration algorithm, which is part of
the Dynamic Programming framework for solving MDP [19] (see the paragraph about
Markov Decision Processes of Section 2.3). Some authors (Bertsekas et al. in Section 2 of [23], and Bertsekas on page 528 of [20]), emphasize that RO also share some
ideas with Tabu Search, particularly with the sequential fan candidate list strategy
(Glover and Laguna [94]) and its extended variant, the fan and filter method (Glover
[92]). Among the papers that apply RO to SCOPs, we cite the works of Secomandi
on the vehicle vouting problem with stochastic demands [162, 163] and on the TSP
with stochastic travel times [164], and the paper by Bertsekas and Casta
non [21] on
stochastic scheduling.

3.7
3.7.1

Discussion and open issues

Using the Sampling approximation

We have seen that the selection-of-the-best method that a metaheuristic uses for performing sample averages and for comparing solutions can have a great impact on the
effectiveness of the algorithm, but it is still hard to say which method is the most effective in relation to the metaheuristic where it is employed, and this is an interesting
open issue.
Table 3.2 reports some successful selection-of-the-best methods described in the
previous sections in the context of the metaheuristic where they have been used. In
some cases ([99, 102, 7, 6, 115]), the use of a particular method has been justified mainly
by the need to derive rigorous properties of convergence, and the application to other
metaheuristics is not very meaningful. But in more experimental oriented papers, a
method which is particularly efficient in one metaheuristic, could be advantageous also
in others. This is the case, for instance, of F-Race [45], SMM [150], and the adaptive
sampling procedures used in [100], [116], and [65]. In looking for efficient selectionof-the-best methods to be applied in metaheuristics, the literature about statistical

3.7. DISCUSSION AND OPEN ISSUES

procedures of ranking, selection, and multiple comparisons (see, for example, [80, 173]
and the references cited therein) could be a good source of inspiration. Moreover,
for speeding up the sample average computations, it could be useful the application
of variance-reduction techniques, such as, for example, those belonging to the field of
Rare Event Simulation [160].
Given the above observations, a selection-of-the-best method working as a black
box simulation that does not allow to specify how samples are chosen is not advisable.
Another requirement that seems necessary is the possibility to increase the accuracy
of objective function estimates, particularly when the algorithm has identified good or
near optimal solutions. The intuitive reason is that often in SCOPs there are many
local optima, whose values may be also quite near, and in order to discriminate between
local optima one needs that the estimation error is small with respect to the difference
between the exact value of local optima. We have seen a practical confirmation of this
in several experimental papers, for instance [181] and [65]. A more rigorous argument in
favor of this requirement is that all metaheuristics with provable convergence properties
need to use a number of samples increasing with the iteration counter.
It has been recognized that completely different discrete stochastic optimization
algorithm may be needed for small and for large search spaces, respectively (cf. [80]).
Also the degree of randomness, that is, the size of noise compared to the undisturbed
objective function values, is an important factor. It cannot be expected that a metaheuristic variant working well for solution spaces with a small amount of noise will also
perform optimally for solution spaces with a large amount of noise, and vice versa. It
appears that a characterization of metaheuristic variants for SCOPs with respect to
their appropriate domains of problem instance types still waits for being elaborated. In
the second part of this thesis, we will address this particular aspect in the context of the
PTSP, by considering a benchmark of PTSP instances with varying levels of stochasticity, and by analyzing the behavior of our developed metaheuristics for different type
of instances.

3.7.2

Experimental comparisons among different metaheuristics

At the moment, most of the papers in the SCOP literature focus on one single metaheuristic, which is compared either to variants of the same metaheuristic, or to simple
heuristics such as random search, or to exact methods when these are available. Only
a very small number of papers perform comparisons among different metaheuristics,
as reported in Table 3.3. Thus, it is still impossible to give guidelines on which metaheuristic is better in which situation.
One important aspect that experimentation with different metaheuristics could reveal is whether the effectiveness of a metaheuristic is due to the particular adjustments
to speed up the computation (like approximating the objective function or using carefully designed sampling and statistical comparison strategies), or to the intrinsic search
trajectory of the metaheuristic. This issue is addressed in Appendix B in the context
of the Vehicle Routing Problem with Stochastic Demands (VRPSD).

CHAPTER 3. METAHEURISTICS FOR SCOPS

Reference(s)

Selection-of-the-best method

Way of evaluating a solution

Metaheuristic(s)
where the method is
used

Solutions compared

Gutjahr [99]

Sample average with number Current solution with current ACO

of samples increasing linearly estimation of optimal solution
with the iteration counter

Gutjahr [100]

Sample average with number of Current solution with current ACO, SA

samples decided adaptively on estimation of optimal solution
the base of a statistical test

Birattari et al. Sample averages integrated All solutions belonging to the ACO
[45]
with the F-Race statistical (dynamic) set of solutions in
the race
procedure
Gutjahr
and Sample average with number of Current solution with current SA
Pflug [102]
samples increasing more than estimation of optimal solution
quadratically with the iteration counter
Alrefaei
and Sample average and normal- All solutions visited so far
Andrad
ottir [7] ized number of visits

Alkhamis
al. [6]

et Sample average with number of Current solution with current SA

samples increasing with itera- estimation of optimal solution
tions, comparison with a statistical significance test

Homemde-Mello
[115, 116]

Sample average with number of Current solution with current SA

samples decided adaptively on estimation of optimal solution
the base of a t-test

Costa and Sil- Descriptive sampling, statistiver [65]

cal test in two stages (using a
higher number of samples only
if first stage of the test is positive)
Gutjahr
al. [101]

Current solution with current TS

estimation of optimal solution,
keeping in memory a given
number of good solutions for final, more accurate comparison

et Sample average using the Im- Lower and upper bounds of SBB (SPM)
portance Sampling variance- all subsets in which the search
reduction technique, and num- space has been partitioned
ber of samples increasing with
the iteration counter

Pichitlamchen Sample averages integrated Random selected solutions in NP+SMM+HC

and
Nelson with the SMM statistical each subset in which the search (SPM)
[150]
procedure
space has been partitioned,
and random solutions selected
from the neighborhood of the
current solution during local
search

Table 3.2: Main selection-of-the-best methods used in the metaheuristics literature for
SCOPs where the objective function must be estimated by sampling.

3.7. DISCUSSION AND OPEN ISSUES

Reference

SCOP

47
Metaheuristics compared

Winner

Bianchi et al. [32, 33] VRPSD

ACO, EC, SA, TS, ILS

EC, TS

Gutjahr [100]

ACO, SA

ACO

and Inventory Problem and Three- SPM, SA

stage Buffer Allocation Problem

SPM

Pichitlamken
Nelson [150]

TSPTW

Easton and Mansour Stochastic Goal Programming

[74]

EC, SA, TS

Table 3.3: Papers with comparisons among different metaheuristics.

3.7.3

Theoretical convergence properties

Papers analyzing theoretical convergence properties of metaheuristics applied to SCOPs

are summarized in Table 3.4. Note that in the table, SCOPs with exactly computable
objective function are missing. In fact, when a metaheuristic always uses an exact
expression for the objective value of a solution, its convergence behavior is equivalent,
from a theoretical point of view, to that of applying the metaheuristic to a DCOP
(pointers to theoretical analyses of metaheuristics for DCOPs have been provided in
the previous sections while introducing each metaheuristic). On the contrary, when
ad hoc approximations of the objective function are used, the fact that the error is
systematic makes a theoretical analysis very difficult.
Theoretical convergence analyses do exist for static SCOPs with sampling estimated
objective function. The most studied metaheuristic from this point of view is certainly
SA, followed by ACO and SPM. TS and EC still miss this kind of analysis. Interstingly,
Homem-de-Mello [116] suggests that the results he derives for a simple variable-sample
random search algorithm can be readily adapted to show the convergence of variablesample versions of more sophisticated methods, in particular, those methods for which
the proof of convergence in the DCOP domain relies on the convergence of pure random
search. The author indicates explicitly EC (GA) as one of these, by referring to the
work of Rudolph [161].
Finally, metaheuristics with provable convergence properties (ACO and EC) have
been designed to solve MDPs. Actually, at the moment MDPs have been addressed
only theoretically by metaheuristics. Therefore, there is the need to validate their
effectiveness also by experimental investigations.

CHAPTER 3. METAHEURISTICS FOR SCOPS

Metaheuristic

SCOP category

Referece(s)

ACO

sampling estimated objective

Gutjahr [99]

Alrefaei and Andrad

ottir [7]

Alkhamis et al. [6]

Homem-de-Mello [115]

Alkhamis and Ahmed [5]

SBB (SPM)

sampling estimated objective

Norkin et al. [144]

objective function subject to normally Gelfand and Mitter [84]

distributed noise

objective function subject to noise reduc- Fox and Heine [79]

ing to zero after a certain number of iterations

objective function subject to noise dis- Gutjahr and Pflug [102]

tributed according to a sufficiently
peaked distribution

ACO

infinite horizon MDP

Chang et al. [60] and Chang

[59]

infinite horizon MDP

Chang et al. [61]

finite horizon partially observed MDP

Lin et al. [135]

Table 3.4: Papers with theoretical convergence proofs.

Part II

Ant Colony Optimization and

local search for the Probabilistic
Traveling Salesman Problem

Chapter 4

The Probabilistic Traveling

Salesman Problem
The first part of the thesis (up to Chapter 3) had a broad scope, being dedicated to the
review of the application of several metaheuristics to the wide class of SCOPs. This
second part of the thesis (from the present chapter to Chapter 7) focuses in particular
on one metaheuristic, ACO (Ant Colony Optimization), and on one SCOP, the Probabilistic Traveling Salesman Problem (PTSP). Concentrating on this case study gives us
the occasion to explore in a more detailed way many issues that have been introduced
in the first part of the thesis, such as the use and effectiveness of objective function approximations inside ACO (in Chapter 5), the design of efficient local search algorithms
also based on move cost approximations (in Chapter 6), and the effect of combining a
metaheuristic with local search for solving the SCOP under study (in Chapter 7).
The remainder of this chapter is organized as follows. Section 4.1 formally defines
the PTSP, by introducing the notation used throughout the second part of the thesis,
and by defining and explaining the PTSP objective function. Section 4.2 reports on the
literature involving the PTSP. Some papers have been already cited in the first part of
the thesis, but they are again considered here for the sake of completeness. Section 4.3
describes the benchmark of PTSP instances that we have set up for our experimental
investigations of the following chapters. The benchmark has been carefully designed, in
particular to allow the analysis of the behavior of optimization algorithms for different
levels of stochasticity. Section 4.4 and 4.5 describe, respectively, the computation of
a lower bound of the optimal solution for the PTSP, and a set of simple heuristics
available in the literature to solve the PTSP. These elements will be used for producing
repeatable comparisons with our metaheuristics.

4.1

Statement of the problem

For many delivery companies, only a subset of the customers require a pickup or delivery
each day. Information may be not available far enough in advance to create optimal
schedules each day for those customers that do require a visit or the cost to acquire
51

CHAPTER 4. THE PROBABILISTIC TRAVELING SALESMAN PROBLEM

n = 10

3
9

4
8

5
7

Figure 4.1: An a priori tour (left), and a possible a posteriori tour (right) obtained from
the a priori tour by visiting only the customers requiring a visit on a given random
realization of customers, and by keeping the same order of visit as in the a priori tour.
In this example, a random realization where only customers number 1, 2, 4, 5, 6, 9, 10
require a visits is shown.
sufficient computational power to find such solutions may be prohibitive. For these
reasons, it is not unusual to design a tour containing all customers (called a priori tour),
and each day to follow the ordering of this a priori tour to visit only the customers
requiring a visit that day (see Figure 4.1). The tour actually traveled each day when
the customers requiring a visit are revealed is called a posteriori tour. The problem of
finding an a priori tour of minimum expected cost, given a set of customers each with a
given probability of requiring a visit, defines the PTSP. In the remainder of the thesis,
the terms a priori tour tour, and solution, will be used interchangeably.
More formally, let N = {i | i = 1, 2, . . . , n} be a set of n customers. For each pair
of customers i, j N , d(i, j) represents the distance between i and j. Here, we assume
that the distances are symmetric, that is, d(i, j) = d(j, i). In the remainder of the thesis,
distances will also be referred to as costs. An a priori tour = ((1), (2), . . . , (n))
is a permutation over N , that is, a tour visiting all customers exactly once. Given the
independent probability pi that customer i requires a visit, qi = 1 pi is the probability
that i does not require a visit. The general case where customers probabilities pi may
be different, is referred to as heterogeneous PTSP, while if probabilities are all equal
(pi = p for every customer i), the problem is called homogeneous PTSP. We will use
the following convention for any customer index i:

i :=

i(modn) iff i 6= 0 and i 6= n

n
otherwise,

(4.1)

where i(mod n) is the remainder of the division of i by n. The reason for defining the
above convention is that we want to use as customer indices numbers from 1 to n (and
not from 0 to n 1), and we need to make computations with the modulus operator.
The expected length of an a priori tour can be computed in O(n2 ) time with the

4.2. LITERATURE REVIEW

following expression derived by Jaillet [118]

E[L()] =

n n1
X
X

d((i), (i + r))p(i) p(i+r)

i+r1
Y

q .

(4.2)

Qj
if 0 j i < n 1
Qt=i q(t) Q
j
n
q :=
q
q
if i j > 1
t=i (t) u=1 (u)
1
otherwise.

(4.3)

i=1 r=1

i+1

We use the following notation for any i, j {1, 2, . . . , n}

j
Y
i

The expression for the objective function (Equation (4.2)) has the following intuitive
explanation: each term in the summation represents the distance between the ith customer and the (i + r)th customer weighted by the probability that the two customers
require a visit (p(i) p(i+r) ) while the r 1 customers between them do not require a
Q
q ).
visit ( i+r1
i+1
In the homogeneous PTSP, where pi = p and qi = q for every customer i, the
expression for the objective function still requires O(n2 ) computation time, but it is a
bit simpler than Equation (4.2):
E[L()] =

n n1
X
X

p2 q r1 d((i), (i + r)).

(4.4)

i=1 r=1

Note that the PTSP falls in the category of SIPs (Stochastic Integer Programs) as
introduced by Definition 2 in Chapter 2.

4.2

Literature review

The PTSP was introduced in 1985 by Jaillet in his PhD thesis [118], where he derives
several theoretical properties of the problem and proposes different heuristics as well
as an integer nonlinear programming formulation and an exact branch-and-bound algorithm for the homogeneous PTSP (the exact algorithms remain of an introductory
level, since they are not tested experimentally). Some theoretical properties of the
PTSP derived in [118] have been later published in Jaillet [119], and focus mainly on
two issues: the derivation of closed form expressions for the efficient computation of the
objective function under several customers probability configurations, and the analysis
of properties of optimal PTSP solutions, especially in relation to optimal solutions of
(deterministic) TSP. The analysis presented in [119] implies that under specific conditions (the set of customers on the Euclidean space is also the convex hull of this set of
customers) the optimal PTSP and TSP solutions coincide. In general though, bounds
have been derived that show how the optimal TSP solution may be arbitrarily bad with
respect to the optimal PTSP solution. This observation justified and triggered subsequent research on developing specific algorithms for the PTSP that take into account
its probabilistic nature.

CHAPTER 4. THE PROBABILISTIC TRAVELING SALESMAN PROBLEM

Bertsimas et al. [28] showed that the PTSP is an NP-hard problem. As such, the
PTSP is very difficult to be solved to optimality, and its literature is more rich in papers
on heuristics rather than on exact methods.
Rossi and Gavioli [159] adapt to the PTSP two well known TSP tour construction
heuristics, the Nearest Neighbor and the Clarke-Wright algorithms, by explicitly including the customers probability in the evaluation and selection of new portions of
the tour to construct. Rossi and Gavioli experimentally evaluate the two heuristics
on homogeneous PTSP instances with up to 100 uniformly distributed customers, and
compare their performance to that of the classical deterministic Nearest Neighbor and
Clarke-Wright heuristics that solve a TSP. Their conclusion is that for the tested instance it is important to use the techniques specifically designed for the PTSP only for
instances with more than 50 customers, and customers probability less than 0.6.
The PhD thesis of Bertsimas [24] and a related article by Bertsimas and Howell
[27] extend the results of Jaillet sharpening the bounds that link the PTSP with the
TSP, and analyze the relation of the PTSP with another probabilistic combinatorial
optimization problem, the probabilistic minimum spanning tree. In [27], the authors
also investigate the worst-case and average performance of some TSP heuristics in
a PTSP context. The analysis reveals that the Nearest Neighbour is very poor on
average for the PTSP, while the Space Filling Curve is asymptotically very close to the
re-optimization strategy, if nodes are uniformly distributed on the unit square and an
Euclidean metric is used for customer distances. The Space Filling Curve is analyzed
also experimentally in [27]. Two local search procedures, the 2-p-opt and the 1-shift,
are applied to the solution produced by the Space Filling curve, to achieve better
solutions for the PTSP. For the two local search procedures, recursive equations are
proposed, that efficiently compute the change of the homogeneous PTSP expected cost
of two neighboring solutions. We have verified that the recursive equations proposed
in [27] are not correct, and we derive in this thesis correct expressions both for the
homogeneous and for the heterogeneous case (see Section 6.4, that covers the contents
of Bianchi et al. [42] and Bianchi and Campbell [36]).
The results and analyses of [24] are summarized and further extended in Bertsimas
et al. [28], where also the probabilistic vehicle routing problem and the probabilistic
traveling salesman facility location problem are investigated with the paradigm of a
priori optimization. In [28], only problems with homogeneous probabilities are considered.
In [26], Bertsimas et al. investigate experimentally the PTSP and the Probabilistic
Vehicle Routing Problem, by comparing the solution quality obtained by solving the
problem a priori and a posteriori (that is, by sampling the stochastic variables and by
approximately solving the deterministic subproblems obtained by sampling). For the
PTSP, the authors consider both homogeneous and heterogeneous instances. Experimental results on small instances with up to 50 customers uniformly distributed on
the unit square show that a priori solutions are on average only 1.22% worse than a
posteriori ones obtained via re-optimization of the sampled subproblems. These experiments confirm the theoretical results of Bertsimas and Howell [27] on the similarity
between a priori optimization and re-optimization when solving uniformly distributed

4.2. LITERATURE REVIEW

PTSP instances.
Exact algorithms for solving also heterogeneous PTSPs to optimality are addressed
in two papers: Berman and Simchi-Levi [17], and Laporte et al. [132]. In [17], the
considered problem is a PTSP on a network, that is, the underlying graph is not
assumed to be complete. The key result is the derivation of a good lower bound for the
optimal PTSP value, that is based on solving an appropriate transportation problem
with 2n constraints. It is suggested that the lower bound can be used in a Branch and
Bound algorithm to find the optimal solution. The authors of [17] refer to a working
paper [18] for a first attempt to implement such Branch and Bound algorithm. Another
result of [17] is the derivation of an algorithm that, given an a priori tour, finds in O(n3 )
time the optimal location of a hypothetical depot from where the traveling salesman
departs, before traveling the a priori tour.
The paper by Laporte et al. [132] proposes a Branch and Cut algorithm based on an
integer two-stage stochastic programming formulation of the PTSP (for the definition
of this type of problems, see Section 2.2). The authors report on computational experiments on both homogeneous and heterogeneous PTSP instances, and show that the
proposed algorithm finds the optimal solution for instances with up to 50 customers.
An interesting point that emerges from the experiments is that the lower the customers
probabilities, the more difficult the PTSP instance. In other words, more random
problems are more difficult to solve by the proposed Branch and Cut algorithm. Unfortunately, [132] does not report the optimal values found for the tested instances, and
their results cannot be used to compare the performance of heuristic approaches.
Recent approaches to the PTSP mainly involve the application of metaheuristics.
These include Simulated Annealing (SA) [50], Evolutionary Computation [158], and
Ant Colony Optimization (ACO) [51, 52, 100, 45, 38, 39].
Bowler et al. [50] analyze experimentally by a stochastic SA algorithm the asymptotic behavior of (sub) optimal homogeneous PTSP solutions, in the limit of pn (customers probability times number of customers) going to infinity. The PTSP objective
function is estimated by sampling, and the sampling estimation error is used as a sort
of temperature regulated during the annealing process.
The first preliminary results about solving the PTSP with ACO have been published
by Bianchi et al. [38, 39], and are the base of the results described also in this thesis.
The main issue addressed in [38, 39] is the comparison of a TSP specific version of
ACO with respect to a version that explicitly considers the PTSP objective function,
similarly to what has been done by Rossi and Gavioli [159] on the Nearest Neighbor
and Clarke-Wright heuristics.
Branke and Guntsch [51, 52] explore the idea to employ faster approximations of
the exact PTSP objective function. The authors propose an ad-hoc approximation
of the expected cost that neglects the least probable customers configurations. This
approximation is shown experimentally to accelerate convergence without significantly
worsening the solution quality. Another issue addressed by [51, 52] is the design of
PTSP-specific heuristics to guide the ants construction process. The authors experimentally analyze different heuristics, and show that one of them indeed improves the
quality of solution constructed by ants, but at the cost of a higher computational time.

CHAPTER 4. THE PROBABILISTIC TRAVELING SALESMAN PROBLEM

name
kroA100
eil101
ch150
d198
lin318
att532
rat783
dsj1000

n
100
101
150
198
318
532
783
1000

Table 4.1: TSPLIB instances providing customers coordinates, from which Euclidean
distances have been computed. Each of these TSP instances has been combined with
54 different customer probability configurations characterized by average probability
and variance as reported in Table 4.2.
Gutjahr [100] briefly addresses the PTSP as a pre-test for analyzing the performance
of S-ACO, an ACO algorithm that is suited for stochastic problems where the objective
function is not known exactly, but can only be estimated by sampling. After the pretest on the PTSP, S-ACO is applied to a TSP with time windows and with stochastic
travel times.
Recently (Birattari et al. [45]), the PTSP has been again used as a sort of test
problem for ACO/F-Race, another ACO algorithm suited for stochastic problems where
the objective function must be estimated by sampling.
More details on how S-ACO [100] and ACO/F-Race [45] work can be found in
Section 3.1.

4.3

Benchmark of PTSP instances

A PTSP instance is characterized by two types of information: distances between each

couple of customers, and probability for each customer of requiring a visit. Since customer distances are what characterize instances of the TSP, we have used instances
from the well known TSPLIB benchmark [178] for this type of information. For each
TSP instance, we have then considered different customer probability configurations.
Homogeneous PTSP instances have been generated by simply associating a TSP instance one single probability value, corresponding to all customers probabilities. For
heterogeneous PTSP instances different sets of customer probability values have been
randomly generated according to a probability distribution.
We have considered 8 TSP instances, with n ranging from 100 to 1000, as reported
in Table 4.1. For these instances we have computed Euclidean distances between customers, on the base of the customers coordinates provided by the TSPLIB.
Customers probabilities have been generated as follows. For homogeneous PTSP
instances the customer probability has been varied from 0.1 to 0.9 with an increment
of 0.1. For heterogeneous PTSP instances, each customer probability pi , i = 1, 2, . . . , n

4.4. LOWER BOUND OF THE OPTIMAL SOLUTION VALUE

has been generated randomly, according to the beta probability distribution. We have
chosen this probability distribution because it is defined on the finite interval [0, 1], and
it can easily model different average and variance of customers probability values by
choosing the appropriate parameters. The beta probability distribution a,b (pi ) is
defined as follows
(a + b) a1
a,b (pi ) =
p (1 pi )b1 ,
(4.5)
(a)(b) i
where pi [0, 1], and a, b > 0 are parameters. The a and b parameters of the beta
probability distribution determine the average customer probability p and the variance
2 around the average value:
a
(4.6)
p =
a+b
ab
2 =
(4.7)
.
2
(a + b) (a + b + 1)
In our benchmark we have generated different sets of customer probabilities by varying
the average customer probability p and the variance 2 . In order to have direct control
on these parameters, rather than on a and b, we have used the inverse of Equation (4.6)
and (4.7):
a =
b =

p [p(1 p) 2 ]
,
2
(1 p) [p(1 p) 2 ]
.
2

(4.8)
(4.9)

Note that the variance is limited by the relation 2 < p(1 p) 0.25. Similarly
to the homogeneous case, we have considered customer probability sets by varying
p between 0.1 and 0.9, with and increment of 0.1. For every value of p, we have
considered five different values of variance, corresponding respectively to 1/6, 2/6, 3/6,
4/6, and 5/6 of the maximum variance which is p(1 p). More precisely, we have set
2 {p(1p)/6, 2p(1p)/6, 3p(1p)/6, 4p(1p)/6, 5p(1p)/6}. For each probability,
the different 2 values are also referred to as the 16%, 33%, 50%, 66%, and 83% of the
maximum variance. In the plots that will be presented in the remainder of this thesis,
the variance values will be indicated by percentage. The shape of the corresponding
beta probability distributions are shown in Figure 4.2.
Summarizing, for each of the 8 TSP instances, we have considered 54 customer
probability configurations, which in total means 432 PTSP instances. Table 4.2 reports
all the considered values of p and 2 , including the ones corresponding to homogeneous
instances (for which 2 = 0). The naming convention that we use throughout this
thesis is illustrated in Table 4.3.

4.4

Lower bound of the optimal solution value

In evaluating the solution quality of algorithms for the PTSP, one faces the difficulty
that the optimal solution of problems is not know, in general. In fact, unlike the

CHAPTER 4. THE PROBABILISTIC TRAVELING SALESMAN PROBLEM

0.4

0.6

0.8

1.0

0.0

0.4

0.6

p = 0.2

p = 0.7

1.0

0.8

1.0

0.8

1.0

0.8

1.0

a, b(x)

0.8

0.4

0.6

0.8

1.0

0.0

0.2

0.4

0.6
x

p = 0.3

p = 0.8

a, b(x)

2
0

0.2

0.0

0.4

0.6

0.8

1.0

0.0

0.2

0.4

0.6

p = 0.4

p = 0.9

a, b(x)

2
0

0.2

0.0

a, b(x)

0.2

a, b(x)

0.0

a, b(x)

4
0

a, b(x)

p(1p)/6
2p(1p)/6
3p(1p)/6
4p(1p)/6
5p(1p)/6

p = 0.6

a, b(x)

p = 0.1

0.0

0.2

0.4

0.6

0.8

1.0

0.0

0.2

0.4

0.6
x

4
2
0

a, b(x)

p = 0.5

0.0

0.2

0.4

0.6

0.8

1.0

Figure 4.2: Shape of the beta probability distributions used for generating customers
probabilities in heterogeneous instances of our PTSP benchmark. The legend in the
first plot specifies the type of line used for different values of 2 , and applies to all plots
of the figure.

4.4. LOWER BOUND OF THE OPTIMAL SOLUTION VALUE

p
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9

0
0
0
0
0
0
0
0
0

0.015
0.027
0.035
0.040
0.042
0.040
0.035
0.027
0.015

0.030
0.053
0.070
0.080
0.083
0.080
0.070
0.053
0.030

2
0.045
0.080
0.105
0.120
0.125
0.120
0.105
0.080
0.045

0.060
0.107
0.140
0.160
0.167
0.160
0.140
0.107
0.060

0.075
0.133
0.175
0.200
0.208
0.200
0.175
0.133
0.075

Table 4.2: Average customer probability p and variance 2 characterize the 54 customer
probability configurations generated for our PTSP benchmark. Homogeneous PTSP
instances correspond to the column with 2 = 0. Heterogeneous probability configurations have been obtained for each TSP instance of Table 4.1 by using the above
values of p and 2 (6= 0) for computing a and b parameters (Equation (4.8)-(4.9)), and
by generating sets of random customer probabilities according the beta probability
distribution (Equation (4.5)).

PTSP instance name

kroA100.p10.v16
kroA100.p20.v33
kroA100.p10

TSP instance name

kroA100
kroA100
kroA100

n
100
100
100

p
0.1
0.2
0.1

2
p(1 p)/6 = 0.015
2p(1 p)/6 = 0.053
0

Table 4.3: Three examples illustrating the naming convention we use for instances
belonging to our PTSP benchmark, where n is the number of customers, p is the
(average) customers probability, and 2 is the variance of customers probability.

CHAPTER 4. THE PROBABILISTIC TRAVELING SALESMAN PROBLEM

TSP, no benchmark of problems exists for which optimal solutions are known. When
possible, a good alternative is to compare the solution quality of an algorithm with
a known lower bound of the optimal solution, which yields to an upper bound of the
distance from the optimal solution. More precisely, suppose that, for a given instance
of an optimization problem, the optimal solution value is E , and a lower bound LB
is available, such that, by definition
LB E .

(4.10)

If the solution obtained by an algorithm that we want to analyze is E, we can bound

the relative error of the algorithm by the following inequality
E E
E LB

.
(4.11)
E
LB
For the PTSP, we rely on the following lower bound of the optimal tour value E[L(P T SP )],
proposed by Bertsimas, Howell [27]
z E[L(P T SP )]

(4.12)

where z is the optimal solution to the (linear) transportation problem

X
z = min
xi,j d(i, j),
i,jV,i6=j

s.t.

xi,j = pj 1

iV,i6=j

(4.13)

k6=j

qk ,

xi,j = pi 1

jV,j6=i

qk ,

k6=i

xi,j 0.
The values of the lower bound computed by solving this transportation problem for all
the instances of our PTSP benchmark are shown in Table 4.5.

4.5

Simple constructive heuristics

Given that the solution structure of the PTSP is equal to that of the TSP (a Hamiltonian tour), all constructive heuristics for the TSP can also be used to construct
solutions for the PTSP. Of course, we do not expect that such heuristics find near
optimal solutions for the PTSP, mainly because they do not take into account customers probabilities in their construction mechanism. Nevertheless, there are at least
two reasons why considering simple TSP constructive heuristics is useful:
in developing sophisticated algorithms based on metaheuristics and local search
for solving the PTSP, the solution quality provided by simple TSP constructive
heuristics is a sort of minimal requirement for the quality of the more sophisticated
algorithms;

4.5. SIMPLE CONSTRUCTIVE HEURISTICS

when a local search algorithm is available for a problem, it might be the case
that even when applied to quite poor starting solutions such those produced by
simple TSP constructive heuristics evaluated with the PTSP objective function,
the final solution found is very good.
In this thesis, we consider metaheuristics (ACO) and sophisticated local search algorithms for solving the PTSP, therefore the above motivations are very relevant to us.
The algorithms described and analyzed in Chapter 5, 6 and 7 will be often compared
with selected TSP constructive heuristics, namely the Space Filling Curve, the Radial
Sort, the Farthest Insertion, the Nearest Neighbor, and the Random Search heuristic.
The Space Filling Curve heuristic The Space Filling Curve heuristic (SFC) for the
TSP was introduced by Bartholdi and Platzman [13], and the main idea behind it is the
following. Let C = {|0 < 1} denote the unit circle, so that C represents a point
on the circle from a fixed reference point. Also let S = {(x, y)|0 x 1, 0 x 1}
denote the unit square. A continuous mapping from C onto S is known as spacefilling
curve [2]. Suppose that is such that lim1 () = (0), so that, as ranges from
0 to 1, () traces out a tour of all the points in S. Given n points in S to be visited
(like in the TSP and PTSP on the unit square), the idea is to visit them in the same
order as they appear along the spacefilling curve, or, more mathematically, to sort the
customers according to their inverse image under .
In order to be a good heuristic for the TSP, the spacefilling curve must be chosen
in such a way that its inverse function is fast to be computed, and it satisfies some
geometrical properties. Details about SFC are described in [13], where also a BASIC
code of the algorithm is reported. SFC constructs a solution in O(n log n) computation
time.
SFC has been analyzed in [27] in the context of the PTSP, where it is shown that
it is asymptotically very close to the re-optimization strategy, if nodes are uniformly
distributed on the unit square and an Euclidean metric is used for customer distances.
In [27], SFC has been also used as starting solution of two local search algorithms, that
will be compared to our algorithms and described in Chapter 6.
The Radial Sort heuristic Radial Sort (RAD) builds a tour by sorting customers
by angle with respect to the center of mass of the customer spatial distribution. More
precisely, the center of mass is a point whose coordinates (xCM , yCM ) are computed by
averaging over the customers coordinates:
n

xCM =

1X
xi ,
n
i=1

yCM =

1X
yi .
n

(4.14)

i=1

The angular position i of customer i with respect to the center of mass is computed
as follows

xi xCM
i = arctan
.
(4.15)
yi yCM
The tour returned by Radial Sort visits customers in order of increasing angle i .

CHAPTER 4. THE PROBABILISTIC TRAVELING SALESMAN PROBLEM

The Farthest Insertion heuristic Farthest Insertion (FI) [125] builds a tour by
starting with a subtour on a small number of nodes, and by extending the subtour by
inserting in the cheapest position the remaining nodes one after the other, choosing
each time the node which is farthest from the subtour.
In our implementation, the starting subtour is made by a single node (the one
labeled by 0), and the node to be selected for insertion at each iteration is the node
which is farthest from the last inserted one.
The Nearest Neighbor heuristic Nearest Neighbor (NN) constructs a tour by
choosing as next customer the nearest, not yet visited customer. Our implementation
of NN constructs n tours starting each with a different customer, and returns the tour
with the smallest expected length, evaluated with the PTSP objective function.
An adapted version of this heuristic which takes into account customers probabilities
has been analyzed in [159]. In [27], it has been shown that NN is very poor on average
for the PTSP with customers uniformly distributed on the unit square and an Euclidean
metric is used for customer distances.
The Random Search heuristic Random Search (RS) is maybe the simplest one.
It builds a solution starting from a random customer, and choosing iteratively as next
customer one at random among the not yet visited ones. In comparing this heuristic
with more sophisticated algorithms, a meaningful choice is to make RS run for the same
computation time as the comparing algorithms.

4.5.1

Experimental analysis

We present here results obtained on the PTSP benchmark by the simple TSP constructive heuristics described above. These results will be also used in the following Sections
of this thesis, as a term of comparisons with our ACO metaheuristics. Experiments
have been run on a machine with two processors Intel(R) Xeon(TM) CPU 1.70 GHz,
running the GNU/Linux Debian 2.4.27 operating system. All algorithms have been
coded in C++ under the same development framework. Each heuristic was allowed a
runtime equal to n2 /100 (with n being the number of customers of a PTSP instance).
Table 4.4 reports average results over all the PTSP benchmark. The solution quality
achieved by a heuristic is expressed in terms of percentage of the best value found above
the value of the lower bound (second column of the Table). More precisely, for each
heuristic heur and for each PTSP instance inst, the percentage above the lower bound
has been computed as:
% above LB(inst) =

E[L(heur,inst )] LB(inst)
,
LB(inst)

(4.16)

where E[L(heur,inst )] is the expected cost (that is, the PTSP objective value) of the
best solution found by heuristic heur on the PTSP instance inst. The second column of
Table 4.4 shows the average percentage above the lower bound over all PTSP instances,

4.5. SIMPLE CONSTRUCTIVE HEURISTICS

heuristic
FI
SFC
NN
RAD
RS

% above LB
90.2%
111.1%
111.6%
388.8%
1228.6%

63
time (seconds)
0.1
0.1
35.5
0.1
2712.0

Table 4.4: Average results of the heuristics for the TSP applied to the PTSP benchmark.
while the third column reports the average time required by each heuristic to find the
best solution. Note that comparisons with respect to the lower bound are useful for
having an objective and repeatible way of evaluating an algorithm for the PTSP, but
they do not indicate the amount by which an algorithm really deviates from optimal
solutions. The reason is that we do not know how near is the lower bound to optimal
solutions.
As expected, RS is the worst one, both in terms of solution quality and time. All
the other heuristics were quite fast, with NN being the slowest. The reason why NN
is so slow with respect to other heuristics (except RS), is that in our implementation
of NN we have generated n different solutions considering each time a different starting customer, and all these n solutions have been evaluated with the PTSP objective
function. On the contrary, in the other simple heuristics (except RS), the algorithm
generated just one single solution.

CHAPTER 4. THE PROBABILISTIC TRAVELING SALESMAN PROBLEM

Instance name
kroA100.p10.v16
kroA100.p10.v33
kroA100.p10.v50
kroA100.p10.v66
kroA100.p10.v83
kroA100.p20.v16
kroA100.p20.v33
kroA100.p20.v50
kroA100.p20.v66
kroA100.p20.v83
kroA100.p30.v16
kroA100.p30.v33
kroA100.p30.v50
kroA100.p30.v66
kroA100.p30.v83
kroA100.p40.v16
kroA100.p40.v33
kroA100.p40.v50
kroA100.p40.v66
kroA100.p40.v83
kroA100.p50.v16
kroA100.p50.v33
kroA100.p50.v50
kroA100.p50.v66
kroA100.p50.v83
kroA100.p60.v16
kroA100.p60.v33
kroA100.p60.v50
kroA100.p60.v66
kroA100.p60.v83
kroA100.p70.v16
kroA100.p70.v33
kroA100.p70.v50
kroA100.p70.v66
kroA100.p70.v83
kroA100.p80.v16
kroA100.p80.v33
kroA100.p80.v50
kroA100.p80.v66
kroA100.p80.v83
kroA100.p90.v16
kroA100.p90.v33
kroA100.p90.v50
kroA100.p90.v66
kroA100.p90.v83
kroA100.p10
kroA100.p20
kroA100.p30
kroA100.p40
kroA100.p50
kroA100.p60
kroA100.p70
kroA100.p80
kroA100.p90
d198.p10.v16
d198.p10.v33
d198.p10.v50
d198.p10.v66
d198.p10.v83
d198.p20.v16
d198.p20.v33
d198.p20.v50
d198.p20.v66
d198.p20.v83
d198.p30.v16
d198.p30.v33
d198.p30.v50
d198.p30.v66
d198.p30.v83
d198.p40.v16
d198.p40.v33
d198.p40.v50

LB value
3046.08
3749.19
3602.53
5936.83
6476.77
4100.97
6470.23
6026.88
5124.01
9420.66
6301.18
6794.94
8693.83
8497.56
10210.88
8429.70
8791.15
10315.35
10534.56
10244.26
9754.43
10218.20
10102.49
9861.52
12483.14
10284.78
11442.14
11176.49
13435.22
13438.37
12326.80
13040.19
13087.34
14119.99
14678.74
14419.79
14398.59
14431.74
16623.24
14786.36
16050.75
15994.13
16382.99
15523.36
15323.54
1709.10
3418.30
5127.45
6836.60
8545.75
10254.91
11964.06
13673.21
15382.36
2141.26
2278.54
2449.09
2323.21
4857.64
2388.82
2964.04
2755.41
3385.44
4391.14
3871.18
3415.73
4457.03
7155.86
5013.44
4896.63
6255.30
4607.99

Instance name
d198.p40.v66
d198.p40.v83
d198.p50.v16
d198.p50.v33
d198.p50.v50
d198.p50.v66
d198.p50.v83
d198.p60.v16
d198.p60.v33
d198.p60.v50
d198.p60.v66
d198.p60.v83
d198.p70.v16
d198.p70.v33
d198.p70.v50
d198.p70.v66
d198.p70.v83
d198.p80.v16
d198.p80.v33
d198.p80.v50
d198.p80.v66
d198.p80.v83
d198.p90.v16
d198.p90.v33
d198.p90.v50
d198.p90.v66
d198.p90.v83
d198.p10
d198.p20
d198.p30
d198.p40
d198.p50
d198.p60
d198.p70
d198.p80
d198.p90
dsj1000.p10.v16
dsj1000.p10.v33
dsj1000.p10.v50
dsj1000.p10.v66
dsj1000.p10.v83
dsj1000.p20.v16
dsj1000.p20.v33
dsj1000.p20.v50
dsj1000.p20.v66
dsj1000.p20.v83
dsj1000.p30.v16
dsj1000.p30.v33
dsj1000.p30.v50
dsj1000.p30.v66
dsj1000.p30.v83
dsj1000.p40.v16
dsj1000.p40.v33
dsj1000.p40.v50
dsj1000.p40.v66
dsj1000.p40.v83
dsj1000.p50.v16
dsj1000.p50.v33
dsj1000.p50.v50
dsj1000.p50.v66
dsj1000.p50.v83
dsj1000.p60.v16
dsj1000.p60.v33
dsj1000.p60.v50
dsj1000.p60.v66
dsj1000.p60.v83
dsj1000.p70.v16
dsj1000.p70.v33
dsj1000.p70.v50
dsj1000.p70.v66
dsj1000.p70.v83
dsj1000.p80.v16

LB value
7351.09
6176.07
6151.70
5919.38
5475.46
5828.26
7830.80
6508.36
7300.36
7928.06
7764.63
8273.28
7408.01
8466.25
8786.42
9152.07
9065.00
8948.07
8038.98
9310.76
8839.78
9693.90
9535.53
10009.58
10276.10
10018.95
10081.61
1064.02
2128.03
3192.04
4256.06
5320.07
6384.09
7448.10
8512.12
9576.13
2413777.00
2741702.00
3375467.00
4028898.00
4327180.00
3782654.00
4523072.00
5127926.00
5778787.00
6450438.00
5447787.00
5975096.00
6389477.00
7436660.00
7400875.00
6978520.00
7315055.00
8011461.00
8898664.00
8798043.00
8113792.00
8625677.00
9405494.00
9744898.00
9878380.00
9653119.00
9820027.00
10557770.00
10992180.00
11566720.00
10971300.00
11217690.00
11729110.00
12062110.00
12047830.00
12246860.00

Instance name
dsj1000.p80.v33
dsj1000.p80.v50
dsj1000.p80.v66
dsj1000.p80.v83
dsj1000.p90.v16
dsj1000.p90.v33
dsj1000.p90.v50
dsj1000.p90.v66
dsj1000.p90.v83
dsj1000.p10
dsj1000.p20
dsj1000.p30
dsj1000.p40
dsj1000.p50
dsj1000.p60
dsj1000.p70
dsj1000.p80
dsj1000.p90
rat783.p10.v16
rat783.p10.v33
rat783.p10.v50
rat783.p10.v66
rat783.p10.v83
rat783.p20.v16
rat783.p20.v33
rat783.p20.v50
rat783.p20.v66
rat783.p20.v83
rat783.p30.v16
rat783.p30.v33
rat783.p30.v50
rat783.p30.v66
rat783.p30.v83
rat783.p40.v16
rat783.p40.v33
rat783.p40.v50
rat783.p40.v66
rat783.p40.v83
rat783.p50.v16
rat783.p50.v33
rat783.p50.v50
rat783.p50.v66
rat783.p50.v83
rat783.p60.v16
rat783.p60.v33
rat783.p60.v50
rat783.p60.v66
rat783.p60.v83
rat783.p70.v16
rat783.p70.v33
rat783.p70.v50
rat783.p70.v66
rat783.p70.v83
rat783.p80.v16
rat783.p80.v33
rat783.p80.v50
rat783.p80.v66
rat783.p80.v83
rat783.p90.v16
rat783.p90.v33
rat783.p90.v50
rat783.p90.v66
rat783.p90.v83
rat783.p10
rat783.p20
rat783.p30
rat783.p40
rat783.p50
rat783.p60
rat783.p70
rat783.p80
rat783.p90

LB value
12785590.00
12903120.00
13104130.00
13244230.00
13609660.00
13744190.00
13890010.00
13891700.00
14245880.00
1480976.00
2961951.00
4442927.00
5923903.00
7404878.00
8885854.00
10366830.00
11847810.00
13328780.00
1185.97
1558.38
1511.61
1915.01
1872.50
2034.14
2245.76
2601.12
2563.15
3198.61
2656.69
2981.47
3391.91
3619.27
3875.69
3337.11
3839.62
4046.29
4210.95
4187.43
4044.51
4314.19
4860.15
5043.91
5203.94
4797.07
5170.12
5266.92
5674.25
5398.97
5459.45
5707.45
5847.90
5918.57
5996.61
6180.58
6259.45
6500.02
6551.43
6566.91
6895.74
6967.85
6996.76
7108.17
6991.72
747.59
1495.19
2242.78
2990.38
3737.97
4485.56
5233.16
5980.75
6728.35

Instance name
ch150.p10.v16
ch150.p10.v33
ch150.p10.v50
ch150.p10.v66
ch150.p10.v83
ch150.p20.v16
ch150.p20.v33
ch150.p20.v50
ch150.p20.v66
ch150.p20.v83
ch150.p30.v16
ch150.p30.v33
ch150.p30.v50
ch150.p30.v66
ch150.p30.v83
ch150.p40.v16
ch150.p40.v33
ch150.p40.v50
ch150.p40.v66
ch150.p40.v83
ch150.p50.v16
ch150.p50.v33
ch150.p50.v50
ch150.p50.v66
ch150.p50.v83
ch150.p60.v16
ch150.p60.v33
ch150.p60.v50
ch150.p60.v66
ch150.p60.v83
ch150.p70.v16
ch150.p70.v33
ch150.p70.v50
ch150.p70.v66
ch150.p70.v83
ch150.p80.v16
ch150.p80.v33
ch150.p80.v50
ch150.p80.v66
ch150.p80.v83
ch150.p90.v16
ch150.p90.v33
ch150.p90.v50
ch150.p90.v66
ch150.p90.v83
ch150.p10
ch150.p20
ch150.p30
ch150.p40
ch150.p50
ch150.p60
ch150.p70
ch150.p80
ch150.p90
att532.p10.v16
att532.p10.v33
att532.p10.v50
att532.p10.v66
att532.p10.v83
att532.p20.v16
att532.p20.v33
att532.p20.v50
att532.p20.v66
att532.p20.v83
att532.p30.v16
att532.p30.v33
att532.p30.v50
att532.p30.v66
att532.p30.v83
att532.p40.v16
att532.p40.v33
att532.p40.v50

LB value
880.64
975.37
1439.81
1710.72
1625.37
1513.44
1612.35
2105.61
2173.89
2620.86
1920.28
2263.51
2507.08
2591.19
3059.95
2513.76
2809.98
2915.05
3321.94
3325.32
3004.14
3136.32
3198.81
3665.11
3319.95
3622.58
3659.43
3809.46
4007.59
4175.02
4198.73
4247.70
4171.51
4509.64
4612.88
4592.32
4706.49
4743.34
4955.88
4862.40
5064.83
5209.75
5154.09
5249.80
5083.59
556.14
1112.28
1668.42
2224.56
2780.70
3336.84
3892.98
4449.12
5005.26
11676.86
14685.17
17183.52
20708.56
21741.35
18985.72
22303.80
25554.69
28117.06
30282.42
26330.13
28601.60
30680.62
34881.71
37100.20
32495.83
36310.92
38035.95

Instance name
att532.p40.v66
att532.p40.v83
att532.p50.v16
att532.p50.v33
att532.p50.v50
att532.p50.v66
att532.p50.v83
att532.p60.v16
att532.p60.v33
att532.p60.v50
att532.p60.v66
att532.p60.v83
att532.p70.v16
att532.p70.v33
att532.p70.v50
att532.p70.v66
att532.p70.v83
att532.p80.v16
att532.p80.v33
att532.p80.v50
att532.p80.v66
att532.p80.v83
att532.p90.v16
att532.p90.v33
att532.p90.v50
att532.p90.v66
att532.p90.v83
att532.p10
att532.p20
att532.p30
att532.p40
att532.p50
att532.p60
att532.p70
att532.p80
att532.p90
eil101.p10.v16
eil101.p10.v33
eil101.p10.v50
eil101.p10.v66
eil101.p10.v83
eil101.p20.v16
eil101.p20.v33
eil101.p20.v50
eil101.p20.v66
eil101.p20.v83
eil101.p30.v16
eil101.p30.v33
eil101.p30.v50
eil101.p30.v66
eil101.p30.v83
eil101.p40.v16
eil101.p40.v33
eil101.p40.v50
eil101.p40.v66
eil101.p40.v83
eil101.p50.v16
eil101.p50.v33
eil101.p50.v50
eil101.p50.v66
eil101.p50.v83
eil101.p60.v16
eil101.p60.v33
eil101.p60.v50
eil101.p60.v66
eil101.p60.v83
eil101.p70.v16
eil101.p70.v33
eil101.p70.v50
eil101.p70.v66
eil101.p70.v83
eil101.p80.v16

LB value
43216.89
42008.26
41615.19
40915.18
44604.81
46501.26
50847.65
45435.27
46825.00
49963.04
50437.37
54291.68
52322.56
55238.08
56307.43
59166.41
58578.00
58741.72
60574.31
64628.56
62711.97
63619.20
65035.06
64913.32
66682.62
67241.73
69304.29
7122.53
14245.05
21367.58
28490.10
35612.63
42735.15
49857.68
56980.21
64102.73
73.71
93.00
93.04
133.67
160.42
139.24
171.47
175.11
190.24
198.62
197.10
220.63
217.30
250.27
282.37
245.70
301.76
286.42
329.05
290.38
309.33
338.21
352.59
331.27
336.15
377.13
345.66
401.28
366.17
438.00
416.54
417.53
439.06
434.64
413.83
477.86

Instance name
eil101.p80.v33
eil101.p80.v50
eil101.p80.v66
eil101.p80.v83
eil101.p90.v16
eil101.p90.v33
eil101.p90.v50
eil101.p90.v66
eil101.p90.v83
eil101.p10
eil101.p20
eil101.p30
eil101.p40
eil101.p50
eil101.p60
eil101.p70
eil101.p80
eil101.p90
lin318.p10.v16
lin318.p10.v33
lin318.p10.v50
lin318.p10.v66
lin318.p10.v83
lin318.p20.v16
lin318.p20.v33
lin318.p20.v50
lin318.p20.v66
lin318.p20.v83
lin318.p30.v16
lin318.p30.v33
lin318.p30.v50
lin318.p30.v66
lin318.p30.v83
lin318.p40.v16
lin318.p40.v33
lin318.p40.v50
lin318.p40.v66
lin318.p40.v83
lin318.p50.v16
lin318.p50.v33
lin318.p50.v50
lin318.p50.v66
lin318.p50.v83
lin318.p60.v16
lin318.p60.v33
lin318.p60.v50
lin318.p60.v66
lin318.p60.v83
lin318.p70.v16
lin318.p70.v33
lin318.p70.v50
lin318.p70.v66
lin318.p70.v83
lin318.p80.v16
lin318.p80.v33
lin318.p80.v50
lin318.p80.v66
lin318.p80.v83
lin318.p90.v16
lin318.p90.v33
lin318.p90.v50
lin318.p90.v66
lin318.p90.v83
lin318.p10
lin318.p20
lin318.p30
lin318.p40
lin318.p50
lin318.p60
lin318.p70
lin318.p80
lin318.p90

Table 4.5: Lower bound of the optimal PTSP solutions for our PTSP benchmark.

LB value
474.31
489.49
505.13
512.06
522.45
528.18
535.83
534.22
537.92
58.11
116.23
174.35
232.46
290.58
348.69
406.81
464.93
523.04
6041.33
7957.71
7874.08
11944.43
12287.68
9132.71
10408.27
13731.64
13718.98
15270.89
11208.81
14177.21
15683.86
17407.57
17945.20
13919.39
16229.05
19063.22
19559.90
21746.95
16332.09
17159.94
19623.45
22333.14
25434.79
19972.60
21346.71
23115.39
23603.75
25870.44
21908.55
23232.24
23834.18
24154.62
25054.59
24802.40
24251.09
25839.17
26770.53
27827.87
26030.73
26597.94
27371.21
27692.71
27449.76
2730.06
5460.11
8190.17
10920.22
13650.27
16380.33
19110.38
21840.44
24570.49

Chapter 5

Ant Colony Optimization

In our case study on metaheuristics for SCOPs we have chosen to focus on ACO for at
least two reasons. First of all, ACO has been shown to be successful in several difficult
combinatorial optimization problems, and particularly in many routing problems, to
which the PTSP belongs. Second, at the time the research for this thesis started, there
were still no applications of ACO algorithms to SCOPs, and the investigation of ACO
potentiality in solving problems from the SCOP domain was a completely open issue.
In this chapter we propose several ACO algorithms, beginning, in Section 5.1, with one
which is a straightforward extension to the PTSP of an ACO algorithm originally developed for the TSP. We will show experimentally that this first ACO version exploiting
the exact PTSP objective function already obtains quite good results, outperforming all
the simple heuristics. We will also show that the performance of ACO strongly depends
on the PTSP instance characteristics, and that if the stochasticity of an instance is
under a certain level, it is better treating it like a (deterministic) TSP. In Section 5.2 we
investigate some issues related to the use of objective function approximations inside
ACO. As we have seen in the first part of this thesis (Section 2.4), there are two types of
possible approximations: ad hoc and sampling approximations. Here, we will consider
both types of approximations, and analyze for each type different aspects. For ad hoc
approximations, we will see how different precision measures are correlated with the
solution quality of an ACO algorithm, when ACO uses ad hoc approximations. Moreover, we will take advantage of the fact that the PTSP objective function is known
exactly, to measure the effectiveness of using the sampling approximation inside ACO.
A critical overview of the obtained results is done in Section 5.3.

5.1

A straightforward implementation

Because of the structural similarity between the PTSP and the TSP (the solution
structure is the same, only the cost of a solution is different), as a first implementation
of the ACO algorithm for the PTSP, we consider an adaptation to the PTSP of the
ACS algorithm by Dorigo and Gambardella [69], which was successfully applied to the
TSP. We call this algorithm probabilistic ACS, or pACS.
65

CHAPTER 5. ANT COLONY OPTIMIZATION

In this section, we describe in detail how pACS and ACS work, and we show some
preliminary experimental evaluation of pACS that also motivates the subsequent investigations of the following sections.

5.1.1

The pACS algorithm

The main principles of the ACO metaheuristic have been described in Section 3.1.
Here, we focus on one particular ACO algorithm, the pACS, whose pseudocode skeleton is shown by Algorithm 8. The procedures Initialization, ConstructAntSolution, and
Algorithm 8 pACS
1: Initialization
2: for iteration k = 1, 2, . . . do
3:
Initialize best ant solution BA
4:
for ant a = 1, 2, . . . , m do
5:
ConstructAntSolution [each ant constructs its solution a ]
6:
if E[L(a )] < E[L(BA )] then
7:
set BA = a
8:
end if
9:
end for
10:
if E[L(BA )] < E[L(BSF )] then
11:
set BSF = BA
12:
end if
13:
GlobalPheromoneUpdate
14: end for
GlobalPheromoneUpdate work as follows.
Initialization consists of four operations: the positioning of m ants on their starting
customers, the initialization of the best-so-far-solution BSF , the computation and
initialization of the heuristic information , and the initialization of pheromone .
Note that and are bidimensional matrices of information, where ij and ij are the
values of the heuristic information, respectively pheromone, on the arc (i, j) that goes
from customer i to customer j. Initialization is done as follows in pACS: the starting
customer of each ant is chosen randomly; heuristic information is so that ij = 1/dij ,
where dij is the distance between i and j; pheromone values are set all equal to 0 ,
which is computed according to
0 =

1
,
n E[L(F I )]

(5.1)

where E[L(F I )] is the expected length of the tour constructed by the Farthest Insertion
heuristic (see Section 4.5). As noted in [69], in the denominator of Equation (5.1) any
very rough approximation of the optimal solution value would suffice, therefore, the
expected cost of other heuristics could be used for the initialization of 0 . Observe
that, in this simple pACS version, is a static information that is computed just once
during the initialization phase.

5.1. A STRAIGHTFORWARD IMPLEMENTATION

ConstructAntSolution is the procedure by which each ant probabilistically builds

a tour by choosing the next customer to move to on the basis of the two types of
information: the pheromone and the heuristic information . When an ant a is on
city i, the next city is chosen as follows.

With probability q0 (a parameter), a city j that maximizes ij ij

is chosen in
the set Ja (i) of the cities not yet visited by ant a. Here, is a parameter which
determines the relative influence of the heuristic information.

With probability 1 q0 , a city j is chosen randomly with a probability given by

ij ij

if j Ja (i)
P
,
pa (i, j) =
(5.2)
rJa (i) ir ir

0,
otherwise.
Hence, with probability q0 the ant chooses the best city according to the pheromone
trail and to the distance between cities, while with probability 1 q0 it explores the
search space in a biased way.
The procedure ConstructAntSolution also takes care of local pheromone updates,
where each ant, after it has chosen the next city to move to, applies the following local
update rule:
ij (1 ) ij + 0 ,
(5.3)
where 0 is the initial pheromone parameter, and , 0 < 1, is another parameter.
The effect of the local updating rule is to make less desirable an arc which has already
been chosen by an ant, so that the exploration of different tours is favored during one
iteration of the algorithm.
After each ant has built its solution, the best ant solution BA is updated and
stored (steps 6 to 8 of Algorithm 8) for future comparison with the best-so-far-solution
BSF (steps 10 to 12 of Algorithm 8). After BSF has also been updated, the GlobalPheromoneUpdate function is applied to modify pheromone on arcs belonging to BSF
with the following global updating rule
ij (1 ) ij + ij ,
where
ij =

1
E[L(BSF )]

(5.4)

(5.5)

with 0 < 1 being the pheromone decay parameter, and E[L(BSF )] is the expected
cost of the best-so-far solution.
As we have already pointed out, pACS is a straightforward adaptation of the ACS
algorithm originally designed for the TSP [69]. Basically, if in the above description of
pACS (in Equations (5.1) and (5.5), and in steps 6 to 10 of Algorithm 8) we substitute
the expected length of a solution with the length of it, we obtain ACS. Thus, from
the point of view of code, pACS is very similar to ACS. Note, however, that from
the complexity point of view there are more important differences, since the expected

CHAPTER 5. ANT COLONY OPTIMIZATION

length of a solution needs O(n2 ) time to be computed, while the length of a solution only
requires O(n) time. It is useful here to consider in more detail what is the asymptotic
time complexity of each iteration of pACS, as a function of the number of customers n
and of the number of ants m:
t(one iteration of pACS or ACS) = m t(ConstructAntSolution)
+ m t(computation of the objective value)

(5.6)

+ t(GlobalPheromoneUpdate).
Procedures ConstructAntSolution and GlobalPheromoneUpdate require respectively O(n2 )
and O(n) time both in pACS and ACS. Thus, we have
t(one iteration of pACS) = 2m O(n2 ) + O(n) = O(n2 ),
2

(5.7)
2

t(one iteration of ACS) = m O(n ) + (m + 1)O(n) = O(n ).

(5.8)

The asymptotic time complexity is the same in the two algorithms (O(n2 )), but the
constant involved are different. In practice, given the same computation time is allowed, pACS will be able to perform far fewer iterations than ACS. These observations
motivate the investigation, done in Section 5.2, of other variants of ACS for solving
the PTSP, where fast approximations of the objective function are used, instead of the
exact one based on the expected length of a solution.

5.1.2
5.1.2.1

Experimental analysis
Computational environment

All the experiments reported in this Section have been run on a machine with two
processors Intel(R) Xeon(TM) CPU 1.70GHz, running the GNU/Linux Debian 2.4.27
operating system. All algorithms have been coded in C++ under the same development
framework.
5.1.2.2

Tuning

The tunable parameters of pACS, as described in the previous section, are

m, the number of ants;
q0 , the probability that the next customer is chosen deterministically;
, the power of heuristic information exponent;
, the local evaporation factor;
, the global evaporation factor.
We have verified empirically that pACS is not very sensitive to parameters m, , and
, when these have values near the ones suggested by Dorigo and St
utzle [72] for
the ACS algorithm applied to the TSP. Thus, according to [72], we set m = 10, and

5.1. A STRAIGHTFORWARD IMPLEMENTATION

1,1: p = 0.5, 2=0.08

0.5,0.5: p = 0.5, 22=0.13
1.5,3: p = 0.33, 2=0.04
3,1.5: p = 0.67, =0.04

probability of random customer probability

2.5

1.5

0.5

0
0

0.2

0.4
0.6
random customer probability

0.8

Figure 5.1: Probability distributions used for randomly generating heterogeneous customer probabilities in the tuning PTSP instances.
= = 0.1. Tuning of parameters q0 and was done considering, respectively, three
and four different values as reported in Table 5.1. This results in twelve different pACS
programs to be run on a set of tuning PTSP instances.
q0

0.95 0.98 1
1 2 3 4

Table 5.1: Values of q0 and parameters considered for the tuning of pACS.
The set of tuning PTSP instances has been constructed in the following way. Customers coordinates are from two instances of the TSPLIB [178], namely eil51.tsp and
pcb442.tsp, with, respectively 51 and 442 customers. Customers probabilities have
been generated according to different probability configurations, both homogeneous
and heterogeneous. Homogeneous probabilities p belong to the set {0.25; 0.5; 0.75}.
Heterogeneous probabilities have been generated according to the beta probability
distribution (Equation (4.5)), with four couples of a and b parameters, corresponding
to the four different probability distributions plotted in Figure 5.1. The customers
probability configurations considered for each of the two TSP instances of the tuning
set are summarized in Table 5.2. Note that these tuning instances do not belong to the
PTSP benchmark described in Section 4.3.
The pACS algorithm was run for n2 /100 CPU seconds on each tuning instance,
and a summary of the results is reported by Figure 5.2. The best choice appears to
be q0 = 0.95 and = 3. A summary of the parameter set used for the experimental

CHAPTER 5. ANT COLONY OPTIMIZATION

Probability distribution
(p)
(p)
(p)
0.5,0.5 ()
1,1 ()
1.5,3 ()
3,1.5 ()

p
0.25
0.5
0.75
0.5
0.5
0.33
0.67

2
0
0
0
0.13
0.08
0.04
0.04

Table 5.2: Average customers probability p and variance 2 characterize the 12 customers probability configurations generated for the PTSP tuning set of pACS. The first
three rows correspond to homogeneous PTSP instances, and the last four to heterogeneous PTSP instances. The probability distribution a,b () is described by Equation
(4.5) in Section 4.3
.
m
10

q0
0.95

0.1

Table 5.3: Parameter set of pACS chosen after tuning.

evaluation of pACS is reported in Table 5.3.
5.1.2.3

Results

The first requirement for a metaheuristic such as pACS is that it performs better than
other simple heuristics (otherwise there would be no point in developing a metaheuristic). This requirement is satisfied by pACS, as it is clear from a comparison with simple
heuristics of the average results over all the PTSP benchmark in Table 5.4. The simple
heuristics find results that are on average more than 90% above the optimal solution
lower bound (LB) (as computed in Section 4.4), while pACS is on average just 81.2%
above LB. However, the time required by pACS for achieving its performance is much
higher than other heuristics (except for RS (Random Search)). This is quite obvious,
since pACS generates a very high number of solutions in its iterations, until the available run time is over, while the other simple heuristics (except for RS) generate at most
n solutions, and then stop.
The performance of pACS with respect to the lower bound of optimal PTSP solutions is graphically shown in Figure 5.3, where the relative difference of the best
found solution of pACS is plotted as a function of three different parameters characterizing PTSP instances, namely average customers probability, customers probability
variance, and number of customers. We remark that comparisons with respect to the
lower bound are useful for having an objective and repeatible way of evaluating an
algorithm for the PTSP, but they do not indicate the amount by which an algorithm
really deviates from optimal solutions. The reason is that we do not know how near is

5.1. A STRAIGHTFORWARD IMPLEMENTATION

28000

tuning pACS

24000

26000

q = 0.95
q = 0.98
q=1

22000

best (average)

value of the parameter in pACS

Figure 5.2: Tuning results of pACS obtained by varying q0 and parameters. The best
choice appears to be q0 = 0.95 and = 3.

algorithm
pACS
FI
SFC
NN
RAD
RS

% above LB
81.2%
90.2%
111.6%
111.1%
388.8%
1228.6%

time (seconds)
2453.7
0.1
0.1
35.5
0.1
2712.0

Table 5.4: Aggregated results showing average values over all instances of the PTSP
benchmark.

0.1

0.3

0.5
prob

0.7

0.9

6
5
4
3
0

(pACS LB)/LB

5
4
3
0

(pACS LB)/LB

4
3
2
0

(pACS LB)/LB

CHAPTER 5. ANT COLONY OPTIMIZATION

var

100

150

318

783

Figure 5.3: Relative distance of the best result found by pACS from the lower bound of
the optimal PTSP solution (LB). On the horizontal axis there is, respectively, average
customers probability (left), variance of the customers probability (center), number
of customers (right). Variance values correspond to the percentage of the maximum
variance in the beta probability distribution used to generate random customers probabilities, as explained in Section 4.3.
the lower bound to optimal solutions. For example, Figure 5.3 shows that the average
distance from the lower bound is bigger for smaller customers probability, but this does
not necessarly mean that pACS finds poorer solutions when customers probability is
small, in fact, it could be also the case that, for this type of PTSP instances, the lower
bound underestimates the optimal solutions more than for instances characterized by
high customers probability.
There is a third type of comparison which is of great importance for evaluating
the usefulness of a metaheuristic for the PTSP in general, and of pACS in particular.
This is the comparison with the solution quality of optimal TSP solutions evaluated
by the PTSP objective function. If the performance of an algorithm developed on
purpose for the PTSP is not superior to that of optimal TSP solutions, there would
be no point in developing such algorithm. This type of comparison is particularly
important here because the TSP is one of the most well known problems in the field
of operations research, and very powerful, freely available, algorithms exist for solving
to the optimum even very big instances of the TSP. In this thesis, we have used the
Concorde software [63] for finding the optimal solution of the TSP instances from which
our PTSP benchmark has been generated (see Section 4.3). A boxplot of the ranks
of our algorithms including the optimal TSP solutions, is shown in Figure 5.4. The
meaning of the vertical line joining two algorithms on the left of the plot is the following:
according to the Friedman two-way analysis of variance by ranks [64], the difference
between the two algorithms is not statistically significant at a confidence level of 95%.
Thus, if two algorithms are not linked by the vertical line, their difference is statistically
significant. The figure shows that pACS is indeed better than optimal TSP solutions.
It is now interesting to see how the relative gain of pACS with respect to optimal
TSP solutions varies by varying different instance characteristics. This is shown by
Figure 5.5. The figure reveals at least two important facts. First, the performance of

5.1. A STRAIGHTFORWARD IMPLEMENTATION

Ranks
1

pACS
TSP
FI
NN
SFC
RAD
RS

0.6
prob

0.8

40
var

0.02
0.02

(pACS TSP)/TSP

0.06

0.4

0.01

(pACS TSP)/TSP

0.2

0.03

0.00
0.04
0.08

(pACS TSP)/TSP

0.04

0.01

Figure 5.4: Boxplot of the distribution of ranks of pACS, TSP (the optimal solution of
the TSP corresponding to the PTSP), and simple heuristics described in Section 4.5.
The vertical line on the left of the plot which joins NN and SFC means that these two
algorithms are not significantly different at a confidence level of 95%, according to the
Friedman two-way analysis of variance by ranks [64]. The interpretation of a box is the
following: the solid line inside the box is the median rank, the limits of the box are the
lower and upper quartile, the whiskers are the smallest and largest values (outliers
excepted), and the circles are the outliers.

200

400

600

800

1000

Figure 5.5: Relative gain of the best solution found by pACS over optimal TSP solutions, versus, respectively, average customers probability (left), variance of the customers probability (center), number of customers (right). Circles correspond to the
average of the relative gain, and the length of error bars equals its standard deviation.

CHAPTER 5. ANT COLONY OPTIMIZATION

0.55
0.50
0.35

0.2

0.40

0.45

critical probability

0.6
0.5
0.4
0.3

critical probability

0.7

0.60

0.8

0.65

200

400

600
n

800

1000

var

Figure 5.6: Critical probability below which pACS finds solutions whose quality is
better than the quality of the optimal TSP solution. Circles correspond to the average
critical probability, and the length of error bars equals its standard deviation.
pACS is very different for different average customers probability. In particular, there
is a critical probability (circa 0.5), under which it is really worth solving PTSP instances
by pACS, while above the critical probability the problem can be better treated like a
TSP. Second, the variance of customers probability has also a great influence on pACS
performance, since, the higher the variance, the more important is the gain of pACS
with respect to the optimal TSP solution. Also the number-of-customers factor, shown
in the right part of Figure 5.5, seems to influence the performance of pACS, which
performs better for smaller number of customers.
It can be useful to have a closer look at the critical probability at which the pACS
ceases to find better solutions than the optimal TSP. In Figure 5.6, we have plotted the
critical probability by varying the number of customers and the variance of customers
probability. It seems that the critical probability goes from a maximum of about 0.8 for
instances with a small number of customers to a minimum of about 0.3 for big instances,
independently of the variance of customers probability. Detailed results obtained by
pACS are shown in Table A.1 in Appendix A.

5.2. THE USE OF OBJECTIVE FUNCTION APPROXIMATIONS IN ACO

5.2
5.2.1

The use of objective function approximations in ACO

Motivation

As we have seen, the objective function of the PTSP is computationally expensive, since
it needs to consider all the O(n2 ) arcs joining couples of customers. For this reason, it
is worth investigating the design of fast approximations of the objective function, to be
used in optimization algorithms and in ACO in particular, for speeding up, and thus
possibly improving, the optimization process.
In this section we present and analyze three types of objective function approximations and three corresponding variations of the pACS algorithm. The first approximation consists in using the TSP objective function instead of the PTSP one, that
is, in using the length instead of the expected length of an a priori tour. We call this
type of approximation TSP approximation. The second approximation is based on the
observation that, for a given a priori solution, some edges have a very small probability
of being actually travelled, and therefore they may be neglected. We call this type of
approximation Depth approximation (this naming convention will become clear later
in this chapter). These first two approximations belong to the class of ad hoc approximations, as described in Section 2.4.1. The third type of approximation corresponds
to the Sampling approximation introduced in Section 2.4.2, and consists in computing
the sample average (Equation (2.13)) of an a priori tour.
There are several questions of interest, when studying the use of objective function
approximations inside an ACO algorithm, such as, for example: verifying whether a
fast approximation allows the algorithm to find better solutions than using the exact objective; investigating the effectiveness of different ways of alternating the use of
approximated and exact objective; investigating how the tuned parameters of the algorithm change, when using objective function approximations, what measure is more
appropriate to guess the effectiveness of an approximation (error, absolute error, correlation with the exact objective, speed, others?). Some of these questions have already
been addressed in the context of the PTSP, and it is thus uninteresting to address them
again here. In particular, Branke and Guntsch [51, 52] propose the Depth approximation and investigate its effectiveness inside ACO. They show that this approximation
accelerates convergence without significantly worsening the solution quality. Gutjahr
[100] briefly addresses the PTSP as a pre-test for verifying the performance of S-ACO,
an algorithm that uses the Sampling approximation with a variable number of samples
for estimating the objective function. In [100], S-ACO has been only tested on very
small PTSP instances, since the goal was then to apply it to a problem where there
is not a closed-form expression for the objective, and estimation by simulation is the
only alternative. Here, the questions we want to address by considering the two ad hoc
approximations (TSP- and Depth approximations) and the Sampling approximation
are the following:
1. by means of the TSP- and Depth approximations, we want to see which precision
measure of the ad hoc approximation is more appropriate to guess the effectiveness
of an approximation, or, in other terms, we want to see which precision measure

CHAPTER 5. ANT COLONY OPTIMIZATION

is more strongly correlated with the solution quality of an ACO algorithm, when
ACO uses these ad hoc approximations;
2. we want to verify the quality achieved by an ACO algorithm based on the Sampling approximation.

The usefulness of answering the first question is that, when facing a different SCOP
than the PTSP, one may be in a situation where several ad hoc approximations can be
used, and the implementation and test of each of them inside ACO in order to find the
best one could be a very time consuming task. Thus, some criteria to select a small
set of promising ad hoc approximations before their actual implementation inside an
ACO, can be useful to save a lot of time.
The interest in the second question is specific to the type of approximation. In
fact, the Sampling approximation is a very general type of approximation, that can be
used in any SCOP, especially when a closed-form expression for the objective function is
missing. In these situations though, it is difficult to evaluate the true performance of the
algorithm, precisely because an exact, closed-form expression for the objective function
is missing. Therefore, the PTSP gives a good occasion to evaluate the performance of
sampling-based algorithm. In a certain sense, our goal is to extend the investigation of
[100], which was limited to very few small PTSP instances.
The remainder of this section is organized as follows. The TSP-, Depth-, and
Sampling approximations are described, together with their integration into pACS,
respectively in Section 5.2.2, 5.2.3, and 5.2.4. In section 5.2.5 we answer by experimental
investigation to the two questions described above.

5.2.2
5.2.2.1

The pACS-T algorithm based on the TSP approximation

The TSP approximation

The TSP approximation consists in using the length of an a priori tour instead of its
expected length. Thus, given an a priori tour , the TSP approximation is based in
the computation of
L() =

n
X

d ((i), (i + 1)) + d ((n), (1)) .

(5.9)

i=1

This approximation can be computed in just O(n) computation time, which is much
faster than the O(n2 ) computation time of the PTSP objective function. Nevertheless,
it is not very accurate, since it does not consider the customers probability. The
TSP approximation is particularly inaccurate for PTSP instances with low customers
probabilities, but it is quite precise for instances with customers probabilities near 1.
5.2.2.2

The pACS-T algorithm

The pACS-T algorithm (Algorithm 9), is essentially equivalent to the ACS algorithm
for the TSP [69], since it exploits for the search of solutions exclusively the length

5.2. THE USE OF OBJECTIVE FUNCTION APPROXIMATIONS IN ACO

of a solution (the TSP objective function), instead of the expected length (the PTSP
objective function). Differences with respect to pACS are in steps 6 to 13, and 15
of Algorithm 9. Best-ant and best-so-far solutions are updated on the base of their
length, and global pheromone update is done with the same rule as in pACS (Equation
(5.4)), but with a different quantity of reinforce pheromone (here, ij = L(BSF )1 ,
while in pACS ij = E[L(BSF )]1 ). In order to evaluate the final solution quality of
pACS-T, the exact expected cost is computed just once at the end for evaluating the
best-so-far-solution BSF .
The Initialization procedure is the same as in pACS. In particular, the initial level
of pheromone is computed based on the exact expected cost of the Farthest Insertion
solution according to Equation (5.1). Also the ConstructAntSolution procedure is the
same as in pACS, since it exploits pheromone and heuristic information in the same
way. Note, however, that pheromone levels here will evolve differently than in pACS,
due to the different amount of pheromone deposited, and due to the fact that BSF
solutions here are different than BSF solutions in pACS.
Algorithm 9 pACS-T
1: Initialization
2: for iteration k = 1, 2, . . . do
3:
Initialize best ant solution BA
4:
for ant a = 1, 2, . . . , m do
5:
ConstructAntSolution [each ant constructs its solution a ]
6:
if L(a ) < L(BA ) then
7:
set BA = a
8:
end if
9:
end for
10:
if L(BA ) < L(BSF ) then
11:
set BSF = BA
12:
end if
13:
GlobalPheromoneUpdate [using ij = L(BSF )1 ]
14: end for
15: Compute E[L(BSF )]

5.2.3
5.2.3.1

The pACS-D algorithm based on the Depth approximation

The Depth approximation

We say that an edge eij has depth {0, 1, . . . , n 2} with respect to a given a priori
tour if there are exactly cities on the tour between i and j. A high depth for
an edge implies a small probability for the edge to be actually part of a realized tour,
since this would mean that a large number of consecutive customers do not require a
visit. Therefore, edges with higher depth should impact less on the objective value of
an a priori tour. Let us now be more precise.

CHAPTER 5. ANT COLONY OPTIMIZATION

Given an a priori tour and depth values = 0, 1, ..., n 2, the edges of the PTSP
instance under consideration may be grouped in sets () , each set containing edges of
the same depth. Note that the edge set () contains a number of subtours (that is,
tours which visit a subset of the customers) equal to gcd(n, + 1)1 . For instance, the
set (0) contains exactly the edges of the a priori tour. In general, however, it may
be the case that the subtours formed in such a way can never be realized under the
a priori strategy. As an example of edge partition according to the depth, Figure 5.7
shows, from left to right, an a priori tour (which coincides with the set of edges (0) ),
(1) and (2) for a PTSP with 8 customers. Note that (1) contains two subtours which
could be actually realized under the a priori strategy, but (2) contains one single tour
that visits all customers, and therefore cannot be realized under the a priori strategy
(since, if all customers are to be visited, then they are visited according to the a priori
tour ).
Given the partition of edges according to their depth = 0, 1, 2, ..., n 2, the exact
PTSP objective function may be written as a sum of terms of increasing depth:

j+
n2
n
X X
Y

E[L()] =
d((j), (j + + 1)) p(j) p(j++1)
q ,
(5.10)
=0

j=1

j+1

where = ((1), (2), ..., (n)) is the a priori tour. In the homogeneous PTSP, where
pi = p for all customers i V , Equation (5.10) may be written as
E[L()] =

n2
X

L(() )p2 (1 p)

(5.11)

P
where L(() ) nj=1 d(j, (j + 1 + )). The L(() )s have the combinatorial interpretation of being the sum of the lengths of the collection of subtours in () . It is
now easy to see how the probability of an edge of depth to be used decreases with .
In the homogeneous PTSP (Equation (5.11)) this probability is p2 (1 p) , and in the
heterogeneous PTSP (Equation (5.10)) it also involves a product over + 2 probability
coefficients. Therefore, the probability of an edge of depth to be used (and therefore
the weight of such and edge in the objective value) decreases exponentially with . The
idea, in Depth approximation, is to stop the evaluation of the objective function after
a fixed depth has been reached. We define

j+r

n
X
X
Y

ED [L()] =
d((j), (j + r + 1)) p(j) p(j+r+1)
q ,
(5.12)
r=0

j=1

j+1

and for the homogeneous PTSP this simplifies to

ED [L()] =

L((r) )p2 (1 p)r .

r=0
1

The term gcd stays for greatest common divisor.

(5.13)

5.2. THE USE OF OBJECTIVE FUNCTION APPROXIMATIONS IN ACO

(0)

(1)

(2)

Figure 5.7: Edges grouped in sets according to their depth . The picture shows the
sets with () with = 0, 1, 2. Note that (0) corresponds to the set of edges forming
the a priori tour.

The time complexity of Depth approximation is O(n), therefore, the smallest the , the
bigger the time gain of Depth approximation with respect to the O(n2 ) exact objective
function. Nevertheless, the smallest the , the bigger is the difference, or error, between
approximated and exact computation.
5.2.3.2

The pACS-D algorithm

The algorithm pACS is modified in a straightforward manner in order to obtain pACSD, an algorithm that exploits the Depth approximation for the evaluation of the quality
of solutions. The pseudocode of pACS-D is shown in Algorithm 10. The Initialization
procedure performs the same tasks as in pACS, but also sets the depth of the approximation to be used for the computation of ED . Differences between pACS-D
and pACS are in steps 6 to 13 of Algorithm 10. Best-ant and best-so-far solutions
are updated on the base of their Depth approximation expected values, and global
pheromone update is done with the same rule as in pACS (Equation (5.4)), but with
a different quantity of reinforce pheromone (here, ij = ED [L(BSF )]1 , while in
pACS ij = E[L(BSF )]1 ). In order to evaluate the final solution quality of pACSD, the exact expected cost is computed just once at the end (step 15 of Algorithm 10)
for evaluating the best-so-far-solution BSF .

5.2.4
5.2.4.1

The pACS-S algorithm based on the Sampling approximation

The Sampling approximation

The Sampling approximation is based on the observation that the PTSP objective
function computes an expected value of a random quantity, and therefore it may be
estimated by a sample average of the type of Equation (2.13) introduced in Section
2.4.2. More precisely, given an a priori tour , the objective function may be written

CHAPTER 5. ANT COLONY OPTIMIZATION

Algorithm 10 pACS-D
1: Initialization [like in pACS, plus setting of the depth of approximation ]
2: for iteration k = 1, 2, . . . do
3:
Initialize best ant solution BA
4:
for ant a = 1, 2, . . . , m do
5:
ConstructAntSolution [each ant constructs its solution a ]
6:
if ED [L(a )] < ED [L(BA )] then
7:
set BA = a
8:
end if
9:
end for
10:
if ED [L(BA )] < ED [L(BSF )] then
11:
set BSF = BA
12:
end if
13:
GlobalPheromoneUpdate [using ij = (ED [L(BSF )])1 ]
14: end for
15: Compute E[L(BSF )]

as
E[L()] =

p()L(| ).

(5.14)

In the above expression, is a subset of the set of customers V , L(| ) is the length
of the tour , pruned in such a way as to only visit the customers in omega, skipping
the others, and p() is the probability for the subset of customers to require a visit:
p() =

Y
i

qi .

(5.15)

iV \

The objective function, as expressed by Equation (5.14), computes the expected length
of the tour , over all possible random subsets of customers.
The idea, in Sampling approximation, is to estimate the exact expected cost (5.14)
through sampling in the following way. The length L() is a discrete random variable,
taking the value L(| ) with probability p(). Let i , i 1, 3, ..., N be subsets of
the original set V of n customers sampled independently with probability p(i ). The
Sampling approximation to E[L()] is the following
ESN [L()] =

N
1 X
L(|i ).
N

(5.16)

i=1

The time complexity of Sampling approximation is O(N n), therefore, the smaller the
sample size N , the bigger the time gain of Sampling approximation with respect to the
O(n2 ) exact objective function. Nevertheless, the smaller N , the bigger the difference,
or error, between approximated and exact computation.

5.2. THE USE OF OBJECTIVE FUNCTION APPROXIMATIONS IN ACO

Some indication of how the error decreases by increasing the sample size N is
given by a fundamental result of probability theory, the so called Strong Law of Large
Numbers [95], whose statement is as follows.
Theorem 1 (Strong Low of Large Numbers) Given an infinite sequence X1 , X2 ,
X3 , . . ., of independent and identically distributed random variables with common expected value and finite variance 2 , then the sample average
XN =

1
(X1 + X2 + X3 + . . . + XN )
N

(5.17)

satisfies

P

lim XN

= =1

(5.18)

that is, the sample average converges almost surely to .

This theorem guarantees that, for any , EN [L()] converges to E[L()] with probability 1 for N . The speed of convergence of the sample average to the exact
expected value may be estimated by evaluating the variance of the sample average.
Under the hypotheses of Theorem 1, the variance of the sample average is
1
Var(X1 + X2 + X3 + . . . + XN )
n2
1
= 2 (Var(X1 ) + Var(X2 ) + Var(X3 ) + . . . + Var(XN ))
(5.19)
N
2
1
,
= 2 N Var(X1 ) =
N
N

Thus, the standard deviation of the sample average is equal to / N (with being
the standard deviation of the random variables X1 , X2 , X3 , . . .). In the context of the
PTSP one could say that the error of the Sampling approximation roughly goes as
O(1/ N ).
Var(X N ) =

5.2.4.2

The pACS-S algorithm

In the first part of this thesis, in Section 3.1, we have considered a few papers that
focused on ACO algorithms for SCOPs where the objective function is estimated by
sampling. One of the main conclusions that these first papers achieve is that, when
Sampling approximation is used, the number of samples should be chosen dynamically
during the run of the algorithm, and in particular the number of samples should increase
during the run. Thus, we have also considered a variable-sample algorithm, pACS-S,
whose pseudocode is shown in Algorithm 11. The number of samples N is a variable
which is linear in the iteration counter k, and quadratic in the number of customers n,
similarly to what has been done by Gutjahr in [100]. More precisely, N is computed
according to the following rule
N = c + bb n2 kc,

(5.20)

CHAPTER 5. ANT COLONY OPTIMIZATION

where c and b are two parameters. In pACS-S, after each solution has been constructed,
it is evaluated by the Sampling approximation with a freshly generated set of samples
(step 8 of Algorithm 11). The same set of samples is used to re-evaluate the best-ant
solution BA . After the ants construction phase is over, at each iteration a new set
of samples is generated, in order to update the best-so-far solution BSF (step 13 to
17 of Algorithm 11). Finally, similarly to what is done in pACS-T and pACS-D, the
final best-so-far solution is evaluated once with the exact objective function (step 20 of
Algorithm 11).
Algorithm 11 pACS-S with variable number of samples
1: Initialization [like in pACS]
2: for iteration k = 1, 2, . . . do
3:
Initialize best ant solution BA
4:
Set N = NumberOfSamples(k) [apply Equation (5.20)]
5:
for ant a = 1, 2, . . . , m do
6:
ConstructAntSolution [each ant constructs its solution a ]
7:
GenerateSamples(N )
8:
Compute ESN [L(a )] and re-compute ESN [L(BA )] using the last generated
samples
9:
if ESN [L(a )] < ESN [L(BA )] then
10:
set BA = a
11:
end if
12:
end for
13:
GenerateSamples(N )
14:
Re-compute ESN [L(BA )] and ESN [L(BSF )] using the last generated samples
15:
if ESN [L(BA )] < ESN [L(BSF )] then
16:
set BSF = BA
17:
end if
18:
GlobalPheromoneUpdate [using ij = (ESN [L(BSF )])1 ]
19: end for
20: Compute E[L(BSF )]

5.2.5
5.2.5.1

Experimental analysis
Tuning

For pACS-T and pACS-D, the same parameters of pACS have been adopted (see Table
5.3 and Section 5.1.2.2), namely q0 = 0.95, = 3, m = 10, = = 0.1.
For pACS-S, we have considered two settings, one where the number of samples N
is kept fixed through all the iterations, and a second one where a variable number of
samples is used. In this case, N is increased at each iteration of the algorithm, according
to Equation (5.20), where c and b are two parameters to be tuned. Even if we were
interested in only evaluating the performance of pACS-S with a variable number of

5.2. THE USE OF OBJECTIVE FUNCTION APPROXIMATIONS IN ACO

samples, we decided to execute tuning also in the fixed number of samples setting,
in order to have an idea of the difference between the two. Table 5.5 summarizes
the parameter values tested for tuning pACS-S. A total of 432 parameter sets has
been considered. For each parameter set, pACS-S has been run on a set of 14 tuning
q0

N (fixed)
c (N variable)
b (N variable)

0.95 0.98 1
1 2 3 4
2 10 50 200
1 10 50
103 104 105

Table 5.5: Parameter values considered for tuning pACS-S.

instances (the same used for pACS, as described in Section 5.1.2.2). The parameter set
which corresponds to the lowest average best-found-solution-value by pACS-S is a fixed
number of samples setting, with and q0 equal to the best pACS parameters (that is,
= 3 and q0 = 0.95), and with N = 50. In the variable number of samples setting,
the best parameters are c = 1, b = 105 , and again = 3 and q0 = 0.95. This is the
parameter set that has been selected for executing pACS-S on the PTSP benchmark.
5.2.5.2

Results

As we anticipated in Section 5.2.1, the first issue that we address here experimentally is
which precision measure of an ad hoc approximation (TSP and Depth approximation)
is more strongly correlated with the solution quality of an ACO based on the ad hoc
approximation. We consider two precision measures: the linear correlation between
the approximation and the exact objective function, and the absolute error of the
approximation. In order to compute these two precision measures, we have slightly
modified the code of pACS-T and pACS-D, by computing also the exact PTSP objective
value each time that a new best-so-far solution is encountered. In practice, we added the
instruction Compute E[L(BSF )] in the if statement starting at line 10 of Algorithm 9
and Algorithm 10. After running pACS-T or pACS-D, the linear correlation r between
the ad hoc approximation Eproxy (with proxy equal to T or to D ) and the exact
objective E has been computed as follows:

P
E[L(BSF )] E
BSF Eproxy [L(BSF )] Eproxy
r = qP
(5.21)
2 P
2 ,
E
[L(
)]

E[L(
)]

E
E
proxy
proxy
BSF
BSF
BSF
BSF
where sums are done over all encountered best-so-far solutions, and Eproxy and E are,
respectively, the average approximated value and average exact objective value over
all the encountered best-so-far solutions. The other precision measure is the average
absolute error , which is computed as follows
1 X
=
|Eproxy [(BSF )] E[(BSF )]| ,
(5.22)
nBSF
BSF

CHAPTER 5. ANT COLONY OPTIMIZATION

4850

4950

best

Ed15

Ed45 Ed60
Ed30
Ed75
0.99985

0.99990

0.99995

correlation

1.00000

4750

4750 4800 4850 4900 4950 5000

best

Ed30
Ed45
Ed60
Ed75
0

500

1000

1500

absolute error

Figure 5.8: Average solution quality versus, respectively, the correlation r between
exact and approximated objective function (left), and the absolute error of the approximated objective function.
where nBSF is the number of best-so-far solutions encountered by the algorithm (pACST or pACS-D). In order to evaluate which measure between r and is better correlated
with the solution quality of pACS-T and pACS-D, we have run once each algorithm
on all the instances of the PTSP benchmark. The run time of the algorithms has
been set equal to twice the time given to pACS, in order not to penalize pACS-T
and pACS-D due to the fact that each new best-so-far solution is evaluated twice
(once with the approximated objective, and once with the exact objective). For the
Depth approximation, we have considered depth values equal to 15, 30, 45, 60, and
75. For each depth value, a different run of pACS-D has been performed. In order to
see whether r or is better correlated with the average solution quality of our ACO
algorithms, we have produced scatterplots of the average solution quality versus the
precision measure as shown in Figure 5.8. From the plots, it is clear that the absolute
error () is more strongly correlated with the solution quality than the linear correlation
(r). In particular, the TSP approximation (denoted by L in the Figure) has a very high
linear correlation with the exact objective, but the average solution quality of pACS-T
is very low. The justification of this fact is that the TSP approximation has the highest
absolute error, as shown in the right scatterplot of Figure 5.8. We think that this
result is reasonable, but not obvious. In fact, even if the TSP approximation is quite
imprecise, it is also very fast, and much faster than the Depth approximation. Thus,
pACS-T can perform many more iterations than pACS-D, and has more time to select
good solutions. It seems, though, that the precision of the objective function is more
important than the guidance due to the ACO intensification mechanism based on the
use of pheromone.
Let us now analyze our second issue of interest as described in Section 5.2.1, that

5.2. THE USE OF OBJECTIVE FUNCTION APPROXIMATIONS IN ACO

Ranks
1

pACS
pACS!S
FI
NN
SFC
RAD
RS

Figure 5.9: Ranking of algorithms on the PTSP benchmark. According to the Friedman
two-way analysis of variance by ranks [64], the algorithms have statistically significant
different ranks, at a confidence level of 95%.
algorithm
pACS
pACS-S
FI
NN
SFC
RAD
RS

% above LB
81.2%
83.7%
90.2%
111.6%
111.1%
388.8%
1228.6%

time (seconds)
2453.7
2367.6
0.1
35.5
0.1
0.1
2712.0

Table 5.6: Average results of pACS-S compared to other heuristics over all the PTSP
benchmark.
is, the quality achieved by pACS-S, which never uses the exact objective function, but
is completely based on the Sampling approximation. The ranking of pACS-S with
respect to pACS and to other simple heuristics is reported in Figure 5.9, while average
deviations from the lower bound are reported in Table 5.6. Interestingly, pACS-S
achieves a quite good performance, being better than all simple heuristics, and being
worse than pACS of roughly 2%. As we have learned from the experimental analysis
of pACS in Section 5.1.2.3, it is important to see how the solution quality varies for
different PTSP instance parameters, particularly the average customers probability and
the variance of the customers probability. The quality of pACS-S with respect to the
optimal TSP solution, by varying the PTSP instance parameters, is shown in Figure
5.10. For comparison purposes, also the results of pACS are reported in the Figure. We
can observe that pACS-S has a behavior which is very similar to that of pACS, only
being shifted above of roughly 2%. Detailed results of pACS-S on the PTSP instances
with average probability smaller than or equal to 0.5 are reported by Table A.2 in
Appendix A.

CHAPTER 5. ANT COLONY OPTIMIZATION

0.00

0.04

0.06

0.1 prob 0.5

0.08

pACS
pACSS

0.2

0.4

0.6

0.8

prob

0.02

(best TSP) TSP

0.04

pACS
pACSS

0.06

0.00

0.04

(best TSP) TSP

0.02

var

Figure 5.10: Relative gain of pACS-S and pACS over the optimal TSP solution, versus,
respectively, average customers probability (left), and variance of the customers probability (right). Points are averages computed over the the complete PTSP benchmark
(left) and over subset of PTSP instances with customers probability smaller than or
equal to 0.5 (right). The length of error bars corresponds to the standard deviation of
the gain.

5.3

Overview of the results

The following are the most important facts observed thanks to the experimental analyses of this section.
Our straightforward ACO implementation of pACS obtains on average results
that are roughly 10% better than the best of the simple heuristics (which is FI
(Farthest Insertion)). This is a confirmation that what we can consider a minimal
requirement for a metaheuristic is satisfied by ACO.
When evaluating the performance of an algorithm for the PTSP, it is very important to consider the factors related to the degree of stochasticity of a PTSP
instance, since the performance can vary a lot for different degrees of stochasticity. In our benchmark these factors are the average customers probability and
the variance of the customers probability.
pACS, as compared to the expected value of the optimal solution of the TSP,
can be qualified indeed very differently, depending on the value of the average
customers probability, and of the variance of it. In particular, there is a critical
probability, under which PTSP instances are really worth solving by pACS, while
above the critical probability the problem can be better treated like a TSP. The

5.3. OVERVIEW OF THE RESULTS

average critical probability is 0.5, but in general, the higher the number of customers, the lower the critical probability. Moreover, the higher the variance of
customers probability, the more important is the gain of pACS with respect to
the optimal TSP solution. Where pACS is most effective, it finds results that are
on average 8% better than the optimal TSP solution.
In considering two ad hoc approximations for the PTSP objective function, we
have seen that the absolute error of the approximation is strongly correlated with
the performance of the pACS algorithm based on the ad hoc approximation. The
same is not true for the linear correlation between the ad hoc approximation and
the exact objective: an ad hoc approximation (in this case, the TSP approximation) can be strongly correlated with the exact objective, but its use inside pACS
can be degrading. This fact has important consequences, since, when designing ad
hoc approximations for a SCOP, one could be tempted to choose one which from
preliminary experiments seems well correlated with the exact objective function.
But this choice could be, as we have seen, quite bad.
When solving a SCOP by means of a sampling-based ACO such as our implementation of pACS-S, it is very likely that its performance satisfies at least the minimal requirement of being better than simple heuristics solving a related DCOP.
This is, at least, what we observe for the PTSP.
In our problem, pACS-S found results constantly worse than pACS of roughly
2%, but the qualitative behavior with respect to factors determining the stochasticity of PTSP instances is the same as the pACS algorithm based on the exact
objective.

CHAPTER 5. ANT COLONY OPTIMIZATION

Chapter 6

Local search
This chapter explores the situations that may arise when designing a local search algorithm for a SCOP that, as we have seen in the first part of this thesis, usually has a
computationally expensive objective function. The different possibilities that one has
in dealing with the complexity of the objective function are analyzed in Section 6.1.
Section 6.2 introduces two local search operators, the 2-p-opt and the 1-shift, that are
the focus of the next sections. Sections 6.3, 6.4 and 6.5 develop different strategies
to deal with the complexity of the objective function and apply them to the 2-p-opt
and 1-shift local search operators. Finally, Section 6.6 summarizes the most important
contributions of this chapter.

6.1

The issue of complexity and three options to deal with

The efficiency of a local search algorithm depends on several factors, such as the starting
solution, the neighborhood structure, and the speed of exploration of the neighborhood.
In particular, the speed of exploration of the neighborhood depends on the time required
for computing the difference of the objective value between couples of neighboring
solutions (such difference is also referred to as move cost).
As we have pointed out in the first part of this thesis, the objective function of
SCOPs is computationally demanding, and the PTSP is no exception, since it requires
O(n2 ) computation time. This can be a serious issue when designing local search
algorithms for SCOPs, because it may imply the impossibility of a fast exploration of
any neighborhood.
In general there are three possibilities when designing a local search algorithm for
a SCOP:
1. Computing the move cost by means of the exact full computation of the difference
in the objective value of couples of solutions;
2. Computing the exact move cost more efficiently, by exploiting some particular
neighborhood and exploration strategy, that allows recursive computations;
89

CHAPTER 6. LOCAL SEARCH

3. Computing an approximated move cost efficiently, by means of fast approximated
expressions for the difference in the objective value of couples of solutions;

Option 1 is the easiest to realize, because it can be used for any neighborhood structure,
and it does not require any particular search strategy, given that the objective function
is explicitly available, like in the PTSP. The problem is of course that it can be very
inefficient, due to the computational complexity of the objective function. We will show
the impracticality of this approach for the PTSP in Section 6.3.
Option 2, when possible, is in principle the best one. The main problem with this
alternative is that it can be very difficult to find a neighborhood structure that allows
for a recursive exact computation of the move cost and, given the SCOP, there is no way
a priori to say if such a neighborhood even exists. Fortunately, for the PTSP we have
been able to find, for two local search operators (the 2-p-opt and the 1-shift) fast and
exact recursive move cost computation expressions. The two operators are described
in Section 6.2, and their recursive move cost evaluation is derived in Section 6.4.
Option 3 can be a valid alternative to Option 1, when the move cost approximation
is both fast and accurate enough to guide the search towards good solutions. Trying this
alternative may also be better than trying Option 2, since sometimes it can be easier
to find ad hoc approximations of the move cost, instead of finding a neighborhood and
a local search operator that allows fast recursive computation. However, finding good
approximations of a move cost may require some effort too. Moreover, it is possible
that a given approximation is applicable only to a particular neighborhood structure.
For the PTSP, we will consider three different move cost approximations in Section 6.5.

6.2

The 2-p-opt and the 1-shift operators

In the literature, there are two local search procedures created specifically for the PTSP
that evaluate the move cost in terms of expected value: the 2-p-opt and the 1-shift.
The 2-p-opt neighborhood of an a priori tour is the set of tours obtained by reversing
a section of the a priori tour (that is, a set of consecutive nodes) such as the example
in Figure 6.1. The 2-p-opt is the probabilistic version of the famous 2-opt procedure
created for the TSP [133]. The 2-p-opt and the 2-opt are identical in terms of local
search neighborhoods, but greatly differ in the cost computation. The change in the
TSP objective value (the tour length) can be easily computed in constant time, while
the same cannot be said for the PTSP objective value. The 1-shift neighborhood of an
a priori tour is the set of tours obtained by moving a node which is at position i to
position j of the tour, with the intervening nodes being shifted backwards one space
accordingly, as in Figure 6.2. The number of neighbors generated by 2-p-opt and 1-shift
moves applied to one a priori tour is O(n2 ).

6.2. THE 2-P-OPT AND THE 1-SHIFT OPERATORS

2POPT
1
n = 10

1
2

n = 10

3
9

3=i
9

4
8

5
7

5
7=j

Figure 6.1: A tour = (1, 2, ..., i, i + 1, ..., j, j + 1, ..., n) (left) and tour i,j belonging to
its 2-p-opt neighborhood (right) obtained from by reversing the section (i, i + 1, ..., j),
with n = 10, i = 3, j = 7.

1SHIFT
1
n = 10

1
2

n = 10

3
9

3=i
9

4
8

5
7
6

5
7 = j = i + k,
k=4

Figure 6.2: A tour = (1, 2, ..., i, i + 1, ..., j, j + 1, ..., n) (left) and a tour i,j belonging
to its 1-shift neighborhood (right) obtained from by moving node i to position j and
shifting backwards the nodes (i + 1, ..., j), with n = 10, i = 3, j = 7.

6.3

CHAPTER 6. LOCAL SEARCH

The infeasibility of using the exact full objective function

Given that both the 2-p-opt and the 1-shift neighborhoods of any a priori tour contain
O(n2 ) neighbors, and given that the PTSP objective value of a solution requires O(n2 )
computation time, the exploration of a 2-p-opt or a 1-shift neighborhood of any a priori
tour requires O(n4 ) computation time, when the exact full objective function is used
to evaluate the move cost. The same operation in the TSP would require only O(n2 )
time, since the objective value (length) difference between two neighboring solutions
can be computed in constant time. This is also true for the recursive expressions that
we derive in Section 6.4 for the 1-shift and 2-p-opt local search.
An O(n4 ) local search is in practice not feasible. As an example, consider the 100customers PTSP instance kroA100 with homogeneous customers probability equal to
0.5. If we generate one solution by means of the Space Filling Curve heuristic, and we
apply just once the 1-shift local search to this solution, the time required to end the
search on a 1.7 GhZ processor is 79.3 CPU seconds, while the time needed when the
O(n2 ) version of 1-shift (derived in Section 6.4) is only 0.19 CPU seconds. Thus, the
evaluation of the move cost by using the exact full objective is more than 400 times
slower. The time progression of the search for this example is shown in Figure 6.3.

6.4

Exact recursive local search

In the PTSP, it has been luckily possible to derive exact recursive and efficient move
cost expressions for the 2-p-opt and 1-shift operators. The recursion on one side allows
to store arrays of partial information for future computation and to save time, and
on the other side, it forces to explore the 2-p-opt and 1-shift neighborhoods in a fixed
lexicographic order as a condition to be able to store the necessary arrays of information.
This section is organized as follows. The first attempts to derive exact recursive
expressions for the cost of 2-p-opt and 1-shift moves date back in the Operations Research history. Past work on this issue is reviewed in Section 6.4.1. In Section 6.4.2, we
derive new and correct expressions for computing the cost of 2-p-opt and 1-shift local
search moves for the general case of heterogeneous customers probabilities. In Section
6.4.3 we specialize the derivation of Section 6.4.2 to the case of homogeneous customers
probabilities. Finally, in Section 6.4.4 we perform a computational test showing the
effects of using, in the homogeneous PTSP, the incorrect expressions published in the
past in [24] and in [27], and the improvement due to the use of the correct cost values.

6.4.1

A bit of history

For the homogeneous PTSP, Bertsimas proposed move evaluation expressions in [27]
that explore the neighborhood of a solution (that is, that verify whether an improving
2-p-opt or 1-shift move exists) in O(n2 ) time. The intent of Bertsimas equations is
to provide a recursive means to quickly compute the exact change in expected value

6.4. EXACT RECURSIVE LOCAL SEARCH

18000
17000

current best

19000

full evaluation
recursive evaluation

1e02

1e01

1e+00

1e+01

1e+02

time

Figure 6.3: Time progression of the 1-shift local search with two types of move cost
evaluation. The full exact evaluation is more than 400 times slower than the recursive
one. The 1-shift local search of this plot is applied to a solution obtained by the
Space Filling Curve heuristic for solving the kroA100 instance with 100 customers and
homogeneous customers probability equal to 0.5.
associated with either a 2-p-opt or 1-shift procedure. Evaluating the cost of a local move
by computing the cost of two neighboring solutions and then evaluating their difference
would require much more time (O(n4 )) than a recursive approach. Unfortunately, as
we have shown in [41], Bertsimas expressions do not correctly calculate the change in
expected tour length. The correction of equations for computing the cost of local search
moves that we propose in this thesis confirms that it is possible to explore both the
2-p-opt and 1-shift neighborhood of a solution in O(n2 ) time, and does, as expected,
create significant improvement in the already good results for the homogeneous PTSP.
For the heterogeneous PTSP, Bertsimas et al. [26], report computational results of
2-p-opt and 1-shift local search algorithms applied to some small PTSP instances. The
results in [26] are based on the work of Chervi [62], who proposed recursive expressions
for the cost of 2-p-opt and 1-shift moves for the heterogeneous PTSP. Chervis expressions explore the 2-p-opt and 1-shift neighborhoods in O(n3 ) time, suggesting that it
is not possible to retain the O(n2 ) complexity of the homogeneous PTSP. Moreover,
Chervis expressions reduce to the incorrect expressions for the homogeneous PTSP

CHAPTER 6. LOCAL SEARCH

published in [27], when all customer probabilities are equal, and therefore are also
not correct. After deriving correct expressions for the computation of the change in
expected tour length, we demonstrate that the neighborhood of a solution for this important problem may be explored in O(n2 ) time, thus retaining the same complexity
as the homogeneous case.

6.4.2
6.4.2.1

Derivation of local search costs for the heterogeneous PTSP

The cost of 2-p-opt moves

Consider, without loss of generality, a tour = (1, 2, ..., i, i + 1, ..., j, j + 1, ..., n), and
denote by i,j a tour obtained by reversing a section (i, i + 1, ..., j) of , where i
{1, 2, . . . , n}, j {1, 2, . . . , n}, and i 6= j (see Figure 6.1). Note that if j < i, the
reversed section includes n. Let Ei,j denote the change in the expected tour length
E[L(i,j )]E[L()]. We will derive a set of recursive formulas for Ei,j that can be used
to efficiently evaluate a neighborhood of 2-p-opt moves. To describe this procedure, we
first introduce a few definitions. Let S, T N be subsets of nodes, with representing
any a priori tour, and (i) representing the customer in the ith position on this tour
such that = ((1), (2), . . . , (n)). The product defined by Equation (4.3) can be
easily generalized by replacing qt with q(t) and qu with q(u) .
P
Q
Definition 7 E[L()]
= (i)S,(j)T,i6=j d( (i), (j) )p(i) p(j) j1
i+1 q , that is,
T S

the contribution to the expected cost of due to the arcs from the nodes in S to the
nodes in T .
Note that E[L()]
= E[L()], when T = S = N .
T S
Definition 8 E[L()]
= E[L()]
+ E[L()]
T S
T S
ST
For the two a priori tours and i,j we introduce
Definition 9 E
= E[L(i,j )]
E[L()]
, that is, the contribution to
i,j T S
T S
T S
Ei,j due to the arcs from the nodes in S to the nodes in T and from the nodes in T
to the nodes in S.
Unlike the TSP, the expected cost of an a priori tour involves the arcs between all of
the nodes. The ordering of the nodes on the a priori tour simply affects the probability
of an arc being used, and this probability determines the contribution this arc makes
to the expected cost of the tour. The change in expected tour length, Ei,j , resulting
from a reversal of a section is thus based on the change in probability, or weight, placed
on certain arcs in the two tours and i,j . While computing Ei,j it is thus necessary
to evaluate the weight change of each arc. The change in weight on an arc is influenced
by how many of its endpoints are included in the reversed section. Because of this, it
is useful to consider the following partitions of the node set.
Definition 10 i nsidei,j = {i, . . . , j}, that is, the section of that is reversed to obtain
i,j

6.4. EXACT RECURSIVE LOCAL SEARCH

Definition 11 outsidei,j = N \ i nsidei,j .

Using the above definitions, Ei,j may be expressed as
Ei,j = E
+ E
+ E
.
i,j insidei,j insidei,j
i,j outsidei,j outsidei,j
i,j insidei,j outsidei,j
(6.1)
It is not difficult to verify that the contributions to Ei,j due to E
i,j insidei,j insidei,j
and to E
are zero. The contribution to Ei,j due to arcs between
i,j outsidei,j outsidei,j

inside and outside (which is now equal to Ei,j ) may be split into three components:
E
=E[L(i,j )]
+
i,j insidei,j outsidei,j
insidei,j outsidei,j
E[L(i,j )]

outsidei,j insidei,j
E[L()]
,
insidei,j outsidei,j

(6.2)

where the three terms on the right hand side of the last equation are, respectively,
the contribution to E[L(i,j )] due to the arcs going from i nsidei,j to outsidei,j , the
contribution to E[L(i,j )] due to the arcs going from outsidei,j to i nsidei,j , and the
contribution to E[L()] due to arcs joining the two customer sets in both directions.
For compactness, these three components will be referenced hereafter by the notation:
(1)
Ei,j = E[L(i,j )]

insidei,j outsidei,j

(2)
Ei,j = E[L(i,j )]

outsidei,j insidei,j

(3)
Ei,j = E[L()]

insidei,j outsidei,j

(6.3)
(6.4)
(6.5)

We may rewrite the expected tour length change Ei,j as follows

(1)

(2)

(3)

Ei,j = Ei,j + Ei,j Ei,j .

(6.6)

Unlike the TSP, there is an expected cost associated with using an arc in a forward
direction as well as a reverse direction, and these costs are usually not the same. The
expected costs are based on which customers would have to be skipped in order for
the arc to be needed in the particular direction. For example, the weight on arc (1, 2)
is based only on the probability of nodes 1 and 2 requiring a visit, whereas the weight
on arc (2, 1) is also based on the probability of nodes (3, 4, , n) not requiring a visit.
(The tour will travel directly from the 2 to 1 only if none of the rest of the customers
on the tour are realized.) For the homogeneous PTSP, the equations are much simpler
since the expected cost is based on the number of nodes that are skipped, not which
nodes are skipped. This difference dictates the new set of equations we present here.
(1)
(2)
(3)
We will now derive recursive expressions for Ei,j , Ei,j , Ei,j , respectively, in terms of
(1)

(2)

(3)

Ei+1,j1 , Ei+1,j1 and Ei+1,j1 . These recursions are initialized with the expressions

CHAPTER 6. LOCAL SEARCH

corresponding to entries (i, i) and (i, i + 1) for all i. We will derive these expressions
quite easily later. First, let us focus on the case where j = i + k and 2 k n 2. The
case k = n 1 may be neglected because it would lead to a tour that is reversed with
respect to , and, due to the symmetry of distances, this reversed tour would have the
same expected length as . Let us consider the tour i+1,j1 = (1, 2, ..., i 1, i, j 1, j
2, ..., i + 1, j, j + 1..., n) obtained by reversing section (i + 1, ..., j 1) of . We can make
three important observations. The first one is that the partitioning of customers with
respect to i,j is related to the partitioning with respect to i+1,j1 in the following
way:
i nsidei,j = i nsidei+1,j1 {i, j}

(6.7)

outsidei,j = outsidei+1,j1 \ {i, j}.

(6.8)

The second observation is that for any arc (l, r), with l i nsidei+1,j1 and r
outsidei,j , the weight on the arc in the expected total cost equation for i,j can be
obtained by multiplying the weight that the arc has in i+1,j1 by qi and dividing it
by qj . One way to see this is to compare the set of skipped customers in i,j with the
set of skipped customers in i+1,j1 . In i,j , the fact that arc (l, r) is used implies that
customers (l 1, l 2, . . . , i + 1, i, j + 1, . . . , r 1) are skipped, while in i+1,j1 using
arc (l, r) implies that customers (l 1, l 2, . . . , i + 1, j, j + 1, . . . , r 1) are skipped.
Therefore, the set of skipped customers in i+1,j1 is equal to the set of skipped customers in i,j except for customer j in i+1,j1 , which is replaced by customer i in i,j .
In terms of probabilities, our second observation can be expressed as
E[L(i,j )]

insidei+1,j1 outsidei,j

qi
.
E[L(i+1,j1 )]
insidei+1,j1 outsidei,j
qj

(6.9)

The third important observation is similar to the previous one, but it refers to arcs
going in the opposite direction. More precisely, for any arc (r, l), with r outsidei,j
and l i nsidei+1,j1 , the weight of the arc in i,j can be obtained by multiplying the
weight that the arc has in i+1,j1 by qj and by dividing it by qi . It is not difficult
to verify this using the same argument as in the previous observation. Similar to the
second observation, the third observation can be expressed as
E[L(i,j )]

outsidei,j insidei+1,j1

qj
E[L(i+1,j1 )]
.
outsidei,j insidei+1,j1
qi

(6.10)

Now, by Equations (6.3), (6.4) and (6.7) we can write

(1)
Ei,j = E[L(i,j )]

insidei+1,j1 outsidei,j

+ E[L(i,j )]

{i,j}outsidei,j

(2)
Ei,j = E[L(i,j )]

outsidei,j insidei+1,j1

+ E[L(i,j )]

outsidei,j {i,j}

(6.11)
.

(6.12)

By combining Equation (6.9) with Equation (6.11), we obtain

(1)

Ei,j =

qi
E[L(i+1,j1 )]
+ E[L(i,j )]
,
insidei+1,j1 outsidei,j
{i,j}outsidei,j
qj

(6.13)

6.4. EXACT RECURSIVE LOCAL SEARCH

which, by Equation (6.8), becomes

(1)

qi
E[L(i+1,j1 )]
insidei+1,j1 outsidei+1,j1
qj
qi
E[L(i+1,j1 )]
+ E[L(i,j )]
.
insidei+1,j1 {i,j}
{i,j}outsidei,j
qj

Ei,j =

(6.14)

We can rewrite this to obtain the following recursion

qi (1)
qi
Ei+1,j1 E[L(i+1,j1 )]
+ E[L(i,j )]
.
insidei+1,j1 {i,j}
{i,j}outsidei,j
qj
qj
(6.15)
(2)
In an analogous way, we can create a recursive expression for Ei,j . By first combining
Equation (6.9) with Equation (6.12), and then applying (6.8), we obtain
(1)

Ei,j =

qj (2)
qj
Ei+1,j1 E[L(i+1,j1 )]
+ E[L(i,j )]
.
{i,j}insidei+1,j1
outsidei,j {i,j}
qi
qi
(6.16)
(3)
Let us now focus on Ei,j . This term refers to the original tour . Therefore, in order
(2)

Ei,j =

(3)

to get a recursive expression in terms of Ei+1,j1 , we must isolate the contribution to

(3)

Ei,j due to arcs going from i nsidei+1,j1 to outsidei+1,j1 and vice versa. Thus, by
combining Equation (6.5) with both (6.7) and (6.8) we obtain
(3)
(3)
Ei,j = Ei+1,j1 E[L()]

{i,j}insidei+1,j1

+ E[L()]

{i,j}outsidei,j

(6.17)

Now we complete the derivation by showing that it is possible to express the residual
(s)
terms on the right hand side of Ei,j , s = 1, 2, 3 in Equations (6.15), (6.16), (6.17) in
terms of the four two-dimensional matrices Q, Q, A and B, defined as follows
Qi,j =

j
Y
i

Qi,j =

i+n1
Y

(6.18)

j+1

and
Ai,k =

Bi,k =

n1
X
r=k
n1
X

d(i, i + r)pi pi+r Qi+1,i+r1

(6.19)

d(i r, i)pir pi Qir+1,i1 .

(6.20)

r=k
(s)

where 1 k n 1, 1 i, j n. Expressing the Ei,j expressions in terms of these

defined matrices allows to minimize the number of calculations necessary in evaluating
a neighborhood of local search moves, since Q, Q, A and B must be recomputed only
when a new current tour is defined.

CHAPTER 6. LOCAL SEARCH

(s)

Let us now focus on the residual terms on the right hand side of Ei,j , s = 1, 2, 3
in Equations (6.15), (6.16), (6.17). Recalling that j = i + k, the second term on the
right hand side of Equation (6.15) is the following

k1
qi
qi X
E[L(i+1,j1 )]
=
d(i + t, i)pi+t pi Qi+1,i+t1 Qi,j1
insidei+1,j1 {i,j}
qj
qj

qi
qj

t=1
k1
X

d(i + t, i + k)pi+t pi+k Qi+1,i+t1 .

t=1

(6.21)
The right hand side of the above equation is in two pieces. In the first piece, the
factor Qi,j1 may be taken out from the sum and, by applying the definition of A from
Equation (6.19) to the remaining terms in the sum, we get qqji Qi,j1 (Ai,1 Ai,k ). Also
the second piece can be expressed in terms of the A matrix, but it requires a bit more
work. First, we substitute (qi /qj )Qi+1,i+t1 with Qi,i+t1 /qj . Then, we multiply and
divide it by the product qj qj+1 . . . qi+n1 , and we obtain the term Qj+1,i+t1 /Qj,i+n1 ,
whose denominator (which is equivalent to Qi,j1 ) may be taken out from the sum.
Finally by replacing i + t with j + n k + t, and by applying the definition of A to the
remaining terms in the sum, the second piece of the right hand side of Equation (6.21)
becomes Q 1 Aj,nk+1 , and the whole Equation (6.21) may be rewritten as
i,j1

1
qi
qi
Aj,nk+1 . (6.22)
E[L(i+1,j1 )]
= Qi,j1 (Ai,1 Ai,k )
insidei+1,j1 {i,j}
qj
qj
Qi,j1

The rightmost term of Equation (6.15) may be written as

=
E[L(i,j )]
{i,j}outsidei,j
+

nk1
X
r=1
nk1
X

d(i, i + k + r)pi pi+k+r Qi+k+1,i+k+r1

d(i + k, i + k + r)pi+k pi+k+r Qi+k+1,i+k+r1 Qi,i+k1 .

r=1

(6.23)
By applying the definition of A from Equation (6.19) to the right hand side of the last
equation we obtain
qi 1
E[L(i,j )]
=
Ai,k+1 + Qi,j1 (Aj,1 Aj,nk ).
(6.24)
{i,j}outsidei,j
qj Qi,j1
The second term on the right hand side of Equation (6.16) is the following

k1
qj
qj X
E[L(i+1,j1 )]
=
d(i, i + k t)pi pi+kt Qi+kt+1,i+k1
{i,j}insidei+1,j1
qi
qi
t=1

qj
qi

k1
X

d(i + k, i + k t)pi+k pi+kt Qi+kt+1,i+k1 Qi+1,j ,

t=1

(6.25)

6.4. EXACT RECURSIVE LOCAL SEARCH

which, by applying the definition of B from Equation (6.20), becomes

qj 1
qj
E[L(i+1,j1 )]
=
Bi,nk+1 Qi,j1 (Bj,1 Bj,k ). (6.26)
{i,j}insidei+1,j1
qi
qi Qi,j1

The rightmost term of Equation (6.16) may be written as

E[L(i,j )]

outsidei,j {i,j}

nk1
X
r=1
nk1
X

d(i r, i)pir pi Qir+1,i1 Qi+1,i+k

(6.27)
d(i r, i + k)pir pi+k Qir+1,i1 .

r=1

By applying the definition of B from Equation (6.20) to the right hand side of the last
equation we obtain

E[L(i,j )]

outsidei,j {i,j}

qj
1
Qi,j1 (Bi,1 Bi,nk ) +
Bj,k+1 .
qi
Qi,j1

(6.28)

The second term on the right hand side of Equation (6.17) is the following

=
E[L()]
{i,j}insidei+1,j1

k1
X
t=1
k1
X
t=1
k1
X
t=1
k1
X

d(i, i + t)pi pi+t Qi+1,i+t1

d(i + k, i + t)pi+k pi+t Qin+k+1,i+t1

d(i + k t, i)pi+kt pi Qi+kt+1,i+k1 Qi+k,i+n1
d(i + k t, i + k)pi+kt pi+k Qi+kt+1,i+k1 ,

t=1

(6.29)
which, by applying the definition of A (Equation (6.19)) and B (Equation (6.20)),
becomes
E[L()]
= (Ai,1 Ai,k + Aj,nk+1 + Bi,nk+1 + Bj,1 Bj,k ) .
{i,j}insidei+1,j1
(6.30)

100

CHAPTER 6. LOCAL SEARCH

The rightmost term of Equation (6.17) is the following

E[L()]
=
{i,j}outsidei,j
+

nk1
X
r=1
nk1
X
r=1
nk1
X
r=1
nk1
X

d(i r, i)pir pi Qir+1,i1

d(i r, i + k)pir pi+k Qir+1,i+k1
(6.31)
d(i, i + k + r)pi pi+k+r Qi+1,i+k+r1

d(i + k, i + k + r)pi+k pi+k+r Qi+k+1,i+k+r1 ,

r=1

which, by applying the definition of A (Equation (6.19)) and B (Equation (6.20)),

becomes
E[L()]
= Bi,1 Bi,nk + Bj,k+1 + Ai,k+1 + Aj,1 Aj,nk .
{i,j}outsidei,j

(6.32)

By substituting in Equations (6.15), (6.16) and (6.17) the appropriate terms from
equations (6.22), (6.24), (6.26), (6.28), (6.30) and (6.32), we obtain the following final
recursive equations for the 2-p-opt local search for j = i + k and k 2:
(1)

(2)

(3)

Ei,j = Ei,j + Ei,j Ei,j ,

(1)

qi (1)
1
E
+ qi
Ai,k+1 qi Qi,j (Ai,1 Ai,k )
qj i+1,j1
Qi,j
1
1 1

Aj,nk+1 + Qi,j (Aj,1 Aj,nk ),

qj Qi,j
qj
qj (2)
1 1
1
= Ei+1,j1
Bi,nk+1 + Qi,j (Bi,1 Bi,nk )
qi
qi Qi,j
qi
1
+ qj
Bj,k+1 qj Qi,j (Bj,1 Bj,k ),
Qi,j

(6.33)

Ei,j =

(2)

Ei,j

(3)

(6.34)

(6.35)

(3)

Ei,j = Ei+1,j1 Ai,1 + Ai,k + Ai,k+1 + Aj,1 Aj,nk Aj,nk+1

+Bi,1 Bi,nk Bi,nk+1 Bj,1 + Bj,k + Bj,k+1 .

(6.36)

For k = 1, we can express the three components of Ei,i+1 (6.3), (6.4) and (6.5) in
terms of A and B and obtain the following equations

6.4. EXACT RECURSIVE LOCAL SEARCH

(1)

Ei,i+1 =

qi+1

Ai,2 + qi (Ai+1,1 Ai+1,n1 )

(2)

Ei,i+1 = qi+1 (Bi,1 Bi,n1 ) +

1
Bi+1,2
qi

(3)

Ei,i+1 = Ai,2 + Ai+1,1 Ai+1,n1 + Bi,1 Bi,n1 + Bi+1,2 .

101

(6.37)
(6.38)
(6.39)

For j = i, Ei,i = 0 since i,i = . It is still necessary, though, to compute the

(s)
(s)
three components Ei,i , s = 1, 2, 3 separately, in order to initiate the recursion Ei1,i+1 ,
s = 1, 2, 3. By expressing (6.3), (6.4) and (6.5) in terms of A and B, we obtain

(1)

(6.40)

(2)

(6.41)

(3)

(6.42)

Ei,i = Ai,1
Ei,i = Bi,1
Ei,i = Ai,1 + Bi,1 .
(1)

(2)

(3)

Note that Ei,i = Ei,i + Ei,i Ei,i = 0, as expected. It is possible to verify that
when pi = p and qi = q = 1 p, we obtain the same recursive Ei,j expressions as for
the homogeneous PTSP in [42].
The time required for evaluating one single 2-p-opt move by means of the recursive
equations is O(1), given that the matrices A, B, Q and Q are known. For each tour,
there are O(n2 ) possible 2-p-opt moves, and the time required to compute the matrices
A, B, Q and Q is O(n2 ). Thus, the recursive calculations allow us to evaluate all
possible 2-p-opt moves from the current solution in O(n2 ) time. With a straightforward
implementation that does not utilize recursion, evaluating the 2-p-opt neighborhood
would require O(n4 ) time instead of O(n2 ). Since a local search procedure involves
many iterations, these savings can lead to much better solutions.
The expression for Ei,j derived by Chervi in [62] is of the form Ei,j = Ei+1,j1 +
. This greatly differs from our set of recursive equations. First, the term, as derived
in [62], is not computable in O(1) time but is O(n). Second, the expression derived in
[62] is incorrect since it reduces to the incorrect 2-p-opt expression for the homogeneous
PTSP published in [27] when all customer probabilities are equal.
6.4.2.2

The cost of 1-shift moves

Consider, without loss of generality, a tour = (1, 2, ..., i, i + 1, ..., j, j + 1, ..., n), and
denote by i,j a tour obtained from by moving node i to the position of node j and
shifting backwards the nodes (i + 1, ..., j), where i {1, 2, . . . , n}, j {1, 2, . . . , n},
and i 6= j (see Figure 6.2). Note that the shifted section may include n. Let Ei,j
denote the change in the expected tour length E[L(i,j )] E[L()]. In the following,
the correct recursive formula for Ei,j is derived for the 1-shift neighborhood.
Let j = i + k. For k = 1, the tour i,i+1 obtained by 1-shift is the same as the

102

CHAPTER 6. LOCAL SEARCH

one obtained by 2-p-opt, and the expression for Ei,i+1 may be derived by applying
the equations derived for the 2-p-opt. By summing Equations (6.37), (6.38) and by
subtracting Equation (6.39) we find

Ei,i+1

=
1 Ai,2 + (qi+1 1)(Bi,1 Bi,n1 )
qi+1

1
1 Bi+1,2 .
+ (qi 1) (Ai+1,1 Ai+1,n1 ) +
qi
1

(6.43)

We will now focus on the more general case where 2 k n 2. Again, the case
where k = n 1 can be neglected because it does not produce any change to the tour
. We re-define the notions of inside, outside and the contributions to the change in
expected tour length adapting them for the 1-shift.
= E[L(i,j )]
E[L()]
. This is similar to DefDefinition 12 E
T S
T S
i,j ST
inition 9, the only difference being the meaning of the a priori tour i,j , that here is
obtained from by a 1-shift move.
Definition 13 i nsidei,j = {i + 1, . . . , j}, that is, the section of that is shifted to
obtain i,j
Definition 14 outsidei,j = N \ (i nsidei,j {i})
It is not difficult to verify that the weights on arcs between outside nodes and arcs
between inside nodes again do not change as a result of the shift. Therefore, the
only contribution to Ei,j is given by the change in weight placed on arcs between
i nsidei,j {i} nodes and outsidei,j nodes, and on arcs between node {i} and i nsidei,j
nodes, that is
.
Ei,j = E
+ E
i,j {i}insidei,j
i,j (insidei,j {i})outsidei,j

(6.44)

In the following, we derive a recursive expression for each of the two components of
Ei,j . Let

E
= E
+ ,
i,j (insidei,j {i})outsidei,j
i,j1(insidei,j1 {i})outsidei,j1

(6.45)

and

E
= E
+ .
i,j {i}insidei,j
i,j1{i}insidei,j1

(6.46)

Then, by Equation (6.44), we can write the following recursive expression

Ei,j = Ei,j1 + + .

(6.47)

6.4. EXACT RECURSIVE LOCAL SEARCH

103

Now we complete the derivation by showing that it is possible to express the residual
terms and of Equation (6.47) in terms of the already defined matrices A, B, and
Q, Q, defined as follows

Qi,j =

j
Y

i+1

Qi,j =

i+n1
Y

(6.48)

j+1

Expressing the Ei,j expressions in terms of these defined matrices allows to minimize
the number of calculations necessary in evaluating a neighborhood of local search moves.
We first re-define the matrices A (Equation (6.19)) and B (Equation (6.20)) in
terms of the matrix Q (Equation (6.48))

Ai,k =

Bi,k =

n1
X
r=k
n1
X

d(i, i + r)pi pi+r Qi,i+r1

(6.49)

d(i r, i)pir pi Qir,i1 .

(6.50)

r=k

Let us now focus on the residual term from Equation (6.45). The contribution to
Ei,j due to arcs between i nsidei,j {i} and outsidei,j for i,j is the following

=
E
i,j (insidei,j {i})outsidei,j

nk1
X

[d(i r, i)pir pi Qir,i1 (Qi,j 1)

r=1

+ d(i, i + k + r)pi pi+k+r Qi+k,i+k+r1 (1 Qi,j )]

k nk1
X
X
t=1

[d(i r, i + t)pir pi+t Qir,i1 Qi,i+t1 (1 qi )

r=1

+ d(i + k t + 1, i + k + r)pi+kt+1 pi+k+r Qi+kt+1,i+k Qi+k,i+k+r1 (qi 1)] , (6.51)

while the contribution to Ei,j1 due to arcs between i nsidei,j1 {i} and outsidei,j1
for i,j1 is

=
i,j1(insidei,j1 {i})outsidei,j1

nk
X

[d(i r, i)pir pi Qir,i1 (Qi,j1 1)

r=1

+ d(i, i + k 1 + r)pi pi+k1+r Qi+k1,i+k+r2 (1 Qi,j1 )]

k1 nk
X
X

[d(i r, i + t)pir pi+t Qir,i1 Qi,i+t1 (1 qi )

t=1 r=1

+ d(i + k t, i + k + r 1)pi+kt pi+k+r1 Qi+kt,i+k1 Qi+k1,i+k+r2 (qi 1)] .

(6.52)

104

CHAPTER 6. LOCAL SEARCH

The difference between E
and E
i,j (insidei,j {i})outsidei,j
i,j1(insidei,j1 {i})outsidei,j1
will only involve arcs which are connected to nodes i and j, that is,
= terms with i and j in E
i,j (insidei,j {i})outsidei,j

terms with i and j in E

i,j1 (insidei,j1 {i})outsidei,j1

(6.53)

So, by extracting the terms which contain the appropriate arcs from Equation (6.51)
and Equation (6.52) and by expressing them in terms of the matrices A (Equation
(6.49)) and B (Equation (6.50)) we obtain the following expression for

1
= (1 Qi,j )
Ai,k+1 (Bi,1 Bi,nk )
Qi,j

+ (qi 1) (Aj,1 Aj,nk ) qi1 Bj,k+1

(6.54)
1
Ai,k (Bi,1 Bi,nk+1 )
(1 Qi,j1 )
Qi,j1

(qi 1) (Bj,1 Bj,k ) qi1 Aj,nk+1 ,
which completes the recursive expression of Equation (6.45). Let us now focus on the
residual term from Equation (6.46). The contribution to Ei,j due to arcs between
{i} and i nside is the following
k
X

E
= Qi,j 1
[d(i, i + t)pi pi+t Qi,i+t1
i,j {i}insidei,j
t=1

d(i + k t + 1, i)pi+kt+1 pi Qi+kt+1,i+k ] ,

(6.55)
while the contribution to Ei,j1 due to arcs between {i} and i nsidei,j1 for i,j1 is
E

= Qi,j1 1

i,j1 {i}insidei,j1

k1
X

[d(i, i + t)pi pi+t Qi,i+t1

t=1

d(i + k t, i)pi+kt pi Qi+kt,i+k1 ] .

(6.56)
Now, by subtracting Equation (6.56) from Equation (6.55) and by applying the definition of A (Equation (6.49)), and B (Equation (6.50)), we obtain the following expression
for
"
#

1
Bi,nk )
= Qi,j 1 Ai,1 Ai,k+1
Qi,j1
"
#
(6.57)

1
+ 1 Qi,j1 Ai,1 Ai,k
Bi,nk+1 ,
Qi,j1

6.4. EXACT RECURSIVE LOCAL SEARCH

105

which completes the recursive expression of Equation (6.46).

By substituting the expression for (Equation (6.54)) and (Equation (6.57)) in
Equation (6.47), we obtain the following final recursive equations for the 1-shift local
search for j = i + k and k 2:
1
)(qj Ai,k Ai,k+1 )
Qi,j
1
1
+(
Qi,j )(Bi,nk Bi,nk+1 )
qj
Qi,j
1
1
+ (1 )Q0i,j Bi,1 + ( 1)Bj,k+1
qj
qi
1
+ (1 qj )Qi,j Ai,1 + (1 )Aj,nk+1
qi
+ (qi 1)(Aj,1 Aj,nk )

Ei,j = Ei,j1 + (Qi,j

(6.58)

+ (1 qi )(Bj,1 Bj,k ).
Similarly to the 2-p-opt, the time required for evaluating one single 1-shift move by
means of the recursive equations is O(1), given that the matrices A, B, Q and Q are
known. For each tour, there are O(n2 ) possible 1-shift moves, and the time required
to compute the matrices A, B, Q and Q is O(n2 ). Thus, the recursive calculations
allow us to evaluate all possible 1-shift moves from the current solution in O(n2 ) time.
With a straightforward implementation that does not utilize recursion, evaluating the
1-shift neighborhood would require O(n4 ) time instead of O(n2 ). Since a local search
procedure involves many iterations, these savings can lead to much better solutions.
The expression for Ei,j derived by Chervi in [62] is of the form Ei,j = Ei,j1 +
0 . Again, the 0 term, as derived in [62], is not computable in O(1) time but requires
O(n). This expression is also incorrect since it reduces to the incorrect 1-shift expression
for the homogeneous PTSP published in [27] when all customer probabilities are equal.

6.4.3

Derivation of local search costs for the homogeneous PTSP

In this section we derive the cost of 2-p-opt and 1-shift moves for the homogeneous
PTSP, in a similar way as we did for the heterogeneous case in the previous section.
The fact that we obtain cost expressions that are equivalent to those that one would
obtain from the heterogeneous PTSP cost expression by letting pi = p and qi = q for
i = 1, 2, . . . , n, is an additional check of their correctness.
6.4.3.1

The cost of 2-p-opt moves

106

CHAPTER 6. LOCAL SEARCH

length E[L(i,j )] E[L()]. In the following, the correct recursive formula for Ei,j is
derived for the 2-p-opt move set. This derivation consists in first developing a formula
for Ei,j and then finding a recursive form of this.
Let j = i+k (we are using the notation expressed by (4.1)). For k = 1 we believe the
expression given in [24] and in [27] for Ei,j (Equation (6.76)) is correct. Therefore,
we focus on the case k 2 (and k n 2). Let S, T N be subsets of nodes,
with representing any a priori tour, and (i) representing the customer in the ith
position on this tour such that = ((1), (2), . . . , (n)). The contribution to the
expected cost of due to the arcs from the nodes in S to the nodes in T , denoted
by E[L()]
and introduced by Definition 7 in section 6.4.2.1, assumes in case of
T S
homogeneous probabilities the following form:
X
d( (i), (j) )p2 q ji1 .
(6.59)
E[L()]
=
T S
(i)S,(j)T,i6=j

Definition 8 and Definition 9 introduced section 6.4.2.1 must be adapted accordingly.

Given that the set of nodes of tour is partitioned in the the two sets i nsidei,j and
outsidei,j (see Definition 10 and Definition 11), the change in expected tour length
Ei,j may be expressed as
.
+ E
+ E
Ei,j = E
i,j insidei,j outsidei,j
i,j outsidei,j outsidei,j
i,j insidei,j insidei,j
(6.60)
It is not difficult to verify that, as in the heterogeneous PTSP, the contributions to
are zero, therefore
and to E
Ei,j due to E
i,j outsidei,j outsidei,j
i,j insidei,j insidei,j
we can write
Ei,j = E
.
(6.61)
i,j insidei,j outsidei,j
Now, the contribution from arcs between i nsidei,j and outsidei,j to the expected tour
length of the undisturbed tour is
E[L()]
= p2
insidei,j outsidei,j

k+1
X

q t1

t=1

nk1
X

q r1 [d(i r, i + t 1) + d(j t + 1, j + r)].

r=1

(6.62)
In the disturbed tour i,j , the contribution of the same arcs is
E[L(i,j )]

insidei,j outsidei,j

= p2

k+1
X
t=1

q t1

nk1
X

q r1 [d(i + t 1, j + r) + d(i r, j t + 1)].

r=1

(6.63)
By subtracting (6.62) from (6.63), we obtain
Ei,j = p

k+1
X
t=1

nk1
X

q r1 [d(i + t 1, j + r) + d(i r, j t + 1)

r=1

d(i r, i + t 1) d(j t + 1, j + r)].

(6.64)

6.4. EXACT RECURSIVE LOCAL SEARCH

107

Consider now the case when the section from i + 1 to j 1 of is reversed, leading to
the tour i+1,j1 . Then the difference in expected length is
k1
X

Ei+1,j1 = p2

q t1

t=1

n+1k
X

q r1 [d(i + t, j + r 1) + d(i + 1 r, j t)

r=1

(6.65)

d(i r + 1, i + t) d(j t, j + r 1)].

From the two above expressions (6.64) and (6.65) it is possible to extract a recursive
equation which relates Ei,j to Ei+1,j1
Ei,j = Ei+1,j1 + .

(6.66)

Observe that the difference between Ei,j and Ei+1,j1 will only involve arcs which
are connected to nodes i and j, that is,
= terms with i and j in Ei,j
terms with i and j in Ei+1,j1 .

(6.67)

So, let us extract the terms which contain the appropriate arcs from (6.64) and (6.65).
The terms with i in Ei,j are (from Equation (6.64))
p2

nk1
X

q r1 [(1 q k )d(i, j + r) + (q k 1)d(i r, i)],

(6.68)

r=1

and the terms with j in Ei,j are (from Equation (6.64))

nk1
X

q r1 [(q k 1)d(j, j + r) + (1 q k )d(i r, j)].

(6.69)

r=1

The terms with i in Ei+1,j1 are (from Equation (6.65))

k1
X

q t1 [(q nk 1)d(i + t, i) + (1 q nk )d(i, j t)],

(6.70)

t=1

and the terms with j in Ei+1,j1 are (from Equation (6.65))

k1
X

q t1 [(1 q nk )d(i + t, j) + (q nk 1)d(j t, j)].

(6.71)

t=1

By subtracting (6.70) and (6.71) from (6.68) and (6.69) we obtain

= p {(1 q )

nk1
X
r=1
k1
X

+(1 q nk )

r=1

q r1 [d(i, j + r) d(i r, i) + d(j, j + r) d(i r, j)]

(6.72)
q r1 [d(i + r, i) d(i, j r) d(i + r, j) + d(j r, j)]}.

108

CHAPTER 6. LOCAL SEARCH

Now, the two dimensional matrices of partial results A and B defined in the heterogeneous case by Equations (6.19) and (6.20), assume here the following form (as also
given by Bertsimas [27])
Ai,k =

n1
X

q r1 d(i, i + r) and Bi,k =

r=k

n1
X

q r1 d(i r, i), 1 k n 1, 1 i n.

r=k

(6.73)
By expressing Equation (6.72) in terms of A and B as defined above we obtain
=p2 [(q k 1)Ai,k+1 + (q k 1)(Bi,1 Bi,nk )
+ (q k 1)(Aj,1 Aj,nk ) + (q k 1)Bj,k+1
+ (1 q nk )(Ai,1 Ai,k ) + (1 q kn )Bi,nk+1

(6.74)

+ (1 q kn )Aj,nk+1 + (1 q nk )(Bj,1 Bj,k )].

Finally, the above expression, together with Equation (6.66) leads to the following recursive equation for the change in expected tour length
Ei,j = Ei+1,j1 + p2 [(q k 1)Ai,k+1 + (q k 1)(Bi,1 Bi,nk )
+ (q k 1)(Aj,1 Aj,nk ) + (q k 1)Bj,k+1
+ (1 q nk )(Ai,1 Ai,k ) + (1 q kn )Bi,nk+1

(6.75)

+ (1 q kn )Aj,nk+1 + (1 q nk )(Bj,1 Bj,k )].

This expression differs from the one proposed by Bertsimas in [24] in the sign of the last
four terms inside the squared brackets, and from the expression proposed by BertsimasHowell in [27] also in the first term on the right side. Recall that Equation (6.75) is
valid for k 2. In the case k = 1 the expression for Ei,j , as published in [24] and
[27], is correct, and it is the following

Ei,i+1 = p3 [q 1 Ai,2 (Bi,1 Bi,n1 ) (Ai+1,1 Ai+1,n1 ) + q 1 Bi+1,2 ].

(6.76)

The computational complexity of evaluating the 2-p-opt neighborhood is O(n2 ), the

motivation being the same as for the heterogeneous PTSP case (see the end of section
6.4.2.1). In practice, the homogeneous case will be faster, since one does not need to
compute the matrices Q and Q.
6.4.3.2

The cost of 1-shift moves

Consider, without loss of generality, a tour = (1, 2, . . . , i, i + 1, . . . , j, j + 1, . . . , n),

and denote by i,j a tour obtained from by moving node i to position j and shifting
backwards one space the nodes i + 1, ..., j, where i {1, 2, . . . , n}, j {1, 2, . . . , n}, and
i 6= j (see Figure 6.2). Let Ei,j denote the change in the expected length E[L(i,j )]

6.4. EXACT RECURSIVE LOCAL SEARCH

109

E[L()]. In the following, the correct recursive formula for Ei,j is derived for the
1-shift move set. This derivation follows a line similar to the one for the 2-p-opt.
Let j = i + k (we are using the notation expressed by (4.1)). Note that, for k = 1,
the tour i,j obtained by 1-shift is the same as the one obtained by 2-p-opt, for which
we believe [24] and [27] gives the correct expression (Equation (6.43)). Therefore, we
focus on the case k 2 (and k n 2).
Based on the node partitions i nsidei,j and outsidei,j introduced for the 1-shift by
Definition 13 and 14, it is not difficult to verify that, like in the heterogeneous case,
the weights on arcs between outside nodes and arcs between inside nodes again do not
change as a result of the shift. Therefore, the only contribution to Ei,j is given by
the change in weight placed on arcs between i nsidei,j {i} nodes and outsidei,j nodes,
and on arcs between node {i} and i nsidei,j nodes, that is
Ei,j = E
+ E
.
i,j (insidei,j {i})outsidei,j
i,j {i}insidei,j

(6.77)

In the following, we derive a recursive expression for each of the two components of
Ei,j . Let

+ ,
= E
E
i,j1(insidei,j1 {i})outsidei,j1
i,j (insidei,j {i})outsidei,j

(6.78)

and

+ .
= E
E
i,j1{i}insidei,j1
i,j {i}insidei,j

(6.79)

Then, by Equation (6.77), we can write the following recursive expression

Ei,j = Ei,j1 + + .

(6.80)

Now we complete the derivation by showing that it is possible to express the residual terms and of Equation (6.80) in terms of the already defined matrices A, B
(Equation (6.73)).
Let us now focus on the residual term from Equation (6.78). The contribution
to 0 Ei,j due to arcs between i nsidei,j {i} and outsidei,j for is the following
E[L()]
= p2
(insidei,j {i})outsidei,j

k+1
X

q t1

t=1

nk1
X

q r1 [d(ir, i+t1)+d(jt+1, j+r)].

r=1

(6.81)
In the disturbed tour i,j , the contribution of the same arcs is
E[L(i,j )]
= p2
(insidei,j {i})outsidei,j
+p2

k
X
t=1

q t1

nk1
X
r=1

nk1
X

q r1 [q k d(i r, i) + d(i, j + r)]

r=1

q r1 [qd(j t + 1, j + r) + d(i r, i + t)].

(6.82)

110

CHAPTER 6. LOCAL SEARCH

By subtracting (6.81) from (6.82), we obtain

E
= p2
i,j (insidei,j {i})outsidei,j
+p2

k
X

q t1

nk1
X

t=1

nk1
X

q r1 [(q k 1)d(i r, i) + (1 q k )d(i, j + r)]

r=1

q r1 [(1 q)d(i r, i + t) + (q 1)d(j t + 1, j + r)].

r=1

(6.83)
Consider now the case where node i is shifted to position j 1, leading to a perturbed
tour i,j1 . Then the contribution to 0 Ei,j1 due to arcs between i nsidei,j1 {i}
and outsidei,j1 for i,j1 is
E

i,j1(insidei,j1 {i})outsidei,j1

p
+p2

k1
X

q t1

t=1

nk
X
r=1
nk
X

q r1 [(q k1 1)d(i r, i) + (1 q k1 )d(i, j 1 + r)]

(6.84)

q r1 [(1 q)d(i r, i + t) + (q 1)d(j t, j 1 + r)].

r=1

Now, here,
E

the heterogeneous PTSP, the difference between

will only involve
and E
i,j1(insidei,j1 {i})outsidei,j1
i,j (insidei,j {i})outsidei,j
arcs which are connected to nodes i and j, that is,
= terms with i and j in E
i,j (insidei,j {i})outsidei,j

.
terms with i and j in E
i,j1(insidei,j1 {i})outsidei,j1

(6.85)

So, let us extract the terms which contain the appropriate arcs from (6.83) and (6.84).
The terms with i and j in E
are (from (6.83)):
i,j (insidei,j {i})outsidei,j

nk1
X

q r1 [(q k 1)d(i r, i) + (1 q k )d(i, j + r)

r=1

(6.86)

+(q 1)d(j, j + r) + (q k1 q k )d(i r, j)],

and the terms with i and j in E

+p2

k1
X
t=1
nk
X
r=1

i,j1(insidei,j1 {i})outsidei,j1

are (from (6.84)):

q t1 [q nk1 (1 q)d(j, i + t) + (q 1)d(j t, j)]

(6.87)
q r1 [(q k1 1)d(i r, i) + (1 q k1 )d(i, j + r 1)].

6.4. EXACT RECURSIVE LOCAL SEARCH

111

Now, by subtracting (6.87) from (6.86) and by applying the definition of A and B (6.73)
we obtain the following expression
= p2 [(q k 1)(Bi,1 Bi,nk ) + (q k 1)Ai,k+1
+(1 q k1 )(Bi,1 Bi,nk+1 ) + (1 q k+1 )Ai,k
+(q 1)(Aj,1 Aj,nk ) + (q 1 1)Bj,k+1

(6.88)

+(1 q)(Bj,1 Bj,k ) + (1 q 1 )Aj,nk+1 ],

which completes the recursive expression (6.78). Let us now focus on the residual
term from Equation (6.79). The contribution from arcs between {i} and i nsidei,j to
the expected tour length of the undisturbed tour is
E[L()]
= p2
{i}insidei,j

k
X

[q t1 d(i, i + t) + q nk1 q t1 d(j t + 1, i)].

(6.89)

t=1

In the disturbed tour i,j , the contribution of the same arcs is

= p2
E[L(i,j )]
{i}insidei,j

k
X

[q t1 d(j + 1 t, i) + q nk1 q t1 d(i, i + t)].

(6.90)

t=1

By subtracting (6.89) from (6.90), we obtain

= p2 (q nk1 1)
E
i,j {i}insidei,j

k
X

q t1 [d(i, i + t) d(j + 1 t, i)].

(6.91)

t=1

Consider now the case where node i is shifted to position j 1, leading to a perturbed
tour i,j1 . Then the contribution to 0 Ei,j1 due to arcs between {i} and i nsidei,j1
is

E
= p2 (q nk 1)
i,j1{i}insidei,j1

k1
X

q t1 [d(i, i + t) d(j t, i)].

(6.92)

t=1

Now, by subtracting (6.92) from (6.91) and by applying the definition of A and B (6.73)
we obtain the following expression
= p2 [(q nk1 1)(Ai,1 Ai,k+1 ) + (q k+1n 1)Bi,nk
+(1 q nk )(Ai,1 Ai,k ) + (1 q kn )Bi,nk+1 ],

(6.93)

which completes the recursive expression (6.79). By substituting the expression for
(Equation (6.88)) and (Equation (6.93)) in Equation (6.80), we obtain the following
recursive equation for the 1-shift local search for j = i + k and k 2:

112

CHAPTER 6. LOCAL SEARCH

Ei,i+k =Ei,i+k1 + p2 [(q k q nk1 )Ai,k+1

+(q k q k1 )Bi,1 + (q k+1n q k )Bi,nk
+(q nk1 q nk )Ai,1 + (q nk q 1k )Ai,k
+(q k1 q kn )Bi,nk+1

(6.94)

+(q 1)(Ai+k,1 Ai+k,nk ) + (q 1 1)Bi+k,k+1

+(1 q 1 )Ai+k,nk+1 + (1 q)(Bi+k,1 Bi+k,k )].
This expression differs from the one proposed by Bertsimas in [24] and [27] by the first
six terms on the right side inside the square brackets, and it can be further simplified
to
Ei,j = Ei,j1 + p2 [(q nk q k )(qAi,k Ai,k+1 )
+(q kn q k1 )(qBi,nk Bi,nk+1 )
+(1 q 1 )(q k Bi,1 Bj,k+1 )
+(q 1 1)(q nk Ai,1 Aj,nk+1 )

(6.95)

+(q 1)(Aj,1 Aj,nk )

+(1 q)(Bj,1 Bj,k )].
Recall that Equation (6.95) is valid for k 2, and that in the case k = 1 the expression
for Ei,j as published in [24] and [27] (Equation (6.76)) is correct.
Again, the computational complexity of evaluating the 1-shift neighborhood is
O(n2 ), the motivation being the same as for the heterogeneous PTSP case (see the
end of section 6.4.2.2). In practice, the homogeneous case will be faster, since one does
not need to compute the matrices Q and Q.

6.4.4

Computational test

In this section we report the results of a computational test showing the improvement
due to the use of the correct cost values with respect to using the incorrect expressions
for the costs 2-p-opt and 1-shift moves published in [24] and in [27] for the homogeneous
PTSP. In the following section 6.4.4.1 we explain the search strategies for the 2-p-opt
and 1-shift local search, that we implemented following [24] and [27]. In section 6.4.4.2,
we describe the experimental setup, and in section 6.4.4.3 we show and comment the
experimental results.
6.4.4.1

Pseudocode of the 2-p-opt and 1-shift local search

2-p-opt The 2-p-opt local search proceeds in two phases. The first phase consists of
only exploring the moves that swap two consecutive nodes (i) and (i+1) of the current
tour , as represented in pseudocode by Algorithm 12, named Swapping local search.
The costs Ei,i+1 are computed for every value of i (by means of Equation (6.76)),

6.4. EXACT RECURSIVE LOCAL SEARCH

113

and each time a negative Ei,i+1 is encountered, one immediately swaps the two nodes
involved. Note that since the computation of Ei,i+1 only involves two rows of A and
B, one can proceed to the next pair of nodes without recomputing each entire matrix.
The local search has, as second argument, a tour BSF which is the best-so-far tour
known by the algorithm calling the Swapping local search function. At the end of the
first phase of the local search, an a priori tour is reached for which every Ei,i+1 is
positive, and the matrices A and B are complete and correct for that tour.
Algorithm 12 Swapping local search(, BSF )
for (i = 1, 2, . . . , n) do
compute rows i and i + 1 of matrices A and B relative to the current solution
compute Ei,i+1 according to Equation (6.43)
if (Ei,i+1 < 0) then
:= the tour obtained from by switching node (i) with node (i + 1)
if E[L()] < E[L(BSF )] then
BSF :=
end if
end if
end for
if (the starting solution has changed) then
re-compute the full matrices A and B
end if
The second phase of the local search consists of computing Ei,j with j = i + k and
k 2 recursively by means of Equation (6.75). The 2-p-opt, represented in pseudocode
by Algorithm 13, is implemented as a first-improvement local search, that is, when a
neighbor better than the current solution is found, the current solution is immediately
updated with the neighbor solution, and the search is restarted. When there are no
improving solutions in the neighborhood, or when the time is over, the search stops.
1-shift The 1-shift algorithm follows the same lines as the 2-p-opt algorithm: all
phase one computations, including the accumulation of matrices A and B, proceed in
the same way (see Algorithm 12). The second phase of the local search consists of
computing Ei,j with j = i + k and k 2 recursively by means of Equation (6.95).
Differently from the 2-p-opt, the 1-shift, represented in pseudocode by Algorithm 14,
is implemented as a best-improvement local search, that is, the whole neighborhood
is explored and the current solution is updated with the best (improving) neighbor
solution. When there are not improving solutions in the neighborhood, or when the
time is over, the search stops.
6.4.4.2

Experimental setup

We considered instances with n = 100 and with customers coordinates uniformly and
independently distributed on the square [0, 1]2 . For generating random instances we

114

CHAPTER 6. LOCAL SEARCH

Algorithm 13 2-p-opt(, BSF )

(1) Swapping local search(, BSF )
k := 2
while (k n 2 and time is not over) do
i := 1
while (i n and time is not over) do
compute Ei,i+k according to Equation (6.75)
if (Ei,i+k < 0) then
:= tour obtained by inverting section (i), (i + 1), . . . , (i + k) of
if E[L()] < E[L(BSF )] then
BSF :=
end if
go to (1)
else
i := i + 1
end if
end while
k := k + 1
end while

Algorithm 14 1-shift(, BSF )

(1) Swapping local search(, BSF )
while (locally optimal tour not found and time is not over) do
for (i = 1, 2, . . . , n) do
for (k = 2, . . . , n 2) do
compute Ei,i+k according to Equation (6.95)
end for
end for
if (arg mini,k Ei,i+k < 0) then
:= tour obtained from by inserting (i) after (i + k)
if E[L()] < E[L(BSF )] then
BSF :=
end if
go to (1)
else
return locally optimal tour
end if
end while

6.4. EXACT RECURSIVE LOCAL SEARCH

115

used the Instance Generator Code of the 8th DIMACS Implementation Challenge at
http://www.research.att.com/dsj/chtsp/download.html. Distances were computed with the Euclidean metric. We considered only homogeneous PTSP instances,
and for each of the common demand probabilities p = 0.1, 0.2, ..., 0.9 we generated 10
problems. Experiments were run on a PowerPC G4 400MHz, the code has been written
in C++ and it is available at http://www.idsia.ch/leo .
6.4.4.3

Comparison between correct and incorrect local search

We tested the 2-p-opt local search algorithm with three delta objective evaluation
expressions: the correct one (Equation (6.75)), the incorrect one published in [24], and
the incorrect one published in [27]. We also tested two versions of the 1-shift local
search algorithm: one with the correct delta objective evaluation (Equation (6.94))
and one with the incorrect delta objective evaluation published in [24] (which is the
same as that published in [27]). In the following, we denote by the prefix C, the
algorithms involving the correct delta objective evaluations (e.g., C-1shift), and by the
prefix I the ones implementing the incorrect delta objective evaluations as published
in [27] (e.g., I-1shift). Since we obtained similar results with the incorrect expressions
of [24] and [27], in the following we only show results obtained with the delta objective
evaluation expression of [27].
For each C- and I-local search, we considered two types of starting solutions, generated with two different solution construction heuristics, namely the the Radial Sort
and the Space Filling Curve heuristic. These solution construction heuristics have been
extensively applied in previously published papers on the PTSP, and are described in
Section 4.5.
The absolute solution quality of the tested algorithms was evaluated with respect
to near-optimal solutions which were heuristically generated in the following way. Consider three solution construction heuristics, namely Radial Sort, Space Filling Curve
and Random Search. Consider as local search heuristics the application of both C1shift and C-2-p-opt one after the other. This results in two local search heuristics
(first C-1shift and after C-2-p-opt and vice versa). By combining the three solution
construction with the two local search heuristics one obtains six different heuristics.
Return the best solution found by these six heuristic combinations.
Table 6.1 shows the results obtained with the correct algorithms, and the percent
increase resulting from the use of Bertsimas expressions. CPU running times in seconds
are shown in parenthesis. In each line of the table the data show the average over 10
different instances. For each instance, each algorithm was run only once. Note, to evaluate the I-algorithms we checked the final solutions obtained using the full evaluation
Equation (4.2). For the C-algorithms we confirm that this check was unnecessary because the algorithms exactly calculated the improvements obtained by the local search.
In Figure 6.4 the relative difference between each algorithmic combination and the
best near-optimal solution found is shown in a graphical form. It is clear from both
Table 6.1 and Figure 6.4 that I-algorithms always give worse solution quality than to
C-algorithms, as expected. From Table 6.1 we see that for small customer probability

116

CHAPTER 6. LOCAL SEARCH

% above best solution found

SFC+C-2POPT
SFC+C-1SFT
40 RAD+C-2POPT
RAD+C-1SFT
SFC+I-2POPT
SFC+I-1SFT
35 RAD+I-2POPT
RAD+I-1SFT
30
25
20
15
10
5
0
0.1

0.2

0.3

0.4

0.5
probability

0.6

0.7

0.8

0.9

Figure 6.4: percent distance from the best solution found for C-(Correct) and I(Incorrect) local search heuristics, combined with the Radial Sort and the Space Filling
Curve solution construction heuristics. Black symbols refer to C-algorithms, and white
symbols refer to I-algorithms. Each point is an average over 10 different Euclidean instances with 100 customers and customer coordinates in the unit square. The customer
probability is on the horizontal axis.

6.5. APPROXIMATED LOCAL SEARCH

117

(p=0.1) the error in Bertsimas expressions results in a very small worsening effect,
smaller than 1%, for all heuristic combinations. The table shows that the higher the
probability, the worse are I-algorithms with respect to C-algorithms. This effect is
bigger for the algorithms involving the Radial Sort, where I-algorithms obtain results
more than 30% worse than C-algorithms.
The running times of all algorithmic combinations are under the second, and also
obtaining the best near-optimal result only took a time of the order of the second.
Running times of I-algorithms (not shown), are very similar to those of C-algorithms,
except for some cases, where the solution quality of a I-algorithm is very bad. In
those cases the I-local search would stop improving very early, because it starts cycling
between two solutions, never stopping until the maximum allowed run time is exceeded.
In order to understand why cycling may occur in the I-local search, consider for instance
the I-2-p-opt local search. Given a solution , suppose that the section (i), (i +
1), . . . , (i + k), with k 2 is reversed, leading to a new solution i,j , with i,j (i) =
(i+k), i,j (i+1) = (i+k1), . . . , i,j (i+k) = (i). This means that Ei,i+k () < 0.
Now, due to the incorrectness of the expression for evaluating Ei,i+k of [24, 27], it may
happen that also Ei,i+k (i,j ) < 0, and that the section i,j (i), i,j (i+1), . . . , i,j (i+k)
is reversed, going back to solution . The same situation may occur in the I-1shift local
search.
From Figure 6.4 we can see that C-1shift consistently gives the best results (among
single heuristic combinations), with both the Space Filling Curve and the Radial Sort.
The C-2-p-opt heuristic always gives worse results, but it is not clear if it works better
with Space Filling Curve or with Radial Sort. When using I-algorithms, it seems that
the starting solution is more important than the local search, since, as we see from
Figure 6.4, the Space Filling Curve gives better results than the Radial Sort, no matter
if it is combined with I-2-p-opt or with I-1shift. This is a side effect of the inaccuracy
resulting from the use of incorrect local searches.

6.5

Approximated local search

In this section we describe three types of approximations for the move cost computation
of PTSP local search operators. The first two types of approximations (section 6.5.1)
are called ad hoc, since they are based on the particular solution structure of the
PTSP, while the third type of approximation (section 6.5.2), which is based on Monte
Carlo sampling, can be applied, in principle, to any SCOP. We restrict the description of
move cost approximations to the 1-shift local search operator, since this is the one that
appears to be more promising (according to our experiments described in Section 6.4.4).
Nevertheless, our derivation might be quite easily extended to the 2-p-opt operator as
well.
In the reminder of this Section, we will consider, without loss of generality, a tour
= (1, 2, ..., i, i + 1, ..., j, j + 1, ..., n) and a tour i,j obtained from by moving node i
to the position of node j and shifting backwards the nodes (i + 1, ..., j) (see Figure 6.2),
like we did in Section 6.4.

CHAPTER 6. LOCAL SEARCH

118

p
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9

SFC+2POPT
C
+I%
2.988 (0.20)
0.1
4.127 (0.15)
0.4
5.089 (0.11)
0.8
5.757 (0.10)
0.7
6.441 (0.12)
1.3
7.065 (0.11)
1.8
7.436 (0.12)
2.3
7.819 (0.11)
3.1
8.340 (0.12)
4.1

SFC+1SFT
C
+I%
2.980 (0.29)
0.5
4.059 (0.33)
2.6
4.903 (0.33)
4.8
5.542 (0.28)
4.8
6.206 (0.31)
6.4
6.733 (0.31)
7.5
7.126 (0.32)
8.1
7.543 (0.27)
7.8
7.985 (0.30)
9.6

RAD+2POPT
C
+I%
2.990 (0.14)
0.2
4.117 (0.22)
1.7
5.109 (0.33)
5.2
5.777 (0.32)
6.4
6.468 (0.39) 10.5
7.057 (0.39) 11.2
7.496 (0.46) 16.8
7.804 (0.48) 17.5
8.335 (0.49) 24.7

RAD+1SFT
C
+I%
2.980 (0.21)
0.5
4.058 (0.44)
3.8
4.942 (0.49) 10.6
5.573 (0.44) 14.5
6.253 (0.52) 19.9
6.871 (0.50) 21.8
7.339 (0.57) 26.5
7.606 (0.55) 33.2
8.316 (0.58) 33.9

best
C
2.979 (1.28)
4.042 (1.06)
4.876 (1.11)
5.492 (1.25)
6.122 (1.34)
6.682 (1.28)
6.950 (1.51)
7.396 (1.62)
7.883 (0.47)

Table 6.1: Average solution values obtained with C-(Correct) local search algorithms, and percent increase resulting from
the use of I-(Incorrect) local search algorithms. The column named best shows the values of the best near-optimal solutions
found heuristically, as described in section 6.4.4.3. For each probability p the average result over 10 different instances is
reported (problems and experiments are the same as the ones used for obtaining Figure 6.4). CPU run times in seconds on a
PowerPC G4 400MHz are shown in parentheses.

6.5. APPROXIMATED LOCAL SEARCH

6.5.1

119

Ad hoc approximations for the 1-shift

Unlike the TSP, the expected cost of an a priori tour involves the arcs between all of the
nodes, as it appears from Equation (4.2). The ordering of the nodes on the a priori tour
simply affects the probability of an arc being used, and this probability determines the
contribution this arc makes to the expected cost of the tour. The change in expected
tour length, that we denote by Ei,j , resulting from shifting the node at position i to
position j (see Figure 6.2) is thus based on the change in probability, or weight, placed
on the arcs in the two tours (the undisturbed tour) and i,j (the tour obtained by
applying the 1-shift move to ). The exact value of Ei,j thus consists of the sum of
the weight change of all arcs.
The main idea behind the two ad hoc approximations that we are going to describe
(1-shift-T and 1-shift-P), is to neglect in the computation of Ei,j the arcs whose
weight change is very small, or, equivalently to select a small number of arcs whose
weight change is biggest. The two different ad hoc approximations correspond to two
different ways in approximating the value of the weight change of selected arcs, while
the selected arcs are the same in 1-shift-T and in 1-shift-P.
Let us first specify how we select arcs, and secondly how we approximate the weight
change (respectively in Section 6.5.1.1 for 1-shift-T and in Section 6.5.1.2 for 1-shiftP). Arcs whose weight changes more from tour to i,j are the ones satisfying the two
following conditions
1. arcs whose weight change is not zero;
2. arcs that link nodes which are nearest neighbor either in , or in i,j or in both
tours.
The first condition is obvious, since an arc whose weight is the same in and i,j
cannot contribute to Ei,j . The second condition can be understood by considering
that the weight of an arc corresponds to its probability of being actually traveled
in the a posteriori tour. Thus, the more far away are two nodes (with respect to
and/or i,j ), the smaller is the probability of being actually traveled, and thus its
weight in the expected cost expression. For example, arc (i, i + 4) in Figure 6.2 has a
probability of actually being traveled in the a posteriori tour corresponding to equal
to pi pi+4 qi+1 qi+2 qi+3 . This weight is a product over four numbers smaller than one,
which is much smaller than one. Thus, the more the number of q factors entering an
arch weight, the smaller its weight.
There are only six arcs that satisfy both the two above mentioned conditions, and
these are: (j, j + 1), (j, i), (i, j + 1), (i 1, i + 1), (i 1, i), and (i, i + 1), as shown
in Figure 6.5. Selecting only these six arcs from the set of all 2n(n 1) arcs for
computing an approximation of Ei,j constitutes a first level of approximation, which
is common both to the 1-shift-T and the 1-shift-P approximation. The next level of
approximation consists in approximating the difference between the weight of each of
these arcs in and i,j , and the way this is done distinguishes the 1-shift-T from the
1-shift-P approximations. Before focusing on approximating weight differences, it is

120

CHAPTER 6. LOCAL SEARCH

1
n = 10

i1 = 2
i=3

i+1 = 4

j+1 = 8
5
j=7
6

Figure 6.5: The set of arcs that are selected to approximate a 1-shift move cost Ei,j .
Arc
(j, j + 1)
(j, i)
(i, j + 1)
(i 1, i + 1)
(i 1, i)
(i, i + 1)

(1) Weight in E[L()]

pj pj+1
Qi1
pj pi j+1 q
Qj
pi pj+1 i+1 q
pi1 pi+1 qi
pi1 pi
pi pi+1

(2) Weight in E[L(i,j )]

pj pj+1 qi
p j pi
pi pj+1
pi1 pi+1
Qj
pi1 pi i+1 q
Qi1
pi pi+1 j+1 q

(2)(1) Weight in Ei,j

pj pj+1 (1 qi )
Qi1
pj pi (1 j+1 q )
Qj
pi pj+1 (1 i+1 q )
pi1 pi+1 (1 qi )
Qj
pi1 pi (1 i+1 q )
Qi1
pi pi+1 (1 j+1 q )

Table 6.2: Exact probabilistic weights of the arcs selected for the 1-shift-T and the
1-shift-P ad hoc approximations. Tour and i,j are shown in Figure 6.2, while for the
definition of the product notation, see Equation (4.3).
useful to consider the exact weights of the selected arcs in and i,j . These are shown
in Table 6.2. Note that some of the exact weight differences can require an O(n)
computation time, when the product of several q factors must be performed.
6.5.1.1

1-shift-T approximation

In this very rough approximation, the weight difference of each selected arc (the right
column of Table 6.2) is approximated either by 1 or by -1, depending on the fact
whether its probability of being traveled a posteriori increases or decreases from to
i,j . From an inspection of the exact weights of the six arcs that we selected, it is not
difficult to verify that the weight increases for (i 1, i + 1), (j, i), and (i, j + 1), while
it decreases for (j, j + 1), (i 1, i), and (i, i + 1). Thus, we define the approximated
move cost as

T Ei,j = d(i 1, i + 1) + d(j, i) + d(i, j + 1) d(j, j + 1) d(i 1, i) d(i, i + 1). (6.96)

Note that T Ei,j also corresponds to the length difference between and i,j , as if,
instead of dealing with a PTSP, we would be working with a TSP. This is the reason for
the subscript T in T Ei,j and for the name of this ad hoc approximation. Obviously,

6.5. APPROXIMATED LOCAL SEARCH

121

T Ei,j requires a constant-time computation.

6.5.1.2

1-shift-P approximation

Here, we approximate the weight difference of each selected arc (the right column of
Table 6.2) by posing any product involving more than 3 q factors equal to zero. Let
Qj
j
Y
3
i q if j i 3 or j + n i 3,
q =
(6.97)
0
otherwise.
i

Then, the approximated move cost in 1-shift-P is the following

P Ei,j =
3

d(i 1, i + 1)pi1 pi pi+1 + d(j, i)pj pi (1

i1
Y

q ) + d(i, j + 1)pi pj+1 (1

j+1
3

d(j, j + 1)pj pj+1 pi d(i 1, i)pi1 pi (1

j
Y

q )

i+1
j
Y

q ) d(i, i + 1)pi pi+1 (1

i+1

i1
Y

q ).

j+1

(6.98)
Note that, since each product appearing in Equation (6.98) can be computed in constant
time, also the 1-shift-P approximation P Ei,j can be computed in constant time, for
any node i and j.

6.5.2

Sampling approximation for the 1-shift: 1-shift-S

This approximation of Ei,j is based on the Sampling approximation of the PTSP

objective function, as defined in Section 5.2.4.1. More precisely, given a set of N samples
of customers 1 , . . . , k , . . . , N , sampled independently according to probability p(k )
(see Equation (5.15)), the Sampling approximation for the 1-shift is defined as follows
SN Ei,j = ESN [L(i,j )] ESN [L()],

(6.99)

where
ESN [L(i,j )] =

N
N
1 X
1 X
L(i,j|k ) and ESN [L()] =
L(|k ).
N
N
k=1

(6.100)

k=1

Equation (6.99) can be rewritten as a sample average over length differences, each one
corresponding to a sample k of customers. For this purpose, let us introduce the
following definitions
Definition 15 Given a subset of customers V , and a node i V ,

i,
pred (i) =
the first-met node r going backward from i along ,

if i ,
otherwise.
(6.101)

122

CHAPTER 6. LOCAL SEARCH

Definition 16 Given a subset of customers V , and a node i V ,

i,
succ (i) =
the first-met node r going onward from i along ,

if i ,
otherwise.
(6.102)

Equipped with the above definitions, the length difference of the two a posteriori tours
induced by and i,j on a given subset of customers k is the following

0,
if i
/ k ,

(i
+
1))
(i

1),
succ
d(pred
k
k

(j),
i)
+
d(i,
succk (j + 1))
+d(pred
(6.103)
k Li,j =
k
otherwise,

(j
+
1))
(j),
succ
d(pred
k
k

d(predk (i 1), i) d(i, succk (i + 1)),

and the sampling approximation of 1-shift can be computed with the following
expression

SN Ei,j

N
1 X
k Li,j .
=
N

(6.104)

k=1

Since the time required to compute a predecessor or a successor node is O(n) in the
worst case, the time complexity for computing SN Ei,j is O(N n), which is much higher
than the constant time complexity of 1-shift, 1-shift-T and 1-shift-P. Note, however,
that there is the possibility to decrease at least by a constant factor the complexity
of SN Ei,j , if the neighborhood is explored lexicographically, and if the same set of
samples is kept fixed during the neighborhood exploration. Let us see in more detail
how this is done.
Suppose that the 1-shift neighborhood is explored lexicographically, such that for
each i = 1, 2, . . . , n and for each j = 1, 2, . . . , n, the value of SN Ei,j is computed
in this order by means of Equation (6.104). Suppose also that before starting the
double cycle over indexes i and j, N random samples of the subsets of customers are
generated independently, and this set is kept fixed until all SN Ei,j are evaluated. It
is not difficult to see that in such a situation it is possible to compute some of the
successor and predecessor nodes needed for the computation of k Li,j (see Equation
(6.103)) recursively. In fact, for any node i V and any subset of customers , one
can compute succ (i) and pred (i) in the following ways

succ (i)
(by Definition 16 in O(n)), if i 1
succ (i) =
(6.105)
succ (i 1) (recursively in O(1)),
otherwise.

pred (i 1) (recursively in O(1)), if i
/
pred (i) =
(6.106)
i
(in O(1)),
otherwise.

6.6. OVERVIEW OF THE RESULTS

123

Thus, the probability of needing an O(n) computation for k Li,j is equal to the
probability that customer i1 in Equation (6.105), and this can be estimated by p,
the average customers probability. In this way, the complexity of computing SN Ei,j
by Equation (6.104) is now O(N pn) (and not O(N n) as it would be without any
hypothesis on the neighborhood exploration). For PTSP instances with low customers
probabilities, the time complexity of 1-shift-S may be comparable to that of 1-shift,
1-shift-T and 1-shift-P.
The interest in considering 1-shift-S despite its possibly high time complexity lies in
its generality. In fact, the sampling approximation can be applied to any SCOP, even
when there is no analytical or closed-form expression for the objective function, and so
it is not possible to design any good ad hoc approximation.

6.5.3

Pseudocode of the approximated 1-shift local search

The three approximated move cost expressions T Ei,j , P Ei,j , and SN Ei,j in principle can be used inside a local search algorithm without any restriction on the order
in which the 1-shift neighbors are explored, and both in first-improvement or bestimprovement mode. However, since our goal is to compare the effectiveness of these
approximated move costs with the exact ones, we have developed 1-shift-T, 1-shift-P,
and 1-shift-S by keeping the same exploration strategy (best-improvement with lexicographic neighborhood exploration) as the 1-shift algorithm based on the exact recursive
move costs (described in Section 6.4.4.1). The pseudocode of the three approximated
1-shift local searches is very similar to the one of the exact recursive 1-shift, except for
two differences: first, the exact move cost Ei,j is substituted by, respectively, T Ei,j ,
P Ei,j , and SN Ei,j ; second, there is no distinction between a first phase (where only
single swap moves were checked) and a second phase. The pseudocode of 1-shift-T,
1-shift-P, and 1-shift-S are represented respectively in Algorithms 15, 16, and 17, and
Table 6.5.3 summarizes the asymptotic time complexity for all the 1-shift variants that
we have presented.
Local Search
1-shift
1-shift-T
1-shift-P
1-shift-S (N samples)

Single move
O(n), if j = i + 1; O(1), otherwise
O(1)
O(1)
O(N pn)

All neighborhood
O(n2 )
O(n2 )
O(n2 )
O(N pn3 )

Table 6.3: Time complexity of the neighborhood exploration for exact and approximated versions of 1-shift.

6.6

Overview of the results

There are several achievements obtained in this chapter, both from the methodological
point of view common to all SCOPs, and from the point of view of specific results valid

124

CHAPTER 6. LOCAL SEARCH

Algorithm 15 1-shift-T(, BSF )

while (locally optimal tour not found and time is not over) do
for (i = 1, 2, . . . , n) do
for (k = 1, . . . , n 2) do
compute T Ei,i+k using equation (6.96)
end for
end for
if (mini,k T Ei,i+k < 0) then
(i, k) := arg mini,k T Ei,i+k
:= tour obtained from by inserting (i) after (i + k)
if E[L()] < E[L(BSF )] then
BSF :=
end if
else
return locally optimal tour
end if
end while

Algorithm 16 1-shift-P(, BSF )

while (locally optimal tour not found and time is not over) do
for (i = 1, 2, . . . , n) do
for (k = 1, . . . , n 2) do
compute P Ei,i+k using equation (6.98)
end for
end for
if (mini,k P Ei,i+k < 0) then
(i, k) := arg mini,k P Ei,i+k
:= tour obtained from by inserting (i) after (i + k)
if E[L()] < E[L(BSF )] then
BSF :=
end if
else
return locally optimal tour
end if
end while

6.6. OVERVIEW OF THE RESULTS

125

Algorithm 17 1-shift-S(, BSF ) with fixed number of samples N

while (locally optimal tour not found and time is not over) do
GenerateSamples(N )
for (i = 1, 2, . . . , n) do
for (k = 1, . . . , n 2) do
compute SN Ei,i+k using equation (6.104)
end for
end for
if (mini,k SN Ei,i+k < 0) then
(i, k) := arg mini,k SN Ei,i+k
:= tour obtained from by inserting (i) after (i + k)
GenerateSamples(N )
Compute ESN [L()] and (re-)compute ESN [L(BSF )] using the last generated
samples
if ESN [L()] < ESN [L(BSF )] then
BSF :=
end if
else
return locally optimal tour
end if
end while
and useful only for the PTSP.
From the methodological point of view, we have presented, discussed, and exemplified in the case of the PTSP the three situations that one can face when
designing a local search for a SCOP. Namely: using the full exact evaluation of
the objective function for computing the move cost; finding exact and recursive
expressions for the move cost, which are faster to be computed; or considering
approximations of the move cost. In this last case, similarly to what happens for
the evaluation of one single solution, ad hoc and sampling approximations may
be considered.
After verifying the burden of one single run of local search when the full exact
evaluation of the objective function is used for computing the move cost, we can
say that, unless faster alternatives are found, it is not convenient to consider such
local search for integration in a metaheuristics. In fact, the metaheuristic itself
would be very much slowed, and it is very unlikely that the already quite good
results obtained without the local search would be improved in this situation.
For the specific case of the PTSP, it has been possible to derive exact recursive
and efficient move cost expressions for the 2-p-opt and 1-shift local search operators. This is a very good result, given that the recursive expressions allow the
exploration of the neighborhood in the same asymptotic time complexity as the

126

CHAPTER 6. LOCAL SEARCH

(deterministic) TSP. In general, when efficient expressions for the move cost are
obtained, it is very likely that, when combined with a metaheuristic, they will let
greatly improve its performance. We will verify this aspect in the next chapter,
dealing with hybrid ACO, that is, ACO integrated with local search.
We proposed two ad hoc and one Sampling approximation of the move cost for
the 1-shift local search operator. The main idea behind the two ad hoc approximations can be in principle exploited also in other SCOPs, since it is based on
estimating and neglecting the terms in the objective function that are smaller,
due to small probabilistic weights. The Sampling approximation that we propose is enhanced by some observations that allow to partially compute the move
cost in a recursive way, thus saving a lot of computation time. However, the
computational burden of the Sampling-based local search is bigger than ad hoc
approximations.

Chapter 7

Integration of Ant Colony

Optimization with local search
In this chapter, we focus on hybrid algorithms, that is, algorithms that solve an optimization problem by combining solution construction and local search techniques1 .
In particular, we consider hybrid algorithms obtained by integrating local search algorithms of Chapter 6 with the constructive ACO algorithms of Chapter 5 and with the
simple heuristics presented in Chapter 4.
In the previous chapters, we have proposed several solution construction algorithms
(5 simple constructive heuristics and a few ACO variants) and several variants of local search for the PTSP (2-p-opt and 1-shift with 4 different ways for computing the
move cost). In principle, any combination of these algorithms can be used to obtain
a new hybrid one that is hopefully better performing than the simple solution construction algorithm. However, it is impractical to test all the hybrid combinations
(potentially more than 30) and we have to make a selection guided by some specific
research question. The following are the main issues that we want to address by testing
experimentally some hybrid combinations of the previously proposed algorithms:
Issue 1: the advantage of using a local search with ACO. We have seen that
the design of a feasible local search algorithm for the PTSP has been more difficult than the design of an ACO algorithm. This situation can be quite often
encountered in SCOPs, where metaheuristics are usually easy to design, but the
same cannot be said for local search algorithms. Thus, we want to verify whether
the effort for the design of a good local search is balanced by a significant improvement in the solution quality, when the local search is used to hybridize an
easy-to-design metaheuristic (ACO in this context). Therefore, we want to quantify the improvement due to the use of local search with ACO, given a fixed
1

Note, however, that the term hybrid can also refer to other features of an algorithm. For instance,
in the work about the vehicle routing problem with stochastic demands (VRPSD) reported in Appendix
B, the term hybrid also refers to the fact of using inside the algorithm the objective function of other
problems. Thus, in that context, the use of ad hoc objective function approximations taken from other
problems is a type of hybridization.

127

128

CHAPTER 7. INTEGRATION OF ACO WITH LOCAL SEARCH

amount of time is available for solving each PTSP problem.

Issue 2: exact versus approximated local search. In designing a local search for
the PTSP (and for any SCOP in general), we have seen several possibilities for
computing the move cost (exact recursive computation, ad hoc approximations,
and the Sampling approximation). Each one of these alternatives needs more
or less stringent conditions on the structure of the SCOP, such as a particular
neighborhood structure, an explicitly known objective function, the possibility
of performing simulations. Therefore, when one faces the problem of designing a
local search for a particular SCOP, it may be useful to have an idea about which is
the most promising choice among the different alternatives. For this purpose, we
want to achieve a classification in terms of solution quality of the different local
search variants, because this can constitute a sort of guideline for the practitioner
who faces a SCOP different from the PTSP.
The remainder of this chapter is organized as follows. Section 7.1 presents the hybrid
algorithms selected for the investigation of the above issues. Experiments are described
in Section 7.2, and Section 7.3 summarizes the main results obtained.

7.1

Description of hybrid algorithms

In order to address the issues listed above, we have considered for hybridization only
the 1-shift local search, since it has been shown to be more promising than the 2-p-opt
(see the experimental results described in Section 6.4.4.3), and also because for 1-shift
we have developed the different move cost sampling schemes. Our main interest is
in analyzing the achievements of hybrid ACO, to which we have applied both exact
and approximated versions of the 1-shift local search, as we are going to describe in
Section 7.1.2. Nevertheless, we have also considered hybrid versions of the simple
constructive heuristics using the exact recursive 1-shift local search, as described in
Section 7.1.1. Similarly to the case of non-hybrid ACO, the performance of simple
heuristics is considered a sort of minimum quality level, that any metaheuristic should
be able to surpass.

7.1.1

Hybridization of simple constructive heuristics

We have considered hybridized versions of all the simple constructive heuristics introduced in Section 4.5, namely Space Filling Curve, Radial Sort, Farthest Insertion,
Nearest Neighbor, and the Random Search heuristic. For all these heuristics, except
Random Search, hybridization is done simply by applying just once the exact recursive
1-shift local search (Algorithm 14) to the single solution produced by the constructive
heuristic. In Random Search hybridization is done differently. There, the exact recursive 1-shift algorithm is applied after any random solution, until the fixed available
runtime is exhausted. Thus, in Random Search the local search is applied several times.
At the end, the best solution with respect to the PTSP objective function is returned.

7.2. EXPERIMENTAL ANALYSIS

7.1.2

129

Hybridization of ACO

In order to address Issue 1 previously described, we considered an hybridized version

of pACS with the exact recursive 1-shift local search (Algorithm 14). For addressing
Issue 2, we considered different hybridizations, namely pACS with 1-shift-P, pACS with
1-shift-T, and pACS-S with 1-shift-S (for a description of the move cost approximation
in 1-shift-P, -T, and -S variants, see Section 6.5).
The hybridization of pACS is done by applying the local search to each ant solution,
before the global pheromone update is performed, as shown in Algorithm 18.
Algorithm 18 hybrid pACS (pACS + 1-shift, pACS + 1-shift-T, pACS + 1-shift-P)
1: Initialization [like in pACS]
2: for iteration k = 1, 2, . . . do
3:
Initialize best ant solution BA
4:
for ant a = 1, 2, . . . , m do
5:
ConstructAntSolution [each ant constructs its solution a ]
6:
Apply either 1-shift(a , BSF ), 1-shift-T(a , BSF ), or 1-shift-P(a , BSF )
7:
if E[L(a )] < E[L(BA )] then
8:
set BA = a
9:
end if
10:
end for
11:
if E[L(BA )] < E[L(BSF )] then
12:
set BSF = BA
13:
end if
14:
GlobalPheromoneUpdate
15: end for

The hybridization of pACS-S has been done differently. In fact, in order to hybridize
pACS-S with 1-shift-S, we had to take into account the fact that the move cost of 1shift-S is much more computationally expensive than in 1-shift, 1-shift-T and 1-shift-P
(see Table 6.5.3). Preliminary experiments lead us to consider an hybridization where
local search is only applied to the best ant solution of each iteration, as shown in
Algorithm 19.

7.2

Experimental analysis

The experimental environment is the same as for the non hybrid ACO versions described
in Section 5.1.2.1. The parameters used for hybrid pACS and hybrid pACS-S are the
same as those used, respectively, in pACS and pACS-S, as described in Section 5.1.2.2
and Section 5.2.5.1.

130

CHAPTER 7. INTEGRATION OF ACO WITH LOCAL SEARCH

Algorithm 19 hybrid pACS-S (pACS-S + 1-shift-S)

1: Initialization [like in pACS]
2: for iteration k = 1, 2, . . . do
3:
Initialize best ant solution BA
4:
Set N = NumberOfSamples(k) [apply Equation (5.20)]
5:
for ant a = 1, 2, . . . , m do
6:
ConstructAntSolution [each ant constructs its solution a ]
7:
GenerateSamples(N )
8:
Compute ESN [L(a )] and re-compute ESN [L(BA )] using the last generated
samples
9:
if ESN [L(a )] < ESN [L(BA )] then
10:
set BA = a
11:
end if
12:
end for
13:
GenerateSamples(N )
14:
Re-compute ESN [L(BA )] and ESN [L(BSF )] using the last generated samples
15:
if ESN [L(BA )] < ESN [L(BSF )] then
16:
set BSF = BA
17:
end if
18:
1-shift-S(BA , BSF )
19:
GlobalPheromoneUpdate [using ij = (ESN [L(BSF )])1 ]
20: end for
21: Compute E[L(BSF )]

7.2.1

Advantage of using a local search with ACO

If we consider average results over the entire PTSP benchmark (Table 7.1), it appears
that pACS+1-shift goes nearer to the lower bound than pACS, but the amount of
improvement is just 0.1% (see first row of the Table). The advantage of pACS+1-shift
with respect to pACS is more relevant if we look at the average time for reaching the
best solution within the allowed computation time: pACS+1-shift requires on average
about 60% time less than pACS (again, see first row of Table 7.1). Table 7.1 also
shows that the simple heuristics, even when hybridized with the 1-shift local search,
do not reach the performance neither of pACS+1-shift, nor of pACS. The statistical
significance of the difference among algorithms has been tested by means of a Friedman
two-way analysis of variance by ranks [64]. The results of this type of test are reported
in Figure A.1 of Appendix A.
As we have learned from the experimental analysis of pACS in Section 5.1.2 however, the algorithmic performance may be very different for different average customers
probability. In particular, for pACS we arrived at the important conclusion that only
PTSP instances with average customers probability up to 0.5 are worth solving by
pACS, since for bigger customers probability the problem can be better treated like a
TSP. Is the situation the same for pACS+1-shift? A look at the left part of Figure

7.2. EXPERIMENTAL ANALYSIS

algorithm
pACS
FI
SFC
NN
RAD
RS

with 1-shift
% above LB time (seconds)
81.1%
1476.9
83.2%
86.9
93.1%
257.9
98.7%
293.1
231.2%
2074.7
667.4%
131.7

131
without 1-shift
% above LB time (seconds)
81.2%
2453.7
90.2%
0.1
111.6%
0.1
111.1%
35.5
388.8%
0.1
1228.6%
2712.0

Table 7.1: Aggregated results showing average values over all instances of the PTSP
benchmark.
7.1 tells us that the answer is positive. It is thus more fair to restrict the comparison between pACS+1-shift and other algorithms only to PTSP instances with average
customers probability smaller than or equal to 0.5. Comparative results restricted to
this subset of PTSP instances are reported in Table 7.2. From the Table and from a
Friedman two-way analysis of variance by ranks [64], it is clear that pACS+1-shift finds
statistically significant better results than all other algorithms (at a confidence level of
95%). In particular, the improvement with respect to pACS is 0.4%. Detailed results
are reported by Table A.3 in Appendix A.
We have also analyzed how the influence of using 1-shift in pACS depends on the
other factors characterizing PTSP instances (variance of the customers probability and
number of customers). The number-of-customers factor, which is shown in the right
plot of Figure 7.1 does not seem to be significant. The customers-probability-variance
factor is more interesting, and we have showed it in the central plot of Figure 7.1. From
the plot one can conclude that hybridization is more effective for instances with low
customers probability variance, while for instances with high probability variance, it
seems that pACS produces better results.

7.2.2

Exact versus approximated local search

The effect of hybridizing ACO with different approximated local search operators is
summarized in Table 7.3. The ranking of the four algorithms has been tested by the
Friedman two-way analysis of variance by ranks [64]. The statistical test tells us that
differences are significant at a confidence level of 95%, except for pACS+1-shift-P and
pACS+1-shift-T.
As expected, pACS+1-shift, which is based on the exact recursive local search, is
the best one. This is an important confirmation, since it means that the big effort done
in Section 6.4 for finding fast recursive expressions for the move cost has been worth.
Quite interestingly, the second classified hybrid ACO of Table 7.3 is the samplingbased algorithm pACS-S+1-shift-S. This fact is good news, as the sampling approximation can be applied to any SCOP, even when there is no analytical or closed-form
expression for the objective function, and so it is not possible to design any good ad
hoc approximation, or any fast recursive exact move cost expression. The performance

132

CHAPTER 7. INTEGRATION OF ACO WITH LOCAL SEARCH

best(pACS+1-shift)best(X)
best(X)

pACS
FI
NN
RAD
RS
SFC
FI+1-shift
NN+1-shift
RAD+1-shift
RS+1-shift
SFC+1-shift

(0.1 p 0.5)

-0.4%
-5.9%
-16.1%
-46.0%
-73.8%
-12.8%
-1.2%
-10.2%
-14.9%
-25.0%
-3.7%

Table 7.2: Average relative difference of the best solution found by pACS+1-shift with
respect to other algorithms (listed in column named X). Averages are restricted to
significant PTSP instances, that is, those with customers probability less then or equal
to 0.5.
of pACS-S+1-shift-S is not much worse than pACS+1-shift and pACS (taking into account Table 7.1), and it is superior to all the simple heuristics even when hybridized
with the exact 1-shift (again, see Table 7.1), which means that the sampling approximation is indeed quite effective.
Table 7.3 shows another interesting result: the bad performance of the two ad hoc
approximations, 1-shift-P and 1-shift-T. Their performance is bad also in comparison
to the simple pACS and to the FI heuristic hybridized with the exact 1-shift (see Table
7.1). This means that the quality of the move cost approximations P Ei,j and T Ei,j
is not sufficient, and the application of these approximated local searches to pACS is
only a time consuming task. Computation time would be much better exploited by the
solution construction mechanism of pACS.
The performance of the different hybrid ACO versions with respect to the value of
the optimal TSP solution is shown in Figure 7.2. Similarly to what emerged from Figure
7.1, there are some groups of PTSP instances for which pACS-S+1-shift-S performs
better than pACS+1-shift. This happens for probabilities higher than 0.4, and for
probability variance higher than 60%. Detailed results obtained by pACS-S+1-shift-S,
pACS+1-shift-T, and pACS+1-shift-P are reported, respectively, by Table A.4, A.5,
and A.6 in Appendix A. Appendix A also reports, in Figure A.1, the results of the
Friedman two-way analysis of variance by ranks done on all the hybrid and non-hybrid
versions of pACS, together with the simple heuristics and the optimal TSP solutions.

7.3

Overview of the results

Simple heuristics enhanced by local search do not reach the performance of the
simple, non-hybrid pACS. This is an additional confirmation that, even without

7.3. OVERVIEW OF THE RESULTS

133

0.05

0.2

0.4

pACS
pACS+1shift

0.6

0.8

0.05
0.00

(bestTSP)/TSP

prob

0.05

pACS
pACS+1shift

0.10

0.05

pACS
pACS+1shift

0.00

(bestTSP)/TSP

0.05

0.00

0.10

0.05

(bestTSP)/TSP

0.10

0.1 prob 0.5

0.10

0.1 prob 0.5

200

400

var

600

800

1000

Figure 7.1: Relative gain of pACS (with and without the 1-shift local search) over
the optimal TSP solution, versus, respectively, average customers probability (left),
variance of the customers probability (center), and number of customers. Points are
averages computed over the the complete PTSP benchmark (left) and over subset of
PTSP instances with customers probability smaller than or equal to 0.5 (center and
right). The length of error bars is equal to the standard deviation of the relative gain.

pACS+1-shift
pACS-S+1-shift-S
pACS+1-shift-P
pACS+1-shift-T

% above LB
81.1%
81.9%
87.3%
88.6%

time(seconds)
1476.9
1965.2
1995.1
1859.2

Table 7.3: Average results over the complete PTSP benchmark of different ACO versions, hybridized with exact and approximated 1-shift local search algorithms.
local search, the ACO metaheuristic obtains quite good results for the PTSP.
The hybridization of pACS with the exact recursive 1-shift local search leads
to an improvement of less than 1%. This is not impressive, but it should be
considered that the hybrid version converges faster to good solutions. In fact, the
time required for finding the best solution in the hybrid pACS is about 60% less
than in the non-hybrid version.
Interestingly, the exact 1-shift local search does not improve, but worsens the
solution quality of pACS, for PTSP instances characterized by very high variance
of the customers probability.
The sampling-based ACO, pACS-S and its hybrid version, is quite near to pACS
hybridized with the exact recursive local search (within 1% on average). This is
an interesting and good result, when we think that the Sampling approximation
is very general, and can be applied to any SCOP, particularly to those for which
an explicit expression for the objective function is not available. The result is also
not obvious, since the behavior of a local search with approximated move cost,

134

CHAPTER 7. INTEGRATION OF ACO WITH LOCAL SEARCH

0.1 prob 0.5

0.10

0.2

0.4

0.6
prob

0.8

0.00

0.05

(bestTSP)/TSP

0.05
0.00

(bestTSP)/TSP

pACS+1shift
pACS+1shiftP
pACSS+1shiftS
pACS+1shiftT

var

0.10

0.05

0.10

0.05
0.00

0.05

(bestTSP)/TSP

pACS+1shift
pACS+1shiftP
pACSS+1shiftS
pACS+1shiftT

0.05

0.1 prob 0.5

200

400

600

800

1000

Figure 7.2: Relative gain of hybrid ACO algorithms over the optimal TSP solution,
versus, respectively, average customers probability (left), variance of the customers
probability (center), and number of customers (right). Points are averages computed
over the the complete PTSP benchmark (left) and over subset of PTSP instances with
customers probability smaller than or equal to 0.5 (center and right).
like 1-shift-S, could be in principle also quite bad.
The best performance of the sampling-based ACO is in correspondence of PTSP
instances with high variance of the customers probability, where pACS-S+1-shiftS outperforms both pACS and pACS+1-shift. This means that, for this class of
PTSP instances, the Sampling approximation could be chosen even for those
SCOPs, such as the PTSP, for which the exact objective function is available.
When ACO is hybridized with local search based on ad hoc approximations, its
performance can be bad, both with respect to non-hybrid ACO, and to simple heuristic hybridized with a good local search. Such bad performance of local
search based on ad hoc approximations reveals that ad hoc approximations should
be chosen very carefully, and that, whenever possible, it might be better to spend
some effort in searching for exact efficient recursive move cost expressions. When
this is impossible, it may still be better to put more effort in improving the performance of the non-hybrid metaheuristic, instead of considering the integration
of local search with ACO.

Chapter 8

Conclusions
In this thesis we have focused on SCOPs (Stochastic Combinatorial Optimization Problems), a wide class of combinatorial optimization problems under uncertainty, where
part of the information about the problem data is unknown at the planning stage, and
some knowledge about its probability distribution is assumed. Note, however, that the
SCOP modeling approach is one among many other possible modeling approaches to
optimization under uncertainty, such as Pure Online, Robust, and Fuzzy approaches,
that have been briefly described in the introduction, but that we have otherwise neglected.
The first part of the thesis has been devoted to the introduction and survey of
the applications to SCOPs of metaheuristics for which there is a significant amount
of literature. These include, Ant Colony Optimization, Evolutionary Computation,
Simulated Annealing, Tabu Search and Stochastic Partitioning Methods. From the
survey, two properties of metaheuristics emerge clearly: they are a valid alternative to
exact classical methods for addressing real-sized SCOPs, and they are flexible, since
they can be quite easily adapted to solve different SCOPs formulations, both static
and dynamic. In fact, there exist applications of metaheuristics to problems formalized
as Stochastic, Chance Constraint, Two-stage and Multi-stage Integer Programs, and
Markov Decision Processes.
On the base of the reviewed literature, we have identified mainly three key issues in
solving SCOPs via metaheuristics: (1) the design and integration of ad hoc, fast and
effective objective function approximations inside the optimization algorithm; (2) the
estimation of the objective function by sampling when no closed-form expression for
the objective function is available, and the ways to deal with the time complexity and
noise induced by this type of estimation; (3) the characterization of the efficiency of
metaheuristic variants with respect to different levels of stochasticity in the problem
instances. In particular, the following observations can be done by looking at the
application of metaheuristics to SCOPs.
(1) The design of ad hoc approximations is strongly problem dependent, and no
general rule exists for finding efficient approximations of the objective function. Examples of ad hoc approximations in the literature include: the use of the objective function
135

136

CHAPTER 8. CONCLUSIONS

of a deterministic combinatorial optimization problem similar in some respects to the

SCOP considered; the use of truncated expressions for the expected values, by neglecting terms that are estimated to be small; the use of scenarios, instead of considering the
true probabilistic model. Ad hoc approximations, if on one side accelerate the evaluation and comparison among solutions, on the other side introduce a systematic error in
the computation of objective values. Usually, the systematic error cannot be reduced
unless a different, more precise ad hoc approximation is designed, and it can only be
evaluated by comparison with the exact objective value. Thus, metaheuristics typically
alternate exact and approximated evaluations during the optimization process.
(2) When the estimation of the objective function by sampling is used, the decision
whether a solution is better than another can only be done by statistical sampling,
obtaining a correct comparison result only with a certain probability. The way sampling approximation is used in metaheuristics largely depends on the way solutions are
compared and the best solution among a set of other solutions is selected (selectionof-the-best method). The selection-of-the-best method that a metaheuristic uses for
performing sample averages and for comparing solutions can have a great impact on
the effectiveness of the algorithm, but it is still hard to say which method is the most
effective in relation to the metaheuristic where it is employed.
(3) It has been recognized that completely different algorithms may be needed for
small and for large search spaces. Also the degree of randomness, that is, the size of
noise compared to the undisturbed objective function values, is an important factor. It
cannot be expected that a metaheuristic variant working well for solution spaces with
a small amount of noise will also perform optimally for solution spaces with a large
amount of noise, and vice versa. It appears that a characterization of metaheuristic
variants for SCOPs with respect to their appropriate domains of problem instance types
still waits for being elaborated.
The second part of the thesis describes a case study where we have investigated
the three key issues introduced above, by focusing in particular on a SCOP belonging
to the class of vehicle routing problems: the PTSP (Probabilistic Traveling Salesman
Problem). This problem is NP-hard, and can also be formalized as a Stochastic Integer
Program. Since no commonly used and satisfying benchmark existed for the PTSP,
we have generated our own. It consists of 432 PTSP instances carefully designed, in
particular to allow the analysis of the behavior of optimization algorithms for different
levels of stochasticity. In order to facilitate future comparisons with our results, we
have also used a known lower bound from the literature to evaluate the lower bound
of the optimal solution values for the instances of our PTSP benchmark. The PTSP,
besides having a practical interest, has many features that make it is useful as a test
problem for algorithms addressing SCOPs, such as: simple formulation, analogy with
the well known (deterministic) Traveling Salesman Problem, and most importantly, a
closed form expression for the exact objective function. Thus, it is likely it will be
considered again in the literature, and for this reason we think that the creation of a
benchmark of instances for the PTSP is a useful instrument for future research.
For the PTSP, we have first concentrated on the design of the ACO (Ant Colony Optimization) metaheuristic, second on the design of efficient local search algorithms, and

137
third, on the integration of local search with ACO and with other heuristics (hybridization). In the following, we summarize the main results and highlight their importance
and limitations.
Our simplest implementation of ACO, which we have called pACS, obtains, without the use of local search, results that can be considered good. In fact, pACS
is better than the Farthest Insertion and the Space Filling Curve heuristics both
enhanced by local search, which have been considered for long time as the best
available algorithms for the PTSP.
We have evaluated how the performance of pACS depends on the level of stochasticity of an instance. In our benchmark two factors can be considered as expression
of the level of stochasticity: the average customers probability and the variance
of the customers probability of requiring a visit. When there is no stochasticity
(that is, when customers probability is 1 and variance is 0), the PTSP is not different from the deterministic classical Traveling Salesman Problem (TSP), thus,
we have compared pACS results with the results obtained by an algorithm for the
TSP which solves the TSP to the optimum. We have found that there is a critical probability, under which PTSP instances are really worth solving by pACS,
but above the critical probability the problem can be better treated like a TSP.
The average critical probability is 0.5, but in general, the higher the number of
customers, the lower the critical probability. Moreover, the higher the variance
of customers probability, the more important is the gain of pACS with respect to
the optimal TSP solution.
We have also considered two versions of pACS that exploit two ad hoc approximations for the PTSP objective function. One of these two approximations is
the objective function of the corresponding TSP, and another one is based on
neglecting some small terms of the exact objective function. We have shown that
the performance of pACS using an ad hoc approximation is strongly correlated
with the absolute error of the approximation, and less related with the linear
correlation between exact and approximated evaluation. This fact has important consequences, since, when designing ad hoc approximations for a SCOP, one
could be tempted to choose one which from preliminary experiments seems well
correlated with the exact objective function. But this choice could be, as we have
seen, quite bad, and we would suggest to consider, instead, the absolute error
of the approximation. However, it is important to stress the fact that the goal
of our analysis of ad hoc approximations applied to ACO was not to enhance
its performance, since this result has already been achieved by others ([51, 52]),
showing that in some cases an ad hoc approximation can accelerate convergence
without significantly worsening the solution quality.
The sampling approximation of the objective function has also been considered in
a version of pACS, with the goal to see how worse is the performance with respect
to the original pACS algorithm. We found results constantly worse than pACS

138

CHAPTER 8. CONCLUSIONS
of roughly 2%, but the qualitative behavior with respect to factors determining
the stochasticity of PTSP instances is the same as the pACS algorithm based on
the exact objective.
One of the nicest results of this thesis is the design of powerful local search algorithms for the PTSP, that do not rely on any objective function approximation.
In fact, it has been possible to derive exact recursive and efficient cost expressions
for two local search operators, the 2-p-opt and 1-shift. This is a very good result,
given that the recursive expressions allow the exploration of the neighborhood of
a solution in the same asymptotic time complexity as the (deterministic) TSP,
and that, without recursion, the same task would require two orders of magnitude
more time. A limitation of this result is that it useful only for the PTSP, since
it strongly depends on the structure of its objective function. The experimental evaluation local search has been restricted to the most promising 1-shift local
search. The integration of pACS with the exact recursive 1-shift local search leads
an average improvement of less than 1%. This is not impressive, but it should
be considered that the hybrid version converges faster to good solutions. In fact,
the time required for finding the best solution in pACS with local search is about
60% less than without local search.
In considering the effect of ad hoc approximation on local search, we have observed
that, when pACS is hybridized with local search based on two different ad hoc
approximations, its performance can be quite bad, both with respect to nonhybrid ACO, and to simple heuristic hybridized with a good local search. Such
bad performance of local search based on ad hoc approximations reveals that ad
hoc approximations should be chosen very carefully, and that, whenever possible,
it is better to spend some effort in searching for exact efficient recursive move
cost expressions. When this is impossible, it may still be better to put more
effort in improving the performance of the non-hybrid metaheuristic, instead of
considering the integration of local search with ACO.
We have also considered an hybrid version of pACS which only uses evaluations
of the objective function and of the local search moves based on the sampling
approximation of the objective function. Such algorithm is quite good, since it
achieves on average results within 1% of the best hybrid pACS algorithm using the
exact objective function. Interestingly, for PTSP instances with high customers
probability variance the sampling-based hybrid pACS is the best one. This means
that, for this class of PTSP instances, the sampling approximation could be chosen
on purpose, even if the exact objective function is available.

Appendix A

Detailed computational results of

ACO algorithms for the PTSP
This appendix first presents, in Figure A.1, a complete ranking scheme of most of
the algorithms analyzed in this thesis. Secondly, six tables of detailed computational
results are reported. The six tables report computational results of two ACO algorithms described in Chapter 5 (pACS and pACS-S) and four hybrid ACO algorithms
described in Chapter 7 (pACS+1-shift, pACS-S+1-shift-S, pACS+1-shift-T, and
pACS+1-shift-P). Hybrid ACO algorithms combine pACS and pACS-S with three
versions of the 1-shift local search presented in Chapter 6.
Experiments have been run on a machine with two processors Intel(R) Xeon(TM) CPU
1.70GHz, running the GNU/Linux Debian 2.4.27 operating system. All algorithms
have been coded in C++ under the same development framework. Each algorithm has
been run once on each PTSP instance for a computation time equal to n2 /100 CPU
seconds, where n is the number of customers of the PTSP instance.
The format of the tables is as follows. There are three times six columns. The first of
the six columns contains the name of the PTSP instance, from which it is possible to
extract informations such as: the corresponding TSP instance with customers coordinates, the number of customers, the average and variance of the customers probability.
For detailed informations on the instances of the PTSP benchmark, see Section 4.3.
Since, as we have seen in Section 5, above the critical probability the PTSP is better
treated by TSP-specific algorithms, our results are interesting particularly for instances
with average probability smaller than the critical one. For this reason, we report only
results of instances with average probability up to 0.5 (which corresponds to the average critical probability). The second of the six columns reports the PTSP objective
function value of the best found solution. The third and the fourth of the six columns
contain, respectively, the iteration kb and the time tb at which the best solution was
found. Finally, the fifth and the sixth of the six columns contain, respectively, the total
number of iterations performed ktot and the total time ttot allowed to the algorithm.
139

140

APPENDIX A. DETAILED RESULTS OF ACO FOR THE PTSP

0.1 ! prob ! 0.5

pACS+1!shift
FI+1!shift
pACS
pACS!S+1!shift!S
SFC+1!shift
TSP
RAD+1!shift
pACS+1!shift!P
pACS+1!shift!T
RS+1!shift
FI
NN+1!shift
SFC
NN
RAD
RS

(a) Ranking on PTSP instances with low customers probability.

0.6 ! prob ! 0.9

TSP
pACS!S+1!shift!S
pACS
pACS+1!shift
FI+1!shift
pACS+1!shift!P
pACS+1!shift!T
FI
NN+1!shift
SFC+1!shift
NN
RAD+1!shift
SFC
RS+1!shift
RAD
RS

(b) Ranking on PTSP instances with high customers probability. Note that in this
case, no algorithm generates significantly better solutions than optimal TSP solutions.

Figure A.1: Ranking of algorithms on the PTSP benchmark. According to the Friedman two-way analysis of variance by ranks [64], the algorithms have statistically significant different ranks, at a confidence level of 95%, when they are not linked by a
vertical line on the left of the plot.

141
Instance name

best

ktot ttot

Instance name best

pACS
kb tb

ktot ttot

Instance name

best

ktot ttot

kroA100.p10.v16
kroA100.p10.v33
kroA100.p10.v50
kroA100.p10.v66
kroA100.p10.v83
kroA100.p20.v16
kroA100.p20.v33
kroA100.p20.v50
kroA100.p20.v66
kroA100.p20.v83
kroA100.p30.v16
kroA100.p30.v33
kroA100.p30.v50
kroA100.p30.v66
kroA100.p30.v83
kroA100.p40.v16
kroA100.p40.v33
kroA100.p40.v50
kroA100.p40.v66
kroA100.p40.v83
kroA100.p50.v16
kroA100.p50.v33
kroA100.p50.v50
kroA100.p50.v66
kroA100.p50.v83
kroA100.p10
kroA100.p20
kroA100.p30
kroA100.p40
kroA100.p50
eil101.p10.v16
eil101.p10.v33
eil101.p10.v50
eil101.p10.v66
eil101.p10.v83
eil101.p20.v16
eil101.p20.v33
eil101.p20.v50
eil101.p20.v66
eil101.p20.v83
eil101.p30.v16
eil101.p30.v33
eil101.p30.v50
eil101.p30.v66
eil101.p30.v83
eil101.p40.v16
eil101.p40.v33
eil101.p40.v50
eil101.p40.v66
eil101.p40.v83
eil101.p50.v16
eil101.p50.v33
eil101.p50.v50
eil101.p50.v66
eil101.p50.v83
eil101.p10
eil101.p20
eil101.p30
eil101.p40
eil101.p50
ch150.p10.v16
ch150.p10.v33
ch150.p10.v50
ch150.p10.v66
ch150.p10.v83
ch150.p20.v16
ch150.p20.v33
ch150.p20.v50
ch150.p20.v66
ch150.p20.v83
ch150.p30.v16
ch150.p30.v33
ch150.p30.v50
ch150.p30.v66
ch150.p30.v83
ch150.p40.v16
ch150.p40.v33
ch150.p40.v50
ch150.p40.v66
ch150.p40.v83

9104.9
8280.7
7318.1
9507.4
9700.7
11003.2
12008.7
11416.7
10369.2
11402.6
14116.2
12973.2
13894.6
12916.7
12462.4
15338.9
14870.9
16112.4
14871.3
13901.5
16454.6
16851.8
16825.6
16200.8
15771.9
9039.4
11720.6
13711.4
15253.8
16605.4
184.9
173.7
149.2
176.2
220.9
276.0
299.7
271.4
277.5
267.0
346.5
366.8
353.8
338.5
361.5
399.4
426.5
409.1
433.6
384.6
466.6
469.9
463.1
431.2
392.2
199.7
286.7
353.5
410.9
470.7
2465.3
2301.9
2426.2
2462.0
2501.4
3553.0
3441.5
3301.9
3113.0
3513.6
3973.1
4096.4
4144.8
3894.0
4019.7
4701.1
4771.8
4651.5
4410.3
4498.8

4736
4776
4338
4711
1872
4502
4494
5160
4186
3699
4271
4210
4578
3949
4724
3611
4544
4747
5104
3207
2861
3299
3918
4590
2189
7867
6721
6688
7451
7712
4468
2340
4713
3494
2488
4111
4865
4814
4646
2852
4615
4652
4829
3475
4954
4808
4169
4725
4431
4350
3733
3768
4702
4929
4641
7640
7461
7646
6950
7660
4470
3827
4512
4466
2453
4232
4531
4507
4766
4543
4229
4546
4742
4588
4521
3770
4305
3530
4719
2261

99.97
99.99
89.93
98.98
31.75
95.53
92.74
99.96
86.93
76.13
90.98
87.51
96.93
82.01
97.42
72.78
96.7
99.43
99.69
67.09
61.43
69.48
83.5
97.19
45.8
99.64
82.45
83.34
93.79
95.11
94.88
48.66
100.4
72.7
51.33
87.52
100.48
101.02
98.4
60.87
96.67
98.65
100.92
73.86
95.15
102
88.49
100.26
84.8
91.21
79.88
78.66
100.2
101.84
95.45
95.11
95.67
96.4
87.56
99.04
220.53
191.21
224.98
219.09
121.63
212.14
223.04
224.88
222.7
220.99
211.3
223.42
224.13
222.97
222.53
186.56
210.96
176.16
221.2
109.25

4738
4777
4826
4764
5131
4711
4855
5164
4828
4837
4698
4804
4724
4830
4853
4922
4700
4776
5120
4771
4672
4749
4688
4726
4780
7897
8182
8011
7945
8112
4808
4901
4791
4893
4906
4790
4939
4861
4817
4789
4871
4812
4881
4801
5275
4809
4800
4810
5252
4864
4767
4877
4787
4938
4963
8190
7959
8090
8100
7891
4562
4505
4513
4600
4559
4492
4572
4511
4830
4626
4503
4579
4760
4630
4572
4545
4594
4508
4799
4660

ch150.p50.v16
ch150.p50.v33
ch150.p50.v50
ch150.p50.v66
ch150.p50.v83
ch150.p10
ch150.p20
ch150.p30
ch150.p40
ch150.p50
d198.p10.v16
d198.p10.v33
d198.p10.v50
d198.p10.v66
d198.p10.v83
d198.p20.v16
d198.p20.v33
d198.p20.v50
d198.p20.v66
d198.p20.v83
d198.p30.v16
d198.p30.v33
d198.p30.v50
d198.p30.v66
d198.p30.v83
d198.p40.v16
d198.p40.v33
d198.p40.v50
d198.p40.v66
d198.p40.v83
d198.p50.v16
d198.p50.v33
d198.p50.v50
d198.p50.v66
d198.p50.v83
d198.p10
d198.p20
d198.p30
d198.p40
d198.p50
lin318.p10.v16
lin318.p10.v33
lin318.p10.v50
lin318.p10.v66
lin318.p10.v83
lin318.p20.v16
lin318.p20.v33
lin318.p20.v50
lin318.p20.v66
lin318.p20.v83
lin318.p30.v16
lin318.p30.v33
lin318.p30.v50
lin318.p30.v66
lin318.p30.v83
lin318.p40.v16
lin318.p40.v33
lin318.p40.v50
lin318.p40.v66
lin318.p40.v83
lin318.p50.v16
lin318.p50.v33
lin318.p50.v50
lin318.p50.v66
lin318.p50.v83
lin318.p10
lin318.p20
lin318.p30
lin318.p40
lin318.p50
att532.p10.v16
att532.p10.v33
att532.p10.v50
att532.p10.v66
att532.p10.v83
att532.p20.v16
att532.p20.v33
att532.p20.v50
att532.p20.v66
att532.p20.v83

4000
3022
4528
3960
4219
5902
6794
5807
7045
4945
4407
4874
4524
4139
4116
4533
4618
1349
4511
1227
3004
4387
4800
4894
3510
4678
4707
4733
4925
4729
4516
3903
4709
4471
4957
6953
7032
1044
337
7209
4691
4673
4567
4680
4661
4035
4532
4099
4574
4512
4578
3560
4615
3941
4416
4742
4497
4608
4583
3532
4199
4205
4755
4704
4768
7016
6479
7107
6761
7088
4657
4630
3588
4146
4795
4632
1090
4628
4556
4620

4559
4630
4714
4751
4641
7279
6986
7020
7075
7159
4768
4893
4789
4763
5107
4857
4785
5008
4821
4982
4869
4829
4822
4931
4938
4873
4795
4782
5019
4987
4793
4841
4944
4997
5004
7419
7384
7406
7576
7397
4747
4692
4588
4736
4698
4583
4579
4614
4676
4744
4630
4673
4633
4672
4810
4746
4720
4611
4749
4781
4676
4625
4766
4712
4798
7136
7108
7121
7294
7125
4711
4672
4635
4704
4843
4688
4657
4671
4844
4853

att532.p30.v16
att532.p30.v33
att532.p30.v50
att532.p30.v66
att532.p30.v83
att532.p40.v16
att532.p40.v33
att532.p40.v50
att532.p40.v66
att532.p40.v83
att532.p50.v16
att532.p50.v33
att532.p50.v50
att532.p50.v66
att532.p50.v83
att532.p10
att532.p20
att532.p30
att532.p40
att532.p50
rat783.p10.v16
rat783.p10.v33
rat783.p10.v50
rat783.p10.v66
rat783.p10.v83
rat783.p20.v16
rat783.p20.v33
rat783.p20.v50
rat783.p20.v66
rat783.p20.v83
rat783.p30.v16
rat783.p30.v33
rat783.p30.v50
rat783.p30.v66
rat783.p30.v83
rat783.p40.v16
rat783.p40.v33
rat783.p40.v50
rat783.p40.v66
rat783.p40.v83
rat783.p50.v16
rat783.p50.v33
rat783.p50.v50
rat783.p50.v66
rat783.p50.v83
rat783.p10
rat783.p20
rat783.p30
rat783.p40
rat783.p50
dsj1000.p10.v16
dsj1000.p10.v33
dsj1000.p10.v50
dsj1000.p10.v66
dsj1000.p10.v83
dsj1000.p20.v16
dsj1000.p20.v33
dsj1000.p20.v50
dsj1000.p20.v66
dsj1000.p20.v83
dsj1000.p30.v16
dsj1000.p30.v33
dsj1000.p30.v50
dsj1000.p30.v66
dsj1000.p30.v83
dsj1000.p40.v16
dsj1000.p40.v33
dsj1000.p40.v50
dsj1000.p40.v66
dsj1000.p40.v83
dsj1000.p50.v16
dsj1000.p50.v33
dsj1000.p50.v50
dsj1000.p50.v66
dsj1000.p50.v83
dsj1000.p10
dsj1000.p20
dsj1000.p30
dsj1000.p40
dsj1000.p50

56571.0
55323.6
52790.9
55349.7
54761.4
63021.1
63909.0
62651.1
64815.9
60648.0
72227.3
67775.5
67844.2
68433.3
72637.3
35179.7
47531.4
55865.3
63308.0
69671.2
3382.4
3320.1
3167.7
3236.0
2732.1
4938.1
4720.8
4680.7
4391.3
5020.8
5768.9
5752.8
5845.3
5641.2
5803.9
6647.8
6714.0
6682.2
6559.2
6026.6
7285.5
7222.1
7275.9
7178.3
7188.3
3368.9
4781.2
5794.0
6643.6
7334.1
7877145.0
7360107.9
7622986.4
8122145.6
7494223.3
10463471.8
10444565.1
10647711.3
10329885.8
10594892.2
12656292.2
12639273.8
12121381.1
12982963.5
11962145.3
14362558.1
14248703.2
13938082.7
14314148.8
14185609.1
15599104.5
15446401.0
15579476.9
15141669.2
14900006.2
7772825.8
10674105.7
12564114.6
14313456.5
15563658.3

4454
4699
4651
3648
4661
4529
4482
4598
4650
4841
4593
4554
3114
4860
4883
7296
7386
7115
7204
7210
4851
4801
1425
4757
4767
4855
4817
4804
4662
4788
4705
4883
1732
3205
4006
4733
4856
3093
4772
4918
4865
3654
4917
5036
4821
7448
7638
7396
7351
7331
4122
3945
4102
3712
2128
4157
4138
3552
4261
3788
3942
4100
3954
3850
4101
3968
4095
3025
3862
2327
2894
3107
3318
4266
4047
6615
6620
6727
6509
6594

2663.9
2804.26
2828.39
2175.14
2748.36
2682.75
2683.06
2764.17
2713.06
2813.22
2781.96
2720.79
2579.61
2824.87
2827.52
2793.28
2819.1
2788.28
2817.19
2822.81
6117.61
6047.41
1781.86
6006.06
5873.92
6122.6
6008.55
6115.72
5704.86
5832.12
5926.64
6107.47
2183.8
6096.46
4860.76
5903.59
6122.22
5973.12
6110.96
6116.8
6083.56
6093.83
6105.37
6125.26
5939.14
6081.61
6105.53
6055.79
6004.38
6000.03
9928.24
9448.79
9992.69
8950.18
4909.98
9996.07
9951.73
8647.21
9965.86
8822.87
9496.84
9988.05
9496.62
8865.51
9547.72
9663.42
9965.81
9903.35
9196.83
5413.97
9987.1
9962.8
9797.33
9949.5
9271.85
9948.81
9998.22
9944.57
9632.96
9762.4

4730
4742
4654
4756
4798
4775
4725
4708
4859
4871
4672
4738
3414
4869
4888
7393
7415
7237
7237
7229
4861
4877
4879
4857
4968
4862
4914
4824
5008
5030
4867
4901
4836
3224
5039
4913
4863
3174
4788
4929
4903
3677
4938
5041
4976
7508
7669
7488
7505
7491
4152
4173
4106
4159
4322
4159
4159
4118
4276
4281
4152
4105
4172
4342
4286
4106
4110
3055
4203
4289
2898
3120
3387
4288
4366
6650
6622
6765
6758
6758

100

102

225

4993.8
5107.8
4915.2
4992.4
4395.6
2493.6
3444.2
4076.3
4636.7
5164.3
8166.3
7891.4
7481.7
6473.9
6496.9
9138.0
9709.2
8287.4
9123.9
9525.9
11100.2
10305.1
10463.7
12356.7
10497.6
11817.7
13040.6
10753.2
12649.6
11260.9
13174.1
12969.6
11945.2
11614.1
13293.1
7556.1
9489.2
10951.9
12047.9
12745.5
18511.7
18831.3
17446.5
16941.7
18288.8
24781.9
24070.8
24228.3
22356.3
22020.2
27840.1
28665.3
27947.6
28331.7
26640.7
31072.0
31633.4
31780.5
30802.8
31151.6
34790.1
33756.3
32835.5
34607.4
33085.1
17583.4
24413.7
28654.4
32035.3
34797.1
37742.4
35825.3
32788.1
33202.3
31511.7
47508.7
48727.8
45935.8
44800.1
45257.9

197.49
148.99
215.76
185.81
204.73
181.51
218.85
186.21
224.04
155.82
362.39
390.52
370.17
341.1
314.11
365.37
378.53
104.95
366.91
99.45
246.01
356.29
390.26
389.08
278.95
376.14
384.76
388.07
384.74
371.65
369.64
316.83
373.04
353.88
388.26
367.45
373.32
55.33
17.49
382.14
999.05
1007.09
1006.69
998.93
1002.98
892.12
1000.97
897.68
989.16
960.26
999.69
768.75
1007.24
853.22
926.11
1010.29
962.5
1010.57
975.41
740.23
907.26
917.69
1009.01
1009.49
1004.71
994.06
921.46
1009.2
936.93
1005.88
2797.11
2808.23
2191.98
2494.98
2802.34
2797.67
662.94
2804.55
2662.58
2693.74

1011

2830

Table A.1: Detailed results of pACS for PTSP instances with 0.1 p 0.5.

6131

10000

142

APPENDIX A. DETAILED RESULTS OF ACO FOR THE PTSP

Instance name

best

ktot ttot

Instance name best

9182.38
8293.02
7330.31
9515.77
9700.78
11152.90
12046.54
11423.25
10386.02
11406.31
14275.90
12900.53
14164.21
12932.81
12535.02
15436.64
14928.29
16188.03
14916.15
14009.87
16572.49
16877.11
16915.17
16278.17
15842.17
9098.05
11819.46
13831.31
15299.55
16793.61
185.76
174.85
149.42
176.72
219.51
276.99
296.73
275.54
283.22
265.52
352.81
367.66
362.76
341.47
349.61
405.59
432.45
410.60
434.75
369.67
476.90
479.17
476.94
431.68
417.87
201.68
287.97
357.49
421.07
476.03
2524.34
2289.85
2430.43
2469.26
2502.02
3518.86
3521.56
3347.31
3178.82
3375.21
4057.14
4407.36
4025.01
4089.49
4021.93
4695.92
4779.80
4646.98
4574.77
4544.59

2247
2259
2288
2276
2278
2278
2318
2297
2274
2281
2234
2241
2232
2236
2274
2213
2231
2230
2242
2239
2232
2288
2225
2116
2269
2284
2282
2241
2225
2245
2267
2292
2235
2278
2269
2252
2308
2249
2261
2282
2255
2220
2242
2233
2246
2206
2228
2289
2199
2240
2232
2238
2246
2222
2250
2285
2272
2240
2227
2243
1899
1899
1915
1751
1924
1874
1885
1885
1747
1895
1849
1830
1862
1780
1099
1838
1844
1845
1850
1880

99.82
98.34
99.94
99.68
99.68
99.98
99.98
99.98
99.86
99.94
99.95
99.73
99.73
97.93
99.85
99.8
99.95
99.89
100
99.86
99.91
99.85
99.79
90.71
99.79
99.88
99.89
99.91
99.96
100.01
101.89
101.72
97.7
101.49
101.7
101.46
101.27
100.05
99.1
102
101.94
101.93
101.82
101.77
101.63
101.99
101.73
101.92
99.53
101.67
101.68
101.9
101.97
101.19
101.68
101.92
101.92
101.94
101.85
102.01
223.62
223.71
224.79
190.24
222.86
224.91
225.01
223.05
192.05
223.34
224.61
218.91
223.99
207.78
84.64
224.87
224.18
224.55
223.85
224.13

2251
2280
2290
2282
2283
2279
2320
2298
2276
2283
2236
2245
2236
2263
2277
2216
2233
2232
2243
2241
2234
2291
2228
2234
2273
2286
2284
2243
2226
2246
2270
2297
2291
2286
2274
2260
2318
2274
2298
2283
2257
2222
2245
2237
2252
2207
2232
2291
2230
2245
2237
2240
2247
2233
2255
2287
2274
2241
2230
2244
1907
1906
1917
1919
1935
1875
1886
1895
1905
1903
1852
1859
1868
1860
1884
1840
1848
1848
1856
1884

100

102

225

4983.58
5090.03
4914.16
4922.89
4309.73
2533.73
3486.08
4290.10
4803.82
5318.64
8298.36
7900.93
7481.83
6598.56
6560.45
9192.41
9984.28
8231.83
9057.44
9592.74
11044.31
10458.84
10899.03
12340.85
10474.07
12249.24
13179.72
10940.37
13222.28
11220.15
13281.17
13087.54
12053.34
11637.35
13750.07
7584.43
9607.01
10879.23
11912.64
13109.46
19171.90
18298.50
17374.96
18123.88
18684.73
25199.26
24510.45
25001.38
22609.73
22061.35
28093.94
28674.76
28512.86
29255.38
26434.57
32049.58
32387.46
32484.74
31596.57
31575.20
35203.54
34422.99
33420.74
34481.07
33916.41
18130.60
24924.10
29922.82
32737.70
35559.28
38841.24
36114.86
35380.47
34038.21
32677.09
48940.30
47846.35
46725.65
47116.98
48522.20

pACS-S
kb tb
1843
1849
1840
1859
1832
1903
1878
1859
1864
1847
4407
4874
4524
4139
4116
4533
4618
1349
4511
1227
3004
4387
4800
4894
3510
4678
4707
4733
4925
4729
4516
3903
4709
4471
4957
6953
7032
1044
337
7209
4691
4673
4567
4680
4661
4035
4532
4099
4574
4512
4578
3560
4615
3941
4416
4742
4497
4608
4583
3532
4199
4205
4755
4704
4768
7016
6479
7107
6761
7088
4657
4630
3588
4146
4795
4632
1090
4628
4556
4620

224.01
224.95
223.54
224.66
211.89
224.22
224.9
224.89
224.92
224.99
374.19
380.3
381.73
376.36
257.45
384.47
389.03
374.28
88.85
388.75
158.51
358.22
367.71
356.51
45.12
387.92
389.94
379.25
386.88
386.31
384.99
369.73
381.49
391.16
348.12
388.21
391.53
384.89
386.88
363.9
1008.89
988.18
994.48
954.57
749.14
1003.92
904.26
307.14
902.38
1002.61
1010.16
1008.25
1007.64
1010.26
988.57
1009.88
1010.53
972.08
993.87
1010.03
973.61
1007.98
1009.4
1002.48
1006.95
993.68
1010.77
1009.12
1010.6
1008.33
2812.02
2670.12
2793.17
2641.78
2738.61
2822.02
2681.44
2781.22
2804.57
1415.78

ktot ttot

Instance name

best

ktot ttot

1848
1850
1847
1862
1892
1908
1880
1861
1865
1848
1692
1699
1708
1691
1705
1666
1670
1690
1687
1687
1643
1653
1638
1671
1658
1629
1637
1631
1643
1640
1623
1632
1638
1631
1637
1702
1663
1639
1628
1624
1351
1360
1353
1366
1356
1322
1326
1327
1336
1338
1306
1304
1303
1308
1318
1290
1290
1294
1292
1302
1294
1287
1296
1301
1292
1347
1319
1302
1290
1287
1050
1069
1078
1077
1083
1035
1042
1052
1041
1049

56961.37
56354.99
55153.30
55373.59
56697.74
64376.03
64866.41
62777.79
65533.29
65135.12
72217.84
68567.59
70003.69
69915.05
72235.53
36946.82
48392.53
57177.77
64655.98
70963.86
3520.95
3475.39
3206.70
3386.69
2820.73
5001.62
4826.23
4833.63
4473.33
5126.43
5904.17
5870.49
5919.19
5950.67
5772.29
6704.20
6839.40
6804.40
6690.64
6189.75
7327.53
7335.12
7540.76
7388.08
7401.50
3472.58
4863.61
5888.89
6707.93
7423.20
8066789.86
7673655.47
7961863.87
8186052.70
7332409.55
10799931.97
10770686.78
10813920.38
10292914.12
10936611.85
12916320.79
13072731.61
12381820.80
12802724.67
12323164.62
14507685.54
14494379.47
14431883.24
14700442.81
14183887.09
15682511.91
15749031.67
15870168.49
15503076.08
15135976.98
7947090.75
10855790.85
12772805.12
14353182.94
15741284.59

4454
4699
4651
3648
4661
4529
4482
4598
4650
4841
4593
4554
3114
4860
4883
7296
7386
7115
7204
7210
518
830
855
349
879
849
862
865
871
439
832
843
848
832
540
816
828
663
805
773
805
819
832
841
832
869
844
815
807
806
762
758
761
624
780
729
737
744
681
760
699
700
731
715
435
679
687
712
498
727
679
686
695
710
141
756
720
696
679
664

2816.56
2784.6
2829.36
2764.24
291.76
2793.78
2588.81
2799.67
2810.4
2793.47
2821.12
2805.45
2815.8
2764.21
2388.22
2828.43
2828.71
2829.96
2825.99
2827.3
2210.97
5373.94
5633.16
1030.22
5898.91
6061.2
6097.73
6078.9
6057.55
1652.18
6097.41
6101.92
6113.06
5791.09
2518.15
6050.99
6127.4
3895.72
5618.55
5149.35
5932.52
6033.69
6093.82
6121.39
5970.7
6100.19
6120.03
6016.13
6107.34
6103.95
9963.63
9638.42
9693.95
6514.5
9991.78
9935.55
9914.45
9881.3
8155.47
10000.11
9793.78
9567.12
9961.54
9492.48
3551.38
9796.68
9684.42
9992.95
4910.97
9904.61
9982.77
9854.33
9736.25
9752.6
465.43
9909.45
9918.87
9946.66
9997.57
9854.02

1014
1020
1025
1027
1032
1000
1006
1011
1014
1017
1004
1004
1009
1019
1024
1051
1023
1003
991
992
883
889
894
892
897
855
865
869
878
874
836
846
850
857
859
822
829
839
843
847
819
826
835
842
845
872
846
824
809
808
764
773
774
780
782
733
741
749
757
761
708
717
733
735
745
688
700
713
721
731
680
692
706
720
728
760
724
699
680
670

392

1011

2830

Table A.2: Detailed results of pACS-S for PTSP instances with 0.1 p 0.5.

6131

10000

143
Instance name

best

kb tb

ktot ttot

pACS+1-shift
Instance name best
kb tb

ktot ttot

Instance name

best

kb tb

ktot ttot

8279.7
7318.1
9507.0
9700.7
10969.9
11997.1
11416.5
10360.8
11414.8
14081.7
12828.8
13880.7
12916.7
12630.9
15302.8
14868.8
16093.9
15115.5
14040.9
16454.1
16850.3
16564.6
16186.7
15845.5
17499.0
11715.1
13679.3
15253.6
16569.7
17723.7
172.6
149.1
176.2
219.4
270.0
293.8
267.9
277.5
277.7
344.7
353.4
349.7
334.7
385.0
393.0
418.6
383.7
452.6
395.3
454.7
456.4
450.0
464.2
434.5
506.0
283.6
349.2
404.7
454.5
500.6
2278.3
2423.3
2460.4
2501.4
3444.1
3390.7
3284.0
3107.0
3511.5
3941.2
4035.8
3960.2
3892.6
4247.8
4540.5
4649.8
4626.5
4557.8
4843.6
4943.4

1
1
1
1
1
1
1
1
1
68
1
25
1
1
100
18
1
1
30
78
65
24
1
1
36
1
1
1
24
1
1
1
1
1
1
1
1
1
1
124
1
1
1
1
71
1
1
6
71
88
32
25
1
93
63
1
1
9
14
12
1
1
1
1
98
1
1
1
1
20
1
1
1
1
86
24
1
1
86
51

17
15
17
19
19
33
41
34
16
114
48
35
38
22
105
99
54
55
122
101
67
97
36
40
91
50
105
105
178
183
13
12
13
18
25
15
14
16
16
127
20
19
20
15
113
40
24
30
113
93
35
29
22
102
97
30
32
72
84
110
7
7
8
10
111
12
11
10
8
103
13
12
14
10
91
25
18
16
115
86

29
28
16
20
87
19
34
77
105
81
4
5
6
7
113
5
5
5
99
117
7
7
7
6
92
8
10
8
114
85
9
10
9
99
88
9
9
11
16
16
4
4
4
103
117
5
5
4
4
95
6
6
5
5
87
7
7
7
106
83
9
8
7
94
87
6
8
9
16
22
3
3
3
3
93
4
3
3
82
82

52469.3
51282.7
53201.8
59588.5
61735.2
62660.0
61540.0
67870.7
64386.0
69773.5
66369.5
67862.0
70631.0
73302.4
74189.3
45755.5
55197.4
62285.2
69346.5
74465.2
3065.1
2946.5
2978.5
3003.2
4722.8
4442.9
4392.6
4529.9
5275.4
5546.6
5367.4
5461.3
5927.4
5767.0
6390.1
6446.7
6747.1
6829.9
6178.2
7088.5
6973.7
7475.4
7421.6
7386.9
7987.6
4619.2
5672.8
6516.2
7199.0
7785.5
7072023.4
7241096.9
6875873.6
7603393.8
9992975.8
9840719.2
9953525.6
10536279.5
10941566.1
12157464.8
11933542.3
11499618.5
13260366.4
12750737.3
13815793.3
13820422.7
14968529.9
14737781.8
14269100.2
15717063.4
15976411.2
15916478.1
15792848.0
15201985.7
17093520.2
10055741.1
12076064.1
13756464.9
15209693.8
17021037.7

1
1
1
1
38
1
1
1
74
52
1
1
1
82
70
1
1
1
1
1
1
1
1
1
51
1
1
1
1
67
1
1
1
40
17
1
1
81
18
1
1
1
82
81
57
1
1
1
1
1
1
1
1
1
18
1
1
1
1
1
1
1
1
31
21
1
1
44
1
1
96
27
11
1
1
1
1
1
1
1

4
3
3
3
81
4
4
4
86
77
4
4
2
84
80
3
4
5
6
7
3
2
2
3
92
3
3
3
74
76
3
3
3
58
78
3
3
95
81
80
3
3
89
83
77
3
3
4
4
5
2
2
2
2
83
2
2
2
60
72
3
2
3
58
63
3
3
61
65
63
109
83
77
70
71
3
3
3
3
4

47.16
38.43
60.73
7
0.21
4.74
15.14
29.66
37.8
75.82
33.8
72.78
40.63
0.87
97.36
51.12
30.88
64.46
30.52
86.22
97.89
51.24
60.29
29.55
39.47
16.25
38.52
28.69
27.87
56.09
55.28
88.18
5.25
1.62
0.85
47.99
88.05
89.83
43.19
99.85
66.79
85.49
79.54
39.32
63.77
90.54
95.87
95.2
64.02
96.54
97.04
91.03
91.69
93.69
66.16
1.21
73.25
47.26
60.31
78.63
92.32
150.51
203.41
109.16
199.62
184.53
221.01
163.72
72.6
42.64
208.23
156.98
139.73
183.42
213.64
219.88
206.98
213.13
192.3
133.78

100

102

225

4979.5
4763.4
4953.4
4588.7
5344.3
3418.3
4053.5
4581.4
5030.2
5467.1
7809.8
7373.7
6451.1
6589.3
8927.8
9504.5
7898.0
9578.7
9676.4
10716.9
9977.5
10181.7
12146.1
10858.9
11801.6
12871.4
10890.1
13773.8
11918.3
13100.5
12467.9
11782.2
12496.4
14104.3
13333.3
9312.1
10645.9
11631.5
12666.5
13539.7
17556.2
16788.8
19755.4
19158.1
24587.6
22957.6
23822.4
21881.9
22589.5
27411.2
28360.4
27552.9
27541.5
30091.0
31184.7
31734.9
31656.1
34028.0
34738.0
34505.6
32922.7
32604.1
35782.0
35763.7
36885.9
23719.9
28945.8
31676.6
34575.3
36774.7
34145.9
33662.9
31982.0
32594.8
46120.8
45123.6
44572.6
48693.1
49473.9
54615.5

16
25
1
18
83
1
1
18
30
28
1
1
1
1
99
1
1
1
33
96
1
1
1
1
72
1
1
1
3
83
1
1
1
84
85
1
1
1
1
1
1
1
1
1
116
1
1
1
1
92
1
1
1
1
51
1
1
1
71
32
1
1
1
76
19
1
1
1
1
13
1
1
1
1
86
1
1
1
1
81

202.43
209.6
211.17
218.24
215.75
143.55
223.42
98.85
107.42
192.42
23.18
284.48
9.92
85.26
343.4
55.86
55.04
279.22
128.99
320.12
96.52
273
275.55
195.01
310.32
35.84
347.05
372.08
240.14
390.6
358.03
3.95
298.53
365.17
380.6
210.39
59.69
232.58
284.23
308.65
980.88
188.88
232.65
43.5
1004.73
130.36
744.53
198.98
231.6
983.32
173.58
850.98
234.72
304.01
633.75
768.93
915.68
957.72
686.46
388.99
858.91
851.87
1010.57
985.57
299.62
869.14
605.64
657.24
1000.21
956.23
360.52
2094.49
1646.95
725.65
2630.3
355.95
302.66
1153.64
570.46
2820.9

392

1011

2830

439.84
317.82
99.39
1214.73
1357.8
504.31
568.95
532.68
2554.54
1951.11
56.15
479.28
328.25
2795.44
2692.83
1095.67
424.27
469.29
2314.42
207.02
2016.47
2098.74
3665.17
3415.33
3553.29
3680.9
4202.93
1180.19
881.96
5444.93
1906.88
2321.88
921.67
4532.02
1414.47
252.23
763.8
5341.53
1464.7
1009.71
172.59
3365
5705.81
6030.79
4547.67
1234.98
1555.04
320.25
221.2
682.03
1405.43
9110.1
2985.8
6701.61
2152.05
9999.42
7854.33
9960.55
1363.86
6275.69
4873.86
6369.15
2813.75
5838.84
3577.4
1033.16
2343
7652.21
1100.66
56.12
8906.4
3735.58
4120.13
2365.17
9.04
1861.05
1224.55
5784.11
346.11
292.43

6131

10000

Table A.3: Detailed results of pACS+1-shift for PTSP instances with 0.1 p 0.5.

144

APPENDIX A. DETAILED RESULTS OF ACO FOR THE PTSP

Instance name

best

ktot ttot

pACS-S+1-shift-S
Instance name best
kb tb

ktot ttot

Instance name

best

kb tb

ktot ttot

9230.3
8294.0
7354.7
9518.9
9701.0
11312.0
12577.8
11476.3
10398.1
11417.4
14477.0
12887.6
13952.7
12922.9
12467.3
15411.1
14933.0
16101.1
14956.4
13901.4
16476.6
16863.1
16637.7
16247.8
15771.9
9161.6
12241.7
13787.0
15286.8
16679.3
197.6
180.9
149.3
177.6
219.8
291.9
303.5
284.9
280.8
267.4
368.9
368.4
368.3
343.6
348.8
402.2
430.8
390.1
434.9
356.4
460.2
462.2
455.5
429.9
387.8
236.1
313.7
364.0
413.6
460.7
2668.2
2471.9
2458.4
2487.9
2514.6
3577.2
3570.6
3371.1
3144.1
3410.3
4046.1
4113.3
3986.3
3819.4
4001.2
4590.7
4603.4
4463.8
4419.9
4460.0

610
1339
1440
1487
1663
341
288
408
1179
1126
190
310
353
577
1148
317
318
293
357
1015
313
308
317
338
1331
681
250
308
324
283
495
525
776
1193
946
162
288
74
200
536
229
224
180
191
593
275
254
176
251
531
271
271
231
307
614
434
261
237
280
268
285
287
88
308
802
57
102
226
113
99
69
2
32
291
1
264
221
262
200
28

99.78
99.73
99.89
98.96
99.94
95.36
98.79
97.65
95
99.52
66.95
99.19
92.25
99.66
99.86
99.53
99.87
98.08
99.98
99.12
99.33
100
99.49
99.86
94.01
99.67
85.31
99.91
99.96
96.98
96.19
96.37
100
97.26
100
100
100
98.38
79.66
100
100.11
97.19
101.37
100.94
101.34
100.17
97.81
99
101.71
100.4
100.16
101.16
91.06
101.31
97.47
101.88
101.32
97.94
101.59
101.93
215.23
219.47
215.35
218.55
216.81
205.55
163.03
207.16
213.42
177.75
223.99
224.27
180.89
223.8
224.79
220.7
194.61
221.91
224.58
116.53

614
1341
1442
1497
1665
361
347
421
1229
1133
291
314
382
589
1152
319
319
299
368
1025
316
309
320
342
1415
685
326
311
325
298
523
554
780
1251
948
292
289
285
359
587
284
280
277
291
597
281
279
272
272
547
280
274
270
311
644
436
290
283
282
280
299
342
361
330
825
269
268
305
349
271
261
261
253
303
346
269
271
266
249
268

275
271
275
370
308
315
272
277
276
275
288
276
276
290
361
275
257
258
235
265
256
245
244
216
212
259
246
246
232
231
256
251
244
236
189
286
272
263
262
255
231
214
216
197
173
224
211
204
197
168
219
211
196
184
162
219
208
200
177
167
224
224
206
196
166
240
228
226
226
233
183
186
166
158
143
176
182
175
151
131

56022.5
55023.4
53958.5
50676.7
51855.9
62975.9
64045.9
63235.9
61581.3
62493.8
69857.5
66431.1
68192.3
66841.3
67911.4
40662.6
48748.1
57785.7
64084.5
69650.1
3495.0
3460.7
3106.2
3201.8
2581.3
5014.1
4721.2
4743.9
4320.9
4451.3
5888.0
5711.8
5749.7
5570.0
5338.1
6539.0
6708.5
6668.9
6485.3
5969.7
7126.1
7239.7
7343.6
7165.0
7014.6
3514.5
4943.5
5855.0
6622.7
7261.4
8181221.0
7824935.1
7659801.0
7277949.9
6940676.8
11010588.1
10757083.9
10757457.2
9950239.4
10312122.1
12869179.9
12879954.8
12233534.5
12670807.3
12081075.5
14267996.3
14100084.6
14243574.9
14304389.9
13791984.3
15676078.7
15484736.5
15564876.7
14968905.1
14671854.7
8116079.6
10925284.4
12946617.0
14358797.0
15373069.3

152
11
1
1
1
202
38
160
68
1
184
159
1
1
133
1
1
1
198
201
1
1
1
1
1
10
1
1
1
1
17
1
60
1
1
164
8
1
1
1
152
1
1
1
1
1
101
133
167
176
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
119
16
1
1
1
134
124
44
29
1
40
1
1
142
120

187
183
161
153
131
203
186
165
157
126
192
187
161
147
156
195
197
195
200
204
165
148
155
126
115
170
151
136
137
99
155
144
151
121
104
176
153
132
122
107
164
148
133
120
105
172
162
173
176
177
141
130
123
108
93
140
123
124
108
95
135
125
118
109
84
133
135
119
110
90
144
129
116
112
95
147
150
145
145
145

100

102

225

4917.1
4953.2
4678.7
4846.5
4279.9
2814.2
3539.1
4118.5
4574.2
5051.3
8372.7
8216.5
7553.0
6513.8
6487.5
9537.2
10131.4
8241.3
9115.5
9380.8
11131.0
10347.9
10172.3
12076.4
10133.3
11933.0
13147.0
11045.9
12682.9
11417.1
12988.3
12733.0
11494.0
11625.5
13223.7
8028.3
10071.9
11290.9
11978.5
12613.3
19454.4
18857.1
17678.2
17211.9
18044.3
26036.7
25083.2
24357.1
22197.5
21627.9
29657.2
29344.1
28710.8
27845.2
27227.6
31156.8
32134.2
32152.0
32272.0
30572.7
34119.5
33610.0
32728.5
34724.6
33013.2
20075.7
26765.1
29958.2
32122.0
34338.2
40632.8
37971.3
34184.7
32732.6
31207.0
47825.1
47879.5
45897.9
45449.7
43596.9

273
265
265
364
245
293
181
275
275
271
287
105
139
139
5
83
21
33
14
11
46
18
51
24
2
184
28
5
2
7
156
8
38
19
2
281
218
22
230
128
42
1
4
1
1
9
23
1
2
1
1
1
1
1
1
107
30
2
1
1
222
218
52
1
1
14
12
1
148
231
3
1
1
1
1
1
1
1
1
1

223.6
223.57
222.85
223.22
224.62
218.77
169.35
223.99
224.9
222.25
391.25
376.13
315.07
373.65
258.26
374.83
99.61
190.3
385.18
316.2
371.94
225.48
316.28
357.85
183.33
275.7
187.92
377.86
134.02
372.15
235.75
245.4
349.38
370.74
292.98
385.78
310.32
381.58
342.86
388.13
169.28
541.93
927.92
236.28
827.7
954.25
601.76
694.16
682.63
447.03
163.9
714.7
365.57
905.47
732.27
950.29
652.14
235.83
905.49
810.29
1006.71
986.67
748.47
947.3
922.8
909.41
699.73
900.57
658.13
1005.81
2054.18
2199.85
1945.84
1740.44
937.23
1456.4
1125.87
651.84
1941.89
1050.87

392

1011

2830

2673.7
2081.18
1470.19
2156.79
1894.72
2827.03
595.97
2757.45
1720.68
2371.25
2709.58
2507.8
1664.71
2667.27
2793.06
2196.46
2521.97
50.42
2807.4
2830.05
61.4
2010.97
945.98
2091.15
5038.71
1106.83
5437.24
2559.21
1805.49
5655.17
3964.24
3379
3089
5893.04
931.08
5711.75
4226.63
3580.08
3125.67
3279.12
6066.69
5648.78
5748.98
4077.72
5229.56
31.54
3769.11
4659.63
5795.25
6111.74
4665.53
3399.16
7355.16
9323.73
9752.77
47.19
1172.85
6229.08
4431.27
1524.21
7253.26
3352.98
9368.57
7714.14
1649.63
9886.1
9551.9
9623.66
1879.36
8272.07
9216.52
9623.54
5802.32
8977.94
9401.83
2342.36
973.42
46.87
9886.96
9731.58

6131

10000

Table A.4: Detailed results of pACS-S+1-shift-S for PTSP instances with 0.1 p 0.5.

145
Instance name

best

kb tb

ktot ttot

pACS+1-shift-T
Instance name best
kb tb

ktot ttot

Instance name

best

kb tb

ktot ttot

9348.0
8331.0
7327.5
9512.3
9700.7
11394.6
12058.6
12133.9
10381.0
11419.5
14599.6
13520.8
14613.7
13408.7
13277.4
15998.5
15242.8
17053.9
16065.1
14270.9
18201.7
17318.5
17037.5
16791.1
15845.6
9229.9
12115.8
14042.3
15653.9
16839.5
196.1
179.3
150.4
181.9
220.9
297.4
318.2
296.4
289.6
281.3
366.3
378.0
382.1
385.4
394.0
433.8
459.2
432.0
473.5
401.0
491.4
506.0
499.2
467.6
445.9
209.8
308.1
386.4
441.9
491.7
2628.2
2363.6
2605.1
2509.2
2511.8
3644.4
3725.9
3722.8
3420.2
3543.7
4198.0
4276.5
4208.6
4274.9
4151.0
4879.9
5126.4
4902.1
4805.9
4752.7

1
29
32
41
50
30
27
33
36
54
33
23
33
29
42
25
31
30
12
46
30
26
32
21
28
25
33
34
29
34
22
31
39
48
45
30
33
30
30
21
32
33
32
30
21
33
25
30
32
36
25
33
30
29
22
34
30
35
28
27
26
26
40
32
31
31
11
30
34
29
22
33
29
20
35
33
31
25
33
34

33
34
40
49
59
34
34
35
43
59
34
33
34
34
43
34
34
33
33
53
34
33
34
34
43
35
37
35
35
35
35
37
46
56
54
34
34
37
44
48
34
34
35
37
48
35
34
35
35
42
34
34
34
39
39
35
35
36
36
35
33
36
43
56
54
33
34
37
37
53
34
34
34
39
38
34
34
34
36
41

33
35
34
35
42
36
36
36
36
36
34
36
42
43
63
34
34
36
41
57
34
34
34
36
37
34
34
34
36
38
34
34
34
34
35
36
35
36
36
35
33
33
36
48
53
33
33
34
41
53
33
33
33
39
42
33
32
33
35
38
33
33
33
33
38
34
34
34
34
34
32
40
49
44
58
32
31
33
39
49

57908.2
58734.3
56852.1
57661.7
57620.2
64715.6
65357.6
65774.5
65977.7
64480.5
72994.9
69544.9
71411.7
71518.3
73585.1
37364.4
49824.7
57812.4
66226.2
71613.6
3525.9
3493.5
3216.1
3428.2
3029.5
5024.5
4874.4
4944.0
4556.9
5305.1
5921.3
5942.7
5963.9
5939.1
5796.8
6730.1
6846.5
6842.6
6862.5
6230.8
7479.0
7377.6
7616.4
7471.4
7421.3
3488.3
4886.7
5912.7
6736.3
7427.4
8344886.8
7895146.6
7939788.7
8496195.3
7858166.8
11027340.2
10947425.7
10968878.7
10518728.9
10997852.7
13014260.7
13159190.9
12475469.4
13360667.7
12498224.5
14626192.1
14619775.6
14600198.7
14801227.0
14310237.1
15911032.9
15829449.3
15995734.4
15915362.2
15234347.6
8141856.8
10928532.6
12904321.0
14596440.1
15844384.8

17
27
17
12
32
20
30
27
28
32
18
1
21
29
29
28
17
27
21
29
21
29
33
22
54
26
18
25
36
37
28
23
30
25
37
27
16
4
25
30
28
28
15
29
7
30
16
22
8
31
25
25
31
31
6
21
25
10
33
34
14
7
1
27
32
25
25
20
23
30
22
6
10
27
31
27
14
24
27
3

32
31
32
36
37
32
32
32
34
34
32
32
30
32
33
33
32
32
32
32
30
32
39
44
56
30
31
33
38
49
30
31
31
31
40
30
30
29
31
39
30
29
30
31
34
32
32
32
32
32
28
29
36
42
52
29
29
30
35
44
29
28
29
31
36
28
28
27
29
38
27
27
28
29
34
30
30
30
30
30

96.87
86.38
82.44
99.87
85.73
91.63
82.04
98.06
86.66
96.58
99.84
98.93
99.88
97.46
99.96
95.88
92.08
98.34
99.03
96.87
92.04
93.84
96.46
94.86
74.81
86.89
89.57
97.85
93.52
98.3
64.67
87.44
87.42
88.7
85.23
94.03
100.76
95.55
100.05
47.14
97.43
100.03
95.19
84.14
85.26
99.69
76
91.43
94.8
87.61
76.03
101.34
91.31
74.87
100.63
100.91
95.91
100.79
80.95
80.5
176.33
162.29
210.51
133.26
128.1
211.92
72.9
187.92
209.07
120.52
151.43
221.86
195.82
123.56
209.3
223.38
208.86
174.31
210.88
187.52

100

102

225

5422.4
5499.8
5156.8
5432.5
4770.0
2577.8
3753.0
4358.9
4893.7
5447.5
8389.9
8030.4
7801.8
6743.7
6929.2
9311.6
10371.0
8576.5
9424.4
9863.9
11366.1
10735.2
11323.1
12744.4
11015.8
12122.4
13526.4
11635.0
13777.8
12050.0
13793.4
13281.5
12361.1
12830.2
14009.9
7747.8
10031.1
11157.3
12698.4
13261.4
19371.4
19469.2
18096.6
18686.4
19078.3
25663.2
25551.1
24910.4
23651.3
22667.0
28740.3
29180.9
29372.8
29097.7
29724.9
32505.6
33523.0
32605.8
33333.6
33625.5
35273.7
34698.5
34667.0
35137.2
35156.1
18714.7
24880.3
29583.3
32906.9
35558.5
39690.6
38117.1
36298.6
34110.9
35996.3
49769.3
48588.3
46622.8
48480.0
49350.4

28
33
28
27
36
32
33
29
28
24
22
32
29
40
13
27
21
35
28
56
29
32
27
31
26
17
28
2
24
37
27
28
30
6
27
32
19
22
16
33
30
28
34
24
44
29
30
28
34
50
29
29
32
38
36
29
11
31
32
36
31
30
30
32
37
23
27
31
27
30
30
37
45
42
32
15
6
32
38
47

195.63
220.13
194.21
176.06
200.7
209.2
212.7
184.58
212.95
152.6
256.15
352.68
265
376.71
73.47
315.66
244.63
391.09
299.07
387.36
345.58
384.79
312.45
343.01
278.55
366.73
332.59
15.83
265.42
389.63
322.03
328.1
353.66
63.19
318.41
357.81
210.6
239.4
173.4
370.36
948.77
1000.68
994.63
502.18
822.52
914.4
955.51
837.48
832.32
963.43
914.08
967
1010.71
1002.2
885.98
926.83
332.78
968.79
951.06
974.37
978.35
946.85
938.31
1010.95
1000.48
700.97
810.61
920.06
823.45
899.76
2730.77
2684.31
2640.98
2753.45
1552.58
1319.57
1512.41
2781.3
2799.28
2763.82

392

1011

2830

1544.22
2512.5
1546.67
936.64
2467.12
1801.47
2764.96
2746.18
2352.64
2683.3
1630.35
197.06
1971.07
2820.5
2568.87
2469.98
1484.69
2466.91
1837.09
2588.05
4231.01
5582.51
5342.81
2877.12
5998.12
5297.28
5734.15
4577.05
6046.02
4723.23
5861.3
4980.95
6101.2
5004.66
5713.86
5515.75
3175.08
767.31
4955.71
4807.95
5755.2
6007.38
2965.04
5795.24
1079.08
5913.63
3026.56
4373.63
1415.85
6127.99
9015.93
8634.33
8747.46
7103.56
1016.28
7453.91
8846.33
3234.45
9867.85
8468.7
4789.86
7242.54
189.59
8907.79
8996.26
9290.11
8999.73
7610.9
8223.57
8017.7
8298.73
1980.95
3454.64
9645.22
9179.5
9332.35
4712.81
8081.83
9238.32
905.79

6131

10000

Table A.5: Detailed results of pACS+1-shift-T for PTSP instances with 0.1 p 0.5.

146

APPENDIX A. DETAILED RESULTS OF ACO FOR THE PTSP

Instance name

best

kb tb

ktot ttot

pACS+1-shift-P
Instance name best
kb tb

ktot ttot

Instance name

best

kb tb

ktot ttot

9231.28
8311.57
7328.27
9529.83
9701.82
11717.29
12553.35
11984.50
10415.38
11509.82
14345.75
13033.73
14552.30
13482.30
12595.77
15624.58
15318.28
17539.00
15078.76
14438.70
16882.97
17470.58
17990.17
16547.76
15846.55
9084.80
11778.00
13795.36
15381.14
16681.61
194.51
180.15
151.86
181.71
220.36
296.69
319.11
298.27
285.47
308.84
378.51
385.33
385.72
390.74
383.55
433.43
469.71
424.77
475.40
392.49
493.19
509.28
505.83
468.49
465.03
201.61
288.27
355.95
414.84
465.96
2641.00
2358.79
2536.87
2499.40
2584.48
3739.19
3612.21
3652.77
3491.84
3581.89
4172.97
4235.27
4206.13
4069.44
4137.79
4654.73
5108.15
5009.68
4939.47
4759.95

40
47
40
47
58
41
45
44
41
39
41
44
47
46
32
37
38
40
20
45
28
20
34
51
11
36
28
38
1
22
31
29
30
47
45
43
47
38
49
23
46
49
14
48
37
26
43
45
9
44
22
37
35
51
22
18
22
29
29
30
43
51
10
42
39
48
38
49
40
44
36
50
48
54
46
30
37
46
38
41

41
52
44
48
66
45
51
46
53
47
42
47
48
56
54
40
43
48
52
55
31
23
46
53
58
37
40
40
29
31
45
49
48
49
46
49
50
56
51
47
48
55
52
52
49
39
46
49
55
53
37
51
44
52
57
30
35
36
34
34
46
54
50
45
41
50
53
55
53
47
39
52
53
56
56
38
43
50
52
54

33
43
53
52
55
35
36
37
35
37
43
52
48
47
42
43
52
53
54
49
28
49
51
50
53
34
46
54
51
56
37
44
50
52
55
35
34
38
38
39
51
51
52
43
45
46
49
52
55
48
39
48
53
55
54
33
44
53
54
54
31
38
47
52
54
28
31
34
35
34
51
52
50
45
43
47
48
52
48
47

57234.84
57562.08
56744.81
57350.33
58715.05
64138.12
65607.63
68409.80
65621.06
66303.11
72897.09
70223.34
70726.72
72258.37
73392.64
36007.74
48548.83
57075.65
64432.50
70544.94
3524.52
3499.64
3217.89
3443.11
2961.02
5021.14
4879.48
4944.03
4543.35
5278.60
5916.00
5925.74
5969.87
5955.31
5842.49
6683.81
6846.09
6843.67
6880.52
6239.47
7337.81
7378.40
7602.76
7453.93
7416.97
3440.87
4857.52
5893.63
6666.42
7346.53
8348796.60
7697726.28
7986595.96
8321199.24
7741932.44
11002618.72
10775932.04
11099510.72
10464092.81
11016257.59
13018989.62
13237395.32
12474427.16
13363286.41
12465217.77
14698262.25
14781265.97
14600198.67
15124600.00
14341220.65
15842221.27
15829449.32
15995734.38
15917561.02
15223680.09
8170111.35
10826795.74
12901577.35
14438800.04
15809154.06

15
41
27
50
33
25
41
44
49
25
28
41
40
39
43
1
26
28
24
21
22
45
46
40
36
35
42
45
49
39
28
45
38
46
33
23
4
42
39
8
23
40
32
46
18
24
7
26
1
29
29
35
43
34
22
36
36
44
26
38
13
36
35
43
39
23
38
19
35
34
5
14
40
27
44
1
1
25
12
1

40
44
44
51
52
30
44
50
50
49
31
42
45
52
50
27
29
32
34
30
48
50
48
42
44
46
49
51
51
46
38
46
47
47
47
30
41
46
50
51
27
42
43
48
49
26
28
27
31
32
41
47
47
40
40
38
42
45
46
41
32
45
47
47
47
32
40
44
42
47
31
33
44
46
47
23
28
30
30
30

99.37
92.16
92.25
99.65
87.36
93.12
88.81
96.79
78.85
84.41
99.06
96.3
99.62
83.37
61.26
96.01
91.8
84.96
38.66
83.65
93.75
88.65
75.38
96.61
19.43
97.84
89.83
97.68
73.77
97
70.88
61.96
91.38
100.49
101.37
89.85
98.25
68.32
100.19
54.41
99.83
91.87
27.12
94.87
79.27
66
95.87
95.15
16.58
86.03
61.43
75.09
82.97
101.49
39.18
96.67
96.24
97.18
90.32
90.33
211.75
211.74
44.35
210.2
219.89
221.58
163.47
202.77
168.43
211.21
209.8
219.55
219.12
219.78
187.63
212.05
195.24
211.45
164.5
174.58

100

102

225

5257.95
5364.55
5127.69
5440.89
4420.91
2540.74
3446.41
4110.99
4629.25
5099.03
8268.71
8145.76
7600.31
6634.63
6647.58
9336.76
10359.02
8569.92
9706.05
9830.37
11701.53
10714.47
11414.01
12755.91
10989.88
12538.54
13259.88
11088.61
13317.83
11909.48
13768.76
13747.73
12375.01
12385.83
13849.37
7650.94
9919.90
11117.27
12187.15
13502.58
19629.48
19357.58
18532.24
18545.92
20357.57
26162.23
25489.35
25203.28
23982.01
22659.58
28519.06
30516.39
30317.57
30239.58
28867.86
32625.38
33232.49
33408.37
33937.32
34088.45
35388.85
34950.54
34501.05
35841.49
35602.99
18012.75
25393.27
29250.48
32798.12
35010.69
39506.27
37525.77
35792.71
35366.58
35489.79
48497.67
48736.35
47561.67
48693.10
48858.97

32
41
51
27
52
33
35
31
28
36
42
15
46
42
17
37
18
48
43
42
10
48
50
48
16
32
38
44
50
21
32
39
48
42
54
10
5
29
3
31
45
39
49
13
24
39
48
51
52
36
12
46
25
36
41
13
43
25
43
23
28
34
40
48
51
13
13
26
33
29
45
49
42
25
26
41
1
38
40
44

218.23
217.65
222.38
115.16
216.69
215.82
224.83
196.56
216.36
224.89
390.66
111.91
378.69
358.09
168.29
344.32
135.93
359.74
318.31
338.82
133.01
388.99
386.73
382.89
113.56
378.7
328.9
319.83
386.57
142.42
340.35
347.13
383.12
318.32
390.69
383.74
372.93
312.89
25.67
381.25
911.89
795.46
966.67
303.72
527.25
897.05
1004.72
1003.2
968.84
767.78
945.23
984.4
484.17
663.55
782.67
958.13
1008.6
481.85
966.23
443.12
930.67
912.65
874.58
957.67
969.28
883.33
747.61
957.45
991.58
990.85
2544.86
2724.85
2373.37
1577.11
1704.97
2523.11
122.23
2058.16
2399.24
2670.55

392

1011

2830

1073.41
2802.9
1753.15
2820.94
1845.79
2394.97
2673.8
2555.34
2801.86
1465.78
2800.83
2782.62
2523.42
2151.36
2487.39
2788.38
2627.09
2767.84
2114.99
2822.07
2790.46
5529.84
5962.09
5976.19
5151.16
4712.38
5383.5
5494.68
6031.97
5343.46
4506.42
6081.33
5036.68
6025.3
4288.22
6082.74
687.02
5622.41
4842.51
977.54
5323.79
5924.48
4686.82
5903.61
2208.26
5968.88
5849.77
5924.7
5705.13
5951.31
7033.6
7513.74
9229.47
8414.45
5562.1
9467.29
8555.22
9835.26
5693.44
9610.05
3906.66
8121.27
7553.77
9243.32
8306.3
7675.73
9782.92
4365.39
8343.21
7251.32
9036.06
3927.15
9200.55
5893.02
9446.47
8630.59
9148.94
8704.97
9900.4
9528.45

6131

10000

Table A.6: Detailed results of pACS+1-shift-P for PTSP instances with 0.1 p 0.5.

Appendix B

Hybrid Metaheuristics for the

Vehicle Routing Problem with
Stochastic Demands
This section analyzes the performance of metaheuristics on the vehicle routing problem
with stochastic demands (VRPSD). The problem is known to have a computationally
demanding objective function, and for this reason the optimization of large instances
could be an infeasible task. Fast approximations of the objective function are therefore
appealing because they would allow for an extended exploration of the search space.
We explore the hybridization of the metaheuristic by means of two objective functions
which are approximate measures of the exact solution quality. Particularly helpful for
some metaheuristics is the objective function derived from the TSP, a closely related
problem. In the light of this observation, we analyze possible extensions of the metaheuristics which take the hybridized solution approach VRPSD-TSP even further and
report about experimental results on different types of instances. We show that, for
the instances tested, two hybridized versions of iterated local search and evolutionary
algorithm attain better solutions than state-of-the-art algorithms.

B.1

Introduction

Vehicle routing problems (VRPs) concern the transport of items between depots and
customers by means of a fleet of vehicles. Solving a VRP means finding the best
set of routes servicing all customers and respecting the operational constraints, such as
vehicles capacity, time windows, drivers maximum working time. VRPs are a key issue
in supply-chain and distribution systems today, and they are becoming increasingly
complex. For this reason there is an ever increasing interest in routing models that
are dynamic, stochastic, rich of constraints, and thus have more and more complex
objective functions.
Models that focus particularly on the stochasticity of information are mainly known
in the literature as Stochastic VRPs (SVRPs), and the problem we are addressing in
147

148

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

this section belongs to this class. In SVRPs elements of the problem such as the
set of customers visited, the customers demands, or the travel times, are modeled as
stochastic variables with known probability distributions, and the objective function
is usually the expected cost of the planned routes. Note, however, that in SVRPs one
needs to specify not only the concept of planned routes, but also the way planned
routes are to be modified in response to the realization of the stochastic information.
A common feature of SVRPs is that they all have at least one deterministic counterpart, which is the VRP that one obtains by considering zero-variance probability
distributions for the stochastic elements of the problem. SVRPs are thus NP-hard
problems, like most VRPs. An important point that increases the difficulty of SVRPs
is that they have an objective function (expected cost) which is much more computationally expensive than their deterministic counterparts. For this reason, a key issue in
solving SVRPs by heuristics and metaheuristics is the use of fast and effective objective
function approximations that may accelerate the search process. Due to the analogy
between stochastic and deterministic VRPs, a reasonable choice for the objective function approximation of a given SVRP is the objective of the corresponding deterministic
problems.
In this section, we investigate the use of objective function approximations derived
from deterministic problems in the context of the VRPSD. This is an NP-hard problem,
and despite the fact that it has a quite simple formulation, it arises in practice in many
real world situations. One example is garbage collection, where it is indeed impossible
to know a priori how much garbage has to be collected at each place. Another example
where the demand is uncertain is the delivery of petrol to petrol stations. In fact, when
a customer issues the order it is still unknown how much he will sell in the time between
the order and the delivery.
In the VRPSD, a vehicle of finite capacity is leaving from a depot with full load,
and has to serve a set of customers whose exact demand is only known on arrival at
the each customer location. A planned route in this context is very simple: a tour
starting from the depot and visiting all customers exactly once; this is also called an
a priori tour, and it will be addressed as such in the remainder of the section. The a
priori tour is a sort of skeleton that fixes the order in which customers will be served,
but the actual route the vehicle would travel would include return trips to the depot
for replenishments when needed. The points at which return trips are performed are,
in general, stochastic. The objective function to be minimized is the expected cost of
the a priori tour.
Due to the nature of the a priori tour, a feasible solution for the VRPSD may
also be seen as a feasible solution for a traveling salesman problem (TSP) on the
set of customers (depot included). Moreover, if the vehicle has infinite capacity, the
consequent VRPSD is, in fact, a TSP. Due to these analogies, a natural approximation
of the VRPSD objective function is the length of the a priori tour.
In this section we consider basic implementations of five metaheuristics: simulated
annealing [129], tabu search [91], iterated local search [137], ant colony optimization
[72] and evolutionary algorithms [11]. Our main goal is to test the impact on metaheuristics of interleaving the exact VRPSD objective function with the a priori tour

B.2. THE VRPSD

149

length as an approximation of it. This mixture changes the search landscape during the
search for good quality solutions and can be seen as an innovative type of hybridization
of a metaheuristics search process that has not been yet explored in the literature. In
particular, we investigate two types of hybridization: first, we consider a local search
algorithm (OrOpt) for which a quite good approximation for the exact VRPSD objective function is available, and we compare metaheuristics using this underlined local
search by applying both VRPSD approximation and TSP approximation. Second, we
further exploit the TSP analogy, by choosing the 3-opt local search operator, which is
very good for the TSP, but for which there is no immediate VRPSD approximation.
The remainder of this section is organized as follows. In Section B.2 we give the
formal description of the VRPSD, we describe in detail the objective function, the state
of the art about the VRPSD, and the relevant aspects of generating a benchmark of
instances for this problem, taking into account the existing literature. Section B.3 describes at high level the metaheuristics and the other algorithms analyzed. Section B.4
reports details about tested instances, parameters used for the metaheuristics, computation times allowed. Sections B.5 and B.6 describe the computational experiments
respectively on the first type of hybridization (the use of the TSP objective function
in OrOpt) and on the second type of hybridization (the use of the 3-opt local search
with the TSP objective function). Section B.7 summarizes the main conclusions that
can be drawn from the experimental results.

B.2

The Vehicle Routing Problem with Stochastic Demands

The VRPSD is defined on a complete graph G = (V, A, D), where V = {0, 1, ..., n} is a
set of nodes (customers) with node 0 denoting the depot, A = {(i, j) : i, j V, i 6= j}
is the set of arcs joining the nodes, and D = {dij : i, j V, i 6= j} are the travel
costs (distances) between nodes. The cost matrix D is symmetric and satisfies the
triangular inequality. One vehicle with capacity Q has to deliver goods to the customers
according to their demands, minimizing the total expected distance traveled, and given
that the following assumptions are made. Customers demands are stochastic variables
i , i = 1, ..., n independently distributed with known distributions. The actual demand
of each customer is only known when the vehicle arrives at the customer location. It
is also assumed that i does not exceed the vehicles capacity Q, and follows a discrete
probability distribution pik = Prob(i = k), k = 0, 1, 2, ..., K Q. A feasible solution
to the VRPSD is a permutation of the customers s = (s(1), s(2), . . . , s(n)) starting at
the depot (that is, s(1) = 0), and it is called an a priori tour. The vehicle visits the
customers in the order given by the a priori tour, and it has to choose, according to the
actual customers demand, whether to proceed to the next customer or to go to depot
for restocking. Sometimes the choice of restocking is the best one, even if the vehicle is
not empty, or if its capacity is bigger than the expected demand of the next scheduled
customer; this action is called preventive restocking. The goal of preventive restocking
is to avoid the risk of having a vehicle without enough load to serve a customer and

150

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

thus having to perform a back-and-forth trip to the depot for completing the delivery
at the customer.
The expected distance traveled by the vehicle (that is, the objective function),
is computed as follows. Let s = (0, 1, . . . , n) be an a priori tour. After the service
completion at customer j, suppose the vehicle has a remaining load q, and let fj (q)
denote the total expected cost from node j onward. With this notation, the expected
cost of the a priori tour is f0 (Q). If Lj represents the set of all possible loads that a
vehicle can have after service completion at customer j, then, fj (q) for q Lj satisfies
fj (q) = Minimum{fjp (q), fjr (q)},

(B.1)

where
fjp (q) = dj,j+1 +

fj+1 (q k)pj+1,k

k:kq

(B.2)
[2dj+1,0 + fj+1 (q + Q k)]pj+1,k ,

k:k>q

fjr (q)

= dj,0 + d0,j+1 +

K
X

fj+1 (Q k)pj+1,k ,

(B.3)

k=1

with the boundary condition fn (q) = dn,0 , q Ln . In (B.2-B.3), fjp (q) is the expected
cost corresponding to the choice of proceeding directly to the next customer, while
fjr (q) is the expected cost in case preventive restocking is chosen. As shown by Yang
et al. in [183], the optimal choice is of threshold type: given the a priori tour, for each
customer j there is a load threshold hj such that, if the residual load after serving j is
greater than or equal to hj , then it is better to proceed to the next planned customer,
otherwise it is better to go back to the depot for preventive restocking. This property
of the VRPSD is illustrated by Figure B.1. The computation of f0 (Q) runs in O(nKQ)
time; the memory required is O(nQ), if one is interested in memorizing all intermediate
values fj (q), for j = 1, 2, ..., n and q = 0, 1, ..., Q, and O(Q) otherwise. Algorithm 20 is
an implementation of the recursion (B.2-B.3) for the computation of f0 (Q) and of the
thresholds.
According to the above definition, the VRSPD is a single-vehicle routing problem.
Note that there would be no advantage in considering a multiple-vehicle problem by
allowing multiple a priori tours, since, as proved by Yang et al. [183], the optimal
solution is always a single tour.
The literature about VRPSD and SVRPs in general is quite rich. Formulations of
SVRPs include the Traveling Salesman Problem with Stochastic Customers (TSPSC),
the Traveling Salesman Problem with Stochastic Travel Times (TSPST), the Vehicle
Routing Problem with Stochastic Customers (VRPSC), the Vehicle Routing Problem
with Stochastic Customers and Demands (VRPSCD). For a survey on the early approaches to these problems, see [87] and [29]; for a more recent survey, especially on
mathematical programming approaches used for SVRPs, see [128]. In the following we
summarize the main contributions to solve VRPSD and similar problems, relevant to

expected costtogo from j

B.2. THE VRPSD

151

fj (q)
r

fj
f (q)
j

hj
q

Figure B.1: The bold line is the cost function fj (q), that is, the total expected cost from
node j onward. The quantity q is the residual capacity of the vehicle just after having
serviced customer j. The function fjp (q) would be the expected cost in case the vehicle
proceeded directly to customer j + 1 without first going to de depot for replenishment.
The function (constant in q) fjr (q) would be the expected cost in case the vehicle always
went to the depot for restocking, before going to the customer j + 1. The figure shows
that if the residual capacity q is under the threshold hj , then it is more convenient to
restock, otherwise it is more convenient to proceed to the next customer.

Algorithm 20 Computation of the VRPSD objective function f0 (Q)

for (q = Q, Q 1, ..., 0) do
fn (q) = dn,0
for (j = n 1, n 3, ..., 1) do
compute fjr using fj+1 () (by means of Equation (B.3))
for (q = Q, Q 1, ..., 0) do
compute fjp (q) (by means of Equation (B.2))
compare fjr and fjp (q) for finding the threshold hj
compute fj (q) using fj+1 () (by means of Equation (B.1))
end for
end for
end for
compute f0 (Q)
return f0 (Q)

152

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

this section. Jaillet [118, 119] and Jaillet-Odoni [120] derive analytic expressions for
the computation of the expected length of a solution for the Probabilistic Traveling
Salesman Problem and variations of it;
Bertsimas [25] proposes the cyclic heuristic for the VRPSD, by adapting to a
stochastic framework one of the heuristics presented by Haimovitch and Rinnooy Kan
[106] in a deterministic context; later, Bertsimas et al. [26] improve this heuristic by
applying dynamic programming, to supplement the a priori tour with rules for selecting
returns trips to the depot, similarly to the preventive restocking strategy; their computational experience suggests that the two versions of the cyclic heuristic provide good
quality solutions when customers are randomly distributed on a square region;
Gendreau, Laporte and Seguin [86] present an exact stochastic integer programming
method for VRPSCD (the same method can be applied to VRPSD as well); by means of
the integer L-shaped method [131] they solve instances with up to 46 and 70 customers
and 2 vehicles, for the VRPSCD and VRPSD, respectively; in [88], they also develop
a tabu search algorithm called TABUSTOCH for the same problem; this algorithm is
to be employed when instances become too large to be solved exactly by the L-shaped
method;
Teodorovic and Pavkovic [175] propose a Simulated Annealing algorithm to the
multi-vehicle VRPSD, with the assumption that no more than one route failure is
allowed during the service of each vehicle.
Gutjahr [100] applies S-ACO to the Traveling Salesman Problem with Time Windows, in case of stochastic service times. S-ACO is a simulation-based Ant Colony
optimization algorithm, that computes the expected cost of a solution (the objective
function), by Monte Carlo sampling.
Secomandi [162, 163] applies Neuro Dynamic Programming techniques to the
VRPSD; he addresses the VRSPD with a re-optimization approach, where after each
new exact information about customers demand is updated, the a priori tour planned
on the not yet served customers is completely re-planned; this approach may find solutions with a lower expected value with respect to the preventive restocking strategy,
but it is much more computationally expensive; moreover, the a priori tour may be
completely different from the actual tour followed by the vehicle, and this situation is
often seen as a disadvantage by companies;
Yang et al. [183] investigate the single- and multi-vehicle VRPSD; the latter is
obtained by imposing that the expected distance traveled by each vehicle does not
exceed a given value; the authors test two heuristic algorithms, the route-first-clusternext and the cluster-first-route-next, which separately solve the problem of clustering
customers which must be served by different vehicles and the problem of finding the
best route for each cluster; both algorithms seem to be efficient and robust for small
size instances, as shown by comparisons with branch-and-bound solutions to instances
with up to 15 customers; The authors also adapt to the stochastic case (both singleand multi-vehicle VRPSD) the OrOpt local search due to Or [147], by proposing a fast
approximation computation for the change of the objective function value of a solution
modified with a local search move.
The OrOpt local search and objective function approximation are used here as

B.2. THE VRPSD

153
Sk

i+1

i+k

i+k+1

j+1

Figure B.2: How an a priori tour is modified after performing an OrOpt move, where
the set of consecutive customers Sk (here, k = 3) is moved forward in the tour.
building blocks of our metaheuristics, therefore we will describe them in detail in the
following subsection.

B.2.1

The OrOpt local search

A basic move of the OrOpt local search, as suggested by Yang et al. in [183], works
as follows. Given a starting a priori tour, sets Sk of k consecutive customers with
k {1, 2, 3} are moved from one position to another in the tour, like in Figure B.2.
In the following we describe the two types of approximation schemes used for the
computation of the move cost. The first one, that we call VRPSD, is the one proposed
in [183], and the second one, that we call TSP, only computes the length change of the
a priori tour.
B.2.1.1

VRPSD approximation scheme

The move cost is computed in two stages: i) compute the saving from extracting the set
of customers from the tour; ii) compute the cost of inserting it back somewhere else in
the tour. Let i and i+k +1 be the nodes immediately preceding, respectively following,
Sk in the tour, and let j be the node immediately after which Sk is to be inserted, as
shown in Figure B.2. Here, we assume that j is after i in the a priori tour. Let fi (q)
and fi+k+1 (q) be the expected cost-to-go from nodes i, respectively i + k + 1 onward
before the extraction of Sk . Apply one dynamic programming recursion step starting
with cost vector fi+k+1 () at node i + k + 1 back to node i, without considering the
sequence Sk . Let fi0 () be the resulting cost vector at node i, that is, after extracting Sk
from the tour. Then, define the approximate extraction saving as a simple average over
q of fi (q) fi0 (q). The computation of the approximate insertion cost of Sk between
nodes j and j + 1 in the tour, is done analogously, if we assume that the insertion point
(node j) is after the extraction point (node i). Let fj (q) be the cost-to-go at node j
before inserting Sk , and fj00 (q) be the cost-to-go at node j after inserting the Sk . The
total approximate cost of an OrOpt move is computed by subtracting the approximate
extraction saving from the approximate insertion cost, as follows
PQ
VRPSD =

00
q=0 [(fj (q)

fj (q)) (fi (q) fi0 (q))]

Q+1

(B.4)

154

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

Note that the cost vectors are assumed to be already available from the computation
of the expected cost for the starting tour, thus, they do not need to be computed when
evaluating Equation (B.4). The only computations that must be done here are the
0 () and f 00 (), requiring O(KQ) time, and the average of
evaluation of cost vectors fi+1
j
Equation (B.4), requiring O(Q) time. Therefore, with the proposed VRPSD approximation, the cost of an OrOpt move can be computed in O(KQ) time. Although it
is possible that tours which are worsening with respect to the evaluation function are
accepted because recognized as improving by the approximate evaluation, in practice
this approximation scheme behave quite well. For a deeper discussion on the issues
related with this scheme we refer the reader to the original paper [183].
B.2.1.2

TSP approximation scheme

In the TSP approximation scheme the cost of an OrOpt move coincides with the difference between the length of the tour before the move and after the move:
TSP = di,i+k+1 +dj,i+1 +di+k,j+1 di,i+1 di+k,i+k+1 dj,j+1 ,

(B.5)

where, as before, i and j are the extraction, respectively insertion point of a string of k
consecutive customers (see Figure B.2). Clearly, TSP is computable in constant time.
The OrOpt neighborhood examination follows the same scheme proposed in [183],
and illustrated by Algorithm 21. Briefly, all possible sequences of length k {1, 2, 3}
are considered for insertion in a random position of the tour after the extraction point.
Then, only the best move among those of length k is chosen. The best move is the
move corresponding to the most negative move cost, which is computed by Equation
(B.4) in the VRPSD approach and by Equation (B.5) in the TSP approach.

B.2.2

Benchmark

In the literature there is no commonly used benchmark for the VRPSD, therefore
we have generated our own testbed. We have tried to consider instances which are
interesting from different points of view, by controlling four factors in the generation
of instances: customer position, capacity over demand ratio, variance of the stochastic
demand, and number of customers.
Instances may be divided into two groups, uniform and clustered, according to the
position of customers. In uniform instances, the position of customers is chosen uniformly at random on a square of fixed size. In clustered instances, coordinates are
chosen randomly with normal distributions around a given number of centers. This results in clusters of nodes, a typical situation for companies serving customers positioned
in different cities.
The ratio between the total (average) demand of customers and the vehicles capacity is an important factor that influences the difficulty of a VRPSD instance [86].
The bigger the ratio, the more difficult the instance. Here, the vehicle capacity Q is
chosen as follows

total average demand r
Q=
,
(B.6)
n

B.3. THE METAHEURISTICS

155

Algorithm 21 OrOpt(p), with p {V RP SD, T SP }

1: let k = 3 and be an initial route
2: compute the expected cost of route by means of Algorithm 20
3: for all S k , a set of successive nodes from do
4:
if p = V RP SD then
5:
compute V RSP D by means of Equation (B.4)
6:
else if p = T SP then
7:
compute T SP by means of Equation (B.5)
8:
end if
9: end for
10: if none of the sets S k corresponds to a negative cost then
11:
go to Step 15
12: else
13:
select the set S k that results in the most negative cost and perform the associated
OrOpt move
14: end if
15: if k = 1 then
16:
stop
17: else
18:
decrease k by 1, and go to Step 3
19: end if
where the parameter r may be approximately interpreted as the average number of
served customers before restocking.
Each customers demand is an integer stochastic variable uniformly distributed on
an interval. The demand interval for each customer i is generated using two parameters:
the average demand Di , and the spread Si , so that the possible demand values for
customer i are the 2Si + 1 integers in the interval [Di Si , Di + Si ]. The spread is a
measure of the variance of the demand of each customer.
The particular parameters used to generate the test instances for our experiments
are reported in Section B.4.

B.3

The metaheuristics

We aim at an unbiased comparison of the performance of well-known five different

metaheuristics for this problem. In order to obtain a fair and meaningful analysis of
the results, we have restricted the metaheuristic approaches to the use of the common
OrOpt local search. In the following we briefly and schematically describe the main
principles of each metaheuristic and give the details of the implementations for the
VRPSD.

156

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

Simulated Annealing (SA)
Determine initial candidate solution s
Set initial temperature T according to annealing schedule
While termination condition not satisfied:
Probabilistically choose a neighbor s0 of s
If s0 satisfies probabilistic acceptance criterion (depending on T ):
s := s0
Update T according to annealing schedule

The initial temperature Ti is given by the average cost of a sample of 100 solutions
of the initial tour multiplied by ; every n iterations the temperature is updated
by T T (standard geometric cooling); after n iterations without
improvement, the temperature is increased by adding Ti to the current value; the
solution considered for checking improvements is the best since the last re-heating.
Tabu Search (TS)
Determine initial candidate solution s
While termination criterion is not satisfied:
Choose the best neighbor s0 of s that is either non-tabu or
satisfies the aspiration criterion, and set s := s0
Update tabu attributes based on s0

The neighborhood of the current solution s is explored. The non-tabu neighbors

are considered for the selection of the next current solution. If the value of a
tabu neighbor is better than the best found solution, then this neighbor is also
considered for selection (aspiration criterion). The best considered neighbor, i.e.
the one with the lowest value, is selected. In order to avoid cycling around the
same set of visited solutions, we found convenient to have a variable neighborhood :
for any current solution s, instead of considering the whole neighborhood, we
choose a subset of it according to a probability distribution. We experimentally
verified that the used variable neighborhood is able to avoid cycles and to explore
a larger part of the search space.
Iterated Local Search (ILS)
Determine initial candidate solution s
Perform local search on s
While termination criterion is not satisfied:
r := s
Perform perturbation on s
Perform local search on s
Based on acceptance criterion, keep s or revert to s := r

The perturbation consists in a sampling of n neighbors according to the 2-opt

exchange neighborhood [124]; each new solution is evaluated with by exact cost

B.3. THE METAHEURISTICS

157

function (Procedure 1) and if a solution is found that has cost smaller than the
best solution found so far plus , the sampling ends; otherwise, the best solution
obtained during the sampling is returned; the acceptance criterion keeps s if it is
the best solution found so far.
Ant Colony Optimization (ACO)
Initialise weights (pheromone trails)
While termination criterion is not satisfied:
Generate a population sp of solutions by a
randomised constructive heuristic
Perform local search on sp
Adapt weights based on sp

Pheromone trails are initialised to 0 ; sp solutions are generated by a constructive

heuristic and refined by local search; a global update rule is applied r times; sp
solutions are then constructed by using information stored in the pheromone
matrix; after each construction step a local update rule is applied to the element
i,j corresponding to the chosen customer pair: i,j = (1 ) i,j + 0 , with
[0,1]; after local search, weights are again updated by the global update rule:
i,j = (1 ) i,j + Cqbs , with [0,1] and C bs the cost of the best-so-far
solution; (note that heuristic information is only used in the initialisation phase).
Evolutionary Algorithm (EA)
Determine initial population sp
While termination criterion is not satisfied:
Generate a set spr of solutions by recombination
Generate a set spm of solutions from spr by mutation
Perform local search on spm
Select population sp from solutions in sp, spr, and spm

At each iteration two solutions are chosen among the best ones to generate a new
solution spr through Edge Recombination [182] (a tour is generated using edges
present in both two other tours, whenever possible); The mutation swaps adjacent customers (without considering the depot) with probability pm ; finally, the
solution improved by local search replaces the worst solution in the population.
The initial solution(s) for all metaheuristics are obtained by the Farthest Insertion
constructive heuristic [125]; it builds a tour by choosing as next customer the not yet
visited customer which is farthest from the current one. Here, we consider a randomised
version of this heuristic (RFI) which picks the first customer at random, and after the
tour has been completed, shifts the starting customer to the depot.
In order to use a reference algorithm for comparison among metaheuristics (see
Section B.4), we also implemented a simple random restart algorithm which uses the
RFI heuristics plus local search and restarts every-time a local optimum is found, until
the termination condition is reached; the best solution found among all the restarts is
picked as the final solution; we call such algorithm RR.

158

B.4

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

Experimental Setup

Here we report two computational experiments with two different goals: i) we analyse
metaheuristics performance to test the hypothesis of the relation between the TSP
and VRPSD; ii) we study the performance of enhanced versions of the best algorithms
found in the first set of experiments. Moreover, we relate these results with parameters
of the instance.
We consider instances with 50, 100 and 200 customers and with customers uniformly
distributed or grouped in clusters. In uniform instances, the position of customers is
chosen uniformly at random in the square [0, 99]2 . Clustered instances are generated
with two clusters, with centres randomly chosen in the square [0, 99]2 . The variance of
the normal distribution used to generate customers coordinates around the centres is

equal to (0.8/ n) (max. coordinate). This corresponds to variance values of about 11,
8, and 6, respectively for instances with 50, 100, and 200 customers. The position of
the depot is fixed at (1,1). The average number of customers served before restocking is
maintained fixed to 4, thus yielding ratios for total demand over vehicle capacity in the
range from 12 to 50. Typical values for this ratio in the VRPSD literature are below
3, however higher values are closer to the needs of real contexts [34]. In all instances,
average demands Di at each customer i are taken with equal probability from [1, 49]
or [50, 100]. With regard to demand spread, instead, instances may be divided in two
groups: low spread instances, in which each customer i has Si chosen at random in
[1, 5], and high spread instances, in which each customer i has Si chosen at random in
[10, 20]. For each combination of size, distribution of customers and demand spread 75
instances were generated, making a total of 900 instances.
The metaheuristic parameters were chosen in order to guarantee robust performances over all the different classes of instances; preliminary experiments suggested
the following settings:
SA: = 0.05, = 0.98, = 1, and = 20;
TS: pnt = 0.8 and pt = 0.3;
ILS: =

n
10 ;

ACO: m = 5, 0 = 0.5, = 0.3, = 0.1, q = 107 , and r = 100;

EA: spm = 10, pm = 0.5.
Given the results reported in [43, 44], we decided to only perform one run for each
metaheuristic on each instance.1 The termination criterion for each algorithm was set
to a time equal to 30, 120 or 470 seconds for instances respectively of 50, 100 or 200
customers. Experiments were performed on a cluster of 8 PCs with AMD Athlon(tm)
1

In [43] it is formally proved that if a total of N runs of a metaheuristics can be performed for
estimating its expected performance, the best unbiased estimator, that is the one with the least variance,
is the one based on one single run on N randomly sampled (and therefore typically distinct) instances.

B.5. FIRST HYBRIDIZATION

159

XP 2800+ CPU running GNU/Linux Debian 3.0 OS, and all algorithms were coded in
C++ under the same development framework.
In order to compare results among different instances, we normalised results with
respect to the performance of RR. For a given instance, we denote as cM H the cost of
the final solution of a metaheuristic MH, cRF I the cost of the solution provided by the
RFI heuristic, and CRR by cost of the final solution provided by RR; the normalised
value is then defined as
Normalized Value for MH =

cM H cRR
.
cRF I cRR

(B.7)

Besides providing a measure of performance independent from different instance hardness, this normalisation method gives an immediate evaluation of the minimal requirement for a metaheuristic; it is reasonable to request that a metaheuristic performs at
least better than RR within the computation time under consideration.

B.5

First hybridization: using approximate move costs in

local search

The main goal of this first experiment is to see whether approximating the exact but
computationally demanding objective with the fast computing length of the a priori
tour is convenient or not. Our hypothesis is that the speedup due to the use of a fast
approximation of the objective is an advantage especially during the phase of local
search, when many potential moves must be evaluated before one is chosen.
In order to test the advantage of a speedup across the metaheuristics, we apply
to each of them the OrOpt local search described in Section B.2.1, and we test two
versions for each metaheuristic according to the type of approximation scheme used
in the local search, VRPSD-approximation or TSP-approximation. This set up allows
to use statistical techniques for a systematic experimental analysis. In particular, we
consider a two-way factor analysis, where metaheuristics and type of objective function
are factors and instances are considered as blocks [66].
The first plot of Figure B.3 is the boxplot of the results over all instances after the
normalisation. Negative values indicate an improvement over random restart and the
larger is the absolute value the larger is the improvement. It emerges that, in average,
ILS, EA and TS are able to do better than RR while SA and TS perform worse. We
check whether these results are also statistically significant. The assumptions for a
parametric analysis are not met, hence we rely on non-parametric methods.
The central plot of Figure B.3 shows the interaction between the two factors, metaheuristic and approximation scheme. Interaction plots give an idea of how different
combinations of metaheuristic and approximation scheme affect the average normalised
result. The lines join the average value for each metaheuristic. If lines are not parallel
it means that there is an interaction effect, that is, metaheuristics perform differently
with different approximation scheme. From the plot, we see that a certain interaction
effect is present between the two factors, hence, it would be appropriate to report the

160

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

effects of one factor separately for each level of the other. In the third plot of Figure
B.3 we report, however, together both VRPSD and TSP approximation schemes, but
we distinguish the effects of this factor on each metaheuristic. The plot presents the simultaneous confidence intervals for the all-pairwise comparison of algorithms. Interval
widths are obtained by the Friedman two-way analysis of variance by ranks [64], after
rejecting the hypothesis that all algorithms perform the same. The difference between
two algorithms is statistically significant at a level of confidence of 5% if their intervals
do not overlap.
From the interaction and the all-pairwise comparison plots of Figure B.3 we can
conclude that:
The improvements of EA, ILS, and TS over RR are statistically significant.
The presence of an interaction effect between metaheuristic and approximation
scheme shows that EA, ILS and ACO perform better with TSP-approximation
while the opposite result holds for TS and SA. While EA, ILS and ACO use the
local search as a black-box, TS and SA employ their own strategy for examining
the neighborhood in the local search. This result indicates that, for becoming
competitive, these two methods require a good approximation of the objective
function.
The metaheuristics which perform better are EA, ILS and TS. Furthermore, EA
and ILS take significant advantage from TSP-approximation scheme.

B.6

Second hybridization:
analogy

further exploiting the TSP

Given the results in the previous section, it is reasonable to investigate what happens if
we exploit even more the hybridization based on the TSP objective function. For this
purpose, we consider one of the best performing TSP state-of-the-art metaheuristics,
and we observe that it is based on iterated local search with the 3-opt local search
operator [170]. We, therefore, hybridize the best algorithms determined in the previous
section (ILS, EA) with the 3-opt local search for TSP. We do not hybridize TS, instead, since we observed that it exhibits better results with the VRPSD-approximation
rather than with the TSP-approximation. The new algorithms that we consider are the
following.
TSP-soa If the solution found solving the TSP was comparable in quality to those
found by our metaheuristics, there would be no point in investigating algorithms specific
for the VRPSD. We consider therefore a transformation of the problem into TSP by
focusing only on the coordinates of the customers (depot included). We then solve the
TSP with a TSP state-of-the-art algorithm [170] and shift the TSP solution found to
start with the depot thus yielding a VRPSD solution. This solution is finally evaluated
by the VRPSD objective function (Procedure 1). The algorithm for TSP is an iterated

B.6. SECOND HYBRIDIZATION

161

local search algorithm that uses a 3-opt exchange neighborhood, and a double bridge
move as perturbation, i.e., it removes randomly four edges from the current tour and
adds four other edges, also chosen at random, that close the tour.
ILS-hybrid It is obtained by modifying the acceptance criterion in TSP-soa, that is,
the best solution according to Procedure 20 is chosen rather that the best TSP solution.
EA-hybrid It is derived from EA-tsp by replacing the local search with a 3-opt local
search based on TSP objective function. The recombination and mutation operators
are maintained unchanged, since these are deemed the components that determined
the success of EA in Figure B.3. The selection operator remains also unchanged, and
is based on the VRPSD objective function computed by Procedure 1.
In order to have a comparison with previous methods from the VRPSD literature,
we also test the effective CYCLIC heuristic [25], that we implemented as follows:
solve a TSP over the n customers, plus the depot, by using the TSP-soa algorithm;
consider the n a priori tours obtained by the cyclic permutations si = (0, i, i +
1, ..., n, 1, ..., i 1), i = 1, 2, ..., n;
evaluate the n a priori tours by the VRPSD objective function (Procedure 1) and
choose the best one.
For the comparison of these algorithms we designed an experimental analysis based
on a single factor layout with blocks [66]. Since the assumption for parametric tests
are again violated, we use the Friedman two-way analysis of variance by ranks.
In Figure B.4, B.5, B.6 we present the simultaneous confidence intervals after rejecting the hypothesis that all algorithms perform the same. We distinguish results by
presenting the instances grouped according to the three main features defined in Section
B.4. Since these the analysis is repeated on the same data, we adjust the family-wise
level of confidence for each test dividing it by a factor of three [165].
The main indication arising from the results is that a mere TSP algorithm is not
sufficient to produce high quality solutions for the VRPSD instances under any of the
circumstances tested. VRPSD problem-specific algorithms, which take into account the
stochasticity of the problem, yield always better solutions. Nevertheless, the best performances are achieved by hybridization of TSP and VRPSD approaches. EA-hybrid
and ILS-hybrid in many cases perform statistically better than all other algorithms and,
when significant, differences are also practically relevant. On the contrary, the difference
between these two approaches is statistically significant only in one case. The comparison with the CYCLIC heuristic indicates that the algorithms we presented improve
considerably the solution for VRPSD with respect to previous known approaches.
Considering the characteristics of the instances, the position of customers seems not
to have an impact on the rank of algorithms. On the contrary, the different position
occupied by the algorithms in the ranks with respect to instance size indicates that
the larger the instances the higher is the importance of speed over precision in the

162

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

evaluation of neighbors in local search. This expected result is exemplified by the

worsening of the average rank of EA-tsp as size increases.
Finally, spread of demand has also a considerable influence. With small spread
it is convenient to consider the costs of preventive restocking and thus the use of
Procedure 1 is appropriate. With large spread, on the contrary, the higher uncertainty
of the right point in the tour for preventive restocking makes the evaluation of solution
by means of Procedure 20 not far from that provided by a TSP evaluation, with the
negative impact of a higher computational cost.

B.7

Conclusions

The main contribution of this section is the study of the hybridization of objective
function approximations on five well known metaheuristics to solve the VRPSD. In
particular, we introduced a TSP-approximation, which uses the length of the a priori tour as an approximation of the exact but computationally demanding VRPSD
evaluation. We showed its superiority with respect to a known VRPSD-approximation.
More precisely, first we considered a local search algorithm (OrOpt) for which a good
approximation for the exact VRPSD objective function is available, and we compared
metaheuristics using this local search by applying both the VRPSD-approximation and
the TSP-approximation. Secondly, we exploited further the TSP analogy, by choosing
the 3-opt local search operator, which is very good for the TSP, but for which there is
no immediate VRPSD approximation.
With the first type of hybridization, we have shown that metaheuristics using the
local search as a black-box (EA, ILS and ACO) perform better when using the TSPapproximation, while metaheuristics that use their own strategy for examining the
neighborhood in the local search, perform better with the more precise but more computationally expensive VRPSD-approximation. With the second type of hybridization
based on the use of the 3-opt local search, we considerably improved the performance
of the best metaheuristics for VRPSD, and we were unable to discover significant differences between the two best algorithms, EA and ILS.
All the metaheuristics implemented found better solutions with respect to the
CYCLIC heuristic (which is known from the literature to perform well on different
types of instances) and with respect to solving the problem as a TSP. This latter
point is important because it demonstrates that the stochasticity of the problem and
the finite demand over capacity ratio is not negligible and that the development of
VRPSD-specific algorithms is needed. We proposed a TSP-VRPSD hybrid approach
that clearly outperforms the current state-of-the-art.

B.7. CONCLUSIONS

163

Boxplot normalized results on all instances

ACOvrpsd
ACOtsp
SAvrpsd
SAtsp
RRvrpsd
RRtsp
TSvrpsd
TStsp
EAvrpsd
EAtsp
ILSvrpsd
ILStsp
10

Normalized values

Interaction Meteheuristic/Appr. method

1.0
Average normalized values

Algorithm
SA
ACO
RR
ILS
EA
TS

0.5

0.0

0.5

TSP Appr.

VRPSD Appr.
Approximation method

Allpairwise comparisons
ACOvrpsd
ACOtsp
SAvrpsd
SAtsp
RRvrpsd
RRtsp
TSvrpsd
TStsp
EAvrpsd
EAtsp
ILSvrpsd
ILStsp
3

Average rank

Figure B.3: Aggregate results over all 900 instances. From top to bottom: boxplot
of normalised results; interaction plot for the two factors: metaheuristic and objective
function approximation scheme; error bars plot for simultaneous confidence intervals
in the all-pairwise comparison. The boxplot is a restricted to values in [10.5, 5], few
outliers fell outside this interval. In the error bar plot, ranks range from 1 to 12, the
number of algorithms considered in the experiment.

164

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

Clustered
TSPsoa
CYCLIC
TSvrpsd
EAtsp
EAhybrid
ILStsp
ILShybrid
Uniform
TSPsoa
CYCLIC
TSvrpsd
EAtsp
EAhybrid
ILStsp
ILShybrid
1

Average rank

Figure B.4: All-pairwise comparisons by means of simultaneous confidence intervals on

uniform and clustered instances.

Low spread
TSPsoa
CYCLIC
TSvrpsd
EAtsp
EAhybrid
ILStsp
ILShybrid
High spread
TSPsoa
CYCLIC
TSvrpsd
EAtsp
EAhybrid
ILStsp
ILShybrid
1

Average rank

Figure B.5: All-pairwise comparisons by means of simultaneous confidence intervals on

instances with different demand spread.

B.7. CONCLUSIONS

165

50 customers
TSPsoa
CYCLIC
TSvrpsd
EAtsp
EAhybrid
ILStsp
ILShybrid
100 customers
TSPsoa
CYCLIC
TSvrpsd
EAtsp
EAhybrid
ILStsp
ILShybrid
200 customers
TSPsoa
CYCLIC
TSvrpsd
EAtsp
EAhybrid
ILStsp
ILShybrid
1

Average rank

Figure B.6: All-pairwise comparisons by means of simultaneous confidence intervals on

instances with different number of customers.

166

APPENDIX B. HYBRID METAHEURISTICS FOR THE VRPSD

Bibliography
[1] E. Aarts and J. Korst. Simulated Annealing and the Boltzmann Machine. John
Wiley & Sons, New York, NY, USA, 1990.
[2] H. Abelson and A. di Sessa. Turtle Geometry: The Computer as a Medium for
Exploring Mathematics. The MIT Press, Cambridge, MA, USA, 1982.
[3] A. N. Aizawa and B. W. Wah. Scheduling of genetic algorithms in a noisy environment. Evolutionary Computation, 2:97122, 1994.
[4] S. Albers. Online algorithms: A survey. Mathematical Programming, 97(1-2):3
26, 2003.
[5] T. M. Alkhamis and M. A. Ahmed. Simulation-based optimization using simulated annealing with confidence intervals. In R. G. Ingalls, M. D. Rossetti,
J. S. Smith, and B. A. Peters, editors, Proceedings of the 2004 Winter Simulation
Conference (WSC04), pages 514518. IEEE Press, Piscataway, NJ, USA, 2004.
[6] T. M. Alkhamis, M. A. Ahmed, and W. Kim Tuan. Simulated annealing for
discrete optimization with estimation. European Journal of Operational Research,
116:530544, 1999.
[7] M. H. Alrefaei and S. Andradottir. A simulated annealing algorithm with constant
temperature for discrete stochastic optimization. Management Science, 45:748
764, 1999.
[8] S. Andradottir. A review of simulation optimization techniques. In D. J. Medeiros,
E. F. Watson, J. S. Carson, and M. S. Manivannan, editors, Proceedings of the
1998 Winter Simulation Conference (WSC98), pages 151158. IEEE Press, Piscataway, NJ, USA, 1998.
[9] R. Aringhieri. Solving chance-constrained programs combining tabu search and
simulation. In C. C. Ribeiro and S. L. Martins, editors, Proceedings of the 3rd
International Workshop on Experimental and Efficient Algorithms (WEA04), volume 3059 of Lecture Notes in Computer Science, pages 3041. Springer, Berlin,
Germany, 2004.
167

[10] D. Arnold. In Noisy Optimization with Evolutionary Strategies, volume 8 of Genetic Algorithms and Evolutionary Computation Series. Kluwer Academic Publishers, Boston, MA, USA, 2002.
[11] T. Baeck, D. Fogel, and Z. Michalewicz, editors. Evolutionary Computation 1:
Basic Algorithms and Operators. Institute of Physics Publishing, Bristol, UK,
2000.
[12] J.-F. M. Barthelemy and R. T. Haftka. Approximation concepts for optimum
structural design - a review. Structural Optimization, 5:129144, 1993.
[13] J. J. Bartholdi III and L. K. Platzman. An O(N logN) planar travelling salesman
heuristic based on spacefilling curves. Operations Research Letters, 1(4):121125,
1982.
[14] T. Beielstein and S. Markon. Threshold selection, hypothesis tests, and DOE
methods. In Proceedings of the IEEE Congress on Evolutionary Computation
(CEC 2002), volume 1, pages 1217. IEEE Press, Piscataway, NJ, USA, 2002.
[15] R. Bellman and L. A. Zadeh. Decision-making in a fuzzy environment. Management Science, 17:141161, 1970.
[16] P. Beraldi and A. Ruszczy
nski. Beam search heuristic to solve stochastic integer problems under probabilistic constraints. European Journal of Operational
Research, 167(1):3547, 2005.
[17] O. Berman and D. Simchi-Levi. Finding the optimal a priori tour and location of
a traveling salesman with nonhomogeneous customers. Transportation Science,
22(2):148154, 1988.
[18] O. Berman and D. Simchi-Levi. Finding the optimal a priori tour and location
of a traveling salesman with nonhomogeneous customers. Technical Report WP08-86, Faculty of Management, University of Calgary, Calgary, Canada, April
1988.
[19] D. P. Bertsekas. Dynamic Programming and Optimal Control, Volumes 1 and 2.
Athena Scientific, Belmont, MA, USA, 1995.
[20] D. P. Bertsekas. Network Optimization: Continuous and Discrete Models. Athena
Scientific, Belmont, MA, USA, 1998.
[21] D. P. Bertsekas and D. A. Casta
non. Rollout algorithms for stochastic scheduling
problems. Journal of Heuristics, 5:89108, 1998.
[22] D. P. Bertsekas and J. N. Tsitsiklis. Neuro-dynamic Programming. Athena Scientific, Belmont, MA, USA, 1996.
[23] D. P. Bertsekas, J. N. Tsitsiklis, and C. Wu. Rollout algorithms for combinatorial
optimization. Journal of Heuristics, 3(3):245262, 1997.
168

[24] D. J. Bertsimas. Probabilistic Combinatorial Optimization Problems. PhD thesis,

MIT, Cambridge, MA, USA, 1988.
[25] D. J. Bertsimas. A vehicle routing problem with stochastic demand. Operations
Research, 40(3):574585, 1992.
[26] D. J. Bertsimas, P. Chervi, and M. Peterson. Computational approaches to
stochastic vehicle routing problems. Transportation Science, 29(4):342352, 1995.
[27] D. J. Bertsimas and L. Howell. Further results on the probabilistic traveling
salesman problem. European Journal of Operational Research, 65(1):6895, 1993.
[28] D. J. Bertsimas, P. Jaillet, and A. Odoni. A priori optimization. Operations
Research, 38(6):10191033, 1990.
[29] D. J. Bertsimas and D. Simchi-Levi. A new generation of vehicle routing research:
robust algorithms, addressing uncertainty. Operations Research, 44(2):216304,
1996.
[30] H.-G. Beyer. Evolutionary algorithms in noisy environments: Theoretical issues
and guidelines for practice. Computer Methods in Applied Mechanics and Engineering, 186(2-4):239267, 2000.
[31] H.-G. Beyer, E. Brucherseifer, W. Jakob, H. Pohlheim, B. Sendhoff, and
T. Binh To. Evolutionary algorithms - terms and definitions. http://ls11-www.
cs.uni-dortmund.de/people/beyer/EA-glossary/def-engl-html.html.
[32] L. Bianchi, M. Birattari, M. Chiarandini, M. Manfrin, M. Mastrolilli, L. Paquete,
O. Rossi-Doria, and T. Schiavinotto. Metaheuristics for the vehicle routing problem with stochastic demands. In X. Yao, E. Burke, J. A. Lozano, J. Smith,
J. J. Merelo Guervos, J. A. Bullinaria, J. Rowe, P. Ti
no, A. Kaban, and H.P. Schwefel, editors, Proceedings of the 8th International Conference on Parallel
Problem Solving from Nature (PPSN VIII), volume 3242 of Lecture Notes in
Computer Science, pages 450460. Springer, Berlin, Germany, 2004.
[33] L. Bianchi, M. Birattari, M. Manfrin, M. Mastrolilli, L. Paquete, O. Rossi-Doria,
and T. Schiavinotto. Hybrid metaheuristics for the vehicle routing problem with
stochastic demands. Journal of Mathematical Modelling and Algorithms. To
appear.
[34] L. Bianchi, M. Birattari, M. Manfrin, M. Mastrolilli, L. Paquete, O. Rossi-Doria,
and T. Schiavinotto. Research on the vehicle routing problem with stochastic demand. Technical Report IDSIA-07-04, IDSIA - Dalle Molle Institute for Artificial
Intelligence, Manno, Switzerland, March 2004.
[35] L. Bianchi and A. M. Campbell. Extension of the 2-p-opt and 1-shift algorithms
to the heterogeneous probabilistic traveling salesman problem. European Journal
of Operational Research. To appear.
169

[36] L. Bianchi and A. M. Campbell. Extension of the 2-p-opt and 1-shift algorithms
to the heterogeneous probabilistic traveling salesman problem. Technical Report
IDSIA-01-04, IDSIA - Dalle Molle Institute for Artificial Intelligence, Manno,
Switzerland, 2004.
[37] L. Bianchi, M. Dorigo, L. M. Gambardella, and W. J. Gutjahr. Metaheuristics
in stochastic combinatorial optimization: a survey. Technical Report IDSIA-0806, IDSIA - Dalle Molle Institute for Artificial Intelligence, Manno, Switzerland,
2006.
[38] L. Bianchi, L. M. Gambardella, and M. Dorigo. An ant colony optimization approach to the probabilistic traveling salesman problem. In J. J. Merelo Guervos,
P. Adamidis, H.-G. Beyer, J.-L. Fernandez-Villaca
nas, and H.-P. Schwefel, editors, Proceedings of the 7th International Conference on Parallel Problem Solving
from Nature (PPSN VII), volume 2439 of Lecture Notes in Computer Science,
pages 883892. Springer, London, UK, 2002.
[39] L. Bianchi, L. M. Gambardella, and M. Dorigo. Solving the homogeneous probabilistic traveling salesman problem by the ACO metaheuristic. In M. Dorigo,
G. Di Caro, and M. Sampels, editors, Proceedings of the 3rd International Workshop on Ant Algorithms (ANTS 2002), volume 2463 of Lecture Notes in Computer
Science, pages 176187. Springer, London, UK, 2002.
[40] L. Bianchi, L. M. Gambardella, and M. Dorigo. Ant colony optimization and
local search for the probabilistic traveling salesman problem. Technical Report
IDSIA-02-06, IDSIA - Dalle Molle Institute for Artificial Intelligence, Manno,
Switzerland, 2006.
[41] L. Bianchi and J. Knowles. Local search for the probabilistic traveling salesman
problem: a proof of the incorrectness of Bertsimas proposed 2-p-opt and 1-shift
algorithms. Technical Report IDSIA-21-02, IDSIA - Dalle Molle Institute for
Artificial Intelligence, Manno, Switzerland, 2002.
[42] L. Bianchi, J. Knowles, and N. Bowler. Local search for the probabilistic traveling
salesman problem: correction to the 2-p-opt and 1-shift algorithms. European
Journal of Operational Research, 162(1):206219, 2005.
[43] M. Birattari. On the estimation of the expected performance of a metaheuristic
on a class of instances. How many instances, how many runs? Technical Report TR/IRIDIA/2004-01.2, IRIDIA - Institut de Recherches Interdisciplinaires
et de Developpements en Intelligence Artificielle, Univeriste Libre de Bruxelles,
Brussels, Belgium, 2004.
[44] M. Birattari. The Problem of Tuning Metaheuristics, as seen from a machine
learning perspective. PhD thesis, Universite Libre de Bruxelles, Brussels, Belgium,
2004.
170

[45] M. Birattari, P. Balaprakash, and M. Dorigo. ACO/F-Race: ant colony optimization and racing techniques for combinatorial optimization under uncertainty. In
K. F. Doerner, M. Gendreau, P. Greistorfer, W. J. Gutjahr, R. F. Hartl, and
M. Reimann, editors, Proceedings of the 6th Metaheuristics International Conference (MIC 2005), pages 107112, 2005.
[46] J. R. Birge and F. Louveaux. Introduction to Stochastic Programming. Springer,
New York, NY, USA, 1997.
[47] Z. W. Birnbaum. On random variables with comparable peakedness. Annals of
Mathematical Statistics, 19:7681, 1948.
[48] C. Blum and A. Roli. Metaheuristics in combinatorial optimization: overview
and conceptual comparison. ACM Computing Surveys, 35(3):268308, 2003.
[49] A. Borodin and R. El-Yaniv. Online Computation and Competitive Analysis.
Cambridge University Press, Cambridge, MA, USA, 1998.
[50] N. E. Bowler, T. M. A. Fink, and R. C. Ball. Characterization of the probabilistic
traveling salesman problem. Physical Review E, 68(036703), 2003.
[51] J. Branke and M. Guntsch. New ideas for applying ant colony optimization to the
probabilistic TSP. In Proceedings of the 3rd European Workshop on Evolutionary
Computation in Combinatorial Optimization (EvoCOP 2003), volume 2611 of
Lecture Notes in Computer Science, pages 165175. Springer, Berlin, Germany,
2003.
[52] J. Branke and M. Guntsch. Solving the probabilistic TSP with ant colony optimization. Journal of Mathematical Modelling and Algorithms, 3(4):403425,
2004.
[53] J. Branke and C. Schmidt. Selection in the presence of noise. In E. Cant
u-Paz,
J. A. Foster, K. Deb, D. Davis, R. Roy, U. M. OReilly, H.-G. Beyer, R. Standish,
G. Kendall, S. Wilson, M. Harman, J. Wegener, D. Dasgupta, M. A. Potter,
A. C. Schultz, K. Dowsland, N. Jonoska, and J. Miller, editors, Proceedings of
the Genetic and Evolutionary Computation Conference (GECCO 2003), pages
766777. Springer, Berlin, Germany, 2003.
[54] M. Brusco and L. Jacobs. A simulated annealing approach to the cyclic staffscheduling problem. Naval Research Logistics, 40(1):6984, 1993.
[55] M. Brusco and L. Jacobs. A simulated annealing approach to the solution of
flexible labour scheduling problems. Journal of the Operational Research Society,
44(12):11911200, 1993.
[56] A. A. Bulgak and J. L. Sanders. Integrating a modified simulated annealing
algorithm with the simulation of a manufacturing system to optimize buffer sizes
in automatic assembly systems. In M. Abrams, P. Haigh, and J. Comfort, editors,
171

Proceedings of the 1988 Winter Simulation Conference (WSC98), pages 684690.

IEEE Press, Piscataway, NJ, USA, 1988.
[57] P. Calegari, G. Coray, A. Hertz, D. Kobler, and P. Kuonen. A taxonomy of
evolutionary algorithms in combinatorial optimization. Journal of Heuristics,
5:145158, 1999.
[58] E. Cant
u-Paz. Adaptive sampling for noisy problems. In K. Deb, R. Poli,
W. Banzhaf, H.-G. Beyer, E. Burke, P. Darwen, D. Dasgupta, D. Floreano, J. Foster, M. Harman, O. Holland, P. L. Lanzi, L. Spector, A. Tettamanzi, D. Thierens,
and A. Tyrrell, editors, Proceedings of the Genetic and Evolutionary Computation
Conference (GECCO 2004), volume 3103 of Lecture Notes in Computer Science,
pages 947958. Springer, Berlin, Germany, 2004.
[59] H. S. Chang. An ant system based exploration-exploitation for reinforcement
learning. In Proceedings of the IEEE Conference on Systems, Man, and Cybernetics, pages 38053810. IEEE Press, Piscataway, NJ, USA, 2004.
[60] H. S. Chang, W. J. Gutjahr, J. Yang, and S. Park. An ant system approach to
Markov decision processes. In Proceedings of the 23rd American Control Conference (ACC04), volume 4, pages 38203825. IEEE Press, Piscataway, NJ, USA,
2004.
[61] H. S. Chang, H-G. Lee, M. Fu, and S. I. Marcus. Evolutionary policy iteration
for solving Markov decision processes. IEEE Transactions on Automatic Control,
2005. To appear.
[62] P. Chervi. A computational approach to probabilistic vehicle routing problems.
Masters thesis, MIT, Cambridge, MA, USA, 1988.
[63] Concorde TSP solver. http://www.tsp.gatech.edu/concorde.html.
[64] W. J. Conover. Practical Nonparametric Statistics. John Wiley & Sons, New
York, NY, USA, 1999.
[65] D. Costa and E. A. Silver. Tabu search when noise is present: an illustration in
the context of cause and effect analysis. Journal of Heuristics, 4:523, 1998.
[66] A. Dean and D. Voss. Design and Analysis of Experiments. Springer, New York,
NY, USA, 1999.
[67] B. Dengiz and C. Alabas. Simulation optimization using tabu search. In J. A.
Joines, R. R. Barton, K. Kang, and P. A. Fishwick, editors, Proceedings of the
2000 Winter Simulation Conference (WSC00), pages 805810. IEEE Press, Piscataway, NJ, USA, 2000.
[68] M. Dorigo, G. Di Caro, and L. M. Gambardella. Ant algorithms for discrete
optimization. Artificial Life, 5(2):137172, 1999.
172

[69] M. Dorigo and L. M. Gambardella. Ant Colony System: A cooperative learning

approach to the traveling salesman problem. IEEE Transactions on Evolutionary
Computation, 1:5366, 1997.
[70] M. Dorigo, V. Maniezzo, and A. Colorni. The ant system: an autocatalytic
optimization process. Technical Report 91-016, Department of Electronics, Politecnico di Milano, Milan, Italy, 1991.
[71] M. Dorigo, V. Maniezzo, and A. Colorni. Ant system: Optimization by a colony
of cooperating agents. IEEE Transactions on Systems, Man, and Cybernetics
Part B, 26(1):2941, 1996.
[72] M. Dorigo and T. St
utzle. Ant Colony Optimization. MIT Press, Cambridge,
MA, USA, 2004.
[73] M. Dyer and L. Stougie. Computational complexity of stochastic programming
problems. Technical Report SPOR-report 2003-20, Department of Mathematics and Computer Science, Technische Universiteit Eindhoven, Eindhoven, The
Netherlands, 2003.
[74] F. Easton and N. Mansour. A distributed genetic algorithm for deterministic and
stochastic labor scheduling problems. European Journal of Operational Research,
118(3):505523, 1999.
[75] F. Easton and D. Rossin. A stochastic goal program for employee scheduling.
Decision Sciences, 27(3):541568, 1996.
[76] E. Erel, I. Sabuncuoglu, and H. Sekerci. Stochastic assembly line balancing using
beam search. International Journal of Production Research, 43(7):14111426,
2005.
[77] D. A. Finke, D. J. Medeiros, and M. Traband. Shop scheduling using tabu search
and simulation. In E. Y
ucesan, C. H. Chen, J. L. Snowdon, and J. M. Charnes,
editors, Proceedings of the 2002 Winter Simulation Conference (WSC02), pages
10131017. IEEE Press, Piscataway, NJ, USA, 2002.
[78] L. J. Fogel, A. J. Owens, and M. J. Walsh. Artificial intelligence through simulated
evolution. John Wiley & Sons, New York, NY, USA, 1966.
[79] B. L. Fox and G. W. Heine. Probabilistic search with overrides. Annals of Applied
Probability, 4:10871094, 1995.
[80] M. C. Fu. Optimization for simulation: Theory vs. practice. INFORMS Journal
on Computing, 14(3):192215, 2002.
[81] M. C. Fu. Guest editorial of the ACM TOMACS special issue on simulation optimization. ACM Transactions on Modeling and Computer Simulation, 13(2):105
107, 2003.
173

[82] L. M. Gambardella and M. Dorigo. Solving symmetric and asymmetric TSPs

by ant colonies. In Proceedings of the 1996 IEEE International Conference on
Evolutionary Computation (ICEC96), pages 622627. IEEE Press, Piscataway,
NJ, USA, 1996.
[83] S. B. Gelfand and S. K. Mitter. Analysis of simulated annealing for optimization.
In Proceedings of the 24th IEEE Conference on Decision and Control (CDC85),
volume 2, pages 779786. IEEE Press, Piscataway, NJ, USA, 1985.
[84] S. B. Gelfand and S. K. Mitter. Simulated annealing with noisy or imprecise
measurements. Journal of Optimization Theory and Applications, 69:4962, 1989.
[85] D. Geman and S. Geman. Stochastic relaxation, Gibbs distributions, and the
Bayesian restoration of images. In IEEE Transactions of Pattern Analysis and
Machine Intelligence, volume 6, pages 721741, 1984.
[86] M. Gendreau, G. Laporte, and R. Seguin. An exact algorithm for the vehicle routing problem with stochastic demands and customers. Transportation Sciences,
29(2):143155, 1995.
[87] M. Gendreau, G. Laporte, and R. Seguin. Stochastic vehicle routing. European
Journal of Operational Research, 88:312, 1996.
[88] M. Gendreau, G. Laporte, and R. Seguin. A tabu search heuristic for the vehicle
routing problem with stochastic demands and customers. Operations Research,
44(3):469477, 1996.
[89] M. Giacobini, M. Tomassini, and L. Vanneschi. Limiting the number of fitness cases in genetic programming using statistics. In J. J. Merelo Guervos,
P. Adamidis, H.-G. Beyer, J.-L. Fernandez-Villaca
nas, and H.-P. Schwefel, editors, Proceedings of the 7th International Conference on Parallel Problem Solving
from Nature (PPSN-VII), volume 2439 of Lecture Notes in Computer Science,
pages 371380. Springer, London, UK, 2002.
[90] F. Glover. Future paths for integer programming and links to artificial intelligence. Computers & Operations Research, 13:533549, 1986.
[91] F. Glover. Tabu search - part I. ORSA Journal on Computing, 1(3):190206,
1989.
[92] F. Glover. Future paths for integer programming and links to artificial intelligence. In J.-K. Hao, E. Lutton, E. Ronald, M. Schoenaurer, and D. Snyers,
editors, Artificial Evolution, volume 1363 of Lecture Notes in Computer Science.
Springer, Berlin, Germany, 1998.
[93] F. Glover. Tabu search and finite convergence. Discrete Applied Mathematics,
119:336, 2002.
174

[94] F. Glover and M. Laguna. Tabu Search. Kluwer Academic Publishers, Norwell,
MA, USA, 1997.
[95] G. R. Grimmett and D. R. Stirzaker. Probability and Random Processes, 3rd
Edition. Oxford University Press, New York, NY, USA, 2001.
[96] G. Gutin and A. Punnen, editors. The Traveling Salesman Problem and its
Variations. Kluwer Academic Publishers, Dordrecht, The Netherlands, 2002.
[97] W. J. Gutjahr. A graph-based Ant System and its convergence. Future Generation
Computer Systems, 16(8):873888, 2000.
[98] W. J. Gutjahr. ACO algorithms with guaranteed convergence to the optimal
solution. Information Processing Letters, 82(3):145153, 2002.
[99] W. J. Gutjahr. A converging ACO algorithm for stochastic combinatorial optimization. In Proceedings of the 2nd Symposium on Stochastic Algorithms, Foundations and Applicaions (SAGA 2003), volume 2827 of Lecture Notes in Computer
Science, pages 1025. Springer, Berlin, Germany, 2003.
[100] W. J. Gutjahr. S-ACO: An ant-based approach to combinatorial optimization under uncertainty. In Proceedings of the 4th International Workshop on Ant Colony
Optimization and Swarm Intelligence (ANTS 2004), volume 3172 of Lecture Notes
in Computer Science, pages 238249. Springer, Berlin, Germany, 2004.
[101] W. J. Gutjahr, A. Hellmayr, and G. Ch. Pflug. Optimal stochastic single-machine
tardiness scheduling by stochastic branch-and-bound. European Journal of Operational Research, 117:396413, 1999.
[102] W. J. Gutjahr and G. Ch. Pflug. Simulated annealing for noisy cost functions.
Journal of Global Optimization, 8:113, 1996.
[103] W. J. Gutjahr, C. Strauss, and M. Toth. Crashing of stochastic activities by sampling and optimization. Business Process Management Journal, 6:6583, 2000.
[104] W. J. Gutjahr, C. Strauss, and E. Wagner. A stochastic branch-and-bound approach to activity crashing in project management. INFORMS Journal on Computing, 12:125135, 2000.
[105] J. Haddock and J. Mittenthal. Simulation optimization using simulated annealing. Computers & Industrial Engineering, 22:387395, 1992.
[106] M. Haimovitch and A. Rinnooy Kan. Bounds and heuristics for capacitated
routing problems. Mathematics of Operations Research, 10:527542, 1985.
[107] B. Hajek. Cooling schedules for optimal annealing. Mathematics of Operations
Research, 13:311329, 1988.
175

[108] S. Hanafi. On the convergence of tabu search. Journal of Heuristics, 7:4758,

2000.
[109] W. K. K. Haneveld and M. H. van der Vlerk. Stochastic integer programming:
state of the art. Annals of Operations Research, 85:3957, 1999.
[110] P. Hansen. The steepest ascent mildest descent heuristics for combinatorial programming. Talk presented at the Congress on Numerical Methods in Combinatorial Optimization, Capri, Italy, 1986.
[111] K. K. Haugen, A. Lkketangen, and D. L. Woodruff. Progressive hedging as a
meta-heuristic applied to stochastic lot-sizing. European Journal of Operational
Research, 132:116122, 2001.
[112] A. Hertz and D. Kobler. A framework for the description of evolutionary algorithms. European Journal of Operational Research, 126:112, 2000.
[113] A. Hertz, E. Taillard, and D. de Werra. Tabu search. In E. H. L. Aarts and J. K.
Lenstra, editors, Local Search in Combinatorial Optimization, pages 121136.
John Wiley & Sons, New York, NY, USA, 1997.
[114] J. H. Holland. Adaptation in natural and artificial systems. The University of
Michigan Press, Ann Harbor, MI, USA, 1975.
[115] T. Homem-de-Mello. Variable-sample methods and simulated annealing for discrete stochastic optimization. Stochastic Programming E-Print Series, http:
//hera.rz.hu-berlin.de/speps/, 2000.
[116] T. Homem-de-Mello. Variable-sample methods for stochastic optimization. ACM
Transactions on Modeling and Computer Simulation, 13:108133, 2003.
[117] S. Irani, X. Lu, and A. Regan. On-line algorithms for the dynamic traveling
repair problem. Journal of Scheduling, 7(3):243258, 2004.
[118] P. Jaillet. Probabilistic Traveling Salesman Problems. PhD thesis, MIT, Cambridge, MA, USA, 1985.
[119] P. Jaillet. A priori solution of a travelling salesman problem in which a random
subset of the customers are visited. Operations Research, 36(6):929936, 1988.
[120] P. Jaillet and A. Odoni. In B. L. Golden and A. A. Assad, editors, Vehicle Routing:
Methods and Studies, chapter The probabilistic vehicle routing problems. Elsevier,
Amsterdam, The Netherlands, 1988.
[121] O. Jellouli and E. Chatelet. Monte Carlo simulation and genetic algorithm for
optimising supply chain management in a stochastic environment. In Proceedings
of the 2001 IEEE Conference on Systems, Man, and Cybernetics, volume 3, pages
18351839. IEEE Press, Piscataway, NJ, USA, 2001.
176

[122] Y. Jin. A comprehensive survey of fitness approximation in evolutionary computation. Soft Computing, 9(1):312, 2005.
[123] Y. Jin. Evolutionary optimization in uncertain environments - a survey. IEEE
Transactions on Evolutionary Computation, 9(3):303317, 2005.
[124] D. S. Johnson and L. A. McGeoch. The travelling salesman problem: A case
study in local optimization. In E. H. L. Aarts and J. K. Lenstra, editors, Local
Search in Combinatorial Optimization, pages 215310. John Wiley & Sons, New
York, NY, USA, 1997.
[125] D. S. Johnson and L. A. McGeoch. Experimental analysis of heuristics for the
STSP. In G. Gutin and A. Punnen, editors, The Traveling Salesman Problem
and its Variations, pages 369443. Kluwer Academic Publishers, Dordrecht, The
Netherlands, 2002.
[126] H. Jonsson and E. A. Silver. Some insights regarding selecting sets of scenarios
in combinatorial stochastic problems. Journal of Production Economics, 45:463
472, 1996.
[127] P. Kall and S. W. Wallace. Stochastic Programming. John Wiley & Sons, Chichester, UK, 1994. Wiley has released the copyright on the book, and the authors made the text available to the scientific community: it can be downloaded
for free at http://www.unizh.ch/ior/Pages/Deutsch/Mitglieder/Kall/bib/
ka-wal-94.pdf.
[128] A. Kenyon and D. P. Morton. A survey on stochastic location and routing problems. Central European Journal of Operations Research, 9:277328, 2002.
[129] S. Kirkpatrick, C. D. Gelatt, and M. P. Vecchi. Optimization by simulated annealing. Science, 220:671680, 1983.
[130] P. Kouvelis and G. Yu. Robust Discrete Optimization and Its Applications, volume 14 of Nonconvex optimization and its applications. Kluwer Academic Publishers, Dordrecht, The Netherlands, 1997.
[131] G. Laporte and F. Louveaux. The integer L-shaped method for stochastic integer
programs with complete recourse. Operations Research Letters, 33:133142, 1993.
[132] G. Laporte, F. Louveaux, and H. Mercure. An exact solution for the a priori optimization of the probabilistic traveling salesman problem. Operations Research,
42(3):543549, 1994.
[133] S. Lin. Computer solution of the traveling salesman problem. Bell System Technical Journal, 44:22452269, 1965.
[134] Z.-Z. Lin, J. C. Bean, and C. C. White III. Genetic algorithm heuristics for finite
horizon partially observed Markov decision processes. Technical Report 98-24,
177

Department of Industrial and Operations Engineering, University of Michigan,

Ann Arbor, MI ,USA, 1998.
[135] Z.-Z. Lin, J. C. Bean, and C. C. White III. A hybrid genetic/optimization algorithm for finite-horizon, partially observed Markov decision processes. INFORMS
Journal on Computing, 16(1):2738, 2004.
[136] A. Lkketangen and D. L. Woodruff. Progressive hedging and tabu search applied
to mixed integer (0,1) multistage stochastic programming. Journal of Heuristics,
2:111128, 1996.
[137] H. R. Lourenco, O. Martin, and T. St
utzle. In F. Glover and G. Kochenberger,
editors, Handbook of Metaheuristics, volume 57 of International Series in Operations Research & Management, chapter Iterated Local Search, pages 321353.
Kluwer Academic Publishers, Boston, USA, 2002.
[138] C. M. Lutz, K. R. Davis, and M. Sun. Determining buffer location and size in
production lines using tabu search. European Journal of Operational Research,
106:301316, 1998.
[139] K. L. Mak and Z. G. Guo. A genetic algorithm for vehicle routing problems with
stochastic demand and soft time windows. In M. H. Jones, S. D. Patek, and B. E.
Tawney, editors, Proceedings of the 2004 IEEE Systems and Information Engineering Design Symposium (SIEDS04), pages 183190. IEEE Press, Piscataway,
NJ, USA, 2004.
[140] S. Markon, D. V. Arnold, T. Back, T. Beielstein, and H.-G. Beyer. Thresholding
a selection operator for noisy ES. In Proceedings of the IEEE Congress on
Evolutionary Computation (CEC 2001), volume 1, pages 465472. IEEE Press,
Piscataway, NJ, USA, 2001.
[141] N. Metropolis, A. Rosenbluth, M. Rosenbluth, A. Teller, and E. Teller. Equation
of state calculations by fast computing machines. Journal of Chemical Physics,
21:10871092, 1953.
[142] B. L. Miller and D. E. Goldberg. Genetic algorithms, selection schemes, and the
varying effects of noise. Evolutionary Computation, 4(2):113131, 1997.
[143] Metaheuristics Network web site. http://www.metaheuristics.org/.
[144] V. I. Norkin, Y. M. Ermoliev, and A. Ruszczy
nski. On optimal allocation of
indivisibles under uncertainty. Operations Research, 46(3):381395, 1998.
[145] V. I. Norkin, G. Ch. Pflug, and A. Ruszczy
nski. A branch and bound method for
stochastic global optimization. Mathematical Programming, 83:425450, 1998.
178

[146] S. Olafsson
and J. Kim. Simulation optimization. In E. Y
ucesan, C. H. Chen,
J. L. Snowdown, and J. M. Charnes, editors, Proceedings of the 2002 Winter Simulation Conference (WSC02), pages 8984. IEEE Press, Piscataway, NJ, USA,
2002.
[147] I. Or. Traveling salesman-type combinatorial problems and their relation to the
logistics of blood banking. PhD thesis, Department of Industrial Engineering and
Management Sciences, Nortwestern University, Evanston, IL, USA, 1976.
[148] J. Pichitlamken. A combined procedure for optimization via simulation. PhD
thesis, Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, IL, USA, 2002.
[149] J. Pichitlamken and L. B. Nelson. Selection-of-the-best procedures for optimization via simulation. In B. A. Peters, J. S. Smith, D. J. Medeiros, and M. W.
Rohrer, editors, Proceedings of the 2001 Winter Simulation Conference (WSC01),
pages 401407. IEEE Press, Piscataway, NJ, USA, 2001.
[150] J. Pichitlamken and L. B. Nelson. A combined procedure for optimization via simulation. ACM Transactions on Modeling and Computer Simulation, 13(2):155
179, 2003.
[151] M. L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, New York, NY, USA, 2005.
[152] M. Rauner, S. C. Brailsford, W. J. Gutjahr, and W. Zeppelzauer. Optimal screening policies for diabetic retinopathy using a combined discrete event simulation
and ant colony optimization approach. In J. G. Andersen and M. Katzper, editors,
Proceedings of the 15th International Conference on Health Sciences Simulation,
Western MultiConference 2005, pages 147152. SCS - Society of Computer Simulation International, San Diego, CA, USA, 2005.
[153] R. I. Rechenberg. Evolutionsstrategie: Optimierung Technischer Systeme nach
Prinzipien der biologischen Evolution. Frommann-Holzboog, Stuttgart, Germany,
1973.
[154] C. R. Reeves and J. E. Rowe. Genetic Algorithms: Principles and Perspectives
- a Guide to GA Theory. Operaations Research/Computer Science Interfaces
Series. Kluwer Academic Publishers, Boston, MA, USA, 2003.
[155] R. T. Rockafellar and R. J.-B. Wets. Scenarios and policy aggregation in optimization under uncertainty. Mathematics of Operations Research, 16:119147,
1991.
[156] N. Roenko. Simulated annealing under uncertainty. Technical report, Institute
for Operations Research, University of Zurich, Switzerland, 1990.
179

[157] S. L. Rosen and C. M. Harmonosky. An improved simulated annealing simulation

optimization method for discrete parameter stochastic systems. Computers &
Operations Research, 32(2):343358, 2005.
[158] S. Rosenow. A heuristic for the probabilistic TSP. In W. Herroelen, E. Demeulemeester, B. De Reyck, U. Zimmerman, U. Derigs, W. Gaul, R. H. Mohring, and
K. P. Schuster, editors, Operations Research Proceedings 1996. Springer, Berlin,
Germany, 1997.
[159] F. A. Rossi and I. Gavioli. Aspects of heuristic methods in the probabilistic
traveling salesman problem. In Advanced School on Statistics in Combinatorial
Optimization, pages 214227. World Scientific, Singapore, 1987.
[160] R. Y. Rubinstein. Simulation and the Monte Carlo method. John Wiley & Sons,
New York, NY, USA, 1981.
[161] G. Rudolph. Convergence of evolutionary algorithms in general search spaces. In
Proceedings of the IEEE International Conference on Evolutionary Computation
(ICEC96), pages 5054. IEEE Press, Piscataway, NJ, USA, 1996.
[162] N. Secomandi. Comparing neuro-dynamic programming algorithms for the vehicle
routing problem with stochastic demands. Computers & Operations Research,
27(5):11711200, 2000.
[163] N. Secomandi. A rollout policy for the vehicle routing problem with stochastic
demands. Operations Research, 49(5):796802, 2001.
[164] N. Secomandi. Analysis of a rollout approach to sequencing problems with
stochastic routing applications. Journal of Heuristics, 9:321352, 2003.
[165] D. J. Sheskin. Handbook of Parametric and Nonparametric Statistical Procedures.
Chapman & Hall/CRC, Boca Raton, FL, USA, 2nd edition, 2000.

[166] L. Shi and S. Olafsson.

Nested partitions method for global optimization. Operations Research, 48(3):390407, 2000.
[167] P. Stagge. Averaging efficiently in the presence of noise. In A. E. Eiben, T. Back,
M. Schoenauer, and H.-P. Schwefel, editors, Proceedings of the 5th International
Conference on Parallel Problem Solving from Nature (PPSN-V), volume 1498 of
Lecture Notes in Computer Science, pages 188200. Springer, Berlin, Germany,
1998.
[168] Stochastic Programming Community Homepage. http://stoprog.org/.
[169] T. St
utzle and M. Dorigo. A short convergence proof for a class of ACO algorithms. IEEE Transactions on Evolutionary Computation, 6(4):358365, 2002.
180

[170] T. St
utzle and H. Hoos. In P. Hansen and C. Ribeiro, editors, Essays and Surveys
on Metaheuristics, chapter Analyzing the Run-time Behaviour of Iterated Local
Search for the TSP, pages 589612. Kluwer Academic Publishers, Boston, USA,
2002.
[171] J. Sudhir Ryan Daniel and C. Rajendran. A simulation-based genetic algorithm
for inventory optimization in a serial supply chain. International Transactions in
Operational Research, 12(1):101127, 2005.
[172] R. S. Sutton and A. G. Barto. Reinforcement Learning. The MIT Press, Cambridge, MA, USA, 1998.
[173] J. R. Swisher, S. H. Jacobson, and E. Y
ucesan. Discrete-event simulation optimization using ranking, selection, multiple comparison procedures: a survey.
ACM Transactions on Modeling and Computer Simulation, 13(2):134154, 2003.
[174] A. Teller and D. Andre. Automatically choosing the number of fitness cases:
The rational allocation of trials. In J. R. Koza, D. Kalyanmoy, M. Dorigo, D. B.
Fogel, M. Garzon, H. Iba, and R. L. Riolo, editors, Proceedings of the 2nd Annual
Genetic Programming Conference (GP-97), pages 321328. Morgan Kaufmann,
San Francisco, CA, USA, 1997.
[175] D. Teodorovic and G. Pavkovic. A simulated annealing technique approach to
the vehicle routing problem in the case of stochastic demand. Transportation
Planning and Technology, 16:261273, 1992.
[176] G. Tesauro and G. R. Galperin. On-line policy improvement using monte carlo
search. Advances in Neural Information Processing Systems, 9:10681074, 1997.
[177] J. N. Tsitsiklis. Asyncronous stochastic approximation and Q-learning. Machine
Learning, 16:185202, 1994.
[178] TSPLIB.
http://www.iwr.uni-heidelberg.de/groups/comopt/software/
TSPLIB95/.
[179] P. J. M. van Laarhoven and E. H. L. Aarts. Simulated Annealing: Theory and
Applications. D. Reidel Publishing Company, Dordrecht, The Netherlands, 1987.
[180] M. Vose. The Simple Genetic Algorithm: Foundations and Theory. The MIT
Press, Cambridge, MA, USA, 1999.
[181] J. P. Watson, S. Rana, L. D. Whitley, and Howe A. E. The impact of approximate
evaluation on the performance of search algorithms for warehouse scheduling.
Journal of Scheduling, 2(2):7998, 1999.
[182] D. Whitley, T. Starkweather, and D. Shaner. The travelling salesman and
sequence scheduling: Quality solutions using genetic edge recombination. In
L. Davis, editor, Handbook of Genetic Algorithms, pages 350372. Van Nostrand
Reinhold, New York, NY, USA, 1991.
181

[183] W. Yang, K. Mathur, and R. H. Ballou. Stochastic vehicle routing problem with
restocking. Transportation Science, 34(1):99112, 2000.
[184] M. Yokoyama and H. W. Lewis III. Optimization of the stochastic dynamic
production problem by a genetic algorithm. Computers & Operations Research,
30:18311849, 2003.
[185] Y. Yoshitomi. A genetic algorithm approach to solving stochastic job-shop
scheduling problems.
International Transactions in Operational Research,
9(4):479495, 2002.
[186] Y. Yoshitomi and R. Yamaguchi. A genetic algorithm and the Monte Carlo
method for stochastic job-shop scheduling. International Transactions in Operational Research, 10(6):577596, 2003.
[187] H. J. Zimmermann. Fuzzy Set Theory and its Application. Kluwer Academic
Publishers, Boston, MA, USA, 2nd edition, 1991.

182

Intelligent Systems Unit 1
No ratings yet
Intelligent Systems Unit 1
13 pages
DSA by Shradha Didi & Aman Bhaiya - DSA in 2.5 Months
No ratings yet
DSA by Shradha Didi & Aman Bhaiya - DSA in 2.5 Months
5 pages
Talbi 2021
No ratings yet
Talbi 2021
32 pages
Discrete Chapter3
No ratings yet
Discrete Chapter3
40 pages
Patrick Siarry (Editor) - Metaheuristics-Springer (2016) PDF
No ratings yet
Patrick Siarry (Editor) - Metaheuristics-Springer (2016) PDF
497 pages
Wind Turbine Blade Design
100% (1)
Wind Turbine Blade Design
27 pages
Analytical Numerical Analysis
86% (7)
Analytical Numerical Analysis
3 pages
What Is A Stack?: Some Key Points Related To Stack
No ratings yet
What Is A Stack?: Some Key Points Related To Stack
23 pages
1 s2.0 S221471601500007X Main
No ratings yet
1 s2.0 S221471601500007X Main
11 pages
Deep Reinforcement Learning For Multi-Objective Optimization
No ratings yet
Deep Reinforcement Learning For Multi-Objective Optimization
12 pages
Mathematics 10 02211 v2
No ratings yet
Mathematics 10 02211 v2
38 pages
Hybrid Metaheuristics Blum
No ratings yet
Hybrid Metaheuristics Blum
172 pages
Metaheuristic Algorithms For Optimization A Brief Review - 2023
No ratings yet
Metaheuristic Algorithms For Optimization A Brief Review - 2023
16 pages
Introduction Complexity
No ratings yet
Introduction Complexity
94 pages
Red Panda Optimization Algorithm An Effective Bio-Inspired Metaheuristic Algorithm For Solving Engineering Optimization Problems
No ratings yet
Red Panda Optimization Algorithm An Effective Bio-Inspired Metaheuristic Algorithm For Solving Engineering Optimization Problems
25 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
35 pages
Meta Heuristic Method
No ratings yet
Meta Heuristic Method
46 pages
A Review of Surrogate-Assisted Evolutionary Algorithms For Expensive Optimization Problems
No ratings yet
A Review of Surrogate-Assisted Evolutionary Algorithms For Expensive Optimization Problems
13 pages
PPPPKK
No ratings yet
PPPPKK
36 pages
J.knosys.2015.12.022 31
No ratings yet
J.knosys.2015.12.022 31
30 pages
A Q-Learning-Based Multi-Objective Hyper-Heuristic Algorithm With Fuzzy Policy Decision Technology
No ratings yet
A Q-Learning-Based Multi-Objective Hyper-Heuristic Algorithm With Fuzzy Policy Decision Technology
17 pages
Metaheuristics For Logistics 1st Edition Laurent Deroussi Instant Download
No ratings yet
Metaheuristics For Logistics 1st Edition Laurent Deroussi Instant Download
52 pages
Fire Hawk Optimizer A Novel Metaheuristic Algorith
No ratings yet
Fire Hawk Optimizer A Novel Metaheuristic Algorith
78 pages
William Higino
No ratings yet
William Higino
97 pages
New Developments in Metaheuristics and T
No ratings yet
New Developments in Metaheuristics and T
5 pages
Solving A Traveling Salesman Problem Using Meta-Heuristics
No ratings yet
Solving A Traveling Salesman Problem Using Meta-Heuristics
9 pages
The Spherical Search Algorithm For Bound-Constrained Global Optimization Problems
No ratings yet
The Spherical Search Algorithm For Bound-Constrained Global Optimization Problems
36 pages
01 Organization+Intro
No ratings yet
01 Organization+Intro
29 pages
Metaheuristics Overview
No ratings yet
Metaheuristics Overview
28 pages
Original Article - EGO - Eel - and - Grouper - Optimizer - A - Nature-Inspired - Optimi - 3
No ratings yet
Original Article - EGO - Eel - and - Grouper - Optimizer - A - Nature-Inspired - Optimi - 3
1 page
Report Dickerson
No ratings yet
Report Dickerson
110 pages
10.1007@s10462 020 09906 6
No ratings yet
10.1007@s10462 020 09906 6
87 pages
Orthogonal Learning Rosenbrocks Direct Rotation W
No ratings yet
Orthogonal Learning Rosenbrocks Direct Rotation W
42 pages
Hybrid Intelligence A Synergistic Metaheuristic For Advanced Numerical Optimization
No ratings yet
Hybrid Intelligence A Synergistic Metaheuristic For Advanced Numerical Optimization
5 pages
Ant Colony Optimization: Dr. B. S. Girish
No ratings yet
Ant Colony Optimization: Dr. B. S. Girish
24 pages
Introduction To Metaheuristics
100% (1)
Introduction To Metaheuristics
60 pages
Harris Hawks Optimization
No ratings yet
Harris Hawks Optimization
40 pages
Secretary Bird Optimization Algorithm
No ratings yet
Secretary Bird Optimization Algorithm
102 pages
Decomposition Strategies For Vehicle Routing Heuristics
No ratings yet
Decomposition Strategies For Vehicle Routing Heuristics
19 pages
A Hybrid Model Combining Ant Lion Optimizer and Genetic Algorithm For Solving Complex Numerical Optimization Problem
No ratings yet
A Hybrid Model Combining Ant Lion Optimizer and Genetic Algorithm For Solving Complex Numerical Optimization Problem
7 pages
Hippopotamus Optimization Algorithm A Novel Nature
No ratings yet
Hippopotamus Optimization Algorithm A Novel Nature
51 pages
Unlocking Statistics for the Social Sciences
From Everand
Unlocking Statistics for the Social Sciences
Norma Sinclair
No ratings yet
Sbai2020 Article TwoMeta-heuristicsForSolvingTh PDF
No ratings yet
Sbai2020 Article TwoMeta-heuristicsForSolvingTh PDF
43 pages
Irnich 2008
No ratings yet
Irnich 2008
19 pages
Sinhcosh Research Paper DAA
No ratings yet
Sinhcosh Research Paper DAA
29 pages
Adaptive Learning Search A New Tool To Help Compre
No ratings yet
Adaptive Learning Search A New Tool To Help Compre
24 pages
Comparison Jmetal 1
No ratings yet
Comparison Jmetal 1
18 pages
Computational Intelligence in Optimization
100% (1)
Computational Intelligence in Optimization
424 pages
Meta Heuristics
No ratings yet
Meta Heuristics
36 pages
08-2021 - A Tutorial On The Design, Experimentation and Application of
No ratings yet
08-2021 - A Tutorial On The Design, Experimentation and Application of
38 pages
Metaheuristic Optimization: Algorithm Analysis and Open Problems
No ratings yet
Metaheuristic Optimization: Algorithm Analysis and Open Problems
12 pages
Tasmanian Devil Optimization Algorithm
No ratings yet
Tasmanian Devil Optimization Algorithm
22 pages
Adapting A Hyper-Heuristic To Respond To Scalability Issues in Combinatorial Optimisation
No ratings yet
Adapting A Hyper-Heuristic To Respond To Scalability Issues in Combinatorial Optimisation
135 pages
Optimizing Dragonfly Algorithm Using Ant Lion Algorithm For Solving Complex Numerical Optimization Problems
No ratings yet
Optimizing Dragonfly Algorithm Using Ant Lion Algorithm For Solving Complex Numerical Optimization Problems
4 pages
IHHO: An Improved Harris Hawks Optimization Algorithm For Solving Engineering Problems
No ratings yet
IHHO: An Improved Harris Hawks Optimization Algorithm For Solving Engineering Problems
114 pages
TSP Cmes 29404
No ratings yet
TSP Cmes 29404
56 pages
Opytimizer A Nature-Inspired Python Optimizer
No ratings yet
Opytimizer A Nature-Inspired Python Optimizer
17 pages
AOR Syllabus20132014
No ratings yet
AOR Syllabus20132014
115 pages
Hybrid Swarm Optimization Enhancing Convergence and Performance With Abstract PSO
No ratings yet
Hybrid Swarm Optimization Enhancing Convergence and Performance With Abstract PSO
5 pages
Complexity - 2021 - Yousefikhoshbakht - Solving The Traveling Salesman Problem A Modified Metaheuristic Algorithm
No ratings yet
Complexity - 2021 - Yousefikhoshbakht - Solving The Traveling Salesman Problem A Modified Metaheuristic Algorithm
13 pages
Metaheuristic Optimization: Algorithm Analysis and Open Problems
No ratings yet
Metaheuristic Optimization: Algorithm Analysis and Open Problems
14 pages
Team 9 CIA 3
No ratings yet
Team 9 CIA 3
82 pages
09-2021 - Design and Applications of An Advanced Hybrid
No ratings yet
09-2021 - Design and Applications of An Advanced Hybrid
80 pages
SCA With Source Code
No ratings yet
SCA With Source Code
14 pages
Data Empowerment: Harnessing Advanced Mathematical and Statistical Methods for Data Science and Machine Learning
From Everand
Data Empowerment: Harnessing Advanced Mathematical and Statistical Methods for Data Science and Machine Learning
NAGARAJU CHEVURU
No ratings yet
Exploring Probability and Random Processes Using MATLAB®
From Everand
Exploring Probability and Random Processes Using MATLAB®
Roshan Trivedi
No ratings yet
,psuryhg'&) Odvkryhu3Huirupdqfhri (SR (/,qvxodwruvlq 6) Dve/'Luhfw) Oxrulqdwlrq
No ratings yet
,psuryhg'&) Odvkryhu3Huirupdqfhri (SR (/,qvxodwruvlq 6) Dve/'Luhfw) Oxrulqdwlrq
9 pages
Doi 10.1109/ICMTMA.2017.0030
No ratings yet
Doi 10.1109/ICMTMA.2017.0030
5 pages
AICTE STTP Schedule Slot 3
No ratings yet
AICTE STTP Schedule Slot 3
2 pages
Power Electronics Principles B.W.Williams Please Search For This Book As It Requires The Registration Process
No ratings yet
Power Electronics Principles B.W.Williams Please Search For This Book As It Requires The Registration Process
1 page
Springer Paper Template
No ratings yet
Springer Paper Template
10 pages
New Control For HVDC System Connected To Large Windfarm
No ratings yet
New Control For HVDC System Connected To Large Windfarm
9 pages
Communication Books
No ratings yet
Communication Books
7 pages
Digital Laboratory Power Supply
No ratings yet
Digital Laboratory Power Supply
5 pages
VLSI Books
No ratings yet
VLSI Books
5 pages
Uncompleted
No ratings yet
Uncompleted
1 page
For Md/Ms Applicants For Mtech Applicants: Date of Completion of Internship
No ratings yet
For Md/Ms Applicants For Mtech Applicants: Date of Completion of Internship
1 page
EDEC
No ratings yet
EDEC
2 pages
D1 Definitions and Key Points
No ratings yet
D1 Definitions and Key Points
2 pages
Datastructure Tree
No ratings yet
Datastructure Tree
2 pages
Chapter 05 Finite Autamata
No ratings yet
Chapter 05 Finite Autamata
22 pages
KKT PDF
No ratings yet
KKT PDF
4 pages
100 Questions To Master DSA
No ratings yet
100 Questions To Master DSA
46 pages
Asymptotic Analysis PDF
No ratings yet
Asymptotic Analysis PDF
26 pages
1.Dmgt QB 2025 Updated
No ratings yet
1.Dmgt QB 2025 Updated
5 pages
Finals 2016 Solutions
0% (1)
Finals 2016 Solutions
11 pages
MMW Project 1
No ratings yet
MMW Project 1
2 pages
Lab 12
No ratings yet
Lab 12
6 pages
Exam Data Structure2
No ratings yet
Exam Data Structure2
2 pages
Heap Sort and Quick Sort-2
No ratings yet
Heap Sort and Quick Sort-2
54 pages
Stacks Notes With Programs
No ratings yet
Stacks Notes With Programs
2 pages
Polynomial Time Interior Point Algorithms For LP, CQP and SDP
No ratings yet
Polynomial Time Interior Point Algorithms For LP, CQP and SDP
49 pages
Assignment 8 Richardson's Extrapolation
No ratings yet
Assignment 8 Richardson's Extrapolation
2 pages
5th Semester B.Tech AI&DS Syllabus
No ratings yet
5th Semester B.Tech AI&DS Syllabus
30 pages
Numerical Method (Secant Method)
No ratings yet
Numerical Method (Secant Method)
20 pages
ADA UNIT 4 Complete Notes
No ratings yet
ADA UNIT 4 Complete Notes
22 pages
Graph Theory 4 - Vertex Degree and Regular Graphs Notes
No ratings yet
Graph Theory 4 - Vertex Degree and Regular Graphs Notes
3 pages
Chap-2 2 (RegularExpression)
No ratings yet
Chap-2 2 (RegularExpression)
46 pages
Assignment 2 1. Solve by Simplex Method
No ratings yet
Assignment 2 1. Solve by Simplex Method
3 pages
Chapter 7
No ratings yet
Chapter 7
29 pages
Mathematics 5th Sem Hons SEC-1
No ratings yet
Mathematics 5th Sem Hons SEC-1
3 pages
Qustionbank1 12
No ratings yet
Qustionbank1 12
40 pages
PRG3201 Final January 2017 Question
No ratings yet
PRG3201 Final January 2017 Question
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.