Extended Empirical Analysis: An analysis of communication policies for homogeneous multi-colony ACO algorithms through experimental design

by Max Manfrin, Thomas Stützle, Mauro Birattari, and Marco Dorigo
March 2008

Table of Contents

Abstract
Introduction
Algorithms
Preliminary experiments
Treatments with no LS
Treatments with 2-opt LS
Treatments with 3-opt LS
Scalability tests
Test on unseen instances

Abstract:

The increasing availability of parallel hardware encourages the design and the adoption of parallel algorithms. Maximizing the degree to which a parallel platform is capable of efficiently utilizing increased computing resources is the challenge to be faced. In this article, we present a study in which, by means of a design of experiments approach, we analyze the impact that communication policies have on the solution quality reached by a parallel homogeneous multi-colony ACO algorithm for the traveling salesman problem. We adopt a full factorial design, empirically testing the treatments on a distributed-memory parallel architecture, and analyzing the results with a multi-factor analysis of variance. We consider several factors that influence the performance of a multi-colony ACO algorithm: the number of colonies, migration frequency schedules, communication strategies on different interconnection topologies, and the usage of local search. Conclustions are drawn about the performance of the different policies. It is shown that the adopted policy has a large influence on the actual performance of the algorithm.

Keywords: ant colony optimization, parallel programming, traveling salesman problem, communication policies, message passing

Introduction:

We present a study of various communication policies for a parallel homogeneous multi-colony MAX-MIN Ant System (MMAS) by means of a design of experiment approach. The idea of the multi-colony approach is to have several colonies running in parallel that exchange information to intensify the search for solutions toward promising regions of the search space, preserving, at the same time, the tendency of each colony to explore the search space differently. We use factorial experiments based on a full factorial design, extensively testing the parallel variants on the TRAVELING SALESMAN PROBLEM (TSP) using message passing libraries and a distributed-memory parallel architecture; we analyze the impact of the factor studied on solution quality with a multi-factor analysisi of variance (ANOVA).

Algorithms:

We modify version 1.0 of ACOTSP, adding the quadrant nearest neighbors heuristic with 20 neighbors and the MPI code to implement the communication among different colonies.
For the policies we consider several factors: the number of colonies (2 levels: 4 and 8 colonies of 25 ants each), migration frequency schedules (2 levels: fixed and increasing), communication strategies on different interconnection topologies (4 levels: unidirectional ring [R] over the ring topology, hypercube [HC] over the hypercube topology, fully-connected [FC] and replace-worst [RW] over the fully-connected topology), and usage of local search or not (3 levels: no local search, 2-opt local search, and 3-opt local search).

All cited factors result in 2*2*4*3=48 different treatments.

The response variable is the percentage error from the known optimal cost.

To evaluate the quality of the parallel variants we compare them with the parallel independent runs [PIR] approach. Moreover, we also test the single colony MMAS algorithm (SEQ) running for k-times number of iterations as the parallel algorithms, where k=4 or 8 depending on the number of colonies.

The naming convention for the treatments is the following: a number, indicating how many identical colonies are used (4 or 8), is followed by the communication strategy label of the algorithm (PIR, R, RW, HC, FC, SEQ), and then by a letter indicating the adopted migration frequency schedule (f for the fixed one, i for the increasing one); finally a number indicating the adopted local search (0 for no local search, 2 for the 2-opt local search, and 3 for the 3-opt local search). Please note that the algorithms implementing PIR and SEQ strategies do not have in their names the migration frequency schema's letter. Moreover, the initial number for the algorithms implementing the SEQ strategis, identifies the multiplier factor for the number of iterations. For example the label 4RWf2 identifies a treatmnet with 4 identical colonies, a replace-worst strategy on a fully-connected topology, adopting fixed-frequency migration schedule and a 2-opt local search; while the label 8PIR3 identifies an algorithm with 8 identical colonies, a parallel independent runs strategy on an isolated topology and 3-opt local search. The label 4SEQ0 identifies a treatment with 1 colony that runs for 4 times the number of iterations of the parallel treatments and no local search.

Preliminary experiments:

To assess the impact of the local search component on the final solution quality in our implementationof MMAS, we compare SEQ0, SEQ2, and SEQ3 on four TSPLIB instances: lin318, rat783, nrw1379, and pr2392. In order to obtain enough statistical evidence, 30 runs are performed for each algorithm on each instance. To conduct a fair comparison, due to the different time complexities of the three algorithms, we use a stopping criterion based on maximum CPU time, instead of a maximum number of iterations. The output of all the experiments for each algorithm is available in the following table:

TSPLIB Instance	Algorithms
`lin318`	SEQ0 (132KB), SEQ2 (32KB), SEQ3 (14KB)
`rat783`	SEQ0 (257KB), SEQ2 (78KB), SEQ3 (40KB)
`nrw1379`	SEQ0 (450KB), SEQ2 (171KB), SEQ3 (88KB)
`pr2392`	SEQ0 (188KB), SEQ2 (256KB), SEQ3 (112KB)

[1] "Instance lin318"

Pairwise comparisons using Wilcoxon rank sum test

SEQ0 SEQ2
SEQ2 3.6e-12 -
SEQ3 3.6e-12 0.33

P value adjustment method: holm

[1] "Instance rat783"

Pairwise comparisons using Wilcoxon rank sum test

SEQ0 SEQ2
SEQ2 3.0e-11 -
SEQ3 2.8e-11 2.8e-11

P value adjustment method: holm

[1] "Instance nrw1379"

Pairwise comparisons using Wilcoxon rank sum test

SEQ0 SEQ2
SEQ2 9e-11 -
SEQ3 9e-11 9e-11

P value adjustment method: holm

[1] "Instance pr2392"

Pairwise comparisons using Wilcoxon rank sum test

SEQ0 SEQ2
SEQ2 <2e-16 -
SEQ3 <2e-16 <2e-16

P value adjustment method: holm

TSPLIB Instance	Boxplot of normalized results (click to download eps format)
`lin318`
`rat783`
`nrw1379`
`pr2392`

Treatments with no local search:

In a first set of experiments, we compare those treatments with the local search factor set to no local search. The experiments are carried out on 7 TSPLIB instances: kroA100, eil101, kroA200, lin318, pcb442, att532, and rat783. In order to We use a stopping criterion based on a maximum number of iterations: 10000. The output of all the experiments for each algorithm is available in the following table:

CPUs	TSPLIB Instance	Algorithms
CPUs	TSPLIB Instance	xFCf0	xHCf0	xRWf0	xRf0	xFCi0	xHCi0	xRWi0	xRi0	xPIR0	xSEQ0
4	`kroA100`	326 KB	344 KB	343 KB	321 KB	392 KB	388 KB	393 KB	389 KB	358 KB	94 KB
	`eil101`	288 KB	296 KB	292 KB	283 KB	329 KB	350 KB	334 KB	325 KB	278 KB	75 KB
	`kroA200`	603 KB	614 KB	607 KB	577 KB	760 KB	757 KB	740 KB	749 KB	663 KB	177 KB
	`lin318`	806 KB	810 KB	723 KB	745 KB	972 KB	1003 KB	909 KB	932 KB	842 KB	233 KB
	`pcb442`	888 KB	869 KB	792 KB	806 KB	1.02 MB	1.03 MB	985 KB	1008 KB	866 KB	244 KB
	`att532`	1.36 MB	1.35 MB	1.18 MB	1.25 MB	1.48 MB	1.6 MB	1.47 MB	1.46 MB	1.36 MB	386 KB
	`rat783`	1.53 MB	1.52 MB	1.29 MB	1.44 MB	1.65 MB	1.71 MB	1.65 MB	1.66 MB	1.51 MB	441 KB

8	`kroA100`	633 KB	671 KB	634 KB	647 KB	790 KB	798 KB	799 KB	776 KB	712 KB	94 KB
	`eil101`	551 KB	558 KB	544 KB	552 KB	659 KB	678 KB	652 KB	652 KB	555 KB	76 KB
	`kroA200`	1.1 MB	1.16 MB	1.11 MB	1.1 MB	1.43 MB	1.49 MB	1.44 MB	1.43 MB	1.3 MB	180 KB
	`lin318`	1.46 MB	1.48 MB	1.31 MB	1.4 MB	1.86 MB	1.89 MB	1.83 MB	1.84 MB	1.63 MB	242 KB
	`pcb442`	1.61 MB	1.69 MB	1.4 MB	1.54 MB	2 MB	2.08 MB	1.91 MB	1.95 MB	1.69 MB	257 KB
	`att532`	2.52 MB	2.7 MB	2.13 MB	2.44 MB	2.89 MB	3.09 MB	2.89 MB	2.95 MB	2.71 MB	410 KB
	`rat783`	2.92 MB	2.96 MB	2.35 MB	2.77 MB	3.22 MB	3.43 MB	3.19 MB	3.31 MB	3.03 MB	470 KB

TSPLIB Instance	Solution quality over iterations (click to download eps format)
TSPLIB Instance	Algorithms
`kroA100`	4FCf0, 8FCf0	4HCf0, 8HCf0	4RWf0, 8RWf0	4Rf0, 8Rf0	4FCi0, 8FCi0	4HCi0, 8HCi0	4RWi0, 8RWi0	4Ri0, 8Ri0
`eil101`	4FCf0, 8FCf0	4HCf0, 8HCf0	4RWf0, 8RWf0	4Rf0, 8Rf0	4FCi0, 8FCi0	4HCi0, 8HCi0	4RWi0, 8RWi0	4Ri0, 8Ri0
`kroA200`	4FCf0, 8FCf0	4HCf0, 8HCf0	4RWf0, 8RWf0	4Rf0, 8Rf0	4FCi0, 8FCi0	4HCi0, 8HCi0	4RWi0, 8RWi0	4Ri0, 8Ri0
`lin318`	4FCf0, 8FCf0	4HCf0, 8HCf0	4RWf0, 8RWf0	4Rf0, 8Rf0	4FCi0, 8FCi0	4HCi0, 8HCi0	4RWi0, 8RWi0	4Ri0, 8Ri0
`pcb442`	4FCf0, 8FCf0	4HCf0, 8HCf0	4RWf0, 8RWf0	4Rf0, 8Rf0	4FCi0, 8FCi0	4HCi0, 8HCi0	4RWi0, 8RWi0	4Ri0, 8Ri0
`att532`	4FCf0, 8FCf0	4HCf0, 8HCf0	4RWf0, 8RWf0	4Rf0, 8Rf0	4FCi0, 8FCi0	4HCi0, 8HCi0	4RWi0, 8RWi0	4Ri0, 8Ri0
`rat783`	4FCf0, 8FCf0	4HCf0, 8HCf0	4RWf0, 8RWf0	4Rf0, 8Rf0	4FCi0, 8FCi0	4HCi0, 8HCi0	4RWi0, 8RWi0	4Ri0, 8Ri0

TSPLIB Instance	Boxplot of normalized results	Boxplot of normalized results
`kroA100`
`eil101`
`kroA200`
`lin318`
`pcb442`
`att532`
`rat783`

TSPLIB Instance	ANOVAs
`kroA100`	Numerical values, Graphics
`eil101`	Numerical values, Graphics
`kroA200`	Numerical values, Graphics
`lin318`	Numerical values, Graphics
`pcb442`	Numerical values, Graphics
`att532`	Numerical values, Graphics
`rat783`	Numerical values, Graphics

Treatments with 2-opt local search:

In a second set of experiments, we compare those treatments with the local search factor set to 2-opt local search. The experiments are carried out on 7 TSPLIB instances: pcb442, att532, rat783, u1060, d1655, and pr2392. We use a stopping criterion based on a maximum number of iterations: 10000. The output of all the experiments for each algorithm is available in the following table:

CPUs	TSPLIB Instance	Algorithms
CPUs	TSPLIB Instance	xFCf2	xHCf2	xRWf2	xRf2	xFCi2	xHCi2	xRWi2	xRi2	xPIR2	xSEQ2
4	`pcb442`	236 KB	242 KB	243 KB	243 KB	265 KB	264 Kb	285 KB	272 KB	245 KB	66 KB
	`att532`	334 KB	346 KB	347 KB	336 KB	404 KB	408 KB	414 KB	387 KB	389 KB	103 KB
	`rat783`	430 KB	425 KB	418 KB	407 KB	506 KB	525 KB	515 KB	504 KB	462 KB	123 KB
	`u1060`	663 KB	676 KB	689 KB	641 KB	817 KB	836 KB	810 KB	800 KB	772 KB	213 KB
	`nrw1379`	987 KB	998 KB	955 KB	921 KB	1.15 MB	1.18 MB	1.09 MB	1.13 MB	996 KB	273 KB
	`d1655`	788 KB	773 KB	794 KB	746 KB	926 KB	924 KB	930 KB	924 KB	895 KB	251 KB
	`pr2392`	1.31 MB	1.35 MB	1.33 MB	1.3 MB	1.61 MB	1.64 MB	1.59 MB	1.59 MB	1.5 MB	418 KB

8	`pcb442`	452 KB	480 KB	487 KB	448 KB	535 KB	560 KB	572 KB	543 KB	495 KB	68 KB
	`att532`	664 KB	672 KB	683 KB	655 KB	784 KB	814 KB	828 KB	814 KB	766 KB	105 KB
	`rat783`	820 KB	846 KB	856 KB	826 KB	1022 KB	1.02 MB	1.02 MB	1 MB	930 KB	127 KB
	`u1060`	1.22 MB	1.28 MB	1.29 MB	1.24 MB	1.58 MB	1.6 MB	1.6 MB	1.59 MB	1.49 MB	219 KB
	`nrw1379`	1.83 MB	1.82 MB	1.78 MB	1.76 MB	2.31 MB	2.35 MB	2.26 MB	2.28 MB	1.93 MB	289 KB
	`d1655`	1.41 MB	1.48 MB	1.42 MB	1.39 MB	1.82 MB	1.88 MB	1.84 MB	1.83 MB	1.73 MB	259 KB
	`pr2392`	2.57 MB	2.56 MB	2.57 MB	2.5 MB	3.14 MB	3.27 MB	3.22 MB	3.16 MB	2.98 MB	443 KB

TSPLIB Instance	Normalized results over iterations (click to download eps format)
TSPLIB Instance	Algorithms
`pcb442`	4FCf2, 8FCf2	4HCf2, 8HCf2	4RWf2, 8RWf2	4Rf2, 8Rf2	4FCi2, 8FCi2	4HCi2, 8HCi2	4RWi2, 8RWi2	4Ri2, 8Ri2
`att532`	4FCf2, 8FCf2	4HCf2, 8HCf2	4RWf2, 8RWf2	4Rf2, 8Rf2	4FCi2, 8FCi2	4HCi2, 8HCi2	4RWi2, 8RWi2	4Ri2, 8Ri2
`rat783`	4FCf2, 8FCf2	4HCf2, 8HCf2	4RWf2, 8RWf2	4Rf2, 8Rf2	4FCi2, 8FCi2	4HCi2, 8HCi2	4RWi2, 8RWi2	4Ri2, 8Ri2
`u1060`	4FCf2, 8FCf2	4HCf2, 8HCf2	4RWf2, 8RWf2	4Rf2, 8Rf2	4FCi2, 8FCi2	4HCi2, 8HCi2	4RWi2, 8RWi2	4Ri2, 8Ri2
`nrw1379`	4FCf2, 8FCf2	4HCf2, 8HCf2	4RWf2, 8RWf2	4Rf2, 8Rf2	4FCi2, 8FCi2	4HCi2, 8HCi2	4RWi2, 8RWi2	4Ri2, 8Ri2
`d1655`	4FCf2, 8FCf2	4HCf2, 8HCf2	4RWf2, 8RWf2	4Rf2, 8Rf2	4FCi2, 8FCi2	4HCi2, 8HCi2	4RWi2, 8RWi2	4Ri2, 8Ri2
`pr2392`	4FCf2, 8FCf2	4HCf2, 8HCf2	4RWf2, 8RWf2	4Rf2, 8Rf2	4FCi2, 8FCi2	4HCi2, 8HCi2	4RWi2, 8RWi2	4Ri2, 8Ri2

TSPLIB Instance	Boxplot of normalized results	Boxplot of normalized results
`pcb442`
`att532`
`rat783`
`u1060`
`nrw1379`
`d1655`
`pr2392`

TSPLIB Instance	ANOVAs
`pcb442`	Numerical values, Graphics
`att532`	Numerical values, Graphics
`rat783`	Numerical values, Graphics
`u1060`	Numerical values, Graphics
`nrw1379`	Numerical values, Graphics
`d1655`	Numerical values, Graphics
`pr2392`	Numerical values, Graphics

Treatments with 3-opt local search:

In a third set of experiments, we compare those treatments with the local search factor set to 3-opt local search. The experiments are carried out on 7 TSPLIB instances: rat783, u1060, nrw1379, d1655, pr2392, fnl4461, and rl5915. We use a stopping criterion based on a maximum number of iterations: 10000. The output of all the experiments for each algorithm is available in the following table:

CPUs	TSPLIB Instance	Algorithms
CPUs	TSPLIB Instance	xFCf3	xHCf3	xRWf3	xRf3	xFCi3	xHCi3	xRWi3	xRi3	xPIR3	xSEQ3
4	`rat783`	251 KB	255 KB	244 KB	244 KB	288 KB	289 KB	286 KB	287 KB	257 KB	63 KB
	`u1060`	294 KB	313 KB	317 KB	297 KB	374 KB	369 KB	375 KB	374 KB	337 KB	87 KB
	`nrw1379`	538 KB	516 KB	534 KB	495 KB	643 KB	650 KB	644 KB	632 KB	565 KB	152 KB
	`d1655`	363 KB	395 KB	405 KB	381 KB	485 KB	475 KB	506 KB	487 KB	470 KB	126 KB
	`pr2392`	601 KB	632 KB	648 KB	583 KB	755 KB	764 KB	771 KB	757 KB	715 KB	195 KB
	`fnl4461`	1.84 MB	1.78 MB	1.66 MB	1.68 MB	2.07 MB	2.15 MB	1.98 MB	2.01 MB	1.66 MB	481 KB
	`rl5915`	616 KB	667 KB	591 KB	643 KB	649 KB	683 KB	601 KB	630 KB	621 KB	189 KB

8	`rat783`	477 KB	483 KB	474 KB	476 KB	563 KB	570 KB	590 KB	561 KB	514 KB	63 KB
	`u1060`	587 KB	608 KB	614 KB	587 KB	731 KB	749 KB	781 KB	748 KB	669 KB	88 KB
	`nrw1379`	988 KB	1019 KB	1003 KB	985 KB	1.23 MB	1.25 MB	1.24 MB	1.26 MB	1.08 MB	155 KB
	`d1655`	708 KB	745 KB	770 KB	747 KB	936 KB	960 KB	1010 KB	959 KB	940 KB	127 KB
	`pr2392`	1.14 MB	1.11 MB	1.21 MB	1.11 MB	1.47 MB	1.48 MB	1.52 MB	1.48 MB	1.4 MB	200 KB
	`fnl4461`	3.45 MB	3.5 MB	3.22 MB	3.27 MB	4.17 MB	4.29 MB	3.96 MB	4.04 MB	3.33 MB	508 KB
	`rl5915`	1.2 MB	1.26 MB	1.04 MB	1.23 MB	1.23 MB	1.34 MB	1.12 MB	1.31 MB	1.22 MB	206 KB

TSPLIB Instance	Normalized results over iterations (click to download eps format)
TSPLIB Instance	Algorithms
`rat783`	4FCf3, 8FCf3	4HCf3, 8HCf3	4RWf3, 8RWf3	4Rf3, 8Rf3	4FCi3, 8FCi3	4HCi3, 8HCi3	4RWi3, 8RWi3	4Ri3, 8Ri3
`u1060`	4FCf3, 8FCf3	4HCf3, 8HCf3	4RWf3, 8RWf3	4Rf3, 8Rf3	4FCi3, 8FCi3	4HCi3, 8HCi3	4RWi3, 8RWi3	4Ri3, 8Ri3
`nrw1379`	4FCf3, 8FCf3	4HCf3, 8HCf3	4RWf3, 8RWf3	4Rf3, 8Rf3	4FCi3, 8FCi3	4HCi3, 8HCi3	4RWi3, 8RWi3	4Ri3, 8Ri3
`d1655`	4FCf3, 8FCf3	4HCf3, 8HCf3	4RWf3, 8RWf3	4Rf3, 8Rf3	4FCi3, 8FCi3	4HCi3, 8HCi3	4RWi3, 8RWi3	4Ri3, 8Ri3
`pr2392`	4FCf3, 8FCf3	4HCf3, 8HCf3	4RWf3, 8RWf3	4Rf3, 8Rf3	4FCi3, 8FCi3	4HCi3, 8HCi3	4RWi3, 8RWi3	4Ri3, 8Ri3
`fnl4461`	4FCf3, 8FCf3	4HCf3, 8HCf3	4RWf3, 8RWf3	4Rf3, 8Rf3	4FCi3, 8FCi3	4HCi3, 8HCi3	4RWi3, 8RWi3	4Ri3, 8Ri3
`rl5915`	4FCf3, 8FCf3	4HCf3, 8HCf3	4RWf3, 8RWf3	4Rf3, 8Rf3	4FCi3, 8FCi3	4HCi3, 8HCi3	4RWi3, 8RWi3	4Ri3, 8Ri3

TSPLIB Instance	Boxplot of normalized results	Boxplot of normalized results
`rat783`
`u1060`
`nrw1379`
`d1655`
`pr2392`
`fnl4461`
`rl5915`

TSPLIB Instance	ANOVAs
`rat783`	Numerical values, Graphics
`u1060`	Numerical values, Graphics
`nrw1379`	Numerical values, Graphics
`d1655`	Numerical values, Graphics
`pr2392`	Numerical values, Graphics
`fnl4461`	Numerical values, Graphics
`rl5915`	Numerical values, Graphics

Scalability test:

In a fourth set of experiments, the most successful policies identified previously are tested with three more levels for the factor number of colonies: 16, 32, and 64, respectively.
From the previous analyses we concluded that the overall better communication strategies are the replace-worst and the unidirectional ring, in both the cases with and without local search component. We found also that the increasing-frequency schedule has beneficial effects that tend to grow with increasing number of colonies. This fourth set of experiments is carried out on 3 TSPLIB instances: nrw1379, pr2392, and rl5915, performing 20 runs of 10000 iterations each. The output of all the experiments for each algorithm is available in the following table:

No local search

CPUs	TSPLIB instance	Algorithms
CPUs	TSPLIB instance	xRi0	xRWi0	xPIR0	xSEQ0
16	`nrw1379`	6.83 MB	5.91 MB	5.83 MB	570 KB
	`pr2392`	5.3 MB	3.25 MB	1.92 MB	825 KB
	`rl5915`	2.09 MB	1.48 MB	1.5 MB	286 KB

32	`nrw1379`	13.62 MB	11.9 MB	12.15 MB	N.A.
	`pr2392`	10.64 MB	5.12 MB	3.88 MB	N.A.
	`rl5915`	4.34 MB	2.99 MB	2.98 MB	N.A.

64	`nrw1379`	27.89 MB	25.63 MB	24.22 MB	N.A.
	`pr2392`	21.9 MB	9.96 MB	7.9 MB	N.A.
	`rl5915`	8.44 MB	6 MB	5.94 MB	N.A.

local search	CPUs	TSPLIB instance
no local search	16	`nrw1379`
		`pr2392`
		`rl5915`

no local search	32	`nrw1379`
		`pr2392`
		`rl5915`

no local search	64	`nrw1379`
		`pr2392`
		`rl5915`

2-opt local search

CPUs	TSPLIB instance	Algorithms
CPUs	TSPLIB instance	xRi2	xRWi2	xPIR2	xSEQ2
16	`nrw1379`	3.15 MB	3.1 MB	2.65 MB	198 KB
	`pr2392`	4.35 MB	4.47 MB	4 MB	306 KB
	`rl5915`	2.95 MB	2.62 MB	2.95 MB	295 KB

32	`nrw1379`	6.36 MB	6.22 MB	5.31 MB	N.A.
	`pr2392`	8.79 MB	9.11 MB	8.1 MB	N.A.
	`rl5915`	5.83 MB	5.57 MB	5.98 MB	N.A.

64	`nrw1379`	12.74 MB	12.71 MB	10.65 MB	N.A.
	`pr2392`	17.74 MB	18.65 MB	16.22 MB	N.A.
	`rl5915`	11.75 MB	11.91 MB	11.82 MB	N.A.

local search	CPUs	TSPLIB instance
2-opt	16	`nrw1379`
		`pr2392`
		`rl5915`

2-opt	32	`nrw1379`
		`pr2392`
		`rl5915`

2-opt	64	`nrw1379`
		`pr2392`
		`rl5915`

3-opt local search

CPUs	TSPLIB instance	Algorithms
CPUs	TSPLIB instance	xRi3	xRWi3	xPIR3	xSEQ3
16	`nrw1379`	1.71 MB	1.74 MB	1.46 MB	108 KB
	`pr2392`	2.05 MB	2.12 MB	1.89 MB	141 KB
	`rl5915`	1.75 MB	1.63 MB	1.62 MB	128 KB

32	`nrw1379`	3.51 MB	3.51 MB	2.94 MB	N.A.
	`pr2392`	4.14 MB	4.34 MB	3.83 MB	N.A.
	`rl5915`	3.59 MB	3.33 MB	3.28 MB	N.A.

64	`nrw1379`	6.98 MB	7.08 MB	5.96 MB	N.A.
	`pr2392`	8.35 MB	8.86 MB	7.66 MB	N.A.
	`rl5915`	7.16 MB	7.02 MB	6.57 MB	N.A.

local search	CPUs	TSPLIB instance
3-opt	16	`nrw1379`
		`pr2392`
		`rl5915`

3-opt	32	`nrw1379`
		`pr2392`
		`rl5915`

3-opt	64	`nrw1379`
		`pr2392`
		`rl5915`

Results of the two-sided pairwise Wilcoxon rank sum test for all the scalability tests can be found clicking here.

Test on unseen instances:

In this last test, we perform some experiments on randomly generated instances to verify our conjecture with unseen instances. We generated several instances of different sizes where cities are distributed uniformly in a square: 30 instances of size 316, 30 instances of size 1000, and 30 instances of size 3162. On each instance, 1 run of 10000 iterations is performed. The results are grouped by instance size and analyzed as if they were multiple runs of a single instance. To assess the statistical significance of the differences in solution costs obtained by the algorithms under analysis, we rely on a two-sided pairwise Wilcoxon rank sum test with p-values adjusted by Holm's method. (To find the optimal value, instances has been solved to the optimum using CONCORDE, and for the few ones for which the optimal solution was not found after 24 hours we took the best solution found by a publicly availabe implementation of the Iterated Lin-Kernigham Helsgaun algorithm)

To contain the computational budget, we limited our test to the treatments 8RWf2 and 8Rf2 versus 8PIR2 and 8SEQ2.

The output of all the experiments for each algorithm is available in the following table:

TSP Instance	Algorithms
`unif316`	8RWf2, 8Rf2, 8PIR2, 8SEQ2
`unif1000`	8RWf2, 8Rf2, 8PIR2, 8SEQ2
`unif3162`	8RWf2, 8Rf2, 8PIR2, 8SEQ2

TSPB Instance	Boxplot of normalized results (click to download eps format)
`unif316`
`unif1000`
`unif3162`

Two-sided pairwise Wilcoxon rank sum test with p-values adjusted by Holm's method

[1] "Instance unif3162 - 2-opt local search"
RWs2 Rs2 SEQ2
0 0 0

[1] "Instance unif1000 - 2-opt local search"
RWs2 Rs2 SEQ2
0.065 0.584 0.032

[1] "Instance unif316 - 2-opt local search"
RWs2 Rs2 SEQ2
0.227 0.227 0.227