Professional Documents
Culture Documents
Pres Vigo Unibo
Pres Vigo Unibo
into state-of-the-art
Vehicle Routing Heuristics
Daniele Vigo
• Research in this field is still at a very early stage but activity is large and
some promising results are coming
– see Benjo,Lodi,Prouvost https://arxiv.org/pdf/1811.06128.pdf
(𝑥" , 𝑦" )
Depot 0
∞ num. of vehicles
with capacity 𝑄
Euclidean distance
(cost)
Undirected and complete
graph
Route
Chapter 5
Gilbert Laporte
Frédéric Semet
i i
“mainchap2000”
2002/9/16
page 3 Chapter 4
i i
i 3 i
I1...I8, S1...S10
I1...I8, S1...S10
Features
Random forest
Random
75.16%
75.16%
forest Random forest
Support-Vector machine
Support-Vector
77.28%
77.28%
machine Support-Vector machine
S1...S10
S1...S10
I1...I8, S1...S10
76.10%
76.10% 77.16%
77.16%
75.16% 77.28%
S1...S10 76.10% 77.16%
acteristics. Each
acteristics. Each pair
pair Algorithm/Features
Algorithm/Features isis executed
executed 50 50 times,
times, with
with
different training
different training and
and test
test sets.
sets. Table
Table 33 details
details the
the average
average results.
results.
We can
We can observe
observe aa very
very low
low evolution
evolution ofof the
the performances
performances with with and
and
without knowledge
without knowledge of of I1...I8.
I1...I8. These
These features
features do do not
not affect
affect the
the qual-
qual-
ity of
ity of the
the estimations.
estimations. As As aa conclusion,
conclusion, features
features I1...I8
I1...I8 are
are useless
useless
for acteristics. Each pair Algorithm/Features is executed 50 times, with
for the
the set
set of
of instances
instances “gap2_large_center_highVariance.csv”.
“gap2_large_center_highVariance.csv”.
Integrating Machine Learning into state-of-the-art Vehicle Routing Heuristics 16
4. different
4. No need
No need forfor machinetraining
machine and
learning ?? How
learning How PCAtest
PCA enablessets.
enables aa Table 3 details the average results.
synthetic view
synthetic view of of the
the solutions
solutions characteristics
characteristics
We can observe a very low evolution of the performances with and
As seen
As seen inin Section
Section 33, , there
there are
are many
many methods
methods of of machine
machine learn-
learn-
ing
ing without
giving some
giving some keys knowledge
keys to understand
to understand each of
each feature
featureI1...I8.
significance.
significance. These
Nev-
Nev- features do not affect the qual-
ertheless, this
ertheless, this section
section will
will show
show that
that aa classical
classical factorial
factorial analysis,
analysis,
ity Principal
namely
namely of the
Principal estimations.
Component
Component Analysis (PCA)
Analysis As
(PCA) isis able
able a conclusion,
to address
to address most
most features I1...I8 are useless
of the
of the issues
issues tackled
tackled by by thethe authors.
authors.
for the set of instances “gap2_large_center_highVariance.csv”. Fig. 3.3. Correlation
Fig. Correlation plot
plot of
of the
the solution
solution metrics.
metrics.
Fig. 2. Importance plot of predictive variables.
Table 3
• they also studied the acteristics. Each pair Algorithm/Features is executed 50 times, with
correlation between S
different training and test sets. Table 3 details the average results.
We can observe a very low evolution of the performances with and
without knowledge of I1...I8. These features do not affect the qual-
features
ity of the estimations. As a conclusion, features I1...I8 are useless
for the set of instances “gap2_large_center_highVariance.csv”.
Component Analysis
namely Principal Component Analysis (PCA) is able to address most spectively 95% of the near-optimal a
of the issues tackled by the authors. mostly intersected.
Fig. 3. Correlation plot of the solution metrics.
By observing which features are
4.1. Preliminary step: correlation analysis mension, we can deduce the meanin
features with correlation larger than
Fig. 3 shows the Pearson correlation coefficient between all features S3 and S10 are anti-correlated. the This different
very simple descrip-
dimensions. Thus, the fi
• Ratio express what is the average improvement of any neighborhood on any solution
• BVND and PVND can identify local optima and thus stop prematurely the VND
local optimum
local optimum
… However:
• we performed a preliminary "In field" testing within a high-
quality ILS
• Current features extraction is O(n2) è the associated
overhead is not compensated by the potential saving of NN-
based VND
Aggregate Bounding
Joint work with L. Accorsi
Can we estimate the value of the Optimal Solution?
• To prematurely stop the search in heuristic algorithms once
we are close enough
• In (heuristic) B&B algorithms to cut unpromising paths?
2500000
Distinct Routes
6000000
2000000
5000000
1500000 4000000
3000000
1000000
2000000
500000
1000000
0 0
0 1 2 3 4 5 6 0 1 2 3 4 5 6
% Gap % Solution Gap
About 50% distinct solutions have gap <= 1%
• Applications:
– Guide a heuristic optimization processes
– Skip low-quality routes in SP-based matheuristics
• A binary classification
– Interesting routes
– Not interesting routes
500
450
400
350
Computing time (min)
300
250
200
150
100
50
0
101 201 301 401 501 601 701 801 901 1001
Number of vertices
450
400
350
Computing time (min)
300
250
200
150
100
50
0
101 201 301 401 501 601 701 801 901 1001
Number of vertices
• Pruning techniques
– Granular Neighborhoods and Selective Vertex Caching
• On standard instances
• X dataset by Uchoa et al. (2017)
ILS-SP - Subramanian,
0.6
Uchoa, and Ochi (2013) 450
0.55
KGLS - Arnold
400
and Sörensen
(2019)
0.5 350
0.45 300
0.4 250
0.35
FILO
200
HGSADC - Vidal
et al. (2012)
0.3 150
0.25 100
FILO (long)
0.2 SISR - ChrisMaens and 50
% gap
% gap
1 0 20 40 60 80 100 120 140
-1 -1
FILO
0.5 -1.5
-2 FILO
FILO (long) FILO (long) -2 FILO
0
0 50 100 150 200 250
-3
-2.5
FILO
(long)
-0.5 -4 -3
t (min) t (min) t (min)
Algorithms
• KGLS, KGLS (long) - Arnold, Gendreau, and Sörensen (2019)
• GVNS - Kytöjoky et al. (2007)
• PSMDA - Zachariadis and Kiranoudis (2010)
Integrating Machine Learning into state-of-the-art Vehicle Routing Heuristics 68
Plugging the idea into FILO
• Shaking performed in ruin-and-recreate fashion
– Select a "seed customer"
– remove a set of customers belonging to routes "close" to the seed
• Standard: shaking seed is a randomly selected customer
• Guided: bias the seed selection towards customers belonging
to low-quality routes
Standard: 0.36%
Guided: 0.34%
Both in 2.12 minutes!
During the algorithm we do
not recompute features if
not needed
Iterations
Integrating Machine Learning into state-of-the-art Vehicle Routing Heuristics 70
Preliminary Experiments
(on the 100 largest instances from the Extended Benchmark of Uchoa et al. 2017)
% gap wrt min found in our runs
BKS values not available
daniele.vigo@unibo.it
www.unibo.it