Professional Documents
Culture Documents
On The Value of Optimal Myopic Solutions For Dynamic Routing and Scheduling Problems in The Presence of User Noncompliance
On The Value of Optimal Myopic Solutions For Dynamic Routing and Scheduling Problems in The Presence of User Noncompliance
Transportation Science
Publication details, including instructions for authors and subscription information:
http://pubsonline.informs.org
This article may be used only for the purposes of research, teaching, and/or private study. Commercial use
or systematic downloading (by robots or other automatic processes) is prohibited without explicit Publisher
approval, unless otherwise noted. For more information, contact permissions@informs.org.
The Publisher does not warrant or guarantee the article’s accuracy, completeness, merchantability, fitness
for a particular purpose, or non-infringement. Descriptions of, or references to, products or publications, or
inclusion of an advertisement in this article, neither constitutes nor implies a guarantee, endorsement, or
support of claims made of that product, publication, or service.
© 2000 INFORMS
INFORMS is the largest professional society in the world for professionals in the fields of operations research, management
science, and analytics.
For more information on INFORMS, its publications, membership, or meetings visit http://www.informs.org
On the Value of Optimal Myopic Solutions
for Dynamic Routing and Scheduling
Problems in the Presence of
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
User Noncompliance
WARREN B. POWELL, MICHAEL T. TOWNS, AND ARUN MARAR
Department of Civil Engineering and Operations Research, Princeton University, Princeton, New Jersey 08544
The most common approach for modeling and solving routing and scheduling problems in a
dynamic setting is to solve, as close to optimal as possible, a series of deterministic, myopic
models. The argument is most often made that, if the data changes, then we should simply
reoptimize. We use the setting of the load matching problem that arises in truckload trucking
to compare the value of optimal myopic solutions versus varying degrees of greedy, suboptimal
myopic solutions in the presence of three forms of uncertainty: customer demands, travel times,
and, of particular interest, user noncompliance. A simulation environment is used to test
different dispatching strategies under varying levels of system dynamism. An important issue
we consider is that of user noncompliance, which is the effect of optimizing when users do not
adopt all of the recommendations of the model. Our results show that (myopic) optimal
solutions only slightly outperform greedy solutions under relatively high levels of uncertainty,
and that a particular suboptimal solution actually outperforms optimal solutions under a wide
range of conditions.
ing the stochastic version somewhat easier to ana- performance by modeling randomness in travel
lyze. One problem in this class looks at managing times, which creates uncertainties around the ar-
fleets of vehicles over time, and the closely related rival time of a driver at the destination. And third,
problem of assigning drivers to full-load movements. we introduce and investigate the general issue of
One of the first deterministic models of a fleet man- user noncompliance. Because no model is perfect,
agement problem was posed by WHITE (1972), which users routinely override the solutions of driver as-
was followed by a series of papers suggesting and signment models. As a result, an additional level of
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
solving different approximations of a multistage, randomness arises in the actual use of the recom-
stochastic dynamic program (JORDAN and TURN- mendations of these models. In practice, several
QUIST, 1983; POWELL, 1986, 1987, 1988; and companies report that average usage of these mod-
FRANTZESKAKIS and POWELL, 1990). Recently, els is typically below 60%, and good performance is
CHEUNG and POWELL (1996) showed that, for this considered around 70%.
problem class, explicit stochastic models can signif- We believe that the issue of user noncompliance is
icantly outperform deterministic models in rolling fundamental to all models, static and dynamic, and
horizon experiments. Despite this research, most that it raises the question of whether so-called opti-
models in practice are basically deterministic and mal algorithms are indeed optimal. For example, if
myopic. All of these papers were oriented toward the real-world schedules do not use 100% of the solution
management of large fleets of vehicles. produced by a crew scheduling model (in a static,
Recent research (SCHRIJVER, 1993; Powell, 1996) planning environment), was the original solution
has focussed on the problem of matching individual even optimal? In this paper, we show that, in the
drivers to full-truckload movements, a problem presence of user noncompliance, optimal solutions
called the load matching problem in truckload truck- will be outperformed, over time, by algorithms that
ing. This problem class is much closer to vehicle
are more local in nature. We feel this has broad
routing problems in its basic characteristics, with
implications for the design of algorithms in other
the important exception that there is no in-vehicle
settings (vehicle routing, crew scheduling), which
consolidation. In addition, the loads are typically
seek to find optimal solutions to sequences of deter-
very long in duration, often requiring one or more
ministic, myopic problems.
days to complete. The advantage of this problem
To address these issues in a systematic fashion,
structure is that a single instance of the problem is
we have developed a simulation architecture to sim-
a network assignment problem, considering the
ulate truckload dispatching operations. This archi-
matching of drivers to loads. This problem is easy to
solve to optimality, and easy to update with real- tecture consists of three modules: a network optimi-
time information. Schrijver (1993) used a greedy zation model for assigning drivers to loads, a fleet
heuristic for matching drivers to loads in a study of simulator/dispatcher module, and a demand gener-
the value of real-time communications. The research ation module. The network optimization model is a
in Powell (1996) compared a myopic model to an production system that is in use at a major motor
approximation of the stochastic, dynamic problem, carrier. The modules communicate with each other
and showed that the stochastic, dynamic model out- through files, closely simulating the actual flow of
performed the myopic model in rolling horizon ex- data at a carrier (where the optimization model typ-
periments. ically runs on a separate workstation, communicat-
In this paper, we consider only myopic models of ing with the carrier’s primary management informa-
the load matching problem for truckload trucking. tion system through a database). Our experiments
Following common engineering practice, our plan is used the actual data from a large motor carrier,
to solve a given instance of the problem to optimal- scaled down to simulate a smaller fleet (of 400
ity, and then reoptimize as new information be- trucks), a step that accelerated the running of each
comes known. This is the approach that is widely simulation (which need to run much faster than real
used in commercial applications. Our interest is in time).
comparing optimal solutions of sequences of deter- The primary contributions of this paper are as
ministic, myopic models, obtained by solving the follows.
network assignment problem, to various approxima-
tions including greedy solutions that simply assign ● We show that different forms of system stochas-
drivers to the best load. We study these algorithms ticity have the effect of reducing the value of
under three forms of system stochasticity. First, we optimality when solving sequences of subprob-
consider the random process of shippers calling in lems in dynamic settings, and we show that a
loads. Second, we investigate the effects of network solution that is suboptimal for a particular prob-
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 69
lem instance actually provides higher solution rigorous standards. In the context of dynamic prob-
quality over an extended simulation. lems, it is necessary to solve a sequence of problems
● We introduce and study for the first time the over time as new information arises, thereby simu-
issue of user noncompliance, and show that this lating the real process.
form of uncertainty can, in particular, reduce In this setting, we must explicitly acknowledge
the value of optimal solutions. Noncompliance is three models: the sequences of subproblems that are
an easy to measure barometer of the quality of solved over time (each of which is a model with its
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
an optimization model as perceived by the user. own objective function), the larger simulation (with
Even relatively low levels of user noncompliance its own cost stream), and the even larger problem in
(say, 30% modification of model recommenda- the real world, which must reflect data errors and
tions) indicate that partially greedy solutions other biases.
can outperform optimal solutions. At the same Let ⫽ { 0 , 1 , . . . , t , . . . , T } be a sequence
time, similar levels of user noncompliance sig- of random outcomes over a horizon ᐀ ⫽ {0, 1, . . .,
nificantly reduced the gap between an optimal T}, where 僆 ⍀. We would then define a probabil-
solution and a greedy solution. ity space (⍀, Ᏺ, ᏼ) where Ᏺ is the -algebra on ⍀,
● We experimentally quantify the effects of differ- and ᏼ is a probability measure defined over Ᏺ. Let
ent forms of system uncertainty, providing a Ᏺ t represent the sequence of sub--algebras contain-
measure of the value of reducing these forms of ing the information known up to time t, where Ᏺ t 債
uncertainty on actual system performance. Ᏺ t⫹1 . Let f t (x t 兩Ᏺ t ) be the objective function at time
t given an information set Ᏺ t at time t. The super-
The paper is organized as follows. Section 1 de- script captures both the structure of the function f t
scribes different perspectives of optimality in the and any parameters used to create the function (in-
context of dynamic routing and scheduling. Section cluding, for example, planning horizons, discount
2 presents an overview of dispatching operations in factors, and any cost factors introduced so that solv-
the truckload motor carrier industry, which we use ing f t at time t will produce a solution with good
as our problem setting. Section 3 presents the opti- long-run behaviors). We can think of f t , 僆 ⌸ as
mization model that is used to solve the problem. the set of all possible functions (or policies, if you
Next, Section 4 discusses some of the issues that wish) that we could use at time t. Now let x t (Ᏺ t ) ⫽
arise in global versus local optimization. These is- arg minx t僆X f t (x t 兩Ᏺ t ) be a sequence of Ᏺ t -adapted
sues are central to any optimization problem where decisions (meaning that the function x can only use
the results will be implemented over time, during information in Ᏺ t ). (Note: The concept of Ᏺ t -adapted
which the status of the system may change. Then, decisions is similar, but not identical, to the concept
Section 5 gives an overview of our simulation archi- of nonanticipativity used in the stochastic program-
tecture, which takes advantage of a production load ming community, which is expressed as a constraint
matching system that is in use at a major motor requiring that one decision be chosen in time period
carrier. This is followed in Section 6 by a description 0 for all possible outcomes in the future. Instead of
of the specific modules. Section 7 describes the ex- saying that a decision is Ᏺ t -adapted, some authors
perimental design, and the results are reported in prefer to say that the decision is Ᏺ t -measurable.)
Section 8. Section 9 summarizes the major conclu- The decision function x t (Ᏺ t ) returns a set of deci-
sions of the paper. sions that are to be implemented at time t given the
events Ᏺ t . We write x ( ) to represent the decision
made for a particular realization , with the implicit
1. PERSPECTIVES ON OPTIMALITY
understanding that the function x is Ᏺ t -measur-
TO PLACE THIS PAPER in the proper context within able. In determining this vector, we need to consider
the research community, it is important to briefly both here-and-now costs at time t, and possibly ap-
discuss the meaning of the word “optimal.” It is proximate estimates of costs that may be incurred in
common engineering practice to formulate a mathe- the future, but which are subject to solving x t⬘ for t⬘
matical model, typically with numerous simplifica- ⬎ t. For example, we may decide that a vehicle
tions to ensure tractability, and then to seek an should go from customer i 1 to customer i 2 right now;
optimal solution to this model. We all accept that the this is because we have formed a tour that consists
word optimal is used in the narrow sense of solving of (i 1 , i 2 , i 3 , i 4 ). The objective function f t would be
a specific mathematical representation, without formulated to capture these future decisions, and
making any claims regarding the impact on the real may also include approximations of what would hap-
problem, because these impacts fall outside of what pen even farther in the future (as was done in Pow-
can be measured experimentally using scientifically ell, 1996). However, only the vector x t is actually
70 / W. B. POWELL, M. T. TOWNS, AND A. MARAR
implemented. Let c t represent the cost of these de- represents a rejection. In the event of a rejection, it
cisions. Then, the costs incurred at time t would be would be possibly to exclude a solution and reopti-
c t x t , and the total costs over the entire simulation mize. In our work, we followed industry practice in
would be F(x, ) ⫽ 兺 t僆᐀ c t x t ( ), where x ⫽ this setting and used the dual variables from the
(x 0 , x 1 , . . . , x T
). optimal solution to suggest alternatives.
We are now ready to talk about different types of Because the physical process representing a user
optimality. Our goal as modelers is to solve accepting a decision is extremely complex, it is cus-
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
再冘 冎
tomary to model a process such as this as a random
max F共 x兲 ⫽ E c tx t共 兲 , variable. This modeling approach is comparable to
x僆X t僆᐀
adding a noise term at the end of a function to
capture all the nonmeasurable effects. We feel that
where X is the space of all feasible Ᏺ-adapted con- this representation of the user compliance issue rea-
trols x t , ᐀ is the planning horizon, and the expec- sonably captures the effects of the model adoption
tation operator is over all outcomes in ⍀. In our process, and offers the benefit of being simple and
hierarchy of optimization problems, this is our experimentally analyzable. Given that we cannot
larger problem. In practice, we solve F(x) using a observe the real process of why users reject model
series of subproblems f t (x t ) in an effort to find a rejections (and we have tried very hard to do just
solution that maximizes F(x). F(x), in turn, is noth- this), the next best alternative, and far superior
ing more than a proxy for the even larger problem in than ignoring the issue entirely, is to model the
the real world, which must also capture issues that acceptance process as a random variable.
we are unable to put into our model. In the presence of user noncompliance, as well as
In the arena of routing and scheduling, it is very other forms of uncertainty, it is reasonable to ask
common to use myopic solutions that are then up- the question: what is the value of an optimal solu-
dated as new information arrives. Although this is tion of the function f t (x t )? Our work focuses on a
the simplest possible option for most classes of rout- problem where finding the optimal solution, given
ing and scheduling problems, it remains a combina- conventional modeling practice, is actually quite
torically difficult problem. Thus, even finding the easy, allowing us to compare solutions that solve
optimal solution to this relatively simple problem is f t (x t ) optimally to those that solve it only approxi-
a nontrivial task. Because f t (x t ) represents a single mately. We consider only myopic models, in part
instance of a classical optimization problem, it is because these are the models that are most widely
understandable that a considerable amount of effort used in practice, and more importantly, because we
will be devoted toward optimizing this problem, re- do not want to get into the tremendously rich and
alizing that it is simply a single snapshot of a larger complex arena of developing approximations of fu-
problem. Of course, even the larger problem, repre- ture activities. Given the emerging research into the
sented by F(x), is little but an approximation of an application of vehicle routing models in dynamic
even larger problem that encompasses information settings, which focuses primarily on the heuristic
that is not captured within the model. Because dy- solution of myopic models, we feel that our choice of
namic routing models must run in production, this research methodology is appropriate and produces
even larger problem, which must reflect the insights interesting results. Most significantly, we feel that
and biases of real users, must begin to be addressed the issue should be considered when testing differ-
within the context of our research. ent search algorithms in the context of harder prob-
In this paper, we model the effect of the even lems such as dynamic vehicle routing. For example,
larger problem through a process we call user com- in the presence of user noncompliance, time-con-
pliance. In this process, we present a recommenda- suming search heuristics may prove to add little or
tion to the user, at which point the recommendation no value in a dynamic environment.
is accepted or rejected. We make the assumption We do not claim that our simple model of user
that the cause for rejecting a recommendation re- compliance precisely captures the complex physics
flects information outside the domain of the model, of human thought. In support of our approach, we
and, hence, we model it as an exogenous random draw on the long history of mathematical models
variable. Specifically, for a particular decision, say that use random variables to capture highly complex
x ij , we introduce a random upper bound u ij ( ) ⫽ 1 processes. We do claim, however, that the estimates
with probability p d , and 0 with probability (1 ⫺ p d ). that we obtain of the value of optimal solutions in a
Only when we choose x ij ⫽ 1 do we then learn the dynamic environment are more accurate using our
value of u ij ( ) where u ij ( ) ⫽ 1 represents the user model of user compliance than they would be if we
accepting the recommendation, whereas u ij ( ) ⫽ 0 ignored the issue altogether. We further claim that
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 71
subject to
冘x
d僆Ᏸt
a
dlt ⫹ y lt ⫽ 1 l 僆 ᏸt (2)
冘x a
dlt ⫹ x dt
h
⫽1 d 僆 Ᏸt (3)
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
l僆ᏸt
a
x dlt h
, y lt , x dt ⭓0 l 僆 ᏸ, d 僆 Ᏸt .
再
A successful carrier must minimize empty miles,
1 if driver d is assigned to cover maximize on-time service for loads, maintain a high
a
x dlt ⫽ load l at time t level of driver satisfaction, and maximize equipment
0 otherwise, productivity. In the truckload arena, where drivers
y lt ⫽ 再 10 ifotherwise,
load l is refused at time t often spend several weeks away from home, driver
satisfaction is measured by our ability to get drivers
back to their home by a certain time. Other issues
⫽ 再 0 otherwise.
h 1 if driver d is not assigned to any load include whether a driver has the right training to
x dt
handle the needs of a particular load; whether his
The objective function is then given by, for a given truck is too heavy to handle a particular load; and
point in time t, whether he is considered sufficiently reliable to
冉冘 冘 冘 c x ⫹冘c y 冊
meet the needs of an account.
min ft 共xt 兲 ⫽ a
cdlt a
xdlt ⫹ h h r We denote all these factors using a multidimen-
dt dt lt lt
x a, x h, y d僆Ᏸ t l僆ᏸ t d僆Ᏸ t l僆ᏸ sional cost function that is then collapsed into a
(1) single utility function. Suppressing the time param-
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 73
Fig. 5. Sequence of multimodular simulator actions occurring when a driver becomes available for assignment.
relevant driver, equipment, and load information, as the appropriate cost and assignment information
such as future driver time of availability, and load is given to the simulator module. The assignment
availability status. The assignment selection, infor- decision-making logic is also quite flexible in that
mation regarding any currently uncovered (rejected) the user can specify exactly how and what costing
loads, and any other desired information is then information is to be used. This, for example, allows
written to a log file for later analysis by the user (D). the user to specify that the simulated dispatcher
Driver and load status update information are writ- exactly implement the solution of the optimization
ten to flat files for input by the optimizer module (E), algorithm being used and then compare the result-
which then supplies a reoptimization of the network. ing solution to that obtained by using some arbi-
This driver assignment process is completely gen- trarily defined costing function. Such a function
eral in that it can be used with any optimization might be used to represent a human dispatcher’s
algorithm, or even with no algorithm at all, so long internal utility function.
76 / W. B. POWELL, M. T. TOWNS, AND A. MARAR
6. SIMULATOR MODULES AND PROCEDURES these functions are presented in Figure 6. The fol-
IN THIS SECTION, we describe some relevant details lowing notation is used in this figure.
specific to the three modules that comprise the fleet t st ⫽the simulated start time (in seconds) of the
simulation architecture: the optimizer, the simula- simulation run
tor module (or interchangeably, the fleet simulator/ t end ⫽the simulated end time (in seconds) of the sim-
dispatcher module), and the demand generator. We ulation run
also discuss the use of the dispatcher selection prob-
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
6.2 Demand Generator c dlt ( ␣ ) ⫽the adjusted reduced cost for assigning
driver d to load l at time t
The demand generator uses an actual historical c dlt ⫽the arc cost for assigning driver d to load l
demand file, from which it draws a random sample. at time t
As the simulation progresses, each load is read in, dt ⫽the dual variable for the node for driver d at
and a random number is drawn to determine time t
whether it is to be included in this run. If it is, then lt ⫽the dual variable for the node for load l at
the generator waits until it is the right time, com- time t
paring the actual call-in time of the load to the ␣ ⫽the dual variable discount factor.
simulated wall-clock time. When it is the right time,
the generator sends the load over to the dispatch Potential assignments for a given driver are ranked
simulator. for selection purposes from lowest to highest cost
The use of a real dataset means that we have an using c dlt ( ␣ ). For full global optimization, we set ␣ ⫽
accurate picture of actual call-in processes, and real 1.0, which gives us the equivalent network simplex
data for the attributes of the loads, including pickup reduced cost. For full greedy optimization, we set
and delivery windows. By using only a sample of ␣ ⫽ 0.0, which gives a ranking according to arc costs.
these loads, we were able to run repeated simula- Values of ␣ in the interval (0, 1) are used to repre-
tions using different random number seeds, allow- sent intermediate strategies.
ing us to obtain repeated observations of a particu-
lar optimization process. In our work, we used a 30% 6.4 The Dispatch Function
sample of the load, reflecting the fact that our fleet
As discussed above, the presence of user noncom-
size was approximately 30% of the size of the actual
pliance with the optimization system’s recommen-
fleet used by the carrier. When we run repeated
dations is a major factor in the implementation of
simulations with the same set of parameters, then
optimization decision-support systems. To simulate
we are simply choosing different samples of loads.
this process, we define
6.3 Fleet Simulator/Dispatcher
u dlt共w兲
The simulator module performs two main func-
tions. The first is the actual simulation of a truck-
load fleet, and the second is the simulation of a
human dispatching operation. The main steps in
⫽ 再 1 if the user accepts the recommendation
x dlt ⫽ 1 at time t
0 otherwise
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 77
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
0 otherwise.
l⬘僆ᏸt
(6)
For a given realization w, we can determine which Equation 6, then, defines a class of dispatch func-
load a driver will be assigned to, allowing us to tions parameterized by ␣. If ␣ ⫽ 1, then we obtain
define the dispatch function for a driver d 僆 Ᏸ t by the optimal solution of f t ; if ␣ ⫽ 0, then we obtain a
78 / W. B. POWELL, M. T. TOWNS, AND A. MARAR
再冘 冘 冘 冎
variable window,
tend w e ⫽the number of standard errors ( 公t eta) after the
F共 ␣ , p , 兲 ⫽ E
d pd
⫺c dlt共 兲 x dlt共 ␣ , w兲 , expected time to include in the random variable
t⫽tst d僆Ᏸta l僆ᏸt window,
(7)  ⫽noise intensity parameter ranging from 0.0 to
1.0. If  ⫽ 0, then there is no noise and we have
where we choose now to maximize profits over a
deterministic travel times.
simulation (instead of minimizing costs at a given
point in time). The expectation is over all random During the simulated dispatcher’s costing of driver–
events, including the likelihood of user acceptance, load pairs, real travel times for transit between load
given by the parameter p d (hence the explicit repre- origin and destination are obtained by the above
sentation of p d in the expectation). We approximate formulation and stored in the simulator module’s
the expectation by running several simulations us- data structures. When the simulator module selects
ing different random number seeds. Each run of the a pairing for assignment, it sends an updated ETA
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 79
for the driver to the optimizer using t eta. Thus, the obtain estimates of the cost of each source of uncer-
optimizer is planning future assignments using t eta, tainty.
whereas the simulator maintains the actual arrival We measure the value of global over local solu-
time t actual. tions through the dual utilization factor ␣. Thus, our
In the event that t actual ⬍ t eta, the driver will experiments will focus on determining the optimal
arrive at the load destination and become available value of ␣ under different degrees and sources of
for a new assignment earlier than expected. At this randomness.
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
time, the simulated dispatcher queries the optimizer The experimental data used for our runs was ob-
module to obtain costing information for feasible tained from a major truckload motor carrier. A sim-
loads for the given driver. This costing information ulated fleet consisting of 400 drivers was con-
will be based on t eta. The simulated dispatcher then structed by randomly picking from a data set
recosts potential assignment arcs from the optimizer consisting of the carrier’s full staff of about 1500
based on the early arrival, and makes an assign- drivers. An initial set of 100 loads was constructed
ment selection using the procedure described in Sec- for input into the system. Subsequent to these initial
tion 6.3. 100 loads, the demand generator was set to provide
Alternatively, if t actual ⬎ t eta, the driver will arrive approximately 225 loads per day of simulation time.
at the load destination and be available for a new Both the initial and subsequently generated load
assignment later than expected. When the simula- sets were constructed by sampling from a set of
tion clock reaches and then passes time t eta, the actual historical service demands called into the
optimizer continually resets t eta to be the current carrier over a one-week period. Both the driver and
time. load sets were distributed throughout the continen-
Randomness in travel times can have the effect of tal U.S. Each run simulated seven days of actual
encouraging greedy solutions. Consider the situa- dispatching, run at a speed of 60 times real time,
tion where drivers 1 and 2 are available now, and resulting in a run time of 2.8 hours per run. For each
driver 3 will become available later in the afternoon. combination of parameter values simulated, with
Driver 1 is 40 miles away from a load, driver 2 is 90 the exception of the advance demand booking exper-
miles away, and driver 3, when he arrives, would iment discussed in Section 8.1, we conducted five
only be 20 miles away. Assume that driver 3’s esti- simulation runs. For the advance demand booking
mated time of arrival is 3 P.M., and the load must be experiment, 10 runs were conducted for each param-
picked up before 5 P.M. A network model would eter value studied. This was done to clearly delin-
choose driver 3 to cover the load based on cost and eate solution behavior for this special case.
service considerations. In this case, we might assign For each run, we statistically estimated a second-
driver 1 to another load. Later, we might discover order polynomial relating total simulation profits
that the ETA for driver 3 is inaccurate, and that he (given by Eq. 7) to the dual discount factor ␣. We
will be delayed several hours. In this case, we have retained this second-order specification throughout
no choice but to assign driver 2 to the load, at a much the analysis.
higher cost. Our experimental design is based on running sim-
ulations with different values of ␣ and p d , along
with other sources of randomness, and drawing con-
7. EXPERIMENTAL DESIGN
clusions regarding the appropriate decision function
(as specified by the parameter ␣ in Eq. 6). Thus, we
OUR PRIMARY INTEREST is in determining the value need to think about what it means to compare val-
of optimal (myopic) solutions over local solutions ues of F( ␣ , p d , ) in Eq. 7 for different values of p d
under different sources and degrees of uncertainty. and ␣. Three issues need to be discussed. First is the
We measure the degree to which a decision is global role of the utility parameter vector , and what it
versus local through the dual discount parameter ␣. means to optimize over a mixture of soft and hard
We consider three sources of randomness: a) ran- costs. Second is the issue of comparing model runs
domness in the dynamic arrival of loads to the sys- for different values of the user acceptance parame-
tem; b) randomness in the travel times, captured as ter p d . Finally, we discuss the important issue of
errors in our estimates of driver ETA’s; and c) ran- comparing model runs for different values of ␣.
domness in the implementation of recommendations The utility vector controls the relative impor-
by the user, which we refer to as the user noncom- tance of different dimensions of the cost function. As
pliance problem. With each source of randomness, we discussed above, our costs include both quantifi-
we focus on finding the value of ␣ that minimizes the able transportation costs, and other soft costs re-
simulation cost function. In the process, we also lated to customer service and driver management.
80 / W. B. POWELL, M. T. TOWNS, AND A. MARAR
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
Fig. 7. Objective function versus dispatcher dual discount factor with advance demand booking and perfect user compliance
(p d ⫽ 1.0).
Our research addresses the challenge of finding a using different values of the user acceptance proba-
decision function x that maximizes F( ␣ , p d , ) bility, p d . This question raises a very subtle issue
within the class of decision functions defined by Eq. with regard to interpreting the objective function
6, parameterized by ␣. At this point, we take as F( ␣ , p d , ). Although we have a well-defined objec-
given the utility function parameterized by the vec- tive function, the issue of user compliance brings
tor . Thus, a dollar of lateness counts the same as a into focus the difference between the objective func-
dollar from traveling empty. In the context of our tion we are maximizing, and the one the users are
research, this is the correct way to run the simula- maximizing. Assume that we wanted to compare
tion. The choice of the vector is set by management F( ␣ , p 1d , ) to F( ␣ , p 2d , ) where p 1d ⬎ p 2d . We might
(in our case, this is the vector running in production wish to estimate the increase in profits if the user
at a major motor carrier). It is not our job to question acceptance level increased. In fact, this comparison
whether we have the correct value of , which has is not meaningful, and we cannot use our results to
been chosen to balance the importance of issues such estimate the value of higher user compliance. To see
as on-time service and routing drivers through their this, consider a run with p 1d ⫽ 1 versus p 2d ⫽ 0.7. At
home with operating costs. Our simulations cover a p 1d ⫽ 1, we are accepting all the recommendations of
week of dispatching; tradeoffs of empty miles and the model. In general, these recommendations are
service, in contrast, are learned over years of watch- the highest profit options and will produce the high-
ing customers switch to competitors because of poor est profits over a simulation. If p 2d ⫽ 0.7, then we
service. Our challenge, then, is to maximize are, in effect, saying that 30% of the time, a top-
F( ␣ , p d , ) for a given choice of . For this reason, ranked recommendation is not acceptable. Thus, the
we have made no effort to run simulations with supposedly high profit loads that appear at the top
different values of . of the list for a driver are, in fact, low profit (mea-
The second issue concerns comparing model runs sured in terms of our utility function) for reasons
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 81
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
Fig. 8. Objective function versus dispatcher dual discount factor, deterministic and stochastic travel times (p d ⫽ 1.0).
Fig. 9. Objective function versus dispatcher dual discount factor, various probabilities of selection.
experiments, we performed 10 samples for each in the form of random travel times between load
value of ␣ that was tested. origin and destination locations. The random travel
The results are shown in Figure 7, which com- times were implemented as described in Section 6.6.
pares total profits versus ␣ for two sets of data: the The objective function values for these runs were
first assumes all the daily bookings are known at the compared to those obtained using only expected
beginning of the day, and the second retains the travel times.
original booking profile of loads. It is not surprising For these experimental runs, we set p d ⫽ 1.0. The
that we get higher profits when all the demands are noise intensity parameter, , was set to 1.0. The
known at the beginning of the day. Most significant time window start and end parameters, w s and w e ,
is that, when demands are known in advance, the respectively, were both set to 1.5. This means that a
curve steadily increases with ␣, leveling off for val- 600-mile move that takes an average of 16 hours
ues of ␣ over approximately 0.60. For this run, we might require anything from 10 to as much as 22
could not reject the hypothesis that the optimum hours. While drivers in a real-world situation tend
dual discount factor was equal to 1.0, as we would to run late more often than early, we wanted the
expect. When we simulate dynamic booking (the expectation of the random travel time to equal the
lower curve), performance noticeably dropped for expected travel time. This was done to avoid influ-
values of ␣ over 0.80, with an apparent optimum encing the objective function value by changing the
close to 0.75. This is our first evidence that, in the relative number of driver–load assignments made
presence of random customer booking, the global during a simulation. Had the window been set to
optimum solution does not give the best overall re- always make travel times at least as long as the
sults. expected time, for example, the fact that drivers
would always be at best on-time and usually late
8.2 Effect of Network Stochasticity would mean that fewer assignments could be made
The second set of experiments we performed ex- in the course of a simulation.
amined the effect of stochasticity in fleet operations The results are shown in Figure 8. These show
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 83
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
Fig. 10. Objective function versus dispatcher selection probability, various dual discounts.
that randomness in travel times has a relatively globally optimal solutions appear to slightly under-
modest (but significant to a motor carrier) impact on perform solutions using ␣ ⫽ 0.75.
overall profits. We note that the impact of ␣ is di- A different perspective on the results was ob-
minished in the presence of random travel times. tained by plotting total profits as a function of dif-
More importantly for our research, the shape of the ferent values of p d , using ␣ ⫽ 0, 0.75, and 1.0, shown
curve as a function of ␣ appears not to have an in Figure 10. The results show that, as the level of
impact on the optimum dual discount factor. Of user compliance drops, the value of a globally opti-
course, both runs were done in the process of dy- mal solution over a greedy solution drops dramati-
namic booking, so we are, in effect, observing that cally. In the truckload motor carrier industry, it is
the randomness in customer bookings is more im- not unusual to see values of p d ⫽ 0.6 or even lower.
portant than randomness in travel times. The opti- A value of p d ⫽ 0.7 is considered very high. These
mum value of ␣ still appears to be around 0.75. results show that, for these levels of user compli-
ance, greedy solutions work almost as well as glo-
8.3 Effect of User Noncompliance bally optimal solutions. Furthermore, using ␣ ⫽ 0.75
gives the best results over all levels of user compli-
Our last set of experiments looked at the effect of
ance, although, for the lowest levels, it really does
user noncompliance on system performance. We re-
not matter what you do. Reassuringly, if user com-
port these results in two ways. First, Figure 9 sum-
pliance is high (over 90%) then the value of global
marizes system profits as a function of ␣ for different
network information (either with ␣ ⫽ 1.0 or 0.75) is
values of p d . Our first conclusion from these runs is
quite high, and demonstrates the usefulness of op-
that the optimum value of ␣ is close to 0.75 for all
timization models.
three values of p d . Our second and more significant
conclusion is that, for the lowest levels of user com-
pliance, p d ⫽ 0.4, the dual discount has relatively
9. CONCLUSIONS
little effect on overall profits. Said differently, the
value in listing loads sorted in the order from best to THE EXPERIMENTS in this paper investigate the im-
worst (measured with respect to c dlt ( ␣ )) is signifi- pact of different forms of uncertainty on the value of
cantly reduced. As user compliance rises, the value optimal (myopic) solutions in a particular class of
of a proper listing of loads increases, although the routing and scheduling problems. The results show
84 / W. B. POWELL, M. T. TOWNS, AND A. MARAR
that uncertainty does reduce the value of globally D. J. BERTSIMAS AND G. VAN RYZIN, “A Stochastic and
optimal solutions, bringing into question whether Dynamic Vehicle Routing Problem in the Euclidean
rigorously optimal solutions are useful in a dynamic Plane,” Opns. Res. 39, 601– 615 (1991).
D. J. BERTSIMAS, P. CHERVI, AND M. PETERSON, “Compu-
setting.
tational Approaches to Stochastic Vehicle Routing,”
Our research did not compare myopic and dy- Transp. Sci. 29, 342–352 (1995).
namic models, but rather looked only at models that R. CHEUNG AND W. B. POWELL, “An Algorithm for Multi-
consider only information that is known (or reason- stage Dynamic Networks with Random Arc Capacities,
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.
ably well known, in the case of random travel times) with an Application to Dynamic Fleet Management,”
at a given point in time. This said, even a myopic Opns. Res. 44, 951–963 (1996).
model plans for decisions in the future, as long as it J. DESROSIERS, M. SOLOMON, AND F. SOUMIS, “Time Con-
is based on known information. In our case, we strained Routing and Scheduling,” in Handbook in Op-
erations Research and Management Science, C.
would consider the global optimization of all drivers
Monma, T. Magnanti, and M. Ball (eds.), Volume on
to all loads, including drivers that would not arrive Networks, 35–139, North Holland, Amsterdam, 1995.
until later in the day or the next day. Our dual M. DROR, G. LAPORTE, AND P. TRUDEAU, “Vehicle Routing
discount, then, implicitly was putting a weight of 1.0 with Stochastic Demands: Properties and Solution
on the decision we were about to make, and a re- Frameworks,” Transp. Sci. 23, 166 –176 (1989).
duced weight on decisions that we might make at a M. FISHER, “Vehicle Routing,” in Handbook in Operations
later point in time (although it might only be a few Research and Management Science, C. Monma, T.
seconds later). Magnanti and M. Ball (eds.), Volume on Networks,
1–33, North Holland, Amsterdam, 1995.
We are unaware of any other research that has
L. FRANTZESKAKIS AND W. POWELL, “A Successive Linear
raised the issue of user compliance, and measured Approximation Procedure for Stochastic Dynamic Ve-
its impact on the choice of algorithm. Even static hicle Allocation Problems,” Transp. Sci. 24, 40 –57
models are subject to user editing and modification (1990).
prior to implementation. Given such postoptimality R. GRAHAM, “Bounds for Certain Multiprocessor Anoma-
tampering, it is clear that the original solution is no lies,” Bell Syst. Tech. J. 45, 1563–1581 (1966).
longer optimal, because it assumes that all the de- D. GUSFIELD AND R. IRVING, The Stable Marriage Algo-
cisions in the solution will be implemented. We rithm: Structure and Algorithms, MIT Press, Boston,
MA, 1989.
would argue that such considerations should be
W. JORDAN AND M. A. TURNQUIST, “A Stochastic Dynamic
built into the design and analysis of algorithms for Network Model for Railroad Car Distribution,” Transp.
all classes of routing and scheduling problems. The Sci. 17, 123–145 (1983).
result may be problems that are actually easy to G. LAPORTE AND F. LOUVEAUX, “Formulations and Bounds
solve, producing both faster run times and more for the Stochastic Capacitated Vehicle Routing Prob-
robust solutions. lem with Uncertain Supplies,“ in Economic Decision-
Making: Games, Econometrics and Optimization, J.
Gabzewicz, J. Richard, and L. Wolsey (eds.), North
Holland, Amsterdam, 1990.
ACKNOWLEDGMENT
M. PINEDO, Scheduling: Theory, Algorithm, and System,
THIS RESEARCH WAS SUPPORTED in part by grant Prentice Hall, Englewood Cliffs, NJ, 1995.
AFOSR-F49620-93-1-0098 from the Air Force Office W. B. POWELL, “A Stochastic Model of the Dynamic Vehi-
cle Allocation Problem,” Transp. Sci. 20, 117–129
of Scientific Research. The authors would also like to
(1986).
gratefully acknowledge the careful reading by an
W. B. POWELL, “An Operational Planning Model for the
anonymous referee who helped to highlight impor- Dynamic Vehicle Allocation Problem with Uncertain
tant issues in this study. Demands,” Transp. Res. 21B, 217–232 (1987).
W. B. POWELL, “A Comparative Review of Alternative
Algorithms for the Dynamic Vehicle Allocation Prob-
REFERENCES lem,” in Vehicle Routing: Methods and Studies, B.
Golden and A. Assad (eds.), 249 –292, North Holland,
S. ALBERS, “Competitive Online Algorithms,” OPTIMA: New York, 1988.
Math. Prog. Soc. Newsl. 54, 1– 8 (1997). W. B. POWELL, “A Stochastic Formulation of the Dynamic
D. BERTSIMAS AND G. V. RYZIN, “A Stochastic and Dynamic Assignment Problem, with an Application to Truckload
Vehicle Routing Problem in the Euclidean Plane,” Motor Carriers,” Transp. Sci. 30, 195–219 (1996).
Opns. Res. 39, 601– 615 (1991). H. PSARAFTIS, “Dynamic Vehicle Routing Problems,” in
D. J. BERTSIMAS AND D. SIMCHI-LEVI, “A New Generation Vehicle Routing: Methods and Studies, B. Golden and
of Vehicle Routing Research: Robust Algorithms, Ad- A. Assad (eds.) 223–248, North Holland, Amsterdam,
dressing Uncertainty,” Opns. Res. 44, 286 –304 (1996). 1988.
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 85
P. SCHRIJVER, Supporting Fleet Management by Mobile P. TRUDEAU AND M. DROR, “Stochastic Inventory Routing:
Communications, P. R. Schrijver, The Netherlands, Route Design with Stockouts and Route Failures,”
1993. Transp. Sci. 26, 171–184 (1992).
D. D. SLEATOR AND R. E. TARJAN, “Amortized Efficiency of W. WHITE, “Dynamic Transshipment Networks: An Algo-
List Update and Paging Rules,” Com. ACM 28, 202– rithm and Its Application to the Distribution of Empty
208 (1985). Containers,” Networks, 2, 211–236 (1972).
W. STEWART AND B. GOLDEN, “Stochastic Vehicle Routing:
A Comprehensive Approach,” Eur. J. Opns. Res. 14,
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.