You are on page 1of 20

This article was downloaded by: [130.160.4.

77] On: 15 September 2016, At: 11:34


Publisher: Institute for Operations Research and the Management Sciences (INFORMS)
INFORMS is located in Maryland, USA

Transportation Science
Publication details, including instructions for authors and subscription information:
http://pubsonline.informs.org

On the Value of Optimal Myopic Solutions for Dynamic


Routing and Scheduling Problems in the Presence of User
Noncompliance
Warren B. Powell, Michael T. Towns, Arun Marar,

To cite this article:


Warren B. Powell, Michael T. Towns, Arun Marar, (2000) On the Value of Optimal Myopic Solutions for Dynamic Routing and
Scheduling Problems in the Presence of User Noncompliance. Transportation Science 34(1):67-85. http://dx.doi.org/10.1287/
trsc.34.1.67.12283

Full terms and conditions of use: http://pubsonline.informs.org/page/terms-and-conditions

This article may be used only for the purposes of research, teaching, and/or private study. Commercial use
or systematic downloading (by robots or other automatic processes) is prohibited without explicit Publisher
approval, unless otherwise noted. For more information, contact permissions@informs.org.

The Publisher does not warrant or guarantee the article’s accuracy, completeness, merchantability, fitness
for a particular purpose, or non-infringement. Descriptions of, or references to, products or publications, or
inclusion of an advertisement in this article, neither constitutes nor implies a guarantee, endorsement, or
support of claims made of that product, publication, or service.

© 2000 INFORMS

Please scroll down for article—it is on subsequent pages

INFORMS is the largest professional society in the world for professionals in the fields of operations research, management
science, and analytics.
For more information on INFORMS, its publications, membership, or meetings visit http://www.informs.org
On the Value of Optimal Myopic Solutions
for Dynamic Routing and Scheduling
Problems in the Presence of
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

User Noncompliance
WARREN B. POWELL, MICHAEL T. TOWNS, AND ARUN MARAR

Department of Civil Engineering and Operations Research, Princeton University, Princeton, New Jersey 08544

The most common approach for modeling and solving routing and scheduling problems in a
dynamic setting is to solve, as close to optimal as possible, a series of deterministic, myopic
models. The argument is most often made that, if the data changes, then we should simply
reoptimize. We use the setting of the load matching problem that arises in truckload trucking
to compare the value of optimal myopic solutions versus varying degrees of greedy, suboptimal
myopic solutions in the presence of three forms of uncertainty: customer demands, travel times,
and, of particular interest, user noncompliance. A simulation environment is used to test
different dispatching strategies under varying levels of system dynamism. An important issue
we consider is that of user noncompliance, which is the effect of optimizing when users do not
adopt all of the recommendations of the model. Our results show that (myopic) optimal
solutions only slightly outperform greedy solutions under relatively high levels of uncertainty,
and that a particular suboptimal solution actually outperforms optimal solutions under a wide
range of conditions.

T he challenge of optimizing routing and schedul-


ing problems in a real-time setting has been receiv-
community in the form of the study of on-line algo-
rithms (SLEATOR and TARJAN, 1985).
ing increased attention (see, for example, PSARAFTIS, Routing and scheduling in a dynamic environ-
1988). The most common approach to solving these ment has been studied by a number of authors. One
problems is to solve a model using the data as they line of research has sought to analyze the properties
are known at a certain point in time, and then re- of greedy heuristics for stochastic, dynamic prob-
optimize as new data become available. Researchers lems (BERTSIMAS and VAN RYZIN, 1991; BERTSIMAS
in logistics typically work to find the best possible and RYZIN, 1991; BERTSIMAS and SIMCHI-LEVI, 1996;
solution for a problem instance. In some cases, this BERTSIMAS, CHERVI, and PETERSON, 1995). Others
may involve solving little more than an assignment have sought to develop explicit stochastic, dynamic
problem or network problem (e.g., POWELL, 1996) or models that incorporate both here-and-now informa-
a more complex integer programming problem (such tion and some forecasts of future activities (see, for
as those described in DESROSIERS, SOLOMON, and example, DROR, LAPORTE, and TRUDEAU, 1989; LA-
SOUMIS, 1995). In harder problems, such as classical PORTE and LOUVEAUX, 1990; STEWART and GOLDEN,
vehicle routing, optimal solutions are very hard to 1983; TRUDEAU and DROR, 1992; POWELL, 1996).
find, raising the question of whether the additional One of the challenges of classical vehicle routing
effort is worth it. In related areas, such as machine problems is that it is looking to solve a stochastic
scheduling, it is common to solve these problems version of a problem where the deterministic version
using greedy heuristics (see GRAHAM, 1966; ALBERS, remains computationally intractable and, for most
1997; and the general book PINEDO, 1995, and the problems, impossible to solve to optimality. Related
references cited there). This field has received con- research has focussed on problems where the deter-
siderable recent attention from the computer science ministic versions are relatively easy to solve, mak-
67
Transportation Science, © 2000 INFORMS 0041-1655 / 00 / 3401-0067 $05.00
Vol. 34, No. 1, February 2000 pp. 67–85, 1526-5447 electronic ISSN
68 / W. B. POWELL, M. T. TOWNS, AND A. MARAR

ing the stochastic version somewhat easier to ana- performance by modeling randomness in travel
lyze. One problem in this class looks at managing times, which creates uncertainties around the ar-
fleets of vehicles over time, and the closely related rival time of a driver at the destination. And third,
problem of assigning drivers to full-load movements. we introduce and investigate the general issue of
One of the first deterministic models of a fleet man- user noncompliance. Because no model is perfect,
agement problem was posed by WHITE (1972), which users routinely override the solutions of driver as-
was followed by a series of papers suggesting and signment models. As a result, an additional level of
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

solving different approximations of a multistage, randomness arises in the actual use of the recom-
stochastic dynamic program (JORDAN and TURN- mendations of these models. In practice, several
QUIST, 1983; POWELL, 1986, 1987, 1988; and companies report that average usage of these mod-
FRANTZESKAKIS and POWELL, 1990). Recently, els is typically below 60%, and good performance is
CHEUNG and POWELL (1996) showed that, for this considered around 70%.
problem class, explicit stochastic models can signif- We believe that the issue of user noncompliance is
icantly outperform deterministic models in rolling fundamental to all models, static and dynamic, and
horizon experiments. Despite this research, most that it raises the question of whether so-called opti-
models in practice are basically deterministic and mal algorithms are indeed optimal. For example, if
myopic. All of these papers were oriented toward the real-world schedules do not use 100% of the solution
management of large fleets of vehicles. produced by a crew scheduling model (in a static,
Recent research (SCHRIJVER, 1993; Powell, 1996) planning environment), was the original solution
has focussed on the problem of matching individual even optimal? In this paper, we show that, in the
drivers to full-truckload movements, a problem presence of user noncompliance, optimal solutions
called the load matching problem in truckload truck- will be outperformed, over time, by algorithms that
ing. This problem class is much closer to vehicle
are more local in nature. We feel this has broad
routing problems in its basic characteristics, with
implications for the design of algorithms in other
the important exception that there is no in-vehicle
settings (vehicle routing, crew scheduling), which
consolidation. In addition, the loads are typically
seek to find optimal solutions to sequences of deter-
very long in duration, often requiring one or more
ministic, myopic problems.
days to complete. The advantage of this problem
To address these issues in a systematic fashion,
structure is that a single instance of the problem is
we have developed a simulation architecture to sim-
a network assignment problem, considering the
ulate truckload dispatching operations. This archi-
matching of drivers to loads. This problem is easy to
solve to optimality, and easy to update with real- tecture consists of three modules: a network optimi-
time information. Schrijver (1993) used a greedy zation model for assigning drivers to loads, a fleet
heuristic for matching drivers to loads in a study of simulator/dispatcher module, and a demand gener-
the value of real-time communications. The research ation module. The network optimization model is a
in Powell (1996) compared a myopic model to an production system that is in use at a major motor
approximation of the stochastic, dynamic problem, carrier. The modules communicate with each other
and showed that the stochastic, dynamic model out- through files, closely simulating the actual flow of
performed the myopic model in rolling horizon ex- data at a carrier (where the optimization model typ-
periments. ically runs on a separate workstation, communicat-
In this paper, we consider only myopic models of ing with the carrier’s primary management informa-
the load matching problem for truckload trucking. tion system through a database). Our experiments
Following common engineering practice, our plan is used the actual data from a large motor carrier,
to solve a given instance of the problem to optimal- scaled down to simulate a smaller fleet (of 400
ity, and then reoptimize as new information be- trucks), a step that accelerated the running of each
comes known. This is the approach that is widely simulation (which need to run much faster than real
used in commercial applications. Our interest is in time).
comparing optimal solutions of sequences of deter- The primary contributions of this paper are as
ministic, myopic models, obtained by solving the follows.
network assignment problem, to various approxima-
tions including greedy solutions that simply assign ● We show that different forms of system stochas-
drivers to the best load. We study these algorithms ticity have the effect of reducing the value of
under three forms of system stochasticity. First, we optimality when solving sequences of subprob-
consider the random process of shippers calling in lems in dynamic settings, and we show that a
loads. Second, we investigate the effects of network solution that is suboptimal for a particular prob-
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 69

lem instance actually provides higher solution rigorous standards. In the context of dynamic prob-
quality over an extended simulation. lems, it is necessary to solve a sequence of problems
● We introduce and study for the first time the over time as new information arises, thereby simu-
issue of user noncompliance, and show that this lating the real process.
form of uncertainty can, in particular, reduce In this setting, we must explicitly acknowledge
the value of optimal solutions. Noncompliance is three models: the sequences of subproblems that are
an easy to measure barometer of the quality of solved over time (each of which is a model with its
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

an optimization model as perceived by the user. own objective function), the larger simulation (with
Even relatively low levels of user noncompliance its own cost stream), and the even larger problem in
(say, 30% modification of model recommenda- the real world, which must reflect data errors and
tions) indicate that partially greedy solutions other biases.
can outperform optimal solutions. At the same Let ␻ ⫽ { ␻ 0 , ␻ 1 , . . . , ␻ t , . . . , ␻ T } be a sequence
time, similar levels of user noncompliance sig- of random outcomes over a horizon ᐀ ⫽ {0, 1, . . .,
nificantly reduced the gap between an optimal T}, where ␻ 僆 ⍀. We would then define a probabil-
solution and a greedy solution. ity space (⍀, Ᏺ, ᏼ) where Ᏺ is the ␴-algebra on ⍀,
● We experimentally quantify the effects of differ- and ᏼ is a probability measure defined over Ᏺ. Let
ent forms of system uncertainty, providing a Ᏺ t represent the sequence of sub-␴-algebras contain-
measure of the value of reducing these forms of ing the information known up to time t, where Ᏺ t 債
uncertainty on actual system performance. Ᏺ t⫹1 . Let f t␲ (x t 兩Ᏺ t ) be the objective function at time
t given an information set Ᏺ t at time t. The super-
The paper is organized as follows. Section 1 de- script ␲ captures both the structure of the function f t
scribes different perspectives of optimality in the and any parameters used to create the function (in-
context of dynamic routing and scheduling. Section cluding, for example, planning horizons, discount
2 presents an overview of dispatching operations in factors, and any cost factors introduced so that solv-
the truckload motor carrier industry, which we use ing f t at time t will produce a solution with good
as our problem setting. Section 3 presents the opti- long-run behaviors). We can think of f t␲ , ␲ 僆 ⌸ as
mization model that is used to solve the problem. the set of all possible functions (or policies, if you
Next, Section 4 discusses some of the issues that wish) that we could use at time t. Now let x t␲ (Ᏺ t ) ⫽
arise in global versus local optimization. These is- arg minx t僆X f t␲ (x t 兩Ᏺ t ) be a sequence of Ᏺ t -adapted
sues are central to any optimization problem where decisions (meaning that the function x ␲ can only use
the results will be implemented over time, during information in Ᏺ t ). (Note: The concept of Ᏺ t -adapted
which the status of the system may change. Then, decisions is similar, but not identical, to the concept
Section 5 gives an overview of our simulation archi- of nonanticipativity used in the stochastic program-
tecture, which takes advantage of a production load ming community, which is expressed as a constraint
matching system that is in use at a major motor requiring that one decision be chosen in time period
carrier. This is followed in Section 6 by a description 0 for all possible outcomes in the future. Instead of
of the specific modules. Section 7 describes the ex- saying that a decision is Ᏺ t -adapted, some authors
perimental design, and the results are reported in prefer to say that the decision is Ᏺ t -measurable.)
Section 8. Section 9 summarizes the major conclu- The decision function x t␲ (Ᏺ t ) returns a set of deci-
sions of the paper. sions that are to be implemented at time t given the
events Ᏺ t . We write x ␲ ( ␻ ) to represent the decision
made for a particular realization ␻ , with the implicit
1. PERSPECTIVES ON OPTIMALITY
understanding that the function x ␲ is Ᏺ t -measur-
TO PLACE THIS PAPER in the proper context within able. In determining this vector, we need to consider
the research community, it is important to briefly both here-and-now costs at time t, and possibly ap-
discuss the meaning of the word “optimal.” It is proximate estimates of costs that may be incurred in

common engineering practice to formulate a mathe- the future, but which are subject to solving x t⬘ for t⬘
matical model, typically with numerous simplifica- ⬎ t. For example, we may decide that a vehicle
tions to ensure tractability, and then to seek an should go from customer i 1 to customer i 2 right now;
optimal solution to this model. We all accept that the this is because we have formed a tour that consists
word optimal is used in the narrow sense of solving of (i 1 , i 2 , i 3 , i 4 ). The objective function f t␲ would be
a specific mathematical representation, without formulated to capture these future decisions, and
making any claims regarding the impact on the real may also include approximations of what would hap-
problem, because these impacts fall outside of what pen even farther in the future (as was done in Pow-
can be measured experimentally using scientifically ell, 1996). However, only the vector x t␲ is actually
70 / W. B. POWELL, M. T. TOWNS, AND A. MARAR

implemented. Let c t represent the cost of these de- represents a rejection. In the event of a rejection, it
cisions. Then, the costs incurred at time t would be would be possibly to exclude a solution and reopti-
c t x t␲ , and the total costs over the entire simulation mize. In our work, we followed industry practice in
would be F(x, ␻ ) ⫽ 兺 t僆᐀ c t x t␲ ( ␻ ), where x ⫽ this setting and used the dual variables from the
(x 0␲ , x 1␲ , . . . , x T

). optimal solution to suggest alternatives.
We are now ready to talk about different types of Because the physical process representing a user
optimality. Our goal as modelers is to solve accepting a decision is extremely complex, it is cus-
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

再冘 冎
tomary to model a process such as this as a random
max F共 x兲 ⫽ E c tx t␲共 ␻ 兲 , variable. This modeling approach is comparable to
x僆X t僆᐀
adding a noise term at the end of a function to
capture all the nonmeasurable effects. We feel that
where X is the space of all feasible Ᏺ-adapted con- this representation of the user compliance issue rea-
trols x t␲ , ᐀ is the planning horizon, and the expec- sonably captures the effects of the model adoption
tation operator is over all outcomes in ⍀. In our process, and offers the benefit of being simple and
hierarchy of optimization problems, this is our experimentally analyzable. Given that we cannot
larger problem. In practice, we solve F(x) using a observe the real process of why users reject model
series of subproblems f t␲ (x t ) in an effort to find a rejections (and we have tried very hard to do just
solution that maximizes F(x). F(x), in turn, is noth- this), the next best alternative, and far superior
ing more than a proxy for the even larger problem in than ignoring the issue entirely, is to model the
the real world, which must also capture issues that acceptance process as a random variable.
we are unable to put into our model. In the presence of user noncompliance, as well as
In the arena of routing and scheduling, it is very other forms of uncertainty, it is reasonable to ask
common to use myopic solutions that are then up- the question: what is the value of an optimal solu-
dated as new information arrives. Although this is tion of the function f t␲ (x t )? Our work focuses on a
the simplest possible option for most classes of rout- problem where finding the optimal solution, given
ing and scheduling problems, it remains a combina- conventional modeling practice, is actually quite
torically difficult problem. Thus, even finding the easy, allowing us to compare solutions that solve
optimal solution to this relatively simple problem is f t␲ (x t ) optimally to those that solve it only approxi-
a nontrivial task. Because f t␲ (x t ) represents a single mately. We consider only myopic models, in part
instance of a classical optimization problem, it is because these are the models that are most widely
understandable that a considerable amount of effort used in practice, and more importantly, because we
will be devoted toward optimizing this problem, re- do not want to get into the tremendously rich and
alizing that it is simply a single snapshot of a larger complex arena of developing approximations of fu-
problem. Of course, even the larger problem, repre- ture activities. Given the emerging research into the
sented by F(x), is little but an approximation of an application of vehicle routing models in dynamic
even larger problem that encompasses information settings, which focuses primarily on the heuristic
that is not captured within the model. Because dy- solution of myopic models, we feel that our choice of
namic routing models must run in production, this research methodology is appropriate and produces
even larger problem, which must reflect the insights interesting results. Most significantly, we feel that
and biases of real users, must begin to be addressed the issue should be considered when testing differ-
within the context of our research. ent search algorithms in the context of harder prob-
In this paper, we model the effect of the even lems such as dynamic vehicle routing. For example,
larger problem through a process we call user com- in the presence of user noncompliance, time-con-
pliance. In this process, we present a recommenda- suming search heuristics may prove to add little or
tion to the user, at which point the recommendation no value in a dynamic environment.
is accepted or rejected. We make the assumption We do not claim that our simple model of user
that the cause for rejecting a recommendation re- compliance precisely captures the complex physics
flects information outside the domain of the model, of human thought. In support of our approach, we
and, hence, we model it as an exogenous random draw on the long history of mathematical models
variable. Specifically, for a particular decision, say that use random variables to capture highly complex
x ij , we introduce a random upper bound u ij ( ␻ ) ⫽ 1 processes. We do claim, however, that the estimates
with probability p d , and 0 with probability (1 ⫺ p d ). that we obtain of the value of optimal solutions in a
Only when we choose x ij ⫽ 1 do we then learn the dynamic environment are more accurate using our
value of u ij ( ␻ ) where u ij ( ␻ ) ⫽ 1 represents the user model of user compliance than they would be if we
accepting the recommendation, whereas u ij ( ␻ ) ⫽ 0 ignored the issue altogether. We further claim that
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 71

possibility of loads that will be called in later in the


day.
The operations of a truckload carrier are rela-
tively simple in concept. A shipper calls the dis-
patching office to request the pickup of a load at a
specific location within a specific time window. This
time window may begin on the day of the call-in or it
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

may begin some time in the future. The load must be


delivered to a specified location within a specific
time window. Other relevant information, such as
special driver and trailer requirements, is also sup-
plied by the shipper.
The dispatching situation for a major truckload
carrier generally involves multiple human dispatch-
Fig. 1. A simplified spatial assignment network.
ers each working within a specified geographic re-
gion. In the absence of decision support systems,
assignment decisions are made more or less inde-
an algorithm that provides a better solution to our
pendently of the actions of other dispatchers. Such a
model of user compliance is likely to be a more
pattern of multiple dispatchers acting this way in
robust algorithm that will provide better results in
relative isolation from one another tends to produce
practice.
a geographic patchwork of locally greedy optimiza-
tions. These local optimizations mostly neglect the
2. THE DYNAMIC ASSIGNMENT PROBLEM IN effects downstream in space and time of assign-
TRUCKLOAD OPERATIONS
ments. The result is a solution that is suboptimal
WE FIRST PRESENT an introduction to the dynamic from a mathematical standpoint as compared to a
assignment problem and its relationship to truck- solution derived from an optimization decision sup-
load motor fleet management (see Schrijver, 1993 port system.
and Powell, 1996 for a much more complete discus-
sion). The dynamic assignment problem involves the
3. THE LOAD MATCHING OPTIMIZATION MODEL
allocation of resources to perform tasks over a dis-
crete-time dynamic network. Only some fraction of THE LOAD MATCHING problem assumes that at a
the relevant tasks in the network is known initially, given point in time t, we have a set of loads ᏸ t and
with additional tasks becoming known in real time a set of drivers Ᏸ t , and we need to match drivers to
as time progresses. This means that the optimal loads. Some of the loads may not be available for
solution for the problem is constantly changing as pickup until some time in the future (possibly a day
new information becomes known. The result may be or more) and some of the drivers are currently pull-
that the viability of a previously optimal solution is ing loads, and also will not be available until some
reduced significantly in light of new information. point in the future (the driver estimated time of
In truckload applications, drivers and loads are arrival (ETA)).
spread almost continuously over space. At a point in For each potential assignment of a driver d 僆 Ᏸ t
time, the challenge is to match drivers to loads, as to a load l 僆 ᏸ t we can assign a cost c dlt
a
(this cost
illustrated in Figure 1. The problem is complicated will change with time). In addition, we must con-
by the fact that there is much more to the problem sider the cost of refusing to service a load, and the
than simply finding the closest driver to a load. cost of holding a driver idle. The problem here is to
When finding a driver, a planner has to balance decide which drivers to assign to which loads to
deadhead (empty) miles, the availability of the minimize the total costs (maximize total profits)
driver (some of the drivers may only be available throughout the network. An example of the driver
late in the day), the pickup and delivery windows of assignment problem (using only assignment costs) is
the load, where the driver lives and whether the load shown in Figure 2.
will get him home on time, the skill and experience Loads are characterized by a) an origin and des-
of the driver (some shippers require top-rated driv- tination, b) a pickup time window and a delivery
ers), whether the driver is a single driver (which can time window, and c) a vector of attributes, which
only drive 10 hours at a stretch) or a team (which might include such items as type of driver required
can drive almost continuously), and other character- and load priority. Drivers are characterized by a) a
istics. In addition, the planner needs to consider the location at time of availability, b) a time of availabil-
72 / W. B. POWELL, M. T. TOWNS, AND A. MARAR

subject to

冘x
d僆Ᏸt
a
dlt ⫹ y lt ⫽ 1 l 僆 ᏸt (2)

冘x a
dlt ⫹ x dt
h
⫽1 d 僆 Ᏸt (3)
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

l僆ᏸt

a
x dlt h
, y lt , x dt ⭓0 l 僆 ᏸ, d 僆 Ᏸt .

We let ␯ l be the dual variable for Eq. 2 and ␯ d be the


dual variable for Eq. 3. Note that f t (x t ) represents
the costs that would be incurred at time t if x t were
chosen. This function is purely greedy and makes no
effort to incorporate factors that might improve the
selection of x t to find solutions that would produce
Fig. 2. A simple driver assignment problem. The c dlt on se- higher values for the multistage problem. In Section
lected arcs represent the arc cost for the assignment of driver d to 1, the notation f t␲ (x t ) refers to a function that incor-
load l at time t.
porates not only the immediate effect of a solution
(represented by f t (x t ) above) but any other adjust-
ity, which may be in the future (if the driver is ments that might be included to produce better long-
currently assigned to a load) or the past (if the driver run solutions. In the remainder of this paper, we use
is waiting for assignment), and c) a vector of at- only f t (x t ) because we are considering only solutions
tributes including special information like time of myopic models.
spent on duty, experience level, and home domicile. The optimal solution of Eq. 1 can be found by
We can formulate this assignment problem as a solving a simple assignment problem as depicted in
simple static assignment model as follows. Figure 2. In more complex problems, such as those
ᏸ t ⫽the set of all loads known to the system at that involve routing a driver through a sequence of
time t points (for example, if the loads are short, or if there
Ᏸ t ⫽the set of all drivers in the network at time t is in-vehicle consolidation), we may have to resort to
a
c dlt ⫽the cost of assigning driver d to load l at heuristic solutions. In a dynamic setting, it is rea-
time t sonable to consider the use of suboptimal or even
r
c lt ⫽the cost of not assigning any drivers to load l at greedy solutions. For this reason, we let ⌸ be the
time t (the refusal cost) class of algorithmic strategies (or policies), and let
h
c dt ⫽the cost of not assigning driver d to any load at x t␲ , ␲ 僆 ⌸ be the solution obtained using policy ␲.
time t (the holding cost). The cost function c dlta
for a truckload motor carrier
Our decision variables are has to capture a number of performance measures.


A successful carrier must minimize empty miles,
1 if driver d is assigned to cover maximize on-time service for loads, maintain a high
a
x dlt ⫽ load l at time t level of driver satisfaction, and maximize equipment
0 otherwise, productivity. In the truckload arena, where drivers

y lt ⫽ 再 10 ifotherwise,
load l is refused at time t often spend several weeks away from home, driver
satisfaction is measured by our ability to get drivers
back to their home by a certain time. Other issues
⫽ 再 0 otherwise.
h 1 if driver d is not assigned to any load include whether a driver has the right training to
x dt
handle the needs of a particular load; whether his
The objective function is then given by, for a given truck is too heavy to handle a particular load; and
point in time t, whether he is considered sufficiently reliable to

冉冘 冘 冘 c x ⫹冘c y 冊
meet the needs of an account.
min ft 共xt 兲 ⫽ a
cdlt a
xdlt ⫹ h h r We denote all these factors using a multidimen-
dt dt lt lt
x a, x h, y d僆Ᏸ t l僆ᏸ t d僆Ᏸ t l僆ᏸ sional cost function that is then collapsed into a
(1) single utility function. Suppressing the time param-
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 73

eter t, we may define posed to a heuristic or suboptimal solution (as might


be used in harder problems such as vehicle routing).
ᏹ ⫽set of cost factors to be considered in Real-time problems offer a special challenge to the
the assignment of a driver (such as optimization community because it is difficult to get
empty miles, on-time service, and so quality data in a timely way. For example, humans
on) possess a certain amount of “head knowledge,”
ad ⫽vector of attributes of driver d which is information that they have acquired by
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

bl ⫽vector of attributes of load l telephone, conversations, visual inspection, and ex-


rl ⫽revenue, or reward from picking up a perience that is not in the computer. In addition,
load there is information that no one knows. The most
␳ ⫽vector of parameters that are used to significant form of uncertain information is the de-
translate a performance measure mands of the customers, but we would also include
(such as arriving late to pick up a weather delays, breakdowns, and failure of drivers
load) into a cost. to perform a given task.
a
c mt (ad , bl , ␳ ) ⫽cost of factor m 僆 ᏹ of assigning a We believe these factors are fundamental to any
driver with attribute vector. ad to a application. The result is that the recommendations
load with attributes bl at time t, made by a model are not always implemented. We
given utility parameter vector ␳ refer to this effect as user noncompliance, which
measures the degree to which users actually imple-
The net total contribution of assigning driver d to
ment the recommendations of a model. User non-
load l at time t can now be expressed as
compliance often reflects information the user has
a
c dlt 共␳兲 ⫽ rl ⫺ 冘c
m僆ᏹ
a
mt 共ad , bl , ␳ 兲. (4)
that the system does not. The presence of user non-
compliance implies that globally optimal solutions
are, in fact, not optimal at all. Humans reflect this
In this paper, we are using the cost function that is property intuitively by making decisions that are
in production at a major motor carrier. The param- good in a local sense (both spatial and temporal) but
eters have been carefully chosen to reflect what the which may not, in theory, properly take into account
carrier believes is an accurate measure of the factors their impact on other parts of the system (in space or
that impact the performance of the behavior. Thus, time). For this reason, software developers have ar-
when we translate a late pickup into a $50 cost, we gued that global optimization models can outper-
treat this as a real cost, comparable to incurring $50 form humans, overcoming the limitations humans
in transportation costs. have in dealing with large problems.
We can quickly illustrate the impact of uncertain
data on solution quality using a simple illustration
4. GLOBAL VERSUS LOCAL OPTIMIZATION OF THE shown in Figure 3. Here, we show driver A covering
SUBPROBLEM
load 1, driver B covering load 2, and driver C cover-
THE EASE WITH WHICH the driver assignment model ing load 3. Driver C would prefer to cover load 1, but
can be solved has spawned a cottage industry of if this were done, driver A would not have very
software vendors who supply models of this sort to attractive options. A few minutes later, a new load is
the truckload industry. A major selling point was called in that is better for driver A, allowing driver C
the ability to perform global optimization in an in- to switch to load 1 after all. At this point, we learn
dustry where the driver assignment problem was that driver B would rather cover load 3 instead of
solved manually, using methods that tended to pro- load 2.
duce locally good solutions. Researchers are actively Given the same example, it would not be uncom-
developing similar globally optimal solutions for mon for a human to assign driver C to load 1 and
other problem classes in vehicle routing and sched- driver B to load 3, leaving both driver A and load 2
uling (see Desrosiers et al., 1995 and FISHER, 1995 unassigned. Because we assumed that driver A did
for recent surveys of the field). Of course, the value not really have any attractive assignments, the
of a globally optimal solution depends, as always, on planner might keep driver A unassigned while hop-
the quality of the data. These references all refer to ing for something better. This would be an instance
global optimization in the context of providing a of using a greedy solution now to achieve an overall
mathematically optimal solution to an individual better solution later.
subproblem f t (x t ). For the remainder of this paper, In practice, humans making decisions on an oper-
all references to global optimization refer to our ational basis spend relatively little time trying to
ability to find the optimal solution to f t (x t ) as op- find the best solution for a given situation. Instead,
74 / W. B. POWELL, M. T. TOWNS, AND A. MARAR
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

Fig. 4. The multimodular fleet simulation system. The diagram


shows the main modules and communication pathways between
them.

dispatcher determines that a driver or load needs to


be matched, it sends a query to the optimizer asking
Fig. 3. Revised optimal assignment in light of new service
demand information. for information about possible assignments. The op-
timizer, in simulated real time, sends over a small
file containing a number of possible loads that a
they spend most of their time simply verifying that driver may be assigned to, along with the full set of
the data they are looking at is, in fact, correct. Given costs and duals. Depending on parameter settings,
the inherent uncertainty in the data (which may the dispatcher may implement exactly what the op-
have nothing to do with forecasts of future customer timizer recommends, or we may allow it to make
activities), it is perhaps not surprising that they other choices.
limit their searches for good solutions to small, ver- As the simulation progresses, two types of events
ifiable subsets of data. Below, we show that such a can occur: drivers become available for assignment
strategy can actually outperform a globally optimal to available loads, and new loads come into the net-
solution. work for assignment. A schematic of the driver as-
signment-availability process is shown in Figure 5.
The letter labels in the diagram refer to the various
5. SIMULATOR ARCHITECTURE AND DEVELOPMENT
modules used at each step in this process. As a
WE HAVE DEVELOPED a specialized simulator archi- driver becomes available (in simulation time) for
tecture to test hypotheses regarding the behavior of assignment, the simulator module (A) requests from
different dispatching strategies in a dynamic envi- the optimization module (B) driver–load arc-costing
ronment. A primary goal of our architecture was to information for all loads with which the driver can
take advantage of a production load-matching sys- be feasibly paired. This costing information is of the
tem that has been developed and implemented for a type required for the costing functions described in
large truckload motor carrier. For this reason, we Section 3 and includes travel distances, availability
adopted a modular architecture consisting of three times, and load characteristics. In addition, the most
components: a network optimization module, a fleet recent set of dual variables from the network algo-
simulator/dispatcher module, and a demand-gener- rithm is supplied from which the arc reduced costs
ator module. The interaction between these three may be calculated. The optimizer also supplies the
modules is represented graphically in Figure 4. flow on each arc, because more than one arc may
Each of the three modules runs independently, co- have zero reduced cost (in such an event, the simu-
ordinated through the use of a common clock object lator will always choose the arc to which the opti-
(just as a real system would operate). Of course, the mizer assigned flow). From this information, a dis-
simulation runs at a rate faster than real time. patching function inside the simulator module (C)
The simulator architecture allows a wide variety makes a driver–load assignment decision (which
of dispatching situations to be simulated. When the may include no assignment at all), and updates all
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 75
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

Fig. 5. Sequence of multimodular simulator actions occurring when a driver becomes available for assignment.

relevant driver, equipment, and load information, as the appropriate cost and assignment information
such as future driver time of availability, and load is given to the simulator module. The assignment
availability status. The assignment selection, infor- decision-making logic is also quite flexible in that
mation regarding any currently uncovered (rejected) the user can specify exactly how and what costing
loads, and any other desired information is then information is to be used. This, for example, allows
written to a log file for later analysis by the user (D). the user to specify that the simulated dispatcher
Driver and load status update information are writ- exactly implement the solution of the optimization
ten to flat files for input by the optimizer module (E), algorithm being used and then compare the result-
which then supplies a reoptimization of the network. ing solution to that obtained by using some arbi-
This driver assignment process is completely gen- trarily defined costing function. Such a function
eral in that it can be used with any optimization might be used to represent a human dispatcher’s
algorithm, or even with no algorithm at all, so long internal utility function.
76 / W. B. POWELL, M. T. TOWNS, AND A. MARAR

6. SIMULATOR MODULES AND PROCEDURES these functions are presented in Figure 6. The fol-
IN THIS SECTION, we describe some relevant details lowing notation is used in this figure.
specific to the three modules that comprise the fleet t st ⫽the simulated start time (in seconds) of the
simulation architecture: the optimizer, the simula- simulation run
tor module (or interchangeably, the fleet simulator/ t end ⫽the simulated end time (in seconds) of the sim-
dispatcher module), and the demand generator. We ulation run
also discuss the use of the dispatcher selection prob-
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

t cur ⫽the simulated current time (in seconds) of the


ability parameter, p d , and the implementation of simulation run
random travel times in the simulation architecture. d ⫽a given driver
Ᏸ t ⫽the set of all drivers in the network at time t
6.1 Optimizer
Ᏸ ta ⫽the set of all drivers available for assignment
The driver assignment optimizer consists of a net- at time t, (Ᏸ ta 傺 Ᏸ t ).
work generator and a network simplex code for solv-
ing the matching problem. The code is a production The costing of potential assignments described in
dispatch system that is in use at several carriers. this figure is done using a modified reduced cost
The cost functions that determine the penalties for formula. This formula uses a dual variable discount-
violated delivery windows or for not returning a ing factor to vary the degree of dual variable utili-
driver to his home on time are the same costs that zation in assignment decisions. The formula used is
are used in production. For this reason, we have as follows:
treated these costs as real costs, because they have
been chosen to accurately capture the tradeoff be- c៮ dlt共 ␣ 兲 ⫽ c dlt ⫹ ␣ 共 ␯ lt ⫺ ␯ dt兲, (5)
tween meeting these goals and running additional
empty miles (which is a hard, well-defined cost). where

6.2 Demand Generator c៮ dlt ( ␣ ) ⫽the adjusted reduced cost for assigning
driver d to load l at time t
The demand generator uses an actual historical c dlt ⫽the arc cost for assigning driver d to load l
demand file, from which it draws a random sample. at time t
As the simulation progresses, each load is read in, ␯ dt ⫽the dual variable for the node for driver d at
and a random number is drawn to determine time t
whether it is to be included in this run. If it is, then ␯ lt ⫽the dual variable for the node for load l at
the generator waits until it is the right time, com- time t
paring the actual call-in time of the load to the ␣ ⫽the dual variable discount factor.
simulated wall-clock time. When it is the right time,
the generator sends the load over to the dispatch Potential assignments for a given driver are ranked
simulator. for selection purposes from lowest to highest cost
The use of a real dataset means that we have an using c៮ dlt ( ␣ ). For full global optimization, we set ␣ ⫽
accurate picture of actual call-in processes, and real 1.0, which gives us the equivalent network simplex
data for the attributes of the loads, including pickup reduced cost. For full greedy optimization, we set
and delivery windows. By using only a sample of ␣ ⫽ 0.0, which gives a ranking according to arc costs.
these loads, we were able to run repeated simula- Values of ␣ in the interval (0, 1) are used to repre-
tions using different random number seeds, allow- sent intermediate strategies.
ing us to obtain repeated observations of a particu-
lar optimization process. In our work, we used a 30% 6.4 The Dispatch Function
sample of the load, reflecting the fact that our fleet
As discussed above, the presence of user noncom-
size was approximately 30% of the size of the actual
pliance with the optimization system’s recommen-
fleet used by the carrier. When we run repeated
dations is a major factor in the implementation of
simulations with the same set of parameters, then
optimization decision-support systems. To simulate
we are simply choosing different samples of loads.
this process, we define
6.3 Fleet Simulator/Dispatcher
u dlt共w兲
The simulator module performs two main func-
tions. The first is the actual simulation of a truck-
load fleet, and the second is the simulation of a
human dispatching operation. The main steps in
⫽ 再 1 if the user accepts the recommendation
x dlt ⫽ 1 at time t
0 otherwise
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 77
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

Fig. 6. Main fleet simulator/dispatcher module procedure.

p d ⫽ the probability that a dispatcher will


accept a potential driver to load assignment
⫽ P关u dl共w兲 ⫽ 1兴.

x 共 ␣ , w兲 ⫽
dlt 再 1 if l ⫽ argmin兵c៮ dl⬘共 ␣ 兲兩u dl⬘共w兲 ⫽ 1其

0 otherwise.
l⬘僆ᏸt

(6)
For a given realization w, we can determine which Equation 6, then, defines a class of dispatch func-
load a driver will be assigned to, allowing us to tions parameterized by ␣. If ␣ ⫽ 1, then we obtain
define the dispatch function for a driver d 僆 Ᏸ t by the optimal solution of f t ; if ␣ ⫽ 0, then we obtain a
78 / W. B. POWELL, M. T. TOWNS, AND A. MARAR

greedy solution. 0 ⬍ ␣ ⬍ 1 produces a range of simulation is a function of ␣ and p d . In addition, our


intermediate solutions. Of course, many other types simulations considered variations of other problem
of decision functions are also possible. For example, characteristics, notably the presence of randomness
we could use a stable marriage algorithm (see, for in travel times and the dynamics of the process by
example, GUSFIELD and IRVING, 1989) or a local which customer demands become known to the sys-
search heuristic. Our function provides an elegant tem.
knob that takes us smoothly from greedy local solu-
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

tions to optimal solutions. 6.6 Random Travel Times


For example, the first assignment will be selected In transportation applications, one of the most
by the simulated dispatcher function with probabil- important sources of noise arises in travel times. In
ity p d and rejected with probability (1 ⫺ p d ). If it is the load matching problem, travel times are impor-
rejected, the second-best assignment will then be tant because they impact our ability to estimate
selected with probability p d and rejected with prob- when a driver will become available in the future,
ability (1 ⫺ p d ). This process continues until either and whether this driver could be assigned to a load
an assignment selection is made, or the number of with a pickup window. To analyze the impact of
possible assignments rejected equals a specified errors in travel times, we first define
user parameter n. Should n potential loads be re-
jected (with probability (1 ⫺ p d ) n ), the given driver d od ⫽the distance in miles from load origin to desti-
is assigned to remain idle for a specified time period, nation
t idle. For our experiments, we set n ⫽ 5 and t idle ⫽ 60 t eta ⫽the expected travel time in hours
minutes of simulation time. ⫽60d od/r
A problem that exists when using this logic is the r ⫽the average rate of speed (typically 40 mph).
potential for the selection of excessively expensive We calculate a realization of an actual travel using
assignment arcs that would never actually be se-
lected by a real human dispatcher. For example, t actual ⫽ U共t min , t max兲
when using a value of p d ⫽ 0.4, the chance of
selecting the fourth-best cost-wise assignment is ⫽ the real travel time in hours,
about 9%, and the fifth-best is about 5%. Because, in where U(a, b) is a random variable uniformly dis-
reality, there are often not five realistic assignment tributed over the interval [a, b]. The bounds of our
arcs for a given driver, these low-ranked arcs may distribution are determined using
have very large arc costs and should be considered
infeasible from a common-sense standpoint. The in- t min ⫽ the minimum possible travel time,
clusion of even a small number of such arcs in the
simulated dispatcher’s solution can skew objective ⫽ t eta ⫺ ␤ w s 冑t eta
function values in an unrealistic fashion. To prevent t max ⫽ the maximum possible travel time,
selection of such arcs, a cost cap of $500 was placed
on arcs considered feasible for assignment. ⫽ t eta ⫹ ␤ w e 冑t eta ,

6.5 Objective Function where


We may now define the total net contribution (or w s ⫽the number of standard errors ( 公t eta) prior to
profit) incurred from running a simulation as the expected time to include in the random

再冘 冘 冘 冎
variable window,
tend w e ⫽the number of standard errors ( 公t eta) after the

F共 ␣ , p , ␳ 兲 ⫽ E
d pd
⫺c dlt共 ␳ 兲 x dlt共 ␣ , w兲 , expected time to include in the random variable
t⫽tst d僆Ᏸta l僆ᏸt window,
(7) ␤ ⫽noise intensity parameter ranging from 0.0 to
1.0. If ␤ ⫽ 0, then there is no noise and we have
where we choose now to maximize profits over a
deterministic travel times.
simulation (instead of minimizing costs at a given
point in time). The expectation is over all random During the simulated dispatcher’s costing of driver–
events, including the likelihood of user acceptance, load pairs, real travel times for transit between load
given by the parameter p d (hence the explicit repre- origin and destination are obtained by the above
sentation of p d in the expectation). We approximate formulation and stored in the simulator module’s
the expectation by running several simulations us- data structures. When the simulator module selects
ing different random number seeds. Each run of the a pairing for assignment, it sends an updated ETA
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 79

for the driver to the optimizer using t eta. Thus, the obtain estimates of the cost of each source of uncer-
optimizer is planning future assignments using t eta, tainty.
whereas the simulator maintains the actual arrival We measure the value of global over local solu-
time t actual. tions through the dual utilization factor ␣. Thus, our
In the event that t actual ⬍ t eta, the driver will experiments will focus on determining the optimal
arrive at the load destination and become available value of ␣ under different degrees and sources of
for a new assignment earlier than expected. At this randomness.
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

time, the simulated dispatcher queries the optimizer The experimental data used for our runs was ob-
module to obtain costing information for feasible tained from a major truckload motor carrier. A sim-
loads for the given driver. This costing information ulated fleet consisting of 400 drivers was con-
will be based on t eta. The simulated dispatcher then structed by randomly picking from a data set
recosts potential assignment arcs from the optimizer consisting of the carrier’s full staff of about 1500
based on the early arrival, and makes an assign- drivers. An initial set of 100 loads was constructed
ment selection using the procedure described in Sec- for input into the system. Subsequent to these initial
tion 6.3. 100 loads, the demand generator was set to provide
Alternatively, if t actual ⬎ t eta, the driver will arrive approximately 225 loads per day of simulation time.
at the load destination and be available for a new Both the initial and subsequently generated load
assignment later than expected. When the simula- sets were constructed by sampling from a set of
tion clock reaches and then passes time t eta, the actual historical service demands called into the
optimizer continually resets t eta to be the current carrier over a one-week period. Both the driver and
time. load sets were distributed throughout the continen-
Randomness in travel times can have the effect of tal U.S. Each run simulated seven days of actual
encouraging greedy solutions. Consider the situa- dispatching, run at a speed of 60 times real time,
tion where drivers 1 and 2 are available now, and resulting in a run time of 2.8 hours per run. For each
driver 3 will become available later in the afternoon. combination of parameter values simulated, with
Driver 1 is 40 miles away from a load, driver 2 is 90 the exception of the advance demand booking exper-
miles away, and driver 3, when he arrives, would iment discussed in Section 8.1, we conducted five
only be 20 miles away. Assume that driver 3’s esti- simulation runs. For the advance demand booking
mated time of arrival is 3 P.M., and the load must be experiment, 10 runs were conducted for each param-
picked up before 5 P.M. A network model would eter value studied. This was done to clearly delin-
choose driver 3 to cover the load based on cost and eate solution behavior for this special case.
service considerations. In this case, we might assign For each run, we statistically estimated a second-
driver 1 to another load. Later, we might discover order polynomial relating total simulation profits
that the ETA for driver 3 is inaccurate, and that he (given by Eq. 7) to the dual discount factor ␣. We
will be delayed several hours. In this case, we have retained this second-order specification throughout
no choice but to assign driver 2 to the load, at a much the analysis.
higher cost. Our experimental design is based on running sim-
ulations with different values of ␣ and p d , along
with other sources of randomness, and drawing con-
7. EXPERIMENTAL DESIGN
clusions regarding the appropriate decision function
(as specified by the parameter ␣ in Eq. 6). Thus, we
OUR PRIMARY INTEREST is in determining the value need to think about what it means to compare val-
of optimal (myopic) solutions over local solutions ues of F( ␣ , p d , ␳ ) in Eq. 7 for different values of p d
under different sources and degrees of uncertainty. and ␣. Three issues need to be discussed. First is the
We measure the degree to which a decision is global role of the utility parameter vector ␳, and what it
versus local through the dual discount parameter ␣. means to optimize over a mixture of soft and hard
We consider three sources of randomness: a) ran- costs. Second is the issue of comparing model runs
domness in the dynamic arrival of loads to the sys- for different values of the user acceptance parame-
tem; b) randomness in the travel times, captured as ter p d . Finally, we discuss the important issue of
errors in our estimates of driver ETA’s; and c) ran- comparing model runs for different values of ␣.
domness in the implementation of recommendations The utility vector ␳ controls the relative impor-
by the user, which we refer to as the user noncom- tance of different dimensions of the cost function. As
pliance problem. With each source of randomness, we discussed above, our costs include both quantifi-
we focus on finding the value of ␣ that minimizes the able transportation costs, and other soft costs re-
simulation cost function. In the process, we also lated to customer service and driver management.
80 / W. B. POWELL, M. T. TOWNS, AND A. MARAR
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

Fig. 7. Objective function versus dispatcher dual discount factor with advance demand booking and perfect user compliance
(p d ⫽ 1.0).

Our research addresses the challenge of finding a using different values of the user acceptance proba-
decision function x ␲ that maximizes F( ␣ , p d , ␳ ) bility, p d . This question raises a very subtle issue
within the class of decision functions defined by Eq. with regard to interpreting the objective function
6, parameterized by ␣. At this point, we take as F( ␣ , p d , ␳ ). Although we have a well-defined objec-
given the utility function parameterized by the vec- tive function, the issue of user compliance brings
tor ␳. Thus, a dollar of lateness counts the same as a into focus the difference between the objective func-
dollar from traveling empty. In the context of our tion we are maximizing, and the one the users are
research, this is the correct way to run the simula- maximizing. Assume that we wanted to compare
tion. The choice of the vector ␳ is set by management F( ␣ , p 1d , ␳ ) to F( ␣ , p 2d , ␳ ) where p 1d ⬎ p 2d . We might
(in our case, this is the vector running in production wish to estimate the increase in profits if the user
at a major motor carrier). It is not our job to question acceptance level increased. In fact, this comparison
whether we have the correct value of ␳, which has is not meaningful, and we cannot use our results to
been chosen to balance the importance of issues such estimate the value of higher user compliance. To see
as on-time service and routing drivers through their this, consider a run with p 1d ⫽ 1 versus p 2d ⫽ 0.7. At
home with operating costs. Our simulations cover a p 1d ⫽ 1, we are accepting all the recommendations of
week of dispatching; tradeoffs of empty miles and the model. In general, these recommendations are
service, in contrast, are learned over years of watch- the highest profit options and will produce the high-
ing customers switch to competitors because of poor est profits over a simulation. If p 2d ⫽ 0.7, then we
service. Our challenge, then, is to maximize are, in effect, saying that 30% of the time, a top-
F( ␣ , p d , ␳ ) for a given choice of ␳. For this reason, ranked recommendation is not acceptable. Thus, the
we have made no effort to run simulations with supposedly high profit loads that appear at the top
different values of ␳. of the list for a driver are, in fact, low profit (mea-
The second issue concerns comparing model runs sured in terms of our utility function) for reasons
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 81
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

Fig. 8. Objective function versus dispatcher dual discount factor, deterministic and stochastic travel times (p d ⫽ 1.0).

that are not captured by the model. It would be 8. EXPERIMENTAL RESULTS


tempting to use our simulations to estimate the
WE NOW PRESENT the results of our simulations.
value of always using the model versus only some-
First, Section 8.1 describes the results of our exper-
times using the model. We feel that our simulations
iments that focus on the effects of randomness in the
do not allow us to draw any conclusions along these
booking process. It is in these runs that we produce
lines.
Finally, we address the third and most important results that show that, if the demands are known in
issue of comparing values of F( ␣ , p d , ␳ ) for different advance, and travel times are deterministic, and
values of ␣. Here, we are using the same user accep- user compliance is 100%, then the optimal myopic
tance probability p d . For a fixed value of p d we will, solution (␣ ⫽ 1) produces the best results. This is an
for any given value of ␣, be searching, on average, important piece of validation.
the same distance down any list of loads offered to a Next, Section 8.2 considers the impact of uncer-
driver. As long as p d is held fixed, we are capturing tain travel times. Finally, Section 8.3 presents the
the same degree of dissatisfaction with the higher experiments that summarize most carefully the
ranked options for a driver. When we change ␣ while value of greedy, optimal, and intermediate solutions
holding p d fixed, we are, in effect, solving the same in the presence of varying levels of user compliance.
problem with different algorithms.
The parameters ␳ and p d , then, represent charac- 8.1 Effect of Dynamic Service Demand
teristics of the problem. ␣, in contrast, is a charac- Booking
teristic of the algorithm. We may compare
F( ␣ , p d , ␳ ) directly for different values of ␣, and we Our first set of experiments included a set of runs
may also compare the optimal value of ␣ for different with the lowest level of uncertainty among all our
values of p d and ␳, but we may not compare experiments. In this run, we assumed all loads to be
F( ␣ , p d , ␳ ) directly for different values of p d or ␳. called in during the day were known at 7 A.M. on
Because a major focus of our research is the impact that day (but we do not know about tomorrow’s
of user noncompliance, it is interesting to estimate loads). We assumed deterministic travel times and
the best value of ␣ for different values of p d . We did perfect user compliance. We then ran simulations
not feel that it would be very interesting to ask the with different values of ␣, expecting to find that the
same question for different parameter vectors ␳. best value of ␣ was very close to 1.0. For this set of
82 / W. B. POWELL, M. T. TOWNS, AND A. MARAR
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

Fig. 9. Objective function versus dispatcher dual discount factor, various probabilities of selection.

experiments, we performed 10 samples for each in the form of random travel times between load
value of ␣ that was tested. origin and destination locations. The random travel
The results are shown in Figure 7, which com- times were implemented as described in Section 6.6.
pares total profits versus ␣ for two sets of data: the The objective function values for these runs were
first assumes all the daily bookings are known at the compared to those obtained using only expected
beginning of the day, and the second retains the travel times.
original booking profile of loads. It is not surprising For these experimental runs, we set p d ⫽ 1.0. The
that we get higher profits when all the demands are noise intensity parameter, ␤, was set to 1.0. The
known at the beginning of the day. Most significant time window start and end parameters, w s and w e ,
is that, when demands are known in advance, the respectively, were both set to 1.5. This means that a
curve steadily increases with ␣, leveling off for val- 600-mile move that takes an average of 16 hours
ues of ␣ over approximately 0.60. For this run, we might require anything from 10 to as much as 22
could not reject the hypothesis that the optimum hours. While drivers in a real-world situation tend
dual discount factor was equal to 1.0, as we would to run late more often than early, we wanted the
expect. When we simulate dynamic booking (the expectation of the random travel time to equal the
lower curve), performance noticeably dropped for expected travel time. This was done to avoid influ-
values of ␣ over 0.80, with an apparent optimum encing the objective function value by changing the
close to 0.75. This is our first evidence that, in the relative number of driver–load assignments made
presence of random customer booking, the global during a simulation. Had the window been set to
optimum solution does not give the best overall re- always make travel times at least as long as the
sults. expected time, for example, the fact that drivers
would always be at best on-time and usually late
8.2 Effect of Network Stochasticity would mean that fewer assignments could be made
The second set of experiments we performed ex- in the course of a simulation.
amined the effect of stochasticity in fleet operations The results are shown in Figure 8. These show
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 83
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

Fig. 10. Objective function versus dispatcher selection probability, various dual discounts.

that randomness in travel times has a relatively globally optimal solutions appear to slightly under-
modest (but significant to a motor carrier) impact on perform solutions using ␣ ⫽ 0.75.
overall profits. We note that the impact of ␣ is di- A different perspective on the results was ob-
minished in the presence of random travel times. tained by plotting total profits as a function of dif-
More importantly for our research, the shape of the ferent values of p d , using ␣ ⫽ 0, 0.75, and 1.0, shown
curve as a function of ␣ appears not to have an in Figure 10. The results show that, as the level of
impact on the optimum dual discount factor. Of user compliance drops, the value of a globally opti-
course, both runs were done in the process of dy- mal solution over a greedy solution drops dramati-
namic booking, so we are, in effect, observing that cally. In the truckload motor carrier industry, it is
the randomness in customer bookings is more im- not unusual to see values of p d ⫽ 0.6 or even lower.
portant than randomness in travel times. The opti- A value of p d ⫽ 0.7 is considered very high. These
mum value of ␣ still appears to be around 0.75. results show that, for these levels of user compli-
ance, greedy solutions work almost as well as glo-
8.3 Effect of User Noncompliance bally optimal solutions. Furthermore, using ␣ ⫽ 0.75
gives the best results over all levels of user compli-
Our last set of experiments looked at the effect of
ance, although, for the lowest levels, it really does
user noncompliance on system performance. We re-
not matter what you do. Reassuringly, if user com-
port these results in two ways. First, Figure 9 sum-
pliance is high (over 90%) then the value of global
marizes system profits as a function of ␣ for different
network information (either with ␣ ⫽ 1.0 or 0.75) is
values of p d . Our first conclusion from these runs is
quite high, and demonstrates the usefulness of op-
that the optimum value of ␣ is close to 0.75 for all
timization models.
three values of p d . Our second and more significant
conclusion is that, for the lowest levels of user com-
pliance, p d ⫽ 0.4, the dual discount has relatively
9. CONCLUSIONS
little effect on overall profits. Said differently, the
value in listing loads sorted in the order from best to THE EXPERIMENTS in this paper investigate the im-
worst (measured with respect to c dlt ( ␣ )) is signifi- pact of different forms of uncertainty on the value of
cantly reduced. As user compliance rises, the value optimal (myopic) solutions in a particular class of
of a proper listing of loads increases, although the routing and scheduling problems. The results show
84 / W. B. POWELL, M. T. TOWNS, AND A. MARAR

that uncertainty does reduce the value of globally D. J. BERTSIMAS AND G. VAN RYZIN, “A Stochastic and
optimal solutions, bringing into question whether Dynamic Vehicle Routing Problem in the Euclidean
rigorously optimal solutions are useful in a dynamic Plane,” Opns. Res. 39, 601– 615 (1991).
D. J. BERTSIMAS, P. CHERVI, AND M. PETERSON, “Compu-
setting.
tational Approaches to Stochastic Vehicle Routing,”
Our research did not compare myopic and dy- Transp. Sci. 29, 342–352 (1995).
namic models, but rather looked only at models that R. CHEUNG AND W. B. POWELL, “An Algorithm for Multi-
consider only information that is known (or reason- stage Dynamic Networks with Random Arc Capacities,
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

ably well known, in the case of random travel times) with an Application to Dynamic Fleet Management,”
at a given point in time. This said, even a myopic Opns. Res. 44, 951–963 (1996).
model plans for decisions in the future, as long as it J. DESROSIERS, M. SOLOMON, AND F. SOUMIS, “Time Con-
is based on known information. In our case, we strained Routing and Scheduling,” in Handbook in Op-
erations Research and Management Science, C.
would consider the global optimization of all drivers
Monma, T. Magnanti, and M. Ball (eds.), Volume on
to all loads, including drivers that would not arrive Networks, 35–139, North Holland, Amsterdam, 1995.
until later in the day or the next day. Our dual M. DROR, G. LAPORTE, AND P. TRUDEAU, “Vehicle Routing
discount, then, implicitly was putting a weight of 1.0 with Stochastic Demands: Properties and Solution
on the decision we were about to make, and a re- Frameworks,” Transp. Sci. 23, 166 –176 (1989).
duced weight on decisions that we might make at a M. FISHER, “Vehicle Routing,” in Handbook in Operations
later point in time (although it might only be a few Research and Management Science, C. Monma, T.
seconds later). Magnanti and M. Ball (eds.), Volume on Networks,
1–33, North Holland, Amsterdam, 1995.
We are unaware of any other research that has
L. FRANTZESKAKIS AND W. POWELL, “A Successive Linear
raised the issue of user compliance, and measured Approximation Procedure for Stochastic Dynamic Ve-
its impact on the choice of algorithm. Even static hicle Allocation Problems,” Transp. Sci. 24, 40 –57
models are subject to user editing and modification (1990).
prior to implementation. Given such postoptimality R. GRAHAM, “Bounds for Certain Multiprocessor Anoma-
tampering, it is clear that the original solution is no lies,” Bell Syst. Tech. J. 45, 1563–1581 (1966).
longer optimal, because it assumes that all the de- D. GUSFIELD AND R. IRVING, The Stable Marriage Algo-
cisions in the solution will be implemented. We rithm: Structure and Algorithms, MIT Press, Boston,
MA, 1989.
would argue that such considerations should be
W. JORDAN AND M. A. TURNQUIST, “A Stochastic Dynamic
built into the design and analysis of algorithms for Network Model for Railroad Car Distribution,” Transp.
all classes of routing and scheduling problems. The Sci. 17, 123–145 (1983).
result may be problems that are actually easy to G. LAPORTE AND F. LOUVEAUX, “Formulations and Bounds
solve, producing both faster run times and more for the Stochastic Capacitated Vehicle Routing Prob-
robust solutions. lem with Uncertain Supplies,“ in Economic Decision-
Making: Games, Econometrics and Optimization, J.
Gabzewicz, J. Richard, and L. Wolsey (eds.), North
Holland, Amsterdam, 1990.
ACKNOWLEDGMENT
M. PINEDO, Scheduling: Theory, Algorithm, and System,
THIS RESEARCH WAS SUPPORTED in part by grant Prentice Hall, Englewood Cliffs, NJ, 1995.
AFOSR-F49620-93-1-0098 from the Air Force Office W. B. POWELL, “A Stochastic Model of the Dynamic Vehi-
cle Allocation Problem,” Transp. Sci. 20, 117–129
of Scientific Research. The authors would also like to
(1986).
gratefully acknowledge the careful reading by an
W. B. POWELL, “An Operational Planning Model for the
anonymous referee who helped to highlight impor- Dynamic Vehicle Allocation Problem with Uncertain
tant issues in this study. Demands,” Transp. Res. 21B, 217–232 (1987).
W. B. POWELL, “A Comparative Review of Alternative
Algorithms for the Dynamic Vehicle Allocation Prob-
REFERENCES lem,” in Vehicle Routing: Methods and Studies, B.
Golden and A. Assad (eds.), 249 –292, North Holland,
S. ALBERS, “Competitive Online Algorithms,” OPTIMA: New York, 1988.
Math. Prog. Soc. Newsl. 54, 1– 8 (1997). W. B. POWELL, “A Stochastic Formulation of the Dynamic
D. BERTSIMAS AND G. V. RYZIN, “A Stochastic and Dynamic Assignment Problem, with an Application to Truckload
Vehicle Routing Problem in the Euclidean Plane,” Motor Carriers,” Transp. Sci. 30, 195–219 (1996).
Opns. Res. 39, 601– 615 (1991). H. PSARAFTIS, “Dynamic Vehicle Routing Problems,” in
D. J. BERTSIMAS AND D. SIMCHI-LEVI, “A New Generation Vehicle Routing: Methods and Studies, B. Golden and
of Vehicle Routing Research: Robust Algorithms, Ad- A. Assad (eds.) 223–248, North Holland, Amsterdam,
dressing Uncertainty,” Opns. Res. 44, 286 –304 (1996). 1988.
MYOPIC SOLUTIONS FOR DYNAMIC ROUTING / 85
P. SCHRIJVER, Supporting Fleet Management by Mobile P. TRUDEAU AND M. DROR, “Stochastic Inventory Routing:
Communications, P. R. Schrijver, The Netherlands, Route Design with Stockouts and Route Failures,”
1993. Transp. Sci. 26, 171–184 (1992).
D. D. SLEATOR AND R. E. TARJAN, “Amortized Efficiency of W. WHITE, “Dynamic Transshipment Networks: An Algo-
List Update and Paging Rules,” Com. ACM 28, 202– rithm and Its Application to the Distribution of Empty
208 (1985). Containers,” Networks, 2, 211–236 (1972).
W. STEWART AND B. GOLDEN, “Stochastic Vehicle Routing:
A Comprehensive Approach,” Eur. J. Opns. Res. 14,
Downloaded from informs.org by [130.160.4.77] on 15 September 2016, at 11:34 . For personal use only, all rights reserved.

(Received: August 1997; revisions received: July 1998, September


371–385 (1983). 1998; accepted: October 1998)

You might also like