Professional Documents
Culture Documents
Operations Research
Publication details, including instructions for authors and subscription information:
http://pubsonline.informs.org
This article may be used only for the purposes of research, teaching, and/or private study. Commercial use
or systematic downloading (by robots or other automatic processes) is prohibited without explicit Publisher
approval, unless otherwise noted. For more information, contact permissions@informs.org.
The Publisher does not warrant or guarantee the article’s accuracy, completeness, merchantability, fitness
for a particular purpose, or non-infringement. Descriptions of, or references to, products or publications, or
inclusion of an advertisement in this article, neither constitutes nor implies a guarantee, endorsement, or
support of claims made of that product, publication, or service.
INFORMS is the largest professional society in the world for professionals in the fields of operations research, management
science, and analytics.
For more information on INFORMS, its publications, membership, or meetings visit http://www.informs.org
OPERATIONS RESEARCH
Vol. 66, No. 3, May–June 2018, pp. 799–813
http://pubsonline.informs.org/journal/opre/ ISSN 0030-364X (print), ISSN 1526-5463 (online)
METHODS
Received: February 24, 2016 Abstract. We investigate the cost of the opportunity delayed by working on one project
Revised: May 23, 2016; March 24, 2017; with uncertain success rather than searching for a new project. We answer this ques-
August 3, 2017 tion: How long should a firm work on a research project with uncertain success before
Accepted: October 5, 2017 abandoning it if the only alternative is to search for a new project to work on? Rather
Published Online in Articles in Advance: than treating the opportunity as an exogenous alternative, this approach endogenizes the
May 23, 2018
opportunity value and the attendant cost of its delay. We consider cases with both a finite
Subject Classifications: dynamic programming: and an infinite number of potential projects. We derive the optimal stopping time and
application, Bayesian; decision analysis: show that it increases with the discount rate, with the rate of arrival of project failure,
sequential and with the prior probability that the project can be successfully completed. The optimal
Area of Review: Decision Analysis stopping time decreases with the rates of new project arrival and project completion.
https://doi.org/10.1287/opre.2017.1701
Keywords: project management • dynamic programming • uncertainty • Bayesian updating • optimal stopping • exponential Poisson bandit
interest in having the project continue and therefore statics on the abandonment time: for example, we find
may tend to paint an overly rosy picture. Typically, “the that the optimal stopping time is relatively insensitive
project manager knows more about the quality of the to changes in the prior probability of success, except
idea than the senior executives that do the screening” when that probability is quite large. On the other hand,
and “the project manager’s development effort is not the optimal stopping time is relatively sensitive to
entirely observable by senior executives.” (Chao et al. changes in the arrival rate of new projects, the arrival
2014, p. 1286). The decision maker often does not have rate of successful project completion, and the arrival
sufficient expertise to be able to verify the information rate of project dead ends.
in the progress reports. Thus, we feel it is important to Our contributions are twofold. Perhaps most impor-
derive a prescriptive model for the situation in which tant, because our model is parsimonious and our
there is no information about the project progress. This assumptions on the parameters of the model fairly gen-
is the focus of our paper. eral, we are able to give prescriptive advice in situa-
There is a large literature on whether to start and tions with relatively little information. For instance, if
when to stop investing time and effort on a research the probability of successful completion of a project is
project, as well as what the optimal path of implement- small (as it is in most research settings), then the rule
ing/investing in it is (see, e.g., the study by Childs for optimal stopping (see Equation (18)) requires know-
and Triantis 1999, which takes a contingent claims ap- ing only the rates at which successful completion and
proach, and the references therein). And there are con- failure normally arrive and the rate at which potential
sulting firms and managerial processes such as Stage- new projects are generated. Because we focus on pro-
Gate (Cooper and Edgett 2015) that assist firms in the cesses in which these rates can be assumed to be con-
task of dropping unsuccessful projects. These decisions stant, there is no interim information on the project’s
depend on the project characteristics (such as the prob- progress except for the Poisson arrival of success or
ability that the project can be successfully completed, failure. To the best of our knowledge, ours is the first
the rate at which success arrives if it is possible, the paper to provide this prescriptive advice absent infor-
fixed amount of time that must be spent before suc- mation about opportunity cost and some measure of
cess is possible, and the payoff if the project is success- the progress of the research project. In large part this is
fully completed) and on the opportunity cost (what because in our model, the opportunity cost is endoge-
the decision maker could earn instead of continuing nous. We also provide closed-form results for other
the project at hand). In that literature, the opportu- situations—ones in which, for instance, the number of
nity cost is mostly exogenous (often zero), and it seems available projects is limited or the decision maker is
that in many instances determining this opportunity risk averse. Our second major contribution is method-
cost would be quite challenging. The idea of our paper ological: we are able to solve for the value function and
is to make opportunity cost somewhat endogenous, optimal policies, again under fairly general conditions.
in the sense that if the current project is abandoned, Our closed-form solutions are more detailed than in
the only option is to search for similar projects. We earlier papers; explicit comparisons will be made as
study this problem through a variety of continuous- appropriate in the next section.
time models: single projects versus multiple projects The rest of this paper is organized as follows. We
versus an infinity of projects, single rewards versus begin in the next section with a brief literature review.
multiple rewards, and risk neutral versus risk averse. In Section 3, we develop a parsimonious model with an
One purpose of studying the various approaches is to exogenously specified opportunity value. We start by
explore the robustness of the results. assuming that there is only one available project and
Our focus is on processes in which time and effort are then extend to a finite number of projects, all of which
the major inputs, or where the flow rate of effort and could prove successful and beneficial. As long as there
expenditures while carrying out the research is nearly are projects remaining, the choice in each instance is
constant; the only information the firm has is that nei- between working on the project at hand or abandoning
ther success nor failure has arrived—until one or the it and searching for a new project. If the latter action
other does. Processes that fit this description include is chosen, we assume that new projects are generated
the early stage of pharmaceutical discovery or oil and following a Poisson process. We characterize the value
gas exploration, academic research, and new prod- function and the optimal policy. For instance, we show
uct development. More broadly, our model applies to that the optimal abandonment time is decreasing in the
any search activity that involves identifying potential number of projects remaining. In Section 4, we endo-
opportunities (e.g., prospective suppliers or business genize the opportunity value by considering the lim-
partners) and then closing the deal. We build a sim- iting case of an infinite number of available projects.
ple model of such processes and derive expressions We characterize the value function and determine the
for the optimal abandonment time that could be used optimal stationary policy. The special case in which
as automated decision rules. We provide comparative only one reward can be earned (though there are an
McCardle, Tsetlin, and Winkler: When to Abandon a Research Project and Search for a New One
Operations Research, 2018, vol. 66, no. 3, pp. 799–813, © 2018 INFORMS 801
infinite number of available projects) is considered in et al. 2005). In continuous time there could be intervals
Section 4.1. We show that the optimal abandonment for which both arms are pulled, and in the competi-
time is identical to the case in which an unlimited tive models, there are instances in which firms trade
number of rewards can be earned. Other variations of off pulling the risky arm.
the problem (for instance, allowing recall of previously The parallels of the two-armed bandit with our sim-
abandoned projects) and comparative statics are pro- plest model with only one research project, as detailed
vided in the remainder of Section 4. We conclude in in Section 3.1, are clear. The firm can either work on
Section 5. the research project (pull the risky arm), or choose
the alternative represented by the opportunity value
(pull the safe arm). The Bayesian evolution of beliefs
2. Literature Review
regarding whether or not the risky arm is good in an
Our paper is related to the vast literature on research
exponential/Poisson bandit problem (see, e.g., Keller
and development (R&D) and technology adoption,
et al. 2005) is identical to the evolution of beliefs in our
which in itself is related to the multiarmed bandit lit-
model, once adjusted for parameterization of the mod-
erature. First, we provide a brief review of the research
els. Thus, our setting with a single available project is
on multiarmed bandits, and then we restrict our atten-
a special case of the models considered in Keller et al.
tion to only a handful of the R&D papers most closely
(2005) and Keller and Rady (2015); beyond that, our
related to the current paper.
focus is on the choice between continuing to explore a
There is extensive literature on multiarmed ban-
risky arm and searching for a new arm. So our model
dits (see Berry and Fristedt 1985 for a primer on dis-
would not be a good representation of drug treatment
crete time and Mandelbaum 1987 for continuous time).
for a chronic disease as in Negoescu et al. (2017); rather,
In the simplest case, the bandit has two arms, one risky we see it as representative of the process of bringing
and one safe. The safe arm returns the same amount, S, new drugs to market.
each time it is pulled. The risky arm returns the amount Turning to the literature on R&D, McCardle (1985)
R or 0. There is a chance that the risky arm is no develops a dynamic-programming model of technol-
good—it will only produce 0. The firm starts with a ogy adoption: in each period a firm can either exper-
prior belief regarding the probability that the risky iment with (gather information about) a technology
arm is good (produces R with positive probability each of uncertain value, adopt the technology, or reject the
period according to some distribution). Once a value technology. As the firm gathers information about
of R is observed, the firm knows that the risky arm the possible success of the technology, it updates its
is good and will continue to pull that arm exclusively prior beliefs using Bayes rule. Eventually, the firm has
(assuming R is sufficiently larger than S). For example, enough information (its posterior distribution has low
in Negoescu et al. (2017), the arms represent drugs for enough variance) that it can make an adopt or reject
the treatment of a chronic disease, the safe one hav- decision. If the firm rejects the technology, it earns 0
ing a known efficacy and the risky one unknown. If and the problem comes to an end. The optimal pol-
the prior belief regarding the risky arm is above some icy is cone shaped: if the current belief is inside the
threshold, the firm (doctor or patient) pulls the risky cone, the firm continues to experiment, moving to
arm; otherwise, the safe arm is pulled. the right in the cone (positive results move the belief
In more general settings, there can be more than two upward, and negative results move the belief down-
arms (k-armed bandits, with k − 1 risky arms), and the ward); if the belief exits the cone on the upper side,
values generated by the risky arms can be random vari- the firm stops experimenting and adopts the technol-
ables, perhaps different across the arms. The essential ogy; and if the belief exits the cone on the lower side,
theorem establishes the existence of an index (Gittins the firm stops experimenting and quits. This work was
1979) for each arm. The index serves to rank order the extended to general distributions by Ulu and Smith
arms. If a risky arm is pulled and returns a value of R, (2009) and Smith and Ulu (2012) and to allow risk
the index increases; if it returns a value of 0, the index aversion by Smith and Ulu (2015). Rather than having
decreases. In the two-arm case, bang-bang solutions a two-dimensional state space (mean and precision),
are obtained: once the safe arm is pulled, it is pulled more general probability distributions are allowed.
exclusively; if the firm starts by pulling the risky arm, it In a set of clever papers, Alizamir et al. (2013, 2014)
continues to do so until the risky index falls below the model a related problem (diagnostic testing of patients
index of the safe arm, and then the safe arm is pulled in a queue) in which additional information is gath-
thereafter. In the multiarm case, at each step, the arm ered; that is, another test performed, or the patient is
with the largest index is pulled, and again, once the sent home or remanded for treatment. Their model has
safe arm is pulled, it is pulled exclusively. the added complication of congestion. Rather than a
The bandit problem has been extended in a myriad cone-shaped optimal policy, they establish the optimal-
of ways, most recently to competitive, continuous-time, ity of an M-shaped policy. The inverted cone of the “M”
multiarmed models (Bolton and Harris 1999, Keller is a result of the congestion.
McCardle, Tsetlin, and Winkler: When to Abandon a Research Project and Search for a New One
802 Operations Research, 2018, vol. 66, no. 3, pp. 799–813, © 2018 INFORMS
Kornish and Keeney (2008) consider information such as time spent outfitting a laboratory or recruiting
acquisition in the context of designing a flu vaccine. subjects for a study—or, in the case of a Ph.D. stu-
This is a decision-making situation with a deadline and dent, the time spent searching the literature. Alterna-
with a finite number of alternatives among which to tively, ∆ can be thought of as the switching cost/time
choose. of transitioning from idea generation to idea execu-
The closest antecedent to our paper is the discrete- tion. No information, positive or negative, arrives dur-
time model of Lippman and McCardle (1991), which ing ∆. That is, if the project is undertaken, first, one
allows for repeated search: if the firm rejects one tech- has to work on it for an amount of time ∆; thereafter, if
nology, it returns to the pool of available technologies the project can be successfully completed, completion
and draws again. They establish generic results regard- arrives at rate λ s ; and if failure is preordained, it arrives
ing the monotonicity and convexity of the value func- at rate λ f . The project pays reward R > 0 if and when it
tion and characterize the optimal policy for probability is successfully completed. (Our results do not change
distributions satisfying certain Bayesian updating con- if the rewards have a distribution and R is interpreted
ditions. In the current paper, we are able to establish as an expected value.)
considerably tighter analytic results and more explicit There is an available nonproject activity yielding a
computational results by imposing slightly stronger deterministic cash flow that the decision maker could
conditions in a continuous-time model. Our model is earn if she works on that in lieu of working on the
sufficiently different in that our results are not an exten- project. Denote by π the net present value at time 0
sion of theirs. For example, while they show that an of the future cash flows from the nonproject activity;
optimal policy exists and is cone shaped, we are able to that is, if she chooses not to begin work on the project,
explicitly solve for and characterize the optimal aban- π is earned from the nonproject activity. By stationar-
donment time (see Equation (18) and the companion
ity, the net present value of future cash flows from the
Figure 3) knowing only the prior probability of suc-
nonproject activity if it is begun at some later time t
cessful project completion, the rates at which success
is πe −δt . Given the discount rate e −δ and the nonproject
and failure conditionally arrive, and the rate at which
activity value π, the opportunity cost of a one-period
a new project can be found.
delay of the nonproject activity is π(1 − e −δ ). For exam-
ple, if time is measured discretely and the nonproject
3. Setting the Stage: Exogenous activity yields an annuity that pays α per period, then
Opportunity Cost π is given by α/(1 − e −δ ), and the opportunity cost of a
We begin with a simple, infinite-horizon, continuous- one-period delay is π(1 − e −δ ) α.
time model with only one available project and an
assumption that the opportunity cost is exogenously 3.1. Abandoning the Project Once Started
specified. The current time period corresponds to t 0. Suppose the decision maker has started work on the
The instantaneous discount rate is e −δt for some fixed risky project. Because we assume that a project once
δ > 0. abandoned cannot be restarted (although we relax this
A risk-neutral decision maker has before her a risky assumption later in Section 4.2), the question before the
research project to which she could devote her time decision maker is when, if ever, to abandon work on the
and effort. Successful completion of the project, if pos- project and divert all her effort to the outside activity.
sible, arrives exponentially at rate λ s . The probability Let ∆ + t be the abandonment time. Note that because
that the project is completed by time t, given that it can there is no chance that the project can be completed
be successfully completed, is 1 − e −λ s t . However, there prior to ∆, it cannot be optimal to abandon the project
is a chance that any effort devoted to the project will prior to ∆ if it was beneficial to start. The expected
be fruitless—the project could be impossible or infeasi- value at time 0 of the current project to the researcher
ble to complete. For instance, the researcher might not if the plan is to abandon at ∆ + t is given by
have the necessary experience, she might discover a
counterexample to a primary theorem, or she might be ∫ t ∫ t
beaten to the solution by a competitor. If the project is v(t) p (R + π)e −δ(∆+z) λ s e −λ s z dz + (1 − p) πe −δ(∆+z)
0 0
undertaken and at some point is found to be fruitless,
we say that failure has arrived. Failure, if the project · λ f e −λ f z dz + (pe −λ s t + (1 − p)e −λ f t )πe −δ(∆+t)
is not possible to complete, arrives exponentially at p(R + π) λ s (1 − e −(λ s +δ)t )
rate λ f . (We will not formally develop an equilibrium λs + δ
λf
model of competition.) e −δ∆ −(λ f +δ)t . (1)
There is a minimum time ∆ that must be spent work- + (1 − p)π (1 − e )
λf +δ
ing on the project before completion is even possible.
+ (pe −λ s t + (1 − p)e −λ f t )πe −δt
This is meant to capture the fixed costs of the project,
McCardle, Tsetlin, and Winkler: When to Abandon a Research Project and Search for a New One
Operations Research, 2018, vol. 66, no. 3, pp. 799–813, © 2018 INFORMS 803
The first term on the right-hand side of the first line of To find the optimal abandonment time t ∗ , differenti-
(1) represents the expected value if the project is suc- ate v(t) in (1) with respect to t yielding
cessfully completed before time ∆ + t; the prior prob- dv(t)
ability of successful completion is p; and the return ∝ p(R + π)λ s e −(λ s +δ)t + (1 − p)πλ f e −(λ f +δ)t
dt
on completion is the reward to the project R plus
− π[(1 − p)(λ f + δ)e −(λ f +δ)t + p(λ s + δ)e −(λ s +δ)t )]
the value of the outside activity π, appropriately dis-
counted for the arrival time before ∆ + t. Note that ∝ p(R + π)λ s e −(λ s −λ f )t + (1 − p)πλ f
once the researcher successfully completes the project − π[(1 − p)(λ f + δ) + p(λ s + δ)e −(λ s −λ f )t ]
and earns R, she immediately turns her attention and
pRλ s e −(λ s −λ f )t − πδ(1 − p + pe −(λ s −λ f )t )
efforts to the outside activity and earns the present
value π. δ
∝ p(t)R − π . (3)
The second term on the right-hand side of the first λs
line of (1) represents the expected value if the project When pRλ s > πδ (which we assume henceforth),
fails before ∆ + t. Again, once the project fails (in which attempting to set the derivative in (3) equal to 0
case the researcher earns 0), she immediately turns her and solving yields a finite solution for the optimal
attention and efforts to the outside activity. abandonment time when λ s > λ f and yields infinity
The term in the second line of (1) represents the otherwise:
expected payoff if the project is neither successfully 1
1−p πδ
if λ s > λ f ,
completed nor ended by failure by time ∆ + t, and it is − ln
λs − λ f p Rλ s − πδ
then abandoned. This happens with prior probability t∗ (4)
(pe −λ s t + (1 − p)e −λ f t ). The researcher then turns to the
∞
otherwise.
nonproject activity and earns π. For example, suppose
a firm has a product it can produce and sell, earning If λ s < λ f , the posterior probability of success p(t)
π in net present value; or the firm could attempt to is increasing in t, so it is never optimal to stop once
started. Note that (4) equates the posterior probabil-
develop an improved product that would earn R + π in
ity of successfully completing the project and receiving
net present value, if the improvement can be success-
the associated reward R with the cost of the delay in
fully completed. How long, if at all, should the firm
receiving the sure π.
pursue the product improvement? And what is the
expected value of pursuing the improvement project? Lemma 2. For λ s > λ f , the optimal abandonment time,
∆ + t ∗ , decreases with the opportunity value π and the dis-
Lemma 1. The expected value of the project v(t) is bounded count rate δ, and it increases with the prior probability of
above by R + π. It is increasing in the prior p, the reward R, successful completion p, the reward from successful com-
and the opportunity value π; it is decreasing in the fixed pletion R, and the arrival of failure rate λ f . For λ f 0,
cost ∆ and the discount rate δ. the optimal abandonment time is increasing (decreasing) for
Proof. The firm earns R + π if there are no fixed costs,
small (large) values of λ s .
∆ 0; no time discounting, δ 1; and if the research is Proof. The assertions regarding π, δ, p, R, and λ f fol-
successful. Differentiating v(t) with respect to ∆ and δ low on taking the appropriate derivatives of t ∗
establishes that v(t) is decreasing in both. Hence, R + π −(1/(λ s − λ f )) ln[((1 − p)/p)(πδ/(Rλ s − πδ))].
is an upper bound. Differentiating v(t) with respect To establish the result for λ s , set λ f 0 and denote
to p and R completes the proof. p(t, λ s ) pe −λ s t /(1 − p(1 − e −λ s t )). The optimal stopping
time t ∗ is given by equation λ s p(t ∗ , λ s ) πδ/R. There-
Let p(t) represent the posterior probability at time fore,
∆ + t that the project can be successfully completed dt ∗ (∂/∂λ s )(λ s p(t, λ s ))
given that it has been worked on for ∆ + t without − .
dλ s (∂/∂t)(λ s p(t, λ s ))
success. Then p(t) is given by
As (∂/∂t)(λ s p(t, λ s )) < 0, dt ∗ /dλ s has the same sign as
pe −λ s t pe −(λ s −λ f )t (∂/∂λ s )(λ s p(t, λ s )). Observe that
p(t) . (2)
pe −λ s t + (1 − p)e ∂ ∂ pe −λ s t
−λ f t
pe −(λ s −λ f )t
+ (1 − p)
(λ s p(t, λ s )) λs
∂λ s ∂λ s 1 − p(1 − e −λ s t )
From (2) we see that the probability that the project −tλ s
e
can be successfully completed given that neither suc- p (pe −tλ s − tλ s − p + ptλ s + 1).
cess nor failure has arrived by t after ∆ is increasing (pe s − p + 1)2
−tλ
(decreasing) in t if the rate of successful arrival is less Therefore, (∂/∂λ s )(λ s p(t ∗ , λ s )) has the same sign as
than (greater than) the rate of failure arrival: λ s < λ f pe −t λ s − t ∗ λ s − p + pt ∗ λ s + 1 pe −t λ s + (1 − p)(1 − λ s t ∗ ),
∗ ∗
(λ s > λ f ). The posterior probability at time t that the which is positive (negative) if (λ s t ∗ − 1)e λ s t < (>)
∗
Alternatively, we can characterize the optimal deci- successfully completed, failed, or abandoned, the firm
sion by considering the posterior probability p(t). searches for a new project as long as there are projects
Because p(t) is monotonically decreasing for λ s > λ f , remaining.
p(t ∗ ) πδ/Rλ s represents the threshold probability at Let π k for k 6 n be the value of searching for a new
which the firm should abandon the project. As the for- project when there are k projects remaining that have
mula for p(t) does not involve R, π, or δ, it is much not been investigated and have not been either success-
easier to establish several of the assertions of Lemma 2: fully completed, failed, or abandoned. Without loss of
the firm should abandon the project when the poste- generality, fix π0 ; π0 represents the same value as π
rior probability of success falls below p(t ∗ ), and p(t ∗ ) is in the single project case, so, as in the previous sec-
increasing in π and δ and decreasing in R. tion, any small positive value will do. Let v k (t) be the
It is worth noting that because our model is rel- expected value when the firm has begun investigation
atively simple, controlled as it is by two exponen- of the kth remaining project, and it plans to abandon
tial/Poisson processes, the Bellman equation has an the project if success is not achieved or failure has not
analytically exact solution with a clear interpretation: arrived by time ∆ + t. We now set up an interleaved
continue researching a project until the instantaneous set of recursions: one for v k (t), suitably optimized, and
expected benefit, p(t)λ s R, falls below the discounted one for π k .
opportunity value, πδ. Analytic solutions generally do Begin by letting v1 (t) v(t) as given in (1), substi-
not obtain in more complicated models; there numeri- tuting π0 for π. Denote the optimal abandonment time
cal methods are used to approximate the dynamic pro- by t1∗ . Let v1 v1 (t1∗ ) be the optimal value of working on
gramming value function and the optimal policy. the last remaining project. Now let π1 be the value of
The only remaining question is whether or not the searching for the last remaining project:
researcher should begin work on the project, and the
answer is obvious: if v(t ∗ ) > π, start work; otherwise, ∫ ∞
λ0
do not. Observe that for ∆ 0 and λ s > λ f this reduces π1 λ 0 e −λ0 x e −δx v1 dx v .
λ0 + δ 1
to the condition pRλ s > πδ. 0
π k−1 alone; that is, without t k∗ , in k. The value function π k is monotonically increasing in k,
it is increasing in the prior p and the reward R, and it is
λf
p(R + π ) λ s + (1 − p)π decreasing in the discount rate δ and the fixed cost ∆.
k−1
λs + δ k−1
λ f + δ
Proof. A bound on the value function is given by
π k−1 δ(λ s − λ f )
pR(λ s /δ) + π0 . To establish that π k is increasing in k,
+(1 − p) (λ f + δ)(λ s + δ)
vk e −δ∆ . (7)
note that the firm could immediately abandon the kth
π k−1 δ
(λ f +δ)/(λ s −λ f ) project and earn π k−1 ; behaving optimally must earn
· 1−p
more than that. The comparative statics with regard to
p Rλ − π δ
p, R, δ, and ∆ are established through a straightforward
s k−1
Finally, let π k be the value of searching for the kth-to- recursion following Lemma 1.
last remaining project:
As in the single-project case, the remaining question
∞
λ0 is whether or not to start research. The condition is the
∫
πk λ0 e −λ0 x e −δx v k dx
vk same: if the value of the research process is greater than
0 λ 0+δ
λf the return to the outside option, v k (t ∗ ) > π k−1 , start the
p(R + π ) λ s +(1− p)π
research program. Also, as in the single-firm case, this
k−1
λs + δ k−1
λf +δ
condition reduces to pRλ s > π0 δ. The proof follows a
λ 0 −δ∆ π k−1 δ(λ s − λ f )
recursion argument.
+(1− p)
e ,
λ0 + δ
(λ f + δ)(λ s + δ)
Corollary 1. The optimal abandonment time t k∗ is decreas-
(λ f +δ)/(λ s −λ f )
π k−1 δ ing in k.
1− p
·
p Rλ s − π k−1 δ
Proof. The claim follows immediately from Proposi-
(8) tion 1 and Lemma 2.
which gives π k as a function of π k−1 . Figure 1 dis- Read another way, Corollary 1 says that the fewer
plays π k as a function of k for various values of the the number of projects remaining to be investigated,
prior probability of success p with λ s 1, λ f 0, λ0 2, the more time the firm should spend on each one prior
δ 0.1, ∆ 0, R 1, and π0 0. From the figure we see to abandoning it. The fewer the number of projects
that π k appears to be bounded, concave, and increas- remaining, the lower the opportunity value foregone
ing in k; it also appears to be increasing in p. The next by working on the project at hand, and the longer the
proposition establishes these observations and several firm should continue to work. Figure 2 displays t k∗ as a
others. function of k for different values of p with other param-
eters as in Figure 1.
Proposition 1. At each project stage, the optimization is
The foregoing material assumed that the arrival rate
well defined. The value function π k is bounded uniformly
of success is greater than the arrival rate of failure,
λ s > λ f . If the opposite is true, λ s < λ f , then failure
Figure 1. (Color online) Graph of π k as a Function of k for arrives more quickly than success, and in this case, no
Different Values of p news is good news: the fact that neither failure nor
6
p = 0.9 p = 0.7 p = 0.5
p = 0.3 p = 0.1 Figure 2. (Color online) Graph of t k∗ as a Function of k for
5 Different Values of p
3.5
4 p = 0.9 p = 0.7 p = 0.5
3.0
p = 0.3 p = 0.1
πk 3 2.5
2.0
t*k
2 1.5
1.0
1
0.5
0 0
0 5 10 15 20 25 30 35 40 0 5 10 15 20 25 30 35 40
k k
Note. Other parameters are λ s 1, λ f 0, λ0 2, δ 0.1, ∆ 0, R 1, Note. Other parameters are λ s 1, λ f 0, λ 0 2, δ 0.1, ∆ 0, R 1,
and π0 0. and π0 0.
McCardle, Tsetlin, and Winkler: When to Abandon a Research Project and Search for a New One
806 Operations Research, 2018, vol. 66, no. 3, pp. 799–813, © 2018 INFORMS
success has arrived by time t increases the probability Alternatively, rather than solving the fixed point
that the project will be a success; that is, p(t) is increas- equation for π (i.e., π not as a function of t ∗ ), one could
ing in t, and the optimal abandonment time t k∗ ∞. solve a fixed point for t ∗ . A fair bit of algebra (see the
With an infinite abandonment time, the value function appendix) leads to the following:
becomes
e −(λ f +δ)t
∗ ∗
e (λ s −λ f )t λs + δ
λs λf
+
vk e −δ∆
p(R + π k−1 ) + (1 − p)π k−1 . λs − λ f (λ f + δ) δ(1 − p)(λ s − λ f )
λs + δ λf + δ λ + δ δ∆
λf
· 0 e − p − (1 − p) . (13)
The firm’s task is considerably easier: once work begins λ0 λf + δ
on a project, it continues until either success or fail- Proposition 2. The value function π∞ is well defined; it is
ure arrives. The manager never needs to intervene and the unique fixed point of (12), it is increasing in the prior p
cancel a project that is going nowhere. and the reward R, and it is decreasing in the discount rate δ
and the fixed cost ∆.
4. Endogenizing the Opportunity Value: An Proof. Proposition 1 establishes that π k is uniformly
Infinite Number of Potential Projects bounded in k and is monotonically increasing in k.
For the case with an infinite number of potential pro- Hence, by the monotone convergence theorem, π∞
jects, each of which if successful earns R, formally let limk→∞ π k is also bounded. The comparative statics are
k → ∞ in Equations (5) through (8). We need to confirm proved as before.
the legitimacy of that formality. In this case v∞ (t) is the
value of starting work on a project when there are an Corollary 2. If λ s > λ f , the optimal abandonment time t ∗ is
infinite number of projects, and the plan is to abandon increasing in p, δ, ∆, and λ f ; it is decreasing in λ 0 and λ s ;
the current project at time ∆ + t and then to behave and it is unaffected by changes in R.
optimally thereafter: Proof. The left-hand side of (13), which defines the
optimal abandonment time, is increasing in t ∗ . Straight-
p(R + π∞ ) λ s (1 − e −(λ s +δ)t ) forward differentiation shows that the right-hand side
λs + δ
of (13) is increasing in p and ∆, decreasing in λ 0 , and
λf
v∞ (t) e −δ∆ −(λ f +δ)t . (9) unaffected by changes in R, establishing the corollary
+ (1 − p)π (1 − e )
∞
λf + δ for those parameters.
+ (pe −λ s t + (1 − p)e −λ f t )π∞ e −δt
To establish the claim regarding λ s , define
δe (λ s −λ f )t + (λ s − λ f )(δ/(λ f + δ))e −(λ f +δ)t
Differentiating (9) with respect to the abandonment F(λ s , t)
time t and setting it equal to 0 yields the optimal aban- λs + δ
λ 0 + δ δ∆ λf
∗
donment time t∞ : 1
− e − p − (1 − p) .
1−p λ0 λf + δ
π∞ δ
1 1− p
if λ s > λ f ,
− ln
Then Equation (13) is equivalent to F(λ s , t ∗ ) 0, and
λs − λ f p Rλ s − π∞ δ
∗
t∞ (10)
dt ∗ ∂F/∂λ s
∞ otherwise. − .
dλ s ∂F/∂t ∗
Finally, π∞ is the value of searching for the next project:
The denominator is positive: ∂F/∂t ∗ ((λ s − λ f )δ/
∞
λ0 (λ s + δ))(e (λ s −λ f )t − e −(λ f +δ)t ) > 0. It remains to show that
∫
π∞ λ 0 e −λ0 x e −δx v∞ (t∞
∗
) dx v (t ∗ ). (11)
0 λ0 + δ ∞ ∞ the numerator is also positive:
If we let v(t) v∞ (t), t ∗ t∞
∗
, and π π∞ , then (δ/(λ f + δ))e −(λ f +δ)t + δte (λ s −λ f )t
∂F/∂λ s
(9) and (10) with an infinite number of projects re- λs + δ
duce to (1) and (4) with an exogenous opportunity δe (λ s −λ f )t
+ (λ s − λ f )(δ/(λ f + δ))e −(λ f +δ)t
value π, with the added condition that π satisfy a −
fixed point equation derived by substituting in (11) and (λ s + δ)2
λs + δ λ s − λ f −(λ +δ)t
simplifying:
δ −δ e f + δ(λ s + δ)
λf + δ λf + δ
λf
p(R + π) λ s + (1 − p)π
1
λs + δ λ f + δ · te (λ s −λ f )t
− δe (λ s −λ f )t
·
(λ s + δ)2
λ 0 −δ∆ πδ(λ s − λ f )
+ δe −δt + δ(λ s + δ)te λ s t − δe λ s t −λ f t
π e (1 − p) . (12)
λ0 + δ (λ f + δ)(λ s + δ) e
(λ f +δ)/(λ s −λ f ) (λ s + δ)2
πδ
δ
· 1−p e (λ s −λ f )t (e −t(λ s +δ) + t(λ s + δ) − 1) > 0.
p Rλ s − πδ
(λ s + δ)2
McCardle, Tsetlin, and Winkler: When to Abandon a Research Project and Search for a New One
Operations Research, 2018, vol. 66, no. 3, pp. 799–813, © 2018 INFORMS 807
To establish the claim regarding λ f , similarly define First observe that as p → 1, t ∗ (p) → t ∗ (1) ∞: as the
probability that the project can be successfully com-
λs + δ pleted approaches 1, the firm should spend more and
F(λ f , t) (λ f + δ)e (λ s −λ f )t + (λ s − λ f )e −(λ f +δ)t −
δ(1 − p) more time on it. If the firm knows with certainty that
λ 0 + δ δ∆
the project can be successfully completed (in which
· e − p (λ f + δ) − (1 − p)λ f case, it cannot fail), then the firm should work on
λ0
it until completion—assuming, of course, that it was
λs + δ
(λ f + δ)e (λ s −λ f )t + (λ s − λ f )e −(λ f +δ)t − worth starting work on it in the first place.
δ(1 − p) Now consider the behavior of t ∗ (p) as p goes to 0.
λ 0 + δ δ∆ Formally define t ∗ (0) limp→0 t ∗ (p). (Of course, if the
· e − 1 (λ f + δ) + (1 − p)δ
λ0 prior probability of successful completion is 0, the firm
(λ f + δ)e (λ s −λ f )t + (λ s − λ f )e −(λ f +δ)t should not start work on the project, as pRλ s needs to
be strictly greater than πδ.) Because t ∗ (0) solves (14),
λ + δ λ 0 + δ δ∆
− s e − 1 (λ f + δ) − (λ s + δ). t ∗ (0) > 0. We interpret t ∗ (0) as the limiting abandon-
δ(1 − p) λ0 ment time as p gets very small.
Then Equation (13) is equivalent to F(λ f , t ∗ ) 0, and Lemma 3. The optimal abandonment time satisfies t ∗ (0) <
t ∗ (p) < t ∗ (0)/(1 − p).
dt ∗ ∂F/∂λ f Proof. Corollary 2 establishes that t ∗ (p) is increasing
− .
dλ f ∂F/∂t ∗ in p, so t ∗ (0) < t ∗ (p).
Let
The denominator is positive: ∂F/∂t ∗ > 0. It remains to
λs + δ λ 0 + δ δ∆
show that the numerator is negative. See appendix for g(p) e −1 ,
δ(1 − p)(λ s − λ f ) λ0
details.
For proof of the claim that t ∗ is increasing in δ, see the right-hand side of (14). Observe that g(p)
the appendix. g(0)/(1 − p). The left-hand side of (14) is convex and
Note that by Corollary 2, with an infinite number increasing in t ∗ , and it is equal to 0 when t ∗ 0. There-
fore, g(p)/t ∗ (p) > g(0)/t ∗ (0), and
of potential projects, the optimal abandonment time
is monotonic in λ s . Contrast this with the nonmono- g(p) ∗
tonic behavior in the single-project case as established t ∗ (p) < t (0) t ∗ (0)/(1 − p).
g(0)
in Lemma 2. This difference arises because the oppor-
tunity cost in the single-project case does not depend Lemma 3 bounds the variation in t ∗ (p) for p small.
on λ s , while with an infinite number of projects, the For example, if p < 0.2, t ∗ (p) < 1.25t ∗ (0). In this sense,
opportunity cost increases with λ s . the optimal abandonment time is relatively insensitive
to changes in the prior probability of successful com-
4.1. Asymptotic Behavior of the Optimal Stopping pletion for small values of that probability.
Time as a Function of the Prior Probability of Lemma 3 can be extended to show that for two prior
Successful Completion probabilities p 1 and p2 with p 1 < p2 , the optimal aban-
Corollary 2 establishes the direction of the relationship donment times are such that t ∗ (p1 ) < t ∗ (p2 ) < t ∗ (p 1 ) ·
between changes in the prior probability of successful (1 − p1 )/(1 − p2 ).
completion p and the optimal abandonment time t ∗ .
4.2. Allowing Recall
In this section we address the relative magnitude of the
Thus far we have assumed that once a project that
changes in t ∗ as a function of p. The optimal abandon-
had neither been successfully completed nor failed was
ment time t ∗ solves (13), which can be rewritten as (for
abandoned; it could not be worked on again. For the
the steps in the process, see the proof in the appendix
case with an infinite number of probabilistically iden-
that t ∗ is increasing in δ):
tical potential projects, we can relax that assumption
and allow recall of previously abandoned projects. As
e (λ s −λ f )t∞ − 1 e −(λ f +δ)t∞ − 1
∗ ∗
λs + δ
+ is true in infinite-horizon models of search in gen-
λs − λ f (λ f + δ) δ(1 − p)(λ s − λ f ) eral, though, recalling a previously abandoned project
λ + δ δ∆
is never optimal—one can always do better in expec-
· 0 e − 1 . (14)
λ0 tation by drawing an unresearched project. A similar
result obtains in multiarmed bandits: once an arm is
Denote the solution to (13) (or equivalently, (14)) as set aside, it is never pulled again because the Gittins
a function of p, holding all of the other parameters index of the untried arm is greater than the index of
(δ, ∆, λ s , λ f , and λ 0 ) fixed, as t ∗ (p). the rejected arm.
McCardle, Tsetlin, and Winkler: When to Abandon a Research Project and Search for a New One
808 Operations Research, 2018, vol. 66, no. 3, pp. 799–813, © 2018 INFORMS
In the finite case, however, these results fail to obtain. same, t11∗ t1∗ . However, for k > 1, k finite, Equations (5)
That is, there may be circumstances under which and (15) are different, and thus the equations for the
recalling a previously rejected item is optimal. Con- optimal stopping times are different. For instance, for
sider, for example, if there are only 10 projects to k 2,
be researched. Once the researcher has tested all 10
π11 δ
1 1−p
projects—successfully completing some, having some t21∗ − ln
fail, and abandoning the rest—it may be optimal to λs − λ f p (R + π0 )λ s − π11 (λ s + δ)
π1 δ
recall a previously abandoned project. Allowing recall 1 1−p
<− ln t2∗ .
generally makes the problem intractable. A variety λs − λ f p Rλ s − π1 δ
of numerical approaches have been suggested for the
bandit problem with recall (see Jun 2004 for a sur- That is, if a decision maker has just begun the second-
vey), which, as we have noted, is closely related to the to-last possible project, she will continue working on
dynamic programming approach we use. Because our that project longer if she is able to earn rewards from
interest is in analytic solutions, we do not consider the both of the remaining projects than if she can earn a
finite case with recall. reward from only one of the remaining two. It is worth
spending a little longer on the project at hand because
4.3. Only a Single Reward Can Be Earned its reward can be banked and added to the reward from
The model developed above and in Section 3 allows the last remaining project.
the firm to earn the reward R for as many projects If there are an infinite number of potential projects
as are successfully completed. Alternatively, one can but only a single reward can be earned, Equation (9)
consider the case wherein only one reward can be becomes
Corollary 3 follows immediately from Corollary 2 Figure 3. (Color online) Graph of t ∗ as a Function of p for
and the equivalence of the optimal abandonment time Different Values of λ0
solution (13) for both the one and many rewards cases. 6
The model with one reward also allows us to gain 0 = 0.25
insights into the behavior of a risk-averse searcher. For 5 0 = 0.5
0 = 1
simplicity, assume that the payoff from the non-project
4 0 = 2
activity π0 0—that is, completing the project in infi-
0 = 5
nite time and abandoning it immediately yield the t* 3
same payoff. Then the only uncertainty is the comple-
tion time. If the project is completed at time τ, the pay- 2
off is e −δτ R. Utility for wealth u w (e −δτ R) can be thought
of as utility for completion time u τ (τ) u w (e −δτ R). If 1
for i 1, . . . , m. A project of type i occurs with prob- and δ 0.1). For example, if it is optimal to stop the
ability q i . Once a project has been generated, the firm project with R 1 1 at t1∗ 1, then one should stop the
knows its type. The recursions build as in Section 2.2; project with R 2 5 at t2∗ 2.87, while the project with
we deal only with the limiting (k ∞) case here. If the R 2 0.5 should not be started at all.
firm has a project of type i at hand, the firm solves Observe that with multiple project types, it is possi-
(1) with the appropriate values of (p i , λ si , λ f i , ∆i , R i ). ble for search and development to be optimal (i.e., the
Denote this value function as v i (t), and denote the opti- firm should start the process) but that some individual
mal abandonment time as t i∗ . Then the fixed point equa- projects should be dropped without being explored.
tion for the expected value of searching for a project is This could not occur with a single project type, as in
that case either all (ex ante identical) projects should be
λ0 X m
π q v (t ∗ ), (19) explored or none of them should.
λ 0 + δ i1 i i i
where, for i 1, . . . , m,
5. Conclusion
πδ
1 1 − pi We developed a model of time and effort allocation
t i∗ − ln . (20)
λ si − λ f i p i R i λ si − πδ to the creative process, a process rife with uncertainty
regarding the chance of success of any individual
For example, university promotion committees often project, the length of time one might wait for a new
assign point values to journals, explicitly or implic- project to appear, and the amount of time required
itly: there are A journals and B journals. An academic
to bring a potentially successful project to completion.
should spend a longer amount of time before abandon-
Whenever a researcher is working on a project, she
ing a project for an A journal than on a project for a B
must consider alternative uses of her time and effort:
journal, other things being equal.
What is the opportunity cost of continuation? If the
In the case of a single project type, one needs to esti-
only alternative facing the researcher is to search out
mate λ 0 in order to derive the optimal stopping time.
For multiple types of projects, the stopping times given and begin devoting effort to some new project, ex ante
by (20) must be consistent. For example, with m 2, identical to the one she is currently working on, then
if the decision maker believes that projects of the first the opportunity cost becomes endogenous. Our focus
type have to be stopped at t1∗ , then she can derive the here is the timing decision: when to give up on the
corresponding π, which in turn will determine t2∗ , thus project at hand in order to search for a new project. Our
providing an extra consistency check. Figure 4 plots focus on the timing of the decision to quit yields several
the optimal stopping time t2∗ for different values of R 2 insights. For a large range of the probability of suc-
as a function of t1∗ , the optimal stopping time for a cessfully completing a project, the optimal solution is
project with R 1 1, giving a sense of the consistency relatively insensitive to this probability (see Section 4.1
that would be required (with the rest of the parameters and Lemma 3). That is, for 0 < p < 0.5, the optimal
at λ s1 λ s2 1, λ f 1 λ f 2 0, p1 p2 0.5, ∆1 ∆2 0, stopping time t ∗ is relatively flat in p. Most research
projects of the nature modeled here (e.g., pharmaceu-
tical discovery) do not have very high initial probabili-
Figure 4. (Color online) Graph of t2∗ as a Function of t1∗ for
Different Values of R2
ties of success, in which case precise estimation of p is
not important. Given that such probabilities are often
10 R2 = 10.0 difficult to assess, this result is important for prescrip-
9 R2 = 5.0
tive purposes. On the other hand, the optimal stopping
R2 = 2.0
8
R2 = 1.0 time t ∗ is sensitive to the arrival rate of new projects
7 R2 = 0.5 (when the firm is searching), the arrival rate of suc-
6 R2 = 0.2 cessful completion for a project that can be successfully
R2 = 0.1
t*2 5 completed, and the arrival rate of failure for projects
4 that are doomed to fail.
3 Given the empirical evidence of organizational iner-
2
tia and personal procrastination, which lead to con-
tinuing projects long after they should have been
1
abandoned, one practical application of our results
0
0 1 2 3 4 5 6 7 8 9 10 would be for setting an alarm clock: if a project has
t1* not been successfully completed by the time its alarm
Note. Other parameters are λ s1 λ s2 1, λ f 1 λ f 2 0, p1 p 2 0.5, goes off, the project should be terminated, and efforts
∆1 ∆2 0, δ 0.1, and R1 1. should be devoted to searching for something new.
McCardle, Tsetlin, and Winkler: When to Abandon a Research Project and Search for a New One
Operations Research, 2018, vol. 66, no. 3, pp. 799–813, © 2018 INFORMS 811
λs + δ π1 δ
1 1−p
(1 − e −(λ f +δ)t ) < t(λ s + δ), t − 1∗
ln
λf + δ λs − λ f p (R + π0 )λ s − π1 (λ s + δ)
1 1 − p (λ s −λ f )t 1∗ 1 (δ + λ s )
or, equivalently, 1 − e −(λ f +δ)t < t(λ f + δ), which holds for ⇔ (R + π0 ) e π δ + π1 .
λs p λs
t > 0.
Substitute for (R + π0 ) in (A.1):
Corollary 2: t ∗ Is Increasing in δ
p 1
1−p (λ s −λ f )t 1∗ 1
To finish the proof of Corollary 2, Equation (13), which solves e π δ+π 1
(δ+λ s +π
) 1
λs + δ λ 0 + δ δ∆
λf
Divide through by π1 and rearrange terms to yield
e − 1 + 1 − p − (1 − p)
δ(1 − p)(λ s − λ f ) λ0 λf + δ e (λ s −λ f )t
1∗
e −(λ f +δ)t
1∗
λs + δ λ 0 + δ δ∆ +
δ
λs − λ f (λ f + δ)
e − 1 + (1 − p)
δ(1 − p)(λ s − λ f ) λ0 λf + δ λs + δ λ0 + δ δ∆
λf
λs + δ λs + δ λ0 + δ δ∆
e − p − (1 − p) ,
+ e −1 δ(1 − p)(λ s − λ f ) λ0 λf + δ
(λ s − λ f )(λ f + δ) δ(1 − p)(λ s − λ f ) λ0
which matches (13), the equation for the optimal stopping
λs + δ λ0 + δ δ∆
1 1
+ + e −1 , time in the case of many rewards, exactly.
λ s − λ f (λ f + δ) δ(1 − p)(λ s − λ f ) λ0
Derivation of Equation (18)
from which it follows that
Multiply both sides of (13) by δ:
∗ ∗
e (λ s −λ f )t∞ − 1 e −(λ f +δ)t∞ − 1 λs + δ λ 0 + δ δ∆
∗ ∗
+ e −1 . δe (λ s −λ f )t δe −(λ f +δ)t
λs − λ f (λ f + δ) δ(1 − p)(λ s − λ f ) λ0 +
λs − λ f (λ f + δ)
λs + δ λ0 + δ δ∆ λf
∗
The left-hand side is increasing in t∞ and decreasing in δ: e − p − (1 − p) .
(1 − p)(λ s − λ f ) λ0 λf + δ
∂
1 ∗
(e −(λ f +δ)t∞ − 1) Apply a Taylor series expansion, and keep the terms that are
∂δ (λ f + δ)
linear in δ:
∗
1 ∗ t∞ ∗
− (e −(λ f +δ)t∞ − 1) − e −(λ f +δ)t∞ ∗
δe (λ s −λ f )t δe −λ f t
∗
λ s +δ δ
(λ f + δ) 2 (λ f + δ) + 1+ (1+δ∆)−p
1 λ s −λ f λf (1−p)(λ s −λ f ) λ0
∗
(1 − e −(λ f +δ)t∞ − (λ f + δ)t∞
∗
) < 0.
δ
(λ f + δ)2 −(1−p) 1− ⇔
λf
At the same time, the right-hand side is increasing in δ: ∗ ∗
λs
δe (λ s −λ f )t δe −λ f t δ δ
+ +δ∆+(1−p) ⇔
∂ 1 λ0 + δ δ∆
λ s −λ f λf (1−p)(λ s −λ f ) λ0 λf
e −1 ∗ ∗
∂δ δ λ0 e (λ s −λ f )t e −λ f t λs
1 1−p
+ +∆+ ⇔
1 λ s −λ f λf (1−p)(λ s −λ f ) λ 0 λf
2 (λ0 − λ0 e ∆δ + ∆δ2 e ∆δ + ∆δλ0 e ∆δ )
δ λ0 λ s −λ f −λ t ∗ λs λ
∗ 1
e (λ s −λ f )t + e f +∆ + s .
∝ λ0 e −∆δ − λ0 + ∆δ 2 + ∆δλ0 λ 0 (e −∆δ − 1 + ∆δ) + ∆δ2 > 0. λf (1−p) λ0 λf
∗
Therefore, t∞ is increasing in δ.
References
Establishing That t 1∗ t∞
∗
Alizamir S, de Vericourt F, Sun P (2013) Diagnostic accuracy under
The fixed point equation for π1 similar to (12) is congestion. Management Sci. 59(1):157–171.
Alizamir S, de Vericourt F, Sun P (2014) Dismissing decision tasks:
λs λf
The optimality of the M-shaped structure. Working paper,
p(R+π0 ) +(1−p)π 1
λ s +δ λ f +δ
Fuqua School of Business, Duke University.
Berry DA, Fristedt B (1985) Bandit Problems: Sequential Allocation of
λ π 1
δ(λ −λ )
0 s f Experiments (Chapman & Hall, London).
+(1−p)
π
1
e −δ∆ .
λ 0 +δ
(λ f +δ)(λ s +δ)
Bolton P, Harris C (1999) Strategic experimentation. Econometrica
(λ f +δ)/(λ s −λ f ) 67(2):349–374.
1−p
· π 1
δ
Chao RO, Lichtendahl KC Jr, Grushka-Cockayne Y (2014) Incen-
p (R+π )λ −π 1 (λ +δ) tives in a Stage-Gate process. Production Oper. Management 23(8):
0 s s
(A.1) 1286–1298.
McCardle, Tsetlin, and Winkler: When to Abandon a Research Project and Search for a New One
Operations Research, 2018, vol. 66, no. 3, pp. 799–813, © 2018 INFORMS 813
Childs PD, Triantis AJ (1999) Dynamic R&D investment policies. O’Riordan M (2012) Merck abandons oral vernakalant. Medscape
Management Sci. 45(10):1359–1377. (March 19), https://www.medscape.com/viewarticle/760527.
Cooper RG Edgett SJ (2015) Stage-Gate. Accessed July 2015, http:// Ryan R, Lippman SA (2005) Optimal exit from a deteriorating project
www.prod-dev.com/stage-gate.php. with noisy returns. Probab. Engrg. Inform. Sci. 19(3):327–343.
Elfenbein DW, Knott AM (2015) Time to exit: Rational, behav- Smith JE, Ulu C (2012) Technology adoption with uncertain future
ioral, and organizational delays. Strategic Management J. 36(7): costs and quality. Oper. Res. 60(2):262–274.
957–975. Smith JE, Ulu C (2017) Risk aversion, information acquisition, and
Gittins JC (1979) Bandit processes and dynamic allocation indices. technology adoption. Oper. Res. 65(4):1011–1028.
J. Roy. Statist. Soc. Ser. B 41(2):148–177. Staw BM, Ross J (1987) Knowing when to pull the plug. Harvard Bus.
Horn JT, Lovallo DP, Viguerie SP (2006) Learning to let go: Making Rev. 65(2):68–74.
better exit decisions. McKinsey Quart. 2006(2):64–75. Ulu C, Smith JE (2009) Uncertainty, information acquisition, and
Jun T (2004) A survey on the bandit problem with switching costs. technology adoption. Oper. Res. 57(3):740–752.
De Economist 152(4):513–541.
Keller G, Rady S (2015) Breakdowns. Theoret. Econom. 10(1):175–202. Kevin F. McCardle is a professor in the Decisions, Oper-
Keller G, Rady S, Cripps M (2005) Strategic experimentation with
exponential bandits. Econometrica 73(1):39–68. ations, and Technology Management Area at the UCLA
Kornish LJ, Keeney RL (2008) Repeated commit-or-defer decisions Anderson School of Management. His research interests lie
with a deadline: The influenza vaccine composition. Oper. Res. in decision analysis (broadly defined); he particularly focuses
56(3):527–541. on decisions that are made sequentially or competitively.
Lippman SA, McCardle KF (1991) Uncertain search: A model of Ilia Tsetlin is a professor of decision sciences at INSEAD.
search among technologies of uncertain values. Management Sci. His teaching and research interests are in prescriptive deci-
37(11):1474–1490.
Lorenzetti L (2015) World’s first malaria vaccine, from Glaxo-
sion making emerging from normative analysis. Two recent
SmithKline, wins approval from EU. Fortune (July 24), http:// research focuses are generic properties of preferences (multi-
fortune.com/2015/07/24/worlds-first-malaria-vaccine-wins attribute utility, stochastic dominance) and search, deadlines,
-recommendation/. and the role of uncertainty.
Mandelbaum A (1987) Continuous multi-armed bandits and multi- Robert L. Winkler is James B. Duke Professor at the Fuqua
parameter processes. Ann. Probab. 15(4):1527–1556. School of Business and the Department of Statistical Sci-
McCardle KF (1985) Information acquisition and the adoption of new
ence at Duke University. His research interests include deci-
technology. Management Sci. 31(11):1372–1389.
Negoescu DM, Bimpikis K, Brandeau ML, Iancu DA (2017) Dynamic sion analysis, Bayesian statistics, forecasting, and risk analy-
learning of patient response types: An application to treating sis. Recent work involves probability forecasting, combining
chronic disease. Management Sci., ePub ahead of print August 21, probabilities, stochastic dominance, sequential decision mak-
https://doi.org/10.1287/mnsc.2017.2793. ing, and multiattribute utility.