8 (Data Analytics in Operations Management A Review

This article was downloaded by: [183.222.235.
92] On: 05 March 2023, At: 01:12

Publisher: Institute for Operations Research and the Management Sciences (INFORMS)
INFORMS is located in Maryland, USA
Manufacturing & Service Operations Management

Publication details, including instructions for authors and subscription information:
http://pubsonline.informs.org
Data Analytics in Operations Management: A Review

Velibor V. Mišić, Georgia Perakis
To cite this article:

Velibor V. Mišić, Georgia Perakis (2020) Data Analytics in Operations Management: A Review. Manufacturing & Service
Operations Management 22(1):158-169. https://doi.org/10.1287/msom.2019.0805
Full terms and conditions of use: https://pubsonline.informs.org/Publications/Librarians-Portal/PubsOnLine-Terms-and-

Conditions
This article may be used only for the purposes of research, teaching, and/or private study. Commercial use
or systematic downloading (by robots or other automatic processes) is prohibited without explicit Publisher
approval, unless otherwise noted. For more information, contact permissions@informs.org.
The Publisher does not warrant or guarantee the article’s accuracy, completeness, merchantability, fitness
for a particular purpose, or non-infringement. Descriptions of, or references to, products or publications, or
inclusion of an advertisement in this article, neither constitutes nor implies a guarantee, endorsement, or
support of claims made of that product, publication, or service.
Copyright © 2019, INFORMS
Please scroll down for article—it is on subsequent pages
With 12,500 members from nearly 90 countries, INFORMS is the largest international association of operations research (O.R.)
and analytics professionals and students. INFORMS provides unique networking and learning opportunities for individual
professionals, and organizations of all types and sizes, to better understand and use O.R. and analytics tools and methods to
transform strategic visions and achieve better outcomes.
For more information on INFORMS, its publications, membership, or meetings visit http://www.informs.org
MANUFACTURING & SERVICE OPERATIONS MANAGEMENT
Vol. 22, No. 1, January–February 2020, pp. 158–169
http://pubsonline.informs.org/journal/msom ISSN 1523-4614 (print), ISSN 1526-5498 (online)
20th Anniversary Invited Article
Data Analytics in Operations Management: A Review

Downloaded from informs.org by [183.222.235.92] on 05 March 2023, at 01:12 . For personal use only, all rights reserved.
Velibor V. Mišić,a Georgia Perakisb

a
Anderson School of Management, University of California, Los Angeles, Los Angeles, California 90095; b Sloan School of Management,
Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
Contact: velibor.misic@anderson.ucla.edu, https://orcid.org/0000-0002-8952-5617 (VVM); georgiap@mit.edu,
https://orcid.org/0000-0002-0888-9030 (GP)
Received: March 28, 2019 Abstract. Research in operations management has traditionally focused on models for
Revised: March 29, 2019 understanding, mostly at a strategic level, how firms should operate. Spurred by the
Accepted: March 29, 2019 growing availability of data and recent advances in machine learning and optimization
Published Online in Articles in Advance: methodologies, there has been an increasing application of data analytics to problems in
September 19, 2019 operations management. In this paper, we review recent applications of data analytics to
https://doi.org/10.1287/msom.2019.0805 operations management in three major areas—supply chain management, revenue man-
agement, and healthcare operations—and highlight some exciting directions for the future.
Copyright: © 2019 INFORMS
History: This paper has been accepted for the Manufacturing & Service Operations Management 20th
Anniversary Special Issue.
Funding: G. Perakis received financial support from the National Science Foundation [Grant CMMI-
1563343].
Keywords: data analytics • machine learning • operations management
1. Introduction optimization can be applied to and the scale at which

Historically, research in operations management (OM) it can be applied. In both machine learning and opti-
has focused on models. These models, based on micro- mization, researchers have benefited from the avail-
economic theory, game theory, optimization, and sto- ability of high-quality software for estimating machine-
chastic models, have been mostly used to generate learning models and for solving large-scale linear, conic,
strategic insights about how firms should operate. One of and mixed-integer optimization problems.
the reasons for the prevalence of this model-based ap- These advances have led to the development of the
proach in the past has been the relative scarcity of data, burgeoning field of data analytics (or analytics for
coupled with limitations in computing power. short). The field of analytics can most concisely be
Today, there has been a shift in research in OM. This described as using data to create models leading to
shift has been driven primarily by the increasing avail- decisions that create value. In this paper, in honor of the
ability of data. In various domains—such as retail, 20th anniversary of Manufacturing & Services Opera-
healthcare, and many more—richer data are becom- tions Management, we highlight recent work that
ing available that are more voluminous and more applies the analytics approach to problems in OM.
granular than ever before. At the same time, the in- We divide our review along three major application
creasing abundance of data has been accompanied by areas: supply chain management (Section 2), where
methodological advances in a number of fields. The we cover location, omnichannel, and inventory de-
field of machine learning, which exists at the in- cisions; revenue management (Section 3), where we
tersection of computer science and statistics, has cre- cover choice modeling and assortment optimization,
ated methods that allow one to obtain high-quality pricing and promotion planning, and personalized rev-
predictive models for high-dimensional data, such as enue management; and healthcare (Section 4), where we
L1 regularized regression [also known as least absolute cover applications at the policy, hospital, and patient
shrinkage and selection operator (LASSO) regression; levels. We conclude in Section 5 with a discussion
Tibshirani (1996)] and random forests (Breiman 2001). of some future directions—such as causal inference,
The field of optimization has similarly advanced. Nu- interpretability, and “small-data” methods—that we
merous scientific innovations in optimization modeling, believe will be increasingly important in the future of
such as robust optimization (Bertsimas et al. 2011), analytics in OM.
inverse optimization (Ahuja and Orlin 2001), and Because of the page limitations imposed by this
improved integer-optimization formulation techniques Anniversary Issue, our review is necessarily brief and
(Vielma 2015), have extended both the scope of what only covers a small set of representative examples in
158
Mišić and Perakis: Data Analytics in Operations Management: A Review
Manufacturing & Service Operations Management, 2020, vol. 22, no. 1, pp. 158–169, © 2019 INFORMS 159
each application area. As a result, there are many clairvoyance, leading to reductions in outbound ship-
other excellent papers that apply analytics to OM that ping costs on the order of 1%. In the same spirit,
we were unable to include in this review; we apolo- Avrahami et al. (2014) discuss their collaboration with
gize in advance to those authors whose papers we the distribution organization within the Yedioth
have not cited. Group in Israel to improve how Yedioth distributes
print magazines and newspapers. This paper uses
2. Analytics in Supply Chain Management real-time information on the newspaper sales at the
Many major problems in supply chain management different retail outlets to enable pooling of inventory
are being reexamined under the lens of analytics; in in the distribution network. The methods developed
this section, we focus on location and omnichannel were implemented at Yedioth and generated sub-
decisions (Section 2.1), as well as inventory decisions stantial cost savings as a result of both a reduction in
(Section 2.2). magazine-production levels and a reduction in return
levels.
2.1. Location and Omnichannel Operations In a different domain, He et al. (2017) consider the
In this subsection, we discuss the application of an- problem of how to design service regions for an electric-
alytics to omnichannel and, more generally, location vehicle-sharing service. A number of car-sharing firms,
problems. The term omnichannel refers to the inte- such as Car2Go, DriveNow, and Autolib, offer electric
gration of an e-commerce channel and a network of vehicles to customers. Providing such a service in an
brick-and-mortar stores. This integration allows a urban area requires the firm to decide the regions in
retailer to do cross-channel fulfillment—that is, fulfill which the service will be offered—that is, in which
online orders in any store location, leading to in- regions of the urban area are customers allowed to
teresting location and inventory-management prob- pick up and drop off a vehicle; this, in turn, informs
lems. Glaeser et al. (2018) consider a location problem major investment decisions (how large the fleet needs
faced by a real “buy online, pickup in store” retailer, to be and where charging stations should be located).
which fulfills online orders through delivery trucks The main challenge in this setting comes from the fact
parked at easily accessible locations (e.g., at schools or that there is uncertainty in customer adoption, which
parking lots). The retailer needs to decide at which depends on which regions are covered by the service.
locations and times to position its trucks in order to To solve this problem, the paper formulates an integer
maximize profit. To solve this problem, the paper programming problem where customer adoption is
first builds a random forest model (Breiman 2001) to represented through a utility model; because of the
predict demand at a given location at a given time, limited data from which such a utility model can be
using a diverse set of independent variables such as calibrated, the paper uses distributionally robust op-
demographic attributes of the location (total pop- timization (see, e.g., Delage and Ye 2010) to account
ulation, population with a postsecondary degree, for the uncertainty in customer adoption. Using data
median income, and so on), the retailer’s operational from Car2Go, the paper applies the approach to de-
attributes (such as whether the retailer offers home signing the service region in San Diego and shows
delivery in this location), and other location attributes how the approach leads to higher revenues than
(e.g., number of nearby competing businesses). Then several simpler, managerially motivated heuristic ap-
the paper uses fixed-effects regression to account for proaches to service region design.
cannibalization effects. Using the retailer’s data, the
paper shows how a heuristic approach based on greedy 2.2. Inventory Management
construction and interchange ideas together with the Ban and Rudin (2019) consider a data-driven ap-
combined random forest and fixed-effects model, leads proach to inventory management. The context that
to improvements in revenue of up to 36%. the paper studies is when one has observations of
Acimovic and Graves (2014) study how to manage demand together with features that may be predictive
the omnichannel operations of a large online retailer. of demand, such as weather forecasts or economic
They address the fundamental questions of what is indicators like the consumer price index. To make
the best way to fulfill each customer’s order in order inventory decisions in this setting, one might consider
to minimize average outbound shipping costs. The building a demand distribution that is feature de-
paper develops a heuristic that makes fulfillment pendent and then finding the optimal order quantity
decisions by minimizing the immediate outbound for the distribution corresponding to a given reali-
shipping cost, plus an estimate of future expected zation of the features. Instead, the paper by Ban and
outbound shipping costs. These estimates are derived Rudin (2019) proposes two alternate approaches. The
from the dual values of a transportation linear program. first, based on empirical risk minimization, involves
Through experiments on industry data, the model finding the order quantity by solving a single prob-
captures 36% of the opportunity gap assuming lem, where the decision variables is the decision rule
160 Manufacturing & Service Operations Management, 2020, vol. 22, no. 1, pp. 158–169, © 2019 INFORMS
that maps the features to an order quantity, and the to close the cost gap between the naive approach (which
objective is to minimize a sample-based estimate of does not consider contextual data) and the perfect
the cost. The second approach involves using kernel foresight approach (which knows demands before they
regression to model the conditional demand distri- are realized) by 88%.
bution and applying a sorting algorithm to determine Ban et al. (2018) consider the problem of dynamic
the optimal order quantity. The paper derives bounds procurement. In this problem, one has to decide how
on the out-of-sample performance of both approaches much of a product to order over a finite time horizon
and applies the approaches to the problem of nurse from different sources that have different lead times
staffing in a hospital emergency room using data from and different costs so as to meet uncertain demand
a large teaching hospital in the United Kingdom. The over the horizon. The problem setting studied in the
paper finds that the proposed approaches outperform paper is motivated by a practical problem faced by
the best-practice benchmark by up to 24% in out-of- Zara, one of the world’s largest fast-fashion retailers.
sample cost. The paper proposes an approach that involves using
In a similar vein, Bertsimas and Kallus (2019) de- linear and regularized linear regression to predict
velop a general framework for solving a collection of the demand for a given product using the historical
sample-average approximation (SAA) problems, where trajectories of other products. Then the paper for-
each SAA problem has some contextual information. mulates a multistage stochastic programming prob-
For example, in an inventory setting, a firm might lem where the scenario tree is generated by using the
be selling different products, which are described by previously estimated predictive model. Using real data
vectors of attributes, for which the firm has historical from Zara, the paper shows how the existing approach,
observations of demand; the firm would then like to which ignores covariate information, can lead to costs
determine an order quantity for a (potentially new) that are 6%–15% higher than the proposed approach.
product. A naive approach to this problem would be to At the core of many inventory problems—and OM
simply pool all the data together and ignore the con- problems in general—lies the problem of demand
textual information. Instead, Bertsimas and Kallus estimation. The results of this estimation can then be
(2019) propose building a machine learning model used, among others, to address pricing, distribution,
first to predict the uncertain quantity as a function of and inventory-management questions. Baardman et al.
contextual information; in our inventory example, (2018a) develop a data-driven approach for predict-
this would amount to predicting demand as a func- ing demand for new products. The paper develops
tion of the product attributes. Then, instead of solving a machine learning algorithm that identifies compa-
the SAA problem using all the data on the uncertain rable products in order to predict demand for the
parameter, one uses the machine-learning model new product. The paper demonstrates both analytically
to obtain a context-specific conditional distribution and empirically, through a collaboration with Johnson
and then solves the SAA problem with respect to & Johnson, the benefits of this approach.
this conditional distribution. In the inventory setting,
consider a regression tree (Breiman et al. 1984) for
3. Analytics in Revenue Management
predicting demand for a given product attribute
Analytics is having a major impact on how firms make
vector. If one runs the product’s attribute vector down
decisions about how to sell their products to customers;
the regression tree, one will obtain a point prediction,
in this section, we discuss applications in choice mod-
but by considering the historical demand observa-
eling and assortment optimization (Section 3.1), pricing
tions contained in the corresponding leaf, one also
and promotion planning (Section 3.2), and person-
obtains an estimate of the conditional distribution.
alized revenue management (Section 3.3).
The paper develops theoretical guarantees for
this general procedure for specific types of machine-
learning models, proving asymptotic optimality of the 3.1. Choice Modeling and Assortment Optimization
procedure as the number of observations grows. The Assortment optimization refers to the problem of de-
paper also applies the approach to a real distribution ciding which products to offer to a set of customers so
problem for a media company that needs to manage as to maximize the revenue earned when customers
inventories of different products (specifically, movie make purchase decisions from the offered products.
CDs, DVDs, and Blu-ray discs) across different retail A major complicating factor in assortment optimi-
locations. The paper builds machine-learning models of zation is the presence of substitution behavior, where
demand as a function of the store location, properties of customers may arrive with a particular product in
the movie (such as its genre, rating on Rotten Tomatoes, mind but purchase a different product if that first
box office revenue, and so on), and other large-scale product is not available.
information (such as localized Google search queries for The paper by Farias et al. (2013) proposed a non-
the movie). The paper shows how the approach is able parametric model for representing customer choice
behavior by modeling the customer population as a designer apparel items (called styles) at significantly
probability distribution over rankings of the prod- discounted prices in weekly limited-time sales (called
ucts. Such a model is able to represent any choice events). The paper develops a strategy for optimally
model based on random utility maximization, which pricing styles in an event that combines machine
includes the majority of popular choice models (e.g., learning and optimization. The machine-learning
the multinomial logit [MNL] model, the nested logit component of the approach involves developing a
model, and the latent-class MNL model). The paper bagged regression tree model (Breiman 1996) for
shows, using data from a major automobile manu- predicting the demand of a style based on the price of
facturer, how the proposed approach can provide more the style and the average price of other styles in the
accurate predictions than traditional parametric models event. The optimization component involves formulat-
such as the MNL model. In later work, a number of ing an integer optimization problem where the goal is to
papers have considered how to efficiently find assort- find the prices of the styles that maximize the total
ments that maximize expected revenue under such a revenue from all styles, which is just the product of
model. From a tractability standpoint, Aouad et al. the price of the style and the demand predicted by the
(2018) showed that the assortment problem is NP- bagged regression tree model. This optimization prob-
hard even to approximate (APX-hard) and developed lem is, in general, very difficult. However, if one fixes the
an approximation algorithm; Feldman and Topaloglu average price of all styles to a given value, then the
(2019) later provided an approximation algorithm problem simplifies, and one can solve a small number of
with an improved guarantee. For specific classes of smaller integer-optimization problems to find the opti-
ranking-based models, Aouad et al. (2015) developed mal collection of prices. The paper reports on a live
efficient approaches based on dynamic programming pilot experiment to test the methodology at Rue La La,
for solving the assortment-optimization problem ex- which resulted in an increase in revenues of about 10%.
actly. In a different direction, the paper by Bertsimas Another example where analytics has made a dif-
and Mišić (2019) proposed a new mixed-integer op- ference in the field of pricing relates to promotion
timization formulation for the problem as well as planning. Cohen et al. (2017a, b), as well as Cohen-
a specialized solution approach, based on Benders Hillel et al. (2019a, b), consider how analytics applies
decomposition, for solving large-scale instances ef- to price-promotion planning first for a single item
ficiently. Using a product line design problem based and, subsequently, for multiple items. Price-promotion
on a real conjoint analysis data set, the paper shows planning is concerned with when and how deeply to
how a large-scale product line design problem that promote each item over a time horizon. The problem is
required 1 week of computation time to solve in earlier challenging because sales of an item in a given time
work is readily solved by the proposed approach in period are affected not only by the price of that item in
10 minutes. that period but also by what the prices of that item were
In addition to the ranking-based model, the paper before then and what the prices of other items are. For
by Blanchet et al. (2016) recently introduced a Markov example, a promotion on a coffee stock-keeping unit
chain–based choice model. In this model, substitution (SKU) today may result in a large increase in sales if it
from one product to another is modeled as a state has not been recently promoted but may only result in a
transition in a Markov chain. The paper shows that modest change if the coffee SKU has been promoted
under some reasonable assumptions, the choice prob- heavily in the preceding weeks (because customers
abilities computed by the Markov chain–based model may, for example, have stockpiled the item during
are a good approximation of the true choice proba- earlier promotion periods). The paper by Cohen et al.
bilities for any random utility-based choice model (2017b) considers the single-item problem and in-
and are exact if the underlying model is a generalized troduces a general mixed-integer nonlinear optimi-
attraction model, of which the MNL model is a special zation formulation of the problem, where one first
case. In addition to the theoretical bounds, the au- learns a demand model from available data and then
thors conduct numerical experiments and observe uses that demand model in the objective of the op-
that the average maximum relative error of the choice timization model. The objective in this body of work
probabilities of their model with respect to the true corresponds to profit, and the constraints reflect com-
probabilities for any offer set is less than 3%, where mon business rules imposed by retailers in practice
the average is taken over different offer sets. (e.g., limits on the number of successive promotions
over any fixed window). Although the optimization
3.2. Pricing and Promotion Planning problem is in general theoretically intractable, Cohen
An exciting example of how analytics is being ap- et al. (2017b) show how an approximate formulation,
plied in pricing is the paper by Ferreira et al. (2015), based on linearizing the nonlinear objective, leads
which studies the pricing of Rue La La’s weekly to an efficiently solvable mixed-integer linear prob-
sales. Rue La La is an online fashion retailer that sells lem. The paper shows how the suboptimality gap
of the promotion-price schedule derived from the interactions in demand and supply, the paper pro-
approximation can be bounded for different classes of poses two pricing policies that are based on the
demand functions. This approach was also later ex- idea of partitions to the store inventory, which ap-
tended to multi-item price-promotion planning prob- proximate how it will be used by the two different
lem in Cohen et al. (2017a). channels. These two policies are, respectively, based
More recently, Cohen-Hillel et al. (2019a, b) have on solving either a deterministic or a robust mixed-
considered a demand model that only considers a integer optimization problem that incorporates sev-
limited set of pricing features; in particular, one con- eral business rules as constraints. Through an analysis
siders the price of the item in the current time period of a pilot implementation at a large U.S. retailer, the
and the price in the previous period, as well as the paper estimates that the methodology leads to a 13.7%
minimum price within a bounded set of past periods. increase in clearance-period revenue.
These papers consider the pricing problem under this
model, which is referred to as the bounded memory peak- 3.3. Personalized Revenue Management
end demand model. These papers show how one can Another major area of research is in personalized
exploit the structure arising from these price fea- revenue management. Retailers have become increas-
tures to introduce a compact dynamic programming ingly interested in personalizing their products and
formulation of the pricing problem, allowing the services, including promotions. Personalization refers
problem to be efficiently solved. In the multi-item to the ability to tailor decisions to the level of indi-
version of the problem, the papers show that the viduals. In revenue management, this entails offering
problem is NP-hard in the strong sense but establish different prices and changing the product assortment,
a polynomial-time approximation scheme. Further- as well as other decisions such as promotions and
more, Cohen-Hillel et al. (2019b) show that even if product bundling.
the true underlying demand model does not follow Many papers have recently considered the problem
the bounded memory peak-end demand model but of personalized pricing. Chen et al. (2015) consider a
rather follows some of the more traditional models setting in which a firm sells a single product to a
in the literature, the bounded memory peak-end customer population, where each customer chooses to
demand model still performs very well; the paper buy the product according to a logit model that de-
provides a theoretical bound for this result, which is pends on customer covariates and the price, and the
tested with data. Using data from a grocery retailer, firm has access to historical transaction records, where
these papers show how the optimization approaches each record contains the covariates of the customer, the
introduced can lead to profit increases of between 3% price at which the product was offered, and a binary
and 10%. indicator of whether the customer bought the prod-
The above-mentioned papers deal with price pro- uct or not. The paper proposes a simple approach to
motions, which constitute one type of promotion. personalized pricing: estimate the parameters of the
Retailers have access to many other methods of pro- logistic regression model using regularized maximum-
motion (termed promotion vehicles), such as flyers, likelihood estimation and then set the price for future
coupons, and announcements on social media web- customers so as to maximize the predicted expected
sites (e.g., Facebook). The paper by Baardman et al. revenue (price multiplied by purchase probability)
(2018c) considers how one should schedule such pro- under the estimated parameters. The paper bounds
motion vehicles so as to maximize profit. The paper the suboptimality gap between the expected revenue
shows that the problem is APX-hard and subsequently of this policy and that of an oracle pricing policy
introduces a nonlinear integer optimization formula- that knows the customer choice model perfectly. Using
tion for scheduling promotion vehicles under various airline sales data, the paper shows how the proposed
business constraints (such as limiting the promotion method provides an improvement of over 3% in rev-
vehicles to use each time period, among others). The enue over a pricing policy that offers all customers the
paper considers a greedy approach as well as a linear same price.
mixed-integer optimization approximation that is Ban and Keskin (2017) develop an algorithm for
more accurate than the greedy method. Using real dynamic pricing of a single product, where arriving
data, the paper illustrates the quality of the two ap- customers are described by a feature vector, and the
proximation methods as well as the potential profit demand in each period is given by a model that de-
improvement, which is on the order of 2%–9%. pends on the price and only a subset of the features;
In the omnichannel retail setting, consumers can the demand model allows for feature-dependent price
compare store prices with online prices, giving rise sensitivity. In this setting, the paper proves a lower
to challenging pricing problems. Harsha et al. (2019) bound on the regret of any dynamic pricing algorithm
study the price-optimization aspect of omnichannel in terms of the number of relevant features and de-
operations. Because of the presence of cross-channel velops algorithms with near-optimal regret. The paper
demonstrates, using data from an online automobile the market responds to the product (buy/no buy
loan company, how the proposed algorithms accrue decisions), the firm can refine a polyhedron repre-
more revenue than the company’s current practice, senting the uncertainty in the parameters of the
as well as other recently proposed dynamic-pricing market’s valuation model. The paper proposes a
algorithms. policy based on approximating this polyhedron with
Personalization of product offerings has also led an ellipsoid and setting prices in a way that balances
to the need to develop new personalized demand the revenue gained against the reduction in un-
models. Unfortunately, in practice (and especially in certainty and show that this policy achieves a worst-
the retail space), the data available are not necessarily case regret that scales favorably in the number of
at the level of granularity needed to develop mean- features and the time horizon.
ingful demand models at the personalized level. There has also been much interest in personalized
A potential path to developing personalized demand assortment planning. In addition to personalized
models is to use data on customers from social media, pricing, Chen et al. (2015) develop similar theoretical
which can directly inform a retailer on how one cus- guarantees as discussed earlier for the problem of
tomer’s purchases affect the purchases of a different assortment personalization based on a finite sample
customer. However, for most retailers, acquiring of data. Golrezaei et al. (2014) consider dynamic-
these data is costly and poses challenges in terms of assortment personalization when there is a limited
privacy concerns. Baardman et al. (2018b) focus on inventory of products. The difficulty in the problem
the problem of detecting customer relationships from arises from the limited inventory of products—given
transactional data and using them to offer targeted a customer now, we may choose to offer him or her a
price promotions to the right customers at the right particular product as part of an assortment, but if the
time. The paper develops a novel demand model that customer chooses to purchase that product, the in-
distinguishes between a base purchase probability, ventory of that product will be depleted, and we may
capturing factors such as price and seasonality, and run out of that product for later customers. The paper
a customer-trend probability, capturing customer-to-
proposes a heuristic, called inventory balancing, that
customer trend effects. The estimation procedure is
attains significantly improved revenues over simpler
based on regularized bounded-variables least squares
benchmarks. At the same time, there is also a growing
combined with the method of instrumental variables,
literature that studies assortment personalization
a commonly used method in econometrics/causal
in a dynamic setting, where one must learn the un-
inference (Angrist and Pischke 2008). The resulting
derlying choice model while making assortment de-
customer trend estimates subsequently feed into a
cisions. For example, Bernstein et al. (2019) consider a
dynamic promotion targeting optimization problem,
bandit approach to assortment personalization based
formulated as a nonlinear mixed-integer optimiza-
tion problem. Although the problem is NP-hard, the on dynamic clustering. This approach assumes that
paper also proposes an adaptive greedy algorithm there exists a clustering of customer profiles (vectors of
and proves that the customer-to-customer trend es- attributes); however, the number of clusters and the
timates are statistically consistent, but also that the clustering (mapping of profiles to clusters) are un-
adaptive greedy algorithm is provably good. Using known. The approach involves modeling the distri-
actual fashion data from a client of Oracle, the paper bution over clusterings by using the Dirichlet process
shows that the demand model reduces the prediction mixture model, which is updated in a Bayesian
error by 11% on average, and the corresponding fashion with each transaction.
optimal policy increases profits by 3%–11%. Personalization has also been applied to other types
Besides these papers, there has been other recent of revenue-management decisions. Ettl et al. (2019)
research in the area of dynamic pricing with learning. consider the problem of making personalized price
Javanmard and Nazerzadeh (2019) consider a single- and product-bundle recommendations over a finite
product setting where, in each period, there is only time horizon to individual customers so as to maxi-
one customer that purchases the product if the price is mize revenue with limited inventory. The paper de-
below that customer’s valuation and propose a reg- velops a number of approximation algorithms related
ularized maximum-likelihood policy that achieves to the inventory-balancing heuristic (Golrezaei et al.
near-optimal expected regret. Cohen et al. (2016) 2014) and Lagrangian relaxation methods for weakly
consider a firm selling a sequence of arriving prod- coupled stochastic dynamic programs (Hawkins
ucts, and each product is described by a feature 2003). Using two different data sets, one in airline
vector. Each product is sold if its price is lower than ticket sales and one from an online retailer, the paper
the market value of the product, which is assumed to shows how the proposed approach can improve
be linear in the features of the product. As more and revenues over pricing and bundle strategies used in
more products arrive and the firm observes how practice by 2%–7%.
4. Analytics in Healthcare Operations Clinical research studies often establish that a treat-
Machine-learning and big-data methodologies are ment has some sort of aggregate benefit in a large
beginning to also have a significant impact in health- population. However, there may be significant het-
care operations. We divide our discussion in this part of erogeneity in the treatment effect across different
the paper to problems at the policy level (Section 4.1), subgroups of that population. For example, an indi-
the hospital level (Section 4.2), and the patient level vidual who is very sick may benefit less from a treat-
(Section 4.3). ment than a less sick individual. A natural question,
then, is how should one allocate the intervention to

4.1. Policy-Level Problems individuals in a population to maximize the aggregate
A number of papers have considered the application benefit, subject to a limit on how many individuals may
of analytics to healthcare policy problems. The paper be treated (i.e., it is not possible to simply treat ev-
by Aswani et al. (2019) studies the Medicare Shared eryone) and a lack of knowledge of individual-level
Savings Program (MSSP). The MSSP was introduced treatment effects? The paper formulates this problem
by Medicare to combat rising healthcare costs: Medi- as a type of robust optimization problem where the
care providers that enroll in MSSP and that reduce their uncertainty set represents all heterogeneous treatment
costs below a certain financial benchmark receive bonus effects that are consistent with the aggregate charac-
payments from Medicare. However, few Medicare pro- teristics of the research study. The paper uses data on
viders have been able to successfully reduce their costs a case management intervention for reducing emer-
in order to receive the bonus payments because of the gency department utilization by adult Medicaid patients
investment required. The paper by Aswani et al. (2019) and demonstrates settings under which the proposed
models the interaction between Medicare and each robust optimization method provides a benefit over
provider as a principal-agent problem, where Medi- scoring rules typically used by medical practitioners.
care sets the financial benchmark for each provider, In a different direction, the paper by Bertsimas et al.
and each provider decides what investment to make (2016) considers the design of combination chemo-
to reduce costs. The authors use data from Medi- therapy clinical trials for gastric cancer. The authors
care, as well as an estimation methodology based of the paper constructed a large database of gastric
on inverse optimization (Ahuja and Orlin 2001), to cancer clinical trials and used this database to build
learn the parameters of their principal-agent model. two predictive models: one model to predict the
Using this estimated model, they propose an alter- median overall survival of patients in a trial based on
nate contract where the payment depends on both the characteristics of the trial patients and the dosages
reducing cost below a financial benchmark and the of different drugs and another model to predict
amount invested and show that this alternate contract whether the trial will exhibit an unacceptably high
can increase Medicare savings by up to 40%. fraction of patients with severe toxicities. Using these
Bertsimas et al. (2013) consider the design of a predictive models, the paper then formulates a mixed-
dynamic allocation policy for deceased donor kidneys integer optimization model for deciding the next com-
to waiting-list patients. In the design of an allocation bination of drugs to test for a given cohort of patients so
policy, a key objective is efficiency—that is, the benefit as to maximize the predicted median overall survival
garnered from allocating (transplanting) a kidney to a subject to limits on the predicted toxicity. The approach is
patient. However, another objective is fairness—that evaluated by using a method based on simulation and
is, making sure that patients with certain character- one based on matching a proposed clinical trial with the
istics are not systematically underserved by the al- one most similar to it in the data; both evaluation
location policy. The paper proposes formulating the methods suggest that the proposed approach can im-
problem as a linear optimization problem where prove the overall efficacy of regimens that are chosen
the objective is to maximize efficiency (the aggre- for testing in phase III clinical trials.
gate match quality) subject to a fairness constraint and
uses linear regression to derive a scoring rule from 4.2. Hospital-Level Problems
the solution of this linear optimization problem to At the hospital level, the paper by Rath et al. (2017)
predict the fairness-adjusted quality of a match using studies the problem of staffing and scheduling sur-
attributes of both the patient and the donor kidney. geries at the Ronald Reagan Medical Center (RRMC)
The paper shows that compared with the existing at the University of California, Los Angeles, one of the
allocation policy at the time, the proposed policy largest medical centers in the United States. Patients
can satisfy the same fairness criteria while resulting who undergo surgery at the RRMC require three dif-
in an 8% increase in aggregate quality-adjusted life ferent types of resources: an anesthesiologist, a sur-
years. geon, and an operating room. Prior to this research,
Gupta et al. (2017) consider the problem of tar- anesthesiologists at RRMC would manually schedule
geting individuals for a treatment or intervention. surgeries, resulting in high operating costs. The paper
presents a methodology, based on a two-stage robust rolling-average method that is commonly used in
mixed-integer optimization problem, that accounts for hospitals today.
the uncertainty in surgery duration. The proposed
method was implemented and is currently in use, 4.3. Patient-Level Problems
saving RRMC approximately $2.2 million per year Finally, the increasing availability of electronic health
through reduced anesthesiologist overtime. The follow- record data has led to the application of analytics
up paper by Rath and Rajaram (2018) focuses spe- methods to patient-level problems that are of a medical
cifically on staff planning at the RRMC (deciding how nature. For example, Bastani and Bayati (2015) con-
many anesthesiologists are needed on regular duty sider a multiarmed bandit problem where, at each
and how many on overtime). Although such de- period, the decision maker can choose one of finitely
cisions involve tangible, explicit costs (e.g., requiring many decisions, and the reward of each decision is a
an on-call anesthesiologist to come in requires a function of a high-dimensional covariate vector. The
payment), they also involve implicit costs (e.g., hav- problem is to find a policy that achieves minimal
ing an anesthesiologist on call but not calling him regret. To do this, the paper proposes a novel algo-
or her). The paper proposes an approach that learns rithm based on the LASSO method. The specific
such implicit costs from data on prior staffing de- healthcare application studied in the paper is that of
cisions and then uses these estimates within a two- learning to dose optimally: a physician encounters
stage integer stochastic dynamic program to make patients described by various features (e.g., patient’s
staffing decisions that lead to improvements in total gender, patient’s age, whether the patient has other
(implicit plus explicit) cost. preexisting conditions, whether the patient is taking
Another example of optimization being applied to other drugs, and so on) and must prescribe a dose to
hospital operations is the paper by Zenteno et al. each patient but must learn the effect of different
(2016). This paper describe the successful imple- dosage levels on the fly. Using a data set on warfarin
mentation by Massachusetts General Hospital (MGH) dosing, the paper demonstrates how the approach
of a large-scale optimization model in order to reduce can be used effectively to prescribe initial doses to
the surgical patient flow congestion in the peri- patients who are starting warfarin therapy.
operative environment without requiring additional Another example is the work of Bertsimas et al.
resources. The paper studies an optimization model (2017). This paper considers the problem of recom-
that rearranges the elective block schedule in order to mending a personalized drug treatment regimen to a
smooth the average inpatient census by reducing the patient with diabetes based on his or her medical
maximum average occupancy throughout the week. attributes so as to minimize the patient’s glycated
This model was implemented by MGH and resulted hemoglobin A1c (HbA1c) level, a measure of a pa-
in increasing the effective capacity of the surgical units tient’s baseline blood glucose level over the preceding
at MGH. Outside of scheduling and staffing, the paper two to three months. The paper proposes an approach
by Bravo et al. (2019) develops a linear optimization- that uses the k-nearest-neighbors algorithm to de-
based framework, inspired by network revenue man- termine, for a given patient, which patients are most
agement, for making resource allocation and case mix similar to that patient in terms of their demographics
decisions in a hospital network. Using data from an (such as age and sex), medical history (such as days
academic medical center, the paper shows how the since first diabetes diagnosis), and treatment history
approach can lead to better case mix decisions than (such as the number of regimens previously tried). By
current practice, which relies on prioritizing services focusing on patients similar to the one at hand, one
based on traditional cost-accounting methods. can obtain an estimate of what the patient’s HbA1c
Outside of optimization, the paper by Ang et al. level will be under each possible drug regimen
(2015) focuses on predictive modeling, specifically the and thereby recommend a good drug regimen. Using
problem of predicting emergency department (ED) data from Boston Medical Center, the paper shows
wait times in hospitals. Many approaches have been how the approach can lead to clinically significant re-
proposed to predict ED wait times, such as rolling- ductions in HbA1c level relative to the current stan-
average estimators, metrics based on fluid models, dard of care.
and quantile regression. The paper proposes a method
that combines the LASSO method of statistical learning 5. Conclusions and Future Directions
(Tibshirani 1996) with predictor variables that are In this paper, we have highlighted recent work in
derived from a generalized fluid model of the queue. OM, leveraging methodological advances in machine
The paper shows, using real hospital data, how the learning and optimization that allow large-scale data
proposed method outperforms other approaches to to be used for complex decision making. We conclude
predicting ED wait times, leading to reductions our review by discussing some methodological direc-
in mean squared error of up to 30% relative to the tions that we expect to grow in importance in the future.
5.1. Causal Inference regression model that allows them to find that a
Machine learning is usually concerned with predicting postpurchase price drop is another cause of returns
a dependent variable using a collection of independent because it gives rise to opportunistic returns.
variables. In many OM applications, the decision en- Recently, new approaches have also been emerging
ters into the machine learning model as an indepen- that combine machine learning with prescriptive an-
dent variable. For such settings, a more relevant per- alytics to deal with issues of heterogeneity. Alley et al.
spective to take is in understanding the causal impact (2019) provide a framework for estimating price
of the decision on the dependent variable. This con- sensitivity when pricing tickets in a secondary mar-
cern for the validity of decisions motivates the study ket. Because of the heterogeneous nature of tickets,
of causal inference methods, which is a major area of the unique market conditions at the time each ticket is
focus in statistics and econometrics. listed, and the sparsity of available tickets, demand
The relevance of causal inference to empirical OM estimation needs to be done at the individual-ticket
has been highlighted previously (Ho et al. 2017), but level. The paper introduces a double-orthogonalized
less research has highlighted the importance of con- machine-learning method for classification that iso-
sidering causality in prescriptive modeling. We have lates the causal effects of pricing on the outcome
already discussed one example, the work of Baardman by removing the conditional effects of the ticket and
et al. (2018b), which develops a specialized estimation market features. Furthermore, the paper shows how
method for detecting customer-to-customer trends this price sensitivity estimation procedure can be
that uses the instrumental variables approach. An- embedded in an optimization model for selling tickets
other example is the work of Bertsimas and Kallus in a secondary market. This double-orthogonalization
(2016), which considers the problem of pricing from procedure can also be applied to other settings, such
observational data. This paper shows that a standard as personalized pricing and estimating intervention
approach to data-driven pricing, which involves fit- effects in healthcare.
ting a regression model to price and then optimizing
revenue as price multiplied by predicted demand, can 5.2. Interpretability
actually be highly suboptimal when there exist con- Another exciting future direction is in the develop-
founding factors affecting both historical prices and ment of interpretable models in OM. In machine
demand. The paper develops a statistical hypothesis learning, an interpretable model is one in which a hu-
test for determining whether the price prescription man being can easily see and understand how the
from a predictive model is optimal and applies this model maps an observation into a prediction (Freitas
approach to a car loan data set. 2014). Interpretability is a major area of research in
Outside of revenue management, Fisher et al. (2016), machine learning for two reasons. First, interpret-
Bandi et al. (2018), and Chaurasia et al. (2019) study able models can provide insights into the prediction
causal inference and prescriptive modeling questions problem that black-box models, such as random for-
in supply chain management, particularly relating to ests and neural networks, cannot. Second, and more
the speed of delivery of online orders and subsequent important, machine-learning models often do not af-
product returns. Fisher et al. (2016) use a quasi-natural fect decisions directly but instead make predictions
experiment to demonstrate the economic value of faster or recommendations to a human decision maker; in
delivery for an online retailer. Chaurasia et al. (2019) many contexts (e.g., medicine), a decision maker is un-
take this one step further by analyzing a transaction likely to accept a recommendation made by a machine-
data set from a large fashion e-retailer and estab- learning model without the ability to understand how
lishing a causal relationship between product deliv- the recommendation was made. In addition, there is
ery gaps in the supply chain and product returns. also growing legislation, such as the General Data
Furthermore, the paper embeds this causal model Protection Rules in the European Union (Doshi-Velez
within a data-driven optimization framework in or- and Kim 2017), requiring that algorithms affecting
der to reduce product returns by optimizing product- users must be able to provide an explanation of their
delivery services. Using the e-retailer’s data, the decisions.
paper shows that a two-day delivery policy—that is, Although interpretable machine learning is still new
when all orders are attempted to be delivered within to OM, some recent papers have considered inter-
two days of order placement—can lead to cost savings pretability in the context of dynamic decision making.
of as much as $8.9 million every year through reduced Bravo and Shaposhnik (2018) develop an approach
product returns. Bandi et al. (2018) also investigate called mining optimal policies (MinOP), wherein one
the causes of product returns, with a focus on how analyzes exact solutions of stochastic dynamic pro-
dynamic pricing can lead to more product returns in grams using interpretable machine learning methods;
the online retail industry. In particular, using data this approach provides a modeler with insight into the
from the same online retailer, they develop a logistic structure of the optimal policy for a given family of
problems. Ciocan and Mišić (2018) propose an algo- the availability of side information. For example, Ama-
rithm for finding policies for optimal stopping prob- zon may see other types of user–product interactions—
lems in the form of a binary tree, similar to trees for example, when users view a product page, add a
commonly used in machine learning; the paper shows product to their wish list, add a product to their shop-
how the proposed tree policies outperform (non- ping cart, and so on. The data for each interaction can
interpretable) policies derived from approximate dy- be represented as a user–product matrix in the same
namic programming in the problem of option pricing way as the principal interaction (making a purchase);
while being significantly simpler to understand. With each matrix thus can be viewed as a slice of a three-
the continuing proliferation of machine learning- dimensional tensor. The paper develops a novel algo-
based decision support systems in modern business, we rithm for recovering slices of the data tensor based
expect interpretable decision making to become a major on singular value decomposition and linear regres-
area of research in OM. sion; using data from Xiami, an online music streaming
service in China, the paper shows how the proposed
5.3. “Small-Data” Methods method can more accurately predict user–song in-
One challenge that is being recognized with the interactions than existing methods that do not use side
creasing availability of data is that these data are often information.
small. This paradoxical statement refers to the fact In an entirely different application, the tensor com-
that, often, although the number of observations one pletion method of Farias and Li (2019) has been used
might have access to is large, the number of param- for the problem of designing a blood test for cancer in
eters that one must estimate is also large. For example, the paper by Mahmoudi et al. (2017). Specifically, an
imagine an online retailer offering a very large se- emerging idea for designing a blood test for cancer is
lection of products in a particular category; the re- to insert nanoparticles into a blood sample and mea-
tailer might be interested in estimating the demand sure the binding levels of these nanoparticles; this
rate of each product but may only see a handful of leads to data in the form of a three-dimensional ten-
purchases for each product in a year. The paper by sor. Although the raw data have too much noise to be
Gupta and Rusmevichientong (2017) considers how directly used in a predictive model, Mahmoudi et al.
to solve linear optimization problems in this small- (2017) leverage the method of Farias and Li (2019) to
data regime and shows that standard approaches in denoise the data. Using data from a longitudinal
stochastic optimization, such as sample-average ap- study, Mahmoudi et al. (2017) show how the result-
proximation, are suboptimal in this setting. The paper ing models can predict whether a patient will de-
develops two different solution approaches, based velop cancer with diagnostically significant accu-
on empirical Bayes methods and regularization, that racy seven to eight years after the blood sample
enjoy desirable theoretical guarantees as the number was drawn.
of uncertain parameters grows and showcases the
benefits of these approaches in an online advertising 5.4. New Approaches to “Predict-Then-Optimize”
portfolio application. Many of the papers described earlier rely on the
A different approach, which specifically considers “predict-then-optimize” paradigm: first, build a pre-
prediction in the small-data regime, can be found dictive model using data, and then embed that model
in the paper by Farias and Li (2019), which studies within the objective function of an optimization prob-
the following problem: a retailer offers a collection of lem. Machine learning models are typically estimated
products to a collection of users. For each product– so as to lead to good out-of-sample predictions; how-
user pair, we see whether the user has purchased that ever, such models may not necessarily lead to good out-
product or not, leading to a 0–1 matrix with users as of-sample decisions. Rather than estimating models
rows and products as columns. Based on these data, that minimize loss functions measuring predictive per-
we would like to estimate the probability of any given formance, the paper by Elmachtoub and Grigas (2017)
user purchasing any given product, leading to a matrix proposes minimizing a loss function that is related
of user–product purchase probabilities; such a ma- to the objective value of the decisions that ensue from
trix is useful because it can guide product recom- the model. The paper shows how this new estima-
mendations, for example. The problem is made chal- tion approach leads to decisions that outperform the
lenging by its scale (e.g., Amazon has on the order of traditional approach in fundamental problems, such
100 million products and 100 million users) and the as the shortest path, assignment, and portfolio selection
fact that there is a very small number of transactions per problems. Given the prescriptive focus of OM, this
user, which inhibits one’s ability to accurately esti- work highlights the potential opportunities for revis-
mate the user–product purchase–probability matrix. iting how machine-learning models are constructed
To solve the problem, Farias and Li (2019) leverage for prescriptive applications in OM.
References Bertsimas D, Farias VF, Trichakis N (2013) Fairness, efficiency, and

flexibility in organ allocation for kidney transplantation. Oper.
Acimovic J, Graves SC (2014) Making better fulfillment decisions
Res. 61(1):73–87.
on the fly in an online retail environment. Manufacturing Service
Oper. Management 17(1):34–51. Bertsimas D, O’Hair A, Relyea S, Silberholz J (2016) An analytics
Ahuja RK, Orlin JB (2001) Inverse optimization. Oper. Res. 49(5): approach to designing combination chemotherapy regimens for
771–783. cancer. Management Sci. 62(5):1511–1531.
Alley M, Biggs M, Hariss R, Herman C, Li M, Perakis G (2019) Pricing Bertsimas D, Kallus N, Weinstein AM, Zhuo YD (2017) Personalized
for heterogeneous products: Analytics for ticket reselling. diabetes management using electronic medical records. Diabetes
Working paper, Massachusetts Institute of Technology, Care 40(2):210–217.

Cambridge. Blanchet J, Gallego G, Goyal V (2016) A Markov chain approximation
Ang E, Kwasnick S, Bayati M, Plambeck EL, Aratow M (2015) Ac- to choice modeling. Oper. Res. 64(4):886–905.
curate emergency department wait time prediction. Manufactur- Bravo F, Shaposhnik Y (2018) Mining optimal policies: A pattern
ing Service Oper. Management 18(1):141–156. recognition approach to model analysis. Working paper, Uni-
Angrist JD, Pischke J-S (2008) Mostly Harmless Econometrics: An Em- versity of California, Los Angeles, Los Angeles.
piricist’s Companion (Princeton University Press, Princeton, NJ). Bravo F, Braun M, Farias V, Levi R, Lynch C, Tumolo J, Whyte R
Aouad A, Farias VF, Levi R (2015) Assortment optimization under (2019) Optimization-driven framework to understand healthcare
consider-then-choose choice models. Working paper, London network costs and resource allocation. Working paper, Univer-
Business School, London. sity of California, Los Angeles, Los Angeles.
Aouad A, Farias V, Levi R, Segev D (2018) The approximability of Breiman L (1996) Bagging predictors. Machine Learn. 24(2):123–140.
assortment optimization under ranking preferences. Oper. Res. Breiman L (2001) Random forests. Machine Learn. 45(1):5–32.
66(6):1661–1669. Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and
Aswani A, Shen Z-JM, Siddiq A (2019) Data-driven incentive de- Regression Trees (CRC Press, Boca Raton, FL).
sign in the Medicare shared savings program. Oper. Res. 67(4): Chaurasia M, Pandey S, Perakis G, Rathore HS, Singhvi D,
1002–1026. Spanditakis Y (2019) First delivery gaps: A supply chain level
Avrahami A, Herer YT, Levi R (2014) Matching supply and demand: to reduce product returns in online retail. Working paper,
Delayed two-phase distribution at Yedioth Group—models, al- Massachusetts Institute of Technology, Cambridge.
gorithms, and information technology. Interfaces 44(5):445–460. Chen X, Owen Z, Pixton C, Simchi-Levi D (2015) A statistical learning
Baardman L, Levin I, Perakis G, Singhvi D (2018a) Leveraging com- approach to personalization in revenue management. Working
parables for new product sales forecasting. Working paper, paper, Massachusetts Institute of Technology, Cambridge.
Massachusetts Institute of Technology, Cambridge. Ciocan DF, Mišić VV (2018) Interpretable optimal stopping. Working
Baardman L, Borjian Boroujeni S, Cohen-Hillel T, Panchamgam K, paper, INSEAD, Fontainebleau, France.
Perakis G (2018b) Detecting customer trends for optimal pro- Cohen M, Lobel I, Paes Leme R (2016) Feature-based dynamic pric-
motion targeting. Working paper, Massachusetts Institute of ing. Working paper, New York University, New York.
Technology, Cambridge. Cohen MC, Kalas J, Perakis G (2017a) Optimizing promotions for
Baardman L, Cohen M, Panchamgam K, Perakis G, Segev D (2018c) multiple items in supermarkets. Working paper, Massa-
Scheduling promotion vehicles to boost profits. Management Sci. chusetts Institute of Technology, Cambridge.
65(1):50–70.
Cohen MC, Leung N-HZ, Panchamgam K, Perakis G, Smith A (2017b)
Ban G-Y, Keskin NB (2017) Personalized dynamic pricing with ma-
The impact of linear optimization on promotion planning. Oper.
chine learning. Working paper, London Business School, London.
Res. 65(2):446–468.
Ban G-Y, Rudin C (2019) The big data newsvendor: Practical insights
Cohen-Hillel T, Panchamgam K, Perakis G (2019a) High-low pro-
from machine learning. Oper. Res. 67(1):90–108.
motion policies for peak-end demand models. Working paper,
Ban G-Y, Gallien J, Mersereau AJ (2018) Dynamic procurement
Massachusetts Institute of Technology, Cambridge.
of new products with covariate information: The residual tree
Cohen-Hillel T, Panchamgam K, Perakis G (2019b) Bounded memory
method. Manufacturing Service Oper. Management, ePub ahead of
peak end models can be surprisingly good. Working paper,
print December 10, https://doi.org/10.1287/msom.2018.0725.
Massachusetts Institute of Technology, Cambridge.
Bandi C, Moreno A, Ngwe D, Xu Z (2018) Opportunistic returns and
dynamic pricing: Empirical evidence from online retailing in Delage E, Ye Y (2010) Distributionally robust optimization under
emerging markets. Harvard Business School Working Paper moment uncertainty with application to data-driven problems.
19-030, Harvard University, Boston. Oper. Res. 58(3):595–612.
Bastani H, Bayati M (2015) Online decision-making with high- Doshi-Velez F, Kim B (2017) Toward a rigorous science of in-
dimensional covariates. Working paper, University of Pennsyl- terpretable machine learning. Working paper, Harvard Uni-
vania, Philadelphia. versity, Cambridge.
Bernstein F, Modaresi S, Sauré D (2019) A dynamic clustering ap- Elmachtoub AN, Grigas P (2017) Smart “predict, then optimize.”
proach to data-driven assortment personalization. Management Working paper, Columbia University, New York.
Sci. 65(5):2095–2115. Ettl M, Harsha P, Papush A, Perakis G (2019) A data-driven ap-
Bertsimas D, Kallus N (2016) The power and limits of predictive proach to personalized bundle pricing and recommendation.
approaches to observational-data-driven optimization. Working Manufacturing Service Oper. Management, ePub ahead of print
paper, Massachusetts Institute of Technology, Cambridge. July 19, https://doi.org/10.1287/msom.2018.0756.
Bertsimas D, Kallus N (2019) From predictive to prescriptive ana- Farias VF, Li AA (2019) Learning preferences with side informa-
lytics. Management Sci., ePub ahead of print August 23, https:// tion. Management Sci. 65(7):3131–3149.
doi.org/10.1287/mnsc.2018.3253. Farias VF, Jagabathula S, Shah D (2013) A nonparametric approach
Bertsimas D, Mišić VV (2019) Exact first-choice product line opti- to modeling choice with limited data. Management Sci. 59(2):
mization. Oper. Res. 67(3):651–670. 305–322.
Bertsimas D, Brown DB, Caramanis C (2011) Theory and applications Feldman J, Paul A, Topaloglu H (2019) Assortment optimization with
of robust optimization. SIAM Rev. 53(3):464–501. small consideration sets. Oper. Res. Forthcoming.
Ferreira KJ, Lee BHA, Simchi-Levi D (2015) Analytics for an online He L, Mak H-Y, Rong Y, Shen Z-JM (2017) Service region design for
retailer: Demand forecasting and price optimization. Manufacturing urban electric vehicle sharing systems. Manufacturing Service
Service Oper. Management 18(1):69–88. Oper. Management 19(2):309–327.
Fisher M, Gallino S, Xu J (2016) The value of rapid delivery in online Ho T-H, Lim N, Reza S, Xia X (2017) OM Forum: Causal inference
retailing. Working paper, University of Pennsylvania, Philadelphia. models in operations management. Manufacturing Service Oper.
Freitas AA (2014) Comprehensible classification models: A position Management 19(4):509–525.
paper. ACM SIGKDD Explorations Newslett. 15(1):1–10. Javanmard A, Nazerzadeh H (2019) Dynamic pricing in high-
Glaeser CK, Fisher M, Su X (2018) Optimal retail location: Empirical dimensions. J. Machine Learn. Res. 20(9):1–49.
methodology and application to practice. Manufacturing Service Mahmoudi M, Caracciolo G, Safavi-Sohi R, Poustchi H, Li AA,
Oper. Management 21(1):86–102. Vasighi M, Chiozzi RZ, et al. (2017) Multi-nanoparticle protein
corona characterization of human plasma and machine learning
Golrezaei N, Nazerzadeh H, Rusmevichientong P (2014) Real-time
enable accurate identification and discrimination of cancers
optimization of personalized assortments. Management Sci. 60(6):
at early stages. Working paper, Harvard Medical School,
1532–1551.
Cambridge, MA.
Gupta V, Rusmevichientong P (2017) Small-data, large-scale linear
Rath S, Rajaram K (2018) Staff planning for hospitals with cost es-
optimization with uncertain objectives. Working paper, Uni-
timation and optimization. Working paper, University of North
versity of Southern California, Los Angeles. Carolina at Chapel Hill, Chapel Hill.
Gupta V, Han BR, Kim S-H, Paek H (2017) Maximizing interven- Rath S, Rajaram K, Mahajan A (2017) Integrated anesthesiologist and
tion effectiveness. Working paper, University of Southern Cal- room scheduling for surgeries: Methodology and application.
ifornia, Los Angeles. Oper. Res. 65(6):1460–1478.
Harsha P, Subramanian S, Uichanco J (2019) Dynamic pricing of Tibshirani R (1996) Regression shrinkage and selection via the lasso.
omnichannel inventories. Manufacturing Service Oper. Manage- J. Roy. Statist. Soc. B. 58(1):267–288.
ment 21(1):47–65. Vielma JP (2015) Mixed integer linear programming formulation
Hawkins JT (2003) A Lagrangian decomposition approach to weakly techniques. SIAM Rev. 57(1):3–57.
coupled dynamic optimization problems and its applications. Zenteno AC, Carnes T, Levi R, Daily BJ, Dunn PF, Systematic OR
Unpublished PhD thesis, Massachusetts Institute of Technology, (2016) Block allocation at a large academic medical center. Ann.
Cambridge. Surgery 264(6):973–981.

8 (Data Analytics in Operations Management A Review

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

8 (Data Analytics in Operations Management A Review

Uploaded by

Copyright:

Available Formats

This article was downloaded by: [183.222.235.

92] On: 05 March 2023, At: 01:12

Manufacturing & Service Operations Management

Data Analytics in Operations Management: A Review

To cite this article:

Full terms and conditions of use: https://pubsonline.informs.org/Publications/Librarians-Portal/PubsOnLine-Terms-and-

Copyright © 2019, INFORMS

Please scroll down for article—it is on subsequent pages

20th Anniversary Invited Article

Data Analytics in Operations Management: A Review

Velibor V. Mišić,a Georgia Perakisb

Keywords: data analytics • machine learning • operations management

1. Introduction optimization can be applied to and the scale at which

then, is how should one allocate the intervention to

References Bertsimas D, Farias VF, Trichakis N (2013) Fairness, efﬁciency, and

Working paper, Massachusetts Institute of Technology, Care 40(2):210–217.

You might also like