Professional Documents
Culture Documents
Network DEAin Computersand Operations Research
Network DEAin Computersand Operations Research
net/publication/223159166
CITATIONS READS
237 706
2 authors:
Some of the authors of this publication are also working on these related projects:
Optimizing Single Vehicle Many-to-Many Operations with Desired Delivery Times View project
All content following this page was uploaded by Thomas R. Sexton on 24 June 2019.
Abstract
DEA models treat the DMU as a “black box.” Inputs enter and outputs exit, with no consideration of the
intervening steps. Consequently, it is di(cult, if not impossible, to provide individual DMU managers with
speci;c information regarding the sources of ine(ciency within their DMUs. We show how to use DEA to
look inside the DMU, allowing greater insight as to the sources of organizational ine(ciency. Our model
applies to DMUs that consist of a network of Sub-DMUs, some of which consume resources produced by
other Sub-DMUs and some of which produce resources consumed by other Sub-DMUs. Our Network DEA
Model allows for either an input orientation or an output orientation, any of the four standard assumptions
regarding returns to scale in any Sub-DMU, and adjustments for site characteristics in each Sub-DMU. We
demonstrate how to incorporate reverse quantities as inputs, intermediate products, or outputs. Thus, we can
apply the Network DEA Model presented here in many managerial contexts. We also prove some theoretical
properties of the Network DEA Model.
By applying the Network DEA Model to Major League Baseball, we demonstrate the advantages of the
Network DEA Model over the standard DEA Model. Speci;cally, the Network DEA Model can detect in-
e(ciencies that the standard DEA Model misses. Perhaps of greatest signi;cance, the Network DEA Model
allows individual DMU managers to focus e(ciency-enhancing strategies on the individual stages of the
production process.
? 2003 Elsevier Ltd. All rights reserved.
Keywords: Data envelopment analysis; Networks; E(ciency; Productivity; Major league baseball
∗
Corresponding author. Tel.: +1-631-632-7172; fax: +1-631-632-9412.
E-mail addresses: hlewis@notes.cc.sunysb.edu (H.F. Lewis), tsexton@notes.cc.sunysb.edu (T.R. Sexton).
0305-0548/$ - see front matter ? 2003 Elsevier Ltd. All rights reserved.
doi:10.1016/S0305-0548(03)00095-9
1366 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
1. Introduction
Data envelopment analysis (DEA) is a linear programming-based methodology for evaluating the
relative e(ciency of each member of a set of organizational units. The units, called decision-making
units (DMUs), consume various levels of each speci;ed input and produce various levels of each
speci;ed output. DEA evaluates the e(ciency of a DMU relative to an empirical production possi-
bility frontier determined by all DMUs under appropriate assumptions regarding returns to scale and
orientation.
DEA makes no assumptions concerning the internal operations of a DMU. Rather, DEA treats
each DMU as a “black box” by considering only the inputs consumed and outputs produced by
each DMU. This perspective is often appropriate and su(cient. For example, if the purpose of the
analysis is to identify ine(cient DMUs and evaluate the extent of their ine(ciency, then a “black
box” approach is adequate. However, such an approach provides no insight regarding the sources of
ine(ciency and cannot provide process-speci;c guidance to DMU managers to help them improve
the DMU’s e(ciency.
Castelli et al. [1] described a DEA-like model that evaluates the e(ciencies of each of a number
of interdependent sub-units within a larger DMU. Their analysis assesses sub-unit e(ciency relative
to other sub-units within the same DMU. This model is attractive because it allows a DMU manager
to assess e(ciencies within the DMU without access to data from other DMUs. However, their
approach may result in very small reference sets, thereby limiting the number of inputs and outputs
that can be included and possibly leading to inKated e(ciency scores.
In Sexton and Lewis [2], we considered an extension of the DEA model in which each DMU is
comprised of two Sub-DMUs connected in series. In that model, the Stage 1 Sub-DMU consumes
inputs to produce intermediate products. These, in turn, are the inputs to the Stage 2 Sub-DMU,
which uses them to produce the DMU’s outputs. The objective of the two-stage DEA Model is to
evaluate the relative e(ciencies of each DMU and each of its Sub-DMUs.
While we believe that the two-stage DEA Model describes many important settings, we recognize
that we may require more complexity in other cases. In this article, we develop a Network DEA
Model in which the internal design of the production process is more intricate. Speci;cally, we
assume that each DMU is comprised of a set of Sub-DMUs. Each input to a Sub-DMU is either exo-
genous to the DMU or is the output of another Sub-DMU. Similarly, each output from a Sub-DMU
either leaves the DMU or is an input to another Sub-DMU. See Fig. 1. Consider the directed graph
in which the nodes correspond to Sub-DMUs and the arcs correspond to Kows from one Sub-DMU
to another. We assume that this graph is acyclic.
We are not the ;rst to consider the network DEA Model. FLare and Whittaker [3] and FLare and
Grosskopf [4] have presented similar models. Those models de;ne the reference set for the DMU
as the set of all convex linear combinations of DMUs, as is the common practice in DEA. In our
model, we de;ne the reference set for the DMU based on the hypothetical Sub-DMUs identi;ed for
each Sub-DMU.
We apply the Network DEA Model to major league baseball (MLB). Two Sub-DMUs corre-
spond to the team’s front o(ce operation, which uses money in the form of player salaries to
acquire talent. Three Sub-DMUs correspond to the team’s on-;eld operation, which utilize the
talent to win games. This analysis is a generalization of that presented in Sexton and
Lewis [2].
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1367
DMU d
Multiple
Inputs
Multiple Multiple
Other Other
Intermediate Sub-DMU s Intermediate
Sub-DMUs Products Products Sub-DMUs
Multiple
Outputs
We argue that it is crucial for the team’s ownership to identify the extent of ine(ciency within
each Sub-DMU. Ine(ciency in a front o(ce Sub-DMU suggests that the team evaluates oNensive
or defensive talent poorly and pays players more than they are worth. Ine(ciency in an on-;eld
Sub-DMU, however, indicates that the team is unable to produce or prevent as many runs as war-
ranted by their talent level or produce as many victories as warranted by the runs they score and
prevent. (DEA is a deterministic method that ignores random variation such as players having ex-
ceptionally good or bad years. At the team level, DEA assumes that such random variations cancel
one another su(ciently to allow a deterministic analysis. Other methods, such as stochastic fron-
tier analysis (see Kumbhakar and Knox Lovell [5]), explicitly allow for such random variation).
Knowing the sources of ine(ciency within an organization allows management to develop focused
interventions designed to improve organizational performance.
We ;rst present the Network DEA Model using an output orientation, indicating the modi;cations
necessary for an input orientation. A DEA Model is output oriented if it seeks to increase outputs
without increasing inputs. Similarly, a DEA model is input oriented if it seeks to decrease inputs
without decreasing outputs. Our approach to the Network DEA Model is an extension of that used
in the two-stage DEA Model. In brief, we begin by solving a DEA problem for each Sub-DMU to
obtain its e(ciency score. Then we use the acyclic structure of the underlying graph to identify a
partial order of the Sub-DMUs based on the dependence of each Sub-DMU on the output of other
Sub-DMUs. Finally, we evaluate the e(ciency of the entire DMU by resolving the DEA problem
for each Sub-DMU in accordance with the partial order, assuming that all Sub-DMUs that precede
the Sub-DMU under analysis are e(cient.
1368 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
Let k be the index of the DMU under analysis. For d = 1; : : : ; D, and s, t = 1; : : : ; S, s = t, we let
Xdsi = Level of input i consumed by Sub-DMU s in DMU d, for i = 1; : : : ; I ,
Ydstp = Level of intermediate product p produced by Sub-DMU s and consumed by Sub-DMU t in
DMU d, for p = 1; : : : ; P,
Zdsr = Level of output r produced by Sub-DMU s in DMU d for r = 1; : : : ; R,
dsk = Weight placed on Sub-DMU s in DMU d by Sub-DMU s in DMU k, and
sk = Inverse e(ciency of Sub-DMU s in DMU k.
For Sub-DMU s at DMU k, we solve
Max sk
s:t:
D
dsk Xdsi 6 Xksi ; i = 1; : : : ; I; (1)
d=1
S
D
S
dsk Ydtsp 6 Yktsp ; p = 1; : : : ; P; (2)
d=1 t=1 t=1
S
D
S
dsk Ydstp ¿ sk Ykstp ; p = 1; : : : ; P; (3)
d=1 t=1 t=1
D
dsk Zdsr ¿ sk Zksr ; r = 1; : : : ; R; (4)
d=1
dsk ¿ 0; d = 1; : : : ; D;
sk ¿ 0:
The optimal values of the weights, dsk ∗ , allow us to identify an e(cient hypothetical Sub-DMU that
serves as a reference point for Sub-DMU s at DMU k. This reference point is a linear combination
of e(cient Sub-DMUs in the sense that each of its inputs, intermediate products, and outputs is a
weighted sum of the corresponding
D quantities at the e(cient
D Sub-DMUs.
S For example, the reference
Sub-DMU consumes X of input i and ( t=1 dtsp )
Y of intermediate product
D d=1 dskdsi S
d=1 dsk
D
p, while producing d=1 dsk ( t=1 Ydstp ) of intermediate product p and d=1 dsk Zdsr of output
∗
r. We refer to the set of e(cient Sub-DMUs for which dsk ¿ 0 as the e;cient reference set for
Sub-DMU s at DMU k.
Inequalities (1) and (2) ensure that the reference Sub-DMU consumes no more of each input and
each intermediate product as does Sub-DMU s at DMU k. Inequalities (3) and (4) guarantee that
the reference Sub-DMU produces at least as much of each intermediate product and each output as
does Sub-DMU s at DMU k. In fact, the model selects the reference Sub-DMU that produces at
least sk times the levels of each intermediate product and output as does Sub-DMU k, where sk is
as large as possible. For example, if the maximal value of sk = 1:15, then the reference Sub-DMU
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1369
produces at least 15% more of each intermediate product and each output as does Sub-DMU k.
We refer to sk as the inverse e;ciency of Sub-DMU s at DMU k because of its analogy with the
de;nition of e(ciency that appears in the input-oriented model described below.
We de;ne
S
D
∗ ∗
Yksp = dsk Ydstp ; p = 1; : : : ; P
d=1 t=1
to be the level of intermediate product p produced by Sub-DMU s in DMU k if it were e(cient,
∗ is the optimal value of . In addition, we de;ne
where dsk dsk
D
∗ ∗
Zksr = dsk Zdsr ; r = 1; : : : ; R
d=1
to be the level of output r produced by Sub-DMU s in DMU k if it were e(cient.
To obtain DMU k’s organizational inverse e(ciency, k , we proceed as follows. Consider the
directed graph in which the nodes correspond to Sub-DMUs and the arcs correspond to Kows of
intermediate products from one Sub-DMU to another. Augment this graph by adding a source node
from which all inputs Kow and a sink node into which all outputs Kow. We assume that this graph
is acyclic.
Next, perform a depth-;rst search on the nodes of the directed acyclic graph and sequence the
Sub-DMUs in reverse order as reached by this search (see Skiena [6], Sedgewick [7], and Kruse
et al. [8]). Solve the model for each Sub-DMU in sequence using the levels of each intermediate
product that would arrive at the Sub-DMU if all previous Sub-DMUs were e(cient. We refer to
these optimal intermediate products as ∗ Yksp
∗ . The reference set for each Sub-DMU remains as it
was when the Sub-DMU model was solved the ;rst time. Let ∗ Zksr ∗ be the level of output r that
Sub-DMU s in DMU k would produce under these conditions if it were e(cient. 1 Then DMU k’s
organizational inverse e(ciency, k , is
S ∗Z ∗
ksr
k = min s=1
S
:
r=1;:::;R Z
s=1 ksr
We point out we may organize the Sub-DMUs in stages such that all the Sub-DMUs in a stage are
solved before any Sub-DMU in a subsequent stage is solved. This allows for a better understanding
of the Sub-DMU structure and faster computation if parallel processing is available. One way to
identify the stages is to apply the Critical Path Method (CPM) (see, for example, Wiest and Levy
[9]) treating each Sub-DMU as a task that requires one time unit and interpreting each Kow as a
precedence constraint. The earliest ;nish time of each task then represents its stage.
The formulation above corresponds to the output orientation because we applied the inverse
e(ciencies, sk and k , to the production levels associated with the corresponding Sub-DMUs. By
contrast, input oriented models apply the e(ciency scores, Esk , and Ek , to the consumption levels.
1
We will employ the convention of omitting pre-superscripts if the Sub-DMU producing the intermediate product or
output consumes only inputs and no intermediate products.
1370 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
dsk ¿ 0; d = 1; : : : ; D;
Esk ¿ 0:
The interpretations of the constraints in this model are parallel to those in the output-oriented model.
We refer to Esk as the e;ciency of Sub-DMU s at DMU k because it represents the ratio of the
lowest feasible level of each input or intermediate product at that Sub-DMU to its actual level.
In this case, we de;ne
S
D
∗ ∗
Yksp = dsk Ydtsp ; p = 1; : : : ; P
d=1 t=1
Sub-DMU model was solved the ;rst time. Let ∗ Xksi ∗ be the level of input i that Sub-DMU s in
DMU k would consume under these conditions if it were e(cient. 2 Then DMU k’s organizational
2
We will employ the convention of omitting post-superscripts if the Sub-DMU consuming the intermediate product or
input produces only outputs and no intermediate products.
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1371
e(ciency, Ek , is
S ∗X ∗
ksi
Ek = max s=1
S
:
i=1;:::;I X
s=1 ksi
Returns to scale for a producing unit refers to the rate at which its outputs increase as its inputs
increase, assuming that the unit is and remains e(cient. See Seiford and Thrall [10] for a detailed
discussion of this topic. If increasing all of the unit’s inputs by a certain constant proportion leads
to an increase in all of its outputs by the same proportion, regardless of the initial input levels,
then we say that the production process demonstrates constant returns to scale. If this leads to an
increase in all of its outputs by at least the same proportion, with one or more increases greater
than this proportion, then we say that the production process demonstrates nondecreasing returns
to scale. If this leads to an increase in all of its outputs by at most the same proportion, with one
or more increases less than this proportion, then we say that the production process demonstrates
nonincreasing returns to scale. In all other cases, we say that the production process demonstrates
variable returns to scale.
Returns to scale is an inherent characteristic of the production process, independent of the sources
of the inputs or the destinations of the outputs. Therefore, in a Network DEA Model, we examine
each Sub-DMU to determine its most appropriate returns to scale assumption without regard to other
Sub-DMUs.
Alternative assumptions concerning returns to scale require the inclusion of an appropriate
constraint:
D
dsk 6 1 Nonincreasing returns to scale;
d=1
D
dsk ¿ 1 Nondecreasing returns to scale;
d=1
D
dsk = 1 Variable returns to scale:
d=1
D
No such constraint is required for constant returns to scale. Intuitively, d=1 dsk represents the
“relative size” of the reference Sub-DMU for Sub-DMU s at DMU k. If we believe that the pro-
duction process at Sub-DMU s is nonincreasing (nondecreasing) returns to scale, then the reference
Sub-DMU must have the same or smaller (larger) relative size. If we believe that the production
process at Sub-DMU s is variable returns to scale, then the reference Sub-DMU must have the
same relative size. Under constant returns to scale, the reference Sub-DMU may be of any relative
size.
1372 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
In some cases, we encounter natural measures of inputs, intermediate products, and outputs with
the property that larger numerical values represent lower levels of consumption or performance. We
refer to such measures as reverse quantities. Lewis and Sexton [11] show how to incorporate reverse
quantities into any DEA Model. The manner in which we incorporate a reverse quantity into a DEA
formulation depends on the DMU’s (or Sub-DMU’s) orientation and whether the reverse quantity is
consumed by or produced by that unit.
Consider ;rst an output-oriented unit. Suppose this unit consumes a reverse quantity. Then we
change the less-than-or-equal-to constraint corresponding to the reverse quantity to a greater-than-or-
equal-to constraint. Suppose this unit produces a reverse quantity. Then we change the greater-than-or-
equal-to constraint corresponding to the reverse quantity to a less-than-or-equal-to constraint. In
addition, we replace the inverse e(ciency variable (the -term) by its inverse, the e(ciency vari-
able (the E-term), that is E = 1=. Finally, if the unit produces both forward and reverse quantities,
so that the -term must remain in the model, then we add the quadratic constraint E = 1 to the for-
mulation. If all quantities produced are reverse, then we can eliminate the -term from the model by
changing the objective function to minimizing E, and there is no need to add the quadratic constraint.
Consider now an input-oriented unit. Suppose this unit produces a reverse quantity. Then we
change the greater-than-or-equal-to constraint corresponding to the reverse quantity to a less-than-or-
equal-to constraint. Suppose this unit consumes a reverse quantity. Then we change the less-than-or-
equal-to constraint corresponding to the reverse quantity to a greater-than-or-equal-to constraint. In
addition, we replace the e(ciency variable (the E-term) by its inverse, the inverse e(ciency variable
(the -term), that is =1=E. Finally, if the unit consumes both forward and reverse quantities, so that
the E-term must remain in the model, then we add the quadratic constraint E =1 to the formulation.
If all quantities consumed are reverse, then we can eliminate the E-term from the model by changing
the objective function to maximizing , and there is no need to add the quadratic constraint.
Unlike standard DEA, our Network DEA Model does not guarantee the existence of an orga-
nizationally e(cient DMU. It is possible that all DMUs under consideration are organizationally
ine(cient. Consider the output-oriented, constant returns to scale example shown in Fig. 2 and
Tables 1 and 2. The Network DEA Model computes the organizational inverse e(ciency for each
DMU to be 1.33. DMU A’s ine(ciency comes from the fact that it should have been able to
produce 10 rather than 5 units of intermediate product 1. DMU B’s ine(ciency comes from the
fact that it should have been able to produce 10 rather than 5 units of intermediate product 2. The
ine(ciencies in the Sub-DMUs propagate through the network and make the DMUs organizationally
ine(cient.
We present below a theorem that identi;es a necessary, but not su(cient, condition for a DMU
to be organizationally e(cient in our Network DEA Model. We de;ne an acyclic directed graph
G = (N; A) in which the node set, N , is the set of Sub-DMUs augmented by two nodes, V and W ,
which serve as origin and destination, respectively. The arc set, A, includes one node pair (s; t) for
each intermediate product produced by Sub-DMU s and consumed by Sub-DMU t. In addition, A
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1373
DMU
Sub
x1
DMU 1
y1
Sub
z1
DMU 3
y2
Sub
x2
DMU 2
Table 1
Data for the example
x1 x2 y1 y2 z1
A 1 1 5 10 20
B 1 1 10 5 20
Table 2
Inverse e(ciencies in the example
includes one node pair (V; t) for each input consumed by Sub-DMU t and one node pair (s; W ) for
each output produced by Sub-DMU s.
Lemma 1. Assume an output oriented model. If Sub-DMU s at DMU k is ine;cient, then it can
produce more of each intermediate product and more of each output.
Proof. This follows directly from the observation that sk ¿ 1, which implies that Sub-DMU s at
DMU k can increase each intermediate product and each output that it produces by at least this
factor.
Lemma 2. Assume an output oriented model and either constant or nondecreasing returns to scale.
If Sub-DMU s at DMU k is e;cient and if all of its inputs and incoming intermediate products
increase, then the Sub-DMU can increase all of its outputs and outgoing intermediate products.
1374 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
Proof. The fact that Sub-DMU s at DMU k is e(cient implies that ksk = 1; dsk = 0 for all d = k,
and sk = 1 is an optimal solution for Sub-DMU s at DMU k. Construct DMU k such that DMU
k is identical to DMU k except that the inputs and incoming intermediate products at Sub-DMU s
at DMU k have all been increased. Therefore, we seek to show that Sub-DMU s at DMU k can
increase all of its outputs and outgoing intermediate products. Observe that Sub-DMU s at DMU
k remains e(cient and dominates Sub-DMU s at DMU k . Therefore, any optimal solution for
Sub-DMU s at DMU k is feasible for Sub-DMU s at DMU k . The assumption of either constant or
nondecreasing returns to scale implies that we can increase ksk without reducing any of the dsk for
all d = k. Therefore, we can increase ksk until one or more of the less-than-or-equal-to constraints
become binding, and such increases will not violate any of the greater-than-or-equal-to constraints
in the model. The increased left-hand sides of the greater-than-or-equal-to constraints represent the
levels of intermediate products and outputs produced by the hypothetical reference Sub-DMU. Thus,
Sub-DMU s at DMU k can increase all of its outputs and outgoing intermediate products.
Theorem. Assume an output oriented model and either constant or nondecreasing returns to scale.
If DMU k is organizationally e;cient, then there exists a path from V to W along which every
Sub-DMU is e;cient.
Proof. The proof is by contradiction. Suppose that there is no path from V to W along which
every Sub-DMU is e(cient. We will show that every output Zksr can be increased. For each Zksr ,
observe that Sub-DMU s at DMU k is either e(cient or ine(cient. If it is ine(cient, then Lemma 1
guarantees that Zdsr can be increased. Suppose it is e(cient. Then Sub-DMU s at DMU k can receive
only intermediate products, for if it received an input, we would have discovered a path from V to
W along which every Sub-DMU is e(cient. Consider all the Sub-DMUs that provide intermediate
products to Sub-DMU s at DMU k. If they are all ine(cient, then Lemma 1 guarantees that all the
intermediate products entering Sub-DMU s at DMU k can be increased, implying (by Lemma 2)
that Zdsr can be increased. If any Sub-DMUs that provide intermediate products to Sub-DMU s
at DMU k are e(cient, then continue to trace Kows back toward node V in the same manner.
Eventually, all such paths must lead to ine(cient Sub-DMUs before reaching node V , or else there
would exist a path from V to W along which every Sub-DMU is e(cient. Thus, every output Zksr
can be increased, implying that DMU k is organizationally ine(cient, which is a contradiction.
While the theorem provides a necessary condition for a DMU to be organizationally e(cient,
the example above illustrates that the condition is not su(cient. In DMU A, Sub-DMUs 2 and 3
are both e(cient and form a path of e(cient Sub-DMUs from V to W . However, DMU A is not
organizationally e(cient.
The following discussion of MLB follows that in Lewis and Sexton [11]. MLB is comprised of
30 teams divided into two leagues. The teams are the DMUs in our analysis. There are 16 teams in
the National League, and 14 in the American League. Each league is comprised of three divisions,
each containing between 4 and 6 teams. The leagues play under essentially identical rules with one
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1375
EG DMU d
TBG
POS$ Sub-DMU 1 Sub-DMU 3
WG
RG
RS
TBS
PIT$ Sub-DMU 2 Sub-DMU 4
WS
major exception: the American League allows the use of a designated hitter who bats in place of
the pitcher. This potentially leads to generally greater oNensive production in the American League
because pitchers are commonly poor batters and designated hitters are often very good oNensively.
This leads to possible systematic league-based diNerences in e(ciency scores. We will return to this
issue later.
Each team’s objective is to win the World Series. Each team plays 162 regular season games, 144
within its own league and 18 with teams in the other league. Within each league, the ;rst-place teams
in the three divisions and the second-place team with the most wins qualify for post-season play.
Two short series of games within each league determine the league champions, who then compete
in the World Series. Our analysis will examine only the regular season, including the interleague
games but excluding post-season games.
In Sexton and Lewis [2], we modeled a MLB team as a two-stage acquisition and production
operation. In Stage 1 of that model, the team’s front o(ce uses resources (total player salaries, the
sole input) to acquire talent (oNensive and defensive production, the two intermediate products). In
Stage 2, the talent produces games won on the ;eld, the sole output.
In the Network DEA Model, we look deeper into both the front o(ce and on-;eld operations.
See Fig. 3. We model the front o(ce as composed of two Sub-DMUs, one of which consumes
player salaries to produce position player talent while the other consumes player salaries to produce
pitching talent. We model the on-;eld operation as composed of three Sub-DMUs, one of which
consumes oNensive contributions to score runs, another which consumes defensive contributions to
1376 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
prevent runs by the other team, and a third which consumes runs gained and runs surrendered to
produce games won.
Sub-DMUs 1 and 2 in Fig. 3 constitute the front o(ce operation. Sub-DMU 1 spends money
on position player salaries (POS$). These players provide the team with oNensive production in
terms of total bases 3 gained (TBG) and walks gained (WG) while surrendering defensive errors
(ES). (We do not include stolen bases gained or extra bases gained by base runners as outputs of
Sub-DMU 1 because ;eld managers and base coaches directly inKuence these quantities. We chose
to restrict oNensive production to the ways in which batters reach base. Therefore, stolen bases and
extra bases gained by base runners contribute to the team’s e(ciency.) Sub-DMU 2 spends money
on pitcher salaries (PIT$). Pitchers provide the team with defensive production in terms of total
bases surrendered (TBS) and walks surrendered (WS).
Sub-DMUs 3, 4, and 5 in Fig. 3 constitute the on-;eld operation. Sub-DMU 3 converts TBG,
WG, and errors gained (EG) (committed by the team’s opponents) into runs gained (RG). A team
scores a run when one of its players successfully advances four bases. Sub-DMU 4 converts TBS,
WS, and ES into runs surrendered (RS). Finally, Sub-DMU 5 converts RG and RS into games won
(GW). A baseball team wins a game by scoring more runs than does their opponent after nine
innings of play. No game can end in a tie, so games that are tied after nine innings continue until
one team wins.
Note that TBS, WS, ES, and RS are reverse quantities.
We selected the output orientation for each MLB team and all of its sub-units because we felt
that the appropriate improvement for an ine(cient team would be to increase the number of games
it wins rather than decrease its total player salary. This orientation is consistent with each team’s
long-term goal of qualifying for post-season play.
The input orientation would imply that all teams seek to hold GW at current levels, an assumption
that, we believe, contradicts the fundamental competitive nature of baseball teams. We recognize that
individual teams may make economic decisions to spend less (or more) on player salaries. However,
we see this as a scale change, not evidence of an input orientation.
We selected a variable returns to scale model for Sub-DMUs 1, 2, and 5, and an increasing returns
to scale model for Sub-DMUs 3 and 4. We present our rationale in the following sub-sections.
3
Total bases are de;ned by MLB as S+2D+3T+4HR, where S, D, T, and HR are the numbers of singles, doubles,
triples, and home runs, respectively.
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1377
Association contract with MLB. Below a certain threshold, therefore, we expect decreasing returns
to scale.
Eventually, as player salary increases beyond this threshold, the team is better able to sign superior
players who contribute signi;cantly on the ;eld. Here we expect increasing returns to scale.
At very high salary levels, we would again expect the marginal return to be less than the average
return. Superstar players who command the highest salaries are unlikely to provide oNensive and
defensive performance commensurate with their salaries. Above a second threshold, therefore, we
expect decreasing returns to scale.
Based on the discussions and assumptions described above, we employ the following Network
DEA Model formulation for MLB. We begin by solving two DEA models for the front o(ce
1378 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
of team k. The ;rst corresponds to Sub-DMU 1. Note that Sub-DMU 1 has an output orientation,
variable returns to scale, consumes one forward input (POS$), and produces two forward intermediate
products (TBG and WG) and one reverse intermediate product (ES). Therefore, we require the
quadratic constraint. The formulation is
Max 1k
s:t:
D
d1k (POS$)d 6 (POS$)k ;
d=1
D
d1k (TBG)d ¿ 1k (TBG)k ;
d=1
D
d1k (WG)d ¿ 1k (WG)k ;
d=1
D
d1k (ES)d 6 E1k (ES)k ;
d=1
1k E1k = 1;
D
d1k = 1;
d=1
d1k ¿ 0; d = 1; : : : ; D;
1k ; E1k ¿ 0:
The second corresponds to Sub-DMU 2. Note that Sub-DMU 2 has an output orientation, variable
returns to scale, consumes one forward input (PIT$), and produces two reverse intermediate products
(TBS and WS). Thus, we do not require a quadratic constraint. The formulation is
Min E2k
s:t:
D
d2k (PIT $)d 6 (PIT $)k ;
d=1
D
d2k (TBS)d 6 E2k (TBS)k ;
d=1
D
d2k (WS)d 6 E2k (WS)k ;
d=1
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1379
D
d2k = 1;
d=1
d2k ¿ 0; d = 1; : : : ; D;
E2k ¿ 0:
We next turn our attention to the on-;eld operations by solving the models for Sub-DMUs 3 and
4. Sub-DMU 3 has an output orientation, increasing returns to scale, consumes one forward input
(EG) and two forward intermediate products (TBG and WG), and produces one forward intermediate
product (RG). We do not require a quadratic constraint. The formulation is
Max 3k
s:t:
D
d3k (EG)d 6 (EG)k ;
d=1
D
d3k (TBG)d 6 (TBG)k ;
d=1
D
d3k (WG)d 6 (WG)k ;
d=1
D
d3k (RG)d ¿ 3k (RG)k ;
d=1
D
d3k ¿ 1;
d=1
d3k ¿ 0; d = 1; : : : ; D;
3k ¿ 0:
Sub-DMU 4 has an output orientation, increasing returns to scale, consumes three reverse inter-
mediate products (ES, TBS and WS), and produces one reverse intermediate product (RS). We do
not require a quadratic constraint. The formulation is
Min E4k
s:t:
D
d4k (ES)d ¿ (ES)k ;
d=1
D
d4k (TBS)d ¿ (TBS)k ;
d=1
1380 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
D
d4k (WS)d ¿ (WS)k ;
d=1
D
d4k (RS)d 6 E4k (RS)k ;
d=1
D
d4k ¿ 1;
d=1
d4k ¿ 0; d = 1; : : : ; D;
E4k ¿ 0:
We complete the analysis of the on-;eld operations by solving the model for Sub-DMU 5.
Sub-DMU 5 has an output orientation, variable returns to scale, consumes one forward interme-
diate product (RG) and one reverse intermediate product (RS), and produces one forward output
(GW). We do not require a quadratic constraint. The formulation is
Max 5k
s:t:
D
d5k (RG)d 6 (RG)k ;
d=1
D
d5k (RS)d ¿ (RS)k ;
d=1
D
d5k (GW )d ¿ 5k (GW )k ;
d=1
D
d5k = 1;
d=1
d5k ¿ 0; d = 1; : : : ; D;
5k ¿ 0:
Next, we compute the organizational inverse e(ciency of DMU k. We begin by constructing the
directed acyclic graph shown in the top portion of Fig. 4. The lower portion of Fig. 4 shows one
possible sequence of Sub-DMUs generated by a depth-;rst search. Other sequences are possible.
Fig. 5 illustrates how the Sub-DMUs may be grouped into stages using CPM.
We proceed through the sequence of Sub-DMUs shown in Fig. 4, or equivalently through the
stages shown in Fig. 5, resolving the model for each Sub-DMU using the levels of intermediate
products that the Sub-DMU would have received had all previous Sub-DMUs been e(cient. We note
that there is no need to resolve for the Sub-DMUs in Stage 1 since they consume no intermediate
products.
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1381
TBG
Sub-DMU 1 Sub-DMU 3
WG
POS$ RG
EG
Source ES Sub-DMU 5 GW Sink
PIT$ RS
TBS
Sub-DMU 2 Sub-DMU 4
WS
EG
WG
POS$ TBG
TBS RS
WS
Fig. 4. Directed acyclic graph for MLB Network DEA Model (top). One possible sequence of Sub-DMUs generated by
a depth-;rst search of the directed acyclic graph (bottom).
EG
DMU d
Sub- 0 1 Sub- 1 2
DMU 1 TBG DMU 3
POS$
WG
1 0 1 1 1 2
RG
Sub- 2 3
DMU 5
Stage 1 ES Stage 2 GW
1 2 3
RS
Sub- Sub- 1 2 Stage 3
0 1 TBS DMU 4
DMU 2
PIT$
WS 1 1 2
1 0 1
Fig. 5. Grouping of Sub-DMUs in the MLB Network DEA Model into stages using CPM.
Thus, we resolve the models for Sub-DMUs 3 and 4 using the levels of TBG, WG, ES, TBS, and
WS that Sub-DMUs 1 and 2 would have produced had they been e(cient. These are the left-hand
sides of the corresponding constraints in the models for Sub-DMUs 1 and 2 at optimality, and we
1382 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
d3k ¿ 0; d = 1; : : : ; D;
3k ¿ 0:
and
Min E4k
s:t:
D
d4k (ES)d ¿ (ES)∗k ;
d=1
D
d4k (TBS)d ¿ (TBS)∗k ;
d=1
D
d4k (WS)d ¿ (WS)∗k ;
d=1
D
d4k (RS)d 6 E4k (RS)k ;
d=1
D
d4k ¿ 1;
d=1
d4k ¿ 0; d = 1; : : : ; D;
E4k ¿ 0:
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1383
Finally, we resolve the model for Sub-DMU 5 using the levels of RG and RS that Sub-DMUs 3
and 4 would have produced had Sub-DMUs 1– 4 been e(cient. These are the left-hand sides of the
corresponding constraints in the most recently stated models for Sub-DMUs 3 and 4 at optimality.
We denote these values with both pre- and post-superscript asterisks. Hence, we solve
Max 5k
s:t:
D
d5k (RG)d 6∗ (RG)∗k ;
d=1
D
d5k (RS)d ¿∗ (RS)∗k ;
d=1
D
d5k (GW )d ¿ 5k (GW )k ;
d=1
D
d5k = 1;
d=1
d5k ¿ 0; d = 1; : : : ; D;
5k ¿ 0:
The organizational inverse e(ciency of DMU k is
∗ (GW )∗
k
k = ;
(GW )k
where the numerator is the value of the left-hand side of the GW constraint in this last model at
optimality.
We identify one site characteristic, the availability of the designated hitter (DH), which has the
potential to aNect e(ciency in both Front O(ce Sub-DMUs and yet is beyond the control of team
management. Relative to National League teams, American League teams may be more interested
in signing a player with signi;cant oNensive abilities even if he is poor defensively. Such a player
can serve as DH on an American League team but not on a National League team because every
hitter in the National League must also play the ;eld. Conversely, pitchers may prefer to pitch in
the National League where they do not face the DH. Therefore, the front o(ces of MLB teams in
diNerent leagues face diNerent circumstances and, consequently, may have systematically diNerent
e(ciencies.
We do not believe, however, that league a(liation aNects the On-Field Sub-DMUs. Consider
Sub-DMU 3, for example. Suppose that two teams, one in each league, consume identical levels of
TBG, WG, and EG. Given that teams score runs according to the same processes in both leagues, we
see no reason why these teams should score diNerent numbers of runs. A similar argument applies to
1384 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
Sub-DMU 4. Consider Sub-DMU 5, and again suppose that two teams, one in each league, consume
identical levels of RG and RS. Again, we see no reason to expect either team to win more games.
We also considered the size of the market in which the team plays as a potential site characteristic
because we recognize that market size can have a major impact on the team’s ability to pay player
salaries. However, we argue that, while beyond the control of management (at least in the short
term—teams can and do relocate), market size is not a site characteristic. We believe that a team’s
ability to pay player salaries aNects the team’s scale of operations, but not its ability to be e(cient. By
imposing variable returns to scale in Sub-DMUs 1 and 2, we believe that market size is inappropriate
as a site characteristic.
Finally, we considered ballpark features as site characteristics. For example, the Colorado Rockies
play at Coors Field, which is over 5000 ft above sea level. This presents signi;cant problems to
pitchers trying to throw pitches that curve and allows batters to hit the ball further than they would
otherwise at or near sea level. As a result, games played at Coors Field tend to be high scoring.
We believe that every team selects players who are likely to perform well at the team’s home ;eld,
where the team plays half of its games. Thus, the front o(ce’s ability to sign players who ;t well
with the idiosyncratic features of team’s ballpark is a component of the e(ciency of the front o(ce.
Therefore, we did not attempt to adjust inverse e(ciency scores for ballpark characteristics.
5. Computational results
We applied the model described above to evaluate the performance of MLB teams during the 1999
season. We obtained data (see Table 3) from a variety of sources. See Appendix A for complete
team names. We extracted player salary data from the USA Today web site [12]. We gathered
games won and the team performance data from the Baseball Archive Database [13]. We obtained
population data for the markets in which teams play from the US Census Bureau web site [14].
Table 4 shows the organizational inverse e(ciency scores for each MLB team in the 1999 season
along with the Sub-DMU inverse e(ciencies from the Network DEA Model. See Figs. 6 and 7 for
histograms of these results, and see Appendix B for detailed results for each team.
We mentioned earlier that the use of the designated hitter in the American League but not the
National League might lead to systematic diNerences in inverse e(ciency scores between the leagues.
We postulate that such eNects might occur in Sub-DMUs 1 and 2, and consequently in the overall
organization. We tested for diNerences in inverse e(ciency in Sub-DMUs 1 and 2 and in the overall
organization using the Wilcoxon Rank Sum test. The P-values were 0.3221 for Sub-DMU 1, 0.2183
for Sub-DMU 2, and 0.7083 for the overall organization. Thus, we found no evidence of systematic
diNerences in inverse e(ciency scores between the two leagues. Had we discovered such diNerences,
we would have applied an adjustment procedure similar to those described in Sexton and Silkman
[15], Norton et al. [16] or Fried et al. [17].
Table 5 repeats the organizational inverse e(ciencies shown in Table 4 supplemented with the
optimal weights placed on each e(cient organization. Note that several teams place positive weight
on CLE and SD even though these teams are not organizationally e(cient. This occurs because we
compute organizational e(ciency relative to the Sub-DMU 5 reference set so that teams that are
e(cient in Sub-DMU 5 may appear in the e(cient reference set of another team at the organizational
level. Tables 6–10 show comparable information for Sub-DMUs 1–5, respectively.
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1385
Table 3
Data for the 1999 Major League Baseball season. See Appendix A for complete team names
ANA A $26,675,166 $25,155,000 114 2170 511 106 2338 624 711 826 70 15,781,273
ARZ N $32,359,000 $33,719,999 121 2595 588 104 2238 543 908 676 100 2,931,004
ATL N $39,982,500 $33,602,500 120 2427 608 111 2100 507 840 661 103 3,746,059
BAL A $48,499,139 $30,003,143 116 2522 615 89 2399 647 851 815 78 7,285,206
BOS A $29,817,500 $29,736,000 113 2497 597 127 2199 469 836 718 94 5,633,060
CHC N $37,805,500 $22,386,000 118 2303 571 139 2641 529 747 920 67 8,809,846
CHW A $13,860,000 $10,700,000 112 2421 499 136 2550 596 777 870 75 8,809,846
CIN N $23,213,761 $9,949,000 121 2549 569 105 2227 636 865 711 96 1,948,264
CLE A $46,285,461 $21,776,166 114 2629 743 106 2453 634 1009 860 97 2,911,683
COL N $32,729,190 $23,135,647 120 2696 508 118 2819 737 906 1028 72 2,365,345
DET A $22,098,000 $12,006,666 114 2426 458 106 2494 583 747 882 69 5,457,583
FLA N $7,510,000 $11,366,000 119 2203 479 127 2443 655 691 852 64 3,655,844
HOU N $35,732,500 $15,896,500 121 2306 728 106 2210 478 823 675 97 4,407,579
KC A $11,060,000 $12,646,000 113 2435 535 125 2533 643 856 921 64 1,737,025
LA N $44,651,667 $34,614,286 119 2340 594 137 2341 594 793 787 77 15,781,273
MIL N $26,814,595 $14,581,167 119 2378 658 127 2604 616 815 886 74 1,645,924
MIN A $9,002,500 $10,240,000 115 2110 500 92 2607 487 686 845 63 2,831,234
MON N $9,375,500 $6,800,000 117 2376 438 160 2328 572 718 853 68 3,326,510
NYM N $36,367,928 $26,082,499 123 2421 717 68 2281 617 853 711 97 20,126,150
NYY A $48,351,026 $36,683,666 114 2521 718 111 2204 581 900 731 98 20,126,150
OAK A $8,867,833 $14,366,500 113 2462 770 122 2406 569 893 846 87 6,816,047
PHI N $16,586,666 $13,710,834 121 2412 631 100 2483 627 841 846 77 5,988,348
PIT N $13,376,666 $8,821,000 118 2292 573 147 2303 633 775 782 78 2,346,153
SD N $21,964,680 $25,863,666 119 2119 631 129 2344 529 710 781 74 2,780,592
SF N $30,009,057 $14,934,500 121 2414 696 105 2376 655 872 831 86 6,816,047
SEA A $33,629,836 $16,333,667 114 2536 610 113 2556 684 859 905 79 3,424,361
STL N $25,563,333 $20,135,000 119 2371 613 132 2376 667 809 838 75 2,563,801
TB A $20,092,500 $13,860,000 112 2296 544 135 2508 695 772 913 69 2,256,559
TEX A $48,874,931 $25,960,000 113 2705 611 119 2606 509 945 859 95 4,802,463
TOR A $20,710,333 $23,799,000 114 2581 578 106 2546 575 883 862 84 4,263,757
A = American League, N = National League, POS$ = Position Player Salary, PIT$ = Pitcher Salary, EG = Errors Gained,
TBG = Total Bases Gained, WG = Walks Gained, ES = Errors Surrendered, TBS = Total Bases Surrendered, WS = Walks
Surrendered, RG = Runs Gained, RS = Runs Surrendered, and GW = Games Won.
5.1. The network versus the two-stage and one-stage DEA Models
Table 11 shows the inverse e(ciencies computed using the Network DEA Model, the Two-Stage
DEA Model presented in Sexton and Lewis [2], and the One-Stage DEA Model. The Two-Stage
DEA Model has total player salary as the input to the ;rst stage, total bases gained and total bases
surrendered as intermediate products Kowing from the ;rst stage to the second stage, and Games
Won as the output from the second stage. In the Two-Stage DEA Model, total player salary is the
sum of position player salary and pitcher salary. In addition, total bases gained included walks gained
and errors gained, while total bases surrendered included walks surrendered and errors surrendered.
The One-Stage DEA Model has total player salary as the input and games won as the output.
1386 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
Table 4
Inverse e(ciency scores for each MLB team in the 1999 season. The table shows the organizational inverse e(ciencies
along with the inverse e(ciencies for each Sub-DMU
We observe that the mean organizational inverse e(ciency scores rise from 1.1943 for the
One-Stage DEA Model to 1.2102 for the Two-Stage DEA Model and to 1.2414 for the Network DEA
Model. The respective (population) standard deviations 4 are 0.1527, 0.1548, and 0.1742, yielding
coe(cients of variation of 12.8%, 12.8%, and 14.0%. Thus, organizational inverse e(ciencies tend
to rise and to become slightly more spread with increasing model complexity. This suggests that the
more complex models are more capable of identifying sources of ine(ciency, thereby potentially
yielding greater managerial insights into organizational improvements.
4
We use the population standard deviation rather than the sample version because we have included the entire population
of 30 teams.
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1387
Fig. 6. Histogram of MLB organizational inverse e(ciencies from the Network DEA Model.
We see that in most cases (21 of 30, 70%), the organizational inverse e(ciency scores computed
using the One- and Two-Stage DEA Models diNer by less than 0.05 in absolute value, suggesting that
the two models provide reasonably similar results in these cases. In 7 of the 30 cases (23.3%), the
inverse e(ciency produced by the Two-Stage DEA Model exceeds that produced by the One-Stage
DEA Model by more than 0.05. Conversely, in 2 of the 30 cases (6.7%), the inverse e(ciency
produced by the One-Stage DEA Model exceeds that produced by the Two-Stage DEA Model by
more than 0.05. If we accept diNerences of 0.05 or greater as potentially meaningful, then the two
models produce such diNerences in 30% of the cases. We elaborate on these diNerences in Sexton
and Lewis [2].
By contrast, we ;nd considerably larger diNerences between the organizational inverse e(ciencies
computed using the Network DEA Model and those computed using the Two-Stage DEA Model.
We see that the absolute diNerence is less than 0.05 in only 13 of the 30 cases (43.3%). In only
four cases (13.3%) does the Network DEA Model produce a lower organizational inverse e(ciency
score. However, all these (absolute) diNerences are greater than 0.05. In particular, we note that
the Network DEA Model’s organizational inverse e(ciency for MIN is lower by more than 0.27,
that for FLA is lower by more than 0.19, and that for TB is lower by more than 0.13. In 13 cases
(43.3%), the Network DEA Model produced an organizational inverse e(ciency score that was at
least 0.05 higher than that produced by the Two-Stage DEA Model. In particular, we note that the
Network DEA Model’s organizational inverse e(ciency for MON is higher by more than 0.33, and
that for PIT is higher by more than 0.20.
We observe that only one team, ATL, is e(cient regardless of model, and that only 12 teams
(40%), including ATL, had all three organizational inverse e(ciencies within 0.05 of one another.
This indicates that the model complexity is an important consideration in evaluating organizational
e(ciency.
1388 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
Fig. 7. Histograms of MLB Sub-DMU inverse e(ciencies from the Network DEA Model.
Consider the case of the Minnesota Twins (MIN). The One- and Two-Stage DEA Models evaluate
the organizational inverse e(ciency of MIN as 1.2104 and 1.2749, respectively, worse than average in
both cases. However, the Network DEA Model identi;es MIN as e(cient, running counter to the gen-
eral pattern cited above in which organizational inverse e(ciency increases with model complexity.
This suggests that the richer detail of the Network DEA Model allows a more comprehensive analysis
that reveals MIN’s e(ciency more accurately.
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1389
Table 5
Organizational inverse e(ciencies for each MLB team in the 1999 season and the optimal reference set weights
Consider next the cases of the Montreal Expos (MON) and the Oakland Athletics (OAK). Both
were on the e(cient frontier in the One-Stage DEA Model. The Two-Stage DEA Model found some
ine(ciency, producing organizational inverse e(ciency scores of 1.1536 and 1.1105, respectively.
The Network DEA Model revealed more ine(ciency, elevating MON to 1.4841 and OAK to 1.1676.
Thus, the richer detail provided by the Two-Stage and Network DEA Models reveals that neither
team is e(cient and that MON in particular is highly ine(cient.
In this section, we refer the reader to Appendix B. We observe that the Atlanta Braves (ATL) and
the Minnesota Twins (MIN) are organizationally e(cient in the Network DEA Model. However, ATL
1390 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
Table 6
Sub-DMU 1 inverse e(ciencies for each MLB team in the 1999 season and the optimal weights placed on each e(cient
Sub-DMU 1
has inverse e(ciency scores of 1.0769 and 1.0383 in Sub-DMUs 1 and 3, respectively. This suggests
that ATL could improve the on-;eld production of the position players it acquires in the front o(ce,
and that the on-;eld operation could gain more runs based on the oNensive performance of its
players. Similarly, MIN has an inverse e(ciency score of 1.0731 in Sub-DMU 4, suggesting that
its on-;eld operation could surrender fewer runs based on the defensive performance of its players.
Therefore, the Network DEA Model may permit an e(cient team to ;nd areas of improvement.
Consider again the Montreal Expos (MON). Their organizational inverse e(ciency is 1.4841,
indicating that the team could have won many more games than it did based on its inputs. How-
ever, the Sub-DMU inverse e(ciency scores, 1.0383, 1, 1, 1.0022, and 1.0083 for Sub-DMUs 1–5,
respectively, shed little light on how MON might improve. A closer examination of the Kows of the
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1391
Table 7
Sub-DMU 2 inverse e(ciencies for each MLB team in the 1999 season and the optimal weights placed on each e(cient
Sub-DMU 2
intermediate products reveals MON’s areas of improvement. We see that Sub-DMU 1 produced 2376
TBG, 438 WG, and 160 ES. The reference Sub-DMU 1 for MON produced 2467 TBG, 762 WG,
and 121 ES. The factor inverse e(ciency scores are 1.0383, 1.7392, and 1.3189, respectively, the
minimum of which is the Sub-DMU 1 inverse e(ciency. Therefore, Sub-DMU 1 at MON could have
gained almost 74% more walks than it did, and it surrendered almost 32% more errors than did its
reference Sub-DMU. This poor performance propagated through the organization causing Sub-DMU
3 to gain only 718 runs, rather than 915, and Sub-DMU 4 to surrender 853 runs rather than 730.
Had they gained 915 runs and surrendered 730, they could have won 101 games, rather than 68.
The Houston Astros (HOU) provide an interesting parallel with MON. The two teams have nearly
identical Sub-DMU inverse e(ciency scores. For HOU, they are 1.0317, 1, 1, 1.0013, and 1.0228
1392 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
Table 8
Sub-DMU 3 inverse e(ciencies for each MLB team in the 1999 season and the optimal weights placed on each e(cient
Sub-DMU 3
Team Inverse Reference set weights
e(ciency
ANA CHW CLE COL DET HOU KC MIN MON OAK SD TB TEX
ANA 1 1 0 0 0 0 0 0 0 0 0 0 0 0
ARZ 1.0154 0 0 0.3050 0.38635 0 0 0.30869 0 0 0 0 0 0
ATL 1.0383 0 0 0.39475 0 0 0 0.3450 0.26025 0 0 0 0 0
BAL 1.0786 0 0 0.39023 0.04328 0 0 0.56649 0 0 0 0 0 0
BOS 1.0618 0 0 0.22106 0 0 0 0.37328 0 0 0 0 0.22106 0.1846
CHC 1.0720 0 0 0.26838 0 0 0 0.16527 0.56635 0 0 0 0 0
CHW 1 0 1 0 0 0 0 0 0 0 0 0 0 0
CIN 1.0417 0 0 0.20079 0.28754 0 0 0.51168 0 0 0 0 0 0
CLE 1 0 0 1 0 0 0 0 0 0 0 0 0 0
COL 1 0 0 0 1 0 0 0 0 0 0 0 0 0
DET 1 0 0 0 0 1 0 0 0 0 0 0 0 0
FLA 1.0101 0 0 0 0 0 0 0.00611 0.65173 0.34216 0 0 0 0
HOU 1 0 0 0 0 0 1 0 0 0 0 0 0 0
KC 1 0 0 0 0 0 0 1 0 0 0 0 0 0
LA 1.0408 0 0 0.3700 0 0 0 0.11682 0.51317 0 0 0 0 0
MIL 1.0522 0 0 0.51193 0 0 0 0 0.23157 0 0 0.2565 0 0
MIN 1 0 0 0 0 0 0 0 1 0 0 0 0 0
MON 1 0 0 0 0 0 0 0 0 1 0 0 0 0
NYM 1.0413 0 0 0.46316 0 0 0.35182 0 0 0 0 0.1850 0 0
NYY 1.0443 0 0 0.68513 0 0 0 0 0.15743 0 0.15743 0 0 0
OAK 1 0 0 0 0 0 0 0 0 0 1 0 0 0
PHI 1.0358 0 0 0.52631 0 0 0 0.08875 0.38494 0 0 0 0 0
PIT 1.0270 0 0 0.2854 0 0 0 0.10424 0.61036 0 0 0 0 0
SD 1 0 0 0 0 0 0 0 0 0 0 1 0 0
SF 1.0126 0 0 0.5770 0 0 0.00386 0 0 0 0 0.41913 0 0
SEA 1.0691 0 0 0.36319 0.0910 0 0 0.52066 0 0 0 0 0 0.02518
STL 1.0456 0 0 0.45371 0 0 0 0.07854 0.46775 0 0 0 0 0
TB 1 0 0 0 0 0 0 0 0 0 0 0 1 0
TEX 1 0 0 0 0 0 0 0 0 0 0 0 0 1
TOR 1.0284 0 0 0.09691 0.1290 0 0 0.42768 0 0 0 0 0 0.3464
for Sub-DMUs 1–5, respectively, versus 1.0383, 1, 1, 1.0022, and 1.0083 for MON. However,
HOU’s organizational inverse e(ciency is 1.0581, versus 1.4841 for MON. This is because HOU’s
Sub-DMU 1 lies very close to its reference Sub-DMU with respect to all three intermediate products
that it produces. Its factor inverse e(ciency scores are 1.0613, 1.0314, and 1.0317 for TBG, WG,
and ES, respectively. This illustrates the crucial role played by the slacks in a Network DEA Model.
Consider next the Colorado Rockies (COL). They are perfectly e(cient in Sub-DMUs 1, 3, and
5 while Sub-DMUs 2 and 4 have inverse e(ciency scores of 1.3074 and 1.1796, respectively.
Thus, they convert position player salary e(ciently into oNensive and ;elding performance, and
they convert oNensive performance e(ciently into runs gained. In addition, they convert runs gained
and runs surrendered e(ciently into games won. All of the organization’s ine(ciency lies in the
Sub-DMUs that deal with pitching. This reKects the conditions under which COL plays its home
games. Coors Field is located in Denver more than 5000 feet above sea level, an elevation that
creates signi;cant advantages for hitters and disadvantages for pitchers.
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1393
Table 9
Sub-DMU 4 inverse e(ciencies for each MLB team in the 1999 season and the optimal weights placed on each e(cient
Sub-DMU 4
We gain a diNerent insight from the analysis of the Los Angeles Dodgers (LA). Here we ;nd
that the primary sources of ine(ciency are Sub-DMUs 1 (1.1356) and 2 (1.1148), which constitute
the team’s front o(ce operations, and Sub-DMU 5 (1.1070), which converts runs gained and runs
surrendered into games won. This suggests ;rst that LA could improve the quality of its player
evaluation processes. In addition, the team lost 8 games on the ;eld that they could have won based
on the runs they gained and surrendered. Poor e(ciency in Sub-DMU 5 may occur when a team
loses many close games.
Finally, we investigate the Florida Marlins (FLA). While FLA is slightly more ine(cient than
average in Sub-DMUs 2 (1.0984) and 4 (1.0908), it is e(cient in Sub-DMUs 1 and 5 and nearly
so (1.0101) in Sub-DMU 3. Its organizational inverse e(ciency is 1.0702, considerably better than
average (1.2414). FLA won only 64 games and, even if it had been organizationally e(cient, it still
1394 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
Table 10
Sub-DMU 5 inverse e(ciencies for each MLB team in the 1999 season and the optimal weights placed on each e(cient
Sub-DMU 5
ANA 1 1 0 0 0 0 0 0 0 0
ARZ 1.0255 0 0.92462 0 0.07538 0 0 0 0 0
ATL 1 0 1 0 0 0 0 0 0 0
BAL 1.1403 0 0.31674 0.30678 0 0 0 0 0 0.37648
BOS 1.0309 0 0.7579 0.14859 0 0 0 0 0 0.09351
CHC 1 0 0 1 0 0 0 0 0 0
CHW 1.0038 0 0.17694 0.75465 0 0 0 0 0 0.06841
CIN 1.0501 0 0.7495 0.00658 0 0 0 0 0 0.24392
CLE 1 0 0 0 1 0 0 0 0 0
COL 1 0 0 0 0 1 0 0 0 0
DET 1.0195 0 0.06246 0.78054 0 0 0 0 0.1570 0
FLA 1 0 0 0 0 0 1 0 0 0
HOU 1.0228 0 0.86923 0 0 0 0 0 0.13077 0
KC 1.2275 0 0 0.40611 0 0.22028 0 0 0 0.37361
LA 1.1070 0 0.50267 0.47712 0 0 0 0 0.02021 0
MIL 1.0528 0 0.05666 0.62652 0 0 0 0 0 0.31682
MIN 1 0 0 0 0 0 0 1 0 0
MON 1.0083 0.6321 0 0.25639 0 0 0.11151 0 0 0
NYM 1.0256 0 0.76555 0.05867 0 0 0 0 0 0.17578
NYY 1.0295 0 0.64824 0 0.35176 0 0 0 0 0
OAK 1.0405 0 0.1260 0.19582 0 0 0 0 0 0.6782
PHI 1.1008 0 0.19553 0.42156 0 0 0 0 0 0.38291
PIT 1.0657 0 0.39973 0.35229 0 0 0 0 0.24798 0
SD 1 0 0 0 0 0 0 0 1 0
SF 1.0428 0 0.21919 0.25245 0 0 0 0 0 0.52836
SEA 1.0211 0 0 0.40987 0 0.12425 0 0 0 0.46588
STL 1.0934 0 0.27306 0.54206 0 0 0 0 0 0.18488
TB 1.0211 0 0 0.87286 0 0.00447 0 0 0 0.12267
TEX 1 0 0 0 0 0 0 0 0 1
TOR 1.0456 0 0.0699 0.27606 0 0 0 0 0 0.6540
could have won only 68 games, less than 42% of the games it played. Therefore, we conclude that
FLA’s poor performance was the result of its low player salaries rather than signi;cant ine(ciencies
in its operations. Thus, the management solution for FLA is to spend more money to acquire better
players while maintaining its organizational e(ciency.
We conclude that the Network DEA Model can provide managerial insight into the sources of
ine(ciency within an organization that a standard DEA Model cannot provide. This insight allows
management to focus its attention on the relatively ine(cient Sub-DMUs and to identify particular
intermediate products that fall short of their target values. We believe that the Network DEA Model
oNers management the critical information it needs to improve operations, even if the DMU is
organizationally e(cient.
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1395
Table 11
Organizational inverse e(ciencies computed using the Network DEA Model, the Two-Stage DEA Model, and the
One-Stage DEA Model
DEA models treat the DMU as a “black box.” Inputs enter and outputs exit, with no consideration
of the intervening steps. Consequently, it is di(cult, if not impossible, to provide individual DMU
managers with speci;c information regarding the sources of ine(ciency within their DMUs.
1396 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
We have shown how to use DEA to look inside the DMU, allowing greater insight as to the
sources of organizational ine(ciency. Our model applies to DMUs that consist of two or more
Sub-DMUs, some of which consume resources produced by other Sub-DMUs and some of which
produce resources consumed by other Sub-DMUs. Our Network DEA Model allows for either an
input orientation or an output orientation, any of the four standard assumptions regarding returns to
scale in any Sub-DMU, and adjustments for site characteristics in each Sub-DMU. We have also
demonstrated how to incorporate reverse quantities as inputs, intermediate products, or outputs. Thus,
we can apply the Network DEA Model presented here in many managerial contexts.
By applying the Network DEA Model to MLB, we have demonstrated the advantages of the
Network DEA Model over simpler DEA models. Speci;cally, the Network DEA Model can detect
ine(ciencies that simpler DEA models miss. Perhaps of greatest signi;cance, the Network DEA
Model allows individual DMU managers to focus speci;c e(ciency-enhancing strategies on the
individual components of the production process.
Appendix A.
Complete team names and their designators are given in Table 12.
Table 12
Team Symbol
Anaheim Angels ANA
Arizona Diamondbacks ARZ
Atlanta Braves ATL
Baltimore Orioles BAL
Boston Red Sox BOS
Chicago Cubs CHC
Chicago White Sox CHW
Cincinnati Reds CIN
Cleveland Indians CLE
Colorado Rockies COL
Detroit Tigers DET
Florida Marlins FLA
Houston Astros HOU
Kansas City Royals KC
Los Angeles Dodgers LA
Milwaukee Brewers MIL
Minnesota Twins MIN
Montreal Expos MON
New York Mets NYM
New York Yankees NYY
Oakland Athletics OAK
Philadelphia Phillies PHI
Pittsburgh Pirates PIT
San Diego Padres SD
San Francisco Giants SF
Seattle Mariners SEA
St. Louis Cardinals STL
Tampa Bay Devil Rays TB
Texas Rangers TEX
Toronto Blue Jays TOR
H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410 1397
Appendix B.
Detailed results from the Network DEA Model for each MLB team are given in Table 13.
Table 13
Table 13 (continued )
Table 13 (continued )
Sub-DMU 2 1
EG 113 Team CLE KC TB TEX
RG 836 888 926 1.0618 Weight 0.2211 0.3733 0.2211 0.1846
Sub-DMU 3 1.0618
RS 718 716 692 1.0021 Team ATL PIT
Sub-DMU 4 1.0021 Weight 0.5799 0.4261
GW 94 97 102 1.0309 Team ATL CHC TEX
Sub-DMU 5 1.0309 Weight 0.7579 0.1486 0.0935
GW 94 102 1.0857 Team ATL CLE
Organization 1.0857 Weight 0.8428 0.1572
Table 13 (continued )
Table 13 (continued )
Table 13 (continued )
Table 13 (continued )
Table 13 (continued )
ES 92 92 1
Sub-DMU 1 1
PIT$ $10,240,000 Team MIN
TBS 2607 2607 1 Weight 1
WS 487 487 1
Sub-DMU 2 1
EG 115 Team MIN
RG 686 686 686 1 Weight 1
Sub-DMU 3 1
RS 845 787 787 1.0731 Team ARZ
Sub-DMU 4 1.0731 Weight 1.1649
GW 63 63 63 1 Team MIN
Sub-DMU 5 1 Weight 1
GW 63 63 1 Team MIN
Organization 1 Weight 1
Table 13 (continued )
Table 13 (continued )
Table 13 (continued )
Table 13 (continued )
Table 13 (continued )
The column labeled “*” shows the hypothetical values of TBG, WG, and ES if Sub-DMU 1
had been e(cient, the hypothetical values of TBS and WS if Sub-DMU 2 had been e(cient,
the hypothetical value of RG if Sub-DMU 3 had been e(cient, the hypothetical value of RS
if Sub-DMU 4 had been e(cient, and the hypothetical value of GW if Sub-DMU 5 had been
e(cient. The column labeled “**” shows the hypothetical values of RG, RS, and GW had all
preceding Sub-DMUs been e(cient. We have rounded all hypothetical values to the nearest in-
teger. The column labeled “Ratio” shows the ratio of the larger to the smaller of the hypo-
1410 H.F. Lewis, T.R. Sexton / Computers & Operations Research 31 (2004) 1365 – 1410
thetical and actual values. The column labeled “Inv EN” shows the inverse e(ciencies for each
Sub-DMU and for the organization. The columns labeled “E(cient reference sets” show the teams
in the e(cient reference sets at each Sub-DMU and for the organization, and their corresponding
weights.
References
[1] Castelli C, Pesenti R, Ukovich W. DEA-like models for e(ciency evaluations of specialized and interdependent
units. European Journal of Operational Research 2001;132:274–86.
[2] Sexton TR, Lewis HF. Two-stage DEA: an application to major league baseball. Journal of Productivity Analysis
2003;19(2/3):227–49.
[3] FLare R, Whittaker G. An intermediate input model of dairy production using complex survey data. Journal of
Agricultural Economics 1995;46(2):201–23.
[4] FLare R, Grosskopf S. Network DEA. Socio-Economic Planning Sciences 2000;34:35–49.
[5] Kumbhakar SC, Knox Lovell CA. Stochastic frontier analysis. Cambridge: Cambridge University Press, 2001.
[6] Skiena SS. The algorithm design manual. New York: Springer-Verlag, 1998. p. 94, 273– 4.
[7] Sedgewick R. Algorithms in C + +. Reading, MA: Addison-Wesley, 1992. p. 479 –81.
[8] Kruse RL, Leung BP, Tondo CL. Data structures and program design in C. Englewood CliNs, NJ: Prentice-Hall,
1991. p. 391–5.
[9] Wiest J, Levy F. Management guide to PERT-CPM, 2nd ed. Englewood CliNs, NJ: Prentice-Hall, 1977.
[10] Seiford LM, Thrall RM. Recent developments in DEA: the mathematical programming approach to frontier analysis.
Journal of Economics 1990;46:7–38.
[11] Lewis HF, Sexton TR. Data envelopment analysis with reverse inputs and outputs. Forthcoming in Journal of
Productivity Analysis.
[12] USA Today (2000). http://www.usatoday.com/sports/baseball/salary/index.htm
[13] Baseball Archive Database (1999). Version 3.0. http://www.baseball1.com/
[14] U.S. Census Bureau (2000). http://www.census.gov/population/estimates/metro-city/ma98-02.txt
[15] Sexton TR, Silkman R. Adjusting inputs for site characteristics. Harriman School Working Paper, Stony Brook
University, 1999.
[16] Norton RN, Sexton TR, Silkman RH. Firm-speci;c productive e(ciency oNsets in the development of a price cap
formula. The Electricity Journal 2002;15(10):43–52.
[17] Fried HO, Schmidt SS, Yaisawarng S. Incorporating the operating environment into a nonparametric measure of
technical e(ciency. Journal of Productivity Analysis 1999;12:249–67.
Herbert F. Lewis is a Lecturer at the W. Averell Harriman School for Management and Policy at the State University
of New York at Stony Brook. He received his Ph.D. in 1996 in operations research from the Department of Applied
Mathematics and Statistics at the State University of New York at Stony Brook. His research interests include productivity
and e(ciency analysis using data envelopment analysis, and solution methods for combinatorial problems in scheduling,
vehicle routing, and facility location.
Thomas R. Sexton is the Director of the W. Averell Harriman School for Management and Policy at the State Univer-
sity of New York at Stony Brook. He earned his Ph.D. in Applied Mathematics and Statistics from the State University
of New York at Stony Brook in 1979. He has published extensively on topics related to productivity, health care manage-
ment, and vehicle routing and scheduling. His recent research is in the areas of productivity and e(ciency measurement
methodologies, student perceptions of Internet learning, and the eNects of mandatory auditor rotation and retention on
audit market share.