Professional Documents
Culture Documents
ABSTRACT
This is a case study of a closely managed product. Its purpose is to
determine whether time-series methods can be appropriate for business
planning. By appropriate, we mean two things: whether these methods can
model and estimate the special events or features that are often present in
sales data; and whether they can forecast accurately enough one, two and
four quarters ahead to be useful for business planning. We use two time-
series methods, Box-Jenkins modeling and Holt-Winters adaptive
forecasting, to obtain forecasts of shipments of a closely managed
product. We show how Box- Jenkins transfer-function models can
account for the special events in the data. We develop criteria for choosing
a final model which differ from the usual methods and are specifically
directed towards maximizing the accuracy of next-quarter, next-half-year
and next-full-year forecasts. We find that the best Box-Jenkins models
give forecasts which are clearly better than those obtained from
Holt-Winters forecast functions, and are also better than the judgmental
forecasts of IBM’s own planners. In conclusion, we judge that
Box-Jenkins models can be appropriate for business planning, in
particular for determining at the end of the year baseline business-as-usual
annual and monthly forecasts for the next year, and in mid-year for
resetting the remaining monthly forecasts.
Business planning often starts with an annual financial goal from which annual targets or plans
for financial measurements and sales of individual products must be generated. Business
planners take each annual plan and further apportion it into monthly numbers which represent
their expectation and a feasible way of reaching the annual plan, in terms of manufacturing
schedules and marketing support. The monthly planning numbers are called the track. They
are based on historical patterns of seasonality in the data and on the planner’s judgment of
such matters as manufacturing capability, product announcements and price cuts.
One important aspect of business planning is assessing, as the year progresses, whether the
annual plan is attainable or whether action needs to be taken to achieve the target; and, when
0277-6693/91/060579- 17$08.50 Received July 1989
@ 1991 by John Wiley & Sons, Ltd. Revised December 1989
580 Journal of Forecasting Vol. 10, Iss. No. 6
action is required, judging the likely effect of feasible choices. Wu (1988) has described how
to model and quantify the uncertainties in this assessment process. She showed how a planning
model, the ‘Track Uncertainty Model’, can be used to construct planning charts (called the
WINEGLASS, SHIPWRECK and OUTLOOK charts) to determine whether we are on track,
to assess the attainability of the annual plan and to make outlooks for the current year; and,
if we are off track, to find where the problem is. In the Track Uncertainty Model outlooks are
based on the year-to-date estimate of the ratio of annual sales to the annual plan. This ratio
is assumed to vary from year to year and is estimated with data from the current year only.
Therefore outlooks made from this model are for the current year only.
Two other areas important to business planning are making forecasts across years and
measuring the effect of special marketing and manufacturing events on sales. Developing
forecasts for the next year is important for several reasons: for cross-checking the
reasonableness of annual plans developed from the financial goals; towards the end of a year,
for setting manufacturing schedules for the beginning of the next year; and for establishing
a baseline business-as-usual forecast for the next year. The modeling and forecasting of sales
data is difficult because of discontinuities caused by special events such as the introduction of
new products and marketing programs. Failure to account for these discontinuities will lead
to erratic and unreliable forecasts. However, if good forecasts are available then the effect of
a special event can be estimated, as the difference between the forecast and actuality. In this
way, one can obtain a reasonable value for the annual plan for the next year, and a track for
reaching it. Start with the baseline business-as-usual forecast. Next, consider any special events
which are expected in the next year, and estimate their effects to be what has been measured
for analogous events in the past. Add these effects to the baseline forecast to obtain the track.
In this paper we examine two time-series forecasting methods: the Box-Jenkins model and
the Holt-Winters forecasting function. We describe the criteria and steps used to construct our
forecasting models; we show how the effect of one particular special event, a constraint on
shipments, can be estimated; and we assess how well these methods forecast the next quarter,
the next half-year and the next full year. The methods are applied to a time series of shipments
of a closely managed product, which we call Boxes. By ‘closely managed’ we mean that there
is a monthly sales objective, the track, and that shipments are managed with the aim of
achieving it. At first, one might not expect time-series forecasts to be as accurate as the track
for predicting future shipments. However, we found that this was not the case. Box-Jenkins
modeling yielded better forecasts than the planner tracks, and we judge it to be an appropriate
method for planning and forecasting sales of Boxes.
The structure of the paper is as follows. The next section contains a description of the data.
Our Box-Jenkins and Holt-Winters modeling procedures are described in the third and fourth
sections, respectively, The final section contains a summary and conclusions, and compares
our forecasts with the planner track.
DATA DESCRIPTION
Data on shipments of Boxes consists of monthly observations from January 1982. The data
set is listed in Table I. T o maintain confidentiality, the numbers have been rescaled by a
constant which we do not disclose. This rescaling does not, of course, affect the dependence
structure of the time series. In our analysis we use the 72 observations between January 1982
and December 1987 to build a time-series model to best describe the data, and then we use this
period to forecast the number of shipments for 1988. We denote the 1982-7 Boxes data by
L . S.-Y. Wu, N . Ravishanker and J. R. M . Hosking IBM Product Sales 581
X,, t = 1,2, ... , 72. The series is shown in Figure 1. The mean of the observations is 6100 and
the variance is 1.82 x 10’.
The series displays a strong seasonal pattern, with peaks in the last month of a quarter and
a particularly large peak in December. There are also two special features in the data which
are important for building a time-series model:
The variability in the data increased at about the end of 1984: this period corresponds to
the introduction of a new model. The point at which the variance change occurs is difficult
to judge exactly; we take the change as occurring at time point t = 33, corresponding to
September 1984. The variance in the data after the introduction of the new model is about
five times the variance before its introduction. To account for this change in variance, we
transform the data to stabilize the variance and use the transformed data for modeling
and forecasting. The transformation is discussed in the next section.
The data in the third quarter of 1987 are affected by an external constraint on shipments
during July and August 1987. Boxes which would have been shipped in July and August
were not shipped until September 1987. This means that September 1987 was unusually
high at the expense of July and August 1987. To correct this anomaly, a reallocation of
shipments within these three months is called for. This is specially important, because
these observations greatly influence the values of the forecasts obtained for 1988.
582 Journal of Forecasting Vol. 10, Iss. No. 6
n
U
V
N
W
LY
3
+ w
CL
3
12 0
LL
LL
n!
<
,
0'1 S'O 0 S'O- 0'1- 0'1 S'O 0 S'O- O ' L - oz SL 01 s 0
N011Vl3tltl0301nV N 0 1 1 V l 3 t l k i 0 3 0 1 ~ V1VlltlVd MISN30 l W 1 3 3 d S
. -
/ -
h
n :
m
v
0
V a
V
N N v) N
; + W
CL
3
a
4 W
3
CL
3
12 12 12
LL LL LL
I LAGS
a'l
LAGS
ES
3 -
8
eo
0
-
w -
Eg
om
~
Let Y, = ( 1 - B 3 ) X fdenote the seasonally differenced Boxes series. Let the parameter k denote
the ratio of the variance after the change point c to the variance before the change point c,
where c corresponds to t = 33., Then Y, is assumed to follow an ARMA model:
(a: t>c
L . S.-Y. Wu, N . Ravishanker and J . R. M. Hosking IBM Product Sales 505
t l 1 1 1
! * . .. .,. 1
.. .
-t
I LAGS
Now let
In the Appendix we show that the log-likelihoods of model (2) and of an ARMA(p, q ) model
fitted with constant innovation variance to Zlk' are approximately equal, apart from a term
+ c log k . We use this result when fitting a full transfer-function model for the Boxes series,
described later.
t = 67, 68 and 69. To effect the reallocation from September into July and August, define
1 t=67
0 otherwise
and
1 t=68
0 otherwise
The series which we want to use as the basis for forecasting is not X I but X I + aUf.l + PUt,2.
The parameters a and P may be regarded as the numbers of Boxes reallocated from September
1987 into July and August, respectively, and are as yet undetermined.
4 (B)+ ( B ) Nf
~ = 8 (B)e( B )tf (8)
where Z j k ’ , Uf,l and Ut,2 are as defined in equations (4), (5) and (6), respectively. This is a
transfer-function model for Z l k ’ , a transformation of Y,. The input series Ut,,and U,,Zare
wholly deterministic, so the model resembles those used in intervention analysis (Box and Tiao,
1975). Estimation of the model involves the simultaneous estimation of all the model
parameters: k, a, and the p + q + P + Q coefficients of the 4, 8, 9 and 8 polynomials. This
is done using the following two-step procedure.
Step 1.
For each of a grid of values of k, in this case k = 1.0,1.2, ..., 5.0, the transformed series Zjk’
is formed and the parameters a! and and the coefficients of the 4, 8, 9,8 polynomials are
estimated by the method of maximum likelihood for a transfer-function model. This
estimation procedure is available in standard software, e.g. Genstat (Genstat 5 Committee,
1987). The maximized log-likelihood t ( k ) , is computed by substituting the maximum-
likelihood estimates for the parameters and the chosen value of k into the log-likelihood
function of the transfer function model.
Step 2.
The optimal value of k is taken to be that value which maximizes the function L ( k ) + 5 c log k ,
with c = 33. This value, %, and the estimates for the other parameters at this value of k, are
the final simultaneously estimated parameters of the transfer-function model.
Estimated parameters of transfer-function models, using each of the candidate seasonal
models identified above are shown in Table 11. Some comments on these estimated parameters
are called for. The estimates of cx and P in all three models are not statistically significant.
Nonetheless, we consider these parameters essential to the model because we know a priori that
there was a constraint on shipments: we do not need to diagnose the presence of a constraint
from the data alone. All the fitted models are stationary and invertible, though some roots of
L . S.-Y. Wu, N . Ravishanker and J. R. M . Hosking IBM Product Sales 587
Table 11. Estimates of transfer-function model parameters. Standard errors of the estimates are given in
parentheses
~~ ~ ~~~
the autoregressive and moving-average polynomials lie close to the unit circle. For example,
the seasonal autoregressive operator of Model2, 1 + 1 .016B3 + 0.954B6 + 0.930B9, factorizes
as (1 + 0.996B3)(1+ 0.020B3+ 0.934B6), which may be regarded as containing a factor of
1 + B3 or even (1 + B 3 ) ( l + B 6 ) . This would suggest, since the seasonal differencing operator
1 - B 3 is also present in this model, that seasonal differencing with period
6 (1 - B6 = (1 - B 3 ) ( l + B 3 ) ) or even period 12 ( 1 - B I Z = ( 1 - B 3 ) ( l + B 3 ) ( l + B 6 ) ) may be
appropriate. We found, however, that models containing seasonal differencing with periods 6
and 12 gave inferior forecasts compared with Model 2.
Autocorrelations of the residual series from fitting each of the models are given in Figure 6 .
Those of Model 1 are not significantly different from zero, but there is some dependence in
the residuals from Models 2 and 3 . Therefore, based on the residual diagnostics, it may be
doubtful whether Models 2 and 3 adequately fit the data. However, Model 1 differs from
Model 2 only by the inclusion of autoregressive terms at lags 1 and 2, parameterized by 4, and
$2, and neither of these parameters is strongly significant in Model 1 .
588 Journal of Forecasting Vol. 10, Iss. No. 6
I
LAGS
9.
I
LAGS
LAGS
c IR-xfl
QFC = Mean " x 100 (9)
it I
c XI
r t Q,
zf
where Qi denotes the ith quarter, and and X I the forecast and the actual value, respectively,
at time t . I is an index set over which the mean is taken. We prefer QFC to more familiar
criteria such as MAPE (Mean Absolute Percentage Error, used, for example, by Makridakis
el al., 1982) so as not to give too much influence to forecast errors in months when shipments
are low.
QFCs for the three candidate models are shown in TableIII. In these QFCs, the index set
I is the forecast period (one quarter for next-quarter forecasts, two quarters for next-half-year
L . S.-Y. Wu, N . Ravishanker and J. R . M. Hosking IBM Product Sales 589
Table 111. QFCs for next-quarter, next-half-year and next-full-year forecasts from Box-Jenkins models
(1,2.3) and Holt-Winters forecast functions (A, B, C)
Next
quarter
12/86 IQ87 27.0 23.1 18.7 40.6 40.9 44.4
3/87 2487 14.0 14.4 11.4 9.4 3.3 5.9
6/87 3Q87 - - - - - -
9/87 4Q87 27.3 28.7 33.7 22.4 16.0 20.2
12187 lQ88 13.4 6.9 16.2 27.9 15.5 27.3
3/88 2488 9.2 11.0 8.1 18.9 17.2 18.8
6/88 3488 19.8 17.4 21 .o 35.4 30.8 33.1
9/88 4488 7.8 7.8 5.6 22.8 15.5 21.8
Next
half-year
12/86 1H87 15.7 18.5 11.6 24.7 23.2 26.9
6/87 2H87 31.8 37. I 39.9 35.7 29.7 33.5
12/87 1H88 14.1 7.8 14.0 24.6 16.5 24.0
6/88 2H88 14.5 15.7 16.2 28.3 24.0 26.4
Next
full year
12/86 87 26.3 26.2 25.9 30.5 26.2 30.9
12/87 88 15.3 10.4 15.2 26.6 20.6 25.3
’1487 denotes first quarter of 1987, 1H87 denotes first half of 1987, erc.
bQFC is not given for the third quarter of 1987, since the monthly data for this quarter are unreliable.
forecasts, four quarters for next-full-year forecasts). For forecasts of 1988 shipments, Model
2 yields, on average, the lowest QFCs at all three lead times. For the 1987-8 forecasts as a
whole there is less to choose between the three models, but Model 2 still performs
competitively. We therefore choose Model 2, the transfer-function model with an
ARIMA(0, 0,O) x (3, 1,3)3 seasonal model, as the final model for forecasting the Boxes series.
where ( Y I , (112 and (113 are smoothing parameters which lie between 0 and 1. The forecast made
at month t for 1 months ahead xl(l),is computed as
%(I) = ( i ,+ fiIl)S,+,-l*, 1 = 1, ... ,12 (14)
To produce forecasts, three components are needed: initial values for the level, slope and
seasonal factors, a set of smoothing parameters, and a set of data over which to run the
updating equations to produce estimates of the current level, slope and seasonal factors. These
three components constitute a HW forecast function or HW function.
where x~(1)is the I-month-ahead forecast made at J and given by equation (14). We then
replace the shipments x J + I , xJ+2and xJ+3by the reallocated XJ+I,2J+2and xJ+3in the
L . S.-Y. Wu,N . Ravishanker and J. R. M . Hosking IBM Product Sales 591
updating equations. This reallocation is based on the forecast from June 1987 alone, an
extrapolation from the past data. In contrast, the transfer-function reallocation described in
the previous section is based not only on the forecast from June 1987 but also on how well
the reallocated data forecasts October through December 1987, and may be regarded as an
interpolation using all the available data. Because the transfer-function reallocation uses more
information than the HW reallocation, we expect it to be more accurate.
CONCLUSIONS
We have attempted to decide whether time-series methods can be appropriate for business
planning. By ‘appropriate’ we mean two things: (1) whether these methods can model and
estimate the special events or features often present in sales data; and (2) whether they can
forecast accurately enough one, two and four quarters ahead to be useful for business
planning.
The Boxes series studied in this report has some special features which standard time-series
models do not account for: a change in variance near the end of 1984 and a constraint on
shipments in 1987. We have shown that the Box-Jenkins transfer-function model provides an
excellent framework for modeling and simultaneously estimating all the parameters in the
model. The Holt-Winters forecast function can also deal with a constraint on shipments,
though in a somewhat ad hoc way. We found that the accuracy of our Holt-Winters forecasts
depended strongly on the amount of data used to build the forecast function. This shows that
when using a Holt-Winters procedure we cannot assume that sudden changes in the slope or
variance of the series can be taken care of by the adaptive updates. Instead, we need to pay
close attention to the amount of data used.
For the Boxes series, the forecasts obtained from our best Box-Jenkins models were clearly
better than those obtained from our best Holt-Winters forecast functions. This is true for all
of the forecasts that we considered-one, two and four quarters ahead. We therefore choose
a Box-Jenkins model, specifically Model 2 defined in the third section, as our best time-series
forecasting model.
Finally, we need to determine whether our Box-Jenkins model is accurate enough to be
useful for business planning. We have compared the 1988 performance of each of our
Box-Jenkins models with the 1988 track. The track was constructed by IBM planners as a
feasible path for reaching the annual plan, and shipments are managed with the aim of
achieving it. The track is therefore intended as a monthly sales objective and not as a direct
forecast of actual shipments. However, it is a reasonable yardstick by which to measure
whether the Box-Jenkins model can be useful for business planning. Figure 7 shows the track
for 1988, established in December 1987, together with the Box-Jenkins forecasts for 1988,
made at origin December 1987 using Model 2. The QFC of these next-full-year forecasts from
Model 2 for 1988 is 10.4. This is much lower than the corresponding QFC of the track, which
is 23.1. Indeed, QFCs of forecasts from Box-Jenkins Models 1,2 and 3 are all markedly lower
than the QFC of the track. We therefore conclude that Box-Jenkins models can be
appropriate for business planning. They should be particularly useful at the end of a year for
determining a baseline business-as-usual annual plan and track for the year ahead, and in mid-
year for correcting the annual plan and resetting the track.
L. S.-Y. Wu, N. Ravishanker and J. R. M . Hosking IBM Product Sales 593
25
......... Track
Model 2 forecasts
Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nav Dec
Figure 7. Planner track and time-series forecasts (origin 12/87) for 1988
APPENDIX
(d t>c
where C is a constant. This is a function of the Y, through equation (Al), which relates at to
Y,. Our goal is t o find a way t o estimate k and the parameters of the Box-Jenkins model
simultaneously so as to maximize L,(k).For known k, u3 is estimated by
Now let
Lb(k) = c - -
n log u; - 7g b ?
2 2ub r = I
594 Journal of Forecasting Vol. 10, Iss. No. 6
But approximately
bt =
t>c
so that, by equation (A4),
Therefore the two likelihoods La(k)and &k) are identical apart from the term ic log k , i.e.
c
C ( k )= L*(k)+ - log k (A 12)
2
This implies that for a given value of k, if we fit a model (A7) that maximizes Lb(k), then the
same model also maximizes L,(k).If we choose k to maximize & , ( k ) + i c log k, then this
choice also maximizes L,(k), giving the maximum-likelihood estimate of model (Al).
Tsay (1988) has suggested a method of dealing with a variance change in a time series both
when the change point is known as well as when it is unknown. Tsay’s method estimates the
variance ratio from the residuals of a time-series model fitted to the original data, transforms
the data by this ratio and fits an ARIMA model to the transformed data. Like the method
described above, it relies on the approximation (A10) but the details of the calculations are
different, and the two methods would, in general, give slightly different results.
ACKNOWLEDGEMENTS
REFERENCES
Abraham, B. and Ledolter, J . Statistical Methods for Forecasting, New York: John Wiley (1983).
Box, G. E. P. and Jenkins G. M., Times Series Analysis Forecasting and Control, 2nd edn, San
Francisco: Holden-Day (1976).
Box, G. E. P. and Tiao, G. C. ‘Intervention analysis with applications to economic and environmental
problems’, Journal of the American Statistical Association, 70 (1979, 70-79.
Chatfield, C. and Prothero, D. L., ‘Box-Jenkins seasonal forecasting: problems in a case study’, Journal
of the Royal Statistical Society, Series A . 136 (1973). 295-336.
Chatfield, C. and Yar, M., ‘Holt-Winters forecasting: some practical issues’, The Statistician, 37 (1988),
129-40.
Genstat 5 Committee, Genstat 5 Reference Manual, Oxford: Clarendon Press (1987).
Makridakis, S., Andersen, A., Carbone, R., Fildes. R., Hibon, M., Lewandowski, R., Newton, J.,
Parzen, E. and Winkler, R., ‘The accuracy of extrapolation (time series) methods: results of a
forecasting competition’, Journal of Forecasting, 1 (1982), 111-53.
Makridakis, S., Andersen, A., Carbone, R., Fildes, R., Hibon, M.. Lewandowski. R., Newton, J.,
Parzen, E. and Winkler, R.. The Forecasting Accuracy of Major Time Series Methods, New York:
Wiley (1984).
L . S.-Y. Wu,N. Ravishanker and J. R. M. Hosking IBM Product Sales 595
Montgomery, D. C. and Johnson, L. A., Forecasting and Time and Series Analysis, New York:
McGraw-Hill (1976).
Tsay, R. S., ‘Outliers, level shifts and variance changes in time series’, Journal of Forecasting, 7 (1988),
1-20.
Wu, L. S.-Y., ‘Business planning under uncertainty: quantifying variability’, The Statistician, 37 (1988),
141-52.
Authors ’ biographies:
Lilian Shiao-Yen Wu has been a Research Staff Member at the T. J. Watson Research Center of IBM
Research Division since 1974. She received a PhD in Applied Mathematics from Cornell University in
1974. Her interests include forecasting, business planning and data analysis.
Nalini Ravishanker is an Assistant Professor in the Department of Statistics at the University of
Connecticut at Storrs. She received a PhD in Statistics from New York University in 1987. Her interests
include simultaneous confidence intervals and the differential geometry of time-series models.
J. R. M. Hosking has been a Research Staff Member at the T. J. Watson Research Center of IBM
Research Division since 1986. He received a PhD in Statistics from the University of Southampton in
1979. His interests include time-series analysis, distribution theory and flood frequency analysis.
Authors’ addresses:
Lilian Shiao-Yen Wu, IBM T. J. Watson Research Center, P O Box 218, Yorktown Heights, NY 10598,
USA.
Nalini Ravishanker. Department of Statistics, University of Connecticut, Storrs, CT 06268, USA.
J. R. M. Hosking, IBM T. J. Watson Research Center, PO Box 218, Yorktown Heights, NY 10598,
USA.