Professional Documents
Culture Documents
Table 2
Correlations between arrivals in consecutive half-hour periods on Wednesday mornings in a
Canadian call center.
Half-hour periods (10, 10:30) (10:30, 11) (11, 11:30) (11:30, 12) (12, 12:30)
Table 3
Correlations between Type A and Type B arrivals in consecutive half-hour periods on
Wednesdays in a Canadian call center.
Type B Type A
(10, 10:30) (10:30, 11) (11, 11:30) (11:30, 12) (12, 12:30)
In practice, arrival forecasts are needed for a range of The early work on the forecasting of call arrivals usually
purposes, such as long-term capacity planning and short- focused on modeling daily or even monthly total call
term scheduling. Therefore, the specifics of the forecasting volumes, due partly to the lack of relevant data. In addition,
only point forecasts of future arrival rates or counts were
procedure need to be determined carefully, including the
produced.
forecasting horizon (for time intervals, for a day, or for
One of the earliest papers on the forecasting of call
multiple days), and whether to combine arrivals from
arrivals was that by Thompson and Tiao (1971), who
separate queues. Typically, models that incorporate time
modeled monthly call arrivals for two different call
dependencies are useful for short-term decision making,
types. Interestingly, they noted that there may be an
since such dependencies tend to vanish over longer time interdependence between the arrival streams of these two
scales. Over longer time horizons, simple smoothing or call types, but they did not explore this issue further. They
moving average models seem to be sufficient to capture used seasonal autoregressive integrated moving average
the general trends in the data. Indeed, in practice, arrivals (ARIMA) models to forecast future call volumes, and relied
to different call types are often combined when there is solely on the past history of call arrivals in their models.
not sufficient data for the individual call types. However, Mabert (1985) relied on multiplicative and additive
typically, this is not done systematically, and is based solely regression models, including covariates for special events
on the experience of the call center managers. and different seasonalities, for forecasting daily call
Ideally, we want arrival models that seek to reconcile arrivals to an emergency call center. He also considered
several objectives. In order for an arrival model to be re- model adjustments which exploit previous forecasting
alistic, it needs to reproduce the properties that we de- errors in order to yield more accurate forecasts. He found
scribed in Section 2. Simultaneously, in order for an arrival that such models yielded the most accurate forecasts, and
model to be of practical use, it needs to be computation- were superior to ARIMA models.
ally tractable; that is, it needs to rely on a relatively small Other early papers also relied on standard time series
number of parameters so as to avoid overfitting. More- models. For example, Andrews and Cunningham (1995)
over, these parameters need to be easy to estimate from modeled daily call arrivals to the call center of a retailer.
historical data. Finally, parameter estimates should not be The authors considered ARIMA models with transfer
hard to update (e.g., via Bayesian methods) based on newly functions and incorporated covariates for advertising
available information, e.g., throughout the course of a day. and special-day effects. They showed that using such
information improved the accuracy of their forecasts
These updated estimates would then be used to update op-
dramatically, and could have a significant impact on
erational decisions in the call center.
operational decision-making in the call center. Similarly,
In this section, we review alternative models that have
Bianchi, Jarrett, and Hanumara (1998) used ARIMA models
been proposed in the literature with the aim of reconciling
with intervention analysis to forecast telemarketing call
these objectives. We begin by reviewing early papers, arrivals, and found that such models are superior to
which relied mostly on standard forecasting methods additive and multiplicative Holt–Winters’ exponentially
(Section 3.1). Next, we focus on more recent models weighted moving average models.
for arrivals over several days or months (Section 3.2). More recently, Antipov and Maede (2002) modeled the
Finally, we move to models for arrivals over a single day daily numbers of applications for loans at a financial ser-
(Section 3.3). vices telephone call center. The authors also went beyond
870 R. Ibrahim et al. / International Journal of Forecasting 32 (2016) 865–874
standard ARIMA models by including advertising re- arrivals in each period, and then generate the arrival times
sponses and special calender effects, through the addition by splitting the counts uniformly and independently over
of exogenous variables to a multiplicative model. Chan- the given time period. (This is consistent with the Poisson
nouf et al. (2007) developed simple additive models for assumption.) In contrast, given a distributional forecast
the (small) number of ambulance calls each hour in the for the rates, one can generate the arrival times directly.
city of Calgary. Their models capture daily, weekly, and Nevertheless, most arrival models in the literature are
yearly seasonalities, selected second-order interaction ef- for the counts Xi,j , rather than the rates Λi,j , because, in
fects (e.g., between the time-of-day and day-of-the-week), practice, we observe, not the arrival rates themselves, but
special-day effects (such as the Calgary Stampede, lead- only the counts, which give only partial information as to
ing to increased call volumes), and autocorrelation of the the rates. This makes the estimation of arrival rates a more
residuals between successive hours. Their best model out- complicated task.
performed a doubly-seasonal ARIMA model for the residu- Multiple papers, such as those of Ibrahim and L’Ecuyer
als of a model which captures only special-day effects. (2013); Shen and Huang (2008b); Taylor (2008), and Wein-
The existing literature has used several different berg et al. (2007), consider a linear fixed-effects (FE) model
measures of forecasting errors to evaluate the accuracy as a benchmark for comparison. To illustrate, let di be the
of forecasting models, including the root mean squared day-of-the-week of day i, where i = 1, 2, . . . , D. That is,
error (RMSE), mean squared error (MSE), mean absolute di ∈ {1, 2, 3, 4, 5}, where di = 1 denotes a Monday, di = 2
percentage error (MAPE), weighted absolute percentage error denotes a Tuesday,. . . , and di = 5 denotes a Friday. Ibrahim
(WAPE), etc. The answer to the question of which error and L’Ecuyer (2013) considered the following fixed-effects
measure is appropriate depends on the objective of the model for the square-root transformed arrival counts:
call center management. For example, Ding and Koole
(2015) studied a call center staffing problem where the yi,j = αdi + βj + θdi ,j + µi,j , (3)
costs are the initial staffing costs plus the intraday traffic where the coefficients αdi , βj , and θdi ,j are real-valued con-
management costs, and concluded that optimal forecast stants that need to be estimated from the data, and µi,j are
methods should be those that minimize the WAPE. independent and identically distributed (i.i.d.) normal ran-
dom variables with a mean of zero. Ibrahim and L’Ecuyer
3.2. Models over several days (2013) and Taylor (2008) have shown that it is difficult to
beat fixed-effect models in terms of the accuracy of long-
To describe more recent arrival modelling approaches, term point forecasting (e.g., two weeks or more). Neverthe-
we need some additional notation. Let Xi,j denote the less, for short-term forecasting, one can exploit interday
number of call arrivals during period j, 1 ≤ j ≤ P, of day and intraday dependencies in the data in order to obtain
i, 1 ≤ i ≤ D. The standard assumption is that call arrivals more accurate forecasts.
follow a Poisson process with a (potentially) random As an improvement, and based on an analysis of real
arrival rate Λi,j , which is taken to be constant over each call center data, Aldor-Noiman et al. (2009) proposed the
period j. The cumulative arrival rate over period j is also following linear mixed-effects (ME) model:
Λi,j if a unit time period is assumed. Thus, conditional on
the event Λi,j = λi,j , Xi,j is Poisson distributed with rate λi,j . Yi,j = αdi + βj + θdi ,j + γi + ϵi,j ,
Several papers (e.g., those of Aldor-Noiman et al., 2009; where γi denotes the daily volume deviation from the fixed
Brown et al., 2005; Ibrahim & L’Ecuyer, 2013; Weinberg weekday effect on day i. Then, γi is the random effect
et al., 2007) exploit the following ‘‘root-unroot’’ variance- on day i. Let G denote the D × D covariance matrix for
stabilizing data transformation: the sequence of random effects. The random effects γi
are identically normally distributed, with expected value
Yi,j ≡ (Xi,j + 1/4)1/2 . (2)
E [γ ] = 0 and variance Var[γ ] = σG2 . The authors
Conditional on the event Λi,j = λi,j , and for large values assume that these random effects follow an AR(1) process.
of λi,j , Yi,j is distributed approximately normally, with Considering an AR(1) covariance structure for G is both
mean λi,j and variance 1/4; see Brown, Cai, Zhang, Zhao,
useful and computationally effective, because it requires
and Zhou (2010). The unconditional distribution, with the estimation of only two parameters, σG and ρG . The
random Λi,j , is then a mixture of such normal distributions, residuals ϵi,j are also assumed to have an AR(1) structure
and therefore has a larger variance. Nevertheless, it within each day. As such, this model captures both interday
can be assumed (as an approximation) that the square- and intraday dependencies in the data. An earlier version
root transformed counts Yi,j are distributed normally, of this model, also based on call-center data, but without
particularly if Var[Λi,j ] is not too large. The resulting intraday correlations and without special-day effects, was
normality is very useful because it allows linear Gaussian proposed by Brown et al. (2005).
fixed-effects and mixed-effects models to be fitted to the Ibrahim and L’Ecuyer (2013) extended this ME model
square-root transformed data. to two bivariate ME models for the joint distribution of
A better alternative than modelling the arrival counts the arrival counts to two separate queues, which exploit
Xi,j would be to model the rates Λi,j directly, because correlations between different call types. These models
it is considerably easier to simulate the system with a account for the dependence between the two call types
distributional forecast for the rates than one for the counts. by assuming that either the vectors of random effects or
Indeed, to simulate arrivals based on a distributional the vectors of residuals across call types are correlated
forecast for counts, one has to generate the number of multinormal. This corresponds to using a normal copula;
R. Ibrahim et al. / International Journal of Forecasting 32 (2016) 865–874 871
see Kim et al. (2012). The choice of copula can have 3.3. Models over a single day
a significant impact on performance measures in call
centers, because of the strong effect of tail dependence on In this section, we focus on modeling arrivals over a
the quality of service (Jaoua et al., 2013). For example, a single day. The day is divided into p time periods. We
strong upper tail dependence for certain call types means denote the vector of arrival counts in those periods by
that very large call volumes tend to arrive together for X = (X1 , . . . , Xp ).
these call types. When this happens, this produces very It is commonly assumed that intraday arrivals follow a
Poisson process with a random arrival rate. Whitt (1999a)
large overloads.
proposed that this be done by starting with a deterministic
To reduce the dimensionality of the vectors (Yi,1 , . . . , arrival rate function {λ(t ), t0 ≤ t ≤ te }, where t0 and te
Yi,P ), Shen and Huang (2005) proposed the use of are the opening and closing times of the call center for the
singular-value decomposition to define a small number of considered day, and multiplying this function by a random
vectors whose linear transformations capture most of the variable W with mean E[W ] = 1, called the busyness factor
information that is of relevance for prediction. Based on for that day. The (random) arrival rate process for that day
this, Shen and Huang (2008b) then developed a dynamic is then Λ = {Λ(t ) = W λ(t ), t0 ≤ t ≤ te }.
updating method for the distributional forecasts of arrival Under this model, the arrival rates at any two given
rates. Shen and Huang (2008a) proposed a method for times are perfectly correlated, and Corr[Λj , Λk ] = 1 for
forecasting the latent rate profiles of a time series of all j, k. We also expect the Xj s to be correlated strongly.
inhomogeneous Poisson processes, to enable future arrival More specifically, let Ij denote the time interval of period
j, let λ̄j = I λ(t )dt, and let Xj be the number of arrivals
rates to be forecast based on a series of observed arrival j
counts. in Ij . Using the variance and expectation decompositions,
Aktekin and Soyer (2011) recently proposed a model one can find that Var[Xj ] = λ̄j (1 + (1 + λ̄j )Var[W ]), i.e. Eq.
based on a Poisson-Gamma process, where Λi,j = Wi,j λi,j (3.12) of Whitt (1999a), and for j ̸= k:
for fixed values of λi,j , and where the multiplicative
Corr[Xj , Xk ]
factors Wi,j have a gamma distribution and obey a gamma −1/2
= Var[W ] (Var[W ] + 1/λ̄j )(Var[W ] + 1/λ̄k ) .
process. Soyer and Tarimcilar (2008) analyzed the effects of
advertisement campaigns on call arrivals, using a Bayesian This correlation is zero when Var[W ] = 0 (a
analysis where the Poisson rate function is modeled using deterministic rate), and approaches one when Var[W ] →
a mixed model approach. This mixed model is shown to ∞. Avramidis et al. (2004) studied this model in the special
be superior to using a fixed-effects model. Weinberg et al. situation where W has a gamma distribution, with E[W ] =
(2007) propose an adaptation of the model of Brown et al. 1 and Var[W ] = 1/γ . Then, each Λj has a gamma
(2005) to enable it to update the forecasts of a day, as distribution, and the Xj s have a negative multinomial
defined from the previous days, using observations made distribution, with parameters that are easy to estimate.
available during this day. Furthermore, the variance of the arrival counts can be
Weinberg et al. (2007) also used Bayesian techniques made arbitrarily large by decreasing γ toward zero. The
in their forecasts. They exploited the (normal) square-root model’s flexibility is rather limited, because, given the λ̄j s,
transformed counts to include conjugate multivariate nor- Var[Xj ] and Corr[Xj , Xk ] for j ̸= k are all determined by
a single parameter value, namely Var[W ]. In an attempt
mal priors, with specific covariance structures. They used
to increase the flexibility of the covariance matrix Cov[X],
Gibbs sampling and the Metropolis–Hastings algorithm
and in particular to enable a reduction of the correlations,
to sample from the forecast distributions, which unfortu-
Avramidis et al. (2004) introduced two different models for
nately involves long computation times. Moreover, it is un- X, based on the multivariate Dirichlet distribution.
clear how exogenous covariates should be incorporated in Jongbloed and Koole (2001) examined a similar model,
such a model. but with independent busyness factors, one for each pe-
The empirical analysis of Taylor (2008) compared sev- riod of the day. Under their model, the Λj s are indepen-
eral time series models, including autoregressive moving dent, as are the Xj s, which is inconsistent with an intraday
average (ARMA) models and Holt–Winters’ exponential dependence of call center arrivals. Channouf (2008) con-
smoothing models with multiple seasonal patterns. The sidered a variant of the model where λ(t ) is defined by a
latter method was adapted by Taylor (2003) for modeling cubic spline over the day, with a fixed set of knots, and
both the intraday and intraweek cycles in intraday data. also shows how the model parameters can be estimated.
Taylor (2012) extended this model and considered the den- This can provide a smoother (and perhaps more realistic)
sity forecasting of call arrival rates. With this aim, he devel- model of the arrival rate. Channouf (2008) and Channouf
and L’Ecuyer (2012) proposed models that account for time
oped a new Holt–Winters’ Poisson count data model with
dependence, overdispersion, and intraday dependencies
a gamma-distributed stochastic arrival rate, and showed
with much more flexibility, in order to match the correla-
that this new model outperformed the basic Holt–Winters’
tions between the Xj s, by using a normal copula to specify
smoothing model. Shen (2010a) commented on Taylor’s the dependence structure between these counts. In princi-
work, highlighting the difference between modeling ar- ple, similar copula models could be developed for the vec-
rivals as a single time series and as a vector time series tor of arrival rates, (Λ1 , . . . , Λp ), instead of for the vector
where each day is modeled as a component of that vector. of counts. Oreshkin et al. (2016) examined the relationship
872 R. Ibrahim et al. / International Journal of Forecasting 32 (2016) 865–874
Table 4
Forecasting comparison among the five methods for call type A.
Type A
MU ME BME1 BME2
Table 5
Forecasting comparison among the five methods for call type B.
Type B
MU ME BME1 BME2
between modeling for the vector of counts and the vector the forecasting distribution by reporting the coverage
of rates. In particular, they gave explicit formulas for the re- probability for the 95% prediction interval, defined as:
lationship between the correlations between the rates and 1
the counts in two given periods, which implied that, for a Cover = I(Xi,j ∈ (L̂i,j , Ûi,j )),
K i,j
given correlation between rates, the correlation between
counts is much smaller in low traffic than in high traffic.
where (L̂i,j , Ûi,j ) is the 95% prediction interval for Xi,j given
by the model.
4. Case studies Tables 4 and 5 summarize the comparisons among the
four methods. For both call types, ME produces the most
In this section, we present empirical results from a case accurate point forecasts in most scenarios. BME1 and BME2
study using real data collected at a Canadian call center, as have better coverage probabilities when the leading period
described in Section 2. We use data based on two call types is one day or one week, and MU has a better coverage
and 200 consecutive workdays (excluding weekends). For probability when the leading period is two weeks.
each call type, the study implements four methods (those Our numerical results serve to illustrate the complexity
of Aldor-Noiman et al., 2009; Gans et al., 2015; Ibrahim & of the forecasting problem. Indeed, different models may
L’Ecuyer, 2013) for forecasting arrival counts based on six be most appropriate depending on the lead-time of interest
weeks of historical data: and on the specific criterion being considered, as is shown
here.
• MU: the multiplicative univariate forecasting model of Ultimately, selecting the ‘‘best’’ forecasting method
Gans et al. (2015); depends on the specifics of the problem at hand, and an
• ME: the univariate mixed-effects model of Aldor- adequate measurement error. Proposing alternative arrival
Noiman et al. (2009); models that capture different features of the data is key to
• BME1: the bivariate mixed-effects model of Ibrahim gaining a deeper understanding of the complexities of the
and L’Ecuyer (2013); and call-arrival process.
• BME2: the bivariate mixed-effects model of Ibrahim
and L’Ecuyer (2013). 5. Conclusions and discussion
Each method is applied to the data in turn, and we The forecasting of call center arrivals plays a crucial role
assess the out-of-sample forecasting accuracy of each. We in call center management, for example in determining ap-
compare the different models by using the root mean propriate staffing levels, scheduling plans and routing poli-
squared error (RMSE) to assess the point forecast accuracy, cies. The call center arrival process is complex and requires
as defined below: appropriate modeling in order to achieve better forecasting
accuracies, leading to more efficient operational decisions.
1
RMSE = (Xi,j − X̂i,j )2 , In this survey paper, we have reviewed the existing
K i,j literature on the modeling and forecasting of call center
arrivals. We have also conducted a case study to evaluate
where X̂i,j is the value of Xi,j predicted by the model, and several recently proposed forecasting methods using real-
K is the total number of predictions. We also evaluate life call center data.
R. Ibrahim et al. / International Journal of Forecasting 32 (2016) 865–874 873
Ibrahim, R., L’Ecuyer, P., Shen, H., & Thiongane, M. (2015). Inter- Shen, H., & Huang, J. Z. (2008b). Interday forecasting and intraday
dependent, heterogeneous, and time-varying service-time distribu- updating of call center arrivals. Manufacturing and Service Operations
tions in call centers. European Journal of Operational Research, in press. Management, 10(3), 391–410.
Jaoua, A., L’Ecuyer, P., & Delorme, L. (2013). Call type dependence in Soyer, R., & Tarimcilar, M. M. (2008). Modeling and analysis of call
multiskill call centers. Simulation, 89(6), 722–734. center arrival data: A Bayesian approach. Management Science, 54(2),
Jongbloed, G., & Koole, G. (2001). Managing uncertainty in call centers 266–278.
using Poisson mixtures. Applied Stochastic Models in Business and Steckley, S.G., Henderson, S.G., & Mehrotra, V. (2004). Service system
Industry, 17, 307–318. planning in the presence of a random arrival rate. Submitted for
Kim, T., Kenkel, P., & Brorsen, B. W. (2012). Forecasting hourly peak publication.
call volume for a rural electric cooperative call center. Journal of Steckley, S. G., Henderson, S. G., & Mehrotra, V. (2005). Performance
Forecasting, 31, 314–329. measures for service systems with a random arrival rate. In M. E. Kuhl,
Kim, S. H., & Whitt, W. (2014a). Are call center and hospital arrivals well N. M. Steiger, F. B. Armstrong, & J. A. Joines (Eds.), Proceedings of the
modeled by nonhomogeneous Poisson processes? Manufacturing and 2005 winter simulation conference (pp. 566–575). IEEE Press.
Service Operations Management, 16(3), 464–480. Steckley, S. G., Henderson, S. G., & Mehrotra, V. (2009). Forecast errors
Kim, S. H., & Whitt, W. (2014b). Choosing arrival process models for in service systems. Probability in the Engineering and Informational
service systems: Tests of a nonhomogeneous Poisson process. Naval Sciences, 23(2), 305–332.
Research Logistics, 61(1), 66–90. Tanir, O., & Booth, R. J. (1999). Call center simulation in bell Canada. In Pro-
Koole, G. (2013). Call center optimization. MG Books. ceedings of the 1999 winter simulation conference (pp. 1640–1647). Pis-
Landon, J., Ruggeri, F., Soyer, R., & Tarimcilar, M. M. (2010). Modeling cataway, New Jersey: IEEE Press.
latent sources in call center arrival data. European Journal of Taylor, J. W. (2003). Short-term electricity demand forecasting using
Operational Research, 204(3), 597–603. double seasonal exponential smoothing. Journal of Operational
Liao, S., Delft, C. V., Koole, G., & Jouini, O. (2012). Staffing a call center with Research Society, 54, 799–805.
uncertain non-stationary arrival rate and flexibility. OR Spectrum, Taylor, J. W. (2008). A comparison of univariate time series methods
34(3), 691–721. for forecasting intraday arrivals at a call center. Management Science,
Mabert, V. A. (1985). Short interval forecasting of emergency phone call 54(2), 253–265.
(911) work loads. Journal of Operations Management, 5(3), 259–271. Taylor, J. W. (2012). Density forecasting of intraday call center arrivals
Maman, S., Mandelbaum, A., Whitt, W., & Zeltyn, S. (2015). Queues using models based on exponential smoothing. Management Science,
with random arrival rates: Inference, modelling and asymptotics 58(3), 534–549.
(c-staffing). submitted for publication. Thompson, H. E., & Tiao, G. C. (1971). Analysis of telephone data: A case
Oreshkin, B. N., Regnard, N., & L’Ecuyer, P. (2016). Rate-based daily arrival study of forecasting seasonal time series. Bell Journal of Economics and
process models with application to call centers. Operations Research, Management Science, 2(2).
in press. Wallace, R. B., & Whitt, W. (2005). A staffing algorithm for call centers
Shen, H. (2010a). Exponentially weighted methods for forecasting with skill-based routing. Manufacturing and Service Operations
intraday time series with multiple seasonal cycles: Comments. Management, 7(4), 276–294.
International Journal of Forecasting, 58, 652–654. Weinberg, J., Brown, L. D., & Stroud, J. R. (2007). Bayesian forecasting of an
Shen, H. (2010b). Statistical analysis of call-center operational data: inhomogeneous Poisson process with applications to call center data.
Forecasting call arrivals, and analyzing customer patience and agent Journal of the American Statistical Association, 102(480), 1185–1198.
service. In J. J. Cochran (Ed.), Wiley encyclopedia of operations research Whitt, W. (1999a). Dynamic staffing in a telephone call center aiming
and management science. John Wiley. to immediately answer all calls. Operations Research Letters, 24,
Shen, H., & Huang, J. Z. (2005). Analysis of call centre arrival data using 205–212.
singular value decomposition. Applied Stochastic Models in Business Whitt, W. (1999b). Improving service by informing customers about
and Industry, 21, 251–263. anticipated delays. Management Science, 45(2), 192–207.
Shen, H., & Huang, J. Z. (2008a). Forecasting time series of inhomogeneous Zhang, X., Hong, L. J., & Zhang, J. (2014). Scaling and modeling of call
Poisson processes with application to call center workforce manage- center arrivals. In Proceedings of the 2014 winter simulation conference
ment. Annals of Applied Statistics, 2(2), 601–623. (pp. 476–485). IEEE.