You are on page 1of 15

International Journal of Forecasting 20 (2004) 169 183

www.elsevier.com/locate/ijforecast

Forecasting economic and financial time-series


with non-linear models
Michael P. Clements a, Philip Hans Franses b, Norman R. Swanson c,*
a
Department of Economics, University of Warwick, Warwick, RI, USA
b
Econometric Institute, Erasmus University Rotterdam, Rotterdam, NY, USA
c
Department of Economics, Rutgers University, 75 Hamilton Street, New Brunswick, NJ 08901-1248, USA

Abstract

In this paper we discuss the current state-of-the-art in estimating, evaluating, and selecting among non-linear forecasting
models for economic and financial time series. We review theoretical and empirical issues, including predictive density, interval
and point evaluation and model selection, loss functions, data-mining, and aggregation. In addition, we argue that although the
evidence in favor of constructing forecasts using non-linear models is rather sparse, there is reason to be optimistic. However,
much remains to be done. Finally, we outline a variety of topics for future research, and discuss a number of areas which have
received considerable attention in the recent literature, but where many questions remain.
D 2004 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.

Keywords: Economic; Financial; Non-linear models

1. Introduction non-linear over linear models in terms of forecast


performance, and we suspect that the situation has
Whilst non-linear models are often used for a not changed very much since then. It seems that we
variety of purposes, one of their prime uses is for have not come very far in the area of non-linear
forecasting, and it is in terms of their forecasting forecast model construction.
performance that they are most often judged. Howev- We argue that the relatively poor forecasting per-
er, a casual review of the literature suggests that often formance of non-linear models calls for substantive
the forecasting performance of such models is not further research in this area, given that one might feel
particularly good. Some studies find in favor, but uncomfortable asserting that non-linearities are unim-
equally there are studies in which their added com- portant in describing economic and financial phenom-
plexity relative to rival linear models does not result in ena. The problem may simply be that our non-linear
the expected gains in terms of forecast accuracy. Just models are not mimicing reality any better than
over a decade ago, in their review of non-linear time simpler linear approximations, and in the next section
series models, De Gooijer and Kumar (1992) con- we discuss this and related reasons why a good
cluded that there was no clear evidence in favor of forecast performance across the board may consti-
tute something of a holy grail for non-linear models.
* Corresponding author. We discuss the current state-of-the-art in non-linear
E-mail address: nswanson@econ.rutgers.edu (N.R. Swanson). modeling and forecasting, with particular emphasis

0169-2070/$ - see front matter D 2004 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
doi:10.1016/j.ijforecast.2003.10.004
170 M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183

placed on outlining a number of open issues. The highly non-linear. As well as which, bond pricing
topics we focus on include joint and conditional models, diffusion processes describing yield curves,
predictive density evaluation, loss functions, estima- and almost all other continuous time finance models
tion and specification, and data-mining, amongst are non-linear. The predominance of non-linear mod-
others. As such, this paper complements the rest of els in economics and finance is not inconsistent with
the papers in this special issue of the International the use of linear models by the applied practitioner, as
Journal of Forecasting. such models can be viewed as reasonable approxima-
The rest of the paper is organized as follows. In tions to the non-linear phenomenon of interest. Thus,
Section 2 we discuss why one might want to consider from the perspective of forecasting, there is ample
non-linear models, and a number of reasons why their reason to continue to look at non-linear models. As
forecasting ability relative to linear models may not our non-linear model estimation, selection and testing
be as good as expected. In Section 3 we discuss approaches become more sophisticated, one might
recent theoretical and methodological issues to do expect to see their forecast performance improve
with forecasting with non-linear models, many of commensurately. It is thus not surprising that non-
which go beyond the traditional preoccupation with linear models, ranging from regime-switching models,
point forecasts to consider the whole predictive den- to neural networks and genetic algorithms, are receiv-
sity. Section 4 highlights a number of empirical ing a great deal of attention in the literature.
issues, and how these are dealt with in the papers On a more cautionary note, we review a number of
collected in this issue. Concluding remarks are gath- factors, which might count against the aforemen-
ered in Section 5. tioned improvement in the relative performance of
non-linear models. Shortly after De Gooijer and
Kumar, Granger and Terasvirta (1993a); Granger
2. Why consider non-linear models? and Terasvirta (1993b), ch. 9 (see also Terasvirta &
Anderson, 1992) in their review of smooth transition
Many of us believe that linear models ought to be a autoregressive (STAR) models of US industrial pro-
relatively poor way of capturing certain types of duction, argue that the superior in-sample perfor-
economic behavior, or economic performance, at mance of such models will only be matched out-of-
certain times. The obvious example would be a linear sample if that period contains non-linear features.
(e.g. Box Jenkins ARMA) model of output growth in Similarly, Tong (1995), pp. 409 410 argues strongly
a Western economy subject to the business cycle, that for non-linear models how well we can predict
where the properties of output growth in recessions depends on where we are and that there are win-
are in some ways quite different from expansions (e.g. dows of opportunity for substantial reduction in
Hamilton, 1989; Sichel, 1994, but references are prediction errors. This suggests that an important
innumerable). Output growth non-linearities can be aspect of an evaluation of the forecasts from non-
characterized by the presence of two or more regimes linear models relative to the linear AR models is to
(e.g. recessions and expansions), as can financial make the comparison in a way which highlights the
variables (periods of high and low volatility). Other favorable performance of the former for certain states,
types of non-linearity might include the possibility especially if it is forecasts in those states which are
that the effects of shocks accumulate until a process most valuable to the user of the forecasts. Clements
explodes (self-exciting or catastrophic behavior), as and Smith (1999) compare the forecasting perfor-
well as the notion that some variables are relevant for mance of empirical self-exciting threshold autoregres-
forecasting only once in a while (e.g. only when oil sive (SETAR) models and AR models using
prices increase by a large amount do they have a simulation techniques which ensure that past non-
significant effect on output growth, and therefore, linearities are present in the forecast period. See Tiao
become useful for forecasting output growth). and Tsay (1994) for a four-regime TAR model applied
In macroeconomics and finance theory a host of to US GNP, and Boero and Marrocu (2004) for an
non-linear models are already in vogue. For example, application of regime-specific evaluation to exchange
almost all-real business cycle (RBC) models are rate forecasts.
M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183 171

In addition, in the context of exchange rate predic- quent sections of this paper review some of the recent
tion, Diebold and Nason (1990) give a number of developments in model selection and empirical strat-
reasons why non-linear models may fail to outperform egies, as well as the remaining papers in this issue,
linear models. One is that apparent non-linearities which take up the challenge of forecasting with non-
detected by tests for linearity are due to outliers or linear models.
structural breaks, which cannot be readily exploited to We end this section with two short illustrations of
improve out-of-sample performance, and may only be some of the difficulties that can arise. In the first, there
detected by careful analysis along the lines of Koop is distinct regime-switching behavior, but this does
and Potter (2000), for example. They also suggest that not contribute to an improved forecast performance.
conditional mean non-linearities may be a feature of In the second, we discuss the relationship between
the data generating process (DGP), but may not be output growth and the oil price.
large enough to yield much of an improvement to
forecasting, as well as the explanation that they are 2.1. Markov-switching models of US output growth
present and important, but that the wrong types of non-
linear models have been used to try and capture them. Clements and Krolzig (1998) present some theo-
There is a view that, because some aspects of the retical explanations for why Markov-switching (MS)
economy or financial markets do indeed display non- models may not forecast much better than AR models,
linear behavior, then neglecting these features in and apply their analysis to post war US output growth.
constructing forecasts would leave the end-user un- The focus is on a two-regime MS model:
easy, feeling that the forecasts are in some sense
second. This follows from the belief that a good Dyt  st aDyt1  st1 ut : 1
model for the in-sample data should also be a good
out-of-sample forecasting model. As an example, an where ut f IN[0, ru2]. The conditional mean l(st)
AR(2) model for US GNP growth may yield lower switches between two states:
average squared errors than an artificial neural net- 8
< l1 > 0 if st 1WexpansionW or WboomW;
work with one hidden unit, but, knowing that the lst
AR(2) model is incapable of capturing the distinct :
l2 < 0 if st 2WcontractionW or WrecessionW;
dynamics of expansions and contractions, is unlikely (2)
to engender confidence that one has a model that
captures the true dynamics of the economy. But this is The description of a MS AR model is completed by
the crux of the mattera good in-sample fit (here, the specification of a model for the stochastic and
capturing the distinct dynamics of the business cycle unobservable regimes on which the parameters of the
phases) does not necessarily translate in to a good out- conditional process depend. Once a law has been
of-sample performance relative to a model such as an specified for the states st the evolution of regimes
AR. The sub-section below illustrates, and see also can be inferred from the data. The regime-generating
Clements and Hendry (1999) for a more general process is assumed to be an ergodic Markov chain
discussion. with a finite number of states st = 1, 2, (for a two-
We take the view that, if one believes the under- regime model), defined by the transition probabilities:
lying phenomenon is non-linear, it is worth consider-
ing a non-linear model, but warn against the pij Prst1 jAst i;
expectation that such models will always do well
X
2
there are too many unknowns and the economic pij 1 bi; jaf1; 2g: 3
system is too complex to support the belief that j1
simply generalizing a linear model in one (simple!)
direction, such as adding another regime, will neces- The model can be rewritten as the sum of two
sarily improve matters. That said, it is natural to be independent processes:
unhappy with models, which are obviously deficient
in some respect, and to seek alternatives. The subse- Dyt  ly lt zt ;
172 M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183

where ly is the unconditional mean of Dyt, such that p11 + p22  1g a in (4), the conditional expectation
E[lt] = E[zt] = 0. While the process zt is Gaussian: collapses to a linear prediction rule. Heuristically, the
relative performance of non-linear regime-switching
zt azt1 e t ; e t fIN 0; r2e ; models would be expected to be better the more
persistent the regimes. When the regimes are unpre-
the other component, lt, represents the contribution of dictable, we can do no better than employing a simple
the Markov chain: linear model. See also Krolzig (2003); Dacco and
lt l2  l1 ft ; Satchell (1999) for an analysis of the effects of
wrongly classifying the regime the process will be in.
where ~ t = 1  Pr(st = 2) if st = 2 and  Pr(st = 2) oth- A number of authors, such as Sensier, Artis,
erwise. Pr(st = 2) = p12/( p12 + p21) is the unconditional Osborn and Birchenhall (2004), attempt to predict
probability of regime 2. Invoking the unrestricted business cycle regimes, and the transition probabilities
VAR(1) representation of a Markov chain: in (4) can be made to depend on leading indicator
variables, as a way of sharpening the forecasting
ft p11 p22  1ft1 tt ;
ability of these models. Franses, Paap and Vroomen
then predictions of the hidden Markov chain are given (2004) utilize information from extraneous variables
by: to determine the regime in a novel approach to
predicting the US unemployment rate.
fThAT p11 p22  1h fT AT
2.2. Output growth and the oil price
where fT AT = E[~ TAYT] = Pr(sT = 2AYT)  Pr(sT = 2) is
the filtered probability Pr(sT = 2AYT) of being in regime Of obvious interest in the literature is the relation-
2 corrected for the unconditional probability. Thus, the ship between oil prices and the macroeconomy. To
conditional mean of DyT + h is given by D yThAT  ly what extent did the OPEC oil price rises contribute to
which equals: the recessions in the 1970s, and might one expect
reductions in prices to stimulate growth, although
lT hAT zThT l2  l1 p11 p22  1h fT AT ah perhaps to a lesser extent? Hamilton (1983) originally

DyT  ly  l2  l1 fTAT proposed a linear relationship between oil prices and
output growth for the US. This was challenged by
ah DyT  ly l2  l1 Mork (1989), who suggested that the relationship was

p11 p22  1h  ah fT AT : 4 asymmetric, in that output growth responds negatively
to oil price increases, but is unaffected by oil price
The first term in (4) is the optimal prediction rule declines. With the advantage of several more years of
for a linear model, and the contribution of the data, Hooker (1996) showed that the linear relation-
Markov regime-switching structure is given by the ship proposed by Hamilton (1983) appears not to hold
term multiplied by fT AT , where fT AT contains the in- from 1973 onwards (the date of the first oil price
formation about the most recent regime at the time the hike!). However, he also cast doubt on the simple
forecast is made. Thus, the contribution of the non- asymmetry hypothesis suggested by Mork (1989).
linear part of (4) to the overall forecast depends on More recently, Hamilton (1996) proposes relating
both the magnitude of the regime shifts, Al2  l1A, output growth to the net increase in oil prices over
and on the persistence of regime shifts p11 + p22  1 the previous year, and constructs a variable that is the
relative to the persistence of the Gaussian process, percentage change in the oil price in the current
given by a. quarter over the previous years high, when this is
Clements and Krolzig estimate p11 + p22  1 = 0.65, positive, and otherwise takes on the value zero. Thus,
and the largest root of the AR polynomial to be 0.64, increases in the price of oil, which simply reverse
so that the second reason explains the success of the previous (within the preceding year) declines do not
linear AR model in forecasting. Since the predictive depress, output growth. Recently, Hamilton (2000)
power of detected regime shifts is extremely small, has used a new flexible non-linear approach (Dahl &
M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183 173

Hylleberg, 2004; Hamilton, 2001) to characterize the The choice of model might be suggested by eco-
appropriate non-linear transform of the oil price. nomic theory, and often by the requirement that the
Raymond and Rich (1997) have investigated the model is capable of generating the key characteristics
relationship between oil prices and the macroeconomy of the data at hand. Of course, the issue of which
by including the net increase in the oil price in an MS characteristics often arises. For example, Pagan
model of US output for the period 1951 1995. They (1997a); Pagan (1997b); Harding and Pagan (2001)
are interested in whether the recurrent shifts between argue that non-linear models should be evaluated in
expansion and contraction identified by the MS model terms of their ability to reproduce certain features of
remain when oil prices have been included as an the classical cycle, rather than their ability to match
explanatory variable for the mean of output. Raymond the stationary moments of the detrended growth
and Rich (1997) conclude that while the behavior of cycle.1
oil prices has been a contributing factor to the mean An approach to tackling these issues is to use
of low growth phases of output, movements in oil predictive density or distributional testing, as a means
prices generally have not been a principal determi- of establishing which of a number of candidate
nant in the historical incidence of these phases . . . forecasting models has distributional features that
(p. 196). Further, Clements and Krolzig (2002) inves- most closely match the historical record. This could
tigate whether oil prices can account for the asymme- include, for example, finding out which of the models
try in the business cycle using the tests proposed in yields the best distributional or interval predictions.
Clements and Krolzig (2003). For example, in financial risk management interest
Clearly, the process of discovery of (an approxi- often focuses on predicting a particular quantile (as
mation to) the form of the non-linearity in the rela- the Value-at-Risk, VaR) but alternatively the entire
tionship between output growth and oil price changes conditional distribution of a variable may be of
has taken place over two decades, is far from simple, interest. Over the last few years, a new strand of
and has involved the application of state of the art literature addressing the issue of predictive density
econometric techniques. Perhaps this warns against evaluation has arisen (see e.g. Bai, 2001; Christof-
expecting too much in the near future using canned fersen, 1998; Christoffersen, Hahn & Inoue, 2001;
routines and models. Clements & Smith, 2000, 2002; Diebold, Gunther &
Tay, 1998 (henceforth DGT), Diebold, Hahn & Tay,
1999; Giacomini & White, 2003; Hong, 2001). The
3. Theoretical and methodological issues literature on the evaluation of predictive densities is
largely concerned with testing the null of correct
There are various theoretical issues involved in dynamic specification of an individual conditional
constructing non-linear models for forecasting. In this distribution model. At the same time, the point fore-
section we outline a number of these. An obvious cast evaluation literature explicitly recognizes that all
starting point is which non-linear model to use, given the candidate models may be misspecified (see e.g.
the many possibilities that are available, even once we Corradi & Swanson, 2002; White, 2000). Corradi and
have determined the purpose to which it is to be put Swanson (2003a) draw on elements from both types
(here, forecasting). The different types of models of papers in order to provide a test for choosing
often require different theoretical and empirical tools among competing predictive density models which
(see e.g. the recent surveys by Franses & van Dijk, may be misspecified. Giacomini and White (2003)
2000; van Dijk, Franses & Terasvirta, 2002). For
example, closed form solutions exist for the condi-
1
tional mean forecast for an MS process, but not for a As an aside, these papers have shown that the durations and
threshold autoregressive process, requiring simulation amplitudes of expansions and contractions of the classical cycle can
or numerical methods in the latter case. Certain be reasonably well reproduced by simple random walk with drift
models, where the ratio of the drift to the variance of the disturbance
theoretical properties, such as stability and stationar- term is the crucial quantity. Non-linear models appear to add little
ity, and the persistence of shocks, are not always over and above that which can be explained by the random walk
immediately evident. with drift.
174 M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183

tackle a similar problem, developing a framework that and


allows for the evaluation of both nested and nonnested y
models. hi arg min Eqi yj ; Z j1 ; hi ; 6
hi aHi
Many of the above papers seek to evaluate predic-
tive densities by testing whether they have the prop- where qi denotes the objective function for model i.
erty of correct (dynamic) specification. By making use Following standard practice (such as in the real-time
of the probability integral transform, DGT suggest a forecasting literature), this estimator is first computed
simple and effective means by which predictive den- using R observations, then R + 1 observations, then
sities can be evaluated. Using the DGT terminology, if R + 2, and so on until the last estimator is constructed
pt( ytAVt  1) is the true conditional distribution of using T  1 observations, resulting in a sequence of
ytAVt  1, then the probability of observing a value of P = T  R estimators. In the current discussion, we
yt no larger than that actually observed is a uniform focus on 1-step ahead prediction, so these estimators
random variable on [0, 1]. Moreover, if we have a are then used to construct sequences of P 1-step ahead
sequence of predictive densities, then the resulting forecasts and associated forecast errors.
sequence of probabilities should be identically inde- Now, define the group of conditional distribution
pendently distributed. Goodness-of-fit statistics can models from which we want to make a selection as
then be constructed that compare the empirical distri- F1(uAZ t, h1y,), . . ., Fn(uAZ t, hny), and define the true
bution function of the probabilities to the 45j degree conditional distribution as F 0 (uAZ t , h0 ) = Pr
line, possibly taking into account that pt( ytAVt  1) ( yt + 1 V uAZ t). Hereafter, assume that qi ( yt, Z t  1, hi)
contains parameters that have been estimated (see also =  ln fi ( ytAZ t  1, hi), where fi ( A , hi) is the condi-
Bai, 2001; Diebold et al., 1999; Hong, 2001). tional density associated with Fi, i =1, . . ., n, so that hyi
The approach taken by Corradi and Swanson is the probability limit of a quasi maximum likelihood
(2003a) differs from the DGT approach as they do estimator (QMLE). If model i is correctly specified,
not assume that any of the competing models are then hiy = h0. Now, F1( A , h1y) is taken as the bench-
correctly specified. Thus, they posit that all models mark model, and the objective is to test whether some
should be viewed as approximations to some un- competitor model can provide a more accurate ap-
known underlying DGP. They proceed by selecting proximation of F0( A , h0) than the benchmark. As-
a conditional distribution model that provides the sume that accuracy is measured using a distributional
most accurate out-of-sample approximation of the true analog of mean square error. More precisely, the
conditional distribution, allowing for misspecification squared (approximation) error associated with model
under both the null and the alternative hypotheses. i, i = 1, . . ., n, is measured in terms of the average over
This is done using an extension of the conditional U of E(( Fi (uAZ t, hyi )  F0(uAZ t, h0))2), where uaU,
Kolmogorov test approach of Andrews (1997) due to and U is a possibly unbounded set on the real line. The
Corradi and Swanson (2003b) that allows for the in- hypotheses of interest are:
Z
sample comparison of multiple misspecified models. y
H0 : max EF1 uAZ t ; h1  F0 uAZ t ; h0 2
More specifically, assume that the objective is to form k2;...;n U
parametric conditional distributions for a scalar ran- y
 Fk uAZ t ; hk  F0 uAZ t ; h0 2 /uduV0
dom variable, yt + 1, given some vector of variables,
Z t=( yt, . . ., yt  s1 + 1, Xt, . . ., Xt  s2 + 1), t = 1, . . ., T, 7
where s = max{s1, s2}, and X is a vector. Additionally, versus
assume that i = 1, . . ., n models are estimated. Now, Z
y
define the recursive m-estimator for the parameter HA : max EF1 uAZ t ; h1  F0 uAZ t ; h0 2
k2;...;n U
vector associated with model i as:
y
 Fk uAZ t ; hk  F0 uAZ t ; h0 2 /udu > 0;
1 Xt
8
hi;t arg min qi yj ; Z j1 ; hi ;
hi aHi t
js
where /(u) z 0 and mU /u 1; uaU oR; U pos-
R V t V T  1; i 1; . . . ; n 5 sibly unbounded. Note that for a given u, we compare
M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183 175

conditional distributions in terms of their (mean square) CDF of current (and lagged) output and hours worked,
distance from the true distribution. The statistic is: say. In general, their testing framework can be used to
Z address questions of the following sort: (i) For a given
ZP max ZP;u 1; k/udu; 9 RBC model, what is the relative usefulness of different
k2;...;n U sets of calibrated parameters for mimicing different
where dynamic features of output growth? (ii) Given a fixed
set of calibrated parameters, what is the relative
1 X T 1
performance of RBC models driven by shocks with
ZP;u 1; k p 1fyt1 Vug
P tR a different marginal distribution?
More specifically, consider m RBC models, and
 F1 uAZ t ; h1;t 2  1fyt1 Vug
assume that the variables of interest are output and
 Fk uAZ t ; hk;t 2 : 10 lagged output. Now, set model 1 as the benchmark
model. Let Dlog Xt, t = 1, . . ., T denote actual histor-
ical output (growth rates), and let Dlog Xj,n, j = 1, . . .,
Here, each model is estimated via QMLE, so that m and n = 1, . . ., S, denote the output series simulated
in terms of the above notation, qi =  ln fi, where fi is under model j, where S denotes the length of the
the conditional density associated with P model i, and simulated sample. Denote Dlog Xj,n(hj;T ), n = 1, . . ., S,
t
hi;t is defined as hi;t arg max hi aHi 1t js ln fi yj ; j = 1, . . ., m to be a sample of length S drawn
j1
Z ; hi , R V t V T  1, i = 1, . . ., n. For further (simulated) from model j and evaluated at the param-
details, please refer to Corradi and Swanson (2003a). eters estimated under model j, where parameter esti-
Clearly, the above approach can be used to evaluate mation is done using the T available historical
multiple non-linear forecasting models. Now, assume observations. Further, let Yt = (Dlog Xt, Dlog Xt  1),
that focus centers on evaluating the joint dynamics of Yj,n(hj;T ) = (Dlog Xj,n(hj;T ), Dlog Xj,n  1(hj;T )), and let
a non-linear prediction model, say in the form of a F0(u; h0) denote the distribution of Yt evaluated at u
RBC model. Corradi and Swanson (2003c) develop a and Fj (u; hyj ) denote the distribution of Yj,n(hyj ), where
statistic based on comparison of historical and simu- hyj is the probability limit of hj;T , taken as T ! l, and
lated distributions. As the RBC data are simulated where uaUoR2 , possibly unbounded. As above,
using estimated parameters (as well as previously accuracy is measured in terms of squared error. The
calibrated parameters), the limiting distribution of squared (approximation) error associated with model
their test statistic is a Gaussian process with a covari- j, j = 1, . . ., m, is measured in terms of the (weighted)
ance kernel that reflects the contribution of parameter average over U of (( Fj(u; hjy)  F0(u; h0))2), where
estimation error. This limiting distribution is thus not uaU, and U is a possibly unbounded set on R2. Thus,
nuisance parameter free, and critical values cannot be the rule is to choose Model 1 over Model 2 if
tabulated. In order to obtain valid asymptotic critical Z
y
values, they suggest two block bootstrap procedures, F1 u; h1  F0 u; h0 2 /udu
each of which depends on the relative rate of growth U
Z
of the actual and simulated sample size. In addition, y
< F2 u; h2  F0 u; h0 2 /udu;
the standard issue of singularity that arises when U
testing RBC models is circumvented by considering
a subset of variables (and their lagged values) for where mU /udu 1 and /(u) z 0 for all uaU oR2.
which a non-singular distribution exists. For example, For any evaluation point, this measure defines a norm
in the case of RBC models driven by only one shock, and is a typical goodness-of-fit measure. Note that
say a technology shock, their approach can be used to within our context, the hypotheses of interest are:
evaluate the models joint CDF of current and lagged Z
output, including autocorrelation and second moment y
H0 : max F0 u; h0  F1 u; h1 2
structures, etc. In the case of models driven by two j2;...;m U
shocks, say a technology and a preference shock, the y
approach can be used for the comparison of the joint  F0 u  Fj u; hj 2 /uduV0
176 M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183

and and
Z
y Zj;T ;S u
HA : max F0 u  F1 u; h1 2
j2;...;m U !2
y 2
1 XT
1 XS
 F0 u  Fj u; hj /udu > 0: 1fYt Vug  1fY1;n h1;T Vug
T t1 S n1
Thus, under H0, no model can provide a better !2
1 XT
1 XS
approximation (in a squared error sense) to the distri-  1fYt Vug  1fYj;n hj;T Vug ;
bution of Yt than the approximation provided by model T t1 S n1
1. If interest focuses on confidence intervals, so that
the objective is to approximate Pr(u V Yt V u then with hj;T an estimator of hyj that satisfies Assumption 2
the null and alternative hypotheses can be stated as: below. See Corradi and Swanson (2003c) for further
y y details.
H0V : max F1 u; h1  F1 u; h1  F0 u; h0 Another measure of distributional accuracy avail-
j2;...;m
able in the literature (see e.g. White, 1982; Vuong,
y y
F0 u; h0 2  Fj u; hj  Fj u; hj 1989), is the KLIC, according to which we should
choose Model 1 over Model 2 if:3
F0 u; h0  F0 u; h0 2 V0:
y y
versus Elog f1 Yt ; h2  log f2 Yt ; h2 > 0:
y y
HAV: max F1 u; h1  F1 u; h1  F0 u; h0 The KLIC is a sensible measure of accuracy, as it
j2;...;m
y y chooses the model, which on average gives higher
F0 u; h0 2  Fj u; hj  Fj u; hj probability to events, which have actually occurred.

F0 u; h0  F0 u; h0 2 > 0: Also, it leads to simple Likelihood Ratio tests. Inter-
estingly, Fernandez-Villaverde and Rubio-Ramirez
If interest focuses on testing the null of equal (2001) have shown that the best model under the
accuracy of two distribution models (analogous to KLIC is also the model with the highest posterior
the pairwise conditional mean comparison setup of probability. The above approach is an alternative to
Diebold & Mariano, 1995), we can simply state the the KLIC that should be viewed as complementary in
hypotheses as: some cases, and preferred in others. For example, if
Z we are interested in measuring accuracy over a spe-
y
H 0VV: F0 u; h0  F1 u; h1 2 cific region, or in measuring accuracy for a given
U confidence interval, this cannot be done in an obvious
y
 F0 u  Fj u; hj 2 /udu 0 manner using the KLIC, while it can easily be done
using our measure. As an illustration, assume that we
versus wish to evaluate the accuracy of different models in
Z approximating the probability that the rate of growth
y
HW
A : F0 u  F1 u; h1 2  F0 u of output is say between 0.5% and 1.5%. We can do
U
so quite easily using the squared error criterion, but
y
 Fj u; hj 2 /udu p 0; not using the KLIC. Furthermore, we often do not
have an explicit form for the density implied by the
In p
order
to test H0 versus HA, the relevant test statistic various models we are comparing. Of course, model
is T ZT ;S where:2 comparison can be done using kernel density estima-
Z
ZT ;S max Zj;T ;S u/udu; 11
j2;...;m U 3
Recently, Giacomini (2002) proposes an extension, which
uses a weighted (over Yt) version of the KLIC, and Kitamura (2002)
2
H 0V versus HAVand H 0VVversus HAVVcan be tested in a similar suggests a generalization for choosing among models that satisfy
manner. some conditional moment restrictions.
M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183 177

tors, within the KLIC framework. However, this leads area, although progress has been made, as discussed
to tests with non-parametric rates (see e.g. Zheng, in Christoffersen and Diebold (1996); Christoffersen
2000). On the other hand, comparison via our squared and Diebold (1997); Clements and Hendry (1996);
error measure of accuracy is carried out using empir- Granger (1993); Granger (1999); Weiss (1996). That
ical distributions, so that resulting test statistics con- said, it is often difficult to come up with asymptoti-
verge at parametric rates. cally valid inferential strategies using standard esti-
Point forecast production and evaluation continues mation procedures (that essentially minimize one-step
to receive considerable attention, and can perhaps be ahead errors) for many varieties of non-linear models,
viewed as a leading indicator for the predictive from smooth transition models to projection pursuit
density literature. There are now a host of tests based and wavelet models. In some contexts it is even
on traditional squared error loss criteria, but in addi- difficult to establish the consistency of some econo-
tion tests based on directional forecast accuracy and metric parameter estimates, such as cointegrating
sign tests, tests of forecast encompassing, as well as vectors in certain non-linear cointegration models,
measures and tests based on other loss functions. and threshold parameters in some types of regime-
Swanson and White (1997a) survey a number of the switching models. In sum, estimation and in-sample
important contributions, which include Chao, Corradi inference of non-linear forecasting models remains a
and Swanson (2001); Chatfield (1993); Clark and potentially difficult task, with much work remaining
McCracken (2001); Clements and Hendry (1993); to be done. Nevertheless, there are many recent papers
Diebold and Chen (1996); Diebold and Mariano that propose novel approaches to estimation, such as
(1995); Hansen (2001); Hansen, Heaton and Luttmer the variety of new cross-validation related techniques
(1995); Hansen and Jeganathan (1997); Harvey, Ley- in the area of neural nets.
bourne and Newbold (1997); Harvey, Leybourne and A general problem with non-linear models is the
Newbold (1998); Linton, Maasoumi and Whang curse of dimensionality and the fact that such
(2003); McCracken (1999); Pesaran and Timmerman models tend to have a large number of parameters
(1992); Pesaran and Timmerman (1994); Pesaran and (at least relative to the available number of macro-
Timmerman (2000); Stekler (1991); Stekler (1994); economic data points)how to keep the number of
West (1996); West (2001); West and McCracken parameters at a tractable level? This sort of issue is
(2002), among others. A number of these papers relevant to the specification of many varieties of non-
emphasise the role of parameter estimation uncertain- linear models, including smooth transition models
ty in testing for equal forecast accuracy, or that one (see e.g. Granger & Terasvirta, 1993a,b) and neural
model forecast encompasses another, as well as the network models (see e.g. Swanson & White, 1995,
consequences of the models being nested. Neverthe- 1997a,b). A counter to the fear that such models may
less, much remains to be done with regard to making be overfitting in-samplein the sense of picking up
these tests applicable to non-linear models, and con- transient, accidental connections between variables
structing tests using non-linear and/or non-differentia- is of course to compare the models on out-of-sample
ble loss functions, as well as allowing for parameter performance, or on a hold-out sample. The data-
estimation error and misspecification when the com- snooping procedures developed by White (2000), and
parisons involve non-linear models. used in Sullivan, Timmerman and White (1999);
The above paragraph notes the role of the loss Sullivan, Timmerman and White (2001); Sullivan,
function in determining how accuracy is to be Timmerman and White (2003), can also be used, but
assessed. However, there is also the issue of whether this would appear to be an open area, with much room
the in-sample model estimation criterion and out-of- for advance. For example, the extension of the data-
sample forecast accuracy criterion should be matched. snooping methodology to multivariate models awaits
For example, it is only recently that much attention attention, where the sheer magnitude of the problem
has been given to the notion that the same loss can quickly grow out of hand.
function used in-sample for parameter estimation is Finally, although not germane to forecasting, some
often that which should be used out-of-sample for models have parameters that are readily interpretable,
forecast evaluation. Much remains to be done in this whilst others are less clear. The study by De Gooijer
178 M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183

and Vidiella-i-Anguera (2004) extends non-linear transition exponential smoothing approach. Parame-
cointegration to allow the equilibrium, cointegrating ters are estimated for the method by minimizing the
relationship to depend upon the regime. Difficult sum of squared deviations between realized and
issues arise when cointegration is no longer global, forecast volatility. Using stock index data, the new
and the theory-justification for the long-run relation- method gives encouraging results when compared to
ship is less clear. Clements and Galvao (2004) review fixed-parameter exponential smoothing and a variety
specification and estimation procedures in systems of GARCH models.
when cointegration is global, and evaluate the fore- Bradley and Jansen (2004) propose a model in
casting ability of non-linear systems in the context of which the dynamics that characterize stock returns are
interest rate prediction, building on earlier contribu- allowed to differ in periods following a large swing in
tions by Anderson (1997); van Dijk and Franses stock returnsthat is, a non-linear state-dependent
(2000); Kunst (1992), inter alia. Using a variety of model. Their approach allows them to test for the
forecast evaluation methods, De Gooijer and Vidiella- existence of non-linearities in returns, and to estimate
i-Anguera establish the superiority of their model the size of the shock that is required to cause the non-
from a forecasting perspective. linear behavior.
Another new model is developed by Franses, Paap
and Vroomen (2004), who propose a model in which a
key autoregressive parameter depends on a leading 4. Empirical issues
indicator variable. This model captures the notion that
some variables are relevant for forecasting only once In this section we discuss various practical issues.
in a while, and it mimics some of the ideas put In contrast to linear models, the design of non-linear
forward in Franses and Paap (2002). The autoregres- models for actual data and the estimation of parame-
sive parameter is constant unless a linear function of ters are less straightforward.
the leading indicator plus a disturbance term exceeds a How should we select a model?4 Should all the
certain threshold level. The authors discuss issues observations be used, or should models be estimated
relating to estimation, inference and forecasting, and and/or evaluated against specific (dynamic) features
the relationship of their model to existing non-linear of the historical record? Should we split the sample
models. The model is applied to forecasting unem- into in- and out-of-sample periods, and if so, where
ployment, and is shown to be capable of capturing the should the split occur?
sharp increases in unemployment in recessions, and to Boero and Marrocu (2004) provide evidence relat-
provide competitive forecasts compared to alternative ed to some of these questions. They analyze the out-
models. of-sample performance of SETAR models relative to a
A number of possible approaches are available to linear AR and a GARCH model using daily data for
account for the possibility that the parameters in the Euro effective exchange rate. Their evaluation is
forecasting models are changing over time. Taylor conducted on point, interval and density forecasts,
(2004) uses adaptive exponential smoothing methods unconditionally, over the whole forecast period, and
that allow smoothing parameters to change over time, conditional on specific regimes. Their results show
in order to adapt to changes in the characteristics of that the GARCH model is better able to capture the
the time series. More specifically, he presents a new
adaptive method for predicting the volatility in finan- 4
There is a growing literature on nonparametric and semi-
cial returns, where the smoothing parameter varies as parametric forecasting methods and models. Traditionally this
a logistic function of user-specified variables. The literature has focused on the conditional mean, but a number of
approach is analogous to that used to model time- recent papers have looked at nonparametric estimation of other
varying parameters in smooth transition GARCH aspects of conditional ditributions, such as quartiles, intervals and
models. These non-linear models allow the dynamics density regions: see e.g. De Gooijer, Gannoun and Zerom (2002);
Matzner-Lber, Gannoun and De Gooijer (1998); Samanta (1989),
of the conditional variance model to be influenced by and the references therein. Developments in these areas look set to
the sign and size of past shocks. These factors can also continue apace with the more parametric approaches considered in
be used as transition variables in the new smooth this issue.
M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183 179

distributional features of the series and to predict (2000), who consider daily, weekly and monthly data,
higher-order moments than the SETAR models. How- and find distinct models for different temporal aggre-
ever, their results also indicate that the performance of gation levels. But, what happens if we aggregate over
the SETAR models improves significantly conditional the cross-section dimension? Marcellino (2004)
on being in specific regimes. In a related study, argues that cross-section aggregation of countries,
Corradi and Swanson (2004) focus on various with constant weights over time, may produce
approaches to assessing predictive accuracy. One of smoother series better suited for linear models, while
the main conclusions of their study is that there are aggregation with time-varying weights (and the pres-
various easy to apply statistics that can be constructed ence of common shocks) is more likely to generate a
using out-of-sample conditional-moment conditions, role for non-linear modeling of the resultant series.
which are robust to the presence of dynamic misspe- Marcellino (2004) fits a variety of non-linear and
cification. Because estimated models are approxima- time-varying models to aggregate EMU macroeco-
tions to the DGP and likely to be mis-specified in nomic variables, and compares them with linear
unknown ways, tests that are robust in this sense are models. He assesses the quality of these models in a
obviously desirable. They provide an illustration of real-time forecasting framework. It is found that often
model selection via predictive ability testing involving non-linear models perform best.
the US money-income relation, and demonstrate the Of course the bottom line is: Are there clear-cut
relevance of the various testing methods. examples where actual forecast improvements are
Next, which non-linear models should be enter- delivered by non-linear models? Provision of such
tained in any specific instance? This question can be examples might serve to allay the fears held by many
answered by looking at the theoretical properties of sceptics. Three nice recent examples of this sort are
the models, as well as the specific properties of the Clements and Galvao (2004); Sensier et al. (2004);
data under scrutiny. For example, Dahl and Hylleberg Genc ay and Selc uk (2004), all of whom show the
(2004) give a nice review of four non-linear models, relevance of non-linear models for forecasting. The
namely, Hamiltons flexible non-linear regression latter paper suggests that the use of non-linearities is
model (Hamilton, 2001), artificial neural networks crucial for making sensible statements about the tail
and two versions of the projection pursuit regression behavior of asset returns. In particular, Genc ay and
model. The forecasting performance of these four Selcuk (2004) investigate the relative performance of
approaches is compared for US industrial production VaR models with the daily stock market returns of
and an unemployment rate series, in a real-time nine different emerging markets. In addition to well-
forecasting exercise, whereby the specification of known modeling approaches such as the variance
the model is chosen, and the parameters re-estimated covariance method and historical simulation, they
at each step, as the forecast origin moves through the employ extreme value theory (EVT) to generate
available sample. The paper provides some guidance VaR estimates and provide the tail forecasts of daily
for model selection, and evaluates the resulting fore- returns at the 0.999 percentile along with 95% con-
casts using standard MSE-related criteria as well as fidence intervals for stress testing purposes. The
direction-of-change tests. Interestingly, they find evi- results indicate that EVT based VaR estimates are
dence that some of the flexible non-linear regression more accurate at higher quantiles. According to
models perform well relative to the non-linear bench- estimated Generalized Pareto Distribution parameters,
mark. certain moments of the return distributions do not
Further, how can we reliably estimate the model exist in some countries. In addition, the daily return
parameters? Can we address the problem whereby we distributions have different moment properties in their
only find local optima? How can we select good right and left tails. Therefore, risk and reward are not
starting values in non-linear optimization? Can we equally likely in these economies. The other two
design methods to test the non-linear models, based papers consider macroeconomic variables. Sensier et
on in-sample estimation and in-sample data? How al. (2004) examine the role of domestic and interna-
should we aggregate and analyze our data? Some of tional variables for predicting classical business
these questions are examined in van Dijk and Franses cycles regimes in four European countries, where
180 M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183

the regimes are classified as binary variables. One tion, but developing coherent strategies for such data
finding is that composite leading indicators and remains an important task. It will be interesting to see
interest rates of Germany and the US have substantial to what extent developments in these areas give rise to
predictive value. Clements and Galvao (2004) test tangible gains in terms of forecast performancewe
whether there is non-linearity in the response of short remain hopeful that great strides will be made in the
and long-term interest rates to the spread. They assess near future.
the out-of-sample predictability of various models and
find some evidence that non-linear models lead to
more accurate short-horizon forecasts, especially of Acknowledgements
the spread. And, as mentioned, De Gooijer and
Vidiella-i-Anguera (2004) report more marked gains The authors wish to thank Valentina Corradi and
to allowing for non-linearities. Dick van Dijk for helpful conversations, and are
grateful to Jan De Gooijer for reading and comment-
ing on the manuscript. Swanson gratefully acknowl-
5. Concluding remarks edges financial support from Rutgers University in the
form of a Research Council grant.
In this paper we have summarized the state-of-the-
art in forecast construction and evaluation for non-
linear models, and in selection among alternative non-
References
linear prediction models. We conclude that the day is
still long off when simple, reliable and easy to use Anderson, H. M. (1997). Transaction costs and non-linear adjust-
non-linear model specification, estimation and fore- ment towards equilibrium in the US treasury bill market. Oxford
casting procedures will be readily available. Never- Bulletin of Economics and Statistics, 59, 465 484.
theless, there are grounds for optimism. The papers in Andrews, D. W. K. (1997). A conditional Kolmogorov test. Econ-
this issue suggest that careful application of existing ometrica, 65, 1097 1128.
Bai, J. (2001). Testing parametric conditional distributions of dy-
techniques, and new models and tests, can result in namic models. Review of Economics and Statistics, in press.
significant advances in our understanding. Supposing Boero, G., & Marrocu, E. (2004). The performance of SETAR
that the world is inherently non-linear, then as com- models: A regime conditional evaluation of point, interval and
putational capabilities increase, more complex models density forecasts. International Journal of Forecasting, 20,
305 320.
become amenable to analysis, allowing the possibility
Bradley, M. D., & Jansen, D. W. (2004). Forecasting with a non-
that future generations of models will significantly linear dynamic model of stock returns and industrial production.
outperform linear models, especially if such models International Journal of Forecasting, 20, 321 342.
become truly multivariate. Chatfield, C. (1993). Calculating interval forecasts. Journal of Busi-
Much remains to be done in the areas of specifi- ness and Economic Statistics, 11, 121 135.
cation, estimation, and testing, with important issues Christoffersen, P. F. (1998). Evaluating interval forecasts. Interna-
tional Economic Review, 39, 841 862.
of non-differentiability, parameter-estimation error Christoffersen, P. F., & Diebold, F. X. (1996). Further results on
and data-mining remaining to be addressed. In addi- forecasting and model selection under asymmetric loss. Journal
tion, a further area for research (both empirical and of Applied Econometrics, 11, 561 572.
theoretical) is the following. Suppose one has data Christoffersen, P. F., & Diebold, F. X. (1997). Optimal prediction
with trend components, seasonality, non-linearity and under asymmetric loss. Econometric Theory, 13, 808 817.
Christoffersen, P. F., Hahn, J., & Inoue, A. (2001). Testing and
outliers. How should one proceed? This is a complex comparing Value-at-Risk measures. Journal of Empirical Fi-
issue, due to the possible inter-reactions between these nance, 8, 325 342.
elements, e.g. neglecting outliers may suggest non- Clark, T. E., & McCracken, M. W. (2001). Tests of equal forecast
linearity (see van Dijk, Franses & Lucas, 1999) and accuracy and encompassing for nested models. Journal of
the trend can be intertwined with seasonality, as in Econometrics, 105, 85 110.
Clements, M. P., & Galvao, A. B. (2004). A comparison of tests of
periodic models of seasonality (see Franses & Paap, non-linear cointegration with an application to the predictability
2004 for an up-to-date survey). Much has been said of US interest rates using the term structure. International Jour-
about modeling each of these characteristics in isola- nal of Forecasting, 2004, 219 236.
M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183 181

Clements, M. P., & Hendry, D. F. (1993). On the limitations of threshold cointegrated systems. International Journal of Fore-
comparing mean squared forecast errors: comment. Journal of casting, 20, 237 253.
Forecasting, 12, 617 637. De Gooijer, J. G., Gannoun, A., & Zerom, D. (2002). Mean
Clements, M. P., & Hendry, D. F. (1996). Multi-step estimation for squared error properties of the kernel-based multi-stage median
forecasting. Oxford Bulletin of Economics and Statistics, 58, predictor for time series. Statistics and Probability Letters, 56,
657 684. 51 56.
Clements, M. P., & Hendry, D. F. (1999). Forecasting non-station- Diebold, F. X., & Chen, C. (1996). Testing structural stability with
ary economic time series. Cambridge, MA: MIT Press [The endogenous breakpoint: A size comparison of analytic and boot-
Zeuthen Lectures on Economic Forecasting]. strap procedures. Journal of Econometrics, 70, 221 241.
Clements, M. P., & Krolzig, H. -M. (1998). A comparison of the Diebold, F. X., & Mariano, R. S. (1995). Comparing predictive
forecast performance of Markov-switching and threshold au- accuracy. Journal of Business and Economic Statistics, 13,
toregressive models of US GNP. Econometrics Journal, 1, 253 263.
C47 C75. Diebold, F. X., & Nason, J. A. (1990). Nonparametric exchange rate
Clements, M. P., & Krolzig, H. -M. (2002). Can oil shocks explain prediction. Journal of International Economics, 28, 315 332.
asymmetries in the US Business Cycle? Empirical Economics, Diebold, F. X., Gunther, T., & Tay, A. S. (1998). Evaluating density
27, 185 204. forecasts with applications to finance and management. Interna-
Clements, M. P., & Krolzig, H. -M. (2003). Business cycle asym- tional Economic Review, 39, 863 883.
metries: characterisation and testing based on Markov-switching Diebold, F. X., Hahn, J., & Tay, A. S. (1999). Multivariate density
autoregressions. Journal of Business and Economic Statistics, forecast evaluation and calibration in financial risk manage-
21, 196 211. ment: High frequency returns on foreign exchange. Review of
Clements, M. P., & Smith, J. (1999). A Monte Carlo study of the Economics and Statistics, 81, 661 673.
forecasting performance of empirical SETAR models. Journal Fernandez-Villaverde, J., & Rubio-Ramirez, J. F. (2001). Compar-
of Applied Econometrics, 14, 124 141. ing dynamic equilibrium models to data, manuscript. University
Clements, M. P., & Smith, J. (2000). Evaluating the forecast of Pennsylvania.
densities of linear and non-linear models: Applications to out- Franses, P. H., & Paap, R. (2002). Censored latent effects autore-
put growth and unemployment. Journal of Forecasting, 19, gression, with an application to US unemployment. Journal of
255 276. Applied Econometrics, 17, 347 366.
Clements, M. P., & Smith, J. (2002). Evaluating multivariate fore- Franses, P. H., & Paap, R. (2004). Periodic time series models.
cast densities: A comparison of two approaches. International Oxford: Oxford University Press.
Journal of Forecasting, 18, 397 407. Franses, P. H., & van Dijk, D. J. C. (2000). Non-linear time series
Corradi, V., & Swanson, N. R. (2002). A consistent test for out of models in empirical finance. Cambridge: Cambridge University
sample non-linear predictive ability. Journal of Econometrics, Press.
110, 353 381. Franses, P. H., Paap, R., & Vroomen, B. (2004). Forecasting un-
Corradi, V., & Swanson, N. R. (2003a). The block bootstrap for employment using an autoregression with censored latent ef-
recursive m-estimators with applications to predictive evalua- fects parameters. International Journal of Forecasting, 20,
tion, Working Paper, Queen Mary, University of London, Uni- 255 271.
versity of Exeter and Rutgers University. Genc ay, R., & Selc uk, F. (2004). Extreme value theory and Value-
Corradi, V., & Swanson, N. R. (2003b). A test for comparing multi- at-Risk: Relative performance in emerging markets. Internation-
ple misspecified conditional distribution models, Working Pa- al Journal of Forecasting, 20, 287 303.
per, Queen Mary, University of London and Rutgers University. Giacomini, R. (2002). Comparing density forecasts via weighted
Corradi, V., & Swanson, N. R. (2003c). Evaluation of dynamic Likelihood Ratio tests: Asymptotic and bootstrap methods,
stochastic general equilibrium models based on a distributional manuscript. San Diego: University of California.
comparison of simulated and historical data. Journal of Econo- Giacomini, R., & White, H. (2003). Tests of conditional predictive
metrics (In press). ability, manuscript. San Diego: University of California.
Corradi, V., & Swanson, N. R. (2004). Some recent developments Granger, C. W. J. (1999). Outline of forecast theory using general-
in predictive accuracy testing with nested models and (generic) ized cost functions. Spanish Economic Review, 1, 161 173.
non-linear alternatives. International Journal of Forecasting, Granger, C. W. J., & Terasvirta, T. (1993a). Modelling non-linear
20, 185 199. economic relationships. New York: Oxford University Press.
Dacco, R., & Satchell, S. (1999). Why do regime-switching models Granger, C. W. J., & Terasvirta, T. (1993b). On the limitations of
forecast so badly. Journal of Forecasting, 18, 1 16. comparing mean squared forecast errors: Comment. Journal of
Dahl, C. M., & Hylleberg, S. (2004). Flexible regression models Forecasting, 12, 651 652.
and relative forecast performance. International Journal of Hamilton, J. D. (1983). Oil and the macroeconomy since World
Forecasting, 20, 201 217. War II. Journal of Political Economy, 91, 228 248.
De Gooijer, J. G., & Kumar, K. (1992). Some recent developments Hamilton, J. D. (1989). A new approach to the economic analysis of
in non-linear time series modelling, testing, and forecasting. nonstationary time series and the business cycle. Econometrica,
International Journal of Forecasting, 8, 135 156. 57, 357 384.
De Gooijer, J. G., & Vidiella-i-Anguera, A. (2004). Forecasting Hamilton, J. D. (1996). This is what happened to the oil price-
182 M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183

macroeconomy relationship. Journal of Monetary Economics, Pagan, A. R. (1997b). Towards an understanding of some Busi-
38, 215 220. ness Cycle characteristics. Australian Economic Review, 30,
Hamilton, J. D. (2000). What is an oil shock? Mimeo. La Jolla, CA: 1 15.
Department of Economics, UCSD. Pesaran, M. H., & Timmerman, A. G. (1992). A simple nonpara-
Hamilton, J. D. (2001). A parametric approach to flexible non- metric test of predictive performance. Journal of Business and
linear inference. Econometrica, 69, 537 573. Economic Statistics, 10, 461 465.
Hansen, P. R. (2001). An unbiased and powerful test for superior Pesaran, M. H., & Timmerman, A. G. (1994). A generalization of
predictive ability, manuscript. Brown University. the non-parametric Henriksson Merton test of market timing.
Hansen, L. P., & Jeganathan, R. (1997). Assessing specification Economics Letters, 44, 1 7.
errors in stochastic discount factor models. Journal of Finance, Pesaran, M. H., & Timmerman, A. G. (2000). A recursive model-
52, 557 590. ling approach to predicting UK stock returns. Economic Jour-
Hansen, L. P., Heaton, J., & Luttmer, E. G. J. (1995). Econometric nal, 159 191.
evaluation of asset pricing models. Review of Financial Studies, Raymond, J. E., & Rich, R. W. (1997). Oil and the macroeconomy:
8, 237 279. A Markov state-switching approach. Journal of Money, Credit
Harding, D., & Pagan, A. (2001). Dissecting the cycle: A meth- and Banking, 29, 193 213.
odological investigation. Journal of Monetary Economics, 49, Samanta, M. (1989). Nonparametric estimation of conditional quan-
365 381. tiles. Statistics and Probability Letters, 7, 407 412.
Harvey, D., Leybourne, S., & Newbold, P. (1997). Testing the Sensier, M., Artis, M., Osborn, D. R., & Birchenhall, C. (2004).
equality of prediction mean squared errors. International Jour- Domestic and international influences on business cycle re-
nal of Forecasting, 13, 281 291. gimes in Europe. International Journal of Forecasting, 20,
Harvey, D. I., Leybourne, S., & Newbold, P. (1998). Tests for 343 357.
forecast encompassing. Journal of Business and Economic Sta- Sichel, D. E. (1994). Inventories and the three phases of the busi-
tistics, 16, 254 259. ness cycle. Journal of Business and Economic Statistics, 12,
Hong, Y. (2001). Evaluation of out of sample probability density 269 277.
forecasts with applications to S&P 500 stock prices, mimeo. Stekler, H. O. (1991). Macroeconomic forecast evaluation techni-
Cornell University. ques. International Journal of Forecasting, 7, 375 384.
Hooker, M. A. (1996). Whatever happened to the oil price-macro- Stekler, H. O. (1994). Are economic forecasts valuable? Journal of
economy relationship? Journal of Monetary Economics, 38, Forecasting, 13, 495 505.
195 213. Sullivan, R., Timmerman, A. G., & White, H. (1999). Data-snoop-
Kitamura, Y. (2002). Econometric comparisons of conditional mod- ing, technical trading rules and the bootstrap. Journal of Fi-
els. University of Pennsylvania, Mimeo. nance, 54, 1647 1692.
Koop, G., & Potter, S. (2000). Non-linearity, structural breaks, or Sullivan, R., Timmerman, A. G., & White, H. (2001). Dangers of
outliers in economic time series. In W. A. Barnett, D. F. data-driven inference: The case of calendar effects in stock re-
Hendry, S. Hylleberg, T. Terasvirta, D. Tjostheim, & A. Wurtz turns. Journal of Econometrics, 249 286.
(Eds.), Non-linear econometric modelling in time series analy- Sullivan, R., Timmerman, A. G., & White, H. (2003). Forecast
sis ( pp. 61 78). Cambridge: Cambridge University Press. evaluation with shared data sets. International Journal of Fore-
Krolzig, H. -M. (2003). Predicting markov-switching vector autor- casting, 19, 217 227.
egressive processes. Journal of Forecasting (In press). Swanson, N. R., & White, H. (1995). A model selection approach
Kunst, R. M. (1992). Threshold cointegration in interest rates, Dis- to assessing the information in the term structure using linear
cussion Paper 92-26. Department of Economics, UC San Diego. models and artificial neural networks. Review of Economics and
Linton, O., Maasoumi, E., & Whang, Y. J. (2003). Consistent test- Statistics, 13, 265 279.
ing for stochastic dominance under general sampling schemes, Swanson, N. R., & White, H. (1997a). Forecasting economic time
manuscript. LSE, Southern Methodist University and Ewha series using adaptive versus nonadaptive and linear versus non-
University. linear econometric models. International Journal of Forecast-
Marcellino, M. (2004). Forecasting EMU macroeconomic varia- ing, 13, 439 461.
bles. International Journal of Forecasting, 20, 359 372. Swanson, N. R., & White, H. (1997b). A model selection approach
Matzner-Lber, E., Gannoun, A., & De Gooijer, J. G. (1998). Non- to real-time macroeconomic forecasting using linear models and
parametric forecasting: A comparison of three kernel-based artificial neural networks. Review of Economics and Statistics,
methods. Communications in Statistical Theory Methods, 27, 79, 540 550.
1593 1617. Taylor, J. W. (2004). Volatility forecasting with smooth transition
McCracken, M. W. (1999). Asymptotics for out of sample tests of exponential smoothing. International Journal of Forecasting,
causality, Working Paper. Louisiana State University. 20, 273 286.
Mork, K. A. (1989). Oil and the macroeconomy when prices go up Terasvirta, T., & Anderson, H. M. (1992). Characterizing non-
and down: An extension of Hamiltons results. Journal of Polit- linearities in business cycles using smooth transition autor-
ical Economy, 97, 740 744. egressive models. Journal of Applied Econometrics, 7,
Pagan, A. R. (1997a). Policy, theory and the cycle. Oxford Review 119 139.
of Economic Policy, 13, 19 33. Tiao, G. C., & Tsay, R. S. (1994). Some advances in non-linear and
M.P. Clements et al. / International Journal of Forecasting 20 (2004) 169183 183

adaptive modelling in time-series. Journal of Forecasting, 13, Biographies: Michael P. CLEMENTS is a Reader in the Depart-
109 131. ment of Economics at the University of Warwick. His research
Tong, H. (1995). A personal overview of non-linear time series interests include time-series modeling and forecasting. He has
analysis from a chaos perspective. Scandinavian Journal of Sta- published in a variety of international journals, has co-authored
tistics, 22, 399 445. two books on forecasting, and co-edited (with David F. Hendry) of
van Dijk, D. J. C., & Franses, P. H. (2000). Non-linear error cor- A Companion to Economic Forecasting 2002, Blackwells. He is
rection models for interest rates in The Netherlands. In W. Bar- also an editor of the International Journal of Forecasting.
nett, D. F. Hendry, S. Hylleberg, T. Terasvirta, D. Tjstheim, &
A. W. Wurtz (Eds.), Non-linear econometric modelling in time Philip Hans FRANSES is Professor of Applied Econometrics and
series analysis ( pp. 203 227). Cambridge: Cambridge Univer- Professor of Marketing Research, both at the Erasmus University
sity Press. Rotterdam. He publishes on his research interests, which are applied
van Dijk, D. J. C., Franses, P. H., & Lucas, A. (1999). Testing econometrics, time series, forecasting, marketing research and
for smooth transition non-linearity in the presence of out- empirical finance.
liers. Journal of Business and Economic Statistics, 17,
217 235. Norman SWANSON, a 1994 University of California, San Diego
van Dijk, D. J. C., Franses, P. H., & Terasvirta, T. (2002). Smooth Ph.D. is currently Associate Professor at Rutgers University. His
transition autoregressive modelsa survey of recent develop- research interests include forecasting, financial- and macro-econo-
ments. Econometric Reviews, 21, 1 47. metrics. He is currently an associate editor of the Journal of
Vuong, Q. (1989). Likelihood ratio tests for model selection and Business and Economic Statistics, the International Journal of
non-nested hypotheses. Econometrica, 57, 307 333. Forecasting, and Studies in Non-linear Dynamics and Economet-
Weiss, A. (1996). Estimating time series models using the rele- rics. He has recently been awarded a National Science Founda-
vant cost function. Journal of Applied Econometrics, 11, tion research grant entitled the Award for Young Researchers,
539 560. and is a member of various professional organizations, including
West, K. (1996). Asymptotic inference about predictive ability. the Econometric Society, the American Statistical Association,
Econometrica, 64, 1067 1084. and the American Economic Association. Swanson has recent
West, K. D., & McCracken, M. W. (2002). Inference about predic- publications in Journal of Development Economics, Journal of
tive ability. In M. P. Clements, & D. F. Hendry (Eds.), A com- Econometrics, Review of Economics and Statistics, Journal of
panion to economic forecasting ( pp. 299 321 ). Oxford: Business and Economic Statistics, Journal of the American
Blackwells. Statistical Association, Journal of Time Series Analysis, Journal
White, H. (2000). A reality check for data snooping. Econometrica, of Empirical Finance and International Journal of Forecasting,
68, 1097 1126. among others. Further details about Swanson, including copies of
Zheng, J. X. (2000). A consistent test of conditional parametric all papers cited above, are available at his website: http://
distributions. Econometric Theory, 16, 667 691. econweb.rutgers.edu/nswanson/.

You might also like