Professional Documents
Culture Documents
Threshold Cointegration
Author(s): Nathan S. Balke and Thomas B. Fomby
Source: International Economic Review, Vol. 38, No. 3 (Aug., 1997), pp. 627-645
Published by: Wiley for the Economics Department of the University of Pennsylvania and Institute of Social
and Economic Research -- Osaka University
Stable URL: http://www.jstor.org/stable/2527284 .
Accessed: 26/09/2013 09:24
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Wiley, Economics Department of the University of Pennsylvania, Institute of Social and Economic Research --
Osaka University are collaborating with JSTOR to digitize, preserve and extend access to International
Economic Review.
http://www.jstor.org
THRESHOLD COINTEGRATION*
1. INTRODUCTION
627
2
See, for example, Scarf (1960), Miller and Orr (1966), Bertola and Caballero (1990), Sheshinski
and Weiss (1983), and Bentolila and Bertola (1990).
For simplicity, let et and rmt be i.i.d. mean zero random variables. Equation (1)
represents the equilibriumrelationship between yt and xt, where zt is the deviation
from equilibrium and the cointegratingvector is given by (1, a). Bt, in equation (2),
represents the common stochastic trend of yt and xt.
Rather than a linear autoregression with constant parameters, as in Engle and
Granger (1987), suppose departures from the equilibrium, zt, follow a threshold
autoregression. In particular,
(4)(4) A
/\ x= = y(lZ1+2
2i, _ 1 +
x~~~~~~t 1)2t
2.2. A More General Model. We can generalize the dynamics of the equilib-
rium error by allowing nonzero intercepts and asymmetricthresholds. In this case,
where p('O(L)are lag polynomials and 04i) (i = 1, m, and u) are mean zero random
disturbances with standard deviation a (i).3 Here zt follows a different autoregres-
sion depending on the value of Zt-d. The integer d represents the delay in the error
correction process and reflects the possibility that economic agents or controllers
may react to deviations from the equilibriumwith a lag.
Unfortunately, necessary and sufficient conditions for tPi) and p(i)(L) to guaran-
tee stationarityare still not well understood. Chan and Tong (1985), and Brockwell
et al., (1992) develop a set of sufficient conditions for the general threshold
autoregression, yet they leave open the possibility of a weaker set of sufficient
conditions. For a symmetric threshold autoregression in which the autoregressive
parameters (but not necessarily the intercepts) in the outer regimes are the same,
Tj$stheim (1990) shows a sufficient condition for stationarityis that the roots of the
autoregression in the outer regimes are less than one in absolute value.
Chan et al., (1985) have, however, determined the necessary and sufficient
conditionsfor the case where p(i)(L) = p(') and d = 1. If Jl), au), p(l), and p(u)
satisfy one of the following conditions:
(i) p(l) < 1, p(O)< 1, and p(l)p(U)< 1;
(ii) p(u) < 1, p(l) - 1, and pufl)> 0;
(iii) p(l) < 1, p(u) - 1, and ju) < 0;
(iV) p(l) = p(u) = 1 and /1(u)< 0 < ,u(l);
(v) p(l)p(u)= 1, p(l) < 0, and A(U)+ p(u)A(l)> 0;
then zt is a stationary stochastic process.4 Regardless of the behavior of Z in the
interior regime (m) (zt can display unit root or explosive behavior in the interior)
the nature of zt in the upper and lower regimes determines whether it is stationary.
Note that even if the autoregressivecoefficient, p, is equal one everywhere Zt may
still be stationary. So long as the drift parameters act to push the series back
towardsthe equilibriumband (i.e., N(l) > 0 and 1Au) < 0) the series is stationaryeven
though it has a 'unit root.' This suggests that, in general, just examining the
autoregressiveparameters in the outer regimes is not enough to determine whether
the series is stationary.
Similarly,for the corresponding threshold error correction model the intercepts
as well as the error correction parameters may reflect adjustment towards the
equilibrium.To see this, let /3'X, = B, and a X,=;z where Xt=(ytxt)',ABt=t[B
+ B(L)WABt + mqtmtis a mean zero i.i.d. shock, and z, is the equilibrium error
given by equation (5). The resulting error correction model is given by
where y(l) is a vector containing the error correction parameters. The constant
terms in the error correction model, ,L4', are, in part, a function of the constant
parameter in the equilibriumerror.5Thus, along with the error correction parame-
ters, the constant terms in the error correction model can play a role in drivingthe
system back towards the equilibrium.
= Zt_1+ if Izt_1I<0
5The errorcorrectionparametersare:
AW 1 a ( )
a:( a(i) (i)(i) )
and
re
C(he(L) ((L - ( (L - L) BL)
where p*W(X(L)
=(p(i )(L) -p(i )(1))1(1-L).
How would one determine whether time series are threshold cointegrated?As we
suggested above, threshold cointegration has implications for 'local' as well as
'global'behavior of a time series. Cointegrationis a global characteristicof the time
series while the threshold regimes are local characteristics. The combination of
global and local behavior suggest four possible hypotheses summarized by the
following:
Take as the null hypothesis no cointegration and linearity (upper left cell). There
are three alternative hypotheses: no cointegration and nonlinearity, cointegration
and linearity, and cointegration and nonlinearity. Our particular focus is on this
third alternative hypothesis: threshold cointegration.6
Ideally, one would like to test the no cointegration/linearity null hypothesis
directly against the threshold cointegration alternative.7However, this poses several
difficulties. First, there is a nonstandard inference problem. Not only do we have
unit roots present under the null but there are nuisance parameters present under
the alternativehypothesis that are not present under the null, namely the thresholds
(0(u) and 0(l)). While there is a growing literature on testing when parameters are
not present under the null (see, for example, Davies 1977 and 1987, Andrews and
Ploberger 1994, and Hansen 1996), the theory has been developed for a stationary
null hypothesis. Second, and perhaps more importantly, the class of stationary
threshold models may be too large to permit testing parametricallythe no cointegra-
tion/linear null against the general threshold cointegration alternative. For exam-
ple, for the simple TAR(1) models described above, there are five separate condi-
tions involvingmultiple one-sided tests that must be examined in order to determine
whether the threshold model is stationary,and for a more general threshold model
we may not even know all the conditions that guarantee stationarity.As a conse-
quence, one is likely going to have to resort to setting a particular (and probably
simple) threshold model as the alternativehypothesis in order to test parametrically
the no cointegration/linear null directly;setting up general threshold cointegration
as an alternative is problematic.
As a result, we suggest breaking up the analysis into, first, an examination of the
global behavior of the time series-testing for no cointegration/cointegration-and,
second, an examination of the local behavior of the time series-testing for
nonlinear or threshold behavior. This approach rests on the fact that many of the
existing tools for testing for cointegration are likely to be of use even in the case of
threshold cointegration. If cointegration is found, then testing for nonlinearity and
threshold behavior can help determine the local behavior. In all the analysis below,
we essentially take the Engle-Granger single equation approachto cointegrationby
examining the equilibriumerror; first for stationarityand then for threshold behav-
ior.
6There may be other interesting stationary nonlinear alternatives to the linear no cointegration
null hypothesis and the methodology discussed below may have power against those alternatives.
7Or, alternatively, one may want to set up threshold cointegration as the null model.
(Stock 1987) holds as well.8 This suggests that standardtests for unit roots should be
able to detect the unit roots present in x, and yt and the cointegratingvector a can
be treated as known in the second stage of two stage procedures such as Engle and
Granger (1987).
Furthermore, standard tests for unit roots or of stationarity should be able to
detect the stationarityof zt. As we pointed out above, there are several alternative
parameter settings in which z; is stationary;a unit root could be present in the outer
regimes and z, could still be stationary(cases (ii), (iii), and (iv), above). Nonetheless,
standard linear unit root tests such as the augmented Dickey-Fuller test and the
Phillips-Perron (1988) test should still have power against these nonlinear alterna-
tives. Alternatively, if one takes the null hypothesis as stationarityas is the case in
Bierens and Guo (1993) and Kwiatkowskiet al., (1992) (KPSS), then as long as
certain mixing conditions hold for z; these tests apply to threshold as well as a linear
stationarytime series. Thus, techniques designed to detect unit roots and cointegra-
tion in the linear case should work for the threshold case.
To evaluate the performance of standard tests for unit roots/stationarity, we
conduct a Monte Carlo experiment in which data is generated by a model similar to
that described in equations (1) and (2). The cointegrating relationship is given by:
yt + axt =Zt, where zt follows one of the three threshold models described above.
The common stochastic trend is determined by: yt + ,8xt = Bt, where Bt is a random
walk without drift. In this experiment, a = -2, and ,3 = -3 while the innovationsin
zt and Bt have unit variance and are uncorrelatedwith one another. To be sure that
initial conditions did not unduly affect the nonlinear time series, we allowed a start
up period of 1,000 observationswhen generating the time series. In addition to the
three threshold models, we also generated linear first-order autoregressive series
that have the same first-orderautocorrelationas the related threshold model.9 This
allows us to generate a linear model with (roughly) the same persistence as the
nonlinear model. By comparing the results for the threshold model with that of its
associated linear model, we can better assess the effect of threshold nonlinearityon
the performance of the tests.10
Tables 1 and 2 apply the augmented Dickey-Fuller (ADF) and the Phillips-
Perron (PP) tests for unit roots to the null of no cointegration.1"In Table 1 the
cointegrating vector is assumed to be known, while in Table 2 the cointegrating
TABLE 1. APPROACH:
ENGLE-GRANGERSINGLE-EQUATION COINTEGRATING
VECTORKNOWN*;
NULLHYPOTHESISOF NO COINTEGRATION
(1) Power of ADF Test at 5% Nominal SignificanceLevel
6 EQ-TAR EQ-AR (1) Band-TAR Band-AR (1) RD-TAR RD-AR (1)
T 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10
100 49 16 8 67 30 8 16 11 8 19 10 6 27 13 8 34 11 6
250 98 64 15 99 91 17 61 19 12 75 27 7 89 28 12 94 39 8
500 100 99 54 100 100 54 99 66 16 100 78 17 100 87 18 100 91 20
1000 100 100 97 100 100 97 100 99 29 100 100 46 100 100 38 100 100 55
T 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10
100 95 33 16 94 53 10 25 18 12 32 12 6 50 20 13 55 14 7
250 100 98 22 100 99 23 95 28 15 93 36 9 100 47 16 99 51 10
500 100 100 55 100 100 63 100 91 19 100 88 20 100 99 20 100 97 24
1000 100 100 99 100 100 99 100 100 36 100 100 50 100 100 50 100 100 63
*Entries represent percentiles of 1000 replications. For EQ-TAR and Band-TAR, p = 0.4. For the
RD-TAR, I i I = 1.0. The EQ-AR (1), Band-AR (1), RD-AR (1) are linear AR(1) models that have the same
first order autocorrelations as the corresponding threshold autoregressive models. The number of lags in
the Augmented Dickey -Fuller and Phillips -Perron test =integer [4(T/100 )25] .Both tests also include a
constant term.
TABLE 2. ENGLE-GRANGERSINGLE-EQUATION
APPROACH: COINTEGRATING
VECTORUNKNOWN*;
NULLHYPOTHESIS OF NO COINTEGRATION
(1) Power of ADF Test at 5% Nominal Significance Level
0 EQ-TAR EQ-AR (1) Band-TAR Band-AR (1) RD-TAR RD-AR (1)
T 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10
100 28 13 9 42 18 6 12 11 8 13 9 6 15 11 7 19 10 6
250 90 37 10 98 65 9 34 13 8 46 15 7 61 17 9 73 24 7
500 100 96 17 100 100 26 96 31 10 98 43 9 100 54 11 100 70 10
1000 100 100 64 100 100 82 100 94 15 100 95 21 100 99 18 100 99 27
*Entries represent percentiles of 1000 replications. For EQ-TAR and Band-TAR, p = 0.4. For the
RD-TAR, I I = 1.0. The EQ-AR (1), Band-AR (1), RD-AR (1) are linear AR(1) models that have the same
first order autocorrelations as the corresponding threshold autoregressive models. The cointegrating vector
was estimated by DOLS. The lags in Augmented Dickey-Fuller and Phillips-Perron test
integer [4(T/100 )25]. Both tests include a constant term. Critical values are from MacKinnon (1991).
vector is estimated by the Dynamic OLS (DOLS) method proposed by Stock and
Watson (1993).12 Because the threshold models considered in this simulation are
relatively persistent, the power of the tests in small samples are relatively low. In
larger samples the power of the tests approaches 1.0, which suggests that the tests
are consistent against the threshold alternative. For larger samples and/or smaller
threshold values, both the ADF and the PP tests have more power against the
associated stationarylinear alternativethan against the stationarythreshold alterna-
tive. However, for large threshold values relative to the sample size, the tests
actually have more power against the threshold series than against the linear
autoregression. The reason is that over the course of the Monte Carlo experiment
there will be instances in which the sample fortuitously starts near the threshold
values which, in turn, allows the mean-revertingnature of the nonlinear model to
show up better in small samples.
Interestingly,for the examples considered in Tables 1 and 2, the PP test has more
power than the ADF test. While this is also true for the linear models considered, it
is particularlytrue for the threshold models. Apparently, the nonparametricnature
of the PP test gives it more power against threshold cointegration than the ADF
test. Finally, estimating the cointegratingvector results in a loss of power for both
the ADF and the PP versions of unit root tests. However, the power loss due to
estimating the cointegrating vector is not substantially different for the threshold
models compared with the linear autoregressions.
Table 3 shows results when cointegration is taken to be the null hypothesis and
the cointegratingvector is known. Here the nonlinear models tend to have a larger
size distortion than the correspondinglinear autoregressivemodels except for large
threshold values and small samples. This is similar to the power results for the no
cointegration null discussed above. While there is a substantial size distortion for
the KPSS test, this is true for both linear and nonlinear models and probably has
more to do with the need to increase the window width to account for the
persistence in these time series than it does with nonlinearity. The Bierens-Guo
Cauchy #3 (BG3) test has little size distortion except at large threshold values for
the Band-TAR and RD-TAR and their associated linear ARs.
An alternative to testing for cointegration using the Engle-Granger approach is
the Johansen (1991) full information, maximumlikelihood approach. The problem
with the Johansen method is that it is based on a Gaussian,linear vector autoregres-
sion (VAR) and, like the Dickey-Fuller regression,would be misspecified under the
threshold alternative; the estimates of the linear VAR would reflect the average
across the three regimes. Nonetheless, it may still yield the correct average rank of
Johansen'slong-run matrixand, hence, could detect threshold cointegration.Table 4
presents results for the Johansen trace test for one or more cointegratingvectors.
The implied VAR for the test is one in which an intercept is present in the
cointegratingvector but in which there is no drift in the stochastic trend(s). To the
12
We also calculated the average bias and mean squared error of OLS and DOLS estimates of
the cointegrating vector. For both OLS and DOLS, the bias is typically larger for threshold
cointegration compared with linear cointegration, but not substantially so. These results are
available upon request.
TABLE 3. APPROACH:COINTEGRATING
ENGLE-GRANGERSINGLE-EQUATION VECTORKNOWN*;
NULLHYPOTHESISOF COINTEGRATION
(1) Size of KPSS Test at 5% Nominal SignificanceLevel
0 EQ-TAR EQ-AR(1) Band-TAR Band-AR(1) RD-TAR RD-AR(1)
T 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10
100 21 37 56 17 26 53 38 51 60 34 50 63 30 48 58 26 46 63
250 16 33 65 12 21 55 33 55 73 27 50 73 24 48 72 19 44 71
500 14 31 73 11 20 61 30 59 86 26 54 82 21 51 84 19 47 80
1000 13 27 67 11 17 58 28 55 90 25 47 85 20 45 87 16 39 81
(2) Size of Bierens -Guo Cauchy Test (#3) at 5% Nominal Significance Level
*
Entries represent percentiles of 1000 replications. For EQ-TAR and Band-TAR, p = 0.4. For the
RD-TAR, Ij.t = 1.0. The EQ-AR (1), Band-AR (1), RD-AR (1) are linear AR(1) models that have the same
first order autocorrelations as the corresponding threshold autoregressive models. Null hypothesis is
stationarity around a constant mean. The number of lags used in the KPSS test = integer [4(T/100)25].
extent they can be directly compared (they have slightly different alternative
hypotheses), the power of the Johansen test is greater than the Engle-Granger
approach that uses the ADF test, but has less power than the Engle-Granger
approach with the PP unit root test. Like the Engle-Granger ADF approach, the
power loss is relatively larger when going from linear to threshold models than was
the case for the nonparametric PP test. This leaves open the possibility that more
nonparametric methods like Bierens (1997; this assumes only a linear infinite
moving average representation) may be more effective at detecting cointegration in
the threshold case than the Johansen approach.
TABLE4. JOHANSEN
TRACE
TESTFORCOINTEGRATION*;
NULLHYPOTHESIS
OFNOCOINTEGRATION
(1) Percent rejections at the 5% significance level
6 EQ-TAR EQ-AR(1) Band-TAR Band-AR(1) RD-TAR RD-AR(1)
T 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10 3 5 10
100 29 12 8 38 13 6 11 7 6 10 6 6 14 10 7 16 7 5
250 91 41 10 99 63 11 35 12 7 48 14 7 63 17 7 71 21 8
500 100 99 19 100 99 22 98 32 10 98 36 7 100 52 10 100 62 8
1000 100 100 60 100 100 67 100 92 13 100 94 14 100 99 16 100 99 20
* Entries represent percentiles of 500 replications. For EQ-TAR and Band-TAR, p = 0.4. For the
RD-TAR, I tLI = 1.0. The EQ-AR(1), Band-AR(1), RD-AR(1) are corresponding linear AR(1)
models that have same first order autocorrelations as the threshold autoregressive models. The
VAR includes a constant but with the restriction of no drift in the stochastic trend(s). The number
of lags in the VAR = integer[2*(T/100).25].
that minimize the sum of squared errors. The test statistic is simply the maximum
Wald (sup-Wald) statistic over the possible threshold values.14 For the case of
homoskedasticity,the break points that minimize the sum of squared errorswill also
yield the maximum Wald statistic. For the heteroskedastic case, general het-
eroskedasticity can be accounted for by using White's consistent covariance matrix
estimate for the Wald-statisticwhile the possibility of grouped heteroskedasticityin
which the variance of innovations differs across regimes can be accounted for by
weighted least squares.
Testing linearity against the two-threshold alternativeinvolves nonstandardinfer-
ence in that the threshold values are not identified under the null. Hansen (1996)
has developed asymptotic theory for cases such as this and proposes a simulation
method which can approximatethe asymptoticdistributionof the sup-Wald statistic.
Alternatively, the sampling distribution of the sup-Wald statistic can be calculated
by bootstrap methods.15Taking a linear autoregression as the null model, one can
resample the residuals from this model to create a bootstrap sample from which one
can calculate a sup-Wald statistic. This resamplingis repeated yielding a distribution
of sup-Wald statistics under the null of a linear autoregression.For the bootstrap,
we considered two null models: a linear autoregression in levels and a linear
autoregression with a unit root imposed. Finally, both the Hansen and Bootstrap
methods have the additional advantage over the other tests of permitting inference
when the delay parameter, d, is unknown.
All the tests share a common modelling strategy. First, a tentative AR model of
order p and a set of possible threshold variables (zt-d for various values of d) is
selected. In the Monte Carlo experiment presented below, we take the order of the
autoregression, p, and the delay as known at p = d = 1. The sample is then
arranged according to the value of Zt-d; here we arrangedthe sample from low to
high values of Zt-d. The start-up for the Tsay and CUSUM tests is taken to be 10%
of the sample. For the one-break sup-Wald test, we take the interior 80% of the
arrangedsample to be the set of possible break points. For the two-breaksup-Wald
test, we search from the 5th to 30th percentile of the arranged sample to find the
lower threshold and from the 70th to 95th percentile to find the upper threshold, so
that, at minimum, 40 percent of the sample is in the middle regime.
One complication present in our application is that an interesting null hypothesis
is one in which zt is a linear autoregression with a unit root. All the tests of
nonlinearity discussed above, with the exception of the difference stationary boot-
strap, assume the null hypothesis is a stationarylinear autoregression.To determine
whether there are significant size distortionspresent when the null model contains a
unit root, we conducted a Monte Carlo experiment in which the tests are applied to
a random walk as well as a stationary autoregression.Furthermore,in practice, the
14
In addition to the sup-Wald, Andrews and Ploberger (1994) have proposed averaging Wald
statistics over possible break points (their avg-Wald and exp-Wald statistics). Those could also be
applied to the double-threshold case.
15
Christiano (1992) has applied bootstraps for testing the trend-break hypothesis and Diebold
and Chen (1996) have shown that bootstrap methods can be quite useful for finite sample inference
in structural change problems.
cointegrating vector has to be estimated, so we also examine the size and power
consequences of having to estimate the cointegratingvector.
Table 5 presents the size and power properties of the various tests of nonlinearity,
both for the case where the cointegratingvector is known and the case where it is
estimated. With the possible exception of the one-break sup-Waldtest, there is little
size distortion for a linear unit root model relative to the linear stationary autore-
T = 250
Tsay 4 5 99 95 69 64 52 30 54 56 30
CUSUM 4 4 99 86 54 63 48 28 61 52 28
sup-Wald 6 7 99 99 81 70 59 39 76 65 43
(1-break)
T= 500
Tsay 5 5 100 100 92 92 84 52 76 87 56
CUSUM 4 6 100 99 76 92 75 44 89 81 48
sup-Wald 6 8 100 100 96 96 90 66 97 91 68
(1-break)
T= 250
Tsay 5 6 97 80 43 38 25 13 39 25 15
CUSUM 4 5 95 69 33 39 22 13 42 24 13
sup-Wald 6 6 98 82 46 43 26 16 51 31 17
(1-break)
T = 500
Tsay 4 5 100 97 64 71 43 17 65 53 17
CUSUM 4 6 99 95 53 72 42 16 76 50 16
sup-Wald 4 8 100 99 67 77 48 22 84 58 24
(1-break)
* Based on 1000 replications. For EQ-TAR and Band-TAR, p = 0.4. For RD-TAR,
AI = 1. For
AR, zt = .8zt_ 1 + et. For RW, zt = zt -1 + Et. For all models, Et - N(0, 1).
gression (see the AR and RW columns, respectively). This is very different from
testing for structural change over time where a unit root process can give the
appearance of a structural change in a stationary autoregression and vice versa.
Because the structural change here occurs in the arranged regression or across
threshold values, the linear unit root model is less likely to be confused with
structuralchange.16
Not surprisingly,estimating the cointegratingvector lowers the power of the tests,
but it does not dramaticallyalter the size properties of the tests. In addition, the
power of the tests diminishes substantiallyfor larger threshold values. The reason is
that for a given sample size, the larger the threshold values the smaller the number
of observations in the outer regimes.
Table 6 presents the results of a small Monte Carlo experiment to evaluate the
alternative methods of testing for the two-threshold model. Because the bootstrap
method involves numerous replications for each Monte Carlo replication, the
computational demands of an unconditional search for the two thresholds in the
Monte Carlo experiment was excessive, particularlyin larger samples.17As a result,
for the bootstrap experiment, we iteratively estimate the two thresholds conditional
on the other.18That is, we search for one threshold at a time taking the value of the
other threshold as given. Having found the location of the first threshold, we
searched for the second threshold conditional on the estimated location of the first.
We iterated back and forth until the same two thresholds were continually chosen.
While in general the iterative search will not alwaysyield the same thresholds as an
unconditional search, it appears to be quite cost effective in practice.19The iterative
search was applied both to the pseudo-data in the Monte Carlo experiment and to
the bootstrap replications. For the Hansen procedure, a standard grid search was
conducted.
On the basis of this small Monte Carlo study, it appears that the size and power
of the Hansen and Bootstrap methods are fairly similar. Not surprisingly,both the
Hansen and Bootstrap tests based on the two-threshold alternative have more
power against the threshold models considered here than the Tsay, CUSUM, and
sup-Wald tests based on a single threshold. However, they also seem to have slightly
larger measured size distortion although this may reflect the relativelysmall number
of Monte-Carlo replications. Once again having to estimate the cointegratingvectors
lowers the power of the tests, although it does not appear to dramaticallyaffect the
size of the tests. Finally, the size and the power of the bootstrap test is not markedly
different when the null model is specified as a linear randomwalk as compared to a
linear stationary autoregression.
16 If there is drift in the unit root series, however,size distortionsmay be more likely; the
arrangedsamplemaybe verymuchlike the time orderedsamplebecauseof the trendpresentin the
time series.
17 A singleBootstrapMonte Carloexperimentof 200 replicationswith 100 bootstrapreplications
and a samplesize of 100 involvescalculating1.35x 107 Waldstatistics.
18 We thankMarkWoharfor suggestingthis iterativeprocedure.
19In a limited Monte Carlo experiment,the iterative procedure picked exactly the same
thresholdsas the unconditionalgrid searchbut with substantiallyfewer calculations.
T= 250
Hansen 6 7 100 98 72 58 83 70
Boot-AR 8 6 100 99 69 66 86 79
Boot-RW 5 6 100 100 76 50 84 62
T= 250
Hansen 4 7 95 74 41 25 48 26
Boot-AR 5 5 99 87 44 23 53 35
Boot-RW 1 3 95 77 36 15 40 25
* Based on 200 Monte Carlo replications. For Boot-AR, the null model is a stationary linear AR.
For Boot-RW, the null model is a linear AR with unit root imposed. P-values for bootstrap and
Hansen methods are based on 100 simulations. For EQ-TAR and Band-TAR, p = 0.4. For
RD-TAR, IAI = 1. For AR, zt = .8zt-1 + et. For RW, zt =zt- + et. For all models, et N(0,1).
Given that the null hypothesis of linearity is rejected, we can estimate the
threshold and parameters of the respective autoregression by conditional least
squares. That is, estimate the threshold autoregression for given threshold values
(and delays) and then find the threshold values (and delays) by minimizingthe sum
of squared errors. Chan (1993) shows that for the case of a single threshold, the
estimated threshold value is (super)consistentin that it converges in distribution at
rate T where T represents the sample size. Chan also shows that the asymptotic
distributionof the estimates of the autoregressiveparametersis independent of the
estimated threshold values. Thus, having rejected linearity, asymptotic inference
about the threshold autoregressive parameters can proceed as if the threshold
values were known. These results are likely to hold for multiple threshold case as
well. How inference about the estimated thresholds themselves might be conducted
in practice is still an open question.20
20
Hansen(1995)developsthe asymptoticinferencefor a single-thresholdmodel.
REFERENCES
ANDERSON, H.M., "Transaction Costs and Nonlinear Adjustment Towards Equilibrium in the U.S.
Treasury Bill Market," mimeo, University of Texas, 1994.
ANDREWS, D.W.K. AND W. PLOBERGER,"Optimal Tests When A Nuisance Parameter is Present
Only Under the Alternative," Econometrica 62 (1994), 1383-1414.
ATHREYA, K.B. AND S.G. PANTULA, "Mixing Properties of Harris Chains and Autoregressive
Processes," Joumal of Applied Probability 23 (1986), 880-892.
BALKE, N.S., "Consumption Anomalies: Outliers, Nonlinear Dynamics, and Other 'Noise',"mimeo,
Southern Methodist University, 1992.
BENTOLILA, S. AND G. BERTOLA,"Firing Costs and Labor Demand: How Bad is Eurosclerosis?"
Review of Economic Studies 57 (1990), 381-402.
BERTOLA,G. AND R.J. CABALLERO, "Kinked Adjustment Costs and Aggregate Dynamics," Brookings
Papers on Economic Activity (1990), 237-295.
BIERENS, H., "Nonparametric Cointegration Analysis," forthcoming, Journal of Econometrics (1996).
AND S. Guo, "Testing Stationarity Against the Unit Root Hypothesis," Econometric Reviews
12 (1993), 1-32.
BROCKWELL, P.J., J. Liu, ANDR. TWEEDIE, "On the Existence of Stationary Threshold Autoregres-
sive Moving Average Processes," Journal of Time SeriesAnalysis 13 (1992), 95-107.
CAMPBELL,J.Y., "Does Saving Anticipate Declining Labor Income? An Alternative Test of the
Permanent Income Hypothesis," Econometrica 55 (1987), 1249-1274.
AND R.J. SHILLER,"Cointegration and Tests of Present Value Models," Journal of Political
Economy 95 (1987), 1062-1088.
CHAN, K.S., "Consistency and Limiting Distribution of the Least Squares Estimator of a Threshold
Autoregressive Model," The Annals of Statistics 21 (1993), 520-533.
, J. PETRUCELLI,H. TONG, AND S. WOOLFORD,"A Multiple Threshold Model AR(1) Model,"
Journal of Applied Probability 22 (1985), 267-279.
AND H. TONG, "On the Use of the Deterministic Lyapunov Functions for the Egodicity of
Stochastic Difference Equations," Advances in Applied Probability 17 (1985), 666-678.
CHRISTIANO,L., "Searching for a Break in GNP," Journal of Business and Economic Statistics 10
(1992), 237-249.
CORBAE, D. AND S. OULIARIS, "Cointegration and Tests of Purchasing Power Parity," Review of
Economics and Statistics 70 (1988), 508-511.
DAVIES, R.B., "Hypothesis Testing When a Nuisance Parameter is Present Only Under the
Alternative," Biometrika 64 (1977), 247-254.
- "Hypothesis Testing When a Nuisance Parameter is Present Only Under the Alternative,"
Biometrika 74 (1987), 33-43.
DIEBOLD, F. AND C. CHEN,"Testing Structural Stability with Endogenous Break Point: A Size
Comparison of Analytic and Bootstrap Procedures," Journalof Econometrics 70 (1996), 221-242.
ENGLE, R.F. AND C.W.J. GRANGER, "Co-integration and Error Correction: Representation, Estima-
tion, and Testing," Econometrica 55 (1987), 251-276.
GRANGER, C.W.J., "Developments in the Study of Cointegrated Economic Variables," Oxford
Bulletin of Economics and Statistics 46 (1986), 213-228.
HANSEN, B., "Inference When a Nuisance Parameter Is Not Identified Under the Null Hypothesis,"
Econometrica 64 (1996), 413-430.
, "Endogenous Sample Splitting and the Estimation of Threshold," mimeo, Boston College,
September 1995.
JOHANSEN, S., "Estimation and Hypothesis Testing of Cointegrating Vectors in Gaussian Vector
Autoregressive Models," Econometrica 59 (1991), 1551-1580.
AND K. JUSELIUS, "Maximum Likelihood Estimation and Inference on Cointegration-with
Applications to the Demand for Money," OxfordBulletin of Economics and Statistics 52 (1990),
169-210.
KwIATKOWSKI,D., P.C.B. PHILLIPS, P. SCHMIDT, AND Y. SHIN, "Testing the Null Hypothesis of
Stationarity Against the Alternative of a Unit Root," Journal of Econometrics 54 (1992),
159-178.
MACKINNON, J.G., "Critical Values for Cointegration Tests," in R.F. Engle and C.W.J. Granger,
eds., Long-Run Economic Relationships: Readings in Cointegration (Oxford University Press,
1991, pp. 267-276).
MILLER, M. AND D. ORR, "A Model of the Demand for Money by Firms," QuarterlyJournal of
Economics 100 (1966), 529-539.
PETRUCCELLI,J. AND N. DAVIES, "A Portmanteau Test for Self-Exciting Threshold Autoregressive-
Type Nonlinearity in Time Series," Biometrika 73 (1986), 687-694.
PHILLIPS,P.C.B., "Time Series Regression with a Unit Root," Econometrica 55 (1987), 277-302.
,"Econometric Model Determination," Econometrica 64 (1996), 763-812.
AND P. PERRON, "Testing for Unit Root In Times Series Regression," Biometrika 75 (1988),
335-346.
PIPPENGER, M. AND G. GOERING, "A Note on the Empirical Power of Unit Root Tests under
Threshold Processes," OxfordBulletin of Economics and Statistics 55 (1993), 473-481.
SCARF, H., "The Optimality of (S, s) Policies in the Dynamic Inventory Problem," in K. Arrow, S.
Karlin, and P. Suppes, eds., Mathematical Methods in the Social Sciences (Stanford University
Press, 1960, pp. 196-202).
SHESHINSKI,E. AND Y. WEISS, "Optimum Pricing Policy under Stochastic Inflation," Review of
Economic Studies 43 (1983), 513-529.
STOCK, J., "Asymptotic Properties of Least Squares Estimators of Cointegrating Vectors," Econo-
metrica 55 (1987), 1035-1056.
AND M. WATSON, "A Simple Estimator of Cointegrating Vectors in Higher Order Integrated
Systems," Econometrica 61 (1993), 783-820.
TJ0STHEIM, D., "Non-linear Time Series and Markov Chains," Advances in Applied Probability 22
(1990), 587-611.
TSAY,R.S., "Testing and Modeling Threshold Autoregressive Processes," Journal of the American
StatisticalAssociation 82 (1989), 590-604.