Pu Chen 2022

ed
Vector Error Correction Models with Stationary and

Nonstationary Variables
iew
Pu Chena
a Melbourne Institute of Technology, Australia
ev
Abstract
Vector error correction models (VECM) have become a standard tool in empirical
r
economics for analysing nonstationary time series data because they combine two key
concepts in economics: equilibrium and dynamic adjustment in one single model. The
er
current standard VECM procedure is restricted to time series data with the same
degree of integration, i.e. all I(1) variables. However, time series data with different
degrees of integration are common in empirical studies, necessitating the simultaneous
pe
handling of I(1) and I(0) time series. The standard VECM is extended in this paper to
accommodate mixed I(1) and I(0) variables. The mixed VECM conditions are derived,
and as a result, a test and estimation of the mixed VECM are presented.
Keywords: VECM, Cointegration, Stationary Variables

JEL: classification: E42, E52, E58.
ot
1. Introduction
tn
Vector error correction models (VECM), in which some linear combinations

of the nonstationary variables are stationary cointegration relations reflecting
equilibrium conditions, are the most popular method for modelling nonstation-
rin
ary macroeconomic variables. A VECM is formulated as follows:

p−1
X
′
∆Yt = αβ Yt−1 + Πl ∆Yt−l + Ut (1.1)
l=1
where Yt is an n dimensional vector, α and β are n × h dimensional matrices

ep
with rank h. β ′ Yt−1 = 0 represents h equilibrium relations. A deviation from
Email address: pchen@academic.mit.edu.au (Pu Chen)

Pr
Preprint submitted to Journal of LATEX Templates October 3, 2022
This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
the equilibrium β ′ Yt−1 ̸= 0 will cause the system variable Yt to adjust by ∆Yt .
iew
5 α is the adjustment coefficient that links the deviation from the equilibrium
β ′ Yt−1 ̸= 0 to the system adjustment ∆Yt .
The standard method for building a VECM is as follows: (1) Run unit root
tests for each time series in Yt . (2) If all of the series are I(1), use the Johansen
test to determine the cointegration rank h. (3) Use the Johansen procedure to
ev
10 estimate the VECM. This standard approach works well if all Yt components
are I(1) series. In empirical studies, however, not all components of Yt are
I(1) in all cases; some times Yt may contain both I(1) and I(0) components.
r
In this case, the Johansen procedure cannot be used directly. Cointegration
analysis with I(0) and I(1) variables can be treated by ARDL bound test.1 .
15
er
The bound test, however, is a single equation approach that can only test one
cointegration relation. A two-step heuristic approach to modelling both I(1) and
I(0) in a VECM is presented, for example, in Hamilton [5] p. 652 for a four-
pe
variable system. y1t is stationary. y2t = (y2t , y3t , y4t ) the components are each
individually I(1). One cointegration relation among the three I(1) variables
20 is concluded in the first step of cointegration analysis, and then a VECM is
presented as follows.
ot
         
(1) (1) (2) (2)
y1,t α1 ζ11 ζ 12 y1,t−1 ζ11 ζ 12 y1,t−2
  =  +
(1) (1)
 +
(2) (2)
  + ...
∆y2,t α2 ζ 21 ζ 22 ∆y2,t−1 ζ 21 ζ 22 ∆y2,t−2
tn
      
(p−1) (p−1) (0) (0)
ζ11 ζ 12 y1,t−p+1 ζ1 ϵ1
... +  (p−1) (p−1)
 +
(0)
 y2,t−1 +  (0)  (1.2)
ζ 21 ζ 22 ∆y2,t−p+1 ζ2 ϵ2
 
(0)
ζ1
where the (3 × 4) matrix 
(0)
 is restricted to be αβ ′ where α is (4 × 1) and
ζ2
rin
β is (3 × 1).
We will formalise the procedure proposed in equation (1.2) and provide a
25 system approach to VECM with mixed I(1) and I(0) variables in this paper. It
turns out that a VECM with mixed I(1) and I(0) variables can be thought of
ep
as a cointegrated VECM with a set of restrictions in the cointegration space,
1 See Pesaran et al. [9] for more details

Pr
ed
i.e. it is a special case of the conventional cointegrated VECM. As a result, the
iew
Johansen test can be used to determine the cointegration rank. Furthermore,
30 the Johansen procedure can be used to test for the presence of I(0) components
and to estimate the mixed VECM, as described in the following sections.
2. VECM and the Underlying Process
ev
VECM (1.1) presented in the previous section can be reformulated as a
vector autoregressive (VAR) model in level of Yt .
p
X
Yt = Φl Yt−l + ϵt (2.3)
r
l=1
35
Assumption 2.1. The roots of |In −
circle |z| = 1 or satisfying z = 1.
∞
Pp
er
i=1 Φi z i | = 0 are either outside the unit
pe
Assumption 2.2. The vector error process Ut=1 is IN (0, Ω), Ω is positive def-
inite.
VAR (2.3) with Assumptions 2.1 and 2.2 is the same VAR of equation (2.1)
and Assumption 1 considered in Johansen [7] p.11 and p.14 with suppressed
deterministic components to simplify the presentation. As a result, the Johansen
ot
40
procedure applies to VECM (1.1).

To motivate our approach to a mixed VECM, we first establish a link be-
tn
tween a cointegrated VECM and an underlying stationary VAR process that

will be defined later, and show that any cointegrated VECM can be seen as
45 being generated from an underlying stationary VAR, and any stationary VAR
can generate a cointegrated VECM that meets Assumptions 2.1 and 2.2. This
rin
constructing principle allows us to create a VECM that contains both I(1) and
I(0) components, to which the Johansen procedure applies.
Under Assumption 2.1 it is shown in Johansen [7] p.50 that β ′ Yt and β⊥
′
∆Yt
ep

′ ∆Zt
50 are stationary.2 Defining ∆Zt = β⊥ ∆Yt and Xt = β ′ Yt . We call   an
Xt
2 In Johansen [7] the system variable is denoted Xt , while in this paper Yt is used instead.
Pr
ed
underlying

process

of VECM (1.1) and show in the Appendix that the underlying
∆Zt
iew
process   is a stationary VAR process.
Xt
 
∆Zt ′
Definition 2.3. For a VECM (1.1),   with ∆Zt = β⊥ ∆Yt and Xt =
Xt
β ′ Yt is called an underlying process of VECM (1.1).
55 The following Lemma establishes the connection between a cointegrated
ev
VECM and an underlying VAR.
Lemma 2.4 (VECM and an underlying VAR).

 
r
∆Zt
a) For a VECM (1.1) satisfying Assumption 2.1, the underlying process  
Xt
is a stationary VAR process.
60
er
b) For an n dimensional stationary VAR, denoting the first r components by
− r components
pe
∆Z and the rest n
 
by Xt . Let Yt be a full rank linear
Zt Zt
transformation of  : Y t = B  . If B is unrestricted Yt is a VAR
Xt Xt
process satisfying Assumption 2.1 with r unit roots and n − r cointegration
65 relations and it can be formulated as VECM (1.1).
ot
Corollary 1. The Johansen procedure is applicable to a VECM generated from

an underlying stationary VAR.
According to Lemma 2.4, a cointegrated VECM is always created from an

tn
underlying stationary VAR, with the number of unit roots in the VECM being
the dimension of Zt and the number of cointegration relations being the dimen-
sion of X
 t
. How

can Yt have I(0) components? The transformation between
Zt
rin
Yt and   contains the answer to this question. To demonstrate this we

Xt
decompose B into a 3 × 3 block:
  
B11 B12 B13 Zt
  
Yt =  B21 , (2.4)
  
B22 B23   X1t
ep
  
B31 B32 B33 X2t
where the n × n matrix B is separated into three blocks each with r, n − r − k,

k dimensions with r + h = n. B11 is r × r, B22 is (n − r − k) × (n − r − k),
Pr
ed
70 B33 is k × k, and other blocks are defined accordingly. If B11 , B21 , and B31 are
iew
nonzero, then all components of Yt are I(1) because they are linear combinations
of Zt and Xt and Zt is I(1). If B31 = 0, the last k components of Yt are only
linear combinations of Xt , implying these k components of Yt are I(0). As a
result, we have a VECM of Yt with both I(1) and I(0) components. Clearly,
75 B31 = 0 is both a necessary and sufficient condition for a VECM of Yt to contain
ev
a mix of I(1) and I(0) components. If B21 = 0 and B31 = 0 there will be r I(1)
and h I(0) components but no cointegration relations. The following lemma
summarises the condition for an I(0) and I(1) mixed VECM.
r
Lemma 2.5. B31 = 0 is a necessary and sufficient condition for a VECM of
Yt to contain k I(0) components.
80
er
There is a problem with applying Lemma 2.5 to determine a mixed VECM
because B31 is a parameter of the underlying process but not of the correspond-
pe
ing VECM that will be estimated. To turn the conditions in Lemma 2.5 into
a testable hypothesis we must investigate the implications of B31 = 0 on the
85 parameters of VECM (1.1) and determine what restrictions on the parameters
of VECM (1.1) lead to B31 = 0.
ot
To that end, we establish a relationship between the transformation matrix

B and the parameter in VECM (1.1). We consider an underlying VAR of lag 2
without losing generality but simplifying the presentation.
tn
   (1) (1) (1)

   (2) (2) (2)
   
∆Zt θ11 θ12 θ13 ∆Zt−1 θ11 θ12 θ13 ∆Zt−2 ezt
 X1t  =  θ (1) (1) (1)   X1t−1 + θ (2) (2) (2)
         
θ22 θ23 θ22 θ23   X1t−2 + ex1t 
   21    21    
(1) (1) (1) (2) (2) (2)
X2t θ31 θ32 θ33 X2t−1 θ31 θ32 θ33 X2t−2 ex2t
rin
(2.5)
(2) (2) (2)
To keep the VECM’s lag length at one θ11 , θ21 , and θ31 are assumed to be
zero. Rewriting the underlying VAR (2.5) in error correction form we obtain:
(1) (2) (1) (2) (1) (2) (2)
         
∆Zt 0 θ12 + θ12 θ13 + θ13 Zt−1 θ11 −θ12 −θ13 ∆Zt−1 ezt
         

 = 
  (1) (2) (1) (2) + (1) (2) (2)  +
θ22 + θ22 − I
  
−θ22 −θ23
   
 ∆X1t 0 θ23 + θ23  X1t−1 θ21  ∆X1t−1 ex1t 
ep
         
∆X2t (1) (2) (1) (2) X2t−1 (1) (2) (2) ∆X2t−1 ex2t
0 θ32 + θ32 θ33 + θ33 − I θ31 −θ32 −θ33
(2.6)
Pr
ed

Zt
90 Transforming the underling variable to Yt we have:
Xt
iew
∆Yt
  
B11 B12 B13 ∆Zt−1
  
=
  
 B21 B22 B23  ∆X1t−1 
  
B31 B32 B33 ∆X2t−1
−1
(1) (2) (1) (2)
     
B11 B12 B13 0 θ12 + θ12 θ13 + θ13 B11 B12 B13 B11 B12 B13 Zt−1
      
=
  (1) (2) (1) (2)
θ22 + θ22 − I
    
 B21 B22 B23  0 θ23 + θ23   B21 B22 B23   B21 B22 B23  X1t−1 
      
B31 B32 B33 (1) (2) (1) (2) B B B B31 B32 B33 X2t−1
0 θ32 + θ32 θ33 + θ33 − I 31 32 33
ev
−1 
(1) (2) (2)
      
B11 B12 B13 θ11 −θ12 −θ13 B11 B12 B13 B11 B12 B13 ∆Zt−1 u1t
        
+
  (1) (2) (2)  +
−θ22 −θ23
      
 B21 B22 B23  θ21   B21 B22 B23   B21 B22 B23   ∆X1t−1 u2t 
        
B31 B32 B33 (1) (2) (2) B31 B32 B33 B31 B32 B33 ∆X2t−1 u3t
θ31 −θ32 −θ33
(1) (2) (1) (2)

  
B11 B12 B13 θ12 + θ12 θ13 + θ13  
21 B 22 B 23
 B
  
=
  (1) (2) (1) (2) Y
B21 B22 B23 θ22 + θ22 − I θ23 + θ23 t−1
B 31 B 32 B 33
  
  
B31 B32 B33 (1) (2) (1) (2)
θ33 + θ33 − I
r
θ32 + θ32
(1) (2) (2)

     
B11 B12 B13 θ11 −θ12 −θ13 B 11 B 12 B 13 u1t
     
(1) (2) (2)
+ B 21 B 22 B 23  ∆Yt−1 + 
 
−θ22 −θ23
   
 B21 B22 B23  θ21  u2t 
     
B31 B32 B33 (1) (2) (2) B 31 B 32 B 33 u3t
θ31 −θ32 −θ33
where 
B11 B12 B13
−1
er 
B 11 B 12 B 13

pe
   
=  B 21 B 22 B 23  . (2.7)
   
 B21 B22 B23 
   
B31 B32 B33 B 31 B 32 B 33
We have:
  
(1) (2) (1) (2)
B11 B12 B13 θ12 + θ12 θ13 + θ13
  
  (1) (2) (1) (2)
ot
α =  B21 B23   θ22 + θ22 − I (2.8)

 
B22 θ23 + θ23 
  
(1) (2) (1) (2)
B31 B32 B33 θ32 + θ32 θ33 + θ33 −I
 
B 21 B 22 B 23
tn
′
β =  (2.9)
B 31 B 32 B 33
What are the restrictions on β that imply B31 = 0? It is well known that β
is identified up to a full column rank transformation. According to Lemma 7.1
rin
in Johansen [7], the number of parameters in αβ ′ with unconstrained α and β is

n(n−r)+r(n−r). To find binding restrictions on β we must first make β exactly
95 identifiable otherwise parameter restrictions and identification restrictions will
be mixedup. Let β be identified in the following form.
ep
B 21 I 0
β′ = 31
. By this identification scheme, all elements in α, B 21 ,
B 0 I
and B 31 are free parameters, and the number of these free parameters are n(n −
r) + r(n − r).
Pr
ed
100 Since β has full column rank, this identification can be done by premultiply-
 −1
B 22 B 23
iew
ing β ′ by  
B 32 B 33
Identifying β ′ in this way is equivalent to make a full rank transformation
of the underlying process of Xt :
   −1  
B11 B12 B13 I 0 0 I 0 0 Zt−1
     
Y =
 B21 B22 B23  0
 T22 T23  0
 T22 T23 

 X1t−1 
  (2.10)
B31 B32 B33 0 T32 T33 0 T32 T33 X2t−1
ev
| {z }| {z }
B∗ ∗′ ,X ∗′ )′
(Yt′ ,X1t 2t
such that the inverse of B ∗ has the following form:

   −1
∗ ∗
B11 B12 B13 B 11∗ B 12∗ B 13∗
r
∗ 
∗ ∗
 
 =  B 21∗

B =
 B21 B22 B23   I 0 
 (2.11)
∗ ∗
B31 B32 B33 B 31∗ 0 I

A B
−1
=

A−1 + A−1 B(D − CA−1 B)−1 CA−1
er
What are the constrains on this identified β that implies B31 = 0? Applying
the following formula of partial matrix inverse
A−1 B(D − CA−1 B)−1


(2.12)
pe
  
C D (D − CA−1 B)−1 CA−1 (D − CA−1 B)−1
   
I 0 B 21∗
to equation (2.11) and taking   as the A block, and   as the B
0 I B 31∗
block, we have
  −1
B 21∗
ot
B31 = B 31∗ B 11∗ − [B 12∗ , B 13∗ ]   , (2.13)

B 31∗
and conclude B 31∗ = 0 implies B31 = 0 and vice versa. Inserting this restriction
tn
into β we have:  
B 21∗ I 0
β′ =   (2.14)
0 0 I
rin
If a k dimensional I(0) component exists in Yt , condition (2.14) implies that the

(n−r)-dimensional I(0) space can be decomposed into two orthogonal subspaces:
a k-dimensional I(0) subspace containing the k I(0) variables and an (n − r − k)-
dimensional cointegration space containing linear combinations of the (n − k)
ep
105
I(1) variables. The k I(0) components do not enter the cointegration space and
the cointegrating error terms do not mix with the k I(0) components. We notice
that this result keeps unchanged if the k-dimensional I(0) subspace and the (n−
Pr
ed
r − k)-dimensional cointegrating space are subject to a full rank transformation
iew
110 respectively. We summarise this result in the following lemma.
Lemma 2.6. The necessary and sufficient condition for a VECM


to contain k 
B 21 B 22 0
I(0) variables is the cointegrating matrix β cab be written as: ′
β = 
0 0 B 33
.
ev
The two step method given in equation (1.2) is partly justified by Lemma
115 2.6. The I(0) components space and the cointegration space are orthogonal,
indicating that only the three I(1) components are actually involved in the
r
cointegration relations. The justification is partly because the I(0) components
are not weakly exogenous for the conditional process of the I(1) components
120 er
generally. A valid inference based on a partial system can be made only when
the conditioning variables are weakly exogenous for the parameters of the partial
system.3 The cointegration relation must not affect the dynamic of the I(0)
pe
(0)
components in order for the weak exogeneity to hold,4 which would imply ζ1 =
0 in equation (1.2).
3. Generating and Testing mixed VECMs

ot
125 This section uses two examples to show that 1) a data generating process
that satisfies the condition in Lemma 2.5 B31 = 0 will generate VECM with
mixed I(0) and I(1) variables, and 2) a data generating process that satisfies
tn
 
B 21 B 22 0
the condition in Lemma 2.6 β ′ =   will also generate a VECM
0 0 B 33
with I(0) and I(0) components.
rin
130 3.1. Data Generating Process of Mixed VECMs

Example 3.1.
   (1) (1) (1)
   (2) (2) (2)
   
∆Zt θ θ12 θ13 ∆Zt−1 θ θ12 θ13 ∆Zt−2 ezt
 11   11
 X1t  =  θ (1) (1) (1)   X1t−1 + θ (2) (2) (2)
      
θ22 θ23 θ22 θ23   X1t−2 + ex1t 
   21    21    
(1) (1) (1) (2) (2) (2)
ep
X2t θ31 θ32 θ33 X2t−1 θ31 θ32 θ33 X2t−2 ex2t
(3.15)
3 See Engle et al. [4] for more details.

4 See I. Habro [6] and Chen and Hsiao [3] for more details.
Pr
ed
 
Zt
iew
 
 X1t  .
Yt = B   (3.16)
X2t
θ (1) θ (2)
-0.06 -0.52 -0.05 0.15 -0.05 -0.1 0 0 0.07 0.09 0 0.04
-0.52 0.25 0.28 -0.07 0.04 0.03 0 0 0.01 0.06 -0.04 0.01
-0.05 0.28 -0.33 -0.07 0.29 -0.2 0 0 0.03 0.04 0.08 0.1
ev
0.15 -0.07 -0.07 -0.44 0.15 -0.05 0 0 0.04 0.01 0.04 0.06
-0.05 0.04 0.29 0.15 -0.27 -0.44 0 0 0.08 0.04 0 0.07
-0.1 0.03 -0.2 -0.05 -0.44 0.26 0 0 0.01 0.06 0.07 0.05
B
-0.6 1.5 0.42 0.55 0.35 -1.83
r
0.32 0.15 -0.23 -0.5 -0.66 -0.35
1.87 -2.61 -1.6 -0.37 -0.05 1.22
-1.77 -0.11 0.73 0.2 -0.5 0.34
0
0
0
0
0.3
-0.4
-0.6
0.5
0.37
0.23
er
0.38
-0.79
pe
ot
Figure 3.1 shows one realisation of the simulated data. It is evident from the
tn
graphs in Figure 3.1 that there are 4 I(1) and 2 I(0) variables.
Example 3.2. The is an example in which data are generated from a VECM
with restricted β such that the last two components are I(0).
rin
∆Yt = C + αβ ′ Yt−1 + Ψ∆Yt−1 + ϵt ϵt ∼ IN (0, Ω)
α β Ψ
-0.09 0.54 6.05 14.74 1 1 0 0 -1.11 0.71 -0.54 0.04 7.42 -9.35
0.21 0.55 2.78 7.64 -1.34 -3.8 0 0 -0.52 0.4 -0.26 -0.13 4.55 -5.86
ep
135 0.3 -0.23 -0.11 -1.32 -0.27 0.59 0 0 0.73 -0.28 0.41 -0.33 -4.16 5.16
0.13 0.32 5.32 12.59 -0.29 0.89 0 0 -0.38 0.38 -0.44 -0.12 5.96 -7.33
-0.13 -0.06 0.12 -1.65 0 0 -2.13 -0.21 0.12 -0.13 0.06 0.05 -1.21 1.49
-0.19 0.06 -0.9 -3.39 0 0 1 1 0.26 -0.07 0.22 -0.04 -2.76 3.26
Pr
ed
iew
ev
ts(RR$Y)
150
150
100
100
r
50
Series 1
Series 4
50
0
0
−50
er −50
−100
−100
−150
10
10
5
pe
0
Series 2
Series 5
0
−10
−5
−20
−10
300
15
200
10
ot
100
5
Series 3
Series 6
0
0
−100
−5
−200
−10
−300
−15
tn
−400
0 50 100 150 200 0 50 100 150 200
Time Time
Figure 1: Time series generated from example 3.1

rin
ep
Pr
10
ed
iew
ev
ts(res_d$Y)
1500
1000
1000
r
Series 1
Series 4
500
500
0
er 10
600
400
5
Series 2
Series 5
pe
200
0
−5
0
100 −200
10−10
5
Series 3
Series 6
50
0
ot
−5
0
−10
−50
0 50 100 150 200 0 50 100 150 200

tn
Time Time
Figure 2: Time series generated from example 3.2

rin
.
ep
Pr
11
ed
One realization of the simulated data is depicted in Figure 3.2. The graphs in
iew
Figure 3.2 show clearly that there are 4 I(1) and 2 I(0) variables.
3.2. Testing Mixed VECMs and Parameter Estimation

As discussed in the previous section, VECM with mixed I(0) and I(1) vari-
140 ables can be seen as a special case of a conventional cointegrated VECM where
ev
the cointegrating vectors subject to a set of linear restrictions. Therefore the
standard Johansen procedure is applicable. We can apply the Johansen proce-
dure to determine the cointegration rank, to test the presence of I(0) variables
in the system, and to estimate the parameters of the mixed VECM.
r
The restrictions on the cointegrating vectors β that leads to a mixed VECM
vec(α′ ) = Gψ,
er
are linear restrictions on β. Following Boswijk and Doornik [1] linear restrictions
can be formulated in the following form.
vecβ = Hϕ + h0 , (3.17)
pe
Boswijk and Doornik [1] propose the following iteration process to calculate the
constrained parameter and thus to test the hypothesis related to the constrains.
ψ̃(ϕ, Ω) = [G′ (Ω−1 ⊗ β ′ S11 β)G]−1 (Ω−1 ⊗ β ′ S11 )vec(Πˆ′LS )

ot
h i
ϕ̃(ψ, Ω) = [H ′ (α′ Ω−1 α ⊗ S11 β)H]−1 (α′ Ω−1 ⊗ S11 vec Πˆ′LS − (α ⊗ Ip1 )h0 .

Ω̃(ψ, ϕ) = S00 − S01 βα′ − αβ ′ S10 + αβ ′ S11 βα′ .

tn
145 Starting from a set of initial values (ψ0 , ϕ0 , Ω0 ), the iterations then become
ϕ̃j = ϕ̃(ψj−1 , Ωj−1 ), ψ̃j = ψ̃(ϕj , Ωj−1 ), Ω̃j = Ω̃(ϕj , Ωj ). The iteration proce-
dure stops when the changes in estimated values between two steps are below
rin
a prescribed threshold. S01 , S11 , and S00 are the cross products of the two
auxiliary regressions in the Johansen procedure. We use this procedure to test
150 the presence of a mixed VECM.
According to Boswijk and Doornik [1], the following likelihood ratio is asymp-
ep
totically χ2 with degree of freedom rk

r
X d
log(1 − λ̂i ) −→ χ2 (rk),

LR = T log |Ω̃| − log |S00 | − (3.18)
i=1
Pr
12
ed
where Ω̃ is the constrained estimate obtained from the iteration above and
Pr
iew
log |S00 | − i=1 log(1 − λ̂i ) is the unconstrained estimate from the Johansen
procedure.
Examples (continue)
155 Applying Johansen’s test to the data generated in Example 3.1, we obtain
the following result.
ev
Johansen test of Example 1
teststatistic critical_value
crk <= 0 | 1287.128961 40.19
160 crk <= 1 | 631.081580 34.03
r
crk <= 2 | 145.332672 27.80
crk <= 3 | 113.392495 21.49
crk <= 4 | 8.438725 15.02
165
crk <= 5 | 1.121140 8.19
er
It is to note that the output of Johansen’s test above indicates the dimension
pe
of the I(0) space. It is the sum of the independent cointegration relations and
the number of I(0) components in the system. For data generated from the
parameters in Example 1, we run likelihood ratio tests of one, two three I(0)
components, respectively. The corresponding p-value are 0.43, 0.34, and 0.0.
170 We conclude there are two I(0) components in the system. It follows that there
ot
are two independent cointegration relations among the 4 I(1) variables.

For the data generated from the parameters in Example 3.2, we have similar
tn
results.
Johansen test of Example 2

175 teststatistic critical_value
crk <= 0 | 1022.555928 40.19
rin
crk <= 1 | 227.575481 34.03

crk <= 2 | 67.163218 27.80
crk <= 3 | 46.358663 21.49
180 crk <= 4 | 7.232841 15.02
crk <= 5 | 2.674875 8.19
ep
Johansen’s test results show that the dimension of the I(0) space is 4. The
p-values of the likelihood ratio tests of one, two, and three I(0) components are
0.62, 0.48, and 0.0 respectively. We conclude there are two I(0) components in
Pr
13
ed
185 Yt and two independent cointegration relations among the 4 I(1) variables in
iew
the system.
4. An Illustrative Empirical Application
In this section, we provide an example of how a mixed VECM is used in an

empirical investigation. We are interested in determining how the unemploy-
ev
190 ment rate (Ut ), productivity (ynt ), and capacity utilisation (Vct ) affect the real
wage (rwt ) in Germany.
The information comes from FRED Economic Data and OECD Statistics.
r
The data spans the quarters of 1990 Q1 and 2021 Q2. Figure 3 contains plots
of the time series.
ts(gData)er
pe
5.4
5.3
rw
5.2
5.1
11
9
U_l
ot
8
7
6
5
4.6
tn
4.5
yn
4.4
4.3
4.45
V_c
rin 4.35
4.25
0 20 40 60 80 100 120
Time
ep
Figure 3: Time series used in the empirical example
.
Pr
14
ed
variable Trnasformation Description of the untransformed series
w log(DEULCWRMN01IXOBQ) German wage index, source: FRED
iew
- log(DEUCPIALLMINMEIQ) German CPI, source: FRED
195
U l LMUNRRTTDEM156SQ German unemployment rate: FRED
V c log(BSCURT02DEQ160S) German capacity utilization rate, source: OECD
yn DEURGDPHq Germany, productivity index, source: FRED
Unit root tests of the 4 time series indicate that rw, Ul and yn are I(1) while
ev
Vc is I(0). Based on BIC criterion, we specify a VAR(2) for the 4 variables in
200 level. Johansen’s test gives the following results.
teststatistic critical_value
crk <= 0 | 31.785720 27.80
crk <= 1 | 23.184527 21.49
r
crk <= 2 | 7.320134 15.02
205 crk <= 3 | 1.031860 8.19
er
The output above indicates that I(0) space has a dimension of 2. P-values
of the likelihood ratio tests of one I(0) components is 8.2%. We conclude that
there is one I(0) component and one cointegration relation in the system. The
pe
estimated cointegrating vector β = (1, 0.002, −1.20, 0) can be reformulated as a
long-run real wage equation.
log wt = −0.002Ul,t + 1.20 log ynt . (4.19)

ot
This long-term equation shows that productivity has a positive effect on real
wages while unemployment has a negative effect. Real wages have an elasticity
of 1.2 to productivity, which means real wages will increase 1.2% for every 1%
tn
increase in productivity. The elastic response of the real wage to productive

210 growth implies that the wage share in total output is increasing, which con-
tradicts Bowley’s law.5 In contrast, the wage share is ”relatively stable” over
rin
the long run. Therefore we test the unit elasticity hypothesis. H0 : β3 = 1

H1 : β3 > 1. The likelihood ratio test has a p-value of 4%. The null hypoth-
esis is rejected at the significance level of 5% but not rejected a the level of
ep
215 1%. Because, despite heated debates, Bowley’s is a reliable long-run benchmark
(See Krämer [8] and Carter [2] for more detailed discussions.), we choose the
5 See Carter [2] for more details.

Pr
15
ed
significance level at 1% and do not reject the null hypothesis that the elasticity
iew
coefficient is 1. As a result, we have a long run real wage equation:
log wt = −0.007Ul,t + log ynt . (4.20)
This equation says if the unemployment rate increases by 1%, the real wage
will decrease by 0.7%. A deviation from this long run relation is measure by
ev
220
the error term β ′ Yt−1 which is depicted in the following diagram. The plot in
r
0.88
er
0.86
pe
0.84
Series 1
0.82
ot
0.80
tn
0.78
0 20 40 60 80 100 120
Time
rin
Figure 4: Time series generated from example
Figure 4 shows the real wage can deviate from the long run equilibrium level by
about ±5%. The cointegration relationship also shows that the rate of capacity
ep
utilisation has no long-run effect on the real wage. However, it may have a
225 short-term impact on the real wage. To evaluate the short run dynamics, we
calculate impulse response functions of the mixed VECM shown in Figure 5. In
Pr
16
ed
yn response to yn shock yn response to U_l shock yn response to rw shock yn response to V_c shock
1.00
0.05
0.00 0.75
iew
0.75
−0.05 0.00
0.50 0.50
−0.10
0.25 0.25 −0.05
−0.15
0.00
0.00 −0.10
−0.20
0 5 10 15 0 5 10 15 0 5 10 15 0 5 10 15
U_l response to yn shock U_l response to U_l shock U_l response to rw shock U_l response to V_c shock
0 40
−10 3
30 0
−20 0
ev
20
−5
−30 −3 10
−40 0
−6
0 5 10 15 0 5 10 15 0 5 10 15 0 5 10 15
rw response to yn shock rw response to U_l shock rw response to rw shock rw response to V_c shock
0.0 1.00
0.4 0.10
r
0.75
0.2 −0.1
0.05
0.50
0.0 −0.2
0.00
0.25
−0.3
−0.2
2
0 5 10 15
V_c response to yn shock
0.0
−0.1
−0.2
0 5 10 15
V_c response to U_l shock

er
0.00
1.0
0.5
0 5 10 15
V_c response to rw shock

−0.05
1.2
0.8
0 5 10 15
V_c response to V_c shock

pe
0.0
1 −0.3 0.4
−0.4 −0.5
0
−0.5 0.0
0 5 10 15 0 5 10 15 0 5 10 15 0 5 10 15
Figure 5: Time series generated from example

ot
this set of impulse response functions, the Cholezky decomposition is used in the
order (yn, U l, rw, V c). The shocks are one unit of the corresponding variables. A
tn
positive productivity shock, according to the impulse response function, reduces

230 unemployment rate, raises the real wage, and increases capacity utilisation. A
one standard deviation unemployment shock has no effect on productivity, but
it lowers the real wage and temporarily reduces capacity utilisation.
rin
5. Concluding remarks
In this paper, we extend the conventional cointegrated VECM to mixed

ep
235 cointegrated VECMs that can accommodate I(0) and I(1) variables. A testable
necessary and sufficient condition for a mixed cointegrated VECM is provided.
We show that a mixed cointegrated VECM is a special case of a conventional
Pr
17
ed
cointegrated VECM. As a result, Johansen’s test can be used to test the cointe-
iew
gration rank as well as the constraints on the cointegrating vectors that make a
240 VECM a mixed cointegrated VECM. Practical implication of the results in this
paper is that the exiting econometric software packages such as R, RATS, and
EVIEWS that can be used to test restrictions on β in a VECM can be directly
use to test a mixed VECM and estimate the parameters of the mixed VECM.
ev
6. Appendix
245 Proof of Lemma 2.4 a)
r
To simplify the presentation, the lag length p is set to 2 and the deterministic
components are suppressed in the following proof without loss of generality. This
results in the following VECM
∆Yt = αβ ′ Yt−1 + Θ∆yt−1 + ut

er
pe
′
β⊥
Premultiply the equation above by ′
we obtain:
β
       −1    
′ ′ ′ ′ ′ ′
β⊥ ∆Yt β⊥ β⊥ β⊥ β⊥ ∆Yt−1 β⊥
  =   αβ ′ Yt−1 +  Θ   +  ut
β ′ Yt − β ′ Yt−1 β′ β′ β ′ Yt−1 − β ′ Yt−2
β′ β′
ot
       −1   
β ′ ∆Yt β′ h i 0 0 β′ β ′ ∆Yt−1
 ⊥  =  ⊥  0 α   + Θ ⊥   ⊥ 
β ′ Yt β′ 0 I β′ β ′ Yt−1
   −1    
′ ′ ′
β⊥ β⊥ 0 β⊥
+   Θ     +   ut
tn
β′ β′ β ′ Yt−2 β′
Replacing β⊥ ∆Yt and β ′ Yt by ∆Zt and Xt respectively, we obtain

       −1   
′ ′
∆Zt β⊥ h i 0 0 β⊥ ∆Zt−1
  =   0 α   + Θ   
Xt β′ 0 I β′ Xt−1
rin
   −1     
β′ β′ 0 0 ∆Zt−2 β′
+ ⊥ Θ ⊥     +  ⊥  ut
β′ β′ 0 I Xt−2 β′
The equation above shows the underlying process is a VAR process. Follow-
ing Theorem 4.2 in Johansen [7] p.49 ∆Z = β⊥ ∆Yt and X = β ′ Yt are stationary,
ep
250 hence the underlying process is a stationary VAR process. This proves Lemma
2.4 a).
Pr
18
ed
To prove Lemma 2.4 b) we consider the following stationary VAR. Again,
iew
we choose p = 2 and suppress the deterministic components.
     
∆Zt ∆Zt−1 ∆Zt−2
  = ζ1   + ζ2   + vt , (6.21)
Xt Xt−1 Xt−2
To keep the lag length ofVECM being


1, we

assume in addition the coeffi-
(1) (1) (2)
ev
ζ11 ζ12 0 ζ12
cient of ∆Zt−2 is zero: ζ1 = (1) (1)
 ζ2 = (2)
 Rearrange equation
ζ21 ζ22 0 ζ22
(6.21):
       
(1) (1) (2)
∆Zt 0 ζ12 Zt−1 ζ11 ζ12 ∆Zt−1
= + +vt ,
r
  
(1) (2) (1) (2)
∆Xt 0 ζ22 + ζ22 − I Xt−1 ζ21 −ζ22 ∆Xt−1
 
(6.22)
The stationarity assumption of
characteristic polynomial equation

er
∆Zt
Xt
 implies the roots of the following
pe
I − ζ1 z − ζ2 z 2 = 0 (6.23)
lie outside the unit circle |z| = 1. Using lag operator the stationary VAR (6.21)
can be written as:  
∆Zt
(I − ζ1 L − ζ2 L2 )  =0
ot
Xt
    
∆Zt I−L 0 Zt
Inserting  =   into the equation above we have
Xt 0 I Xt
tn
  
I −L 0 Zt
(I − ζ1 L − ζ2 L2 )   =0 (6.24)
0 I Xt
Obviously
rin
   
I(1 − z) 0 I(1 − z) 0
(I − ζ1 z − ζ2 z 2 )   = (I − ζ1 z − ζ2 z 2 )   =0
0 I 0 I
ep
 
Zt
has only roots outside the unit circle and unit roots. Hence the VAR of  
Xt
satisfies Assumption 2.1. Next we show a full rank transformation of a VAR pro-
cess will not change the roots of the characteristic polynomials of the respective
Pr
19
ed
VAR processes. For a VAR process
iew
p
X
Xt = Φl Xt−l + ϵt (6.25)
l=1
the roots the characteristic polynomial of VAR (6.25) is defined by the following
equation
p
X
|In − Φi z i | = 0 (6.26)
ev
i=1
For a full rank transformation of Wt = BXt we have

p
X
Wt = BΦl B −1 Wt−l + Bϵt (6.27)
r
l=1
and its roots is defined by
|In −
p
X
i=1
er
BΦi B −1 z i | = |B(In −
p
X
i=1
Φi z i )B −1 | = 0
Because B has full rank, the roots of equation (6.28) is identical to the roots
(6.28)
pe
of equation
 
(6.26). Using this results and that Yt is a full rank transformation
Zt
of  , we conclude the VAR of Yt satisfies Assumption 2.1. This proves
Xt
255 Lemma 2.4 b). □
Proof of Lemma 2.5
ot
According to the relation between Yt and the underlying process in equation

(2.4), if B31 = 0, the last k components of Yt are linear combinations of I(0)
tn
variables, these k components are I(0). This proves the sufficiency. If the last
260 k components of Yt are I(0), the last component of Yt is I(0). From equation
(2.4) we have
rin
r
X n−r
X
Yn,t = bn,i Zi,t + bn,j+r Xj,t . (6.29)
i=1 j=1
Pr
Because Zi,t for i = 1, 2, ..., r are independent I(1) variables, i=1 bn,i Zi,t is an
I(1) variable unless bn,i = 0 for i = 1, 2, ..., r. Because Yn,t is I(0) by assumption
ep
it follows bn,i = 0 for i = 1, 2, ..., r, i.e. the last row in B31 is zero. In the same
265 way we can prove each row in B31 is zero. This proves the necessity. □
Proof of Lemma 2.6
Pr
20
ed

B 21 B 22 0
To prove the sufficiency, we can premultiply β ′ = by
0 0 B 33
iew
22 −1
B 0
33
and obtain
0 B
 
′ (B 22 )−1 B 21 I 0
β∗ =   (6.30)
0 0 I
ev
By the
formula of partial
matrix inverse (2.12) and equation (2.13) we know
′ B 21 B 22 0
β = 33
implies B31 = 0.
0 0 B
To prove the necessity, we assume without loss of generality that the last k
components of Yt are I(0). Then a cointegrating matrix β ∗ can be written as:
r
270
 
′ B ∗21 B ∗22 B ∗23
β∗ = 
0
er 0 B 33

because a full rank transformation of the last k I(0) components of Yt are

still I(0), thus the n × k matrix (0, 0, B 33 ) is a cointegrating matrix of Yt ,
(6.31)
pe
containing k independent
I(0) components. Premultiply equation (6.31) by
23∗ 33 −1
I −B (B )
0 I
we obtain  
B 21 B 22 0
β′ =  (6.32)
ot

0 0 B 33
275 This proves the necessity. It is to note that making B 23 = 0 is not the con-
sequence of k I(0) components in Yt but the convention of normalization of
tn
orthogonal cointegration relations and the I(0) components. □
References
rin
[1] Boswijk, H. P. and Doornik, J. A. (2004). Identifying, estimating

280 and testing restricted cointegrated systems: An overview. SEMANTIC
SCHOLAR, Corpus ID:9940494:DOI:10.1111/j.1467–9574.2004.00270.x.
ep
[2] Carter, S. (2007). Real wage productivity elasticity across advanced

economies, 1963-1996. Journal of Post Keynesian Economics, 29(4):573–
600.
Pr
21
ed
285 [3] Chen, P. and Hsiao, C. Y. (2004). Testing weak
iew
exogeneity in cointegrated system. REPEC , page
http://repec.org/esFEAM04/up.27567.1079363411.pdf.
[4] Engle, R., Hendry, D. F., and Richard, J.-F. (1983). Exogeneity.
Econometrica, 51:277–304.
ev
290 [5] Hamilton, J. D. (1994). Time Series Analysis. Princeton, 1st edition.
[6] I. Habro, S. Jahansen, B. N. A. R. (1998). Asymptotic inference on coin-

tegrating rank in partial systems. Journal of Business&Economic Statistics,
r
16:388–399.
295
er
[7] Johansen, S. (1995). Likelihood-Based Inference in Cointegrated Vector
Autoregressive Models. Oxford University Press, 1st edition.
[8] Krämer, H. M. (2011). Bowley’s law: The diffusion of an empirical sup-

pe
position into economic theory. Papers in Political Economy, 61,(2):19–49.
[9] Pesaran, M. H., Shin, Y., and Smith, R. J. (2001). Bounds testing
approaches to the analysis of level relationships. Journal of Econometrics,
16:289–326.
ot
300
tn
rin
ep
Pr
22

Pu Chen 2022

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Pu Chen 2022

Uploaded by

Copyright:

Available Formats

ed

Vector Error Correction Models with Stationary and

Keywords: VECM, Cointegration, Stationary Variables

Vector error correction models (VECM), in which some linear combinations

ary macroeconomic variables. A VECM is formulated as follows:

where Yt is an n dimensional vector, α and β are n × h dimensional matrices

with rank h. β ′ Yt−1 = 0 represents h equilibrium relations. A deviation from

Email address: pchen@academic.mit.edu.au (Pu Chen)

Preprint submitted to Journal of LATEX Templates October 3, 2022

as a cointegrated VECM with a set of restrictions in the cointegration space,

1 See Pesaran et al. [9] for more details

2. VECM and the Underlying Process

procedure applies to VECM (1.1).

tween a cointegrated VECM and an underlying stationary VAR process that

55 The following Lemma establishes the connection between a cointegrated

Lemma 2.4 (VECM and an underlying VAR).

Corollary 1. The Johansen procedure is applicable to a VECM generated from

According to Lemma 2.4, a cointegrated VECM is always created from an

Yt and   contains the answer to this question. To demonstrate this we

where the n × n matrix B is separated into three blocks each with r, n − r − k,

To that end, we establish a relationship between the transformation matrix

   (1) (1) (1)

(1) (2) (1) (2)

(1) (2) (2)

α =  B21 B23   θ22 + θ22 − I (2.8)

in Johansen [7], the number of parameters in αβ ′ with unconstrained α and β is

such that the inverse of B ∗ has the following form:

A−1 B(D − CA−1 B)−1

B31 = B 31∗ B 11∗ − [B 12∗ , B 13∗ ]   , (2.13)

If a k dimensional I(0) component exists in Yt , condition (2.14) implies that the

Lemma 2.6. The necessary and sufficient condition for a VECM

3. Generating and Testing mixed VECMs

130 3.1. Data Generating Process of Mixed VECMs

3 See Engle et al. [4] for more details.

∆Yt = C + αβ ′ Yt−1 + Ψ∆Yt−1 + ϵt ϵt ∼ IN (0, Ω)

0 50 100 150 200 0 50 100 150 200

Figure 1: Time series generated from example 3.1

0 50 100 150 200 0 50 100 150 200

Figure 2: Time series generated from example 3.2

3.2. Testing Mixed VECMs and Parameter Estimation

ψ̃(ϕ, Ω) = [G′ (Ω−1 ⊗ β ′ S11 β)G]−1 (Ω−1 ⊗ β ′ S11 )vec(Πˆ′LS )

Ω̃(ψ, ϕ) = S00 − S01 βα′ − αβ ′ S10 + αβ ′ S11 βα′ .

totically χ2 with degree of freedom rk

are two independent cointegration relations among the 4 I(1) variables.

Johansen test of Example 2

crk <= 1 | 227.575481 34.03

4. An Illustrative Empirical Application

In this section, we provide an example of how a mixed VECM is used in an

Figure 3: Time series used in the empirical example

log wt = −0.002Ul,t + 1.20 log ynt . (4.19)

increase in productivity. The elastic response of the real wage to productive

the long run. Therefore we test the unit elasticity hypothesis. H0 : β3 = 1

5 See Carter [2] for more details.

log wt = −0.007Ul,t + log ynt . (4.20)

Figure 4: Time series generated from example

V_c response to yn shock

V_c response to U_l shock

V_c response to rw shock

V_c response to V_c shock

Figure 5: Time series generated from example

positive productivity shock, according to the impulse response function, reduces

In this paper, we extend the conventional cointegrated VECM to mixed

245 Proof of Lemma 2.4 a)