You are on page 1of 22

ed

Vector Error Correction Models with Stationary and


Nonstationary Variables

iew
Pu Chena
a Melbourne Institute of Technology, Australia

ev
Abstract

Vector error correction models (VECM) have become a standard tool in empirical

r
economics for analysing nonstationary time series data because they combine two key
concepts in economics: equilibrium and dynamic adjustment in one single model. The

er
current standard VECM procedure is restricted to time series data with the same
degree of integration, i.e. all I(1) variables. However, time series data with different
degrees of integration are common in empirical studies, necessitating the simultaneous
pe
handling of I(1) and I(0) time series. The standard VECM is extended in this paper to
accommodate mixed I(1) and I(0) variables. The mixed VECM conditions are derived,
and as a result, a test and estimation of the mixed VECM are presented.

Keywords: VECM, Cointegration, Stationary Variables


JEL: classification: E42, E52, E58.
ot

1. Introduction
tn

Vector error correction models (VECM), in which some linear combinations


of the nonstationary variables are stationary cointegration relations reflecting
equilibrium conditions, are the most popular method for modelling nonstation-
rin

ary macroeconomic variables. A VECM is formulated as follows:


p−1
X

∆Yt = αβ Yt−1 + Πl ∆Yt−l + Ut (1.1)
l=1

where Yt is an n dimensional vector, α and β are n × h dimensional matrices


ep

with rank h. β ′ Yt−1 = 0 represents h equilibrium relations. A deviation from

Email address: pchen@academic.mit.edu.au (Pu Chen)


Pr

Preprint submitted to Journal of LATEX Templates October 3, 2022

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
the equilibrium β ′ Yt−1 ̸= 0 will cause the system variable Yt to adjust by ∆Yt .

iew
5 α is the adjustment coefficient that links the deviation from the equilibrium
β ′ Yt−1 ̸= 0 to the system adjustment ∆Yt .
The standard method for building a VECM is as follows: (1) Run unit root
tests for each time series in Yt . (2) If all of the series are I(1), use the Johansen
test to determine the cointegration rank h. (3) Use the Johansen procedure to

ev
10 estimate the VECM. This standard approach works well if all Yt components
are I(1) series. In empirical studies, however, not all components of Yt are
I(1) in all cases; some times Yt may contain both I(1) and I(0) components.

r
In this case, the Johansen procedure cannot be used directly. Cointegration
analysis with I(0) and I(1) variables can be treated by ARDL bound test.1 .
15

er
The bound test, however, is a single equation approach that can only test one
cointegration relation. A two-step heuristic approach to modelling both I(1) and
I(0) in a VECM is presented, for example, in Hamilton [5] p. 652 for a four-
pe
variable system. y1t is stationary. y2t = (y2t , y3t , y4t ) the components are each
individually I(1). One cointegration relation among the three I(1) variables
20 is concluded in the first step of cointegration analysis, and then a VECM is
presented as follows.
ot

         
(1) (1) (2) (2)
y1,t α1 ζ11 ζ 12 y1,t−1 ζ11 ζ 12 y1,t−2
  =  +
(1) (1)
 +
(2) (2)
  + ...
∆y2,t α2 ζ 21 ζ 22 ∆y2,t−1 ζ 21 ζ 22 ∆y2,t−2
tn

      
(p−1) (p−1) (0) (0)
ζ11 ζ 12 y1,t−p+1 ζ1 ϵ1
... +  (p−1) (p−1)
 +
(0)
 y2,t−1 +  (0)  (1.2)
ζ 21 ζ 22 ∆y2,t−p+1 ζ2 ϵ2

 
(0)
ζ1
where the (3 × 4) matrix 
(0)
 is restricted to be αβ ′ where α is (4 × 1) and
ζ2
rin

β is (3 × 1).
We will formalise the procedure proposed in equation (1.2) and provide a
25 system approach to VECM with mixed I(1) and I(0) variables in this paper. It
turns out that a VECM with mixed I(1) and I(0) variables can be thought of
ep

as a cointegrated VECM with a set of restrictions in the cointegration space,

1 See Pesaran et al. [9] for more details


Pr

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
i.e. it is a special case of the conventional cointegrated VECM. As a result, the

iew
Johansen test can be used to determine the cointegration rank. Furthermore,
30 the Johansen procedure can be used to test for the presence of I(0) components
and to estimate the mixed VECM, as described in the following sections.

2. VECM and the Underlying Process

ev
VECM (1.1) presented in the previous section can be reformulated as a
vector autoregressive (VAR) model in level of Yt .
p
X
Yt = Φl Yt−l + ϵt (2.3)

r
l=1

35
Assumption 2.1. The roots of |In −
circle |z| = 1 or satisfying z = 1.

Pp

er
i=1 Φi z i | = 0 are either outside the unit
pe
Assumption 2.2. The vector error process Ut=1 is IN (0, Ω), Ω is positive def-
inite.

VAR (2.3) with Assumptions 2.1 and 2.2 is the same VAR of equation (2.1)
and Assumption 1 considered in Johansen [7] p.11 and p.14 with suppressed
deterministic components to simplify the presentation. As a result, the Johansen
ot

40

procedure applies to VECM (1.1).


To motivate our approach to a mixed VECM, we first establish a link be-
tn

tween a cointegrated VECM and an underlying stationary VAR process that


will be defined later, and show that any cointegrated VECM can be seen as
45 being generated from an underlying stationary VAR, and any stationary VAR
can generate a cointegrated VECM that meets Assumptions 2.1 and 2.2. This
rin

constructing principle allows us to create a VECM that contains both I(1) and
I(0) components, to which the Johansen procedure applies.
Under Assumption 2.1 it is shown in Johansen [7] p.50 that β ′ Yt and β⊥

∆Yt
ep


′ ∆Zt
50 are stationary.2 Defining ∆Zt = β⊥ ∆Yt and Xt = β ′ Yt . We call   an
Xt

2 In Johansen [7] the system variable is denoted Xt , while in this paper Yt is used instead.
Pr

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
underlying

process

of VECM (1.1) and show in the Appendix that the underlying
∆Zt

iew
process   is a stationary VAR process.
Xt
 
∆Zt ′
Definition 2.3. For a VECM (1.1),   with ∆Zt = β⊥ ∆Yt and Xt =
Xt
β ′ Yt is called an underlying process of VECM (1.1).

55 The following Lemma establishes the connection between a cointegrated

ev
VECM and an underlying VAR.

Lemma 2.4 (VECM and an underlying VAR).


 

r
∆Zt
a) For a VECM (1.1) satisfying Assumption 2.1, the underlying process  
Xt
is a stationary VAR process.
60

er
b) For an n dimensional stationary VAR, denoting the first r components by
− r components
pe
∆Z and the rest n
 
by Xt . Let Yt be a full rank linear
Zt Zt
transformation of  : Y t = B  . If B is unrestricted Yt is a VAR
Xt Xt
process satisfying Assumption 2.1 with r unit roots and n − r cointegration
65 relations and it can be formulated as VECM (1.1).
ot

Corollary 1. The Johansen procedure is applicable to a VECM generated from


an underlying stationary VAR.

According to Lemma 2.4, a cointegrated VECM is always created from an


tn

underlying stationary VAR, with the number of unit roots in the VECM being
the dimension of Zt and the number of cointegration relations being the dimen-
sion of X
 t
. How

can Yt have I(0) components? The transformation between
Zt
rin

Yt and   contains the answer to this question. To demonstrate this we


Xt
decompose B into a 3 × 3 block:
  
B11 B12 B13 Zt
  
Yt =  B21 , (2.4)
  
B22 B23   X1t
ep

  
B31 B32 B33 X2t

where the n × n matrix B is separated into three blocks each with r, n − r − k,


k dimensions with r + h = n. B11 is r × r, B22 is (n − r − k) × (n − r − k),
Pr

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
70 B33 is k × k, and other blocks are defined accordingly. If B11 , B21 , and B31 are

iew
nonzero, then all components of Yt are I(1) because they are linear combinations
of Zt and Xt and Zt is I(1). If B31 = 0, the last k components of Yt are only
linear combinations of Xt , implying these k components of Yt are I(0). As a
result, we have a VECM of Yt with both I(1) and I(0) components. Clearly,
75 B31 = 0 is both a necessary and sufficient condition for a VECM of Yt to contain

ev
a mix of I(1) and I(0) components. If B21 = 0 and B31 = 0 there will be r I(1)
and h I(0) components but no cointegration relations. The following lemma
summarises the condition for an I(0) and I(1) mixed VECM.

r
Lemma 2.5. B31 = 0 is a necessary and sufficient condition for a VECM of
Yt to contain k I(0) components.
80

er
There is a problem with applying Lemma 2.5 to determine a mixed VECM
because B31 is a parameter of the underlying process but not of the correspond-
pe
ing VECM that will be estimated. To turn the conditions in Lemma 2.5 into
a testable hypothesis we must investigate the implications of B31 = 0 on the
85 parameters of VECM (1.1) and determine what restrictions on the parameters
of VECM (1.1) lead to B31 = 0.
ot

To that end, we establish a relationship between the transformation matrix


B and the parameter in VECM (1.1). We consider an underlying VAR of lag 2
without losing generality but simplifying the presentation.
tn

   (1) (1) (1)


   (2) (2) (2)
   
∆Zt θ11 θ12 θ13 ∆Zt−1 θ11 θ12 θ13 ∆Zt−2 ezt
 X1t  =  θ (1) (1) (1)   X1t−1 + θ (2) (2) (2)
         
θ22 θ23 θ22 θ23   X1t−2 + ex1t 
   21    21    
(1) (1) (1) (2) (2) (2)
X2t θ31 θ32 θ33 X2t−1 θ31 θ32 θ33 X2t−2 ex2t
rin

(2.5)
(2) (2) (2)
To keep the VECM’s lag length at one θ11 , θ21 , and θ31 are assumed to be
zero. Rewriting the underlying VAR (2.5) in error correction form we obtain:
(1) (2) (1) (2) (1) (2) (2)
         
∆Zt 0 θ12 + θ12 θ13 + θ13 Zt−1 θ11 −θ12 −θ13 ∆Zt−1 ezt
         

 = 
  (1) (2) (1) (2) + (1) (2) (2)  +
θ22 + θ22 − I
  
−θ22 −θ23
   
 ∆X1t 0 θ23 + θ23  X1t−1 θ21  ∆X1t−1 ex1t 
ep

         
∆X2t (1) (2) (1) (2) X2t−1 (1) (2) (2) ∆X2t−1 ex2t
0 θ32 + θ32 θ33 + θ33 − I θ31 −θ32 −θ33
(2.6)
Pr

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
 
Zt
90 Transforming the underling variable to Yt we have:
Xt

iew
∆Yt
  
B11 B12 B13 ∆Zt−1
  
=
  
 B21 B22 B23  ∆X1t−1 
  
B31 B32 B33 ∆X2t−1
−1
(1) (2) (1) (2)
     
B11 B12 B13 0 θ12 + θ12 θ13 + θ13 B11 B12 B13 B11 B12 B13 Zt−1
      
=
  (1) (2) (1) (2)
θ22 + θ22 − I
    
 B21 B22 B23  0 θ23 + θ23   B21 B22 B23   B21 B22 B23  X1t−1 
      
B31 B32 B33 (1) (2) (1) (2) B B B B31 B32 B33 X2t−1
0 θ32 + θ32 θ33 + θ33 − I 31 32 33

ev
−1 
(1) (2) (2)
      
B11 B12 B13 θ11 −θ12 −θ13 B11 B12 B13 B11 B12 B13 ∆Zt−1 u1t
        
+
  (1) (2) (2)  +
−θ22 −θ23
      
 B21 B22 B23  θ21   B21 B22 B23   B21 B22 B23   ∆X1t−1 u2t 
        
B31 B32 B33 (1) (2) (2) B31 B32 B33 B31 B32 B33 ∆X2t−1 u3t
θ31 −θ32 −θ33

(1) (2) (1) (2)


  
B11 B12 B13 θ12 + θ12 θ13 + θ13  
21 B 22 B 23
 B
  
=
  (1) (2) (1) (2) Y
B21 B22 B23 θ22 + θ22 − I θ23 + θ23 t−1
B 31 B 32 B 33
  
  
B31 B32 B33 (1) (2) (1) (2)
θ33 + θ33 − I

r
θ32 + θ32

(1) (2) (2)


     
B11 B12 B13 θ11 −θ12 −θ13 B 11 B 12 B 13 u1t
     
(1) (2) (2)
+ B 21 B 22 B 23  ∆Yt−1 + 
 
−θ22 −θ23
   
 B21 B22 B23  θ21  u2t 
     
B31 B32 B33 (1) (2) (2) B 31 B 32 B 33 u3t
θ31 −θ32 −θ33

where 
B11 B12 B13
−1
er 
B 11 B 12 B 13

pe
   
=  B 21 B 22 B 23  . (2.7)
   
 B21 B22 B23 
   
B31 B32 B33 B 31 B 32 B 33
We have:
  
(1) (2) (1) (2)
B11 B12 B13 θ12 + θ12 θ13 + θ13
  
  (1) (2) (1) (2)
ot

α =  B21 B23   θ22 + θ22 − I (2.8)


 
B22 θ23 + θ23 
  
(1) (2) (1) (2)
B31 B32 B33 θ32 + θ32 θ33 + θ33 −I
 
B 21 B 22 B 23
tn


β =  (2.9)
B 31 B 32 B 33
What are the restrictions on β that imply B31 = 0? It is well known that β
is identified up to a full column rank transformation. According to Lemma 7.1
rin

in Johansen [7], the number of parameters in αβ ′ with unconstrained α and β is


n(n−r)+r(n−r). To find binding restrictions on β we must first make β exactly
95 identifiable otherwise parameter restrictions and identification restrictions will
be mixedup. Let β be identified in the following form.
ep

B 21 I 0
β′ = 31
. By this identification scheme, all elements in α, B 21 ,
B 0 I
and B 31 are free parameters, and the number of these free parameters are n(n −
r) + r(n − r).
Pr

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
100 Since β has full column rank, this identification can be done by premultiply-
 −1
B 22 B 23

iew
ing β ′ by  
B 32 B 33
Identifying β ′ in this way is equivalent to make a full rank transformation
of the underlying process of Xt :
   −1  
B11 B12 B13 I 0 0 I 0 0 Zt−1
     
Y =
 B21 B22 B23  0
 T22 T23  0
 T22 T23 

 X1t−1 
  (2.10)
B31 B32 B33 0 T32 T33 0 T32 T33 X2t−1

ev
| {z }| {z }
B∗ ∗′ ,X ∗′ )′
(Yt′ ,X1t 2t

such that the inverse of B ∗ has the following form:


   −1
∗ ∗
B11 B12 B13 B 11∗ B 12∗ B 13∗

r
∗ 
∗ ∗
 
 =  B 21∗

B =
 B21 B22 B23   I 0 
 (2.11)
∗ ∗
B31 B32 B33 B 31∗ 0 I


A B
−1

=

A−1 + A−1 B(D − CA−1 B)−1 CA−1
er
What are the constrains on this identified β that implies B31 = 0? Applying
the following formula of partial matrix inverse

A−1 B(D − CA−1 B)−1


(2.12)
pe
  
C D (D − CA−1 B)−1 CA−1 (D − CA−1 B)−1

   
I 0 B 21∗
to equation (2.11) and taking   as the A block, and   as the B
0 I B 31∗
block, we have
  −1
B 21∗
ot

B31 = B 31∗ B 11∗ − [B 12∗ , B 13∗ ]   , (2.13)


B 31∗

and conclude B 31∗ = 0 implies B31 = 0 and vice versa. Inserting this restriction
tn

into β we have:  
B 21∗ I 0
β′ =   (2.14)
0 0 I
rin

If a k dimensional I(0) component exists in Yt , condition (2.14) implies that the


(n−r)-dimensional I(0) space can be decomposed into two orthogonal subspaces:
a k-dimensional I(0) subspace containing the k I(0) variables and an (n − r − k)-
dimensional cointegration space containing linear combinations of the (n − k)
ep

105

I(1) variables. The k I(0) components do not enter the cointegration space and
the cointegrating error terms do not mix with the k I(0) components. We notice
that this result keeps unchanged if the k-dimensional I(0) subspace and the (n−
Pr

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
r − k)-dimensional cointegrating space are subject to a full rank transformation

iew
110 respectively. We summarise this result in the following lemma.

Lemma 2.6. The necessary and sufficient condition for a VECM



to contain k 
B 21 B 22 0
I(0) variables is the cointegrating matrix β cab be written as: ′
β = 
0 0 B 33
.

ev
The two step method given in equation (1.2) is partly justified by Lemma
115 2.6. The I(0) components space and the cointegration space are orthogonal,
indicating that only the three I(1) components are actually involved in the

r
cointegration relations. The justification is partly because the I(0) components
are not weakly exogenous for the conditional process of the I(1) components

120 er
generally. A valid inference based on a partial system can be made only when
the conditioning variables are weakly exogenous for the parameters of the partial
system.3 The cointegration relation must not affect the dynamic of the I(0)
pe
(0)
components in order for the weak exogeneity to hold,4 which would imply ζ1 =
0 in equation (1.2).

3. Generating and Testing mixed VECMs


ot

125 This section uses two examples to show that 1) a data generating process
that satisfies the condition in Lemma 2.5 B31 = 0 will generate VECM with
mixed I(0) and I(1) variables, and 2) a data generating process that satisfies
tn

 
B 21 B 22 0
the condition in Lemma 2.6 β ′ =   will also generate a VECM
0 0 B 33
with I(0) and I(0) components.
rin

130 3.1. Data Generating Process of Mixed VECMs


Example 3.1.
   (1) (1) (1)
   (2) (2) (2)
   
∆Zt θ θ12 θ13 ∆Zt−1 θ θ12 θ13 ∆Zt−2 ezt
 11   11
 X1t  =  θ (1) (1) (1)   X1t−1 + θ (2) (2) (2)
      
θ22 θ23 θ22 θ23   X1t−2 + ex1t 
   21    21    
(1) (1) (1) (2) (2) (2)
ep

X2t θ31 θ32 θ33 X2t−1 θ31 θ32 θ33 X2t−2 ex2t
(3.15)

3 See Engle et al. [4] for more details.


4 See I. Habro [6] and Chen and Hsiao [3] for more details.
Pr

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
 
Zt

iew
 
 X1t  .
Yt = B   (3.16)
X2t

θ (1) θ (2)
-0.06 -0.52 -0.05 0.15 -0.05 -0.1 0 0 0.07 0.09 0 0.04
-0.52 0.25 0.28 -0.07 0.04 0.03 0 0 0.01 0.06 -0.04 0.01
-0.05 0.28 -0.33 -0.07 0.29 -0.2 0 0 0.03 0.04 0.08 0.1

ev
0.15 -0.07 -0.07 -0.44 0.15 -0.05 0 0 0.04 0.01 0.04 0.06
-0.05 0.04 0.29 0.15 -0.27 -0.44 0 0 0.08 0.04 0 0.07
-0.1 0.03 -0.2 -0.05 -0.44 0.26 0 0 0.01 0.06 0.07 0.05
B
-0.6 1.5 0.42 0.55 0.35 -1.83

r
0.32 0.15 -0.23 -0.5 -0.66 -0.35
1.87 -2.61 -1.6 -0.37 -0.05 1.22
-1.77 -0.11 0.73 0.2 -0.5 0.34
0
0
0
0
0.3
-0.4
-0.6
0.5
0.37
0.23
er
0.38
-0.79
pe
ot

Figure 3.1 shows one realisation of the simulated data. It is evident from the
tn

graphs in Figure 3.1 that there are 4 I(1) and 2 I(0) variables.

Example 3.2. The is an example in which data are generated from a VECM
with restricted β such that the last two components are I(0).
rin

∆Yt = C + αβ ′ Yt−1 + Ψ∆Yt−1 + ϵt ϵt ∼ IN (0, Ω)

α β Ψ
-0.09 0.54 6.05 14.74 1 1 0 0 -1.11 0.71 -0.54 0.04 7.42 -9.35
0.21 0.55 2.78 7.64 -1.34 -3.8 0 0 -0.52 0.4 -0.26 -0.13 4.55 -5.86
ep

135 0.3 -0.23 -0.11 -1.32 -0.27 0.59 0 0 0.73 -0.28 0.41 -0.33 -4.16 5.16
0.13 0.32 5.32 12.59 -0.29 0.89 0 0 -0.38 0.38 -0.44 -0.12 5.96 -7.33
-0.13 -0.06 0.12 -1.65 0 0 -2.13 -0.21 0.12 -0.13 0.06 0.05 -1.21 1.49
-0.19 0.06 -0.9 -3.39 0 0 1 1 0.26 -0.07 0.22 -0.04 -2.76 3.26
Pr

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
iew
ev
ts(RR$Y)
150

150
100

100

r
50
Series 1

Series 4
50
0

0
−50

er −50
−100

−100
−150

10
10

5
pe
0
Series 2

Series 5
0
−10

−5
−20

−10
300

15
200

10
ot
100

5
Series 3

Series 6
0

0
−100

−5
−200

−10
−300

−15
tn
−400

0 50 100 150 200 0 50 100 150 200

Time Time

Figure 1: Time series generated from example 3.1


rin
ep
Pr

10

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
iew
ev
ts(res_d$Y)

1500
1000

1000

r
Series 1

Series 4
500

500
0

er 10
600
400

5
Series 2

Series 5
pe
200

0
−5
0
100 −200

10−10
5
Series 3

Series 6
50

0
ot
−5
0

−10
−50

0 50 100 150 200 0 50 100 150 200


tn

Time Time

Figure 2: Time series generated from example 3.2


rin

.
ep
Pr

11

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
One realization of the simulated data is depicted in Figure 3.2. The graphs in

iew
Figure 3.2 show clearly that there are 4 I(1) and 2 I(0) variables.

3.2. Testing Mixed VECMs and Parameter Estimation


As discussed in the previous section, VECM with mixed I(0) and I(1) vari-
140 ables can be seen as a special case of a conventional cointegrated VECM where

ev
the cointegrating vectors subject to a set of linear restrictions. Therefore the
standard Johansen procedure is applicable. We can apply the Johansen proce-
dure to determine the cointegration rank, to test the presence of I(0) variables
in the system, and to estimate the parameters of the mixed VECM.

r
The restrictions on the cointegrating vectors β that leads to a mixed VECM

vec(α′ ) = Gψ,
er
are linear restrictions on β. Following Boswijk and Doornik [1] linear restrictions
can be formulated in the following form.

vecβ = Hϕ + h0 , (3.17)
pe
Boswijk and Doornik [1] propose the following iteration process to calculate the
constrained parameter and thus to test the hypothesis related to the constrains.

ψ̃(ϕ, Ω) = [G′ (Ω−1 ⊗ β ′ S11 β)G]−1 (Ω−1 ⊗ β ′ S11 )vec(Πˆ′LS )


ot

h i
ϕ̃(ψ, Ω) = [H ′ (α′ Ω−1 α ⊗ S11 β)H]−1 (α′ Ω−1 ⊗ S11 vec Πˆ′LS − (α ⊗ Ip1 )h0 .


Ω̃(ψ, ϕ) = S00 − S01 βα′ − αβ ′ S10 + αβ ′ S11 βα′ .


tn

145 Starting from a set of initial values (ψ0 , ϕ0 , Ω0 ), the iterations then become
ϕ̃j = ϕ̃(ψj−1 , Ωj−1 ), ψ̃j = ψ̃(ϕj , Ωj−1 ), Ω̃j = Ω̃(ϕj , Ωj ). The iteration proce-
dure stops when the changes in estimated values between two steps are below
rin

a prescribed threshold. S01 , S11 , and S00 are the cross products of the two
auxiliary regressions in the Johansen procedure. We use this procedure to test
150 the presence of a mixed VECM.
According to Boswijk and Doornik [1], the following likelihood ratio is asymp-
ep

totically χ2 with degree of freedom rk


r
X  d
log(1 − λ̂i ) −→ χ2 (rk),

LR = T log |Ω̃| − log |S00 | − (3.18)
i=1
Pr

12

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
where Ω̃ is the constrained estimate obtained from the iteration above and
Pr

iew
log |S00 | − i=1 log(1 − λ̂i ) is the unconstrained estimate from the Johansen
procedure.
Examples (continue)
155 Applying Johansen’s test to the data generated in Example 3.1, we obtain
the following result.

ev
Johansen test of Example 1
teststatistic critical_value
crk <= 0 | 1287.128961 40.19
160 crk <= 1 | 631.081580 34.03

r
crk <= 2 | 145.332672 27.80
crk <= 3 | 113.392495 21.49
crk <= 4 | 8.438725 15.02

165
crk <= 5 | 1.121140 8.19

er
It is to note that the output of Johansen’s test above indicates the dimension
pe
of the I(0) space. It is the sum of the independent cointegration relations and
the number of I(0) components in the system. For data generated from the
parameters in Example 1, we run likelihood ratio tests of one, two three I(0)
components, respectively. The corresponding p-value are 0.43, 0.34, and 0.0.
170 We conclude there are two I(0) components in the system. It follows that there
ot

are two independent cointegration relations among the 4 I(1) variables.


For the data generated from the parameters in Example 3.2, we have similar
tn

results.

Johansen test of Example 2


175 teststatistic critical_value
crk <= 0 | 1022.555928 40.19
rin

crk <= 1 | 227.575481 34.03


crk <= 2 | 67.163218 27.80
crk <= 3 | 46.358663 21.49
180 crk <= 4 | 7.232841 15.02
crk <= 5 | 2.674875 8.19
ep

Johansen’s test results show that the dimension of the I(0) space is 4. The
p-values of the likelihood ratio tests of one, two, and three I(0) components are
0.62, 0.48, and 0.0 respectively. We conclude there are two I(0) components in
Pr

13

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
185 Yt and two independent cointegration relations among the 4 I(1) variables in

iew
the system.

4. An Illustrative Empirical Application

In this section, we provide an example of how a mixed VECM is used in an


empirical investigation. We are interested in determining how the unemploy-

ev
190 ment rate (Ut ), productivity (ynt ), and capacity utilisation (Vct ) affect the real
wage (rwt ) in Germany.
The information comes from FRED Economic Data and OECD Statistics.

r
The data spans the quarters of 1990 Q1 and 2021 Q2. Figure 3 contains plots
of the time series.

ts(gData)er
pe
5.4
5.3
rw
5.2
5.1
11
9
U_l

ot
8
7
6
5
4.6

tn
4.5
yn
4.4
4.3
4.45
V_c

rin 4.35
4.25

0 20 40 60 80 100 120

Time
ep

Figure 3: Time series used in the empirical example

.
Pr

14

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
variable Trnasformation Description of the untransformed series
w log(DEULCWRMN01IXOBQ) German wage index, source: FRED

iew
- log(DEUCPIALLMINMEIQ) German CPI, source: FRED
195
U l LMUNRRTTDEM156SQ German unemployment rate: FRED
V c log(BSCURT02DEQ160S) German capacity utilization rate, source: OECD
yn DEURGDPHq Germany, productivity index, source: FRED

Unit root tests of the 4 time series indicate that rw, Ul and yn are I(1) while

ev
Vc is I(0). Based on BIC criterion, we specify a VAR(2) for the 4 variables in
200 level. Johansen’s test gives the following results.
teststatistic critical_value
crk <= 0 | 31.785720 27.80
crk <= 1 | 23.184527 21.49

r
crk <= 2 | 7.320134 15.02
205 crk <= 3 | 1.031860 8.19

er
The output above indicates that I(0) space has a dimension of 2. P-values
of the likelihood ratio tests of one I(0) components is 8.2%. We conclude that
there is one I(0) component and one cointegration relation in the system. The
pe
estimated cointegrating vector β = (1, 0.002, −1.20, 0) can be reformulated as a
long-run real wage equation.

log wt = −0.002Ul,t + 1.20 log ynt . (4.19)


ot

This long-term equation shows that productivity has a positive effect on real
wages while unemployment has a negative effect. Real wages have an elasticity
of 1.2 to productivity, which means real wages will increase 1.2% for every 1%
tn

increase in productivity. The elastic response of the real wage to productive


210 growth implies that the wage share in total output is increasing, which con-
tradicts Bowley’s law.5 In contrast, the wage share is ”relatively stable” over
rin

the long run. Therefore we test the unit elasticity hypothesis. H0 : β3 = 1


H1 : β3 > 1. The likelihood ratio test has a p-value of 4%. The null hypoth-
esis is rejected at the significance level of 5% but not rejected a the level of
ep

215 1%. Because, despite heated debates, Bowley’s is a reliable long-run benchmark
(See Krämer [8] and Carter [2] for more detailed discussions.), we choose the

5 See Carter [2] for more details.


Pr

15

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
significance level at 1% and do not reject the null hypothesis that the elasticity

iew
coefficient is 1. As a result, we have a long run real wage equation:

log wt = −0.007Ul,t + log ynt . (4.20)

This equation says if the unemployment rate increases by 1%, the real wage
will decrease by 0.7%. A deviation from this long run relation is measure by

ev
220

the error term β ′ Yt−1 which is depicted in the following diagram. The plot in

r
0.88

er
0.86

pe
0.84
Series 1

0.82

ot
0.80

tn
0.78

0 20 40 60 80 100 120

Time
rin

Figure 4: Time series generated from example

Figure 4 shows the real wage can deviate from the long run equilibrium level by
about ±5%. The cointegration relationship also shows that the rate of capacity
ep

utilisation has no long-run effect on the real wage. However, it may have a
225 short-term impact on the real wage. To evaluate the short run dynamics, we
calculate impulse response functions of the mixed VECM shown in Figure 5. In
Pr

16

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
yn response to yn shock yn response to U_l shock yn response to rw shock yn response to V_c shock
1.00
0.05
0.00 0.75

iew
0.75
−0.05 0.00
0.50 0.50
−0.10
0.25 0.25 −0.05
−0.15
0.00
0.00 −0.10
−0.20
0 5 10 15 0 5 10 15 0 5 10 15 0 5 10 15

U_l response to yn shock U_l response to U_l shock U_l response to rw shock U_l response to V_c shock
0 40

−10 3
30 0

−20 0

ev
20

−5
−30 −3 10

−40 0
−6
0 5 10 15 0 5 10 15 0 5 10 15 0 5 10 15

rw response to yn shock rw response to U_l shock rw response to rw shock rw response to V_c shock
0.0 1.00
0.4 0.10

r
0.75
0.2 −0.1
0.05
0.50
0.0 −0.2
0.00
0.25

−0.3
−0.2

2
0 5 10 15

V_c response to yn shock

0.0

−0.1

−0.2
0 5 10 15

V_c response to U_l shock


er
0.00

1.0

0.5
0 5 10 15

V_c response to rw shock


−0.05

1.2

0.8
0 5 10 15

V_c response to V_c shock


pe
0.0
1 −0.3 0.4
−0.4 −0.5
0
−0.5 0.0

0 5 10 15 0 5 10 15 0 5 10 15 0 5 10 15

Figure 5: Time series generated from example


ot

this set of impulse response functions, the Cholezky decomposition is used in the
order (yn, U l, rw, V c). The shocks are one unit of the corresponding variables. A
tn

positive productivity shock, according to the impulse response function, reduces


230 unemployment rate, raises the real wage, and increases capacity utilisation. A
one standard deviation unemployment shock has no effect on productivity, but
it lowers the real wage and temporarily reduces capacity utilisation.
rin

5. Concluding remarks

In this paper, we extend the conventional cointegrated VECM to mixed


ep

235 cointegrated VECMs that can accommodate I(0) and I(1) variables. A testable
necessary and sufficient condition for a mixed cointegrated VECM is provided.
We show that a mixed cointegrated VECM is a special case of a conventional
Pr

17

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
cointegrated VECM. As a result, Johansen’s test can be used to test the cointe-

iew
gration rank as well as the constraints on the cointegrating vectors that make a
240 VECM a mixed cointegrated VECM. Practical implication of the results in this
paper is that the exiting econometric software packages such as R, RATS, and
EVIEWS that can be used to test restrictions on β in a VECM can be directly
use to test a mixed VECM and estimate the parameters of the mixed VECM.

ev
6. Appendix

245 Proof of Lemma 2.4 a)

r
To simplify the presentation, the lag length p is set to 2 and the deterministic
components are suppressed in the following proof without loss of generality. This
results in the following VECM

∆Yt = αβ ′ Yt−1 + Θ∆yt−1 + ut


er
pe
 ′ 
β⊥
Premultiply the equation above by ′
we obtain:
β

       −1    
′ ′ ′ ′ ′ ′
β⊥ ∆Yt β⊥ β⊥ β⊥ β⊥ ∆Yt−1 β⊥
  =   αβ ′ Yt−1 +  Θ   +  ut
β ′ Yt − β ′ Yt−1 β′ β′ β ′ Yt−1 − β ′ Yt−2
β′ β′
ot

       −1   
β ′ ∆Yt β′ h i 0 0 β′ β ′ ∆Yt−1
 ⊥  =  ⊥  0 α   + Θ ⊥   ⊥ 
β ′ Yt β′ 0 I β′ β ′ Yt−1
   −1    
′ ′ ′
β⊥ β⊥ 0 β⊥
+   Θ     +   ut
tn

β′ β′ β ′ Yt−2 β′

Replacing β⊥ ∆Yt and β ′ Yt by ∆Zt and Xt respectively, we obtain


       −1   
′ ′
∆Zt β⊥ h i 0 0 β⊥ ∆Zt−1
  =   0 α   + Θ   
Xt β′ 0 I β′ Xt−1
rin

   −1     
β′ β′ 0 0 ∆Zt−2 β′
+ ⊥ Θ ⊥     +  ⊥  ut
β′ β′ 0 I Xt−2 β′

The equation above shows the underlying process is a VAR process. Follow-
ing Theorem 4.2 in Johansen [7] p.49 ∆Z = β⊥ ∆Yt and X = β ′ Yt are stationary,
ep

250 hence the underlying process is a stationary VAR process. This proves Lemma
2.4 a).
Pr

18

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
To prove Lemma 2.4 b) we consider the following stationary VAR. Again,

iew
we choose p = 2 and suppress the deterministic components.
     
∆Zt ∆Zt−1 ∆Zt−2
  = ζ1   + ζ2   + vt , (6.21)
Xt Xt−1 Xt−2

To keep the lag length ofVECM being



1, we

assume in addition the coeffi-
(1) (1) (2)

ev
ζ11 ζ12 0 ζ12
cient of ∆Zt−2 is zero: ζ1 = (1) (1)
 ζ2 = (2)
 Rearrange equation
ζ21 ζ22 0 ζ22
(6.21):
       
(1) (1) (2)
∆Zt 0 ζ12 Zt−1 ζ11 ζ12 ∆Zt−1
= + +vt ,

r
  
(1) (2) (1) (2)
∆Xt 0 ζ22 + ζ22 − I Xt−1 ζ21 −ζ22 ∆Xt−1
 
(6.22)
The stationarity assumption of
characteristic polynomial equation

er
∆Zt
Xt
 implies the roots of the following
pe
I − ζ1 z − ζ2 z 2 = 0 (6.23)

lie outside the unit circle |z| = 1. Using lag operator the stationary VAR (6.21)
can be written as:  
∆Zt
(I − ζ1 L − ζ2 L2 )  =0
ot

Xt
    
∆Zt I−L 0 Zt
Inserting  =   into the equation above we have
Xt 0 I Xt
tn

  
I −L 0 Zt
(I − ζ1 L − ζ2 L2 )   =0 (6.24)
0 I Xt

Obviously
rin

   
I(1 − z) 0 I(1 − z) 0
(I − ζ1 z − ζ2 z 2 )   = (I − ζ1 z − ζ2 z 2 )   =0
0 I 0 I
ep

 
Zt
has only roots outside the unit circle and unit roots. Hence the VAR of  
Xt
satisfies Assumption 2.1. Next we show a full rank transformation of a VAR pro-
cess will not change the roots of the characteristic polynomials of the respective
Pr

19

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
VAR processes. For a VAR process

iew
p
X
Xt = Φl Xt−l + ϵt (6.25)
l=1

the roots the characteristic polynomial of VAR (6.25) is defined by the following
equation
p
X
|In − Φi z i | = 0 (6.26)

ev
i=1

For a full rank transformation of Wt = BXt we have


p
X
Wt = BΦl B −1 Wt−l + Bϵt (6.27)

r
l=1

and its roots is defined by

|In −
p
X

i=1
er
BΦi B −1 z i | = |B(In −
p
X

i=1
Φi z i )B −1 | = 0

Because B has full rank, the roots of equation (6.28) is identical to the roots
(6.28)
pe
of equation
 
(6.26). Using this results and that Yt is a full rank transformation
Zt
of  , we conclude the VAR of Yt satisfies Assumption 2.1. This proves
Xt
255 Lemma 2.4 b). □
Proof of Lemma 2.5
ot

According to the relation between Yt and the underlying process in equation


(2.4), if B31 = 0, the last k components of Yt are linear combinations of I(0)
tn

variables, these k components are I(0). This proves the sufficiency. If the last
260 k components of Yt are I(0), the last component of Yt is I(0). From equation
(2.4) we have
rin

r
X n−r
X
Yn,t = bn,i Zi,t + bn,j+r Xj,t . (6.29)
i=1 j=1
Pr
Because Zi,t for i = 1, 2, ..., r are independent I(1) variables, i=1 bn,i Zi,t is an
I(1) variable unless bn,i = 0 for i = 1, 2, ..., r. Because Yn,t is I(0) by assumption
ep

it follows bn,i = 0 for i = 1, 2, ..., r, i.e. the last row in B31 is zero. In the same
265 way we can prove each row in B31 is zero. This proves the necessity. □
Proof of Lemma 2.6
Pr

20

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
 
B 21 B 22 0
To prove the sufficiency, we can premultiply β ′ = by
0 0 B 33

iew
 22 −1
B 0
33
and obtain
0 B
 
′ (B 22 )−1 B 21 I 0
β∗ =   (6.30)
0 0 I

ev
By the
 formula of partial
 matrix inverse (2.12) and equation (2.13) we know
′ B 21 B 22 0
β = 33
implies B31 = 0.
0 0 B
To prove the necessity, we assume without loss of generality that the last k
components of Yt are I(0). Then a cointegrating matrix β ∗ can be written as:

r
270

 
′ B ∗21 B ∗22 B ∗23
β∗ = 
0
er 0 B 33

because a full rank transformation of the last k I(0) components of Yt are


still I(0), thus the n × k matrix (0, 0, B 33 ) is a cointegrating matrix of Yt ,
(6.31)
pe
containing k independent
  I(0) components. Premultiply equation (6.31) by
23∗ 33 −1
I −B (B )
0 I
we obtain  
B 21 B 22 0
β′ =  (6.32)
ot


0 0 B 33
275 This proves the necessity. It is to note that making B 23 = 0 is not the con-
sequence of k I(0) components in Yt but the convention of normalization of
tn

orthogonal cointegration relations and the I(0) components. □

References
rin

[1] Boswijk, H. P. and Doornik, J. A. (2004). Identifying, estimating


280 and testing restricted cointegrated systems: An overview. SEMANTIC
SCHOLAR, Corpus ID:9940494:DOI:10.1111/j.1467–9574.2004.00270.x.
ep

[2] Carter, S. (2007). Real wage productivity elasticity across advanced


economies, 1963-1996. Journal of Post Keynesian Economics, 29(4):573–
600.
Pr

21

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531
ed
285 [3] Chen, P. and Hsiao, C. Y. (2004). Testing weak

iew
exogeneity in cointegrated system. REPEC , page
http://repec.org/esFEAM04/up.27567.1079363411.pdf.

[4] Engle, R., Hendry, D. F., and Richard, J.-F. (1983). Exogeneity.
Econometrica, 51:277–304.

ev
290 [5] Hamilton, J. D. (1994). Time Series Analysis. Princeton, 1st edition.

[6] I. Habro, S. Jahansen, B. N. A. R. (1998). Asymptotic inference on coin-


tegrating rank in partial systems. Journal of Business&Economic Statistics,

r
16:388–399.

295
er
[7] Johansen, S. (1995). Likelihood-Based Inference in Cointegrated Vector
Autoregressive Models. Oxford University Press, 1st edition.

[8] Krämer, H. M. (2011). Bowley’s law: The diffusion of an empirical sup-


pe
position into economic theory. Papers in Political Economy, 61,(2):19–49.

[9] Pesaran, M. H., Shin, Y., and Smith, R. J. (2001). Bounds testing
approaches to the analysis of level relationships. Journal of Econometrics,
16:289–326.
ot

300
tn
rin
ep
Pr

22

This preprint research paper has not been peer reviewed. Electronic copy available at: https://ssrn.com/abstract=4239531

You might also like