Professional Documents
Culture Documents
Three-Stage Semi-Parametric Estimation of T-Copulas: Asymptotics, Finite-Samples Properties and Computational Aspects
Three-Stage Semi-Parametric Estimation of T-Copulas: Asymptotics, Finite-Samples Properties and Computational Aspects
Dean Fantazzini
Moscow School of Economics
• Conclusions
Under some regularity conditions, Genest et al. (1995) show that the
semiparametric estimator γ̂ CM L (for bivariate copula) has the following
asymptotic distribution:
√ 2
d σ
T (γ̂ CM L − γ0 ) → N 0, 2 (4)
h
where
and where Wi (xit ) can have this alternative expression too, upon
integrating by parts with respect to ui (i = 1, 2):
Z
Wi (xit ) = − 1l Fi (Xi )≤ui lγ (u1 , u2 ; γ) li (u1 , u2 ; γ) c(u1 , u2 ; γ)du1 du2
Since the seminal work by Genest et al. (1995), it has become common
practice to use semi-parametric methods with high-dimensional elliptical
Student’s T copulas too (see, e.g. Cherubini et al. (2004) and Mcneil et
al. (2005)):
− υ+n
n ζ ′ Σ−1 ζ 2
υ+n υ 1+
" #
Γ Γ υ
c(tυ (x1 ), . . . , tυ (xn )) = |Σ|−1/2 2
υ
2
υ+1
− υ+1
Γ 2 Γ 2 n
Q
ζi2 2
1+ 2
i=1
Definition 1.2 (Kendall’s tau). If we have (X1 ; X2 ) and (X̃1 ; X̃2 ) two
independent and identically distributed random vectors, the population
version of Kendall’s tau τ (X1 ; X2 ) is, (see Kruskal (1958)):
h i
τ (X1 , X2 ) = E sign (X1 − X̃1 )(X2 − X̃2 ) (5)
Lindskog, McNeil, and Schmock (2002) proved that Kendall’s tau for
elliptical distributions is given by
2
τ (X1 , X2 ) = arcsin ρX1 X2 (7)
π
where ρX1 X2 is the copula correlation parameter.
There are cases when the copula parameter vector has different kinds of
parameters and only some of them can be expressed as a function of the
Kendall’s tau. This is the case for the T-copula . In this situation, Bouyé
et al. (2001) and McNeil et al. (2005) have suggested the following
estimation procedure:
2. Collect all pairwise estimates of the sample Kendall’s tau given by (9)
in an empirical Kendall’s tau matrix R̂τ defined by
τ
R̂jk = τ (FjT (Xj ), FkT (Xk )), and then construct the correlation matrix
τ
using this relationship Σ̂j,k = sin( π2 R̂j,k ), where the estimated
parameters are the q = n · (n − 1)/2 correlations [ρ̂1 , . . . ρ̂q ]′ .
E [ψ1 (F1 (X1 ), F2 (X2 ); ρ1 )]
..
ψ (F1 (X1 ), . . . , Fn (Xn ); θ0 ) = =0
.
E [ψq (Fn−1 (Xn−1 ), Fn (Xn ); ρq )]
(11)
Just note that the CML estimator is defined by the derivative of the
log-likelihood function with respect to the degrees of freedom:
T
∂l(·; ν) X
= lν F1T (x1,t ), . . . , FnT (xn,t ); Σ̂ , ν̂ = 0 (12)
∂ν t=1
Let define the sample moments vector Ψ KM E−CM L for the parameter
vector Ξ̂ = [ρ̂1 , . . . ρ̂q , ν̂]′ as follows:
Ψ KM E−CM L F1T (x1,t ), . . . , FnT (xn,t ); Ξ̂ =
1 PT
i=1 ψ1 (F1T (x1,t ), F2T (x2,t ); ρ̂1 )
T
..
.
=
=0
1 T
P
i=1 ψq Fn−1,T (xn−1,t ), FnT (xn,t ); ρ̂q
T
1 PT
T i=1 ψν F1T (x1,t ), . . . , FnT (xn,t ); Σ̂, ν̂
Let also define the population moments vector with a correction to take
the non-parametric estimation of the marginals into account, together with
its variance (see Genest et al. (1995), § 4):
ψ1 (F1 (X1 ), F2 (X2 ); ρ1 )
..
.
∆0 = =0 (13)
ψq (Fn−1 (Xn−1 ), Fn (Xn ); ρq )
n
P
ψν (F1 (X1 ), . . . , Fn (Xn ); Σ0 , ν0 ) + Wi,ν (Xi )
i=1
′
Υ0 ≡ var [∆0 ] = E ∆ KM E−CM L ∆ KM E−CM L (14)
where
∂2
Z
Wi,ν (Xi ) = 1l Fi (Xi )≤ui log c(u1 , . . . un )dC(u1 , . . . un ) (15)
∂ν∂ui
7 Note that the population moments used to estimate the correlations are
→
not affected by the marginals empirical d.f., since the Kendall’s tau is
invariant under strictly increasing marginal transformations
• We found that the KME-CML estimator was more efficient and less
biased than the one-stage ML estimator when small samples and
t-copulas with low degrees of freedom ν were of concern.
• When small samples were of concern and ν was high, the number of
times when the numerical maximization of the log-likelihood failed to
converge was much higher for the ML method than for the KME-CML
method.
• Yet, while the coverage rates at the 95% level for the ML estimates for
ν did not show any particular bias or trend, the KME-CML estimates
showed very low rates when ν became close to 30 and the correlations
were not too strong.
• However, this drop in the coverage rates for ν was large with bivariate
t-copulas, only, while it was much lower with ten-variate copulas.
• The coverage rates for the correlations were quite close to the true
values.
• This fix induces a positive mean bias in the estimate of ν, but the
effects on the coverage rates are rather limited. Besides, the number of
times when this method has to be used quickly decreases when ν
increases.