Professional Documents
Culture Documents
1
General Framework
2
Outline
• Concluding Remarks
3
Sparse Principal Component Analysis
4
Leading Eigenvectors Estimation
5
Applications
• Equity data
• ...
6
Sparse Leading Eigenvectors
7
Assumption and Estimation
Estimation:
where Σn is the sample covariance matrix, Sd−1 is the unit sphere in Rd and
B0 (s) := {v ∈ Rd : ||v||0 ≤ s}.
8
Rates of Convergence
Theorem. Let ||θ1 ||0 = s. Then the parametric rate of convergence under the
`2 norm is
r !
1 s log d
OP ,
λ 1 − λ2 n
9
Constraints
The Gaussian or sub-Gaussian assumption is too constraint, appearing to be
rare in applications.
10
Equity Data
452 stocks that were consistently in the S&P 500 index between January 1,
2003 though January 1, 2008.
0.06
0.40
0.04
0.02
0.38
0.00
stock 2
stock 4
0.36
−0.02
−0.04
0.34
−0.06
0.32
0.36 0.37 0.38 0.39 0.40 0.41 0.06 0.08 0.10 0.12
stock 1 stock 3
11
Extensions to non-Gaussian (Abnormal)
12
Liu, Lafferty and Wasserman (2009, JMLR)
13
z
z
z
x
x
y
x
y y
3 3 3
2 2 2
1 1 1
0 0 0
−1 −1 −1
−2 −2 −2
−3 −3 −3
−3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3 −3 −2 −1 0 1 2 3
14
Liu et.al. Annals of Statistics (2012)
15
Good Enough?
Theorem. The tail dependence is zero in Nonparanormal (Gaussian Copula).
16
17
Elliptical Distribution
When the density exists, the elliptical distribution has the density:
Note: Elliptical distributions can be very heavy tailed and even have
infinite first or second moments.
18
Transelliptical Distribution
(f1 (X1 ), f2 (X2 ), . . . , fd (Xd ))T ∼ ECd (0, Σ, g), where diag(Σ) = 1.
19
Kendall’s tau and Its Invariance Property
τ (Xj , Xk ) = P((Xj − X
ej )(Xk − X
ek ) > 0) − P((Xj − X
ej )(Xk − X
ek ) < 0),
where (X
ej , X
ek ) is a independent copy of (Xj , Xk ).
20
Transelliptical Component Analysis
Let x1 , . . . , xn be n independent realizations of X. Using the Kendall’s tau
correlation coefficient estimate:
2 X
τbjk = sign (xij − xi0 j ) (xik − xi0 k ) .
n(n − 1)
1≤i<i0 ≤n
Let
h π i
bτ
R := sin τbjk .
2
21
Theoretical Results
In estimating θ1 and its support, we have
22
Equity Data Again
Prediction of the Market Trend:
86
85
Successful Matches %
84
83
Pearson
NS
Kendall
82
15 20 25 30 35 40 45
23
Beyond PCA
• Directed Graphs Estimation
• Discriminant Analysis
• ...
24
Remarks
• Model Flexibility
25
Thanks!
for more information, please come to the poster session (Tu66).
26