Professional Documents
Culture Documents
of High-Dimensional Data
Gunnar Carlsson
Department of Mathematics
Stanford University
Stanford, California 94305
gunnar@math.stanford.edu
Qualitative Properties of
Data Sets
• Clustering
• Dimension
• Presence of Flares
Idea: Study Probability
Density Functions
Qualitatively
• Examples:
E(f,+,r)
E(f,-,r)
Components of E(f, −, r)
vary with r
b = b =1
0 1
b = 0 for i>1
i
b =1 b = 2
0 1
b = 0 for i > 1
i
b = b =1
† 0 2
b = 0 otherwise
i
Sn b = bn =1
0
b = 0 otherwise
i
†
b = b =1, b = 2
0 2 1
b = 0 for i> 2
i
†
Remarks
• Comments:
1. Doesn’t involve embedding in Eu-
clidean space, which introduces dis-
tortion
2. Doesn’t involve fitting to a particu-
lar geometric model, which can be
deceiving
3. Gets directly at the core question,
which is whether the space has a
loop in it, given by periodicity