Professional Documents
Culture Documents
Email: nzabaras@gmail.com
URL: https://www.zabaras.com/
September 3, 2020
Test
point
Naïve classification based on division in cells & taking a “majority vote” on the
cell where a test point 𝑥 lies is not practical in high-dimensions.
Bishop, C. M. and G. D. James (1993). Analysis of multiphase flows using dual-energy gamma densitometry and neural networks. Nuclear Instruments
and Methods in Physics Research A327, 580–593
Statistical Computing and Machine Learning, Fall 2020, N. Zabaras 3
The Curse of Dimensionality
With an exponentially large number of cells, we will need an exponential large
training data set to be sure that the cells are not empty.
𝑀𝐷 # cubical regions
(here 𝑀 = 3)
D D D D D D
y ( x , w ) w0 wi xi wij xi x j wijk xi x j xk
i 1 i 1 i 1 i 1 i 1 i 1
How many 𝑤 −terms do we have in the 𝑀th term in this general polynomial
expansion? Using symmetry:
𝐷 𝐷 𝐷 𝐷 𝑖1 𝑖𝑀−1
… 𝑤
ഥ𝑖1𝑖2...𝑖𝑀 𝑥𝑖1 𝑥𝑖2 . . . 𝑥𝑖𝑀
𝑖1 =1 𝑖2 =1 𝑖𝑀 =1
We can estimate the number of terms in this expansion as:
D i1 iM 1 D i1 iM 1
D D
n( D, M ) ... 1 ... 1 n(i1 , M 1) n(i, M 1)
i1 1 i2 1 iM 1 i1 1 i2 1 iM 1 i1 1 i 1
By induction in 𝐷 (and any 𝑀) can directly show that (Hint: true for 𝐷 = 1,
assume true for 𝐷 and show true for 𝐷 + 1):
D
i M 2 ! D M 1!
i 1 i 1 ! M 1 ! D 1! M !
(1)
n ( D, M )
D M 1!
Using the above and by induction in 𝑀 and any 𝐷 can show that D 1!M !
𝐷+𝑀−2 !
For 𝑀 = 2, n( D, 2) D D 1 2 , assume true for 𝑀 − 1, i.e. n 𝐷, 𝑀 − 1 =
D
Using (1), we obtain: n( D, M ) n(i, M 1)
D M 1 !
𝐷−1 ! 𝑀−1 !
i 1 D 1! M !
Statistical Computing and Machine Learning, Fall 2020, N. Zabaras 6
Polynomial Expansion in High Dimensions
The total number of
M
the terms in a polynomial of order 𝑀 in a 𝐷 −dimensional
space is: N ( D, M ) n( D, m) . Prove that:
m0
N ( D, M )
D M !
D !M !
D M 1!
Using n ( D, m)
D 1! M !
and by induction in 𝑀 (for constant 𝐷):
M
N ( D, M 1) n( D, m) n( D, M 1)
D M ! D M !
D M 1!
m0 D !M ! ( D 1)!( M 1)! D !( M 1)!
Consider the case 𝑀 >> 𝐷 and use Stirling’s approximation for large 𝑛, n ! n n e n
you can show that 𝑁 is of the order 𝑀𝐷:
D M
DM M D
e( D M ) M D M e D D M D e D D (1 D )e
D
N ( D, M ) 1 1 ( M D) MD
D !M M e M D !M M M D! M D!
e i 1
xi2
dxi S D e r2
r D 1dr , dxi S D r D 1dr
i 1
0
Let us calculate the ratio of the volume of a sphere of radius 𝑎 to the volume
of the hypercube of size 2𝑎 (sphere touches the hypercube at the centers of
each side) Vol. of sphrere 2 1 D /2 D /2
a D
D 2a D
D
Vol. of cube
DG D 2 D 1 G
2 2
For high dimensions, use Stirling’s formula for the Gamma function
2 z 4 D
z D /2
D
G z G
z e 2 D 2e
Statistical Computing and Machine Learning, Fall 2020, N. Zabaras 10
Volume of a Sphere in High-Dimensions
Vol. of sphrere D /2 D 4 D
D /2
G
Vol. of cube D
D 2 D 1 G
2 D 2e
2
This implies that most of the volume of the hypercube must be concentrated
in its many corners which themselves become very long spikes (note distance
from the center of the cube to the corners a D )!
𝑝𝑚(𝑟)
We see that the probability is larger at the origin than at 𝑟Ƹ by a factor 𝑒 𝐷/2 , i.e.
most of the probability mass in high 𝐷 is concentrated at a different radius
than the density!
This distinction for Gaussians in high 𝐷 is important in Bayesian estimation.
Statistical Computing and Machine Learning, Fall 2020, N. Zabaras 14
Curse of Dimensionality
Real data are confined to a region of the space having lower effective
dimensionality.
Real data exhibit smoothness so that small changes in the input variables
produce small changes in the target variables.