Professional Documents
Culture Documents
A Nonparametric Measure of Inequality
A Nonparametric Measure of Inequality
1 5 2 3 4
3 4 1 5 2
Example: 3 4 1 5 2
5 2 1 4 3
n
2
d 1( , ) k k .
k 1
2. Spearman’s Footrule*:
n
d 2( , ) k k .
k 1
3. Kendall’s Distance*:
4. Cayley’s Distance*:
Cayley's Distance dC between two permutations σ and τ is
given by the minimum number of transpositions needed to
reach σ from τ.
(i) D1 A min d i, .
d
1i m
(ii ) D 2 A d , .
d
i j
1i j m
2
(ii) If d is a metric,
then
D 2 A m 1 D1 A .
d d
Step 1: 1 2 3 4
So,
Step 2: 2 1 3 4 d* (σ,τ) = 3.
Step 3:
2 1 4 3
We propose the following D-Measure:
D*= D2 d*
i.e. for a rank matrix Amxn ,
D A D A d , .
* d* *
2 i j
1i j m
Proposition 1: Let , ' S n .Then
d * , ' d K , ' .
(m 2 1) n
{
D* A
m2
4
n
,
2
if m is odd;
, if m is even.
4 2
Contd.
1 1
, if m is odd;
(ii)
{
I
1
2 2m
1
, if m is even.
2 2(m 1)
Theorem 5: (contd.)
For a rank matrix Amxn with Inequality Coefficient I, we have
•We use the results of 59th and 61st rounds of NSSO household
survey.
•The 59th round survey was carried out in 2003 and the 61st in
2004-05.
•We used the data for 17 states for which data on all
categories were available.
I=0.591
P-value=0.003
Inequality among States in India
Results: 61st Round
•Data were available for all the states.
I=0.605
P-value=0.00001
Inequality among States in India
Results: 61st Round
TRUNCATED
•Data were available for all the states in the 61 st round,
but only for 17 states in the 59th.
I=0.575
P-value=0.010
Comparison with other Statistics
Other statistics used for comparing are:
(1)Friedman Statistic:
2
(n 1)
D '' i j i ( j )
2
(2)Statistic used by Sarkar et al.:
D' D . dC
1
Comparison with D’
1 2 3 4 1 2 3 4
A1 A2
4 3 2 1 2 1 4 3
D '( A1 ) 1, D '( A2 ) 2
D* ( A1 ) 5, D* ( A2 ) 2
1 4 3 2 D* ( A1 ) 24, D* ( A2 ) 20
2 1 4 3
A2 has more inequality w.r.t. D*.
3 2 1 4
4 3 2 1
In real life scenarios we often end up with situations where ties
exist between the ranks of blocks, or where the data is
incomplete.
In such cases, we give a natural extension to our measure by
using the formula
1
I
m n i j k l 1{ i k i l j k j l 0}
.
2 2
In case of ties we replace the indicator function by 0.5.
In case of incomplete data, we ignore those cases and scale by
the number of meaningful observation pairs.
»» Large sample distribution of I
»» Further investigation of the combinatorial properties of
D*.
»» Effect of an outlier block.
»» Effect of different clusters in categories.
»» Exploring the case when all the categories are not of
equal importance.