Professional Documents
Culture Documents
L18 Cumulants PDF
L18 Cumulants PDF
TOPIC. Cumulants. Just as the generating function M of a random variable X generates its moments, the logarithm of M generates a sequence of numbers called the cumulants of X. Cumulants
are of interest for a variety of reasons, an especially important one
being the fact that the j th cumulant of a sum of independent random
variables is simply the sum of the j th cumulants of the summands.
Definitions and examples. To begin with, suppose that X is a
real random variable whose real moment generating function M (u) =
E(euX ) is finite for all us in an open interval about 0. Since M (0) =
1 6= 0 and since the composition of two functions which have power
series expansions itself has a power series expansion, we may write
X n un
(log M )(u) =
(1)
n=0 n!
for all u in some (possibly smaller) open interval about 0. The numbers 1 , 2 , . . . in this expansion are called the cumulants (for a
reason that will be explained subsequently). Notice that
n = (log M )(n) (0)
(2)
(5)
in
Section 11 K(t) denoted log M (t) , whereas here K(t) is log (t) .)
Formula (5) is used to define the first n cumulants of X when X
is only known to have an nth moment, i.e., when E(|X|n ) < . In
this situation is n-times continuously differentiable and close to 1
in an open interval about 0 R, and so K = log() is defined and
n-times continuously differentiable in that interval. Consequently the
j th cumulant
j := K (j) (0)/ij
(6)
(7)
(82 )
2 2
t /2
1 = , 2 = 2 , 3 = 4 = = 0.
(9)
for all n N. Since the functions log M , log G, and K = log generate the cumulants, they are called cumulant generating functions
(CGFs). (Some properties of cumulants and their generating func-
Since the first and second cumulants of any random variables are its
mean and variance (see (19) and (20)), the normal distribution has
the simplest possible cumulants.
18 1
18 2
Properties of cumulants. This section develops some useful properties of cumulants. The nth moment of cX is cn times the nth moment
of X; this scaling property is shared by the cumulants.
Theorem 1 (Homogeneity). Suppose X is a random variable with
an nth cumulant. Then for any c R, cX has an nth cumulant and
(10)
it
1)
and
K(t) = (eit 1) =
(13)
n (cX) = cn n (X).
X
n=1
in tn
n!
(n)
(n)
(n)
1 = 2 = 3 = = .
(11)
|t|2
|t|3
= |t| + |t|
+
+
(for |t| < 1)
2
3
it
i2 t2
it
i2 t2
t2
+ o(t2 ) = 1 + 2
+ o(t2 )
= + o(t2 ) = 0 + 1
2
1!
2!
1!
2!
as t 0. (82 ) gives
n (X) + b, if n = 1,
(14)
n (X + b) =
n (X),
if n > 1.
Proof For t near 0 we have
X+b (t) = E(eit(X+b) ) = eitb E(eitX ) = eitb X (t)
= KX+b (t) = itb + KX (t)
(n)
1 = 0 and 2 = 1.
(12)
3 , 4 , . . . are undefined.
(n)
= KX+b (t) =
dn
dtn itb
(n)
+ KX (t)
= (14) holds.
18 3
18 4
19:39 01/11/2001
Here is the reason for the name cumulants; note that (16) is
much simpler than the corresponding relation for moments.
Theorem 3 (Cumulants accumulate). Suppose X and Y are independent random variables, each having an nth cumulant. Then
S := X + Y has an nth cumulant, and
n (S) = n (X) + n (Y ).
(15)
(16)
Pr
(18)
j r+1j for r = 0, . . . , n 1.
r
j=0 j
= 1 ,
= 2 + 1 1 ,
= 3 + 21 2 + 2 1 ,
= 4 + 31 3 + 32 2 + 3 1 .
(19)
j := E (X E(X))j
and formulas (19) simplify to
2 = 2 ,
2 = 2 ,
3 = 3 ,
3 = 3
4 = 4 +
(17): r+1 =
322 ,
4 = 4
(20)
322 .
X Gamma(r)
X Poisson()
1 = r, 1 = r,
1 = , 1 = ,
2 = r
2 = , 2 = ,
3 = 0,
3 = 0,
4 = 0,
2 = , 2 = ,
2 = r,
3 = 2r, 3 = 2r,
18 6
3 = , 3 = ,
m2 times
mk times
:= m1 + m2 + + mk
3 = 3 + 32 1 + 31 ,
4 = 4 + 43 1 + 322 + 62 21 + 41 .
Note that the formula for 4 is a linear combination of products of j s
and that, including multiplicities, the subscripts j on the j s in those
products form the following lists:
These are all the possible lists of nonincreasing positive integers which
add to 4; theyre called the additive partitions of 4. In what follows
IPam going to show that for any n N, n is a sum of the form
partitions of n, c is a
c where ranges over the additive
Q
certain number depending on , and = j j .
We need some notation. For a positive integer n let
Pn be the collection of all additive partitions of n.
(21)
(221 )
mk times
(23)
n!
(j1 !)m1 (j2 !)m2
(jk
!)mk
1
m1 ! m2 ! mk !
d := c (1) 1 ( 1)!
m1 times
1 = 1 ,
m1 times
(222 )
(25)
(24)
(26)
the sums here are taken over the elements = [ j1m1, j2m2, . . . , jk k ]
of Pn and c , d , , and are defined by (24)(26) above.
Example 3. One of the additive partitions of 11 is = [41 , 22 , 13 ] =
[ j1m1 , j2m2 , j3m3 ] for j1 = 4, j2 = 2, j3 = 1, m1 = 1, m2 = 2, m3 = 3.
According to (27), the contribution this makes to 11 is c where
:= 4 22 31
and
c :=
11!
1
= 34650 .
4! (2!)2 (1!)3 1! 2! 3!
18 7
18 8
= 4 22 13
19:39 01/11/2001
(23): :=
Pk
i=1
mi
(24): c := n!/
Qk
mi
mi !
i=1 (ji !)
where c and are defined by (24) and (23) respectively, f (j) (x) is
the j th derivative of f at x, and g () (y) is the th derivative of g at y.
Example 4. For the additive partition [21 , 11 ] of n = 3 one has
= 1 + 1 = 2 and c = 3!/[(21 1!)(11 1!)] = 3. According to Fa`a di
Brunos formula,
derivative of h = g(f ) should contain the
0 the third
00
00
term 3g f (x) f (x)f (x). This can be verified by direct calculation:
h00 = g 00 (f )(f 0 )2 + g 0 (f )f 00 ,
Pn c
(j)
g ( ) f (x)
(x)
j f
X
Pn
hY
j
i
X
ij j = in
Pn
c ;
the RHS of
(27) follows by applying (28) for x = 0 to K(t) = g (t) with g(z) =
log(z) and using g () (z) = (1)1 ( 1)!/z = (1)1 ( 1)! for
z = y = (0) = 1.
Xn
j=0
h(j) (x)
( x)j + o | x|n as x.
j!
h0 = g 0 (f )f 0 ,
h = g(f )
Xn
j=0
Hj
( x)j + o | x|n as x
j!
h = g(f )
Pn c
(j)
g ( ) f (x)
(x)
j f
( x)j + o | x|n as x
(29)
f () =
j=0 j!
for fj = f (j) (x). Similarly, since g has an nth derivative at y = f (x),
Xn g
(30)
g() =
( y) + o | y|n as y
=0 !
for g = g () (y). Thus
Xn g
h() = g f () =
f () f (x) + o |f () f (x)|n
=0 !
Xn g Xn fi
=
( x)i + o | x|n
+ o | x|n
=0 !
i=1 i!
Xn g Xn fi
=
( x)i + o | x|n
=0 !
i=1 i!
X n Hj
=
( x)j + o | x|n
(31)
j=0 j!
=1
n
X
g h
=1
j=0
j=0
as x, where
Xn
Hn = n!
= n!
fi1 fi2 fi i
g hX
i1 ,i2 ,...,i 1
!
i1 +i2 ++i =n i1 ! i2 ! i !
m
fjm1 1 fjk k
X
m
[ j1 1,...,jk k ]Pn
m1 ++mk =
i
!
. (32)
m1 ! mk !
The last step uses the fact that the number of -tuples i1 , i2 , . . . , i
which contain m1 j1 s, m2 j2 s, . . . , and mk jk s (for
j1 , . . . , jk )
distinct
!
is given by the multinomial coefficient m1 m
= m1 !m
. This
k
k!
completes the proof of (28).
It is worth noting that (29) and (30) for arbitrary fj s and g s
imply (31) with Hn given by (32). This result doesnt require f and
g to be differentiable,
18 11
(34)
1 1
1
+
,
2
t
exp(t) 1
18 12
19:39 01/11/2001
(35)
in particular
B0 = 1,
B6 =
1
,
42
1
B1 = ,
2
1
B8 = ,
30
1
,
6
5
=
,
66
B2 =
B10
1
,
30
691
=
2730
B4 =
B12
(36)
if m = n,
1,
pn,m = Xmin(m,nm)
pnm,j , if m < n.
j=1
and that
Exercise 6. Let U and the Bernoulli numbers be as above. By
integrating K 0 (s) from 0 to t, show that
K(t) =
X Bk tk
k=2 k k!
(37)
pn =
Xn
m=1
pn,m .
for |t| < 2, and hence that the k th cumulant of U is Bk /k, for k 2;
in particular all cumulants of odd order equal zero, while
2 =
1
12
1
8 =
,
240
1
,
120
1
=
,
132
4 =
10
1
,
252
691
=
.
32760
6 =
12
(38)
18 14