Rosenthal

A NOTE ON BURKHOLDER-ROSENTHAL INEQUALITY
ADAM OSȨKOWSKI
Abstract. Let df be a Hilbert-space-valued martingale difference sequence.

The paper is devoted to a new, elementary proof of the estimate
 
∞  ∞
!1/2 ∞
!1/p 
X  X X 
dfk ≤ Cp E(|dfk |2 |Fk−1 ) + |dfk |p ,
 
k=0 p
 k=0 k=0 
p p
with Cp = O(p/ ln p) as p → ∞.
1. Introduction
Let (Ω, F, P) be a probability space, filtered by (Fn )n≥0 , a nondecreasing family
of sub-σ-algebras of F. Assume that f is an adapted martingale, taking values in
a certain separable Hilbert space H with norm | · | and scalar product h·, ·i. Then
df = (dfn )n≥0 , the difference sequence of f , is given by df0 = f0 and dfn = fn −fn−1 ,
n ≥ 1. We define the conditional square function of f by
"∞ #1/2
X
2
s(f ) = E(|dfk | |Fk−1 )
k=0
(here and below, F−1 = F0 ) and use the notation

" n #1/2
X
2
sn (f ) = E(|dfk | |Fk−1 ) , n = 0, 1, 2, . . . ,
k=0
for the truncated conditional square function of f .

The purpose of this note is to investigate Burkholder-Rosenthal inequality
 !1/p 
X∞
(1.1) ||f ||p ≤ cp ||s(f )||p + |dfk |p 
k=0
p
where p ≥ 2 and cp is a constant depending only on p. The special case in which

the martingale f is a sum of independent mean-zero random variables forms an
important extension of Khintchine inequality and was studied by Rosenthal in the
60’s . The proof from [11] gives the constant cp which grows exponentially in p as
p → ∞. Johnson, Schechtman and Zinn [4] refined the reasoning and showed that
the optimal order of cp as p → ∞ (still in the independent case) is p/ ln p. Applying
difficult isoperimetric techniques, Talagrand [12] extended this statement to the case
of independent Banach-space-valued random variables. Using hypercontractivity
2010 Mathematics Subject Classification. Primary: 60G42.

Key words and phrases. martingale, conditional square function.
Partially supported by MNiSW Grant N N201 397437.
1
2 ADAM OSȨKOWSKI
methods, Kwapień and Szulga [7] gave a completely elementary proof of Talagrand’s
result.
The inequality (1.1) for general real martingales (and some cp ) was established
by Burkholder in [1]. The validity of this estimate with cp = O(p/ ln p) was proved
by Hitczenko [5] (see also [6]). This result was further generalized to vector-valued
setting by Pinelis [10]. Consult also Nagaev [8] for a yet another approach.
The purpose of this paper is to present a new and elementary proof of (1.1) with
cp = O(p/ ln p). Precisely, we will establish the following statement.
Theorem 1.1. If f is a Hilbert-space-valued martingale, then for p ≥ 4 we have

∞
!1/p p 1/p
X
(1.2) ||f ||p ≤ Cp ||s(f )||pp + |dfk |p  ,
k=0
p
where
√ p 1/p
p
Cp = 2 2 +1 1+ .
4 ln(p/2)
In fact, using Davis’ decomposition, we will be able to prove a slightly stronger
estimate: see (2.10) and Remark 2.5 below.
A few words about the proof are in order. Hitczenko [5], [6] and Pinelis [10] apply
the extrapolation method (good λ-inequality) of Burkholder and Gundy, combined
with appropriate version of Prokhorov “ arcsinh” estimate for martingales. Na-
gaev [8] first establishes a certain exponential bound for the tail of f and deduces
Burkholder-Rosenthal estimate using a standard integration argument. Our ap-
proach is entirely different and exploits the properties of a certain special function;
this type of proof can be regarded as an application of Burkholder’s method (see
[2] and [9] for more on the subject).
2. Proof of Theorem 1.1

The starting point is the following technical estimate proved by Kwapień and
Szulga [7].
Lemma 2.1. Let p ≥ 4 and put
ln(p/2)/p
η = η(p) := .
1 + ln(p/2)/p
Then for any t ≥ 0 we have
p
(2.1) (1 + tη)p − ptη ≤ 1 + − 1 t2 + tp .
2
We shall require the following vector-valued version
√ of this bound. From now
on, we assume that p ≥ 4 and that σ = σ(p) = η(p)/ 2.
Lemma 2.2. For any y, d ∈ H we have
√ √ p
(2.2) |y + 2σd|p − p|y|p−2 y, 2σd ≤ |y|p + |y|p−2 |d|2 + |d|p .
2
√
Proof. The left-hand side can be rewritten in the form F (hy, 2σdi), where
p/2
F (s) = |y|2 + 2σ 2 |d|2 + 2s − p|y|p−2 s, s ∈ R.
Now keep |y| and√ |d| fixed; since
√ the function F is convex, it suffices to prove the
estimate for hy, 2σdi = ± 2σ|y||d|, i.e. in the case when y and d are linearly
BURKHOLDER-ROSENTHAL INEQUALITIES 3
√ √
dependent. If hy, √2σdi = √ 2σ|y||d|, then (2.2) follows directly from (2.1); on the
other hand, if hy, 2σdi = − 2σ|y||d|, we have
√ √ √ p √
|y + 2σd|p − p|y|p−2 y, 2σd = |y| − 2σ|d| + p 2σ|y|p−1 |d|
√ p √
≤ |y| + 2σ|d| − p 2σ|y|p−1 |d|,
so the claim again follows from (2.1).
The key ingredient of the proof is the special function U : [0, ∞)×H×[0, ∞) → R,
given by ( √
(|y|2 − x2 )p/2 − cxp − z if |y| ≥ 2x,
U (x, y, z) = √
|y|p − (2p/2 − 1 + c)xp − z if |y| < 2x,
where
c = p 2p/2−2 + 1.
Let us list some properties of this function.
Lemma 2.3. (i) For any (x, y, z) ∈ [0, ∞) × H × [0, ∞) we have
n o
p/2
(2.3) U (x, y, z) = min |y|2 − x2 − cxp − z, |y|p − (2p/2 − 1 + c)xp − z .
(ii) For any x ≥ 0 and y ∈ H we have
(2.4) U (x, σy, |y|p ) ≤ 0.
(iii) For all (x, y, z) ∈ [0, ∞) × H × [0, ∞) we have
U (x, y, z) ≥ 2−p/2 |y|p − σ p Cpp (xp + z) .

(2.5)
Proof. (i) For fixed x, z ≥ 0, the function
F (s) = sp − (2p/2 − 1 + c)xp − z − |s2 − x2 |p/2 − cxp − z ,

s ≥ 0,
√
vanishes at s = 2x and is strictly increasing:
h i
F 0 (s) = ps sp−2 − |s2 − x2 |(p−2)/2 sgn (s2 − x2 ) .
This yields (2.3).
(ii) This is obvious, since σ ≤ 1.
(iii) Using the definitions of Cp and σ, we see that we must prove the bound
h p i
U (x, y, z) ≥ 2−p/2 |y|p − + 1 2p (xp + z) .
4
√
Now, for |y| < 2x, we have
p h p i
U (x, y, z) = |y|p − + 1 2p/2 xp − z ≥ 2−p/2 |y|p − + 1 2p (xp + z) .
4 4
√
On the other hand, if |y| ≥ 2x, then |y|2 − x2 ≥ |y|2 /2 and hence
h i
U (x, y, z) ≥ 2−p/2 |y|p − 2p/2 cxp − 2p/2 z ,
so the majorization is clear.
We turn to the key property of the function U .
Lemma 2.4. For any x, z ≥ 0, y ∈ H and any H-valued, mean-zero random
variable d with ||d||p < ∞ we have
p
(2.6) EU x2 + E|d|2 , y + σd, z + |d|p ≤ U (x, y, z).
4 ADAM OSȨKOWSKI
Proof. We consider three cases separately.

1◦ The case |y|2 ≤ 2x2 . By (2.3), we have
p
EU x2 + E|d|2 , y + σd, z + |d|p
≤ E|y + σd|p − (2p/2 − 1 + c)(x2 + E|d|2 )p/2 − z − E|d|p
= E |y + σd|p − p|y|p−2 hy, σdi − |d|p − (2p/2 − 1 + c)(x2 + E|d|2 )p/2 − z.

By (2.2), the expression in the parentheses does not exceed |y|p + p|y|p−2 |d|2 /2;
furthermore, we have
p
(2p/2 − 1 + c)(x2 + E|d|2 )p/2 ≥ (2p/2 − 1 + c) xp + xp−2 E|d|2
2
p/2 p p (p−2)/2 p−2
≥ (2 − 1 + c)x + 2 x E|d|2
2
p
≥ (2p/2 − 1 + c)xp + |y|p−2 E|d|2 .
2
Combining these estimates, we obtain
p
EU x2 + E|d|2 , y + σd, z + |d|p ≤ |y|p − (2p/2 − 1 + c)xp − z,
which is precisely the desired bound.
2◦ The case 2x2 < |y|2 ≤ 2(x2 + E|d|2 ). We start as previously: by (2.3) and
then (2.2),
p
EU x2 + E|d|2 , y + σd, z + |d|p
p
≤ |y|p + |y|p−2 E|d|2 − (2p/2 − 1 + c)(x2 + E|d|2 )p/2 − z.
2
The latter expression decreases as E|d|2 increases; indeed, the function
p |y|2
F (s) = |y|p + |y|p−2 s − (2p/2 − 1 + c)(x2 + s)p/2 − z, s≥ − x2 ,
2 2
satisfies
p h p−2 i
F 0 (s) ≤ |y| − 2p/2−1 (x2 + s)p/2−1 ≤ 0.
2
In consequence, we have
p
EU x2 + E|d|2 , y + σd, z + |d|p
2
|y|
≤F − x2
2
2 2 p/2
p |y| y
= |y|p−2 − x2 − (c − 1) −z
2 2 2
p
= − |y|p−2 x2 − z
2
2 p/2−1
|y| p
= x2 − + 21−p/2 |y|p−2 x2 − z
2 2
2 p/2 2 p/2−1
|y| |y|
≤ −c x2 − z
2 2
≤ (|y|2 − x2 )p/2 − cxp − z,
and we are done.

3◦ The case |y|2 > 2(x2 + E|d|2 ). Here the reasoning is a bit more complicated.
First we show the pointwise estimate
p/2 p/2−1
|y + σd|2 − x2 − E|d|2 − p |y|2 − x2 − E|d|2 hy, σdi
(2.7)
1/2 √ p
(p−1)/2 √
≤ |y|2 − x2 − E|d|2 + 2σ|d| − p |y|2 − x2 − E|d|2 2σ|d|.
In fact, we will establish a slightly stronger inequality:
p/2 p/2−1
|y + σd|2 − x2 − E|d|2 − p |y|2 − x2 − E|d|2 hy, σdi
√ 1/2
p/2
≤ |y|2 − x2 − E|d|2 + σ 2 |d|2 + 2 2 |y|2 − x2 − E|d|2 σ|d|
(p−1)/2 √
− p |y|2 − x2 − E|d|2 2σ|d|.
To do this, divide throughout by ||y|2 − x2 − E|d|2 |p/2 and substitute
|y|2 − x2 − E|d|2 + σ 2 |d|2 y
A2 = , Y =
|y|2 − x2 − E|d|2 ||y|2 − x2 − E|d|2 |1/2
and
d
D= .
||y|2 − x2 − E|d|2 |1/2
The estimate becomes
p/2 √ p √
(2.8) A2 + 2hY, σDi − phY, σDi ≤ A2 + 2 2σ|D| − p 2σ|D|.
However, the reasoning presented in the proof of (2.2) gives
p/2 p/2
A2 + 2hY, σDi − phY, σDi ≤ A2 + 2σ|Y ||D| − pσ|Y ||D|.
√
It suffices to use the bounds |Y | ≤ 2 and A2 ≥ 1 to obtain (2.8), because the
function s 7→ (A2 + 2s)p/2 − ps is increasing on [0, ∞). Thus (2.7) follows. We turn
to (2.6): applying (2.3), we get
p
EU x2 + E|d|2 , y + σd, z + |d|p
p/2
≤ E |y + σd|2 − x2 − E|d|2 − c(x2 + E|d|2 )p/2 − z − E|d|p
n o
p/2 p/2−1
= E |y + σd|2 − x2 − E|d|2 − p |y|2 − x2 − E|d|2 hy, σdi
− c(x2 + E|d|2 )p/2 − z − E|d|p
n
1/2 √ p
(p−1)/2 √
o
≤E |y|2 − x2 − E|d|2 + 2σ|d| − p |y|2 − x2 − E|d|2 2σ|d|
− cxp − z − E|d|p .
Now we apply (2.2) (in the real case) to obtain
p
EU x2 + E|d|2 , y + σd, z + |d|p
p/2 p p/2−1
≤ |y|2 − x2 − E|d|2 + |y|2 − x2 − E|d|2 E|d|2 − cxp − z
2
p/2
≤ |y|2 − x2 − cxp − z = U (x, y, z).
This completes the proof.
6 ADAM OSȨKOWSKI
Proof of (1.2). It suffices to prove that for any nonnegative integer n,

n
!
X
p p p p
(2.9) E|fn | ≤ Cp E sn (f ) + |dfk | .
k=0
Of course, we may assume that df0 , df1 , . . ., dfn (and hence also fn ) belong to Lp ,
since otherwise there is nothing to prove. The key observation is that the process
n
X
U sn (f ), σfn , |dfk |p
n≥0
k=0
is a supermartingale with respect to (Fn )n≥0 . Indeed, the integrability follows from
the above assumption on df ; furthermore, for any n ≥ 0 we have
" n+1
! #
X
p
E U sn+1 (f ), σfn+1 , |dfk | Fn
k=0
" n
! #
p X
=E U s2n (f ) + E(|dfn+1 |2 |Fn ), σfn + σdfn+1 , |dfk |p + |dfn+1 |p Fn ,
k=0
Pn
which does not exceed U (sn (f ), σfn , k=0 |dfk |p ), by (2.6) applied conditionally
with respect to Fn−1 . Next, we have U (s0 (f ), σf0 , |df0 |p ) ≤ 0, in view of (2.4).
Combining these two facts with (2.5) yields the claim:
" n
!# n
!
p/2
X 2 X
E |fn |p − Cpp spn (f ) + |dfk |p ≤ p EU sn (f ), σfn , |dfk |p ≤ 0.
σ
k=0 k=0
Remark 2.5. Using Davis’ decomposition (see e.g. Davis [3] or Burkholder [1]), one
can deduce a slightly stronger form of (1.2). Namely, for all f as in the statement
of Theorem 1.1 and p ≥ 4 we have
1/p
(2.10) ||f ||p ≤ 2Cp ||s(f )||pp + ||df ∗ ||pp ,
where df ∗ = supn≥0 |dfn |. Indeed, fix a martingale f and consider the random
variables d0n = dfn 1{|dfn |<2dfn−1
∗
00
} , dn = dfn 1{|dfn |≥2dfn−1
∗ } , n = 0, 1, 2, . . .. Here, as
∗ ∗ ∗
usual, df−1 ≡ 0 and dfn = max0≤k≤n |dfk |. Note that on the set {|dfn | ≥ 2dfn−1 }
we have
(2p − 1)|d00n |p + (2dfn−1 ∗
)p ≤ (2|dfn |)p ≤ (2dfn∗ )p ,
which implies
n
X 2p
|d00k |p ≤ p (df ∗ )p , n = 0, 1, 2, . . . .
2 −1 n
k=0
Next, observe that for any n,
X n n
X
E |d0k |p ≤ E ∗
|dfk |2 (2dfk−1 )p−2
k=0 k=0
Xn
∗
=E E(|dfk |2 |Fk−1 )(2dfk−1 )p−2
k=0
≤ Es2n (f )(2dfn∗ )p−2
2 p−2
≤ ||sn (f )||pp + ||2dfn∗ ||pp ,
p p
where in the last line we have exploited Young’s inequality. Combining the above
estimates for the sums of d0n and d00n we get
n
X 2 1 p−2
E |dfk |p ≤ ||sn (f )||pp + 2p + ||dfn∗ ||pp
p 2p − 1 p
k=0
2
≤ ||sn (f )||pp + 2p ||dfn∗ ||pp .
p
Plugging this into (2.9) and using the fact that n is an arbitrary nonnegative integer,
we obtain (2.10).
References
[1] D. L. Burkholder, Distribution functions inequalities for martingales, Ann. Probab. 1 (1973),
19–42.
[2] D. L. Burkholder, Explorations in martingale theory and its applications, Ecole d’Ete de
Probabilités de Saint-Flour XIX—1989, pp. 1–66, Lecture Notes in Math., 1464, Springer,
Berlin, 1991.
[3] B. Davis, On the integrability of the martingale square function, Israel J. Math. 8, 187–190.
[4] W. B. Johnson, G. Schechtman and J. Zinn, Best constants in moment inequalities for linear
combination of independent and exchangeable random variables, Ann. Probab. 13 No. 1
(1985), 234–243.
[5] P. Hitczenko, Best constants in martingale version of Rosenthal inequality, Ann. Probab. 18
No. 4 (1990), 1656–1668.
[6] P. Hitczenko, On a domination of sums of random variables by sums of conditionally inde-
pendent ones, Ann. Probab. 22 No. 1 (1994), 453–468.
[7] S. Kwapień and J. Szulga, Hypercontraction methods in moment inequalities for series of
independent random variables in normed spaces, Ann. Probab. 19 No. 1 (1991), 369–379.
[8] S. V. Nagaev, On probability and moment inequalities for supermartingales and martingales,
Acta Appl. Math. 79 (2003), 35-46.
[9] A. Osȩkowski, Sharp martingale and semimartingale inequalities, to appear in Monografie
Matematyczne, Birkhäuser.
[10] I. Pinelis, Optimum bounds for the distributions of martingales in Banach spaces, Ann.
Probab. 22, No. 4 (1994), 1679–1706.
[11] H. P. Rosenthal, On the subspaces of Lp (p > 2) spanned by the sequences of independent
random variables, Israel J. Math. 8 (1970), 273–303.
[12] M. Talagrand, Isoperimetry and integrability of the sum of independent Banach-space-valued
random variables, Ann. Probab. 17 No. 4 (1989), 1546–1570.
Department of Mathematics, Informatics and Mechanics, University of Warsaw, Ba-

nacha 2, 02-097 Warsaw, Poland
E-mail address: ados@mimuw.edu.pl

Rosenthal

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Rosenthal

Uploaded by

Copyright:

Available Formats

A NOTE ON BURKHOLDER-ROSENTHAL INEQUALITY

Abstract. Let df be a Hilbert-space-valued martingale difference sequence.

(here and below, F−1 = F0 ) and use the notation

for the truncated conditional square function of f .

where p ≥ 2 and cp is a constant depending only on p. The special case in which

2010 Mathematics Subject Classification. Primary: 60G42.

2. Proof of Theorem 1.1

Proof. We consider three cases separately.

and we are done.

Proof of (1.2). It suffices to prove that for any nonnegative integer n,

Department of Mathematics, Informatics and Mechanics, University of Warsaw, Ba-

You might also like