Professional Documents
Culture Documents
GIANMARCO BROCCHI
Abstract. We show that any Littlewood–Paley square function S satisfying a minimal local
testing condition is dominated by a sparse form,
X
h(Sf )2 , gi ≤ C h|f |i2I h|g|iI |I|.
I∈S
This implies strong weighted Lp estimates for all Ap weights with sharp dependence on the Ap
characteristic. The proof uses random dyadic grids, decomposition in the Haar basis, and a
arXiv:2004.05365v1 [math.CA] 11 Apr 2020
1. Introduction
´
1.1. Setting. Let {θt }t>0 be a family of integral operators θt f (x) = Rd kt (x, y)f (y) dy for which
there exists C > 0 and α ∈ (0, 1] such that for all x, y, x′ , y ′ ∈ Rd and t > 0 the kernels kt satisfy
the following size and regularity conditions:
tα
|kt (x, y)| ≤ C (C1)
(t + |x − y|)α+d
|x − x′ |α + |y − y ′ |α
|kt (x, y) − kt (x′ , y ′ )| ≤ C if |x − x′ | + |y − y ′ | < t. (C2)
(t + |x − y|)α+d
Let S be the square function
ˆ ∞ 1/2
2 dt
Sf (x) := |θt f (x)| .
0 t
By the T (1) theorem of Christ and Journé [CJ87] it is known that S is bounded on L2 (Rd ) if θt
applied to the constant function 1 gives rise to a Carleson measure ν := |θt 1(x)|2 dt/t dx on the
upper half space Rd+1
+ . A Carleson measure on R+
d+1
is a measure which acts like a d-dimensional
measure in the following sense. Let Q be a cube in Rd with sides parallel to the coordinate axes.
Denote by ℓQ and |Q| the side length and the Lebesgue measure of Q, so that (ℓQ)d = |Q|.
Consider the Carleson box BQ := Q × (0, ℓQ). Then ν is a Carleson measure if ν(BQ )/|Q| is
finite for any cube Q.
Let 1Q be the indicator function on Q. It has been shown [Hof08; Hof10; LM17a] that S is
bounded in L2 (Rd ) if there exists a constant CT > 0 such that for any cube Q the following local
testing condition holds
ˆ ˆ ℓQ
dt
|θt 1Q (x)|2 dx ≤ CT |Q|. (T)
Q 0 t
A standard example for which these conditions hold is θt f = f ∗ ψt , where ψt (x) = t−d ψ(t−1 x)
and ψ is the mean zero Schwartz function which gives rise to the Littlewood–Paley square func-
tion [Gra14, §6.1]. In particular, conditions (C1) and (C2)
´ are off-diagonal conditions compatible
with the scaling while (T) is the cancellation condition ψ = 0.
The aim of this paper is to show that this Carleson condition (T) is enough to obtain something
better: a sparse domination.
1.2. Main result. A collection of dyadic cubes S is τ -sparse if for any Q ∈ S there exists a
subset EQ ⊂ Q with the property that {EQ }Q∈S are pairwise disjoint and the ratio |EQ |/|Q| ≥ τ
for a fixed τ ∈ (0, 1).
Our main result is an optimal sparse domination of S under the minimal condition (T).
Theorem 1.1. If S satisfies the testing condition (T) then for any pair of compactly supported
functions f, g ∈ L∞ (Rd ) there exists a sparse collection S such that
ˆ
X 1 ˆ 2
1
ˆ
2
(Sf ) g dx ≤ C |f | |g| |Q|
Rd |Q| Q |Q| Q
Q∈S
1.3. Structure of the paper. In §2 we introduce shifted random dyadic grids and the associ-
ated Haar basis. Furthermore we use the classical reduction to good cubes. In §3 we decompose
the operator into off-diagonal and diagonal parts. These are split further each one into two
terms
h(Sf )2 , gi . (I) + (II) + (IIIa ) + (IIIb ) .
| {z } | {z }
off-diagonal diagonal
The off-diagonal part is bounded by a dyadic form using standard techniques in §4 and off-
diagonal estimates in §5. The dyadic form is dominated by a sparse form in §8.
Terms (IIIa ) and (IIIb ) come from a Calderón–Zygmund decomposition g = a + b, where a is
the average part and b is the bad part of g.
In §6 we introduce the stopping cubes used to control the diagonal part. We reduce (IIIa ) to
a telescopic sum on stopping cubes plus off-diagonal terms. We remark that the stopping family
depends only on the functions f and g. Furthermore, the testing condition (T) is used only in
this section and only once.
In §7 we deal with (IIIb ). We exploit the zero average property of b together with the regularity
of the kernel (C2) to restore a setting in which off-diagonal estimates can be applied as in the
previous sections, see §7.1.
In §9 we collect some of the proofs postponed to ease the reading. In Appendix A we recall
some known results about conditional expectations and Haar projections which are used in §7.
Notation. For two positive quantities X and Y the notation X . Y means that there exists a
constant C > 0 such that X ≤ CY . The dependence of C on other parameters will be indicated
by subscripts X .d,r,α Y when appropriate.
Given a cube Q in Rd , the quantities ∂Q, ℓQ and |Q| denote, respectively, boundary, size
length, and the Lebesgue measure of Q. We also denote by 3Q the (non-dyadic) cube with the
same centre of Q and side length 3ℓQ.
The average of a function f over a cube Q will be denoted by
1
ˆ
hf iQ := f := f (y) dy.
Q |Q| Q
We consider Rd with the ℓ∞ metric |x| = maxi |xi |. The distance between two cubes P and R
will be denoted by d(P, R), while
D(P, R) := ℓP + d(P, R) + ℓR
is the “long distance”, as defined in [NTV03, Definition 6.3].
2. Preliminaries
2.1. Dyadic cubes. The standard dyadic grid D on Rd is a collection of nested cubes organised
in generations
Dj := {2−j ([0, 1)d + m), m ∈ Zd }.
Each generation Dj is a partition of the whole space and D = ∪j∈Z Dj . Any cube Q ∈ Dj has
2d children in Dj+1 and one parent in Dj−1 . For k ∈ N we denote by Q(k) the k-ancestor of Q,
that is the unique cube R in the same grid D such that R ⊃ Q and ℓR = 2k ℓQ. We also denote
by chk (Q) the set of the k-grandchildren of Q, so that if P ∈ chk (Q) then P (k) = Q.
2.2. Haar functions. Given a dyadic system D on Rd , Haar functions are an orthonormal basis
of L2 (Rd ) given by linear combinations of indicator functions supported on cubes in D.
On R, for a given interval I ∈ D let I − and I + be the left and the right dyadic child of
I. Consider the functions h0I := |I|−1/2 1I and h1I := (1I − − 1I + )|I|−1/2 . Then {h1I }I∈ D is an
orthonormal complete system of L2 (R). In higher dimensions, as a cube I is the product of
intervals I1 × · · · × Id , a non-constant Haar function hǫI is the product hǫI11 × · · · × hǫIdd , where
ǫ = (ǫi )i ∈ {0, 1}d \ {0}d .
4 GIANMARCO BROCCHI
In this paper the sum over ǫ is not important, so both the superscript and the sum will be
omitted and hI will denote a non-constant Haar function. Two bounds that will be used are
ˆ
k∆I f kL1 ≤ |hf, hI i||I|1/2 ≤ |f |, k∆I f kL∞ ≤ |hf, hI i||I|−1/2 ≤ |f |. (2.1)
I I
2.3. Good and bad cubes. A cube is called good if it is distant from the boundary of any
much larger cube. More precisely, we have the following
Definition 2.1 (Good cubes). Given two parameters r ∈ N and γ ∈ (0, 21 ), a cube R ∈ D is
r-good if d(R, ∂P ) > (ℓR)γ (ℓP )1−γ for any P ∈ D with ℓP ≥ 2r ℓR.
2.4. Shifted dyadic cubes. Given a sequence ω = {ωi }i∈Z ∈ ({0, 1}d )Z and a cube R ∈ Dj of
length 2−j , the translation of R by ω is defined by
X
R+̇ω := R + xj where xj := ωi 2−i .
i>j
For a fixed ω, let Dω be the collection of dyadic cubes in D translated by ω. The standard
dyadic grid corresponds to D0 where ωi = 0 for all i ∈ Z. Shifted dyadic grids enjoy the same
nested properties of the standard grid D0 , together with other properties that will be useful
later, see Remark 4.5. For more on dyadic grids, we refer the reader to the beautiful survey
[Per19, §3].
2.5. Random shifts. Let P be the unique probability measure on Ω := ({0, 1}d )Z such that the
coordinate projections are independent and uniformly distributed. Fix R ∈ D0 with ℓR = 2−j
and consider J ∈ D0 with ℓJ > ℓR. The translated cube J +̇ω is
X X
J +̇ω = J + ωi 2−i + ωi 2−i ,
2−i <ℓR ℓR≤2−i <ℓJ
X
R+̇ω = R + ωi 2−i .
2−i <ℓR
The position of R+̇ω depends on the i such that 2−i < ℓR while the goodness of R+̇ω, since R
and J are translated by the same ω, depends on the i such that 2−i ≥ ℓR. Then position and
goodness of a cube are independent random variables, see [Hyt12].
Let 1good be the function on Dω which takes value 0 on bad cubes and 1 on good cubes. The
probability of a cube R to be good is πgood = P(R+̇ω is good) = Eω [1good (R+̇ω)], where Eω is
the expectation with respect to P. The probability πgood > 0 provided to choose r large enough,
see [Hyt17, Lemma 2.3]. The indicator function 1R+̇ω ( · ) depends only on the position of R+̇ω,
so by the independence of goodness and position, for any cube R ∈ D0 we have
Remark 2.3. When r = 0, this is the usual Calderón–Zygmund decomposition of f , see [Gra14,
Theorem 5.3.1].
Proof. Given λ > 0, let L be the collection of maximal dyadic cubes L covering the set
n o [
E := x ∈ Rd : sup h|f |iQ 1Q (x) > λ = L
Q∈ D L∈L
The cubes in chr (L) are a partition of L. Since the cubes L in L are disjoint, we have
We shall sometimes omit the superscript ω in the following. Bound the operator:
X¨ X ¨ X 2 X 2 dt
2 dt
|θt f (x)| g dx ≤ 2 θ t ∆P f + θt ∆P f |g| dx. (3.1)
WR t WR t
R∈ D R∈ D P ∈ D\PR P ∈PR
6 GIANMARCO BROCCHI
Consider the second term in (3.1). Let PR be the dyadic child of P containing R. Then
∆P f 1P = ∆P f 1P \PR + h∆P f iPR 1PR and we split the operator accordingly as before to obtain:
X¨ dt
|θt f (x)|2 g dx .
t
R∈ D WR
X 2 dt
X
¨
θt ∆P f |g| dx (I)
W R
t
R∈ D P ∈ D\PR
X ¨ X 2 dt
+ θt ∆P f 1P \PR |g| dx (II)
W R
t
R∈ D P ∈PR
X ¨ X 2 dt
+ θt h∆P f iPR 1PR |g| dx. (III)
WR t
R∈ D P ∈PR
We remark that the function g barely plays any role in this section.
A SPARSE QUADRATIC T 1 THEOREM 7
4.1. Different cases for P . Given R ∈ D, the cubes P are grouped according to their length
and position with respect to R.
Table 4.1. Different cases for P given R according to their lengths (first row) and position.
ℓP ≥ 2r+1 ℓR ℓR ≤ ℓP ≤ 2r ℓR ℓP < ℓR
P ⊃R P 6⊃ R
Psubscale
3P \ P ⊃ R 3P 6⊃ R 3P 6⊃ R 3P ⊃ R P ⊂ 3R P 6⊂ 3R
Pnear Pfar Pclose inside far
PR D \ PR
for i ∈ {near, far, close, subscale}. When P and R are disjoint, it’s useful to rearrange the sums
using a common ancestor of P and R.
Lemma 4.4 (Common ancestor). Let R, P ∈ D be disjoint cubes with R good.
If d(R, P ) > max(ℓR, ℓP )1−γ min(ℓR, ℓP )γ then there exists K ⊇ P ∪ R such that
min(ℓP, ℓR) γ
ℓK ≤ 2r d(R, P ).
ℓK
A proof in the case ℓP ≥ ℓR can be found in [Hyt17, Lemma 3.7]. When ℓP < ℓR, the same
ideas carry over, see §9 for a proof of this case.
Remark 4.5. For any P, R ∈ Dω there exists (almost surely) a common ancestor K ∈ Dω .
Indeed, dyadic grids (like the standard grid D0 ) without this property have zero measure in the
probability space (Ω, P), see [Per19, §3.1.1 and Example 3.2].
4.3. P far from R. In this case d(P, R) > ℓP and ℓP = max(ℓP, ℓR), so the hypotheses of
Lemma 4.4 are satisfied. Let K be the common ancestor of P and R given by Lemma 4.4. Since
ℓP ≥ 2r+1 ℓR, let ℓP = 2−j ℓK and ℓR = 2−i−j ℓK for some i, j ∈ Z+ , with i ≥ r + 1. We have
√ √
Xˆ X ( ℓRℓP )α XX X ˆ X ( ℓRℓP )α
g hf, hP i2
α+d
= g hf, hP i2
.
R d(R, P ) R d(R, P )α+d
R∈ D P ∈Pfar K∈ D i,j R : R⊂K P : P ⊂K
ℓR=2−i−j ℓK ℓP =2−j ℓK
d(P,R)>ℓP
By using the lower bound d(P, R) &r (ℓK)1−γ (ℓR)γ with γ = α/(4α + 4d), we estimate
√ √
ℓP ℓR 2−j−i/2 ℓK ( ℓP ℓR)α 2−(j+i/2)α 2−(3j+i)α/4
.r so that . r,α,d = . (4.3)
d(P, R) ℓK2−(i+j)γ d(P, R)α+d 2−(i+j)γ(α+d) |K| |K|
For any fixed integer m, the set {R ⊂ K : ℓR = 2−m ℓK} is a partition of K, so we bound
√
XX X X α
2 ( ℓRℓP )
ˆ
g hf, hP i
R d(R, P )α+d
K∈ D i,j R : R⊂K P : P ⊂K
ℓR=2−i−j ℓK ℓP =2−j ℓK
d(P,R)>ℓP
X X X X
. 2−3jα/4 2−iα/4 |g| hf, hP i2 .
j∈N i≥r+1 K∈ D K P : P ⊂K
ℓP =2−j ℓK
4.4. P near R. Recall that P ∈ Pnear if 3P \ P ⊃ R and ℓP ≥ 2r+1 ℓR. By the goodness of R,
we have that d(P, R) > (ℓP )1−γ (ℓR)γ . So the hypotheses of Lemma 4.4 are satisfied and there
exists K ⊇ P ∪ R such that d(P, R) &r (ℓK)1−γ (ℓR)γ . Arguing as in the far term leads to
√ !
Xˆ X ( ℓRℓP )α X
g hf, hP i2 . 2−3jα/4 BjD(g, f ).
d(R, P )α+d
R∈ D R P ∈Pnear j∈N
4.5. P comparable and close to R. In this case ℓR ≤ ℓP ≤ ℓR(r) and 3P ⊃ R. Using the
trivial bound D(P, R) ≥ ℓR we have
√
Xˆ X α
( ℓRℓP ) Xˆ X 1
|g| hf, hP i2 . r,α |g| hf, hP i2 .
D(R, P )α+d |R|
R∈ D R P ∈Pclose R R∈ D P : 3P ⊃R
ℓR≤ℓP ≤2r ℓR
Rearrange the sum in groups of P such that ℓP = 2k ℓR for k ∈ {0, . . . , r}. Then
Xˆ r
X X r X
X X
1 2kd
ˆ
2
|g| hf, hP i = hf, hP i2 |g|
|R| |P |
R∈ D R k=0 P : 3P ⊃R k=0 P ∈ D R⊂3P R
ℓP =2k ℓR ℓR=2−k ℓP
r X
X kd
22
ˆ
≤ hf, hP i |g|
|P | 3P
k=0 P ∈ D
X 3d
ˆ X
2
.r,d hf, hP i |g| = 3d hf, hP i2 h|g|i3P .
|3P | 3P
P ∈D P ∈D
We define
X
B0D(g, f ) := hf, hP i2 h|g|i3P . (4.4)
P ∈D
4.6. Subscale. When ℓP < ℓR we distinguish two subcases, as shown in Table 4.1.
4.6.1. Inside : P ⊂ 3R. The leading term in the long-distance D(R, P ) is ℓR, so we bound
√ ! α/2
Xˆ X ( ℓRℓP )α X X ℓP
|g| hf, hP i2 ≤ |g| hf, hP i 2
D(R, P )α+d ℓR
R∈ D R P : ℓP <ℓR R∈ D R P : ℓP <ℓR
P ⊂3R P ⊂3R
X X X
= 2−jα/2 h|g|iR hf, hP i2
j∈N R∈ D P : P ⊂3R
ℓP =2−j ℓR
X
.d 2−jα/2 BjD(g, f ).
j∈N
4.6.2. Far : P 6⊂ 3R. In this case d(P, R) > ℓR > ℓP , so the hypotheses of Lemma 4.4 are
satisfied. After Cauchy–Schwarz, rearrange the sum using the common ancestor K, then let
10 GIANMARCO BROCCHI
The dyadic form B0D(g, f ) defined in (4.4) is controlled by a sparse form in §8.
Remark 5.1. The goodness of R gives the lower bound on the distance d(R, ∂P ) > (ℓP )1−γ (ℓR)γ .
As will be clear from the proof, inequality (5.1) holds if one replaces the indicator 1P \PR with
1K\PR where K is Rd or any other larger cube containing P .
To prove (5.1), we use a classical estimate for the Poisson kernel.
Lemma 5.2 (Poisson off-diagonal estimates). Let β ∈ (0, 1], r ∈ N and γ as in the introduction
and let Q, P ∈ D such that Q(r) ⊂ P and Q is r-good. Then
η
(ℓQ)β ℓQ
ˆ
β+d
dy .
Rd \P d(y, Q) ℓP
where η = β − γ(β + d).
Proof. Decompose Rd \ P in annuli Ak = 3k+1 P \ 3k P for k ∈ N. Then on each annulus
d(y, Q) > d(∂(3k P ), Q). Since ℓP > 2r ℓQ, use the goodness of Q to obtain the bound.
Proof of (5.1). When (x, t) ∈ WR the size condition (C1) and Lemma 5.2 give
(ℓR)α |hf, hP i| ℓR η
ˆ
θt (∆P f 1P \PR )(x) . k∆P f kL∞ α+d
dy .
P \PR (ℓR + d(y, R)) |P |1/2 ℓPR
P
where η = α − γ(α + d) > 0. The sum P ⊃R(r) (ℓR/ℓPR )η is a geometric series. An application
of Cauchy–Schwarz gives
X¨
2 X X hf, hP i2 ℓR η ˆ
X |hf, hP i| ℓR η dt
|g| dx ≤ |g(x)| dx
W R |P |1/2 ℓPR t |P | ℓPR R
R∈ D P ⊃R(r) R∈ D P ⊃R (r)
X X hf, hP i2 X ˆ
−iη
. 2 |g|
|P | R
i≥r+1 P ∈D R⊂P
ℓR=2−i ℓP
X X hf, hP i2 ˆ
= 2−iη |g|.
|P | P
i≥r+1 P ∈D
Fix Q0 in D containing the support of f and g. The stopping family S is defined iteratively:
[ [
S0 := Q0 , Sn+1 := A⋆ (Q), S := Sn .
Q∈Sn n∈N
S
Remark 6.2. The family S is 12 -sparse, since for any S ∈ S the set ES := S \ S ′ ∈A⋆ (S) S
′ has
measure |ES | > 21 |S| and {ES }S∈S are disjoint.
In the same way, taking A⋆ (Q) to be the maximal dyadic components of Ag (Q) produces a
sparse family that we denote with Sg . It will be used later when only the stopping cubes related
to g are needed.
For a given Q ∈ D, denote by Q b the minimal stopping cube S ∈ S such that S ⊇ Q.
For S ∈ S let Tree(S) be the family of dyadic cubes contained in S, but not in any S ′ ∈ A⋆ (S)
b = S}.
Tree(S) := {R ∈ D : R
d
Also, we define Treer (S) := {R ∈ D : R (r) = S}. Note that the maximal cubes in Tree (S) are
r
the r-grandchildren of S. See Figure 1 in the appendix.
The term h∆P f iPR 1Pc \P is supported away from R, so one can use off-diagonal estimates
R R
as in (5.1). Also notice that in the bound (5.1) and in its proof one can replace |g| by h|g|iR . In
the same way, off-diagonal estimates are used for 1Pb\Pc hf iP as shown in Lemma 6.4 below.
R
12 GIANMARCO BROCCHI
The terms h∆P f iPR 1Pc and 1Pc hf iPR − 1Pb hf iP left from (6.2) and (6.3) are rearranged to
R R
obtain a telescopic series. We have
1PcR h∆P f iPR = 1PcR hf iPR − 1PcR hf iP when PR 6∈ S
and 1Pc hf iPR − 1Pb hf iP when PR ∈ S .
R
If PR 6∈ S then P and PR are contained in the same minimal stopping cube Pb. So Pc b
R = P and
the two cases add up to 2(1Pc hf iPR − 1Pb hf iP ) which leads to the telescopic sum
R
X
1Pc hf iPR − 1Pb hf iP = 1 d
R (r)
hf iR(r) − 1Q
c0 hf iQ0 .
R
(6.4)
P ∈D
R(r) ⊂P ⊆Q 0
(n)
Since f is supported on a fixed Q0 , the average on larger cubes Q0 containing Q0 decreases:
1 1
ˆ
hf iQ(n) = (n) f ≤ (n) kf kL1 → 0 as n → ∞.
0
|Q0 | Q0 |Q0 |
Thus when the sum in (6.4) extends to all P ⊃ R(r) , the term 1 d hf iR(r) is the only one
R(r)
remaining.
We have then identified three terms
X X X X
h∆P f iPR 1PR = − +
P ∈D telescopic far sparse
P ⊃R(r)
where
X X X X
:= h∆P f iPR 1Pc \P , := 1Pb\PR hf iP
R R
far P : P ⊃R(r) sparse P : P ⊃R(r)
PR ∈S
X X
and := 2(1Pc hf iPR − 1Pb hf iP ) = 1 d
(r)
hf iR(r) .
R R
telescopic P ⊃R(r)
P
Since the case with far is done in (5.1), we show how to deal with the remaining two cases.
6.3. Bound by a sparse form. We bound the operator applied to 1 d
(r)
hf iR(r) and 1Pb\PR hf iP .
R
Lemma 6.3. Let S be the sparse collection defined in §6.1, then
X¨ dt X
h|g|iR |θt 1 d
(r)
(x)|2 hf i2R(r) dx . h|g|iS h|f |i2S |S|.
WR
R t
R∈ D S∈S
X X ¨ dt
.r,d h|g|iS h|f |iS2
|θt 1S (x)|2 dx
WR t
S∈S R : R⊂S
X ˆ ˆ ℓS
dt
= h|g|iS h|f |i2S |θt 1S (x)|2 dx
S 0 t
S∈S
X
≤ CT h|g|iS h|f |i2S |S|
S∈S
where we used the stopping conditions for f and g, and the testing condition (T).
Lemma 6.4. Let S be the sparse collection defined in §6.1, then
2
X¨ X dt X
θt (1Pb\PR )hf iP |g| dx . h|f |i2S h|g|iS |S| (6.5)
WR t
R∈ D (r) P :P ⊃R
′ S∈S
PR ∈S
A SPARSE QUADRATIC T 1 THEOREM 13
After applying Cauchy–Schwarz the sums are rearranged using P as the common ancestor:
Xˆ X
ℓR η X −kη X X ˆ
2 2
|g| hf iP = 2 hf iP |g|
R ℓPR R
R∈ D (r)
P : P ⊃R k≥r P ∈D R : R⊂P
with PR ∈S ℓR=2−k−1 ℓP
Pc R =PR
X X ˆ
−kη
= 2 hf i2P |g|
k≥r P :PR ∈S P
X ˆ
≤ h|f |i2P |g|.
P :PR ∈S P
In order to exploit the goodness of Q, for example via Poisson off-diagonal estimates as in
Lemma 5.2, we need a gap of at least r generations between Q and PR . This motivates the
Calderón–Zygmund decomposition in Proposition 2.2. In particular, since b is the bad part of g
at height λ = Ah|g|iR given by Proposition 2.2, we have that
X X X X
∆Q b = ∆Q bLr .
Q∈ D L∈L Lr ∈chr (L) Q∈ D
Q⊂R Q⊆Lr
Since A > 1, the cubes in L are strictly contained in R. If we choose the constant A as in the
construction of the stopping family in §6.1, then the cubes in L are also stopping cubes in Sg .
We can regroup the dyadic cubes Q ⊆ Lr in the stopping trees Treer (S) for all S ∈ Sg inside R.
X X X X X X
∆ Q bL r = ∆ Q bS r .
L∈L Lr ∈chr (L) Q∈ D S∈Sg Sr ∈chr (S) Q∈Treer (S)
Q⊆Lr S⊂R Q⊆Sr
The last sum is the Haar projection of b on Span{hQ : Q ∈ Treer (S), Q ⊆ Sr }. We denote this
quantity by X
PSr (b) := ∆ Q bS r .
Q∈Treer (S)
Q⊆Sr
Remark 7.1. The Haar projection PSr b is supported on Sr and equals PSr (|g|). Indeed bSr =
1Sr (|g| − h|g|iSr ) and for Q ⊆ Sr the Haar coefficient hbSr , hQ i = h|g|, hQ i.
We have then proved the following identity
X X X ¨ X 2 dt
(IIIb ) = θt h∆P f iPR 1PR PSr (|g|) dx.
WR t
S∈Sg Sr ∈chr (S) R :R⊃S P ∈PR
Sr good
With a slight abuse of notation, we omit the subscript in the stopping family Sg in the following.
14 GIANMARCO BROCCHI
Remark 7.2 (Estimates for the Haar projection). The Haar projection PSr (|g|) has zero average
and
kPSr gkL1 . |Sr |h|g|iS . (7.1)
A proof of (7.1) is in Appendix A.4. In particular, summing over all Sr ∈ chr (S) gives
X X ˆ
kPSr gkL1 . |Sr |h|g|iS ≤ |g|. (7.2)
Sr ∈chr (S) Sr ∈chr (S) S
7.1. Recover decay and telescopic sum. Let PS be the dyadic child of P containing S.
Then
X X X
h∆P f iPR 1PR = h∆P f iPS 1PS − h∆P f iPS 1PS .
P :P ⊃R(r) P :P ⊃S P :S⊂P ⊆R(r)
The second term can be handled as in the subscale case (§4.6), while the first can be reduced
to a telescopic sum which equals hf iS 1S .
If one tries to reduce h∆P f iPS 1PS to a telescopic term plus off-diagonal terms as in §6.2, the
off-diagonal factor which should provide decay is the quantity
(ℓR)α
ˆ
α+d
dy.
Rd \PS d(y, Sr )
Here the scale (numerator) and the distance (denominator) don’t match and Lemma 5.2 seems
unable to provide enough decay in order to handle the integral and the sum over R. But the
zero average property of PSr (g) comes to the rescue bringing a factor (ℓSr )α/2 at the numerator
by exploiting the smoothness condition of the kernel. We will explain how.
Let xSr be the centre of the Sr and consider the sublinear operator
Since the Haar projection PSr (g) is supported on Sr , we have the following bound.
Lemma 7.3. Let Sr and R be dyadic cubes with Sr ⊂ R, then
¨ 2 dt
2 dt
¨
|θt f (x)| PSr (g)(x) dx . KtSr f (x) |PSr (g)(x)| dx. (7.3)
WR t WR t
Proof. The idea is to use the zero average of PSr (g) to exploit the smoothness condition (C2).
We recall that PSr (g) is supported on Sr ⊂ R. Consider the operator
ˆ ℓR ˆ 2
dt
Kf (x) :=
kt (x, y)f (y) dy t
ℓR/2
´
so that the left hand side of (7.3) equals Kf (x)PSr g(x) dx. Let xSr be the centre of Sr . Then
ˆ ˆ
Kf (x)PSr g(x) dx = Kf (x) − Kf (xSr ) PSr g(x) dx
For x ∈ Sr , since Sr ⊂ R and t ∈ (ℓR/2, ℓR), the distance |x − xSr | ≤ ℓSr /2 < ℓR/2 < t, so by
conditions (C2) and (C1) we have
|x − xSr |α tα
ˆ ˆ
− +
KSr f (x) · KSr f (x) . |f (y)| dy · |f (y)| dy
(t + |x − y|)α+d (t + |x − y|)α+d
!2
(t|x − xSr |)α/2 2
ˆ
= α+d
|f (y)| dy =: KtSr f (x) .
(t + |x − y|)
The operator KtSr satisfies Poisson-like off-diagonal estimates.
Lemma 7.4 (Estimates for KtSr ). Let x ∈ Sr ⊂ R and t ∈ (ℓR/2, ℓR). Let Q ∈ D such that
Q ⊃ Sr . Then there exists η > 0 such that the following estimates hold:
η
ℓSr |Q| ℓSr α/2
KtSr 1Rd \Q (x) . , K t
Sr
1 Q (x) . .
max(ℓR(r) , ℓQ) |R| ℓR
Remark 7.5. Notice that the first estimate is better than the one in Lemma 5.2 on smaller scale
(when ℓQ < ℓR(r) ). For the second one, since ℓSr < ℓR, we can also estimate
|Q|
KtSr 1Q (x) .
|R|
provided that x ∈ Sr and t ∈ (ℓR/2, ℓR).
Proof of Lemma 7.4. For the second estimate, by forgetting the distance in the denominator,
we simply have
(ℓSr ℓR)α/2 |Q| ℓSr α/2
ˆ
Kt (1Q )(x) .
Sr
α+d
dy ≤ .
Q (ℓR + d(y, Sr )) |R| ℓR
For the first estimate, when Q ⊃ R(r) use (a + b)α = (a + b)2α/2 ≥ (2ab)α/2 in order to apply
off-diagonal estimates. For x ∈ Sr and t ∈ (ℓR/2, ℓR) we bound
(ℓSr ℓR)α/2
ˆ
KtSr 1Rd \Q (x) . α+d
dy
Rd \Q (ℓR + d(y, Sr ))
We split the sum in P to obtain a telescopic sum as in §6.2, with an extra subscale term:
X X X X X
h∆P f iPR 1PR = − + −
P ⊃R(r) telescopic far sparse subscale
where
X X X X
:= 2 (hf iPS 1Pc − hf iP 1Pb ) = 2hf iS 1S := h∆P f iPS 1PS
S
telescopic P : P ⊃S subscale P : P ⊆R(r)
P ⊃S
X X X X
:= h∆P f iPS 1Pc \P := hf iP 1Pb\PS .
S S
far P : P ⊃S sparse P : P ⊃S
PS ∈S
Then we bound
X X X X X X X X
− + − ≤ + + + .
telescopic far sparse subscale telescopic far sparse subscale
We estimate KtSr applied to each term by using sublinearity and Lemma 7.4. Then take the
´ ℓR
supremum in t on the Whitney region WR to bound the remaining integral ℓR/2 dt/t by 1.
We give the details in each case.
P
7.3. Telescopic term. This case is bounded by the sparse form ΛS (f, g) = S∈S h|f |i2S S |g|,
´
where S is the stopping family of g.
Lemma 7.6. It holds that
X X X ¨ 2 dt
hf i2S KtSr 1S |PSr g| dx . ΛS (f, g)
WR t
S∈S Sr ∈chr (S) R : R⊃S
Proof. For x ∈ Sr and t ∈ (ℓR/2, ℓR) we estimate KtSr (1S )(x) . |S|/|R| and
´ ℓR
ℓR/2 dt/t ≤ 1.
Then by using (7.2) for the Haar projection we have
X X X ¨ 2 dt
2
hf iS KtSr 1S (x) |PSr g(x)| dx
W R
t
S∈S Sr ∈chr (S) R:R⊃S
X X X |S| 2
2
. hf iS kPSr gkL1
|R|
S∈S Sr ∈chr (S) R:R⊃S
X X |S| 2 ˆ X ˆ
2 2
.r,d hf iS |g| ≤ h|f |iS |g|.
|R| S S
S∈S R:R⊃S S∈S
7.4. Subscale term. This term is bounded in a similar way as in the subscale case in §4.6.1.
Lemma 7.7. It holds that
X X X ¨ X 2 dt X
KtSr |PSr g| dx . 2−jα/4 BjD(g, f ).
WR t
S∈S Sr ∈chr (S) R : R⊃S subscale j∈N
A SPARSE QUADRATIC T 1 THEOREM 17
Then for x ∈ Sr and t ∈ (ℓR/2, ℓR) we estimate KtSr 1PS using Lemma 7.4
ℓPS d ℓSr α/2
Kt 1PS (x) .
Sr
.
ℓR ℓR
Bound ℓPS < ℓP and |h∆P f iPS | ≤ |hf, hP i||P |−1/2 , then we apply Cauchy–Schwarz
!2
X X X ˆ X |P |1/2 ℓSr α
|hf, hP i| |PSr g| dx
R |R| ℓR
S∈S Sr ∈chr (S) R : R⊃S (r) P : P ⊆R
P ⊃S
! ! α
X X X X hf, hP i2 X |P | ℓSr
≤ kPSr gkL1
|R| |R| ℓR
S∈S Sr ∈chr (S) R⊃S P : P ⊆R(r) (r)
P ⊆R
P ⊃S
!
X X X ℓS α/2 X hf, hP i2 ℓP α/4 X |P | ℓP α/4
≤ kPSr gkL1 .
ℓR |R| ℓR |R| ℓR
S∈S Sr ∈chr (S) R⊃S P :P ⊃S (r) P ⊆R
P ⊆R(r)
The second factor after Cauchy–Schwarz is controlled as in subscale case in Lemma 4.3 where
P ⊂ 3R. Then bound kPSr gkL1 as in (7.2) to obtain
Xˆ X ℓS α/2 X
hf, hP i2 ℓP α/4
|g|
S ℓR |R| ℓR
S∈S R : R⊃S (r) P : S⊂P ⊆R
X 1 Xˆ α/2 X α/4
ℓS 2 ℓP
= |g| hf, hP i .
|R| S ℓR ℓR
R∈ D S∈S P : S⊂P ⊆R(r)
S⊂R
For i, j ∈ N, let ℓP = 2−j ℓR(r) and ℓS = 2−i ℓR. Extend the sum over all P such that P ⊆ R(r)
and rearrange
X 1 Xˆ α/2 X α/4
ℓS 2 ℓP
|g| hf, hP i
|R| S ℓR ℓR
R∈ D S∈S (r) P : S⊂P ⊆R
S⊂R
XX X X α/4
1 ℓP
ˆ
−iα/2 2
= 2 |g| hf, hP i 2rα/4
i,j R∈ D
|R|
S⊂R S ℓR(r)
P ⊆R(r)
ℓS=2−i ℓR ℓP =2−j ℓR(r)
X X X
.r,d 2−iα/2 2−jα/4 |g| hf, hP i2
i,j R∈ D R P ⊆R(r)
ℓP =2−j ℓR(r)
X X X X
.r,d 2−jα/4 |g| hf, hP i2 ≤ 3d 2−jα/4 BjD(g, f ).
j∈N R(r) j∈N
R(r) ∈ D P ⊆R(r)
ℓP =2−j ℓR(r)
Then split the sum over P and consider the two cases:
X X X
= + =: (i) + (ii).
P : P ⊃S P :P ⊃R(r) P : P ⊆R(r)
P ⊃S
Lemma 7.8 (Bound for (i)). Let FP be either h∆P f iPR or hf iP 1{PS ∈S } . Then
!2
X X X ¨ X dt
|FP | · KtSr 1Rd \PS (x) |PSr g| dx . ΛS (f, g).
WR t
S∈S Sr ∈chr (S) R:R⊃S P : P ⊃R(r)
Lemma 7.9 (Bound for (ii)). Let FP be either h∆P f iPS or hf iP 1{PS ∈S } . Then
X X X ¨ X 2 dt
|FP | · KtSr 1Rd \PS (x) |PSr g| dx . ΛS (f, g).
WR t
S∈S Sr ∈chr (S) R : R⊃S P : P ⊃S
P ⊆R(r)
Proof of Lemma 7.8. In this case P ⊃ R ⊃ S, so the dyadic child PS equals PR . Using
Lemma 7.4, since ℓSr < ℓS, we have
η η η
ℓSr ℓS ℓR
1
KtSr Rd \PR (x) . ≤ .
ℓPR ℓR ℓPR
´ ℓR
We bound ℓR/2 dt/t ≤ 1 and then apply Cauchy–Schwarz
X X X X !2
ℓS η ℓR η
|FP | kPSr gkL1
ℓR ℓPR
S∈S Sr ∈chr (S) R:R⊃S P : P ⊃R(r)
X X X X 2η
ℓS ℓR η
≤ FP2 kPSr gkL1
ℓR ℓP R
S∈S Sr ∈chr (S) R:R⊃S P : P ⊃R
(r)
P η
since P : P ⊃R(r) (ℓR/ℓPR ) ≤ 1. Bound the sum of Haar projections as in (7.2)
X X X X
ℓS 2η ℓR η
FP2 kPSr gkL1
ℓR ℓPR
S∈S Sr ∈chr (S) R:R⊃S P :P ⊃R(r)
X X X 2η ˆ
2 ℓS ℓR η
. FP |g|.
ℓR ℓPR S
S∈S R:R⊃S P :P ⊃R(r)
X X η X X 2η ˆ
ℓR ℓS
FP2 |g|
ℓPR ℓR S
R∈ D P :P ⊃R(r) i∈N S∈S
S⊂R
ℓS=2−i ℓR
X X η X
ℓR
ˆ
≤ FP2 2 −iη
|g|
ℓPR R
R∈ D P :P ⊃R(r) i∈N
X X X η ˆ
ℓR
≤ FP2 |g|
ℓPR R
P ∈D k≥r R : R⊂P
ℓR=2−k−1 ℓP
X X ˆ X ˆ
≤ FP2 2−kη |g| ≤ FP2 |g|.
P ∈D k≥r P P ∈D P
X ˆ X hf, hP i2 ˆ X ˆ X ˆ
FP2 |g| ≤ |g| FP2 |g| = hf i2P |g|
P |P | P P ∈D P P : PR ∈S P
P ∈D P ∈D
≤ 3d B0D(g, f ). = ΛS ′ (f, g)
Then B0D(g, f ) is bounded by a sparse where S ′ is the sparse collection of
form in Lemma 8.4. dyadic parents of S .
Proof of Lemma 7.9. For x ∈ Sr and t ∈ (ℓR/2, ℓR), since Sr ⊂ S ⊂ P ⊆ R(r) , by Lemma 7.4
η
ℓSr
1
KtSr ( Rd \PS )(x) . .
ℓR(r)
´ ℓR
Estimate the integral ℓR/2 dt/t ≤ 1 and the sum of Haar projections as in (7.2).
X X X¨ X η !2
ℓSr dt
FP |PSr g| dx
WR ℓR(r) t
S∈S Sr ∈chr (S) R⊃S P :P ⊃S
P ⊆R(r)
X X X ℓS η X η/2 η/2 !2
ℓS ℓP
≤ kPSr gkL1 FP
S∈S Sr ∈chr (S) R : R⊃S
ℓR(r) P :P ⊃S
ℓP ℓR(r)
P ⊆R(r)
Xˆ X ℓS η X η/2 η/2 !2
ℓS ℓP
. |g| FP .
S∈S S R : R⊃S
ℓR(r) P :P ⊃S
ℓP ℓR(r)
P ⊆R(r)
20 GIANMARCO BROCCHI
Apply Cauchy–Schwarz.
Xˆ X ℓS η X η/2 η/2 !2
ℓS ℓP
|g| FP
S R : R⊃S
ℓR(r) P :P ⊃S
ℓP ℓR(r)
S∈S
P ⊆R(r)
Xˆ X ℓS η X
ℓS
η X ℓP η
≤ |g| FP2 ·
S R : R⊃S
ℓR(r) P :P ⊃S
ℓP
P :P ⊃S
ℓR(r)
S∈S
P ⊆R(r) P ⊆R(r)
The last sum is finite: since P ⊃ S there is only one ancestor for each generation. Since all
terms are non-negative, we bound by removing the restriction P ⊂ R(r) in the sum in P .
Xˆ X ℓS η X η
ℓS Xˆ X η
ℓS
2 2
|g| (r)
FP ≤ |g| FP
ℓR ℓP ℓP
S∈S S R : R⊃S P :P ⊃S S∈S S P :P ⊃S
X X X ˆ
= FP2 2−iη |g|
P ∈D i∈N S∈S S
S⊂P
ℓS=2−i ℓP
X ˆ X
≤ FP2 |g| 2−iη .
P ∈D P i∈N
The two cases for FP are as at the end of the proof of Lemma 7.8.
(k+1) (k)
)∁
Proof. Decompose (Q0 in the union of Q0 \ Q0 for k ∈ Z+ . The non-zero terms in
(k) (k)
Bj ↾Q∁ (g, f ) are the ones where P intersects Q0 and (Q0 )∁ . Then P ⊃ Q0 and in particular
D
0
(m)
P = Q0 for m > k. There is only one ancestor for each m, so we have
!2
ˆ X X hf, hP i
D
Bj ↾Q(k+1) \Q(k) (g, f ) = h|g|i3K 1P (x) dx
0 0 (k+1)
Q0
(k)
\Q0 K∈ D P ⊂3K
|P |1/2
ℓP =2−j ℓK
(k)
P ⊃Q0
∞
X (m)
. h|g|i3Q(m+j) h|f |i2 (m) |Q0 |
0 Q0
m=k+1
∞
X
= 3−d 2−(m+j)d h|g|iQ0 2−2md h|f |i2Q0 2md |Q0 |
m=k+1
∞
X
−jd
≤2 h|g|iQ0 h|f |i2Q0 |Q0 | 2−2md .
m=k+1
The last sum is bounded by 2−kd and summing over k ∈ Z+ concludes the proof.
A SPARSE QUADRATIC T 1 THEOREM 21
It’s enough to construct a sparse family inside Q0 . Taking the supremum of h|g|i3K over all
K ∈ D we have ˆ
Bj (g, f ) ≤ M 3 Dg(x) · (Sj3 Df (x))2 dx
D
where M 3 D and Sj3 D denote the maximal function and the square function given by
2 X X hf, hP i2
M 3 Df := sup h|f |i3Q 13Q , Sj3 Df (x) := 1P (x). (8.1)
Q∈ D |P |
R∈ D P ∈D
P ⊂3R
ℓP =2−j ℓR
As we see below, Sj3 D is pointwise controlled by the square function SjDf (x) given by
2 X X hf, hP i2
SjDf (x) := 1P (x).
|P |
Q∈ D P ∈D
P ⊂Q
ℓP =2−j ℓQ
Proposition 8.2 (Pointwise control). Let f ∈ L2 (Rd ) and j ∈ N0 . For all x ∈ Rd it holds that
SjDf (x) ≤ Sj3 Df (x) ≤ 3d/2 SjDf (x)
Proof. The enlarged cube 3R is the union of 3d cubes {Ra }a in the same dyadic grid D. So
2 X X 1P (x)
Sj3 Df (x) = hf, hP i2
|P |
R∈ D P : P ⊂3R
ℓP =2−j ℓR
d
3
XX X 1P (x)
= hf, hP i2
|P |
R∈ D a=1 P : P ⊂Ra
ℓP =2−j ℓRa
3d
X 2 2
= SjDf (x) ≤ 3d SjDf (x) .
a=1
We show that the square function SjD satisfies a weak (1, 1) bound. The proof follows the one
for dyadic shifts without separation of scales [Hyt+14, Theorem 5.2] and [LM17b, Lemma 4.4].
Proposition 8.3. Let j ∈ Z+ . There exists C > 0 such that for any f ∈ L1 (Rd ) it holds that
sup λ|{x ∈ Rd : SjDf (x) > λ}| ≤ C(1 + j)kf kL1 .
λ>0
The last bound follows by using Chebyshev’s inequality for the good part:
D 2
Sj g > λ ≤ 4 kSjDgk22 . kgk2 . kf k1
2 λ2 λ2 λ
and Markov’s inequality for the bad part. The sublinearity of SjD and the triangle inequality
imply that X
kSjDbkL1 (E ∁ ) ≤ kSjDbQ kL1 (E ∁ ) .
Q∈L
For each Q ∈ L, only dyadic cubes K ⊃ Q contribute to the norm kSjDbQ kL1 (E ∁ ) , since if K ⊆ Q,
then K would be inside E. Thus K is an ancestor of Q, so K = Q(k) for some integer k ≥ 1.
For k > j each j-child P ⊂ K contains Q, and so hbQ , hP i vanishes, by the zero average of bQ .
Thus we estimate
1P (x)
ˆ X X
D
kSj bQ kL1 (E ∁ ) ≤ |hbQ , hP i| 1/2 dx
E ∁ K∈ D P ⊂K |P |
ℓP =2−j ℓK
j
X X X
≤ |hbQ , hP i||P |1/2
k=1 K⊃Q P ⊂K
−j
ℓK=ℓQ(k) ℓP =2 ℓK
j
X X X
≤ kbQ kL1 (P )
k=1 K⊃Q P ⊂K
−j
ℓK=ℓQ(k) ℓP =2 ℓK
j
X X j
X
≤ kbQ kL1 (K) = kbQ kL1 .
k=1 K⊃Q k=1
ℓK=ℓQ(k)
Since kbQ kL1 (K) = kbQ kL1 . λ|Q| < Q |f |, and there is only one ancestor of Q for each k, we
´
have
Xj Xj ˆ ˆ
kbQ kL1 . |f | ≤ j |f |.
k=1 k=1 Q Q
The operator M 3D defined in (8.1) is also weak (1, 1) as it is bounded by the Hardy–Littlewood
maximal function, which is weakly bounded.
The following lemma exploits the weak boundedness of the operators M 3 D and SjD to construct
a sparse collection S . The proof adapts the one in [LM17b, Lemma 4.5] to our square function.
We include the details for the convenience of the reader.
Lemma 8.4 (Sparse domination of BjD). Let j ∈ Z+ . For any pair of compactly supported
functions f, g ∈ L∞ (Rd ) there exists a sparse collection S such that
ˆ X
BjD(g, f ) . M 3 Dg · (SjDf )2 . (1 + j)2 h|f |i2S h|g|iS |S|
S∈S
where the implicit constant does not depend on j.
Proof of Lemma 8.4. Fix a cube Q0 ∈ D containing the union of the supports of f and g. By
Lemma 8.1 it is enough to construct a sparse family inside Q0 . Consider the set
F (Q0 ) := {x ∈ Q0 : M 3 Dg(x) > Ch|g|iQ0 } ∪ {x ∈ Q0 : SjDf (x) > C(1 + j)h|f |iQ0 }.
A SPARSE QUADRATIC T 1 THEOREM 23
By the weak boundedness of M 3 D and SjD, there exists C > 0 such that |F (Q0 )| ≤ 12 |Q0 |. Then
ˆ ˆ ˆ
3D D 2 3D D 2
M g · (Sj f ) ≤ M g · (Sj f ) + M 3 Dg · (SjDf )2
Q0 Q0 \F (Q0 ) F (Q0 )
Xˆ
3 2 2
≤ C (1 + j) h|g|iQ0 h|f |iQ0 |Q0 | + M 3 Dg · (SjDf )2
Q∈ F Q
where F is the collection of maximal dyadic cubes covering F (Q0 ). Iterating on each Q ∈ F
produces a sparse family of cubes S , since {EQ := Q \ F (Q)}Q∈S are pairwise disjoint and
|EQ | > 12 |Q| for each Q in S .
This shows that the P in Pk (R) are exactly the cubes contained in 3R(k) with ℓP = ℓR(k) ,
and there are 3d of such cubes.
near : For P such that 3P \ P ⊃ R and ℓP ≥ 2r+1 ℓR, the decay comes from d(P, R),
which is bounded below by ℓP (ℓR/ℓP )γ , and γ = α/(4α + 4d). Then
√ ∞ √ !α ∞
X ( ℓRℓP )α X X |P | ℓP ℓR X
|P | = . d 2−kα/4
d(R, P )α+d d(P, R)d d(P, R)
P : 3P \P ⊃R k=r+1 P : 3P \P ⊃R k=r+1
ℓP ≥2r+1 ℓR ℓP =2k ℓR
where, by Lemma 9.1, the P in the sum are at most 3d for each k.
close : For ℓR ≤ ℓP ≤ ℓR(r) and 3P ⊃ R, the leading term in the long-distance is ℓR.
√
( ℓRℓP )α (2r/2 ℓR)α |P | |P |
|P | ≤ ≤ 2αr/2 .
D(R, P )α+d (ℓR)α |R| |R|
To estimate the term we will fix a scale k for P , such that 0 ≤ k ≤ r, then
√ r
X ( ℓRℓP )α X |P | X X
|P | . = 2kd
D(R, P )α+d |R|
P : 3P ⊃R P : 3P ⊃R k=0 P : 3P ⊃R
ℓR≤ℓP ≤ℓR(r) ℓR≤ℓP ≤ℓR(r) ℓP =2k ℓR
X r
rd
≤2 |{P : 3P ⊃ R, ℓP = 2k ℓR}| ≤ 2rd 3d (r + 1).
k=0
α+d
|P | ≤ = 2 2 2 .d 2−k 2 < ∞.
D(R, P ) ℓR
P : P ⊂3R P ⊂3R k=1 P ⊂3R k=1
ℓP =2−k ℓR
subscale, P 6⊂ 3R : In this case d(P, R) > ℓR > ℓP . Regroup the P according to length
and distance:
√ d α
X ( ℓRℓP )α X X ℓR ℓR
−kd −kα/2
|P | = 2 2
D(R, P )α+d D(P, R) D(P, R)
P : P 6⊂3R k∈N P : 2 ℓP =ℓR
k
ℓP <ℓR d(P,R)>ℓR
X X X
≤ 2−k(d+α/2) 2−md 2−mα ≤ 2−kα/2 2−mα .
k,m P : 2k ℓP =ℓR k,m
2m+1 ≥d(P,R)/ℓR>2m
This because there are at most 2md cubes R in the range given by the distance, which
means at most 2md · 2kd cubes P with ℓP = 2−k ℓR.
A SPARSE QUADRATIC T 1 THEOREM 25
Proof of Lemma 4.4 (for ℓP < ℓR). Recall that γ ∈ (0, 21 ). Let K be the minimal cube K ⊃ R
ℓP γ
such that ℓK ≥ 2r ℓR and d(P, R) ≤ ℓK ℓK . (The set of such cubes is not empty since
ℓP
γ 1−γ
ℓK ℓK equals ℓP ℓK
ℓP which goes to infinity as ℓK → ∞.) First, observe that P ⊂ K.
Suppose not, then
ℓP γ ℓR γ P ⊂K ∁
ℓK < ℓK < d(R, ∂K) ≤ d(R, P )
ℓK ℓK
which is absurd because of the second condition on K. It remains to show the upper bound for
ℓK. By minimality of K, one of the following conditions holds: either
!γ
ℓK ℓK ℓP
< 2r ℓR or 1 < d(P, R).
2 2 2 ℓK
A.1. Conditional expectation. Denote by E[ · |GS ] the projection on the space of measurable
functions with respect to the σ-algebra GS .
(
f (x) if x ∈ S \ A(S)
E[f |GS ](x) =
hf iS ′ if x ∈ S ′ for some S ′ ∈ A⋆ (S).
For more details about this operator, we refer the reader to [Hyt+16, §2.6]. Let S be a stopping
family for f . The supremum of E[f |GS ] in S is either f (x) (if A(S) is empty), or hf iS ′ for some
S ′ ∈ A⋆ (S). In both cases kE[f 1S |GS ]kL∞ (S) .d hf iS , since hf iS ′ ≤ 2d Ahf iS by the stopping
conditions.
where {hǫI }ǫ are the Haar functions on I. Being a sum of Haar functions on cubes in Tree(S),
the Haar projection PS f is constant on any S ′ ∈ A⋆ (S), so it’s measurable on GS . It also holds
that PS f = PS E[f 1S |GS ].
The Haar projection PS f can be seen as a martingale transform, and so it satisfies the following
Lemma A.1 (Lp bound for martingale transform [Bur84]). For 1 < p < ∞ we have
kPS E[f 1S |GS ]kp ≤ Cp kE[f 1S |GS ]kp . (A.1)
Combining (A.1) with the estimate for the supremum of E[f 1S |GS ] one obtains that
kPS f kp .p hf iS .
26 GIANMARCO BROCCHI
S
A⋆ (S)
Sr
Figure 1. An example of stopping tree Tree(S) and the maximal stopping cubes in
A⋆ (S). Below, shifted by r generations, there is the stopping tree Treer (S).
The cubes Q in Treer (S) contained in a specific r-grandchild Sr are high-
lighted.
A.3. Richer σ–algebras and r-Haar projections. The same idea works with slight modifi-
cations when S is the minimal stopping cube containing the r-ancestor of Q. Let Treer (S) be
d
the collection of cubes Q such that Q (r) = S. Define the r-Haar projection on S as
X
PrS f = ∆Q f.
Q∈Treer (S)
Proof of (7.1). The Haar projector PSr f is measurable with respect to the σ-algebra GSr , then
ˆ ˆ
|PSr f | = |PSr E[f 1Sr |GSr ]| ≤ k1Sr kLp′ kPSr E[f 1Sr |GSr ]kLp (Sr )
Sr Sr
by (A.1) .p k1Sr kLp′ kE[f 1Sr |GSr ]kLp (Sr )
1 1
≤ |Sr | p′ |Sr | p kE[f 1Sr |GSr ]k∞
by Lemma A.3 . |Sr |hf iS .
REFERENCES 27
Acknowledgements
This work will be part of the author’s PhD thesis supervised by Maria Carmen Reguera whose
patience, support and guidance is greatly appreciated. The author wishes to thank Gennady
Uraltsev for stimulating discussions on related topics.
References
[BBL17] C. Benea, F. Bernicot, and T. Luque. “Sparse bilinear forms for Bochner Riesz multipliers and
applications”. In: Trans. London Math. Soc. 4.1 (2017), pp. 110–128 (cit. on p. 2).
[BC17] D. Beltran and L. Cladek. “Sparse bounds for pseudodifferential operators”. Preprint. 2017. arXiv:
1711.02339 [math.CA] (cit. on p. 2).
[BFP16] F. Bernicot, D. Frey, and S. Petermichl. “Sharp weighted norm estimates beyond Calderón-Zygmund
theory”. In: Anal. PDE 9.5 (2016), pp. 1079–1113 (cit. on p. 2).
[Buc93] S. M. Buckley. “Estimates for operator norms on weighted spaces and reverse Jensen inequalities”.
In: Trans. Amer. Math. Soc. 340.1 (1993), pp. 253–272 (cit. on p. 2).
[Bur84] D. L. Burkholder. “Boundary value problems and sharp inequalities for martingale transforms”. In:
Ann. Probab. 12.3 (1984), pp. 647–702 (cit. on p. 25).
[CD17] M. J. Carro and C. Domingo-Salazar. “Stein’s square function Gα and sparse operators”. In: J. Geom.
Anal. 27.2 (2017), pp. 1624–1635 (cit. on p. 2).
[CDO18a] A. Culiuc, F. Di Plinio, and Y. Ou. “Domination of multilinear singular integrals by positive sparse
forms”. In: J. Lond. Math. Soc. (2) 98.2 (2018), pp. 369–392 (cit. on p. 2).
[CDO18b] A. Culiuc, F. Di Plinio, and Y. Ou. “Uniform sparse domination of singular integrals via dyadic
shifts”. In: Math. Res. Lett. 25.1 (2018), pp. 21–42 (cit. on p. 2).
[CJ87] M. Christ and J.-L. Journé. “Polynomial growth estimates for multilinear singular integral operators”.
In: Acta Math. 159.1-2 (1987), pp. 51–80 (cit. on p. 1).
[CMP12] D. Cruz-Uribe, J. M. Martell, and C. Pérez. “Sharp weighted estimates for classical operators.” In:
Adv. Math. 229.1 (2012), pp. 408–441 (cit. on p. 2).
[Con+17] J. M. Conde-Alonso, A. Culiuc, F. Di Plinio, and Y. Ou. “A sparse domination principle for rough
singular integrals”. In: Anal. PDE 10.5 (2017), pp. 1255–1284 (cit. on p. 2).
[CR16] J. M. Conde-Alonso and G. Rey. “A pointwise estimate for positive dyadic shifts and some applica-
tions”. In: Math. Ann. 365.3-4 (2016), pp. 1111–1135 (cit. on p. 2).
[DDU18] F. Di Plinio, Y. Q. Do, and G. N. Uraltsev. “Positive sparse domination of variational Carleson
operators”. In: Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) 18.4 (2018), pp. 1443–1458 (cit. on p. 2).
[DLR16] C. Domingo-Salazar, M. Lacey, and G. Rey. “Borderline weak-type estimates for singular integrals
and square functions”. In: Bull. Lond. Math. Soc. 48.1 (2016), pp. 63–73 (cit. on p. 2).
[Dra+05] O. Dragičević, L. Grafakos, M. C. Pereyra, and S. Petermichl. “Extrapolation and sharp norm es-
timates for classical operators on weighted Lebesgue spaces”. In: Publ. Mat. 49.1 (2005), pp. 73–91
(cit. on p. 2).
[Gra14] L. Grafakos. Classical Fourier analysis. Third. Vol. 249. Graduate Texts in Mathematics. Springer,
New York, 2014, pp. xviii+638 (cit. on pp. 1, 2, 5, 25).
[Hof08] S. Hofmann. “A local T b theorem for square functions.” In: Perspectives in partial differential equa-
tions, harmonic analysis and applications. A volume in honor of Vladimir G. Maz’ya’s 70th birthday.
Providence, RI: American Mathematical Society (AMS), 2008, pp. 175–185 (cit. on p. 1).
[Hof10] S. Hofmann. “Local T (b) theorems and applications in PDE”. In: Harmonic analysis and partial
differential equations. Vol. 505. Contemp. Math. Amer. Math. Soc., Providence, RI, 2010, pp. 29–52
(cit. on p. 1).
[HTV00] S. Hukovic, S. Treil, and A. Volberg. “The Bellman functions and sharp weighted inequalities for
square functions”. In: Complex analysis, operators, and related topics. Vol. 113. Oper. Theory Adv.
Appl. Birkhäuser, Basel, 2000, pp. 97–113 (cit. on p. 2).
[Hyt+14] T. Hytönen, C. Pérez, S. Treil, and A. Volberg. “Sharp weighted estimates for dyadic shifts and
the A2 conjecture”. In: J. Reine Angew. Math. 687 (2014), pp. 43–86. arXiv: 1010.0755 [math.CA]
(cit. on p. 21).
[Hyt+16] T. Hytönen, J. van Neerven, M. Veraar, and L. Weis. Analysis in Banach spaces. Volume I. Martin-
gales and Littlewood-Paley theory. Vol. 63. Cham: Springer, 2016, pp. xvii + 614 (cit. on p. 25).
[Hyt12] T. P. Hytönen. “The sharp weighted bound for general Calderón-Zygmund operators.” In: Ann. Math.
(2) 175.3 (2012), pp. 1473–1506 (cit. on pp. 2, 4).
[Hyt17] T. P. Hytönen. “Representation of singular integrals by dyadic operators, and the A2 theorem.” In:
Expo. Math. 35.2 (2017), pp. 166–205. arXiv: 1108.5119 [math.CA] (cit. on pp. 4, 8).
[Lac17] M. T. Lacey. “An elementary proof of the A2 bound”. In: Israel J. Math. 217.1 (2017), pp. 181–195
(cit. on p. 2).
28 REFERENCES
[Ler06] A. K. Lerner. “On some sharp weighted norm inequalities”. In: J. Funct. Anal. 232.2 (2006), pp. 477–
494 (cit. on p. 2).
[Ler08] A. K. Lerner. “On some weighted norm inequalities for Littlewood–Paley operators”. In: Illinois
Journal of Mathematics 52.2 (2008), pp. 653–666 (cit. on p. 2).
[Ler11] A. K. Lerner. “Sharp weighted norm inequalities for Littlewood-Paley operators and singular inte-
grals”. In: Adv. Math. 226.5 (2011), pp. 3912–3926. arXiv: 1005.1422 [math.CA] (cit. on p. 2).
[Ler13] A. K. Lerner. “A simple proof of the A2 conjecture”. In: Int. Math. Res. Not. IMRN 14 (2013),
pp. 3159–3170 (cit. on p. 2).
[Ler16] A. K. Lerner. “On pointwise estimates involving sparse operators”. In: New York J. Math. 22 (2016),
pp. 341–349. arXiv: 1512.07247 [math.CA] (cit. on p. 2).
[LL16] M. T. Lacey and K. Li. “On Ap –A∞ type estimates for square functions”. In: Math. Z. 284.3-4 (2016),
pp. 1211–1222 (cit. on p. 2).
[LM17a] M. T. Lacey and H. Martikainen. “Local T b theorem with L2 testing conditions and general measures:
square functions”. In: J. Anal. Math. 133 (2017), pp. 71–89 (cit. on pp. 1, 7, 11).
[LM17b] M. T. Lacey and D. A. Mena. “The sparse T1 theorem.” In: Houston J. Math. 43.1 (2017), pp. 111–
127. arXiv: 1610.01531 [math.CA] (cit. on pp. 2, 21, 22).
[LMR19] M. T. Lacey, D. Mena, and M. C. Reguera. “Sparse bounds for Bochner-Riesz multipliers”. In: J.
Fourier Anal. Appl. 25.2 (2019), pp. 523–537 (cit. on p. 2).
[LN15] A. K. Lerner and F. Nazarov. “Intuitive dyadic calculus: the basics”. Preprint. Aug. 2015. arXiv:
1508.05639 [math.CA].
[LS12] M. T. Lacey and J. Scurry. “Weighted weak type estimates for square functions”. Preprint. 2012.
arXiv: 1211.4219 [math.CA] (cit. on p. 2).
[LS17] M. T. Lacey and S. Spencer. “Sparse bounds for oscillatory and random singular integrals.” In: New
York J. Math. 23 (2017), pp. 119–131 (cit. on p. 2).
[MM14] H. Martikainen and M. Mourgoglou. “Square functions with general measures.” In: Proc. Am. Math.
Soc. 142.11 (2014), pp. 3923–3931 (cit. on pp. 6, 7, 11).
[NTV03] F. Nazarov, S. Treil, and A. Volberg. “The T b-theorem on non-homogeneous spaces”. In: Acta Math.
190.2 (2003), pp. 151–239 (cit. on p. 3).
[Obe19] R. Oberlin. “Sparse bounds for a prototypical singular Radon transform”. In: Canad. Math. Bull.
62.2 (2019), pp. 405–415 (cit. on p. 2).
[Per19] M. C. Pereyra. “Dyadic harmonic analysis and weighted inequalities: the sparse revolution”. In: New
Trends in Applied Harmonic Analysis, Volume 2. Springer, 2019, pp. 159–239. arXiv: 1812.00850
[math.CA] (cit. on pp. 4, 8).
[Sem90] S. Semmes. “Square function estimates and the T (b) theorem”. In: Proc. Amer. Math. Soc. 110.3
(1990), pp. 721–726.
[Wil07] M. Wilson. “The intrinsic square function”. In: Rev. Mat. Iberoam. 23.3 (2007), pp. 771–791 (cit. on
p. 2).
[Wit02] J. Wittwer. “A sharp estimate on the norm of the continuous square function”. In: Proc. Amer. Math.
Soc. 130.8 (2002), pp. 2335–2342 (cit. on p. 2).