Professional Documents
Culture Documents
1. Introduction
1.1. Main results. An integer n ≥ 1 is squarefree if it is not divisible by the
square of a prime. By analogy with questions about prime numbers, a basic problem
in analytic number theory is to understand the distribution of squarefree numbers
in arithmetic progressions and in short intervals. Squarefree numbers ought to be a
simpler, more regular sequence than primes, and yet they present distinct challenges;
for instance we can determine whether n is prime in polynomial time [AKS04], but
there is no known polynomial time algorithm to determine whether n is squarefree.
It was conjectured by Montgomery (see [Cro75]) that for any given ε ∈ (0, 1/100),
and (a, q) = 1,
X 6 x Y 1
(1) µ2 (n) = 2 · 1 − 2 + Oε (x/q)1/4+ε .
n≤x
π q p
p|q
n≡a (mod q)
1In fact establishing that the left-hand side is positive for q = x1−ε is open!
1
2 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
with
ζ(3/2) Y 3 2
(4) C := 1− 2 + 3 .
π p
p p
Assuming the Lindelöf Hypothesis (3) holds in the wider range H ≤ X 2/3−ε .
VARIANCE OF SQUAREFREE INTEGERS 3
We recall that the Lindelöf Hypothesis follows from the Riemann Hypothesis and
asserts that for any given ε > 0 we have |ζ( 21 + it)| ≪ε 1 + |t|ε for all t ∈ R. In
Theorem 3 we will show that if we had (3) in the full range H ≤ X 1−ε then the
Riemann Hypothesis would ensue.
Theorem 1 extends a theorem of Hall [Hal82] who showed that the asymptotic
formula (3) holds in the range H ≤ X 2/9−ε . We will now explain why the range H =
X 1/2 can be considered a threshold in this problem. It is reasonable to conjecture
that given ε > 0, for any 1 ≤ h ≤ x1−ε ,
X
(5) µ2 (n)µ2 (n + h) − C(h)x = Oε (x1/4+ε )
n≤x
with C(h) a constant depending only on h. Summing this conjectural estimate over
h recovers Theorem 1 but only in the range H < X 1/2−ε . Thus Theorem 1 exploits
(unconditionally!) additional cancellations between the error terms in (5).2
We now describe the analogue of Theorem 1 for the distribution of squarefree
numbers in arithmetic progressions with a given modulus. In this case for a given
modulus q the parameter x/q has the same role as the length H of the short interval
in Theorem 1. While the results are analogous they are harder to prove, as is often
the case with q-analogues.
1
Theorem 2. Let ε ∈ (0, 100 ) be given. Let x ≥ q ≥ x5/11+ε be a prime. Then
1 X X
2 6 x Y 1 −1 2
µ (m) − 2 · 1− 2
ϕ(q) m≤x
π q p
(a,q)=1 p|q
m ≡ a (mod q)
(6) r
Y 2 −1 x
=C 1+ · + Oε ((x/q)1/2−ε/16 ),
p q
p|q
(see also [Par19]) and Le Boudec [LB18] who considered individual averages over
(a, q) = 1 as we do in Theorem 2. In particular Nunes showed that (6) holds in the
range x31/41+ε ≤ q = o(x) and Le Boudec showed that the left-hand side of (6) is
Oε ((x/q)1/2+ε ) for all ε > 0 in the range x1/2 ≤ q ≤ x.
Keating and Rudnick [KR16] obtained Theorem 1 and Theorem 2 in the context
of function fields in the limit of a large field size. Their results hold in the (analogues
of) the ranges X ε ≤ H ≤ X 1−ε and xε ≤ q ≤ x1−ε . Our proofs of Theorem 1 and
Theorem 2 can be adapted in the setting of a fixed base field and large degree limit.
In fact our proofs of Theorems 1 and 2 were originally motivated by analogies with
the function field setting. Since we ended up obtaining equally strong results in the
setting of number fields we do not include the proofs in the function field setting.
Finally the next result shows that obtaining nearly optimal upper bounds for (3)
in a complete range is equivalent to the Riemann Hypothesis.
1
Theorem 3. The Riemann Hypothesis holds if and only if for every ε ∈ (0, 100 ) and
1−ε
every 1 ≤ H ≤ X ,
Z
1 2X X 2 6H 2
(7) µ (m) − 2
dx ≪ε,δ H 1/2+δ
X X x<m≤x+H
π
for every δ > 0.
Following the proof of Theorem 3 one can show that for any smooth compactly
supported Φ, conditionally on the Generalized Riemann Hypothesis
1 X X n 6 X m 2
(8) µ2 (n)Φ − 2 Φ ≪ε,δ (x/q)1/2+δ
ϕ(q) x π ϕ(q) x
(a,q)=1 m≡a (mod q) (m,q)=1
1−ε 1
for all δ > 0 and uniformly in 1 ≤ q ≤ x for any given ε ∈ (0, 100 ). However
it is not clear whether (8) implies the Generalized Riemann Hypothesis. Moreover
replacing the smoothing Φ by sharp cut-offs appears to be difficult. For these reasons
we decided not to pursue this further in the present paper.
Finally, we note that we have made no effort to optimize the exponents of error
terms Oε (H 1/2−ε/16 ) and Oε ((x/q)1/2−ε/16 ) in Theorems 1 and 2. Better power saving
estimates, in more restricted ranges, can be found in the papers [Hal82] and [Nun15].
1.2. Fractional Brownian motion. One notable feature of Theorems 1 and 2 is
that while the expected count of squarefrees in a short interval (or likewise arithmetic
progression) is of order H, the variance of these counts is of order H 1/2 . For many
other natural arithmetic sequences (e.g. primes) one conjectures that the variance
of counts is of the same order of magnitude as the expected value of counts.
That the variance is of order H 1/2 in Theorems 1 and 2 speaks to the idea that the
squarefree numbers are “less random” than (for example) the primes (cf. [CS13]).
VARIANCE OF SQUAREFREE INTEGERS 5
One may conjecture that higher moments are gaussian (see [ACS17] for numerical
evidence). For x drawn uniformly at random from [X, 2X], one may even make the
stronger conjecture that the process
1 X
(9) t 7→ 1/4 (µ2 (n) − 1/ζ(2))
H x<n≤x+tH
is depicted in Figure 2. Both Figure 1 and Figure 2 depict the same range of param-
eters to make the comparison easier. The dots on Figure 1 and Figure 2 correspond
to lattice points on the positive x-axis and on the (positive and negative) y-axis and
indicate the difference in scales.
1.4. Conventions and Notations. Throughout the rest of the paper we will allow
the implicit constants in ≪ and O(·) to depend on ε. Furthermore the notation
n ∼ N in the subscript of a sum will mean that N ≤ n < 2N.
with C as in (4).
1
Proposition 2. Let ε ∈ (0, 100 ) be given. Let X ≥ 1 and X ε ≤ H ≤ X 4/7−ε . Let
z ≥ H 4/3+ε . Then
Z
1 2X X X µ(d) 2
(12) µ(d) − H 2
dx ≪ H 1/2−ε/8 .
X X 2 2
d
x<nd ≤x+H 2X≥d >z
d2 >z
Assuming the Lindelöf Hypothesis, the claim holds in the wider range X ε ≤ H ≤
X 2/3−ε and z ≥ H 1+ε .
VARIANCE OF SQUAREFREE INTEGERS 7
Under the assumption of the Lindelöf Hypothesis, the above propositions cover all
the possible values of d2 for X ε ≤ H ≤ X 2/3−ε . However, unconditionally they cover
all the possible values of d2 only for X ε ≤ H ≤ X 6/(11+12ε) . It would be possible to
improve on the exponent 4/7 in Proposition 2, but this would not help. Similarly
it should be possible to prove Proposition 1 only with the condition H 1+ε ≤ z ≤
X/H 1/2+ε by adapting the proof of Proposition 3 below.
√We note that only the terms d with d2 ∈ [H 1−ε , H 1+ε ] contribute to the main term
C H in Proposition 1.
Roughly speaking Proposition 1 depends only on “convex” inputs such as a Fourier
expansion and a point-counting lemma, whereas Proposition 2 exploits large value
estimates of Huxley and subconvexity and fourth moment estimates for the Riemann
zeta-function.
1
Proof of Theorem 1 assuming Proposition 1 and Proposition 2. Let ε ∈ (0, 100 ). If
ε
H ≤ X then the result already follows from Hall’s theorem. We can therefore
assume that H > X ε .
For H ∈ [X ε , X 6/11−ε ], take z = min{X/H 1/2+ε , H 1/2−ε X 1/2 }. Note that z ≥
H 4/3+ε . Denoting by I1 the left-hand side of (11) and by I2 the left-hand side of
(12), we get, using Cauchy-Schwarz, that
Z
1 2X X X µ(d) 2 p
µ(d) − H dx = I 1 + O( I1 I2 + I2 ).
X X 2 2
d2
x<nd ≤x+H d ≤2X
P √
Notice that the tail H d2 >2X µ(d)/d2 is ≪ H/ X. Hence the claim reduces to
showing that
Z
1 2X X X µ(d) H H 2
µ(d) − H 2
· √ + √ dx ≪ H 1/2−ε/16 .
X X 2 2
d X X
x<nd ≤x+H d ≤2X
√
Applying Cauchy-Schwarz and (13) we see that the left hand side is ≪ H 1/4 (H/ X)+
H 2 /X ≪ H 1/2−ε/16 since H ≤ X 2/3−ε .
Likewise the proof of Theorem 2 splits into two steps and depends on the following
propositions.
8 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
1
Proposition 3. Let ε ∈ (0, 100 ). Let q be prime with x1/3+30ε ≤ q ≤ x1−ε and let
1+ε −ε √
(x/q) ≤ z ≤ x · qx. Then
(14)
1 X X 1 X 2
p
µ(d) − µ(d) = C x/q + O (x/q)1/2−ε/16
ϕ(q) 2 2
ϕ(q) 2 2
(a,q)=1 d n≤x, d <z d n≤x, d <z
d2 n ≡ a (mod q) (d2 n,q)=1
with C as in (4).
1
Proposition 4. Let ε ∈ (0, 100 ). Let x ≥ 1 and x3/7+ε ≤ q ≤ x1−ε . Let z ≥
(x/q)4/3+ε . Then
1 X X 1 X 2
(15) µ(d) − µ(d) ≪ (x/q)1/2−ε/8 .
ϕ(q) 2 2
ϕ(q) 2 2
(a,q)=1 d n≤x, d ≥z d n≤x, d ≥z
d2 n ≡ a (mod q) (d2 n,q)=1
Assuming the Generalized Lindelöf Hypothesis, the claim holds in the wider range
x1/3+ε ≤ q ≤ x1−ε and z ≥ (x/q)1+ε .
The proof of Proposition 3 depends once again only on “convex” inputs: in this
case Poisson summation and results on integer solutions to binary quadratic forms
with positive discriminant. However the proof of Proposition 3 is more intricate than
that of Proposition 1 due to a number of technical issues. The proof of Proposition 4
is similar to the proof of Proposition 2 and uses hybrid versions of Huxley’s large value
estimates, subconvexity estimates for L(s, χ) and a hybrid fourth moment estimate.
The deduction of Theorem 2 from the above two proposition is identical to the
deduction of Theorem 1 from Proposition 1 and Proposition 2. The only difference
is that we use the result of Nunes to handle the case when q > x1−ε and we notice
that for prime q,
1 X 1 X j x k j x k
µ(d) = µ(d) −
ϕ(q) ϕ(q) 2 d2 qd2
d2 n≤x d ≤x
(d2 n,q)=1 (d,q)=1
x X µ(d) √x 6 x 1 −1 √x
= +O = 2 1− 2 +O
q 2 d2 q π q q q
d ≤x
(d,q)=1
p
and the total error term incurred is x/q 2 which is ≤ x−ε x/q for q > x1/3+ε .
Theorem 3 depends upon similar principles as Propositions 2 and 4. We prove
Theorem 3 in section 8.
VARIANCE OF SQUAREFREE INTEGERS 9
Finally let us make a few remarks on the bottleneck that prevents us from pushing
our result further. Taking H = X 6/11 , we are unable to show the following estimate,
Z √
1 2X X X µ(d) 2 H
µ(d) − H 2
dx ≪A A
X X d log X
x≤nd2 ≤x+H 2 8/11
d ∼X
d2 ∼X 8/11
Specifically opening µ(d) using Heath-Brown’s identity (see [HB82]) the only situ-
ation that we are not able to estimate is the one in which µ(d) is replaced by two
smooth sums of equal length. Roughly speaking this corresponds to estimating,
Z √
1 2X X X 1 2 H
1−H dx ≪
X X 2 2 2/11
a2 b2 logA X
x≤na b ≤x+H a,b∼X
a,b∼X 2/11
Opening the above expression into Dirichlet polynomials this is roughly equivalent
to Z X 1 X 1 X 1 2 X 6/11
dt ≪
|t|≤X 5/11 3/11
n1/2+it 2/11
a1/2+2it 2/11
b1/2+2it logA X
n∼X a∼X b∼X
Applying the functional equation on the Dirichlet polynomial over n, and setting
Y = X 12/11 we then see that obtaining the above estimate is equivalent to showing
that,
Z X
1 X 1 X 1 2 Y 1/2
1/2+it 1/2+2it 1/2+2it
dt ≪ A
.
|t|≤Y 5/12 1/6
n 1/6
a 1/6
b log Y
n∼Y a∼Y b∼Y
If the 2it in the Dirichlet polynomial over a and b were replaced by it then we would
be facing exactly the same bottleneck as in the case of improving Huxley’s prime
number theorem in short intervals (by a variant of the computations in [HB82], see
also [Har07, Chapter 7]). In particular to make further progress we either need to
find a way to improve Huxley’s estimate or find a way to exploit the fact that the
phases in two of the Dirichlet polynomials are 2it and not it. Unfortunately we do
not see how to make progress on either of these questions.
3. Lemmas
3.1. Dirichlet polynomials and L-functions. Let us first collect some standard
results on large values of Dirichlet polynomials and L-functions.
P
Lemma 1 (Large-value theorem). Let N, T ≥ 1 and V > 0. Let F (s) = n≤N an n−s
P
be a Dirichlet polynomial and let G = n≤N |an |2 . Let T be a set of 1-spaced points
tr ∈ [−T, T ] such that |F (itr )| ≥ V . Then
|T | ≪ (GNV −2 + T min{GV −2 , G3 NV −6 })(log 2NT )6
10 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
Proof. This follows from the mean-value theorem and Huxley’s large value theorem,
see e.g. [IK04, Theorem 9.7 and Corollary 9.9].
We will say that a set of tuples (t, χ) with χ a Dirichlet character and t a real
number is well-spaced whenever it holds that if (t, χ) 6= (u, χ′ ) then either χ 6= χ′ or
|t − u| ≥ 1.
Lemma 2P(Hybrid large-value theorem). Let q ∈ N, N, T ≥ 1 and P V > 0.2 Let
−s
F (s, χ) = n≤N an χ(n)n be a Dirichlet polynomial, and let G = n≤N |an | . Let
T be a set of well-spaced tuples (tr , χ) with tr ∈ [−T, T ] and with χ a primitive
character of modulus q such that |F (itr , χ)| ≥ V . Then
|T | ≪ (GNV −2 + qT min{GV −2 , G3 NV −6 }) · (log 2qNT )18 .
Proof. This follows e.g. from [IK04, Theorems 9.16 and 9.18 with k = q and Q =
1].
Lemma 3 (Fourth moment estimate). Let T ≥ 2. Then
Z
|ζ( 12 + it)|4 dt ≪ T (log T )4 .
|t|≤T
Proof. We notice that given a character ψ (mod q) there are at most ≪ q ε characters
χ such that χ2 = ψ. Therefore the left-hand side of the claim is bounded by
X Z X
2
ε
≪q a(n)ψ(n)nit dt
ψ (mod q) |t|≤T n≤N
and the result follows from the standard hybrid mean-value theorem, see e.g. [Mon71,
Theorem 6.4].
where C is as in (4) and where the implied constant depends only on K0 and ε.
Proof. The proof consists of two steps.
The first step is to show that the sum in (17) can be completed into a sum over
all d1 , d2 without affecting the claimed asymptotic. We use the information (16) for
k = 0 and ℓ = 0, 1 to see that, for any ν > 0,
X X X H ε/2 ν 2
(18) |W (λ/ν)|2 ≪ 1+ 2
≪ νH ε/4 .
λ≥1 ε/4 ε/4
λ
1≤λ≤νH λ>νH
Hence
X µ(d1 )µ(d2 ) X Hλ 2 X (d1 , d2)2
2 1+ε/4
2H W ≪H .
1/2
d21 d22 λ≥1
(d21 , d22 ) 1/2
d21 d22
d1 >z d1 >z
or or
d2 >z 1/2 d2 >z 1/2
12 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
Writing (d1 , d2 ) = d0 and di = δi d0 and utilizing symmetry and the lower bound for
z, we see that this is
X 1 X X 1
1+ε/4 1 1+ε/4 d0
≪H 2 2 2
≪H 2
min 1, (1+ε)/2 ≪ H 1/2−ε/5 .
d ≥1 0
d (1+ε)/2
δ δ
1 2 d
d ≥1 0
H
0 δ1 ≥H /d0 0
δ2 ≥1
The second step is to use contour integration to simplify (19). Define g(x) =
|W (ex )|2 ex , which is smooth and decays exponentially as |x| → ∞. Now
Z ∞ Z ∞
x 2 x
ĝ(ξ) = |W (e )| e e(−xξ)dx = |W (y)|2y −2πiξ dy,
−∞ 0
and standard partial integration arguments show that (i) ĝ(ξ) is entire and (ii) ĝ(ξ) =
O(H 2ε /(|ξ| + 1)3) uniformly for |ℑ(ξ)| < 1/(2π). Fourier inversion implies, for r > 0,
Z s
2 −1 1 s
|W (r)| = r r ĝ ds,
2πi (c) 2πi
where the integral is over ℜ(s) = c, and −1 < c < 1.
Hence, taking c = −1/4,
X µ(d1)µ(d2 ) X Hλ 2
2H 2 2 2 W
d1 ,d2 ≥1
d 1 d 2 λ≥1
(d21 , d22 )
Z s
H X µ(d1)µ(d2 ) X (d1 , d2 )2 s s −2s
= H λ (d ,
1 2d ) ĝ ds.
iπ d ,d d21 d22 λ≥1
λ (−1/4) 2πi
1 2
The range of s is such that the sums over both λ and d1 , d2 can be taken inside the
integral, and the above simplifies to
Z Y
H 2 1 s
H s ζ(1 − s) 1 − 2 + 2+2s ĝ ds
iπ (−1/4) p
p p 2πi
Z Y
H s 2 2 1 s
= H ζ(1 − s)ζ(2 + 2s) 1 − 2 + 4+2s − 4+4s ĝ ds.
iπ (−1/4) p
p p p 2πi
The Euler product in the last line converges absolutely for ℜs > −3/4. Therefore
using Lemma 5 (noting that the same bound holds also for ζ(c + it) with c ≥ 1/2)
and bounds on ĝ we can push the contour integral above to the left to an integral
VARIANCE OF SQUAREFREE INTEGERS 13
over ℜ(s) = −3/4 + ε. Because of the singularity from ζ(2 + 2s) at s = −1/2 the
above then simplifies to
Y 3 2 1
1/2
H ζ(3/2) 1 − 2 + 3 ĝ − + O(H 1/4+3ε )
p
p p 4πi
Z ∞ Y
√ 3 2
= H 1/2 |W (y)|2 y dyζ(3/2) 1 − 2 + 3 + O(H 1/2−ε/5 ).
0 p
p p
We split the sum over d1 and d2 into the complementary ranges (d21 , d22 ) ≤ H 1−ε/4
and (d21 , d22 ) > H 1−ε/4 . In the second case we utilize that λ > H ε/4−1 (d21 , d22 ), and we
see that the above is
X (d1 , d2)4 X (d1 , d2 )2
1−ε/4
≪ 2 2
+ H 2 2
.
2 1−ε/4
d 1 d 2 2 1−ε/4
d 1 d 2
(d1 ,d2 ) ≤H (d1 ,d2 ) >H
14 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
Proof. The proof can be found in a paper by Saffari and Vaughan [SV77, Page 25]
but for the convenience of the reader we include the proof here.
First note that by the triangle inequality we have, for any v ≥ H,
|F (x + H) − F (x)|2 ≪ |F (x + v) − F (x)|2 + |F (x + v) − F (x + H)|2.
Integrating this over x ∈ [X, 2X] and v ∈ [2H, 3H],
Z 2X Z 3H Z 2X
2
H |F (x + H) − F (x)| dx ≪ |F (x + v) − F (x)|2 dxdv
X 2H X
Z 3H Z 2X
+ |F (x + v) − F (x + H)|2 dxdv.
2H X
By a change of variables the right-hand side is equal to
Z 3H Z 2X Z 2H Z 2X+H
2
|F (x + v) − F (x)| dxdv + |F (y + w) − F (y)|2 dydw
2H X H X+H
Z 3H Z 3X Z 3X Z 3H
2
≤ |F (x + v) − F (x)| dxdv = |F (x + v) − F (x)|2 dvdx.
H X X H
Changing the order of integration was justified by Fubini’s theorem. Letting v = θx
in the inner integral of the last expression above, we see the right-hand side is equal
to
Z 3X Z 3H/x Z 3X Z 3H/X
2
|F (x + θx) − F (x)| x dθdx ≪ X |F (x + θx) − F (x)|2 dθdx.
X H/x X H/3X
VARIANCE OF SQUAREFREE INTEGERS 15
and
2 2
1 X X
X X X
bn χ(n) = bn1 bn2 = bn .
(24) ϕ(q)
χ (mod q) n n1 ≡ n2 (mod q) a (mod q) n=a (mod q)
(n1 n2 ,q)=1 (a,q)=1
If M2 < T , then once m1 is chosen there are at most two choices for m2 . Therefore
it suffices to count the number of integers m1 ∼ M1 such that
r
b
M2
m1
≤ .
a
T
The result now follows from Lemma 11.
Here
X
(28) 1 = H/d2 + ψ(x/d2 ) − ψ((x + H)/d2),
x/d2 <n≤(x+H)/d2
where ψ(y) = y − [y] − 1/2 with [y] the integral part of y. For ψ we have the Fourier
expansion (see e.g. [IK04, (4.18)])
1 X 1
(29) ψ(y) = − e(yn) + O(min{1, 1/(Nkyk)}).
2πi n
0<|n|≤N
18 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
We take N = X 10 and plug (29) into (28). The arising error term is O(1/X 5) unless
kx/d2 k < X −5 or k(x + H)/d2k < X −5 . Given this, it is easy to see that the error
term leads to acceptable contribution to the left hand side (27).
Hence, the left hand side of (27) can be replaced by
1
Z ∞ X
x X 1 nx
nH
2
σ µ(d) e 1 − e dx.
4π 2 X −∞ X 2
n d2 d2
d ≤z 0<|n|≤N
Owing to the support of σ̂ this implies that we may restrict the sum in (30) to those
integers for which
n
1 n2 BH ε/2
(31) 2 − 2 ≤ .
d1 d2 X
We consider separately those (n1 , n2 , d1 , d2 ) for which n1 d22 = n2 d21 and those for
which this does not hold. In the first case parameterizing solutions in n1 and n2 by
n1 = λd21 /(d21 , d22 ) and n2 = λd22 /(d21, d22 ) for λ ∈ Z \ {0}, we obtain
2
σ̂(0) X X (d1 , d2 )4
1 − e λH
+O 1 ,
µ(d )µ(d ) (d1 , d2 )2
1 2
4π 2 2 2 λ6=0
d21 d22 λ2 X5
d1 ,d2 ≤z
where the error term comes from adding |ni | > N (for which surely |λ| > X 8 ). Here
λH
1 − e = 2 sin λπH ,
(d1 , d2 )2 (d21 , d22 )
Summing over m ≪ min{N1 , N2 } and using this bound in (35), the maximum in the
resulting bound for (35) is attained for Nj = Dj2 /H. Hence we obtain that (35) is at
most H 2+ε/2+ε/500 /X ≤ H 1/2−ε/2 since H ≤ X 2/3−ε .
Let us now dispose of the smoothing σ: Take B to be a sufficiently large absolute
constant that there exist integrable functions σ− and σ+ such that σb− and σb+ have
ε/2 ε/2
support [−BH , BH ], and
Z
σ− ≤ 1[1,2] ≤ σ+ , and σ± (x) dx − 1 ≤ H −ε/2.
Splitting into dyadic ranges according to the size of d, it essentially suffices to show
that, for each D ∈ [z 1/2 , (2X)1/2 ], we have
Z
1 2X X X µ(d) 2
(36) µ(d) − H 2
dx ≪ H 1/2−ε/4 .
X X 2 d∼D
d
x<nd ≤x+H
d∼D
Let
X X µ(d)
A(x) := µ(d) − x .
d2
nd2 ≤x d∼D
d∼D
Using this definition and Lemma 10, we see that the left-hand side of (36) is
Z Z
1 2X 2 1 3X
(37) |A(x + H) − A(x)| dx ≪ |A(u(1 + θ)) − A(u)|2 du
X X X X
VARIANCE OF SQUAREFREE INTEGERS 21
H 3H H
for some θ ∈ [ 3X , X ]. Choose w such that ew = 1 + θ, so that w ≍ X
. By contour
integration
Z 2+i∞ X µ(d)
y 1 eys
(38) A(e ) = ζ(s)M(2s)ds − ey ,
2πi 2−i∞ s d2
d∼D
where
X µ(d)
M(s) := s
.
d∼D
d
Moving the contour to the line ℜs = 1/2 we notice that the residue from s = 1
cancels with the second term on the right-hand side of (38), and we obtain
Z 1
A(ew+x ) − A(ex ) 1 ew( 2 +it) − 1 itx 1
= 1 e ζ( 2 + it)M(1 + 2it)dt.
ex/2 2π R 2
+ it
Therefore, by Plancherel,
Z Z w( 1 +it)
∞
u+w u du
2 e 2 − 1 2
(39) |A(e ) − A(e )| · u ≪ 1 · |ζ( 21 + it)M(1 + 2it)|2 dt.
0 e R 2
+ it
(40)
Z Z ∞
1 2X 2 du
|A(x + H) − A(x)| dx ≪ X |A(u(1 + θ)) − A(u)|2 2
X X 0 u
Z w( 1 +it)
e 2 − 1 2
≪X 1 · |ζ( 12 + it)M(1 + 2it)|2 dt
R 2
+ it
Z n H 2 1 o
≪ X min , 2 · |ζ( 21 + it)M(1 + 2it)|2 dt.
R X |t|
On the other hand, the contribution of |t| ≤ X 2 to the right-hand side of (40) is at
most
Z
H2
≪ |ζ( 21 + it)M(1 + 2it)|2 dt
X |t|≤2X/H
Z X2 Z
1 1
+X 2
· |ζ( 21 + it)M(1 + 2it)|2 dtdT
X/H T T T ≤|t|≤2T
Z
1
(41) ≪H sup |ζ( 21 + it)M(1 + 2it)|2 dt + O(1).
X/H≤T ≤X 2 T |t|≤T
Let us now prove the claim on the assumption of the Lindelöf Hypothesis. Applying
Lindelöf and then the mean-value theorem (Lemma 7 with q = 1), we have for any
choice of δ > 0,
Z Z
H 2 HT δ
|ζ(1/2 + it)M(1 + i2t)| dt ≪ |M(1 + i2t)|2 dt
T |t|≤T T |t|≤T
HT δ 1 HT δ HT δ
≪ (T + D) · ≪ + .
T D D T
Recall we have D ≥ z 1/2 ≥ H (1+ε)/2 , H ≤ X 2/3−ε and X/H ≤ T ≤ X 2 . Hence the
above is
H 2−δ H 2−δ
≪ H 1/2−ε/2 T δ + ≪ H 1/2−ε/4
+ ≪ H 1/2−ε/4 ,
X 1−δ H (1−δ)/(2/3−ε)
for δ sufficiently small. Applying this bound to (41) yields the claim.
Let us now prove the unconditional part of the proposition. First notice that the
values of t for which |M(1 + 2it)| ≤ D −1/2+ε/16 contribute to (41) by Cauchy-Schwarz
and the fourth moment bound (Lemma 3) O(H 1+ε/16 D −1+ε/8 ) = O(H 1/2−ε/4 ), and
therefore their contribution is always acceptable. Writing
S(V ) = {t ∈ [−T, T ] : V ≤ |M(1 + 2it)| < 2V },
Consider first the case when the first term dominates here. Then by Lemma 5 we
have
Z
H 2 H H H 5/3
V |ζ(1/2 + it)|2 dt ≪ T 1/3+ε/2 ≪ 2/3−ε/2 ≪ 2/3−ε/2 ≤ H 1/2−ε/3
T S(V ) T T X
since H ≤ X 4/7−ε .
Consider now the case that the second term dominates in (42). Then, by Cauchy-
Schwarz and the fourth moment estimate (Lemma 3),
Z Z 1/2
H 2 2 H 2 1/2 1 4
V |ζ(1/2 + it)| dt ≪ V |S(V )| |ζ( 2 + it)| dt
T S(V ) T |t|≤T
with C as in (4).
The proof of Proposition 6 is based on two Propositions that we now describe.
Proposition 7 below will be used to introduce a smoothing into (43). Note that it
√ √
gives an upper bound that is o( qx) whenever z = o( qx/(log x)6 ) and the interval
I has length o(x/(log x)12 ).
Proposition 7. Let q be prime with q ≤ x, let z ≤ x. Let I ⊂ [1, 2x] be an interval.
Then
1 X X 2 p
2 6
(44) µ(d)χ(d )χ(n) ≪ (log x) · z + |I|q .
ϕ(q)
χ (mod q) d2 ≤z
χ6=χ0 nd2 ∈I
We will use the following proposition to evaluate the smoothed analogue of (43).
24 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
where C is as in (4).
One way to construct f satisfying the assumptions of the proposition is to take
φ(t) to be a smooth function which vanishes for negative t and has φ(t) = 1 for t
greater than 1, and then set f (u) = φ((x/q)ε/4 u)φ((x/q)ε/4(1 − u)).
With these two propositions at hand we are ready to prove Proposition 6.
where
1 X X n 2
S1 := An χ(n)f
ϕ(q) n
x
χ (mod q)
χ6=χ0
and
1 X X n 2
S2 := An χ(n) 1 − f
ϕ(q) x
χ (mod q) n∈I
χ6=χ0
By partial summation,
X n Z t
An χ(n) 1 − f = 1−f dB2 (χ; t)
n∈I2
x I 2
x
Z t
1
= f′ B2 (χ; t)dt + B2 (χ; x).
x I2 x
Hence,
1 X X n 2
A n χ(n) 1 − f
ϕ(q) χ6=χ x
0 n∈I2
Z
1 X 1
′ t
2
1 X
≪ f B2 (χ; t)dt + B2 (χ; x)|2
ϕ(q) χ6=χ x I2 x ϕ(q) χ6=χ
0 0
1 X 1Z 1 X
(45) ≤ (x/q)ε/4 · |B2 (χ; t)|2 dt + B2 (χ; x)|2 .
ϕ(q) x I2 ϕ(q)
χ6=χ0 χ6=χ0
Therefore (45) is
1 √ √
≪ (log x)6 · (x/q)ε/4 · · x · (x/q)−ε/4 · x−ε qx + x3/2 (x/q)−3ε/8 q
x
√ √
+ (log x)6 (x/q)−ε/8 qx ≪ (log x)6 (x/q)−ε/8 · qx.
A similar argument shows that
1 X X n 2
√
An χ(n) 1 − f ≪ (log x)6 (x/q)−ε/8 · qx.
ϕ(q) χ6=χ n∈I x
0 1
where
(
1 na nb |I| 2
d2 q
if |n| ≤ d|I|q ,
fI/d2 (n) = · e 2 − e 2 ≪ gI/d2 (n) := 1
n dq dq n
otherwise .
We split d and n into dyadic intervals and bound the left-hand side of (44) by
X X X 2
(46) (log x)2 sup µ(d)χ(d2) χ(n)fI/d2 (n) + O(z(log q)2 ).
D≤z 1/2 χ (mod q) d∼D n∼N
1≤N ≤q
The error term is clearly acceptable. We bound the main term of (46) using a
majorant principle — by going through the first equality in (24) we can replace
coefficients µ(d) and fI/d2 (n) by their majorants. Hence we get the bound
X X d X 2
≪ (log x)2 sup χ2 (d)V · χ(n)V (n/N)gI/D2 (N)
D≤z 1/2 χ D n
d
1≤N ≤q
We claim that
(48)
X X n 4
X X n 4
χ2 (n)V ≪ qD 2 ·(log x)4 and χ(n)V ≪ qN 2 ·(log x)4 .
2 n
D n
N
χ 6=χ0 χ6=χ0
We explain the second bound in (48); the first bound is similar. Let Ve be the Mellin
transform of V . Using contour integration, the decay of Ve , and Hölder, we get, for
every A ≥ 1,
X X n 4
X Z 4
χ(n)V = e
L(1/2 + it, χ)V (1/2 + it)N 1/2+it
dt
χ6=χ0 n
N χ6=χ0 R
X Z
4
≪A N 2 L(1/2 + it, χ)(1 + |t|)−A dt
χ6=χ0 R
X Z 4
2
≪A N L(1/2 + it, χ) (1 + |t|)−A dt.
χ6=χ0 R
VARIANCE OF SQUAREFREE INTEGERS 27
A dyadic decomposition of the integration range and the fourth moment bound for
Dirichlet L-functions (Lemma 4) yield the second part of (48).
Using (48) in (47), we obtain an upper bound
≪ (log x)6 sup gI/D2 (N)2 qDN
D≤z 1/2
1≤N ≤q
Recalling the definition of gI/D2 (N), we see that on the last line, the first N-
supremum is attained for N = 1 and the second N-supremum is attained for
N = D 2 q/|I|, and we get the bound
p
≪ (log x)6 sup
√ qD + (log x) 6
√ sup |I|/D ≪ (log x) 6
|I|q
D≤ |I|/q 1/2 |I|/q<D≤z
6.3. Proof of Proposition 8. We apply Poisson summation (see e.g. [IK04, formula
(4.26)]) in the sum over n, getting
X nd2 x X xℓ
χ(n)f = τ (χ) · 2 χ(ℓ)fˆ 2 .
n
x qd ℓ dq
Therefore we have to asymptotically estimate
q x2 X X µ(d) 2 ˆ
xℓ 2
(49) · 2 2
χ(d )χ(ℓ)f 2 =
ϕ(q) q χ6=χ 2 d dq
0 d ≤z
ℓ∈Z
x2 X µ(d1 ) µ(d2) ˆ xn2 ˆ xn1
(50) 2 2
·f 2 f 2 + O(zx2ε/3 ),
q n1 ,n2 ∈Z
d1 d2 d1 q d2 q
d21 ,d22 ≤z
d21 n1 ≡ d22 n2 (mod q)
(d1 d2 n1 n2 ,q)=1
and where O(zx2ε/3 ) comes from the principal character and from replacing ϕ(q) by
√
q. We note that since z ≤ x−ε qx this contribution is acceptable. Notice that we
√
can add and remove the restrictions d1 , d2 > x1/2−ε/6 / q and |n1 |, |n2 | ≤ xε/3 · zq/x
at will because they cost us a negligible error term that is ≪A x−A for any given
A > 0. Moreover note that n1 and n2 now traverse all of Z.
We now separate the set of tuples (n1 , n2 ) into
M := {(k12 m, k22 m) : m ∈ Z squarefree, k1 , k2 ∈ N}
28 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
and the complement. The (n1 , n2 ) ∈ M contribute to a main term that is relatively
easy to compute. On the other hand we will bound the contribution of (n1 , n2 ) 6∈ M.
6.3.1. The main term (n1 , n2 ) ∈ M. The conditions d21 n1 ≡ d22 n2 (mod q) and (n1 n2 , q) =
in the sum in (50) imply that if (n1 , n2 ) ∈ M then d21 k12 ≡ d22 k22 (mod q) p
and therefore
√
d1 k1 ≡p±d2 k2 (mod q). This implies that d1 k1 = d2 k2 since dj kj ≤ z · xε/3 zq/x =
√
xε/6 z · q/x and this is ≤ q/3 because z ≤ x−ε qx. We conclude that the contri-
bution of (n1 , n2 ) ∈ M is given by
x2 X X µ(d1 )µ(d2) X 2 2 2
(51) µ (m) ˆ xk2 m fˆ xk1 m .
f
q k ,k d1 k1 =d2 k2
d21 · d22 d21 q d22 q
1 2 (m,q)=1
d21 ,d22 ≤z
(d1 d2 k1 k2 ,q)=1
Note that we can drop the condition (d1 d2 , q) = 1 as q is prime and d1 , d2 < q.
Likewise since ℓ ≥ q contribute OA (x−A ), we can drop the condition (ℓ, q) = 1 and
apply Lemma 8 with W = fˆ and H = x/q to see that the above is
Z ∞
√ √ √
C qx · π |fˆ(y)|2 y dy + O((x/q)−ε/8 xq).
0
Hence
Z ∞ Z ∞
ˆ 2√ √
|f (y)| y dy − |F̂ (y)|2 y dy
0 0
Z ∞
√
≪ min{(x/q)−ε/4 , y −1}(1 + y)−1 y dy ≪ (x/q)−ε/8 .
0
Putting these estimates together, and using the relation |F̂ (ξ)| = |S(ξ)| and the
integral identity (21), we see that (51) is
√ √
C qx + O((x/q)−ε/8 xq).
with V a smooth non-negative compactly supported function such that V (x) ≥ 1 for
x ∈ [−2, 2] and D1 , D2 > x1/2−ε/6 /q 1/2 and N1 ≤ xε/3 D22 q/x and N2 ≤ xε/3 D12 q/x.
We now split into two cases according to the size of D1 D2 :
6.3.3. Case D1 D2 ≥ x1+2ε /q. In this case we do not use the condition (n1 , n2 ) 6∈ M.
Dropping this condition and using Dirichlet characters we can re-write (52) as
(53)
x2 1 X X n X
1
n
2
2 2
χ(n1 )V χ(n2 )V
qϕ(q) D1 D2 2 n
N1 n
N2
χ 6=χ0 1 2
X d X d x2N1 N2
2 1 2 2
× χ (d1 )V χ (d2 )V +O 2 ·
d1
D1 d2
D2 q D1 D2
and where the O(·) term corresponds to the contribution of the characters with
χ2 = χ0 . Note that this contribution is acceptable since
x2 N1 N2 √
2
· ≪ x2ε/3 D1 D2 ≪ x2ε/3 z ≪ x−ε/3 qx.
q D1 D2
30 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
Now we express each of the sums in (53) using a contour integral, and using Hölder’s
inequality this allows us to bound (53) by
√ Z
x2 N1 N2 D1 D2 X √
2
· 2 2
|L( 21 + iu, χ)|4du + x−ε/3 qx.
q D1 D2 χ |u|≤xε/3
where
n √ √ √n x + √n y o
1 2
Tm+ 2 2m−2
= (x, y) ∈ Z : f (x, y) = qℓ and n1 x > n2 y and εn1 n2 ≤ √ √ < ε2m
n1 n2
n1 x − n2 y
and
n √ √ √n x + √n y o
1 2
Tm− 2 2m−2
= (x, y) ∈ Z : f (x, y) = qℓ and n1 x < n2 y and εn1 n2 ≤ √ √ < ε2m
n1 n2
n1 x − n2 y
= {(x, y) ∈ Z2 : (−x, −y) ∈ Tm+ }.
VARIANCE OF SQUAREFREE INTEGERS 31
√ √
The reason for this is that n1 x1 + n2 y1 ≪ x7ε (x/q)1/2 and
√ √ qℓ q 3/2
| n1 x1 − n2 y1 | = √ √ ≫ 1/2+7ε ≫ 1.
n1 x1 + n2 y1 x
Moreover by Lemma 13 of [MW02] we have #Tm+ = #T1+ for all m ≥ 1, and trivially
#Tm− = #Tm+ for all m ≥ 1.
The solutions belonging to T1+ are primary for the quadratic form n1 x21 − n2 x22 of
discriminant 4n1 n2 (see p. 101 of [SW06] for the definition of primary). By Theorem
4.1 of [SW06] the number of (x1 , y1 ) for which there exists a quadratic form g of
discriminant 4n1 n2 such that g(x1 , y1) = qℓ and such that (x1 , y1 ) is primary for g,
is either 0 or given by
Y
1 4n1 n2 /m2
X
d0
m 1− · ,
p p qℓ
k
p|m k|
m2
2
for particular integers m and d0 with m | (qℓ, 4n1 n2 ). Using the divisor bound
#{k : k | n} ≪ε nε/100 , we find that this is
≪ (n1 n2 )1/2+ε/100 (qℓ)ε/100 ≪ x8ε .
We conclude therefore that #T1+ ≪ x8ε and therefore the number of solutions (x1 , y1 )
with |x1 |, |y1| ≪ (x/q)1/2 x3ε to the equation f (x1 , y1 ) = qℓ is bounded by ≪ log x ·
#T1+ ≪ x9ε as claimed. It follows therefore that the total number of solutions to
n1 d21 − n2 d22 = qℓ with ni ∼ Ni , di ∼ Di for i = 1, 2 is ≪ x1+36ε /q 2 .
We conclude therefore that (52) is
x2 1 x1+36ε x1+40ε √
≪ · 2 2· 2
≪ ≪ x−ε qx
q D1 D2 q q
since q > x1/3+30ε .
Expressing the condition nd2 ≤ x using a contour integral (see [MV07, Cor. 5.3])
the above is bounded by
Z
1 X 2 x1/2+it 2
≪ log x √ sup √ 2
1
L( 2 +it, χ)M(1+2it, χ )· dt +O((x/q)1/2−ε/8 ),
z≤D≤ x ϕ(q) χ6=χ |t|≤x 1/2 + it
0
32 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
(in fact a better error term can be obtained but we do not need to keep track of it)
where
X µ(d)χ2 (d)
M(1 + 2it, χ2 ) = .
d∼D
d1+2it
Applying Cauchy-Schwarz and splitting according to the values of t we can bound
the main term above as
Z
3 1 X 1 T
(54) ≪ x(log x) √ sup √ 2
|L( 12 + it, χ)|2 · |M(1 + 2it, χ2 )|2 dt.
z≤D≤ x ϕ(q) χ6=χ T −T
0
1≤T ≤x
Let us now prove the claim on the assumption of the Generalized Lindelöf Hy-
pothesis. Applying Generalized Lindelöf and then the hybrid mean-value theorem
(Lemma 7) we have for any choice of δ > 0,
Z
x X 1 T
|L( 12 + it, χ)|2 |M(1 + 2it, χ2 )|2 dt
ϕ(q)2 χ6=χ T −T
0
Z
x(qT )δ X T
(55) ≪ 2 |M(1 + 2it, χ2 )|2 dt
q T χ6=χ −T
0
δ 2δ
xT q 1 δ 2δ x x
≪ 2 (qT + D) · ≪T q + .
q T D qD q 2 T
Since q, T ≤ x ≤ (x/q)O(1) , for sufficently small δ we have T δ q 2δ ≤ (x/q)ε/100 .
Recalling also that D ≥ z 1/2 ≥ (x/q)(1+ε)/2 and q ≥ x1/3+ε , we see that (55) is
ε/100 x 1/2−ε/2 x
≪x + 2 ≪ (x/q)1/2−ε/3 .
q q
Applying this estimate to (54) yields the claim.
Let us now consider the unconditional part of the claim. Let
ST,q (V ) := {(t, χ) : V ≤ |M(1 + 2it, χ2 )| ≤ 2V , |t| ≤ T , χ (mod q)}.
√
Note that for D ≥ z ≥ (x/q)1/2+ε , the values of t ∈ [−T, T ] for which |M(1 +
2it, χ2 )| ≤ D −1/2+ε/4 contribute to (54) by Cauchy-Schwarz and the fourth moment
bound (Lemma
P 4) O((log x)5 xD −1+ε/2 /q) = O((x/q)1/2−ε/2 ). Additionally,√ |M(1
√+
2it)| ≤ d∼D 1/d ≤ 2. Therefore it suffices to show that for each x ≥ D ≥ z,
V ∈ [D −1/2 , 1], and T ∈ [1, x], we have
Z
xV 2 X 1 √
(56) |L( 12 + it, χ)|2 dt ≪ (x/q)−ε/8 (log x)−4 · qx.
ϕ(q) χ6=χ T t : (t,χ)∈ST,q (V )
0
VARIANCE OF SQUAREFREE INTEGERS 33
By Lemma 2 we have,
Here |ST,q (V )| is the measure of ST,q (V ), where the set of χ (mod q) is endowed with
the counting measure. Consider first the case when the first term dominates. Then,
by Lemma 6, we see that the left-hand side of (56) is
xV 2 |ST,q (V )| x 1 x √
≪ · ·(qT )1/3+ε/4 ≪ · ·(qT )1/3+ε/3 ≪ 2/3−ε/2 ≪ (x/q)−ε/8 (log x)−4 · qx
ϕ(q) T ϕ(q) T q
since q > x3/7+ε . Note that the factor (log x)18 in (57) was absorbed in the exponent
of qT .
Consider now the case that the second term dominates in (57). Then by Cauchy-
Schwarz and the hybrid fourth moment estimate (Lemma 4),
Z
xV 2 X 1
|L( 12 + it, χ)|2 dt
ϕ(q) χ6=χ T t : (t,χ)∈ST,q (V )
0
xV 2 XZ T 1/2
1/2 1 4
≪ · |ST,q (V )| · |L( 2 + it, χ)| dt
T ϕ(q) χ6=χ −T
0
8.1. Proof that the Riemann Hypothesis implies (7). The proof follows the
same ideas as the proof of Proposition 2. The claim (7) is already proved for H ≤
X 2/3−ε , so we may assume H > X 2/3−ε . We return to (41) and consider first the
case D ≥ H (1−δ)/2 . Note that the Riemann Hypothesis implies
for |t| ≤ X 2 . Now (41), Cauchy-Schwarz and the fourth moment bound for the
Riemann zeta function (Lemma 3) imply
Z
1 2X X X µ(d) 2
µ(d) − H 2
dx ≪δ (log X)2 H/D 1−δ .
X X 2
d
x<nd ≤x+H d∼D
d∼D
2 /2
For D ≥ H (1−δ)/2 , the right-hand side is ≪δ (log X)H 1/2+δ−δ . Splitting dyadically
for D ∈ [H (1−δ)/2 , X 1/2 ] and using the tail bound
X µ(d) 1
2
≪δ 3/4−δ/10 ,
2
d X
d >2X
valid under Riemann Hypothesis, we see that the Riemann Hypothesis implies
Z
1 2X X X µ(d) 2
µ(d) − H dx
X X 2 2 1−δ
d2
x<nd ≤x+H d ≥H
d2 ≥H 1−δ
Z 2X X X µ(d) 2
2 1
(59) ≪ (log X) sup µ(d) − H dx
H (1−δ)/2 ≤D≤X 1/2 X X d2
x<nd2 ≤x+H d∼D
d∼D
Z 2X X µ(d) 2
1
+ H 2
dx ≪δ H 1/2+δ .
X X 2
d
d >2X
On the other hand, estimating the n-sum on the left-hand side by H/d2 + O(1),
we see that
Z
1 2X X X µ(d) 2
µ(d) − H 2
dx ≪ H 1/2 .
X X 2 2 1/2
d
x<nd ≤x+H d ≤H
d2 ≤H 1/2
Hence the claim follows once we have shown that, for any D ∈ [H 1/4 , H (1−δ)/2 ], we
have Z
1 2X X X µ(d) 2
µ(d) − H 2
dx ≪δ H 1/2 .
X X 2
d
x<nd ≤x+H d∼D
d∼D
Notice that we can attach to the n variable a dummy function f (nD 2 /X) with f a
smooth function supported in [1/20, 20] and such that f (y) = 1 for y ∈ [1/10, 10].
Similarly to the proof of Propositon 2, write
X n X µ(d)
A(x) := f µ(d) − x .
2
X/D 2 d2
nd ≤x d∼D
d∼D
VARIANCE OF SQUAREFREE INTEGERS 35
Plugging this into (60) and arguing as in the proof of Proposition 2, we see that, for
some w ≍ H/X, we have
Z
1 2X
|A(x + H) − A(x)|2 dx
X X
(61) Z w( 1 +it)
e 2 − 1 2 2
≪X 1 · N1 ( 21 + it) − N2 ( 12 + it) M(1 + 2it) dt.
R 2
+ it
By Poisson summation
X 1 m XZ ∞ 1 u
1
N1 ( 2 + it) = 1/2+it
· f 2
= 1/2+it
· f 2
e(ℓu)du
m
m X/D −∞ u X/D
ℓ
Z 1/2+it
1 X X 20
D2 ℓyX
= N2 ( 2 + it) + 2 f (y)e dy.
D ℓ6=0 1/20 yX D2
By partial integration (taking antiderivatives of e(ℓyX/D 2)), this implies that, for
|t| < X/D 2+δ/100 ,
|N1 ( 12 + it) − N2 ( 21 + it)| ≪A X −A ,
for any A > 0. Therefore the part of the integral (61) with |t| < X/D 2+δ/100 is
completely negligible.
On the other hand the part with |t| ≥ X 10 contributes only O(1) to the left-hand
side of (61) by estimating |Nj (1/2 + it)| and |M(1 + it)| trivially.
36 OFIR GORODETSKY, KAISA MATOMÄKI, MAKSYM RADZIWILL, BRAD RODGERS
and
|M(1 + 2it)| ≪δ D −1/2+δ/100 .
Furthermore, by partial integration we have, for |t| ∈ [X/D 2+δ/100 , X 10 ],
|N2 ( 12 + it)| ≪δ X δ/100
Hence the part with |t| ∈ [X/D 2+δ/100 , X 10 ] contributes to (61)
Z
1 δ/50 −1+δ/50
≪δ X 2
X D dt ≪δ DX δ/10 ≪δ H 1/2
X/D 2+δ/100 ≤|t|≤X 10 |t|
since D ≤ H (1−δ)/2 and H ≥ X 1/2 .
8.2. Proof that (7) implies the Riemann Hypothesis. Suppose that (7) holds
for H = X 1−δ . Then, by Cauchy-Schwarz,
Z X b
x 1 2 6X Φ(0)
Φ µ (m) dx = 2
+ Oδ (X 1/4+3δ ).
R X H x<m≤x+H π
with Φ an arbitrary, but not identically zero smooth function compactly supported
b
in [1/2, 3] (one could even enforce that Φ(0) = 0 to simplify the above expression
but we didn’t find any significant advantage in doing this). Therefore,
Z 2+i∞ b
1 ζ(s) s 6X Φ(0)
(62) X · ΨH/X (s)ds − = Oδ (X 1/4+3δ ),
2πi 2−i∞ ζ(2s) π2
where uniformly in 1/100 < ℜs < 100, for any given A > 1,
Z
1 X H
ΨH/X (s) := · Φ x− − Φ(x) xs dx
s H R X
Z
1 X (−1) j
H j−1
(63) = · Φ(j) (x)xs dx + OA (X −δA )
s 1≤j≤A j! X R
Z
1 ′ s H −A −δA
=− Φ (x)x dx + OA · (1 + |ℑs|) + X .
s R X
By integration by parts the main term is equal to Φ(s)e e
where Φ(s) is the Mellin
transform of Φ. The reader may also verify that we have the exact relation ΨH/X (1) =
e
Φ(1).
VARIANCE OF SQUAREFREE INTEGERS 37
Suppose that the Riemann Hypothesis fails. Then ζ(s)/ζ(2s) has a pole in the
strip 14 < σ < 21 (e.g. s = ρ/2 with ρ = β + iγ the zeros of ζ(s) with smallest γ > 0
among all zeros of ζ(s) with β ∈ ( 12 , 1)). Let Θ > 14 denote the supremum of the
real part of poles of ζ(s)/ζ(2s) lying in the strip 14 < σ < 21 . Choose δ > 0 to be
sufficiently small so that 14 + 3δ ≤ Θ − δ/2.
Pick now s0 a pole of ζ(s)/ζ(2s) with ℜs0 ∈ (Θ−δ/50, Θ] and the smallest positive
imaginary part. We can assume without loss of generality that Φ is chosen so that
e 0 ) 6= 0. Indeed if it were the case that Φ(s
Φ(s e 0 ) = 0 then pick a c ∈ (0, 1) such that
e + s0 ) 6= 0 and consider xc Φ(x) in place of Φ(x).
Φ(c
We shall shift the contour in (62) to the line σ = Θ + δ/8. Note that for any
fixed values of X and H, from the definition (63) and integration by parts, we have
ΨH/X (s) ≪A (1+|ℑs|)−A uniformly for 1/100 < ℜs < 100 for all A ≥ 1. Furthermore
for ℜs ∈ [σ, 2] and s bounded away from 1 we get the bound ζ(s)/ζ(2s) ≪δ (1 +
|ℑs|)C , where C is a constant which depends only on δ. (This follows because we
may bound ζ(s) using a convexity bound (see e.g. [Tit86, Sec. 5.1]) and we may
bound 1/ζ(2s) using the estimate log ζ(2s) ≪δ log(|ℑ(2s)| + 2) in this region, which
follows from a well-known estimate on the logarithmic derivative of the zeta function
(e.g. [Tit86, Thm 9.6 (A)]) and the fact that in this region |2s − ρ′ | ≥ δ/8 whenever
ζ(ρ′ ) = 0.) Thus
Z Z
1 ζ(s) s 1 ζ(s) s 6
X · ΨH/X (s)ds = X · ΨH/X (s)ds + 2 XΨH/X (1).
2πi (2) ζ(2s) 2πi (σ) ζ(2s) π
we have for X ≥ 1,
Z
1 ζ(s) e
A(X) = · X s Φ(s)ds = Oδ (X Θ−δ/2 ).
2πi (σ) ζ(2s)
for all x ≥ 0 (note that for 0 < x < 1/100 we have that A(x) vanishes). Let us start
by observing that for ℜs > 1,
Z ∞ X Z ∞ b
−s−1 2 n −s−1 6Φ(0) 1
A(x)x dx = µ (n) Φ x dx − 2
·
0 n≥1 0 x π s−1
X e
6Φ(1) 1
(65) = e
µ2 (n) · n−s Φ(s) − ·
π 2 s−1
n≥1
ζ(s) e e
6Φ(1) 1
= · Φ(s) − 2
· .
ζ(2s) π s−1
R∞
The function 0 A(x)x−s−1 dx is analytic in ℜs > Θ − δ/50 by (64). Therefore, by
(65) and analytic continuation,
ζ(s) e e
6Φ(1) 1
(66) · Φ(s) − 2
·
ζ(2s) π s−1
is analytic in the region ℜs > Θ − δ/50. This however contradicts that (66) has a
pole at s0 and ℜs0 ∈ (Θ − δ/50, Θ].
References
[ACS17] M. Avdeeva, F. Cellarosi, and Y. G Sinai. Ergodic and statistical properties of B-free
numbers. Theory of Probability & Its Applications, 61(4):569–589, 2017.
[AKS04] M. Agrawal, N. Kayal, and N. Saxena. PRIMES is in P. Ann. of Math. (2), 160(2):781–
793, 2004.
[Blo08] V. Blomer. The average value of divisor sums in arithmetic progressions. Q. J. Math.,
59(3):275–286, 2008.
[CE19] D. Carmon and A. Entin. On square-free values of large polynomials over the rational
function field. Mathematical Proceedings of the Cambridge Philosophical Society, page
1–17, 2019.
[Cro75] M. J. Croft. Square-free numbers in arithmetic progressions. Proc. London Math. Soc. (3),
30:143–159, 1975.
[CS13] F. Cellarosi and Y. G. Sinai. Ergodic properties of square-free numbers. Journal of the
European Mathematical Society, 15(4):1343–1374, 2013.
[FT92] M. Filaseta and O. Trifonov. On gaps between squarefree numbers. II. J. London Math.
Soc. (2), 45(2):215–221, 1992.
[GH91] G. R. Grimmett and R. R. Hall. The asymptotics of random sieves. Mathematika,
38(2):285–302, 1991.
[GR14] I. S. Gradshteyn and I. M. Ryzhik. Table of integrals, series, and products. Academic
press, 2014.
[Gra98] A. Granville. ABC allows us to count squarefrees. Internat. Math. Res. Notices, (19):991–
1009, 1998.
[Hal82] R. R. Hall. Squarefree numbers on short intervals. Mathematika, 29(1):7–17, 1982.
VARIANCE OF SQUAREFREE INTEGERS 39