Professional Documents
Culture Documents
Non-vanishing
of L-Functions
and Applications
This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned,
specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on
microfilms or in other ways, and storage in data banks. For any kind of use, permission of the copyright
owner must be obtained.
degree and was awarded the doctorate after the requisite period of two years as a
student. Although he had been a part-time employee of the Mathematical Seminar
since 1948, he was not allowed to become a full member of the scientific staff until
1962. This despite his actually heading the department rather than just being a
staff member.
His own papers regularly appeared in the journals of the Barcelona Seminar,
Collectanea Mathematica, to which he was also an eminent reviewer and advisor.
On several occasions, he was consulted by the Proceedings of the American Society
of Mathematics as an advisor. He always participated in and supported guest
lectures in Barcelona, many of them having been prepared or promoted by him. On
the occasion of a conference in 1966, H. Mascart of Toulouse publicly pronounced
his feeling of beeing honoured by the presence of M. Sunyer Balaguer, “the first,
by far, of Spanish mathematicians”.
At all times, Sunyer Balaguer felt a strong attachment to the scientific ac-
tivities of his country and modestly accepted the limitations resulting from his
attitude, resisting several calls from abroad, in particular from France and some
institutions in the USA. In 1963 he was contracted by the US Navy, and in the
following years he earned much respect for the results of his investigations. “His
value to the prestige of the Spanish scientific community was outstanding and his
work in mathematics of a steady excellence that makes his loss difficult to accept”
(letter of condolence from T.B. Owen, Rear Admiral of the US Navy).
Twice, Sunyer Balaguer was approached by young foreign students who want-
ed to write their thesis under his supervision, but he had to decline because he was
unable to raise the necessary scholarship money. Many times he reviewed doctoral
theses for Indian universities, on one occasion as the president of a distinguished
international board. The circumstances under which Sunyer attained his scientific
achievements, also testify to his remarkable human qualities. Indeed, his manner
was friendly and his way of conversation reflected his gift for friendship as well
as enjoyment of life and work which went far beyond a mere acceptance of the
situation into which he had been born. His opinions were as firm as they were
cautious, and at the same time he had a deep respect for the opinion and work
of others. Though modest by nature, he achieved due credit for his work, but his
petitions were free of any trace of exaggeration or undue self-importance. The most
surprising of his qualities was, above all, his absolute lack of preoccupation with his
physical condition, which can largely be ascribed to the sensible education given
by his mother and can be seen as an indication of the integration of the disabled
into our society.
On December 27, 1967, still fully active, Ferran Sunyer Balaguer unexpectedly
passed away. The memory of his remarkable personality is a constant source of
stimulation for our own efforts.
Translated from Juan Augé: Fernando Sunyer Balaguer. Gazeta Matematica,
1.a Serie – Tomo XX – Nums. 3 y 4, 1968, where a complete bibliography can be
found.
Satyam Jnanam Anantam Brahma
Table of Contents
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
ix
x Table of Contents
xi
Introduction
Since the time of Dirichlet and Riemann, the analytic properties of L-functions
have been used to establish theorems of a purely arithmetic nature. The distri-
bution of prime numbers in arithmetic progressions is intimately connected with
non-vanishing properties of various L-functions. With the subsequent advent of
the Tauberian theory as developed by Wiener and Ikehara, these arithmetical the-
orems have been shown to be equivalent to the non-vanishing of these L-functions
on the line Re(s) = 1.
In the 1950’s, a new theme was introduced by Birch and Swinnerton-Dyer.
Given an elliptic curve E over a number field K of finite degree over Q, they
associated an L-function to E and conjectured that this L-function extends to an
entire function and has a zero at s = 1 of order equal to the Z-rank of the group
of K-rational points of E. In particular, the L-function vanishes at s = 1 if and
only if E has infinitely many K-rational points.
The analytic continuation of the L-series associated to E has now been es-
tablished in the work of Wiles and his school for all elliptic curves which have
semistable reduction at 3 and 5. So it now makes sense to talk about the values of
the L-function of such curves for any point of the complex plane. In recent work of
V. Kolyvagin, it was necessary to have a quadratic twist of a given elliptic curve
whose L-function has a simple zero at s = 1.
This monograph is concerned with the non-vanishing of a general L-function,
with special emphasis on classical Dirichlet L-functions, Artin L-functions and
L-functions attached to modular forms.
The first technique to prove a theorem on non-vanishing arose in the work
of Hadamard and de la Vallée Poussin. It is based on the simple trigonometric
inequality
3 + 4 cos θ + cos 2θ ≥ 0.
The prime number theorem is a special case of the more general Chebotarev
density theorem. In Chapters 1 and 2, we trace the development of these ideas
and discuss in some detail analytic properties of Artin L-functions. The effective
Chebotarev density theorem, which plays such an essential role in many questions
of an arithmetic and diophantine nature, is also described.
In Chapter 3, we discuss a general formalism due to Serre. It becomes clear
that questions of uniform distribution reduce to questions about analytic contin-
uation of L-functions through the Tauberian theorems and an appropriate use of
trigonometric inequalities.
The subject of modular forms and their associated L-functions has its origin
in the works of Ramanujan and Hecke. After reviewing quickly some basic notions,
we discuss in Chapter 4 the theme of non-vanishing of modular L-functions and
their symmetric power analogues. This is done in the context of the Sato-Tate
conjecture. We also discuss the application of these ideas to questions of oscillation
of Fourier coefficients of cusp forms.
It is a ‘folklore’ conjecture that the classical Dirichlet L-function L(s, χ),
associated to a Dirichlet character χ(mod q) does not vanish at the central critical
point s = 1/2. As of 1995, this is still unproved. In Chapter 5, we discuss this
question from a variety of methods. First, one can consider averages such as
1 1
L( , χ) and |L( , χ)|2 .
2 2
χ mod q χ mod q
By developing asymptotic formulas for these averages, one can obtain the existence
of many characters χ for which L( 12 , χ)
= 0. To get stronger results, one considers
not the above averages, but sums which are weighted with an auxilliary function. In
work of Balasubramanian and K. Murty, it is shown that for each sufficiently large
prime q, the number of Dirichlet characters χ(mod q) such that L(1/2, χ) = 0 is at
least ≥ (.04)φ(q). This is the content of Theorem 5.1 of Chapter 5. The methods
are involved and based on the study of the averages
1 1
L( , χ)Mz ( , χ)
2 2
χ mod q
which converges absolutely in some half plane and extends to an analytic function
in the region Re(s) > c. Suppose further that all the twists
∞
an χ(n)
f (s, χ) =
n=1
ns
by Dirichlet characters χ(mod q) have the same property. Given s0 ∈ C such that
Re(s0 ) > c, does there exist χ(mod q) such that f (s0 , χ)
= 0? To answer this, it is
natural to study
f (s0 , χ)
χ mod q
and determine its asymptotic behaviour. More generally, one can study
cχ f (s0 , χ).
q≤Q χ mod q
Such a study was necessary in the recent work of V. Kolyvagin on the Birch and
Swinnerton-Dyer conjecture. In his situation f (s) was the L-function of a modular
elliptic curve, s0 = 1 and cχ = 0 unless χ is of order 2. We derive asymptotic
formulas for such sums in Chapter 6.
This technique has been amplified and expanded in many works such as that
of Iwaniec, Luo-Rudnick-Sarnak, Barthel-Ramakrishnan, K. Murty-Stefanicki, and
Y. Zhang.
There are at least two more important techniques of non-vanishing of L-
functions that are not discussed in this book. One is the method of Rohrlich which
can be termed ‘Galois theoretic’. The other is the ‘automorphic method’ of Bump,
Friedberg and Hoffstein.
The important topic of general automorphic L-functions is not touched in this
monograph. In this connection, we refer the reader to the monograph of Gelbart
and Shahidi.
Finally, in Chapter 7, we discuss Selberg’s conjectures concerning Dirichlet
series with Euler products and functional equations. These conjectures imply that
no element of the Selberg class vanishes on the line Re(s) = 1. Most likely, the
Selberg class coincides with the class of automorphic L-functions. An intriguing
pathway of research is to compare and contrast these two points of view.
Chapter 1
The Prime Number Theorem
and Generalizations
π(x)
lim = 1.
x→∞ x/ log x
Their method had its origins in a fundamental paper of Riemann written by him
in 1860. That paper outlines a ‘program’ for proving the prime number theorem.
It begins by introducing the ζ function which is defined for Re(s) > 1 as
∞
1
ζ(s) = s
.
n=1
n
Riemann then proceeds to show that (s − 1)ζ(s) extends to an entire function and
satisfies a functional equation
s 1−s
π −s/2 Γ( )ζ(s) = π −(1−s)/2 Γ( )ζ(1 − s).
2 2
In addition, ζ(s) can be written as an infinite product over the prime numbers p:
1
−1
ζ(s) = 1− s Re(s) > 1. (1)
p
p
This equality is an analytic reformulation of the fact that every natural number is
a product of prime numbers in an (essentially) unique way. Because the product
is absolutely convergent in Re(s) > 1, equation (1) also reveals that ζ(s) does not
vanish in this half-plane. Earlier, Euler1) had noticed that unique factorization of
the integers could be written in this way as well as the functional equation for
the zeta function, but he treated ζ(s) as a function of a real variable. Riemann
emphasized that many intricate questions about the distribution of prime numbers
can, by virtue of the above identity, be translated into complex analytic questions
involving the ζ function.
It took several decades to vindicate Riemann’s approach and put it on rigor-
ous footing. Many new ideas of complex analysis were discovered and developed as
a consequence. By the time Hadamard and de la Vallée Poussin completed their
proof, there was a general method in place for tackling all such questions. At the
heart of their proof is the fact that the zeta function does not vanish on the line
Re(s) = 1. As it later transpired, this non-vanishing theorem is equivalent to the
prime number theorem.
It is interesting to note that Hadamard, in his characteristic humility, writes,
“Stieltjes avait démontré que tous les zéros imaginaires de ζ(s) sont (conformément
aux prévisions de Riemann) de la forme 1/2+it, t étant réel; mais sa démonstration
n’a jamais été publiée. Je me propose simplement de fair voir que ζ(s) ne saurait
avoir de zero dont la partie réele soit égale à 1.” (Oeuvres, p. 183).
We now understand this in a better light. The dominant theme that arises
from the papers of Riemann, Hadamard, and de la Vallée Poussin is the following.
Suppose we are given a sequence of complex numbers an and we would like to
know the behaviour of
an .
n≤x
∞
an
f (s) =
n=1
ns
as a function of a complex variable and infer from the analytic properties the
desired behaviour of the summatory function. Indeed, suppose that all the an ’s
are bounded by some constant C. Then the associated Dirichlet series defines an
analytic function for Re(s) > 1. Suppose further that the series can be continued
analytically to Re(s) > 1 − δ where δ > 0. Beginning with the fundamental line
integral
c+i∞ s 1 if x > 1
1 x
ds = 1/2 if x = 1
2πi c−i∞ s
0 if x < 1
1) Equation (1) is referred to as an Euler product. More general Euler products will
be introduced later in the chapter.
§1 The Prime Number Theorem 7
as x→∞.
We relegate the proof of this theorem to later in this chapter. For the mo-
ment, we will quickly proceed to deduce the prime number theorem from the
non-vanishing of ζ(s) on Re(s) = 1.
8 Chapter 1 The Prime Number Theorem and Generalizations
∞ ∞
an A(n) − A(n − 1)
s
=
n=1
n n=1
ns
∞
1 1
= A(n) −
n=1
ns (n + 1)s
∞ n+1
dx
=s A(n) s+1
n=1 n x
∞
A(x)
=s dx.
1 xs+1
In particular,
∞ ∞
[x] s {x}
ζ(s) = s dx = −s dx (2)
1 xs+1 s−1 1 xs+1
where [x] denote the greatest integer less than or equal to x and {x} = x − [x].
Since the fractional part of x is less than 1, the integral
∞
{x}
dx
1 xs+1
where
log p if n is a power of the prime p
Λ(n) =
0 otherwise
denotes the von Mangoldt function (after the mathematician who introduced the
notation). From the second equation in (2), we see that −ζ (s)/ζ(s) has a simple
pole at s = 1 with residue 1. If in addition we knew that ζ(s) does not vanish
on Re(s) = 1, then −ζ (s)/ζ(s) is represented in the region Re(s) > 1, by a
Dirichlet series with non-negative coefficients and has a meromorphic continuation
to Re(s) ≥ 1 with only a simple pole at s = 1 with residue 1. Applying the Wiener-
Ikehara Tauberian theorem, we deduce that
Λ(n) = x + o(x)
n≤x
§1 The Prime Number Theorem 9
as x→∞. It is now an easy exercise to deduce the prime number theorem from
this asymptotic formula (see exercise 1).
Hadamard’s proof that ζ(1 + it) = 0 for t ∈ R, is exceedingly simple and can
be explained intuitively as follows (see [Ka]). Let us write,
∞
an
log ζ(s) =
n=1
ns
Since
3 + 4 cos θ + cos 2θ = 2(1 + cos θ)2 ,
we see that the right side of equation (3) is non-negative. Hence,
2) The traditional proof will use the traditional notation, a curious combination of
Greek and Roman letters, in writing s = σ + it with
√ σ denoting the real part of s
and t the imaginary part. i will of course denote −1.
10 Chapter 1 The Prime Number Theorem and Generalizations
Proof. Suppose f has a zero at 1 + it0 of order k > e/2. Then, e ≤ 2k − 1. Consider
the function
2k
g(s) = f (s)2k+1 f (s + ijt0 )2(2k+1−j)
j=1
|g(σ)| ≥ 1.
as x→∞. The famous Riemann hypothesis asserts that ζ(s) = 0 for Re(s) > 1/2.
If we assume this hypothesis, one can prove by methods of contour integration the
formula
ψ(x) = x + O(x1/2 log2 x).
After this brief discussion, we are now ready to prove Theorem 1.1.
Proof of Theorem 1.1. If the an are real, it suffices to prove the theorem for F (s),
for then one can apply such a result to F (s) − f (s) which is a Dirichlet series with
non-negative coefficients in the region Re(s) > 1.
If not all the an are real, then set
∞
∗
f (s) = an /ns
n=1
sin2 λx
Kλ (x) =
λx2
has Fourier transform
√
|x|
K̂λ (x) = 2 2π 1 − 2λ if |x| ≤ 2λ
0 otherwise.
∞
Now let F (s) = s
≥ 0 be as above and define an analytic
n=1 bn /n , bn
function for Re(s) > 1. Put B(x) = n≤x bn . Replacing bn by bn /R, we can
suppose without loss of generality that R = 1. Then, by partial summation, we
see that for Re(s) > 1, ∞
B(x)
F (s) = s dx.
1 xs+1
Set x = eu . Then ∞
F (s)
= B(eu ) e−us du.
s 0
Note that ∞
1
e−u(s−1) du = .
0 s−1
Hence, putting s = 1 + δ + it, δ > 0, we get
∞
F (1 + δ + it) 1
− = (B(eu ) e−u − 1) e−uδ e−iut du.
1 + δ + it s−1 0
Set
F (1 + δ + it) 1
g(u) = B(eu )e−u , hδ (t) = − ,
1 + δ + it s−1
and
F (1 + it) 1
h(t) = − (s = 1 + it),
1 + it s−1
which is regular for t in R.
Our goal is to prove√that g(u) → 1 as u → ∞. The above formula says that
the Fourier transform of 2π(g(u) − 1)e−uδ is hδ (t). Applying Parseval’s formula,
we deduce ∞ ∞
(g(u) − 1)e−uδ Kλ (u) du = hδ (t)K̂λ (t) dt.
−∞ −∞
§1 The Prime Number Theorem 13
Since K̂λ has compact support, the limit as δ → 0 of the right hand side exists.
The same is true of the left hand side. Hence
∞ ∞
(g(u) − 1))Kλ (u − v) du = h(t)K̂λ (t)eitv dt.
−∞ −∞
Thus, ∞
lim g(u)Kλ (u − v) du = π.
v→∞ −∞
We can now prove the theorem. Since B(x) is monotone increasing, we see
that
g(u2 ) ≥ g(u1 )eu1 −u2 , u1 ≤ u 2 .
√
Thus, for |α| ≤ λ, we have
α 1 − √1 + α 1 − √2
g v− ≥g v− √ e λ λ ≥g v− √ e λ.
λ λ λ
Since √
λ
α sin2 α
lim sup √ g v− dα ≤ π,
v→∞ − λ λ α2
we deduce,
1 − √2 π
lim sup g v − √ e λ ≤ √ .
v→∞ λ √λ sin2α
2 dα − λ α
Since
α 1 1
v+ √1 −v+ α √2
g v− ≤g v+ √ e λ λ
≤g v+√ e λ,
λ λ λ
we obtain
√λ
1 √2 sin2 α 1
lim inf g v + √ e λ √ 2
dα ≥ π + O( √ ),
v→∞ λ − λ α λ
so that
lim inf g(v) ≥ 1,
v→∞
we deduce limv→∞ g(v) = 1 as needed. This completes the proof of the theorem.
Theorem 1.3. Suppose that the function F (s) has the following properties:
(a) there exists β > 0 such that for Re(s) > β,
∞
F (s) = s B(u)e−us du
0
H(s)
F (s) = and H(β) = cΓ(α + 1)
(s − β)α+1
as u → ∞.
§2 Primes in Arithmetic Progression 15
as x → ∞.
This formulation is useful in most applications. There are
other variations
(e.g. see Ellison [p. 64–65]) where upper and lower bounds for n≤x bn can be
obtained from the knowledge of analytic continuation for Re(s) ≥ β, |Im(s)| ≤ T .
We can apply this to the Riemann zeta function. Indeed,
∞ ∞
[x] s {x}
ζ(s) = s dx = − s dx
1 xs+1 s − 1 1 xs+1
gives a meromorphic continuation of ζ(s) for Re(s) > 0 with only a simple pole at
s = 1 and residue 1. If dk (n) denotes the number of ways of writing n as a product
of k positive numbers, it is easily seen that
∞
k dk (n)
ζ (s) = .
n=1
ns
as x → ∞.
where the summation is over primes p ≡ a(mod q). He first proved his theorem
for prime modulus q and then a year later, treated the general case. In the course
of this discovery, he contributed two fundamental ideas: (a) the beginnings of the
theory of group characters and (b) the celebrated class number formula. The first
was essential in ‘sifting’ the primes in a given arithmetic progression. The second
was used to establish the non-vanishing of certain of his L-functions needed in his
proof.
If (Z/qZ)∗ is the multiplicative group of coprime residue classes, let
χ : (Z/qZ)∗ →C∗
be a homomorphism. (These are called Dirichlet characters.) One now defines for
any n ∈ Z,
χ(n) = χ(n mod q) if (n, q) = 1
0 otherwise
and (by abuse of language) we also call these Dirichlet characters. There are φ(q)
such characters where φ is Euler’s function and we denote by χ0 the trivial char-
acter. Analogous to the Riemann zeta function, we define
∞
χ(n)
L(s, χ) = .
n=1
ns
Since |χ(n)| ≤ 1 for all values of n, the series converges absolutely for Re(s) > 1.
If we write
S(x) = χ(n),
n≤x
If χ
= χ0 , then
χ(n) = 0
n mod q
so that we easily see |S(x)| ≤ q upon partitioning the interval [1, x] into subinter-
vals equal to or at most of length q. Hence, (4) converges for Re(s) > 0 and this
gives us an analytic continuation for L(s, χ) when χ
= χ0 , in this half-plane.
As in the case of the Riemann zeta function, the multiplicativity of χ(n) and
the unique factorization of the natural numbers combine to give the Euler product:
χ(p)
−1
L(s, χ) = 1− s .
p
p
§2 Primes in Arithmetic Progression 17
and hence
lim log L(s, χ0 ) = +∞
s→1+
since the contribution for k ≥ 2 in the penultimate sum remains bounded as s→1+ .
We will now establish the non-vanishing of L(s, χ) on Re(s) = 1 by appeal-
ing to Theorem 1.2. But before we do, we need the following lemma, which is a
variation on a well-known result of Landau (see exercise 5).
∞ s
Lemma 2.1. Let g(s) = n=1 an /n be a Dirichlet series where a1 = 1 and
an ≥ 0. Suppose that the series is absolutely convergent in Re(s) > 1. Suppose
further that g(s) admits an analytic continuation to Re(s) ≥ 1/2. Then g(1/2)
= 0.
Proof. We can expand g(s) as a Laurent series about the point s = 2 in a disc
|s − 2| < 3/2:
∞
g (m) (2)
g(s) = (s − 2)m .
m=0
m!
where bm ≥ 0. Hence,
∞
bm
g(s) = (2 − s)m
m=0
m!
g(s) ≥ g(2) ≥ 1
since all the terms are non-negative. Taking lims→1/2+ g(s) gives g(1/2) ≥ 1. This
establishes the result.
Theorem 2.2. L(1 + it, χ)
= 0 for all t ∈ R and all χ
= χ0 .
Proof. We will apply Theorem 1.2 to the function
f (s) = L(s, χ).
χ
which is a Dirichlet series with non-negative coefficients because the Euler product
is supported only at primes where χ(p) = +1. Note that if L(1, χ) = 0, then this
cancels the pole of L(s, χ0 ) at s = 1 so that g(s) is regular there. In fact, g(s)
is regular for Re(s) ≥ 1/2 since the numerator is regular for Re(s) ≥ 0 and
the denominator does not vanish for Re(s) ≥ 1/2 essentially by Theorem 1.1.
Moreover, since L(2s, χ0 ) has a simple pole at s = 1/2, g(s) has a zero at s = 1/2
which contradicts Lemma 2.1. This completes the proof.
§3 Dedekind’s zeta function 19
where the product is over all prime ideals of K. Notice that this product shows
ζK (s)
= 0 for Re(s) > 1. Hecke proved in 1917, that (s − 1)ζK (s) extends to an
entire function such that
2r1 (2π)r2 hR
lim (s − 1)ζK (s) = κ =
s→1+ w |dK |
where r1 is the number of real conjugate fields, 2r2 is the number of complex
conjugate fields, h is the class number, R is the regulator, w is the number of roots
of unity and dK is the discriminant of K. Moreover, ζK (s) satisfies a functional
equation
ξK (s) = ξK (1 − s)
where s
|dK |
ξK (s) = Γ(s/2)r1 Γ(s)r2 ζK (s).
2r2 π n/2
The functional equation allows one to write the residue in a slightly elegant form
ζK (s) hR
lim r
=−
s→0 s w
where r = r1 + r2 − 1. By the Tauberian Theorem 1.1, we immediately deduce
20 Chapter 1 The Prime Number Theorem and Generalizations
as x→∞. We now deduce the result by partial summation (see exercise 1).
Let us consider the special case K = Q(i). If r(n) denotes the number of
ways of writing n as a sum of two integer squares, then Theorem 3.1 gives
r(n) ∼ πx
n≤x
as x→∞, since Z[i] is a unique factorization domain and thus has class number 1.
It is also not difficult to see that a rational prime p is the norm of a prime ideal
Z[i] if and only if p can be written as the sum of two integral squares. If p can
be so written, and p is odd, there are exactly two prime ideals of Z[i] of norm p.
Thus, Theorem 3.2 in this case proves that the number of primes p ≤ x which can
be written as the sum of two squares is
1 x
∼
2 log x
as x→∞.
§4 Hecke’s L-functions 21
§4 Hecke’s L-functions
We begin by constructing the analogues of Dirichlet’s L-functions. We first need
to define the notion of “ideal classes” and then define characters of these classes.
Let K be an algebraic number field and f an ideal of OK . A natural starting
point is to consider the ideal class group and to define characters of this group.
One can generalize this to obtain a notion of ideal classes mod f as follows. The
multiplicative group generated by all ideals coprime to f will be denoted by I(f).
The principal ray class P (f) (mod f) is the subgroup of principal ideals of the form
(α/β) with
(i) α, β ∈ OK and coprime to f;
(ii) α ≡ β(mod f);
(iii) α/β is totally positive (that is, all its real conjugates are positive).
The quotient group G(f) = I(f)/P (f) is called the ray class group mod f. The
elements of this group are called ray classes. These will be considered as analogues
of the groups (Z/mZ)∗ in the rational number field case. Let us note that without
the totally positive condition, the construction leads to (Z/mZ)∗ /{±1} if K = Q
and f = (m).
Let χ be a character of the abelian group G(f). Define
χ(a)
L(s, χ) =
a
Nas
where the sum is over integral ideals a of K. This series converges absolutely for
Re(s) > 1 as is seen by comparing with the Dedekind zeta function which converges
absolutely in that region. We again have the Euler product:
χ(p)
−1
L(s, χ) = 1−
p
Nps
which is valid for Re(s) > 1. This product shows the non-vanishing of L(s, χ) in
that region. If χ
= χ0 , the trivial character, Hecke showed that L(s, χ) extends to
an entire function.
By considering
f (s) = ζK (s)L(s, χ)L(s, χ)L(s, χχ)
and applying Theorem 1.2, we deduce that f (s) does not vanish on Re(s) = 1 for
s
= 1. In the latter case, we consider
g(s) = ζK (s)3 L(s, χ)4 L(s, χ2 )
which has non-negative coefficients. We can apply Theorem 1.2 again to get the
non-vanishing of L(1, χ) provided χ2
= 1. In the last case, we need to consider as
before
ζK (s)L(s, χ)
h(s) =
ζK (2s)
and apply the reasoning as before. This allows us to deduce
22 Chapter 1 The Prime Number Theorem and Generalizations
Theorem 4.1. The number of prime ideals with norm less than x, in a given ray
class is
1 x
|G(f)| log x
as x→∞.
We can consider a more general situation. Recall that each finite prime p
defines a valuation vp : K→Z given by vp (α) = the exponent of p in the prime
ideal decomposition of the principal ideal (α). We can extend this definition to
ideals. A generalized ideal a of K is an ideal af together with a set of embeddings
{σ1 , . . . , σi } of K into R. We will say that α ∈ K satisfies the congruence α ≡
1(mod a) if α is a unit at all the primes dividing af , vp (α − 1) ≥ vp (af ) and
σj (α) > 0 for j = 1, . . . , i. We can now define G(a) as the quotient group of
fractional ideals coprime to af modulo principal ideals (α) with α ≡ 1(mod a).
Hecke defined L-series for characters of these generalized ideal class groups and an
analogue of Theorem 4.1 is true for prime ideals in a given ideal class.
We recomend that the reader study the theory of Hecke L-functions as ex-
plained, for example, in Lang [L], both from the classical and adelic points of view
(Tate’s thesis).
Exercises
1. Let f (t) have a continuous derivative f (t), for t ≥ 1. Let cn for n ≥ 1 be
constants and let
C(u) = cn .
n≤u
and that
x x
f (n) = f (t)dt + (t − [t])f (t)dt + f (1) − (x − [x])f (x).
n≤x 1 1
Deduce that
Λ(n) = x + o(x)
n≤x
2. Prove that
⎛ ⎞2
2k
k
2k + 1 + 2(2k + 1 − j) cos(jθ) = ⎝1 + 2 cos jθ⎠ .
j=1 j=1
a0 + a1 cos θ + · · · an cos nθ ≥ 0.
References
[Ellison] W. Ellison, Prime numbers, John Wiley and Sons, Paris, Hermann, 1985.
[Fe] L. Féjer, Über trigonometrische Polynome, J. Reine Angew. Math., 146
(1916), pp. 53–82.
[Ka] Jean-Pierre Kahane, Jacques Hadamard, Math. Intelligencer, Vol. 13, No. 1,
(1991), pp. 23–29.
[L] S. Lang, Algebraic Number Theory, Springer-Verlag, 1986.
[VKM] V. Kumar Murty, On the Sato-Tate conjecture, in Number Theory related
to Fermat’s Last Theorem, (ed. N. Koblitz), Progress in Mathematics, Vol.
26, (1982), pp. 195–205.
[Rudin] W. Rudin, Real and Complex analysis, Bombay, Tata Mcgraw-Hill Pub-
lishing Co. Ltd., 1976.
Chapter 2
Artin L-Functions
§1 Group-theoretic background
In this section, we shall collect together a few group theoretic preliminaries. We
begin by reviewing the basic aspects of characters and class functions.
Let G be a finite group. If f1 , f2 : G → C are two C-valued functions on G,
we define their inner product by
1
(f1 , f2 ) = f1 (g)f2 (g).
|G|
g∈G
Then
r
1 ˙ −1
(IndG
H f )(g) = f˙(gi−1 ggi ) = f (s gs).
i=1
|H|
s∈G
(f1 , f2 |H ) = (IndG
H f1 , f2 ).
H1 H2
H1 H2
V = ⊕g G Cxg .
regG : G → GL(V )
is defined by
→ (xg
σ → xσg ) .
Its character will be denoted by the same letter and we easily see that
n σ = e (identity)
regG (σ) =
0 σ= e.
In terms of characters
regG = χ(1)χ
χ
regG = IndG
{e} 1
where the product is over primes p of K with Iq = 1 for any q of L with q|p.
To obtain an L-function which has good analytic properties (such as functional
equation), it is necessary to also define Euler factors at the primes p which are
ramified in L and also at infinite primes of K.
Let p be a prime of K which is ramified in L, and q a prime of L above p. Let
V be the underlying complex vector space on which ρ acts. Then we may restrict
this action to the decomposition group Dq and we see that the quotient Dq /Iq
acts on the subspace V Iq of V on which Iq acts trivially. Now we see that any σq
will have the same characteristic polynomial on this subspace and we define the
Euler factor at p to be this polynomial:
Lp (s, χ, K) = det(I − ρ(σq )|V Iq (Np)−s )−1 .
This is well-defined and gives the Euler factors at all finite primes.
Remark. Since G is a finite group, once ρ is given, there are only a finite number of
characteristic polynomials that can occur. For example, if we take the trivial one-
dimensional representation, only the polynomial (1 − T ) occurs. But the subtlety
in the Artin L-function is the assignment p → σp . In other words, which one
of the finite number of characteristic polynomials is assigned to a given prime p
determines and is determined by the arithmetic of the field extension, in particular
the splitting of primes.
28 Chapter 2 Artin L-Functions
We have also to define the Archimedean Euler factors. For each Archimedean
prime v of K we set
((2π)−s Γ(s))χ(1) if v is complex
Lv (s, χ, K) =
((π −s/2 Γ(s/2))a (π −(s+1)/2 Γ((s + 1)/2))b if v is real.
Here
a + b = χ(1)
and a (resp. b) is the dimension of the +1 eigenspace (resp. −1 eigenspace) of
complex conjugation.
We shall write
γ(s, χ, K) = Lv (s, χ, K).
v infinite
G0 ⊇ G1 ⊇ · · · .
We also set
χ(1)
Aχ = dK NK/Q fχ .
Let us set
Λ(s, χ, K) = As/2
χ γ(s, χ, K)L(s, χ, K).
The number W (χ) itself carries deep arithmetic information. For example,
it is related to Galois module structure. The reader is referred to the monograph
[Fr] of Fröhlich for an introduction to this subject.
We now recall some of the formalism of Artin L-functions and their basic
properties. It is summarized in the two properties:
L(s, aχ χ, K) = L(s, χ, K)aχ for any aχ ∈ Z (1)
χ χ
L(s, IndG
H χ, K) = L(s, χ, LH ) where LH is the subfield of L fixed by H. (2)
There is a theorem of Brauer which says that for any irreducible χ, there are
subgroups {Hi }, one-dimensional characters ψi of Hi and integers mi Z with
χ= mi IndGHi ψi .
i
that Artin’s conjecture implies that ζL (s)/ζK (s) is entire. In fact, let L/K be a
(not necessarily Galois) finite extension and let K̃/K be its normal closure. Say
G = Gal(K̃/K) and H = Gal(K̃/L). Then
L(s, IndG
H (1H ), K) = L(s, 1H , L) = ζL (s).
with 0 ≤ aχ ∈ Z. So,
L(s, IndG
H 1H , K) = ζK (s) L(s, χ, K)aχ .
Putting these together, we see that Artin’s conjecture implies that ζL (s)/ζK (s)
is entire, whether L/K is Galois or not. This special case of Artin’s conjecture
is called Dedekind’s conjecture. Below, we shall discuss it in several cases. In
particular, it is known to hold in the case L/K is Galois (Aramata-Brauer) and
in case L̃/K is solvable (Uchida-van der Waall).
Now
φ(|A|) = 1= 1 = |G|.
A A σ∈A σ G
<σ>=A
By Mackey’s theorem,
Thus,
IndG
H (1H )|A =
where ranges over all the irreducible characters of A. Thus, for all χ,
mχ = 0 or 1 and (χ|A , ) = 0 or 1 for any ∈ Irr(A).
Now, take an ∈ Irr(A) such that there is a χ ∈ Irr(G) with mχ
= 0 and
(χ|A , ) = 1. Let T be the inertia group of :
T = {σ ∈ G : σ = }.
→ (σaσ−1 ).) Of course, T ⊇ A and we can write it
(Here, σ is the character a
as T = H A where H ⊆ H. We can extend to a character ˜ of T by setting
˜(ha) = (a) for any h ∈ H , a ∈ A.
Let us write
IndTA = IndTA ˜|A = mψ ψ.
ψ∈Irr(T )
Notice that
IndTA (g) = [T : A] (g) if g ∈ A .
0 otherwise
Thus,
[T : A] = (IndTA )|A = mψ ψ|A .
Thus for every ψ ∈ Irr(T ), with mψ
= 0, ψ|A is a multiple of . In fact,
mψ = (ψ, IndTA ) = (ψ|A , ),
and so ψ|A = mψ . It follows that
m2ψ = [T : A].
From this, we deduce that the characters {IndGT ψ} are distinct and irreducible.
Indeed, we have
[T : A] = m2ψ ≤ (IndG G G
A , IndA ) = ( , (IndA )|A )
and
IndG
A |A = [T : A] g
where the sum on the right is over a set of coset representatives for T in G. By
definition of T , the conjugates g are distinct and our claim follows. Now,
1 = (χ|A , ) = (χ, IndG
A ) = mψ (χ, IndGT ψ).
ψ∈Irr(T )
Corollary 4.2 (Uchida, van der Waall) Let K/F be an extension of number
fields and K̃ a normal closure of K/F . Suppose that Gal(K̃/F ) is solvable. Then
ζK (s)/ζF (s) is entire.
Proof. As above, we set G = Gal(K̃/F ) and H = Gal(K̃/K), We proceed by
induction on the order of |G|. We may assume that H is a maximal subgroup of
G. For if J is a maximal subgroup of G with H ⊂ J ⊂ G, and M is the fixed field
of J, then
ζK (s)/ζF (s) = (ζK (s)/ζM (s)) (ζM (s)/ζF (s))
where the first factor on the right is entire by the induction hypothesis and the
second by the maximality of J.
Also, since G corresponds to the normal closure of K/F, we may assume
that H does not contain any proper non-trivial normal subgroup of G. Now let
A be a minimal normal subgroup of G. As G is solvable, such an A exists and
is (elementary) abelian. Moreover, A is not contained in H. Then HA = G and
H ∩ A = {1}. Indeed, the first equality is just the maximality of H and the second
follows from the minimality of A and the observation that H ∩ A is again a normal
subgroup. Thus, G is the semidirect product of A by H and Theorem 4.1 applies.
Finally in this section, we can ask the following variant of Dedekind’s con-
jecture. Let L/K be an extension with group G, and let H be a subgroup. Let ρ
be an irreducible representation of G. Then, is the quotient
L(s, IndG
H (ρ|H ), K)/L(s, ρ, K) (‡)
entire? This includes the general case of Dedekind’s conjecture (if we take ρ = 1G ).
(‡) can be proved by the method of the Proposition above, if G is solvable and ρ
is an abelian character. Indeed, we need only make two observations. First, if we
write
H (ρ|H ) − ρ =
IndG mχ χ
and as before, this implies that there is an i with mi = 1 and χ = IndG T ψi . Even
G
without assuming ρ(1) = 1, we get χ = IndT ψi for some and i. But we may
not know the holomorphy of L(s, ψi ). (Notice also, that we can restrict to the case
H is a maximal subgroup. For if J ⊇ H is a maximal subgroup, (M = fixed field
of J)
L(s, IndG G J
H (ρ|H ), K)/L(s, IndJ (ρ|J ), K) = L(s, IndH (ρ|H ), M )/L(s, ρ|J , M ).
and so
L(s, IndG
H (ρ|H ), K) L(s, IndJH (ρ|H ), M ) L(s, IndG
J (ρ|J ), K)
= · .
L(s, ρ, K) L(s, ρ|J , M ) L(s, ρ, K)
is proved. From this, it follows for example that ζK (s)/L(s, χ, F ) is entire except
possibly at s = 1, and that the same holds for the product ζK (s)L(s, χ, F ). This
raises the question of whether regG −χ can be decomposed as a non-negative sum
of monomial characters. This was answered in the affirmative by Rhoades [R].
Some special cases were computed in [Mu1].
Our approach applies in a wider context of an L-function formalism which is
satisfied by a variety of objects in number theory and algebraic geometry. Let G
be a finite group. For every subgroup H of G and complex character ψ of H, we
attach a complex number n(H, ψ) satisfying the following properties:
(1) Additivity: n(H, ψ + ψ ) = n(H, ψ) + n(H, ψ ),
(2) Invariance under induction: n(G, IndGH ψ) = n(H, ψ).
The formalism can be applied to the above case when G is the Galois group
of a normal extension K/k and n(H, ψ) is the order of the zero at s = s0 of the
Artin L-series attached to ψ corresponding to the Galois extension K/K H . It can
also be applied to the situation when E is an elliptic curve over k and n(H, ψ)
corresponds to the order of the zero at s = s0 of the “twist” by ψ of the L-function
of E over K H (see [MM] for definitions and details).
36 Chapter 2 Artin L-Functions
where the sum is over all irreducible characters ψ of H. Our first step is to show
that
Proposition 5.1 θG |H = θH .
Proof.
θG |H = n(G, χ)χ|H
χ
⎛ ⎞
= n(G, χ) ⎝ (χ|H , ψ)ψ ⎠
χ ψ
where the inner sum is over all irreducible characters of H and the outer sum is over
all irreducible characters of G. By Frobenius reciprocity, (χ|H , ψ) = (χ, IndG H ψ)
and so
θG |H = G
n(G, χ)(χ, IndH ψ) ψ.
ψ χ
By Proposition 5.1,
θG (g) = θg
(g) = n(g, ψ)ψ(g)
ψ
which is bounded by n(G, reg) in absolute value by our hypothesis and property
(1). This completes the proof.
Similar reasoning implies
Proposition 5.3 Let ρ be an arbitrary character of G. Suppose for every cyclic
subgroup H of G, and irreducible character ψ of H, we have n(H, ρ|H ⊗ ψ) ≥ 0,
then n(G, ρ ⊗ χ) is real for every irreducible character χ of G and
n(G, ρ ⊗ χ)2 ≤ n(G, ρ ⊗ reg)2 .
χ
nχ χ(1)
Also, if M
= Q, ζM has at most one zero in the region
1 1
σ ≥1− , |t| ≤ .
4 log |dm | 4 log |dm |
38 Chapter 2 Artin L-Functions
1 1 1
= + log |dm |
ρ
s−ρ s−1 2
1 n r1 Γ s Γ ζ
+ − log π + ( ) + r2 (s) − log 2 + M (s).
s 2 2 Γ 2 Γ ζM
The sum on the left runs over zeros ρ of ζM (s) in the strip o < σ < 1 and the
terms with ρ and ρ̄ are grouped together.
For s = σ > 1 we have
1 1
+ > 0.
σ − ρ σ − ρ̄
where the sum on the left denotes summation over any convenient subset of the
zeros ρ which is closed under complex conjugation. In particular, the sum
1
σ−ρ
1 1 1
< + log |dm |.
ρ
σ−ρ σ−1 2
0) then ρ̄ is
If ρ = β + iγ is in the rectangle specified in the statement, (with γ =
also in the same rectangle and taking the contribution from ρ and ρ̄ only, we get
the inequality
2(σ − β) 1 1
< + log |dm |.
(σ − β) + γ
2 2 σ−1 2
But this is false for M = Q at σ = 1 + log 1|dm | < 2. The same value of σ gives a
contradiction if there are two real zeroes in this rectangle (or a single real multiple
zero). This completes the proof of the Proposition.
The following consequence is also due to Stark [St].
§6 Low order zeros of Dedekind zeta functions 39
Corollary 6.2 Let K/F be a Galois extension. For any Artin L-function L(s, χ, F )
of this extension, the region
1 1
σ ≥1− , |t| ≤ .
4 log |dk | 4 log |dk |
is free of zeros except possibly for a simple zero. This zero exists only if χ is a real
Abelian character of a quadratic subfield of K.
Next, we examine the case when the Dedekind zeta function may in fact
vanish, but the order of zero is small. We shall study this under the assumption
that K/F itself is a solvable Galois extension. If we are at a point s = s0 where
ζK (s) has a “small” order zero, then it is possible to show more than just the
analyticity of ζK (s)/ζF (s) at s0 . We have the following result due to Foote and
K. Murty [FM].
Theorem 6.3 Let K/F be a solvable extension and write
1 · · · pt ,
[K : F ] = pα p1 < p2 < · · · < pt
1 αt
for the prime power decomposition of the degree. Suppose that at s = s0 , we have
r = ords=s0 ζK (s) ≤ p2 − 2.
and
θG (1) = ords=s0 ζK (s).
The first follows from Proposition 5.1 and the second follows from the factorization
of ζK into the L(s, χ, F ). Moreover, by our assumption of minimality, we may
suppose that θH is a character for every proper subgroup H of G. In addition, the
40 Chapter 2 Artin L-Functions
θG = θ1 − θ2 + θ3 .
(θ2 |N , θ3 |N ) = 0.
Since
θG |N = θ1 |N − θ2 |N + θ3 |N
is a character of N, it follows that
either θ1 |N = θ2 |N or θ1 |N = θ2 |N + φ
and hence also in ψ|N . It follows that the sum of the distinct conjugates form a
character of degree ≤ φ(1). Clifford’s theorem tells us that ψ|N is equal to this
sum and so ψ(1) ≤ φ(1) ≤ r. Also, ψ is a constituent of θ1 and so it is faithful by
definition.
Now, a theorem of Ito tells us that in a solvable group, a p-Sylow subgroup
is abelian and normal if there is a faithful character of degree smaller than p − 1.
Thus, the conclusion of the previous paragraph and our assumption that r ≤ p2 −2
imply that G has an abelian normal subgroup of order n/pα 1 . This would force G
1
to be nilpotent and Artin’s conjecture is known to hold for such groups as every
irreducible character is monomial. This again contradicts the nontriviality of θ2 .
We conclude that θ1 |N = θ2 |N . This is the only step in which the assumed bound
on r is used.
The final contradiction now comes by showing that θ1 = θ2 . To do this, take
x ∈ G\N . Denote by H the subgroup generated by x and the center of G. As H is
abelian, it is a proper subgroup of G. As we observed earlier, every irreducible com-
ponent λ of θ3 has the property that its kernel Ker λ meets the center non-trivially.
Thus, the same holds for IndG H (λ|H ). Now, taking an irreducible component χ of
θ2 , we know that χ is faithful and so
(χ, IndG
H (λ|H )) = 0.
(θ2 |H , θ3 |H ) = 0.
Now,
θG |H = θ1 |H − θ2 |H + θ3 |H
and again θG |H is a character of H. Thus, θ1 |H − θ2 |H is either zero or a character.
By our earlier argument, we know that θ1 (1) = θ2 (1) and so we must have θ1 |H =
θ2 |H . Combined with our earlier result for N it follows that
θ1 = θ2 .
|C|
πC (x) ∼ πF (x)
|G|
Theorem 7.1 Suppose the Dedekind zeta function ζK (s) satisfies the Riemann
hypothesis. Then
|C| |C| 1
πC (x) = πF (x) + O( · x 2 (log dL + nL log x)).
|G| |G|
This version of their result is only slightly more refined than the statement
given in [LO] and is due to Serre [Se2, p. 133]. The proof of Theorem 7.1 is very
analogous to the classical proof of the prime number theorem in arithmetic pro-
gressions, as presented, for example, in the monograph of Davenport [D]. However,
there are some points of difference and we now briefly discuss them.
As in the classical case, the proof begins by expressing the characteristic
function of the conjugacy class C in terms of characters of G. However, we have
to deal with the fact that G is non-abelian and that we do not know the analytic
properties of Artin L-functions. In particular, we do not know Artin’s conjecture.
We have
|C|
δC = χ(gC )χ
|G| χ
where δC denotes the characteristic function of the class C and gC is any element
in this class. Hence
|C|
π(x, δC ) = χ(gC )π(x, χ)
|G| χ
δ : H −→ {0, 1}
φ = IndG
H δ.
Now the right hand side is written as a sum involving the characters of H. In
particular, if we are given C and we let H be the cyclic subgroup generated by gC
then we are able to express π̃(x, δC ) in terms of Hecke L-functions. As we know the
analytic properties of these L-functions, we are now able to follow rather closely
the classical method as developed in [D] to prove Theorem 7.1.
Though the above technique has the advantage of replacing the non-abelian
L-functions with abelian ones, it does so at some cost. The estimates will now
involve the field constants (that is, degree, discriminant, etc.) of the fixed field M
(say) of H. In general, as we do not have any information about M we are forced
to majorize its field constants by those of K and this magnifies the error terms
significantly.
This problem could be avoided if we were able to deal directly with the Artin
L-functions. This theme is developed in the next section. We conclude this section
by stating some unconditional results developed in [LO] and in [LMO].
44 Chapter 2 Artin L-Functions
This bound does not assume that K/F is Galois. If we know in addition that K/F
is Galois, the following slightly stronger estimate holds:
log NF/Q dK/F ≤ (nK − nF ) log p + nK (log n).
p∈P (K/F )
There is an analogue of this for Artin conductors also. This analogue is needed
in the proofs of the results of the next section.
§7 Chebotarev density theorem 45
Proposition 7.4 Suppose that K/F is Galois with group G. Let χ denote an
irreducible character of G and denote by fχ its Artin conductor. Then
log NF/Q fχ ≤ 2χ(1)nF { log p + log n}.
p∈P (K/F )
1
dim V Gi = χ(a),
|Gi |
a∈Gi
where Gi is as in Section 2.
Thus, for each finite ν,
|Gi | 1
n(χ, ν) = χ(1) − χ(a) .
i
|G0 | |Gi |
a∈Gi
χ(1) 1
n(χ, ν) = (|Gi | − 1) − χ(a)iG (a).
|G0 | i |G0 |
1=a∈G0
Applying this formula for χ the trivial character, and the character of the regular
representation of G0 , we find that
iG (a) = (|Gi | − 1) = w(DK/F ).
1=a∈G0 i
Hence,
1 2χ(1)w(DK/F )
n(χ, ν) = iG (a)(χ(1) − χ(a)) ≤ .
|G0 | ew/ν
1=a∈G0
Now using the above stated estimate for w(DK/F ) we deduce that
1
log Nfχ ≤ 2χ(1) (ew/ν − 1 + sw/ν )fν log pν
ew/ν
46 Chapter 2 Artin L-Functions
and this is
eν eν
≤ 2χ(1) fν 1− log pν + fν w(ew/ν ) log pν
ew ew
Proposition 8.1 Suppose that the Artin L-series L(s, χ) is analytic for all s
=1
and is nonzero for Re(s)
= 12 , 0 < Re(s) < 1. Then
1
π(x, χ) = δ(χ) Li(x) + O(x 2 ((log Aχ ) + χ(1)nF log x)) + O(χ(1)nF log M (K/F ))
where
1/nF
M (K/F ) = ndF p.
p∈P (K/F )
§8 Consequences of Artin’s conjecture 47
Proof. The argument proceeds along standard lines and so we just sketch it here.
Artin proved the functional equation
where W (χ) is a complex number of absolute value 1 and χ̄ is the complex conju-
gate of χ. We know that
(s(s − 1))δ(χ) Λ(s, χ)
is entire and we have the Hadamard factorization
s
Λ(s, χ) = ea(χ)+b(χ)s (1 − )es/ρ (s(s − 1))−δ(χ)
ρ
where a(χ), b(χ) ∈ C and the product runs over all zeroes ρ of Λ(s, χ) (ncecessarily
0 ≤ Re(ρ) ≤ 1.) From the equality
Λ Λ
(s, χ) = − (1 − s, χ̄).
Λ Λ
From these two relations, we deduce that
Λ 1
Re ( , χ) = 0.
Λ 2
Also, if ρ is a zero of Λ(s, χ) then so is 1 − ρ̄. Hence,
1
Re ( − ρ)−1 = 0
2
as is seen by grouping together the terms corresponding to ρ and 1 − ρ̄ in the
absolutely convergent sum. Logarithmically differentiating the product formula at
s = 12 and taking real parts, we deduce that
1
Re(b(χ) + ) = 0.
ρ
Hence,
Λ 1 1 1
Re (s, χ) = Re( ) − δ(χ) Re( + ).
Λ ρ
s − ρ s s − 1
48 Chapter 2 Artin L-Functions
Let N (t, χ) denote the number of zeros ρ = β + iγ of L(s, χ) with 0 < β < 1 and
|γ − t| ≤ 1. Evaluating the above formula at s = 2 + it and observing that
1 2−β
Re( )=
2 + it − ρ (2 − β)2 + (t − γ)2
Λ
N (t, χ) Re (2 + it, χ).
Λ
Since the Dirichlet series for L(s, χ) converges at 2 + it, the right hand side is
easily estimated, the essential contribution coming from log Aχ and the number of
Γ factors. We get
xρ
χ(σν ) log Nν = δ(χ)x− + O(χ(1)nF log M (K/F ))
ρ
Nν≤x |γ|<x
1
+ O(x 2 (log x)(log Aχ + χ(1)nF log x)),
where the prime on the sum indicates that we only include places ν that are
unramified in K. The sum over zeros can be estimated by observing that
1 N (j, χ)
ρ j
|γ|<x j<x
and using the above estimate for N (t, χ). The estimate for π(x, χ) can be deduced
by partial summation.
Proposition 8.2 Suppose that all Artin L-series of the extension K/F are analytic
at s
= 1 and that GRH holds. Then
1 |C|
2
πC (x) − Li x xn2F (log M (K/F )x)2 .
|C| |G|
C
1 |C| |C|
2
1
π(x, 1G ) − Li x = (π(x, 1G ) − Li x)2 .
|C| |G| |G| |G|
C
§8 Consequences of Artin’s conjecture 49
Proposition 8.3 Let D be a union of conjugacy classes. Under the same hypotheses
as in Proposition 8.2, we have
|D| 1 1
πD (x) = Li x + O(|D| 2 x 2 nF log M (K/F )x).
|G|
Proof. We have
|D| |C|
πD (x) − Li x = πC (x) − Li x
|G| |G|
C
where the sum is taken over all conjugacy classes C contained in D. Now applying
the Cauchy-Schwarz inequality we deduce that
2 12
|C|
1 |C|
πC (x) − Li x (
1
|C|) 2 πC (x) − Li x .
|G| |C| |G|
C C C
Remark. Using Hensel’s estimate for the discriminant, it is possible to write the
error term in Theorem 7.1 as
1
O(|C|x 2 nF log M (K/F )x).
1
Thus Artin’s conjecture allows us to replace |C| with |C| 2 . In some cases, we can
also improve Theorem 7.1 even without assuming Artin’s conjecture. We give two
such results below.
50 Chapter 2 Artin L-Functions
Now,
|C| |H|
· (π̃CH (x) − πCH (x))
|G| |CH |
C⊆D
⎛ ⎞
|H| |C| ⎜ ⎟
≤ ⎝ δC (σm ) + δCH (σν )⎠
|G| |CH | Nν m ≤x H ν
C⊆D Nν≤x
m≥2 ν ramified in L/K
|H| |C| |G| 1 2 1
≤ maxC⊆D · nF x +
2 log dK
|G| |CH | |H| log 2 |H|
|C| 1
≤ max (nF x 2 + nF log M x)
|CH |
and this can be absorbed into the error term. Therefore, we can replace π̃CH by
πCH in the equation (8.1). Now,
|C| |H|
· πC (x)
|G| |CH | H
C⊆D
⎛ ⎞
|C|
|D| |H| 1 |C |
= Li x + O ⎝ · πCH (x) − H
Li x⎠ .
|G| |G| |CH | 2 |CH | 2 |H|
1 1
C⊆D
§8 Consequences of Artin’s conjecture 51
Now applying the Cauchy-Schwarz inequality and using Proposition 8.2, we find
that the error term above is
⎛ ⎞ 12
|H| ⎝ |C|2 ⎠ 1 |G|
· x 2 nF log M (K/F )x
|G| |CH | |H|
C⊆D
where F is the fixed field of H. This proves the proposition since M (K/F )
M (K/F ).
We state one immediate corollary of this result.
Corollary 8.5 Under the same hypotheses as above,
1
|D| 1 1 |G| 2
πD (x) = Li x + O |D| 2 x 2 nF log M x .
|G| |H|
|C| |G|
≤ .
|CH | |H|
We now present one further result in this direction. This estimate has the
feature that in some cases, it gives a better result than what one deduces from
Artin’s conjecture.
Proposition 8.6 Suppose the GRH holds. Let D be a nonempty union of conjugacy
classes in G and let H be a normal subgroup of G such that Artin’s conjecture is
true for the irreducible characters of G/H and HD ⊆ D. Then
1
|D| |D| 2 12
πD (x) = Li x + O x nF log M x
|G| |H|
and
πD (x) = πD̄ (x) + O((log dK )/|G|).
Also, M (F /F ) M (K/F ). The result follows.
52 Chapter 2 Artin L-Functions
Finally, we can ask what the true order of the error term in the Chebotarev
theorem should be. Let α(G) denote the number of conjugacy classes of G.
Question. Is it true that for any conjugacy set D ⊆ G,
12
|D| |D| 1
πD (x) = Li x + O x nF log M x ?
2
|G| α(G)
This would be implied by the Proposition 8.2 if all the terms are of the same order.
In the case F = Q and K/F is Abelian, our question is a well-known conjecture
of Montgomery.
In fact, the term (log |G|)2 may also be removed by using a more detailed argument.
The purpose of this section is to show, assuming the GRH, that in the special case
C = G − {1}, there is a prime ideal p of K of degree 1 which is unramified in L
which does not split completely and which satisfies
2 2
log |dL | nK
NK/Q p log |dL | . (9.3)
|G| − 1 nL
where nK = [K : Q] and nL = [L : Q]. Thus, the estimate (9.3) shows that for the
special set C = G − {1}, one can do substantially better than (9.1).
§9 The least prime in a conjugacy class 53
Next, we shall show that for certain subgroups H of G the bound (9.3) may
be extended for the least norm of a prime ideal p for which σp does not intersect
H. The precise statement is given in Theorem 9.3.
We apply these last results to the group of points on an elliptic curve over a
finite field. Let E be an elliptic curve without complex multiplication and defined
over Q. Denote by N the conductor of E.
Let us set
T = lcmE |E (Q)tors |
where the lcm ranges over elliptic curves E which are defined over Q and are
Q-isogenous to E. In [K, Th. 2] Katz proved that
gcd |E(Fp )| = T
where the gcd is taken over primes p of good reduction. It is well known and easily
proved that both sides are divisible by the same primes. Using our results, we can
make this effective in the following sense. Let l ≥ 5 be a prime and assume the
GRH. If l does not divide T then we show (Theorem 9.4) that there is a prime p
so that
p ( log N )2
and E(Fp ) does not have a point of order .
We begin by proving the estimate (9.3). We recall that L/K is a non-trivial
Galois extension.
Theorem 9.1 Assume the GRH. Then, there exists a prime ideal p of K
(i) p is of degree 1 over Q and unramified in L
(ii) p does not split completely in L
and
nK
NK/Q p ( log |dL |)2
nL
where nK = [K : Q] and nL = [L : Q].
Proof. We consider the kernel function of [LMO, §2], namely
s−1 2
y − xs−1
k(s) = k(s; x, y) = .
s−1
For y > x > 1 and u > 0, it has the property that the inverse Mellin transform
1
k̂(u) = k(s)u−s ds
2πi (2)
is given by the formulae
⎧
⎪ 0 if u > y2
⎨1 y2
k̂(u; x, y) = u log u if xy < u < y 2
⎪ 1
⎩ u log
u
if x2 < u < xy
x2
0 if u < x2 .
54 Chapter 2 Artin L-Functions
where ρ runs over all zeroes of ζK (s). Write ρ = β + iγ. If NK (r; s0 ) denotes the
number of zeroes ρ of ζK (s) with |ρ − s0 | ≤ r then ([LMO, Lemma 2.2])
Since
x−2(1−β)
|k(ρ; x, y)| ≤
|ρ − 1|2
it follows that
∞
1
k(ρ; x, y) x−2δ dNK (r; 1)
δ r2
β≤1−δ
−2δ
x (δ −2 + δ −1 log |dK |).
1
As we are assuming the GRH, we may take δ = 2 and we see that
y
JK = (log )2 + O(x−1 log |dK |).
x
On the other hand, the integral is equal to the sum
Λ((N p)n )k̂((N p)n ; x, y).
p,m
The contribution to this sum of ideals pn for which N pn is not a rational prime is
nK (log y)(log y/x)
x log x
as in [LMO, (2.6)]. Moreover, the contribution of primes p which ramify in L is
y
(log N p)x−2 log
x
p|dL/K
as in [LMO, (2.27)]. (Recall that dL/K is the norm to Q of the discriminant of the
extension L/K.) Since [Se2, p. 129]
2
log N p ≤ log |dL |,
n
p|dL/K
§9 The least prime in a conjugacy class 55
Indeed, choose for L the Hilbert class field of K, and use the fact that dL = dhK .
56 Chapter 2 Artin L-Functions
L1
L
2 ...Lr
K
M
Q
m = min[Li : K]
f = [F : K]
where r )
i=1 (log |dLi |) m
B = max , log |dF | .
m−r f (m − r)
Proof. Let S denote the set of degree 1 prime ideals p of K with N p ≤ y 2 for which
p ∩ OM does not ramify in F . Suppose that every element of S splits completely
in some Li . Then, with notation as in the proof of Theorem 9.1, we have
JLi ≥ m (log N p)k̂(N p; x, y).
p∈S
§9 The least prime in a conjugacy class 57
Using the estimate for JL and J˜K given in the proof of Theorem 9.1, we deduce
that
y 1
r(log )2 + O( log |dLi |)
x x i
NK/Q p BH
2
and satisfying
(a) p is of degree one and does not ramify in L
(b) σp is disjoint from H.
Here,
)
1 1 m
BH = max log |dL |, (log |dL |) .
m−r |Hi | |N |(m − r)
Proof. Each Li is a Galois extension of R and L is a Galois extension of R containing
all the Li . By Theorem 9.2, we can find a prime ideal P of R of degree one (over
Q) so that p = NR/K P does not ramify in L, P does not split completely in any
of the Li and
NR/Q P B 2
where ) *
1
r
m
B = max log |dLi |, (log |dL |) .
m − r i=1 [L : R](m − r)
58 Chapter 2 Artin L-Functions
Theorem 9.5 Suppose that E does not have complex multiplication and let ≥ 5
be a prime which does not divide T . Denote by N the conductor of E. Assume the
GRH. Then, there is a prime
p ( log N )2
NG (H) = NG (H ) = B.
gHg −1 ∩ B ⊆ H or H.
We apply Theorem 9.3 to get a prime p which is unramified in L, the fixed field
of the kernel of ρ and which has the property that σp ∩ H = φ and
p x2
where )
1 2 −1
x= (log |dL |), (log |dL |) .
− 1 − 2 ( − 1) 4
60 Chapter 2 Artin L-Functions
and so
p ( log N )2 .
Now consider
D = {g ∈ G : trg = 1 + det g}.
Clearly, every conjugacy class in D intersects H non-trivially. Hence σp is not
contained in D, or in other words σp ∩ D = φ. Thus
a(p)
≡ 1 + p(mod )
|E(Fp )| = p + 1 − a(p)
≡ 0(mod ).
' 1 (2
p log dF
[G : H]
p (log N )2 .
(iii): This is impossible if > 2 since G contains the image of complex conjuga-
tion, a matrix with distinct F -rational eigenvalues (namely +1, −1), whereas the
eigenvalues of every element of a nonsplit Cartan subgroup are either equal or lie
in F2 \F .
Exercises 61
(iv): In this case |G| . By the result of Lagarias and Odlyzko , quoted
as (9.1)
2
at the beginning of this section, there exists a prime p whose σp is
2
(say) and
p < (|G| log N )2 ( log N )2 .
Such a prime has
a(p) ≡ 4
≡ 1 + 4 ≡ 1 + p (mod ).
(v): In this case, there is a quadratic character with the property that
Exercises
1. Let χ be an irreducible character of a finite group G. If χ is a linear com-
bination with positive real coefficients of monomial characters, then mχ is
monomial for some integer m ≥ 1.
2. Let A be a normal subgroup of the group G and let χ be an irreducible
character of G. Then either the restriction of χ to A is isotypic (that is, a
multiple of one character) or there is a subgroup H containing A and an
irreducible character σ of H such that χ = IndGH σ. (See [Se1, Prop. 24]).
{1} = G0 ⊆ G1 ⊆ · · · Gn = G
4. Let F be a set of characters of the finite group G. We say that a class function
θ is semi-orthogonal to F if (θ, φ) ≥ 0 for all φ ∈ F.
(a) If F is the set of all characters of G, then a generalized character θ is
semi-orthogonal to F if and only if θ is a character.
(b) Let
F̃ = { xφ φ : 0 < xφ ∈ R, φ ∈ F}.
Then a class function θ is semi-orthogonal to F if and only if it is semi-
orthogonal to F̃.
5. If
F = {IndG
H ψ : H an Abelian subgroup of G}
then
(a) the generalized character θG is semi-orthogonal to F.
(b) if a generalized character θ = mχ χ is semi-orthogonal to F then
|mχ | ≤ |θ(1)|.
6. Let F be a subset of Rk and define
and
C(F ) = { xi fi : 0 < xi ∈ R, fi ∈ F }
9. Let F/Q be a finite Galois extension with group G and let H, H be two
subgroups. Denote by K and K the corresponding fixed fields.
(a) Show that ζK (s) = ζK (s) if and only if for every conjugacy class C of
G, we have #(H ∩ C) = #(H ∩ C).
(b) Let G = S6 (the symmetric group on 6 letters) and consider the sub-
groups
H = {(1), (12)(34), (12)(56), (34)(56)}
and
H = {(1), (12)(34), (13)(24), (14)(23)}.
Prove that the above condition is satisfied and deduce that the Dedekind
zeta functions of the corresponding fixed fields coincide. (This is due to
Gassman, 1926.)
10. Let 1 < a ∈ Z be a squarefree integer and q a prime. Set K = Q(a1/q ) and
prove directly that ζK (s)/ζ(s) is entire.
11. Let f (T ) ∈ Z[T ] be an irreducible polynomial of degree larger than 1. Show
that the set
{p : f (T ) ≡ 0(mod p) has a solution}
has positive density.
12. Let E be a biquadratic extension of Q and let K1 , K2 , K3 be the three
quadratic subfields. Show that
References
[CF] J. Cassels and A. Fröhlich, Algebraic Number Theory, Academic Press, 1967.
[D] H. Davenport, Multiplicative Number Theory, Springer-Verlag, 1980.
[Fo] R. Foote, Non-monomial characters and Artin’s conjecture, Trans. Amer.
Math. Soc., 321 (1990), 261–272.
[FM] R. Foote and V. Kumar Murty, Zeros and poles of Artin L-series, Math.
Proc. Camb. Phil. Soc., 105 (1989), 5–11.
[FW] R. Foote and D. Wales, Zeros of order 2 of Dedekind zeta functions and
Artin’s conjecture, J. Algebra, 131 (1990), 226–257.
[Fr] A. Fröhlich, Galois Module Structure of algebraic integers, Ergebnisse der
Mathematik und ihrer Grenzgebiete, Springer-Verlag, 1983.
64 Chapter 2 Artin L-Functions
[K] N. Katz, Galois properties of torsion points of abelian varieties, Invent. Math.,
62 (1981), 481–502.
[La] S. Lang, Algebraic Number Theory, Springer-Verlag, 1986.
[LO] J. Lagarias and A. M. Odlyzko, Effective versions of the Chebotarev Density
Theorem, Algebraic Number Fields, ed. A. Fröhlich, 409–464, Academic Press,
New York, 1977.
[LMO] J. Lagarias, H. Montgomery and A. M. Odlyzko, A bound for the least
prime ideal in the Chebotarev Density Theorem, Invent. Math., 54 (1979),
271–296.
[Mu1] V. Kumar Murty, Holomorphy of Artin L-functions, in: Proc. Ramanujan
Centennial Conference, pp. 55–66, Ramanujan Mathematical Society, Chi-
dambaram, 1987.
[Mu2] V. Kumar Murty, Explicit formulae and the Lang-Trotter conjecture, Rocky
Mountain J. Math., 15 (1985), 535–551.
[Mu3] V. Kumar Murty, The least prime which does not split completely, Forum
Math., 6 (1994), 555–565.
[MM] M. Ram Murty and V. Kumar Murty, Base change and the Birch and
Swinnerton-Dyer conjecture, in: A tribute to Emil Grosswald: number theory
and analysis, Contemp. Math., 143 (1993), 481–494.
[MMS] M. Ram Murty, V. Kumar Murty and N. Saradha, Modular forms and the
Chebotarev Density Theorem, Amer. J. Math., 110 (1988), 253–281.
[MS] V. Kumar Murty and J. Scherk, Effective versions of the Chebotarev Density
Theorem in the function field case, C.R. Acad. Sci. Paris, 319 (1994), 523–
528.
[R] S. Rhoades, A generalization of the Aramata-Brauer theorem, Proc. Amer.
Math. Soc., 119 (1993), 357–364.
[Se1] J.-P. Serre, Linear representations of finite groups, Springer-Verlag, New
York, 1977.
[Se2] J.-P. Serre, Quelques applications du Théorème de Densité de Chebotarev,
Publ. Math. IHES, 54 (1981), 123–201.
[St] H. M. Stark, Some effective cases of the Brauer-Siegel theorem, Invent. Math.,
23 (1974), 135–152.
[Uc] K. Uchida, On Artin L-functions, Tohoku Math. J., 27 (1975), 75–81.
[vdW] R. W. van der Waall, On a conjecture of Dedekind on zeta functions, Indag.
Math., 37 (1975), 83–86.
Chapter 3
Equidistribution and L-Functions
1
n
μ(f ) = lim f (xi ).
n→ ∞ n
i=1
1
n
μn (φα ) = φα (xi ),
n i=1
from which we obtain lim inf nU /n ≥ μ(U ). The same argument applied to
X − U shows that
lim inf (n − nU )/n ≥ μ(X − U ).
n→ ∞
Hence,
lim sup nU /n ≤ μ(U ) ≤ lim inf nU /n.
n→ ∞ n→ ∞
1
n
lim χ(xi ) = μ(χ).
n→ ∞ n
i=1
Proof. The map C(X) → C(G) is an isomorphism of C(X) onto the space of
class functions on G; by the Peter-Weyl Theorem, the irreducible characters χ of
G generate a dense subspace of C(X). Hence the proposition follows from Lemma
1.1.
§3 L-functions on G 67
Corollary 2.2 Let μ be the Haar measure of G with μ(G) = 1. Then the sequence
∞
{xn }n=1 of elements of X is μ-equidistributed if and only if for any irreducible
character χ of G, χ
= 1, we have,
1
n
lim χ(xi ) = 0.
n→ ∞ n
i=1
Proof. This follows from Proposition 2.1 and (a special case of) the orthogonality
relations: μ(χ) = 0 if χ is irreducible
= 1 and μ(1) = 1.
Corollary 2.3 (H. Weyl) Let G = R/Z, and let μ be the normalized Haar measure
on G. Then {xn }∞
n=1 is μ-equidistributed if and only if for any integer m
= 0 we
have
e2πimxn = ◦(N )
n≤N
as N → ∞.
Proof. It suffices to remark that the irreducible characters of R/Z are the maps
x
→ e2πimx , (m ∈ Z).
In [BTD], the image of the Haar measure of SU (2, C) in the space of conju-
gacy classes is calculated. Each conjugacy class has a representative of the form
eiθ 0
,0 ≤ θ ≤ π
0 e−iθ
2 2
which has measure π sin θdθ.
§3 L-functions on G
Let G be a compact group, X its space of conjugacy classes as above. Suppose
that for each prime p, we associate a conjugacy class Xp in X. As p varies, how
are the Xp distributed (say, with respect to the normalised Haar measure on X)?
To answer this, we define the L-function associated to each irreducible complex
linear representation ρ of G in the following way: Let
ρ : G −→ GLn (C)
and define
L(s, ρ) = det(1 − ρ(Xp )p−s )−1 .
p
68 Chapter 3 Equidistribution and L-Functions
Example. Let ζn be a primitive n-th root of unity and set K = Q(ζn ), be the
usual n-th cyclotomic field. Then
Gal(K/Q) (Z/nZ)∗ .
σp (ζn ) = ζnp .
1
π(x; n, a) ∼ i x
φ(n)
as x → ∞.
Hence, the prime number theorem for arithmetic progressions, and more generally,
the Chebotarev density theorem, fit into this general formalism. In [Se2], Serre
formulates a ‘motivic’ generalization of the Chebotarev density theorem.
Theorem 4.1 Let G be a compact group. Assume that for every non-trivial, irre-
ducible representation ρ of G, L(s, ρ) is holomorphic at s = 1. If χ is quadratic we
suppose that L(s, χ) is holomorphic on [1/2, 1]. Then L(1, ρ)
= 0 for all irreducible
ρ= 1.
§4 Deligne’s Prime Number Theorem 69
Remark. We want to point out that the second condition above is essential. For
instance, consider the group G = ±1 of order 2. For each prime p, we will define
Xp = −1. G has only two irreducible characters, the trivial one which gives the
Riemann zeta function and the non-trivial one which gives
1 1 − p−s ζ(2s)
= = .
p
1+p −s
p
1 − p−2s ζ(s)
In the case of an arbitrary compact group, we will try and approximate this
construction.
Given any > 0, there exists a delta like function F on G such that F is
continuous, real-valued, non-negative, invariant under conjugation, and F (x) =
F (x−1 ) satisfying
F (x)dx = 1,
G
and
F (x)χ(x)dx > d −
G
70 Chapter 3 Equidistribution and L-Functions
for > 0. Note that the latter quantity is a real number since it is invariant under
complex conjugation. Now choose a finite sum that approximates F :
cψ ψ.
ψ
We may assume without loss that cψ are rational numbers. Thus we have a function
F = cψ ψ
ψ
which is a finite sum, with cψ rational numbers, real valued and non-negative,
c1 = 1 and cχ > d − . Now take f = F ∗ F to get non-negative coefficients so that
f (x) = F (xy −1 )F (y)dy.
G
1
ψ∗ψ = ψ
ψ(1)
so that
c2ψ
f= ψ.
ψ(1)
The coefficients are still rational numbers. The coefficient of the trivial character
is 1 and of χ is > (d − )2 /d. Clearing denominators gives us the desired function.
Proof of Theorem 4.1 Assume that ρ is not one or quadratic. Choose f = ψ cψ ψ
as in the lemma with c1 < cχ . Assume L(1, χ) = 0. Then,
L(s, f ) = L(s, ψ)cψ
ψ
= L(s, 1)c1 L(s, χ)cχ L(s, ψ)cψ ,
ψ=1,χ
−L −m
(s, f ) = + O(1)
L s−1
as s→1+ . Hence,
L
lim − Re( (s, f )) < 0 .
s→1+ L
Exercises 71
L
− Re( (s, f )) ≥ 0
L
which implies
L
lim − Re( (s, f ) ≥ 0
s→1+ L
which is a contradiction. This completes the proof in this case. If χ is quadratic,
consider
f (s) = L(s, χ)L(s/2, 1)L(s/2, χ)
and suppose that L(1, χ) = 0. By hypothesis, f (s) is holomorphic for s in [1, 2]
with a zero at s = 1. It is easily verified by looking at the Euler factors that the
coefficients of −f /f are non-negative. Call β the zero of f on [1, 2] closest to 2.
Such a zero exists since f (1) = 0. Therefore, for some positive integer m,
f m
− (s) = − + O(1)
f s−β
as s→β. But Landau’s lemma (Exercise 5 in Chapter 1)implies
f
− →∞
f
as s→β. This is a contradiction. Therefore L(1, χ)
= 0 and this completes the
proof.
Exercises
1. Prove that the sequence {log p} as p varies over the prime numbers is not
uniformly distributed mod 1.
2. (Erdös-Turán inequality): Let {xj }nj=1 be a finite sequence of real numbers.
For 0 ≤ α < β ≤ 1. Let A([α, β] : n) be the counting function
M n
1 1 1 1
Dn ≤ +2 + min(β − α, ) e2πihxj .
M +1 M +1 πh n j=1
h=1
*3. Let Fp denote the finite field of p elements. Let ψ : Fp →C∗ be a fixed non-
trivial additive character. For each character χ of the multiplicative group
F∗p , we define a Gauss sum
G(χ, ψ) = χ(x)ψ(x).
x∈F∗
p
If χ
= 1, we know that |G(χ, ψ)| = p1/2 so we can write
Show that the sequence obtained by listing θp (χ) as p ranges over all the prime
numbers and for each fixed p, χ ranges over all the non-trivial characters
mod p, is uniformly distributed mod 1.
4. Suppose that
1
n
cχ = lim χ(xi ).
n→∞ n
i=1
Define
Φ(g) = cχ χ(g),
χ
where the sum is over all irreducible characters arranged by increasing de-
grees. Then determine conditions when {xn }∞
n=1 is uniformly distributed with
respect to the measure Φ(g)dμ where μ is the normalized Haar measure of G.
5. In Theorem 4.1, let us suppose further that each L(s, ρ) is holomorphic on
Re(s) = 1 with s
= 1. Show that L(1 + it, ρ) = 0 for t ∈ R, t
= 0. (Hint: for
t ∈ R and non-zero, consider St = R/tZ and the group G × St . Define for
each prime p the conjugacy class (Xp , log p) and now apply Theorem 4.1 to
the L-functions associated to G × St .)
6. For each prime p ≡ 1(mod 4), let χ mod p be a character of order 4 and define
J(χ, χ) = χ(x)χ(1 − x).
x mod p
x =0,1
x2 + xy + 6y 2
√
with x, y ∈ Z has density 1/6. (Hint: consider Q( −23).)
*8. In the above question, define φp by
√
y 23
tan φp = .
2x + y
References
[BTD] T. Bröcker, T. tom Dieck, Representations of compact Lie groups, Graduate
texts in mathematics, 98, Springer-Verlag, 1985.
[Mo] H. Montgomery, Ten lectures on the interface between analytic number theory
and harmonic analysis, No. 84, CBMS, Amer. Math. Soc. 1994.
[Se] J.-P. Serre, Abelian -adic representations and elliptic curves, Benjamin, New
York, 1968; second edition, Addison-Wesley, Redwood City, 1989.
[Se2] J.-P. Serre, Propriétés conjecturales des groupes de Galois motiviques et des
représentations -adiques in Proc. Symp. Pure Math., 55 (1994) p. 377–400.
Chapter 4
Modular Forms and Dirichlet Series
Clearly
Γ(N ) ⊆ Γ1 (N ) ⊆ Γ0 (N ) ⊆ SL2 (Z)
A congruence subgroup of SL2 (Z), is by definition, a subgroup which contains
Γ(N ) for some N . Γ0 (N ), Γ1 (N ) are examples of congruence subgroups.
An element γ ∈ SL2 (Z) is called elliptic if |tr γ| < 2, parabolic if |tr γ| = 2
and hyperbolic if |tr γ| > 2.
Let GL+2 (R) be the group of 2 × 2 matrices with real entries and positive
determinant. Then GL+ 2 (R) acts on h as a group of holomorphic automorphisms:
az + b a b
γ:z
→ , γ= ∈ GL+
2 (R)
cz + d c d
Let h∗ denote the union of h and the rational numbers Q together with a
symbol ∞ (or more suggestively i∞). The action of SL2 (Z) on h can be extended
to h∗ by defining
a b a
·∞= for c =0
c d c
§3 Modular forms and cusp forms 77
and
a b
· ∞ = ∞;
0 d
r
for rational numbers s with (r, s) = 1, we define
a b r ar + bs
· =
c d s cr + ds
with the understanding that when cr + ds = 0, the right side of the above equation
is the symbol ∞. The rational numbers together with ∞ are called cusps.
If Γ is a discrete subgroup of SL2 (R), then the orbit space h∗ /Γ can be given
the structure of a compact Riemann surface, and is denoted XΓ .
We will be interested in the case Γ is a congruence subgroup of SL2 (Z). In
that case, the algebraic curve corresponding to XΓ is called a modular curve.
In case Γ = Γ(N ), Γ1 (N ) or Γ0 (N ), the corresponding modular curve is de-
noted X(N ), X1 (N ) and X0 (N ) respectively. For further details, the reader may
consult [VKM].
define
−k az + b
(f |k γ)(z) = (detγ) k/2
(cz + d) f .
cz + d
For fixed k, the map γ : f → f |k γ defines an action of GL+ 2 (R) on the space of
holomorphic functions on h. (Sometimes, we simply write f |γ for f |k γ.)
Let Γ be a subgroup of finite index in SL2 (Z). Let f be a holomorphic function
on h such that f |k γ = f for all γ ∈ Γ. Since Γ has finite index,
M
1 1 1 M
= ∈Γ
0 1 0 1
for some positive integer M . Hence f (z + M ) = f (z) for all z ∈ h. So, f has a
“Fourier expansion at infinity”:
∞
n
f (z) = an qM , qM = e2πiz/M
n=−∞
78 Chapter 4 Modular Forms and Dirichlet Series
∞
f (z) = an qn , q = e2πiz
n=0
The integer k is called the weight of f . Such a modular form is called a cusp form
if it vanishes at the cusps.
The modular forms on Γ0 (N ) of type (k, ) form a complex vector space
Mk (Γ0 (N ), ) and this has a subspace Sk (Γ0 (N ), ) consisting of cusp forms. The
subspace has a canonical complement:
-
Mk (Γ0 (N ), ) = Ek (Γ0 (N ), ) Sk (Γ0 (N ), )
and the space Ek is called the space spanned by Eisenstein series. These spaces
are finite dimensional. Moreover, one can define an inner product (Petersson) on
Sk (Γ0 (N ), ) by
dxdy
f, g = f (z)g(z)y k 2
h/Γ0 (N) y
Examples
1. Let k ≥ 4 be even. Then
Gk (z) = (mz + n)−k
m,n∈Z
(m,n) =(0,0)
§3 Modular forms and cusp forms 79
Then ∞
Δ(z) = τ (n) q n
n=1
is a cusp form of weight 12 for SL2 (Z); τ (n) is called the Ramanujan function.
Ramanujan conjectured in 1916 that
(i) τ (nm) = τ (n)τ (m) (n, m) = 1
(ii) |τ (p)| ≤ 2p11/2 , p prime
(i) was proved by Mordell in 1928 and (ii) by Deligne in 1974. More generally,
if f ∈ Sk (Γ0 (N ), ), then its Fourier coefficients satisfy
k−1
an = O(n 2 +δ ) for any δ > 0.
If χ is odd ( i.e., χ(−1) = −1), then E1,χ ∈ M1 (Γ0 (c), χ). One can also show that
if χ is even (i.e., χ(−1) = 1) then
⎛ ⎞
∞
4 ⎝
E2,χ =1− χ(d)d⎠ q n
B2,χ n=1
d|n
For higher weights, one has analogous results. See for instance [La].
i(N ) i2 (N ) i3 (N ) i∞ (N )
dim S2 (N ) = 1 + − − −
12 4 3 2
where
1
i(N ) = N 1+
p
p|N
0 if 4 | N
i2 (N ) = . −4
p|N 1 + p otherwise
0 if 2 | N or 9 | N
i3 (N ) = . −3
p|N 1 + p otherwise
i∞ (N ) = φ ((d, N/d))
d|N
This formula occurs in [Kn, p. 272] with a misprint which has been corrected
above. One can write down analogous formulas for dim Sk (N ) for k ≥ 2. (See for
example, [Shi] or [CO].) For explicit examples, see Frey [F]. For k = 1, no such
simple formula is known. However, for N prime, it is conjectured [D] that
1
dimS1 (N ) = h(−N ) + O(N )
2
√
where h(−N ) denotes the class number of Q( −N ).
§4 L-functions and Hecke’s theorem 81
Since y k/2 |f (z)| is invariant under Γ1 (N ), and hence represents a function on the
compact Riemann surface X1 (N ), it is bounded on h. Therefore
1/2
−2πny
e an = f (x + iy)e−2πinx dx
−1/2
y −k/2
WN Γ0 (N )WN−1 ⊂ Γ0 (N )
In the first term, we replace y by 1/N y and use the modular relation to get
∞ ∞
−s/2
N Λ(s, f ) = ±N k/2 k
i √ f (iy)y k−s−1
dy + √ f (iy)y s−1 dy
1/ N 1/ N
§5 Hecke operators
∞
Let p denote a prime number and f (z) = n=0 an q n be a modular form on Γ0 (N )
of type (k, ). The Hecke operators Tp and Up are defined by
∞
∞
f |Tp = anp qn + (p)pk−1 an q np if p N
n=0 n=0
∞
f |Up = anp qn if p | N
n=0
It is not difficult to show that f |Tp , f |Up are also modular forms on Γ0 (N ) of type
(k, ), and they are cusp forms if f is a cusp form.
Theorem 5.1 (Hecke) The Tp ’s are commuting linear transformations of
Sk (Γ0 (N ), ). As such, the space can be decomposed as a direct sum of eigenspaces.
Let f ∈ Sk (Γ0 (N ), ). We will say that f is an eigenform if f is an eigen-
function for all the Hecke operators Tp ’s and Up ’s.
If ∞
f (z) = an e2πinz
n=1
ap
−1
ap (p)
−1
L(s, f ) = 1− s 1 − s + 2s+1−k
p p p
p|N pN
k+2
which converges absolutely for Re s > 2 .
k+1
Remark. The product converges for Re s > 2 . See [Ogg] for further details.
§7 The Sato-Tate conjecture 83
be a newform of weight k and level N which is also a cusp form. Let us write for
each prime p N ,
k−1
a(p) = 2p 2 cos θp .
Since we know the Ramanujan conjecture, the θp ’s are real. Inspired by the Sato-
Tate conjecture for elliptic curves, Serre [Se] conjectured that if f is not of CM-
type, then θp ’s are uniformly distributed with respect to the Sato-Tate measure
2
sin2 θ.
π
84 Chapter 4 Modular Forms and Dirichlet Series
If we consider the group SU (2, C) and consider its space of conjugacy classes, we
can make the following assignment:
iθ
e p 0
p→ conjugacy class of .
0 e−iθp
Clearly, Lm (s) converges for Re(s) > 1. It is in fact conjectured that each Lm (s)
extends to an entire function. By Theorem III.3.1, we see that the Sato-Tate
conjecture is true if and only if Lm (1 + it)
= 0 for every real t and m ≥ 1. The
fact that L1 (s) extends to an entire function follows from Hecke’s theory. In 1939,
Rankin and Selberg (independently) introduced a powerful method into the theory
of numbers and as a consequence established that
(s − 1)ζ(s)L2 (s)
extends to an entire function. It was Shimura who using the theory of modular
forms of half-integral weight managed to isolate L2 (s) and established an analytic
continuation for all s ∈ C. By the powerful methods of the Langlands program,
Shahidi has established the analytic continuation for L3 (s), and L4 (s) to Re(s) ≥ 1.
(See [Sh].) In some cases, he has obtained better results establishing a meromorphic
continuation and defining sets where possible poles may exist. Since we do not need
these here, we will not go into these details.
Ogg [Ogg2 ] has shown that if for each r ≤ 2m, Lr (s) has an analytic con-
tinuation to Re(s) > 1/2 − δ for some δ > 0, then Lm (1 + it) = 0. K. Murty
[VKM2] showed that it suffices to have analytic continuation of each Lm (s) up to
Re(s) ≥ 1.
where d(n) is the divisor function. It is known that the maximum order of d(n)
satisfies
d(n) = O(exp(c log n/ log log n))
for some constant c > 0. Therefore,
k−1 c log n
an = O(n 2 exp ).
log log n
We would like to know if this is best possible. This question has a long history.
Before we discuss the past accomplishments on this question, we recall the Ω
notation.
Let g be a positive function and f any function. We say
f (x) = Ω(g(x))
if there is some constant c > 0 such that |f (x)| > cg(x) for infinitely many x→∞.
We also write
f (x) = Ω± (g(x))
if there exists a constant c > 0 such that
Theorem 8.1 For any normalized newform f of weight k and level N , there is a
constant c > 0 such that
k−1 c log n
an = Ω± n 2 exp .
log log n
In view of the above discussion, this is best possible apart from the value of
the constant c > 0.
For any cusp form of weight k, he also established the following result:
Theorem 8.2 For any cusp form f of weight k and level N , there is a constant
c > 0 such that
k−1 c log n
an = Ω n 2 exp .
log log n
To prove these, we first need the non-vanishing of L3 (s) and L4 (s) on Re(s) =
1. To do this, we prove a slightly more general theorem:
Theorem 8.3 If Lr (s) has an analytic continuation up to Re(s) ≥ 1/2 for 1 ≤
r ≤ 2m, then L2m−1 (1 + it)
= 0 for t ∈ R. If L2r (s) has an analytic continuation
up to Re(s) = 1 for 1 ≤ r ≤ m, then L2m (1 + it) = 0.
Corollary 8.4 L3 (1 + it)
= 0 and L4 (1 + it)
= 0 for all t ∈ R.
Proof of the Corollary. By the history described at the end of the previous sec-
tion, we know that L1 (s), L2 (s), L3 (s) and L4 (s) extend to analytic functions for
Re(s) ≥ 1. The result now follows from the theorem.
Proof of Theorem 8.3. We first show that L2m (1 + it)
= 0. Consider
f (s) = L0 L2 · · · L2m .
1 sin(n + 12 )θ
+ cos θ + cos 2θ + · · · cos nθ =
2 2 sin(θ/2)
and
sin nθ
cos θ + cos 3θ + · · · cos(2n − 1)θ =
2 sin θ
sin(r + 1)nθp
we see that
1
log Lr (s) = .
n,p
sin nθp npns
Therefore, as
2
sin 3θ sin 5θ sin(2n − 1)θ sin nθ
1+ + + ··· + = ,
sin θ sin θ sin θ sin θ
§8 Oscillations of Fourier coefficients of newforms 87
we find that log f (s) is a Dirichlet series with non-negative coefficients. Moreover,
the Euler product shows that f (s) does not vanish in σ > 1. An application of
Theorem I.1.2 with e ≤ 1 gives the result. Now consider,
Since
∞
2
sin(m + 1/2)θ
2m + 1 + 2(j + 1) cos(2m − j)θ = ,
j=0
sin(θ/2)
we see that log g(s) is a Dirichlet series with non-negative coefficients. If L2m−1 (1+
it) = 0, with t
= 0, then g(s) has a zero of order ≥ 2 on Re(s) = 1, s = 1. As g(s)
has a pole of order 2 at s = 1, we get a contradiction again by Theorem I.1.2. We
need to consider L2m−1 (1) = 0. If this happens, then g(s) is regular. By a well-
known theorem of Landau (see exercise 5 in Chapter 1), we find that log g(s) has a
singularity at its abscissa of convergence. As L0 (s) = ζ(s) has zeros in Re(s) ≥ 1/2,
g(s) has zeros in this half-plane. Therefore the abscissa of convergence of log g(s)
is σ0 ≥ 1/2, and as g(s) is analytic in Re(s) ≥ 1/2, σ0 is a zero of g(s). But then
g(σ) ≥ 1 for σ > σ0 . We get a contradiction by letting σ→σ0+ . This completes the
proof of the theorem.
Theorem 8.4 Suppose Lr (s) has an analytic continuation up to Re(s) ≥ 1/2 for
all r ≤ 2m + 2. Then
(i) for r ≤ m + 1,
1 2r x
(2 cos θp )2r = (1 + o(1))
r+1 r log x
p≤x
as x→∞, and
(ii) for r ≤ m,
(2 cos θp )2r+1 = o(x/ log x)
p≤x
as x→∞.
88 Chapter 4 Modular Forms and Dirichlet Series
Proof. By Theorem 8.3, we know that Lr (s) does not vanish on the line σ = 1.
Observe further that
sin(r + 1)θp i(r−2j)θ
r
= e .
sin θp j=0
and
T1 (cos θp ) = o(x/ log x),
p≤x
By the above results, the inner sum is o(x/ log x) unless r − 2k = 2 or 0. Hence,
(ii) is deduced. In the case of (i), we find that
2r 2r x
(2 cos θp ) = −
2r
+ (1 + o(1))
r−1 r log x
p≤x
as x→∞. The term in the brackets is easily seen to be the coefficient stated in (i).
To prove the omega theorem, we will need a few preliminary combinatorial
identities. We collect them below and leave them as exercises.
§8 Oscillations of Fourier coefficients of newforms 89
Proof. Exercise.
Theorem 8.6 Suppose that Lr (s) has an analytic continuation up to Re(s) ≥ 1/2
for all r ≤ 2m + 2. Then, each of the statements holds for a set of primes of
positive density:
2
(i) for any δ > 0, −δ < 2 cos θp < ,
δ(m + 2)
)
4m + 2
(ii) for any > 0, |2 cos θp | > − ,
m+2
2m+1
1
1 2m + 2
(iii) for any > 0, 2 cos θp > βm − , where βm = .
4(m + 2) m + 1
There is a corresponding result for negative values of ap .
Proof. For (i), consider the polynomial
where α, β will be suitably chosen later. By Theorem 8.4 and Lemma 8.5, we
deduce
log x 2m + 2 αβ 1
Pm (2 cos θp ) ∼ (−1)m + .
x m+1 2 m+2
p≤x
2 2
αβ > − , if m is even and αβ < − ,
m+2 m+2
§9 Rankin’s theorem
Rankin [R] proved the following theorem:
Theorem 9.1 Let f be a normalized newform which is a cusp form of weight k
with respect to Γ0 (N ). Let an denote the n-th Fourier coefficient and write
a(n) = an /n(k−1)/2 .
Given β ≥ 0, let
2β−1 β
F (β) = (2 + 32−β ) − 1.
5
Then
|a(n)|2β x(log x)F (β) .
n≤x
In particular, if k = 2, then
|a(n)| x(log x)−1/18 .
n≤x
Exercises
1. Show that
1
[SL2 (Z) : Γ(N )] = N 3 1− 2
p|N
p
p prime
2. Show that:
(i) Γ0 (N ) is not a normal subgroup of SL2 (Z) if N > 1.
.
(ii) Γ0 (N ) has index N p|N 1 + p1 in SL2 (Z).
(iii) Γ1 (N ) is not normal in SL2 (Z) but is normal in Γ0 (N ). Compute its
index.
(iv) γ ∈ SL2 (Z) has finite order if and only if γ is elliptic.
|a(n)| x(log x)−c
n≤x
for some positive constant c. Deduce a corresponding result for negative os-
cillation.
7. Given any cusp form f of level 1, show that there are numbers mi which are
not all zero so that
r
mi Ti (f )
i=1
is an eigenform. Here, Ti denotes the i-th Hecke operator and r is the dimen-
sion of the space of cusp forms of weight k. Now deduce Theorem 8.2 for any
cusp form of level 1.
92 Chapter 4 Modular Forms and Dirichlet Series
References
[CO] H. Cohen and J. Oesterlé, Dimensions des espaces de formes modulaires,
Lecture Notes in Mathematics, 627(1977), 69–78.
[D] W. Duke, The dimension of the space of cusp forms of weight one, Int. Math.
Res. Not., 1995, pp. 99–109.
[F] G. Frey, Construction and arithmetical application of modular forms of low
weight, in: Elliptic curves and related objects, pp. 1–21, eds. H. Kisilevsky
and R. Murty, CRM Proceedings and Lecture Notes, vol. 4, Amer. Math.
Soc., Providence, 1994.
[Kn] A. Knapp, Elliptic Curves, Princeton University Press, 1992.
[La] S. Lang, Introduction to modular forms, Grundl. Math. Wiss. 222, Springer-
Verlag, Berlin, New York 1976.
[Ogg] A. Ogg, Survey of modular functions of one variable, in: ‘Modular functions
of one variable I’, (Ed. W. Kuijk), Lecture Notes in Math., 320 (1972) pp.
1–36, Springer-Verlag.
[Ogg2] A. Ogg, A remark on the Sato-Tate conjecture, Inventiones Math., 9 (1970)
p. 198–200.
[RM] M. Ram Murty, Oscillations of Fourier coefficients of modular forms, Math.
Annalen, 262 (1983) p. 431–446.
[R] R. Rankin, Sums of powers of cusp form coefficients II, Math. Ann., 272
(1985) p. 593–600.
[VKM] V. Kumar Murty, Introduction to abelian varieties, CRM Monograph
Series, Volume 3, 1993, American Math. Society, Providence, USA. Chapters
9–11.
[VKM2] V. Kumar Murty, On the Sato-Tate conjecture, in: Number Theory re-
lated to Fermat’s Last Theorem, (ed. N. Koblitz), Birkhäuser-Verlag, Boston,
1982, pp. 195–205.
[Sh] F. Shahidi, Symmetric power L-functions for GL(2), in: Elliptic curves and
related objects, (ed. H. Kisilevsky and M. Ram Murty), CRM Proceedings
and Lecture Notes, Vol. 4 (1994) pp. 159–182.
[Shi] G. Shimura, Introduction to the arithmetic theory of automorphic functions,
Publ. Math. Soc. Japan, vol. 11, Iwanami Shoten, 1977.
Chapter 5
Dirichlet L-Functions
§1 Introduction
Let χ denote a Dirichlet character and L(s, χ) the associated Dirichlet L-function.
Let us begin by considering how one would approach the problem of showing that
L( 12 , χ)
= 0. In the following, we assume that χ is defined modulo a prime q. We
first study the average
1
L( , χ).
2
χ(mod q)
1 χ(n)
L( , χ) ∼ √ .
2 √ n
n< q
Hence,
1 χ(n)
L( , χ) ∼ √ ∼ φ(q).
2 χ
√ n
χ(mod q) n< q
Similarly,
1 χ(n1 )χ̄(n2 )
|L( , χ)|2 ∼ √
2 χ n ,n <q
n1 n2
χ(mod q) 1 2
1
∼ φ(q)
n<q
n
∼ φ(q) log q.
Therefore,
2
1
φ(q)
2
L( , χ)
2
⎛ ⎞
1
|L( , χ)|2 ⎝ 1⎠
2 1
L( 2 ,χ)=0
⎛ ⎞
φ(q)(log q) ⎝ 1⎠
L( 12 ,χ)=0
and so
1 φ(q)
#{χ mod q : L( , χ)
= 0} .
2 log q
This kind of argument can in fact be made precise. The reader is referred to the
paper [B] of Balasubramanian where only a slightly weaker result is proved.
We may try to consider higher moments as well. In general, one might expect
that for some c > 0 depending on k,
1 dk (r)2
|L( , χ)|2k ∼ cφ(q)
2 k
r
r<q
ζ(s)k ∼ (s − 1)−k .
2 2
r<X
If we assume this and try to apply the above argument, we do not seem to get
anything better. Of course, (∗) is still very highly conjectural. The largest value
for which an asymptotic formula is known is k = 2, and prime q. This is a result
of Heath-Brown [HB].
In spite of our difficulty in showing that L( 12 , χ)
= 0 holds often, no example
is known for which L( 12 , χ) = 0. Siegel [Sie] has shown that any point on the
line σ = 12 is a limit point of zeroes of the L(s, χ) as χ ranges over all Dirichlet
characters. More precisely, he shows that for any [T1 , T2 ], with T2 −T1 (log q)− 2 ,
1
1 1 1
σ∈[ , + ], t ∈ [T1 , T2 ].
2 2 log log q
1
L( , χD )
2
' (
where χD is the real character given by D. . From this, one can deduce the non-
vanishing of infinitely many such L-functions. In §5, we describe the work of R.
Balasubramanian and V. K. Murty [BM] which considers Dirichlet L-functions to a
prime modulus q. Here, it is proved that for a positive proportion of the χ(mod q),
we have L( 12 , χ)
= 0. Finally in §6, we briefly describe the work of R. Murty [RM]
which establishes a strengthening of §5 assuming the Riemann Hypothesis .
§2 Polya-Vinogradov estimate
In this section, we shall describe an estimate for character sums involving a Dirich-
let character. Let q ≥ 1 be an integer and let χ denote a character mod q. We shall
consider a sum of the form
χ(n)
n≤x
or more generally,
χ(n).
y≤n≤x
96 Chapter 5 Dirichlet L-Functions
If χ is the trivial character, then this is best possible. However, for nontrivial χ it
is possible to do substantially better.
Theorem 2.1 We have the estimate
1
χ(n) q 2 log q.
y<n≤x
1
q
an
χ̄(a) e( ).
g(χ̄) a=1 q
y<n≤x
Since the inner sum is a geometric series, it can be evaluated exactly and we find
it to be
(M + 12 N + 12 )a sin πN a/q
e
q sin πa/q
where M = [y] + 1 and N = [x] − [y] − 1. Hence, using the fact that the Gauss
1
sum g(χ̄) has absolute value q 2 we deduce that
q−1
1
χ(n)| ≤ q− 2
1
| .
a=1
sin πa/q
y≤n≤x
q−1
1 1
[q/2]
<1+q < 1 + q log q/2.
a=1
sin πa/q a=1
a
§3 Jutila’s character sum estimate 97
Thus 1
| χ(n)| < q 2 log q.
y≤n≤x
The proof of the estimate in the case that χ is not primitive is left as an exercise.
One can ask whether the estimate provided by the Polya-Vinogradov inequal-
ity is best possible. Montgomery and Vaughan [MV] have shown that assuming
the Riemann Hypothesis for Dirichlet L-functions, we have the sharper estimate
1
χ(n) q 2 log log q.
n≤x
It is possible to remove the factor log q entirely if we work with even characters
and integrate over the summation parameter. For a character of any parity, it is
possible to eliminate the log q factor if we integrate and take the mean square.
These observations play a crucial role in Chapter 6.
where the outer sum is over D which belong to the set D of integers satisfying:
(i) D is not a square and (ii) D ≡ 1(mod 4) or D = 4N with N
≡ 0(mod 4). Note
that if (D/·) is a non-principal character, then D ∈ D.
If we apply the Polya-Vinogradov estimate to the inner sum, we find that it
is
|D|(log |D|).
From this, we deduce that
The main tool in the proof of Theorem 3.1 is a Lemma of Vinogradov [Vino]
which gives an explicit and effective smooth approximation to the characteristic
function of an interval. We state the lemma as follows.
1
0<δ< , δ ≤ β − α ≤ 1 − δ.
2
Then there exists a periodic function ψ(x) with period 1, and satisfying
(i) ψ(x) = 1 in the interval [α + 12 δ, β − 12 δ]
(ii) ψ(x) = 0 in the interval [β + 12 δ, 1 + α − 12 δ]
(iii) 0 ≤ ψ(x) ≤ 1 in the intervals [α − 12 δ, α + 12 δ] and [β − 12 δ, β + 12 δ]
(iv) ψ(x) has the Fourier expansion
∞
ψ(x) = β − α + (am e(mx) + ām e(−mx))
m=1
where
sin( 12 πmδ) 2
am = (2πim)−1 (e(−mα) − e(−mβ)){ 1 }
2 πmδ
and
|am | min(β − α, m−1 , δ −2 m−3 ).
In the proof of Jutila’s estimate, one uses not a single function ψ but a
family of such functions. Denote by ψ(x, u) the function ψ(x) of Lemma 3.2 with
the parameters
1 1
δ = X − 2 , α = δ, β = Y u−1 + δ.
1
2 2
Then one has the estimates
∂ψ(x, u)
δ −1
∂x
and
dψ(n/u, u) X
2.
du u
Moreover
dam,u Y
2 min(1, m−2 δ −2 ).
du u
§3 Jutila’s character sum estimate 99
Proof of Theorem 3.1. First, we observe that the estimate is easy in certain ranges
of the parameters. For example, the case Y ≥ X/4 follows from the Polya-
1
Vinogradov estimate and the case Y ≤ X 2 also follows easily and is left as an
1
exercise. We are now left with the essential range X 2 ≤ Y ≤ X/4. Moreover, the
Polya-Vinogradov estimate also implies that
√
S( XY , Y ) XY (log XY )2 .
To estimate this, first consider the case when rs is a square. Using the estimates
provided by Vinogradov’s lemma, we have to consider
Y 1 Y 1
2 |D| min min .
|D| r |D| s
r<s≤X 2
1 |D|≤X
rs=t2
The inner sum over D is split into three subsums depending on the value of the
minima, namely sY < |D| ≤ X, rY < |D| ≤ sY and |D| ≤ rY . Consider the first
range. The sum is
Y2
|D|
1
|D|2
r<s≤X 2
sY <|D|≤X
rs=t2
X 1
Y 2
log + O( ) (3.2)
r<s<X/Y
Ys Ys
rs=t2
X 1
Y2 log 1 + O(Y 2 ).
Ys t<s
Ys
s≤X/Y s≤X/Y
s0 |t
§3 Jutila’s character sum estimate 101
Here s0 is defined as follows: if pa ||s then pb ||s0 where b is the least integer ≥ a/2.
It is easy to show that
∞
n/n0
= ζ(2s − 1)ζ(s)/ζ(2s).
n=1
ns
Hence,
n/n0 x(log x).
n≤x
Inserting this into (3.2), using partial summation, and observing that the O term
is O(Y log X), we deduce that (3.2) is
s X
Y2 log XY (log X/Y ).
s0 Ys
s≤X/Y
For the sum in the range rY < |D| < sY we have an estimate
Y 1 1 1 s
|D| XY XY
|D| s s s s0
r<s≤X 2
1 |D|≤X r<s<X 2
1 1
s<X 2
rs=t2 rs=t2
XY log X
by estimates similar to the ones described above. Finally, in the remaining range
|D| ≤ rY , we have that the sum is
1 Y (log X)2
|D| 1 |D|
t2 |D|
|D|≤X |D| 1 2
rs=t |D|≤X
Y <t<X
2
XY (log X) . 2
Let p(n) = 0 if n is a square and p(n) = 1 otherwise. Then the part in consideration
is by partial summation
X *
p(rs) gr,s (X)fr,s (X) − √ gr,s (u)fr,s (u)du .
1 XY
r,s≤X 2
102 Chapter 5 Dirichlet L-Functions
When rs is not a square, the estimates above for fr,s and fr,s are replaced by
and
X 2 Y −1
fr,s (u) (r + s−1 ) + X 2 (rs)−3 .
ur2 s2
Inserting these, one finds that the contribution of S12 is
(rs)1/2 (log X)(X 3 (rs)−3 + X 2 Y (log X/Y )(rs)−2 (r−1 + s−1 ))
√
r,s> X
and this is
XY (log X)2 .
For the nonsquare terms, we have by partial summation that the sum is
*
r s X r s
p(rs) ψ( ,X)ψ( ,X)gr,s (X) − √ gr,s (u)d ψ( ,u)ψ( ,u) .
1
X X XY u u
Y <r,s<Y +X 2
Now √
s
ψ( , u) = 0 if u < (s − Y ) X.
u
By the estimate
dψ( nu , u) X
2
du u
and the Polya-Vinogradov estimate, we see that
X
X
r s √ X
√ gr,s (u)d ψ( , u)ψ( , u) rs log X √ √ 2
du.
XY u u max((s−Y ) X, XY ) u
√
√ √ s
r X log X √
√ √ max(s − Y, Y )
Y <r<Y + X Y <s<Y + X
and this is
⎧ ⎫
⎨ √ √ ⎬
√ √ s s
r X log X √ +
√ ⎩ √ Y √ √ s−Y ⎭
Y <r<Y + X Y <s<Y + Y Y + Y <s<Y + X
XY (log X)2 .
d d
=| μ(δ) |2
δ m
δ|e m≤Y δ −1
d
≤ d(e) | |2 .
−1
n
δ|e n≤Y δ
Hence, for fixed values of d and δ with |d|δ ≤ X we have a contribution to S(X, Y )
an amount which is
√ √
d d X X
≤| |2 d(e)| |2 d(δ) log .
n≤Y δ −1
n
e2 ≤X|d|−1 n≤Y δ −1
n |d|δ |d|δ
δ|e
Applying the result for fundamental discriminants and using partial summation,
we deduce that this is √ )
X XY X
d(δ) (log )2 .
δ δ δ δ
Now a summation over δ accomplishes the proof of Theorem 3.1.
§4 Average value of L 12 , χD
In this section, we outline the proof of Jutila [Jut2] of the mean and mean-square
of L( 12 , χD ). The results are as follows.
Theorem 4.1 We have
∗ 1
L( , χd ) = c1 Y log Y + c2 Y + O(Y 3/4+ )
2
0<d≤Y
where
3 1
cn = 2
(1 + )−1 .
π p
p|n
Y 1+s 1
fY (n, s) = cn + O((|s| + 1)d(n)Y 2 +σ ).
1+s
We apply this first to S. Indeed, in S the squares contribute an amount
∞
(cm Y + O(Y 2 d(m2 ))) exp(−m2 /X)m−1
1
m=1
∞
cm 1
=Y exp(−m2 /X) + O(Y 2 (log X)3 ).
m=1
m
for some constant c. For non-square values of n, we apply the estimate of Exercise
3, namely 2
∗ d
N Y (log N )4 . (4.1)
n
1<n≤N 0<d≤Y
Here the outer sum ranges over non-square integers and the inner sum over fun-
damental discriminants. By Cauchy’s inequality, we get
∞ ∞
−1/2 1 1
fY (n) exp(−n/X)n ( fY (n)2 exp(−n/X)) 2 (log X) 2 .
n=1 n=1
Now using (4.1) and partial summation, we deduce that the right hand side is
Thus
S = c1 Y log X + c Y + O(Y 2 X 2 (log X)5/2 )
1 1
where
3 1
c1 = 1 −
=0
2π 2 p p(p + 1)
'1 (
§4 Average value of L 2 , χD 107
where
1
P (s) = 1− ,
p
(p + 1)ps
we see that the above integral gives
s Γ( 4 − 2 )
1 s
1 Y 1−s 3
Γ(s)X s ds + O(Y 1+ X − 2 − ).
1
− ζ(1 − 2s)P (1 − 2s)π
2πi (− 12 − ) 1 − s π 2 Γ( 4 + 2s )
1
π 2
for some constant c . Next, we observe that when summation is restricted to the
non-squares, the integrand is
∞
−1/2−
exp(−|t|)X |fY (n, −s)|n−1−
n=1
n =m2
where t is the imaginary part of s. Using the above character sum estimate (4.1)
and partial summation, it follows that for σ ≥ 0 we have
|fY (n, s)|2 (|s| + 1)2 N Y 1+2σ (log N )4 .
n≤N
n =m2
2
π 2
1
Now choosing X = Y 2 gives the result. This proves Theorem 4.1.
For the proof of Theorem 4.2, we need several lemmas.
108 Chapter 5 Dirichlet L-Functions
where the sum is taken over integers n which are not squares and fundamental
discriminants d.
Proof. Let us consider the double sum where the inner sum is taken over d ≡
1(mod 4). The other cases are similar. The sum in question is
⎛ ⎞
m
d(n) ⎝ μ(a)⎠
n
n≤X m≤Y 2 a |m
m≡1(mod 4)
(2) (a) h
= μ(a)χ0 (a) d(n)χ0 (n)
1 h≤Y /a2
n
n≤X
a≤Y 2
h≡1(mod 4)
= S1 + S2
where the sum S1 is over a ≤ Z = (log XY )B for a suitable constant B > 0 and
S2 is the remainder. Applying Cauchy’s inequality twice, we get
⎛ ⎞ 12 ⎛ 2 ⎞ 12
⎜ h ⎟
S1 ≤ ⎝ a−1 ⎠ ⎝
(a)
a d(n)χ0 (n) ⎠
n
a≤Z a≤Z n≤X h≤Y a−2
h≡1(mod 4)
⎧ ⎛ 2 ⎞⎫ 12
⎨ h ⎬
d(n)2 ⎝ ⎠
1
(log Z) 2 a
⎩ a n n
n ⎭
h
⎛ ⎞1
2 2
h ⎠
X 2 (log X)3/2 (log log XY ) 2 ⎝
1 1
a .
a n
n
h
In S2 we sum first over n using Lemma 4.4. Taking into account also the values of
h which are squares, we get
(Y a−2 X 2 d(a)2 (log XY )17 + Y 2 a−1 X(log X))
1 1
S2
1
Z<a≤Y 2
1 1
Y X (log XY )20−B + Y 2 X(log XY )2 .
2
For the integral, the sum over the non-square values of n is split at a point U ≥ Y
say. For the initial segment, the line of integration is moved to −η (say) for some
0 < η < 12 . This gives an estimate
η
1 1 Y2
U Y 2 2 (log U Y )5/2 (log log U Y ).
UX
For the tail, one again uses Lemma 4.5 and partial summation to get an estimate
3/4
1 1 Y2
U 2Y 2 (log U Y )5/2 log log U Y.
UX
110 Chapter 5 Dirichlet L-Functions
Finally, it is easy to check that the squares in the sum over n in S(X, Y ) and in
I(X, Y ) give a total contribution of
where
1 4p2 − 3p + 1
c= 2 1−
= 0.
8π p p4 + p3
#{χ(mod q) : L(1/2, χ)
= 0} ≥ cφ(q)
where c ≥ .04.
The assumption on q can be weakened significantly but only at the cost of
making the proof more intricate due to the presence of imprimitive characters.
To prove the theorem, we have to consider a mollifier polynomial
M (s, χ) = λ(n)χ(n)n−s
n≤Z
Observe that
a(1) = 1, a(n) = 0 f or 1 < n ≤ Y.
§5 Non-vanishing for a positive proportion of characters, I 111
On the other hand, moving back the line of integration, we see that it is
1 1 1 1 1
L( , χ)M ( , χ) + L( + w, χ)M ( + w, χ)X w Γ(w)dw.
2 2 2πi (−η) 2 2
Applying the functional equation in the integral, we get
1 1 1 1
L( − w, χ̄)M ( + w, χ)γ( + w, χ)X w Γ(w)dw
2πi (−η) 2 2 2
where 12 s− 12
g(χ) 2 2π π
γ(s, χ) = 1 sin (a + s) Γ(1 − s)
ia q 2 π q 2
where a = 0, 1 and χ(−1) = (−1)a and g(χ) is the Gauss sum determined by χ.
In applying the functional equation, we are assuming that χ is primitive. Now, we
can expand L( 12 − w, χ̄) as a Dirichlet series provided η > 12 . We split this series
at Z and so we get two integrals and the formula
1 1
S(χ) = L( , χ)M ( , χ) + I(χ) + J(χ).
2 2
Now we compare mean and mean square estimates for S(χ), I(χ) and J(χ) and
show that they are incompatible with the frequent vanishing of L( 12 , χ). Choosing
X = q, we prove the following estimates:
S(χ) ∼ φ(q) (1)
5
|S(χ)|2 < φ(q) (2)
2
|I(χ)| ∼ cφ(q)
2
(3)
where
4 2 1
c= 1 + 1 − ∼ .374750
π 2 p>2 (p − 2)(p + 1) p>2 (p − 1)2
and q
|J(χ)| . (4)
log q
112 Chapter 5 Dirichlet L-Functions
What is the role of the specific mollifier weights in all of the above estimates? The
key points are that by an important result of S. Graham [Gra], we have
N log N/Y
Y <N <Z
log2 Z/Y
|a(n)|2 ∼
n≤N
N
log Z/Y Z≤N
and one uses this as well as a partial generalization of it to arithmetic progressions.
A second key point is that if we sum instead
a(n)a(n − k)
we save an extra logarithm over the above estimates.
Using (1)–(4), let us see how to deduce Theorem 5.1. We have
S(χ) = (I(χ) + J(χ)) + S(χ) ∼ φ(q).
χ:L( 12 ,χ)=0 χ:L( 12 ,χ)=0
Now,
| I(χ) + J(χ)| ≤ |I(χ)| + |J(χ)|
χ:L( 12 ,χ)=0
1/2 q
≤ φ(q)1/2 |I(χ)|2 + O( )
log q
√
cφ(q).
Therefore, √
S(χ) (1 − c)φ(q).
χ:L( 12 ,χ)=0
Thus,
1 2 √
#{χ : L( , χ)
= 0} (1 − c)2 φ(q).
2 5
Now we shall give a few more details on this outline. We begin with two
results on the Barban-Vehov weights .
Let 1 ≤ z1 ≤ z2 . Following Barban and Vehov [BV], we introduce the func-
tions
μ(n) log(zi /n) if n ≤ zi
Λi (n) =
0 if n > zi ,
for i = 1, 2. We also define
⎧
1 ≤ n ≤ z1
Λ2 (n) − Λ1 (n) ⎨
μ(n)
log(z2 /n)
λ(n) = = μ(n) log(z z1 ≤ n ≤ z2 (5.1)
log(z2 /z1 ) ⎩ 2 /z1 )
0 n > z2 .
§5 Non-vanishing for a positive proportion of characters, I 113
Let us define
a(n) = λ(d).
d|n
Graham [Gra] has found asymptotic estimates for the mean square of the a(n).
We recall his main result.
Proposition 5.2 We have
⎧
⎨ N log(N/z
2
1)
+ O 2
N
if z1 < N < z2
|a(n)|2 =
log (z2 /z1 )
log (z2 /z1)
⎩ N N
if z2 ≤ N.
n≤N log(z2 /z1 ) + O log2 (z2 /z1 )
The proof will require two preliminary results. We begin by recalling a result
from Graham [Gra, Lemma 2].
Lemma 5.5 For any integer r, and any c > 0,
μ(n)
Q r ' (
log = + Oc σ− 12 (r) log−c (2Q) .
n≤Q
n n φ(r)
(n,r)=1
The same estimate holds even if we drop the condition that (j1 , j2 ) = 1.
114 Chapter 5 Dirichlet L-Functions
⎧ ⎫
⎪ ⎪
μ(e) ⎨ μ(1 ) log(z1 /d1 e1 ) ⎬
= μ(d1 )μ(d2 )
e≤z1 /d1
e2 ⎪
⎩ 1 ≤z1 /d1 e
1 ⎪
⎭
(e,dr)=1 (1 ,d1 er1 )=1
⎧ ⎫
⎪
⎨ ⎪
μ(2 ) log(z2 /d2 e2 ) ⎬
×
⎪
⎩ 2 ≤z2 /d2 e
2 ⎪
⎭
(2 ,d2 er2 )=1
2
μ(e) dk erk −c 2zk
= μ(d1 )μ(d2 ) + Oc σ−2
1 (d er
k k ) log
e2 φ(dk erk ) dk e
e≤z1 /d1 k=1
(e,dr)=1
We see that the inner sum is zero unless (d, e)|k and if (d, e) divides k it is
N −M
+ O(1).
[d, e]
Thus, (5.3) is
Λ1 (d)Λ2 (e)
(N − M ) + +O( |Λ1 (d)Λ2 (e)|) (5.4)
d,e
[d, e] d,e
(d,e)|k (d,e)|k
Also note that in the outer sum m must be squarefree for otherwise Λ1 (md0 ) =
Λ2 (me0 ) = 0. Thus, invoking Lemma 5.6, we find that the main term in (5.5) is
μ2 (m)φ(m) m 2
+ σ− 12 (m)
m2 φ(m)
m|k
k
.
φ(k)
Hence the main term in (5.4) is
k
N.
φ(k)
Summarizing, (5.3) is
k
N + z1 z2 .
φ(k)
The proposition follows.
Next, we discuss the mollifier polynomial. We introduce the parameters
Y = (log q)
.
Z = q 1/2
Corresponding to the choices z1 = Y and z2 = Z, we have the weights
Λ2 (n) − Λ1 (n)
λ(n) = .
log(Z/Y )
We define the Dirichlet polynomial
λ(n)χ(n)
M (s, χ) =
ns
n≤Z
where
a(n) = λ(d)
d|n
satisfies
a(1) = 1
a(n) = 0 for 1 < n ≤ Y.
We record the following estimate.
§5 Non-vanishing for a positive proportion of characters, I 117
(q + Z) 1
· (q 2 −σ ·
1
|M (s, χ)|2 + Y 1−2σ ).
(1 − 2σ) (log q)2
χ(mod q)
|λ(n)|2
|M (s, χ)|2 (Z + q)
n2σ
χ(mod q) n≤Z
⎧ ⎫
⎨ 1 log Z/n 2 1 ⎬
(q + Z) + · 2σ
⎩ n2σ log Z/Y n ⎭
n≤Y Y <n≤Z
1−2σ
Y Z 1−2σ 1
(q + Z) + · .
1 − 2σ 1 − 2σ (log Z/Y )2
where 12 s− 12
g(χ) 2 2π π
γ(s, χ) = 1 sin (a + s) Γ(1 − s).
ia q 2 π q 2
(Here g(χ) is the Gauss sum, a = 0, 1 and χ(−1) = (−1)a .) Then we truncate the
Dirichlet series expansion of L(1 − s − w, χ) at Z. Let us set
*
1 χ(n)
I(s, χ) = γ(s + w, χ) M (s + w, χ)q w Γ(w)dw
2πi (−η) n1−s−w
n<Z
and
⎧ ⎫
⎨
1 χ(n) ⎬
J(s, χ) = γ(s + w, χ) M (s + w, χ)q w Γ(w)dw
2πi (−η) ⎩ n1−s−w ⎭
n≥Z
Thus, we get
S(s, χ) = L(s, χ)M (s, χ) + I(s, χ) + J(s, χ). (5.7)
If L(s, χ) = 0, then S(s, χ) and I(s, χ) + J(s, χ) are equal. We will therefore
try to show that, in general, they are not equal and for this purpose we study their
mean values. We begin with J(s, χ) which is the easiest of the three to estimate.
Proposition 5.8 For | Im s| < 1 and 0 ≤ σ ≤ 1, we have
q 2 −σ
3
|J(s, χ)|
log q
1=χ(mod q)
χ(n)
| ||M (s + w, χ)||Γ(w)||dw|
n1−s−w
n≥Z
Using the large sieve inequality and Lemma 5.7, we find that
⎧ ⎫ 12
⎨ ⎬
|J(s, χ)| q 2 −σ
1
(q + n)n2(σ−η−1) ×
⎩ ⎭
1=χ(mod q) n≥Z
* 12
q 2 −σ+η
1
(q + Z)
× ·( +Y 1−2(σ−η)
)
1 − 2(σ − η) (log q)2
12
1
−σ q 1 1
q2 Z σ−η
+ ×
Z |2(σ − η) − 1| |σ − η|
*
q 2 ( 2 −σ+η)
1 1 1
(q + Z) 2
× 1 .
|2(σ − η) − 1| 2 log q
Moreover, the same estimate holds if we sum only over non-trivial characters.
120 Chapter 5 Dirichlet L-Functions
∞
a(n) −n/q
S(s, χ) = e χ(n)
n=1
ns
χ(mod q) χ(mod q)
∞
a(n) −n/q
= φ(q) e .
n=1
ns
n≡1(mod q)
The O-term is
q −σ+ .
It thus follows that
S(s, χ) = φ(q) + O (q 1−σ+ ).
χ(mod q)
Finally,
a(n)
S(s, 1) = e−n/q q 1−σ+
ns
(n,q)=1
2 2
χ(mod q)
which is seen to be
∞
a(n1 )a(n2 )
φ(q) 1 exp(−(n1 + n2 )/q), (6.1)
n1 ,n2 =1 (n1 n2 ) 2
§6 Non-vanishing for a positive proportion, II 121
We begin by considering the sum over n2 . We must necessarily have n2 > q for
if n2 ≤ q, then n1 ≤ q also and so the congruence n2 ≡ n1 (mod q) would force
n1 = n2 . We split Σ1 into three subsums Σ11 , Σ12 and Σ13 where
in Σ11 we have n2 ≥ q log q
in Σ12 we have q ≤ n1 < q log q and n1 < n2 < q log q.
in Σ13 we have n1 < q and q < n2 < q log q.
In Σ11 , we see, by partial summation, that the sum over n2 is
⎧ ⎫
∞ ⎪ ⎨ ⎪
⎬ 1
q−1 |a(n)| u− 2 e−u/q du.
q log q ⎪
⎩ n1 <n≤u ⎪
⎭
n≡n1 (mod q)
and this is ∞
(log q)−1/2 v 2 e−v dv
1
log q
−1
q .
Inserting this into the n1 -sum, using Proposition 5.2, the Cauchy-Schwarz inequal-
ity and partial summation, we have
1
q2 1
Σ11 1
(log q) q 2
q− 2 (log q)− 2 .
1 1
122 Chapter 5 Dirichlet L-Functions
Let us write n2 = n1 + jq. The above double sum may therefore be written
as
a(n1 )a(n1 + jq)
e−j e−2n1 /q 1 1
(6.3)
j<log q U <n1 ≤2U n12 (n1 + jq) 2
(n1 ,q)=1
a(kq)a((k + j)q)
e−j e−2k 1 1 . (6.4)
j<log q U <qk≤2U
(kq) 2 ((k + j)q) 2
|a(kq)| ≤ d(k) k .
A similar estimate holds for a((k + j)q). Using this in (6.4), we see that it is
e−2k
q−1 e−j
2 − (k + j) 2 −
1 1
j<log q U <qk≤2U
k
and this is
q−1 .
The sum in (6.3) may thus be replaced by
Let us set
G(u) = a(n1 )a(n1 + jq).
U <n1 ≤u
The sum over n1 in (6.5) can be estimated using partial summation. We find that
it is equal to
2U
G(u)e− q e− q
2u 2u
2U
1 + G(u)d .
u 2 (u + jq) 2 U
1 1 1
U u 2 (u + jq) 2
Using the estimate for G(u) quoted above, we see that this is
1
j (U + jq) 2 U
e−2U/q 1 (log q)−2 .
φ(j) U 2 q
Incorporating these estimates into the sum over j, we find that (6.5) is for σ
=1
(U + jq) 12 U 12 j −j
e−2U/q (log q)−2 e
q φ(j)
j<log q
which is
U 2 e−2U/q −j j
1
1
e (U + jq) 2
q(log q)2 φ(j)
j<log q
1
U 2 −2U/q
q (log q)−3/2
1
2 e .
q
Now summing this over U, we find it is
(log q)−3/2 .
The first factor above is O(1) as can be seen from our discussion of Σ3 below. As
for the second factor, we see that it is equal to
a(n2 )e−n2 /q a(n2 )e−n2 /q
.
(n2 ) 2
1 1
q<n2 ,n <q log q
2
n22
n2 ≡n (mod q)
2
Again, we split this sum into three sums according as n2 < n2 , n2 = n2 , and
n2 > n2 . The third is the same as the first. Also, we note that the first sum is just
Σ12 which we have estimated above as being
(log q)−3/2 .
(log q)−1 .
1 1
q
q log(z2 /z1 )
(log q)−1 .
The main term is equal to
a(n)2
.
n
Y <n<q
The first sum is equal to 1 since a(n) = 0 for 1 < n ≤ Y . Using Proposition 5.2
and partial summation, we see that the second sum is
(log n/Y ) 1
1
· + O .
(log Z/Y )2 n log Z/Y
Y <n≤Z
1 1
= +O .
2 log q
Similarly, the third sum is
1 1 1
+O
log Z/Y n log q
Z<n<q
which is
1
=1+O .
log q
Putting these together we deduce that
a(n)2
5 1
= 1 + O( ) .
n<q
n 2 log q
Now we can put together the above results to prove Theorem 5.1.
Proof of Theorem 5.1. Let s0 ∈ C satisfy 1
2 ≤ Re s0 < 1 − 1
logq . We return now to
(5.7). For χ
= 1, we have
Thus,
' (
S(s0 , χ) = I(s0 , χ) + J(s0 , χ) + S(s0 , χ)
χ=1
where ranges over χ
= 1 such that L(s0 , χ) = 0 and over the remaining
non-trivial χ(mod q). By Proposition 6.1, we have
' (
S(s0 , χ) = φ(q) + O q 1−σ+ .
χ=1
Thus, we have
' ( ' (
S(s0 , χ) = φ(q) − I(s0 , χ) + J(s0 , χ) + O q 1−σ0 +
and consequently,
' (
S(s0 , χ) ≥ φ(q) − | I(s0 , χ) + J(s0 , χ) |
' (
+ O q 1−σ+
Thus,
1
1 √ 1 log log q 2
| S( , χ)| ≥ (1 − c)φ(q) + O(q 2 ) + O q
+
2 log q
+ O(q(log q)−1 ).
2 2
χ(mod q)
We deduce that
/
2 √ 2 log log q 1
≤ N ( , q) 1 + O((log q)− 2 ) .
1
φ(q)(1 − c) + O q
5 log q 2
Thus, /
1 2 √ 2 log log q
N ( , q) ≥ φ(q)(1 − c) + O q .
2 5 log q
The methods of this section in fact prove a significantly stronger result [BM,
Theorem 11.1]. We state it below.
Theorem 6.4 Fix a σ in the interval 1
2 ≤ σ < 1. Then, for all sufficiently large
primes q,
L(σ, χ)
=0
for a positive proportion of the characters χ(mod q).
Remark. The proof produces a lower bound for this proportion. How large q
must be taken will depend on σ.
Finally, we state another result [BM, Theorem 12.1] which can be proved by
refining the techniques described above.
Theorem 6.5 Let q be a sufficiently large prime. There is an absolute constant
c > 0 such that for a positive proportion of the characters χ mod q, L(σ, χ)
= 0 in
the interval
1 c
+ ≤ σ ≤ 1.
2 log q
128 Chapter 5 Dirichlet L-Functions
§7 A conditional improvement
It is an interesting question to ask whether Theorem 5.1 can be strengthened by
assuming the Riemann Hypothesis. Of course, the Riemann Hypothesis tells us
that there are no zeroes for Re(s) > 12 , but it gives no direct information about
s = 12 . The following result is due to R. Murty [RM].
Theorem 7.1 Let q be a prime. Assume the Riemann Hypothesis for all the
Dirichlet L-functions L(s, χ). The number of characters χ mod q with L( 12 , χ)
=0
is at least 12 φ(q).
The proof depends on the explicit formula method. Let us write
∞
L
− (s, χ) = Λ(n)χ(n)n−s .
L n=1
Then
√
q 1 ∞ Γ
φ(γ) = 2F (0) log + φ(−i/2)δχ − (1 + it)φ(t)dt
γ
2π π −∞ Γ
∞
Λ(n)χ(n)
−2 F (log n)
n=1
n
where the sum on the left hand side is over γ such that L( 12 + iγ, χ) = 0 and
2 ≤ Re( 2 + iγ) ≤ 1 and
1 1
1 if χ is the principal character
δχ =
0 otherwise.
The following two lemmas are proved easily by straightforward calculations.
Lemma 7.3 Let T > 0 and define
F (x) = 2T − |x| if |x| ≤ 2T
0 otherwise.
Then F satisfies the conditions of Proposition 7.2 and
2
2 sin(γT )
φ(γ) = .
γ
Exercises 129
As we are assuming the Riemann Hypothesis, all of the γ are real. Let us set
1
T = log x.
2
Let rχ denote the order of zero of L(s, χ) at the point s = 12 . Then we have the
inequality
√ 1
rχ (log x)2 ≤ 2φ(q)(log x)(log q/2π) + 4x 2 + O(φ(q) log x) − 2φ(q)
χ
1 x
× Λ(n) log .
n≤x
n n
n≡1(mod q)
We can discard the last sum on the right as it is non-negative. Now choosing
x = φ(q)2
gives
1
rχ ≤ φ(q) + O(φ(q)/ log q).
χ
2
Exercises
1. Prove the Polya-Vinogradov estimate for imprimitive characters as follows.
Let χ mod q be induced from χ1 mod q1 and write q = q1 r. Then
χ(n) = χ1 (n).
n≤x n≤x
(n,r)=1
Now apply the Polya-Vinogradov estimate for the inner sum and deduce that
the above quantity is
1 1
q12 log q1 |μ(d)| q 2 log q1 .
d|r
1
2. Prove the estimate of Theorem 3.1 in the case Y ≤ X 2 as follows.
(i) First,
D
S(X, Y ) ≤ + d(n2 ) 1.
rs
r,s≤Y |D|≤X
2 2 n ≤Y |D|≤X
rs =a2
Show that
d(n2 ) Y (log Y )2
n≤Y
and rearrange as
D
μ(d) .
1 n≤Y
n
d≤Y 2
d2 |n
∞
cm 3 1 1
( log X + c) + O(X − 2 + ).
1
2
exp(−m /X) = 2 1−
m=1
m π p
p(p + 1) 2
T
1
|L( + it, χ)|2 XT (log XT )16
−T 2
|D|≤X
References
[B] R. Balasubramanian, A note on Dirichlet’s L-functions Acta Arith., 38
(1980), 273–283
[BM] R. Balasubramanian and V. Kumar Murty, Zeros of Dirichlet L-functions,
Ann. Scient. Ecole Norm. Sup., 25 (1992), 567–615
[BV] M.B. Barban and P.P. Vehov, On an extremal problem Trans. Moscow Math.
Soc., 18 (1968), 91–99
[D] H. Davenport, Multiplicative Number Theory, Springer-Verlag, 1980.
[Gra] S. Graham, An asymptotic estimate related to Selberg’s sieve J. Numb.
Thy., 10 (1978), 83–94
[HB] R. Heath-Brown, The fourth power mean of Dirichlet’s L-functions, Analysis,
1 (1981), 25–32
[Hil] A. Hildebrand, Large values of character sums, J. Numb. Thy., 29 (1988),
271–296
[Jut1] M. Jutila, On character sums and class numbers, J. Numb. Thy., 5 (1973),
203–214
132 Chapter 5 Dirichlet L-Functions
§1 Introduction
Statement of results
Let f be a holomorphic cusp form for Γ0 (N ) of weight 2 and character . We assume
that f is a normalized newform for the Hecke operators. Denote by L(s, f ) the
L-function attached to f. For Re(s) > 3/2, it is given by an absolutely convergent
Dirichlet series ∞
a(n)
L(s, f ) = .
n=1
ns
For L(s, f ) we have the functional equation
As Γ(s)L(s, f ) = ωA2−s Γ(2 − s)L(2 − s, f¯),
√
where A = N /2π, ω is a complex number of absolute value 1 and L(s, f¯) is
defined by
∞
ā(n)
L(s, f¯) =
n=1
ns
for Re(s) > 3/2 and by analytic continuation for all values of s. (This series
actually converges (conditionally) for Re(s) > 5/6. Thus, the functional equation
and the Dirichlet series serve to define L(s, f ) for all values of s.)
' (
For any D, let χD denote the quadratic character D. mod D. Let us set
∞
a(n)χD (n)
LD (s, f ) = .
n=1
ns
M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, 133
Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_7, © Springer Basel AG 1997
134 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
Notation
We begin by introducing some notation. If D0 is a fundamental discriminant which
is coprime to N, then we may write the functional equation in the unsymmetric
form
Γ(1 − s)
LD0 (1 + s, f ) = ωχD0 (−N ) (D0 )|D0 |−2s A−2s LD (1 − s, f¯).
Γ(1 + s) 0
Let us define μ̃ by
∞
1 μ̃(n, f )
= .
L(s, f ) n=1 ns
D
f˜Yβ (n, s; a) = |D|s , D unrestricted
0<βD≤Y
n
D≡a(mod 8N )
D0
fYβ (n, s; a) = |D0 |s , D0 fundamental
0<βD0 ≤Y
n
D0 ≡a(mod 8N )
f˜Yβ (n; a) = f˜Yβ (n, 0; a), fYβ (n; a) = fYβ (n, 0; a).
and Y
1
gYβ (n, s; a) = ft (n, s; a)dt.
Y 1
We shall write d(n) for the number of positive divisors of n. Also, b stands for a
generic integer. Thus the statement nd = b2 means that nd is not the square of
an integer. For an integer n we shall often write n = n1 n2 where (n2 , 2N ) = 1
and p|n1 =⇒ p|2N. The complex numbers s0 and s1 have real parts σ0 and σ1
(respectively). We set
log+ x = log x if x > 1
1 otherwise.
136 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
This is equal to
∞
a(m) D
exp(−m/X).
m=1
m m
On the other hand, moving the line of integration to the left, we obtain a residue
at s = 0 equal to
LD (1, f )
and an integral
1
LD (1 + s, f )X s Γ(s)ds.
2πi (−η)
Here, 0 < η < 1 and the integration is on the line Re(s) = −η.
In the above notation,
μ̃(d, f ) D0
LD (s, f ) = LD0 (s, f ) .
ds d
d|δ 2
Applying the functional equation for LD0 (s, f ) we see that the integral is
1
μ̃(d, f ) D0 Γ(1 − s) X s
−2s
× |D0 | LD0 (1 − s, f¯) Γ(s)ds.
2πi (−η) d1+s d Γ(1 + s) A2
d|δ 2
where
a(m) ˜+ 0
∞
S(X, Y ) = fY (m; a) + f˜Y− (m; a) exp(−m/X)
m=1
m
§1 Introduction 137
and
a μ̃(d,f )
I(X,Y ) = ω (a) ¯(δ 2 ) ×
N √
2
d
δ≤ Y d|δ
(δ,2N )=1
1
ā(n) 0 Γ(1 − s) X s
−
× f 2 (nd,2s;a) − fY /δ2 (nd,2s;a).
+
Γ(s)ds.
2πi (−η) n1−s Y /δ Γ(1 + s) A2 d
where
1 a(n1 n22 ) a φ(n2 )
C(f ) = .
8N n ,n n1 n22 n1 n2
1 2
This is similar to the constant that occurs in [MM, Theorem 1] and an analogous
argument shows that it is nonzero. Actually, we work with a more general series.
For (h, 2N j) = 1, and Re(s0 ), Re(s1 ) ≥ 0, define
1 a(n) a φ(n2 h)
C(s0 , s1 , j, h) = .
8N (1 + 2s0 )(1 + s0 ) n=n1 n2 n1+s1 n1 n2 h
n2 h=b2
(n2 h,j)=1
That the series converges in this domain follows from estimates in §3. Moreover,
we note that
C(f ) = C(0, 0, 1, 1).
The error estimate above is obtained by applying the integrated Polya-Vino-
gradov inequality and an estimate of Rankin. For the integral, we show that a
smoothened version of it is
4/5
1 1 Y
X Y 2 2 (log Y )ν
X
for any 0 < ν < ρ/10. Choosing X = Y (log Y )10ν in the above estimates, the main
theorem follows.
The above estimate for the mean value of I(X, t) is the technical heart of
this chapter. It requires an integrated and refined version of the Polya-Vinogradov
estimate (§2, §4), together with an iterated argument to estimate certain weighted
sums of Fourier coefficients and Dirichlet characters (§6). Preliminary estimates
for such weighted sums are obtained in §5. The treatment of the main terms is
discussed in §3 and the theorems are proved in §7.
138 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
There are several estimates which we shall repeatedly use. Firstly, for Fourier
coefficients, we have the estimate of Rankin
|a(n)|2 X 2 .
n≤X
Secondly, for character sums, there is the estimate of Fainleib and Saparnijazov
[FS] (see also [MM, Lemma 1]) which is a generalization of Theorem V.3.1 of Jutila
[J]:
2
h
(N 2 /φ(N ))dXY (log Xd)2 .
n≤X
nd
2
0<βh≤Y
n2 =bh≡a(mod 8N )
Remarks on the proof. To prove Theorem 1.2, we consider averages over both posi-
tive and negative discriminants, and in addition, we introduce a further smoothing
factor. Thus, we consider
1 Y
|D|
LD (1, f ) dt = 1− LD (1, f ).
Y 1 |D|≤t |D|≤t
Y
D≡a(mod 8N ) D≡a(mod 8N )
the root number does not give an obstruction to the choice of the sign of D. By
including both values of sgn D, therefore, we get a statement valid in all cases.
There is a second, and deeper, reason for including both positive and negative
values of D. In order to handle the error terms that come from S(X, Y ), we need
to estimate character sums of the form
D
|D|≤Y
m
D≡a(mod 8N )
for β = ±1. If in addition, χ is even, Hua has shown (see Lemma 2.1 below) that
1 Y
√
χ(D)dt q.
Y 1 0<βD≤t
Thus, for even characters, the extra averaging allows us to save a factor of log q.
Now, by summing over both positive and negative values of D, and integrating over
t, we are able to filter out odd characters (see Lemma 2.2) and use this improved
estimate.
To estimate the integral
Y
1
I(X, t)dt
Y 1
we find that the device which filtered out odd characters in the sum has exactly
the opposite effect in the integral. (This is because of the sgn character that is
introduced by the functional equation). Therefore, we need an analogue of Hua’s
estimate which is valid for all characters. In §4, we give such an estimate in mean
square. Indeed, a special case of Lemma 4.6 gives
2
1 Y D
X
dt X 2 (log+ )2
Y 0 n Y
n≤X 0<βD≤t
D≡a(mod 8N )
where
log+ a = log a if a > 1
1 otherwise
and β = ±1.
140 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
It turns out that the required estimate is intimately connected with estimates for
two other quantities. We shall now briefly describe these.
For the purpose of this exposition, we state them approximately as follows.
(The reader who wishes a more precise statement is referred to the relevant sec-
tions.) Let α > 0, β = ±1. We consider the following statements:
1 Y D
Aβ (α) : |D|2s0 LD (1 + s1 , f )dt = main term +
Y 1 0<βD≤t
h
D≡a(mod 8N )
√ 0
+ O(Y 1−σ1 +2σ0
h(1 + |s0 | + |s0 − s1 |)2 (log Y h)α log(h log Y ) )
Lemma 2.1. Let β = ±1, and let χ be an even, nontrivial Dirichlet character
modulo q. Then,
1 Y √
χ(D) dt q.
Y 1
0<βD≤t
Lemma 2.2 If n2 h
= b2 , and (ah, 2N ) = 1, then
Y
1 D 1
|D|2s0 dt (|s0 | + 1)(nh) 2 Y 2σ0 .
Y 1 |D|≤t
nh
D≡a(mod 8N )
Proof. We have
D
|D|2s0
|D|≤t
nh
D≡a(mod 8N )
1 D
= ψ̄(a) ψ(D) |D|2s0
φ(8N ) nh
ψ(mod 8N) |D|≤t
1 D −1
= ψ̄(a) ψ(D) |D| (1 + ψ(−1)
2s0
).
φ(8N ) nh nh
ψ 0<D≤t
' . (
Each character ψ nh is nontrivial as n2. h is. Thus, we may apply Lemma 2.1
and partial summation to the inner sum to complete the proof.
1 φ(8N n2 h) t1+2s0
|D|2s0 = + O(t2σ0 d(n2 h)(|s0 | + 1)).
0<βD≤t
φ(8N ) 8N n2 h 1 + 2s0
D≡a(mod 8N ),(D,n2 h)=1
We remark that using the results of §6, it is possible to refine the error terms.
To establish asymptotics for values of s1 inside the critical strip, we shall
need a more refined approach. Moreover, we shall need a substitute for Lemma 2.1
that holds for odd characters. These two problems are addressed in the next two
sections.
This is
a(n) D
exp(−n/X).
n1+s1 n
(n,j)=1
§3 The main terms 143
1
Suppose now that η > 2 + σ1 . Then, on applying the functional equation, the
integral becomes
1 μ̃(d, f )
ā(n) D0
ωχD0 (−N ) (D0 ) ×
2πi d|δ2 d1+s1 (−η) n1−s1 −w nd
j|δ
Γ(1 − s1 − w)
×(A|D0 |)−2(s1 +w) (X/d)w Γ(w)dw.
Γ(1 + s1 + w)
' (
Multiplying through by |D|2s0 D
h , summing over
0 < βD ≤ t, D ≡ a(mod 8N ),
and integrating over t, we see that
1 Y D
|D|2s0 LDj 2 (1 + s1 , f ) dt
Y 1 0<βD≤t
h
D≡a(mod 8N )
and
a
δ2 μ̃(d, f )
− βω (a) A−2s1 2
¯(δ )δ 4s0
N δ2 ≤Y
h d1+s1
d|δ 2
j|δ
(δ,2N )=1
1 ā(n) 1 Y /δ 2
× ftβ (ndh, 2s0 − 2s1 − 2w; aδ̄ 2 )dt
2πi (−η) n1−s1 −w Y /δ 2 1
Γ(1 − s1 − w)
× (X/dA2 )w Γ(w)dw.
Γ(1 + s1 + w)
144 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
We have now to determine the size of the sum and the integral. For the sum,
we expect that the main term will come from the sum over those values of n for
which n2 h = b2 . This “main term” therefore is
⎛ ⎞
a(n) a ⎜ ⎜1
Y ⎟
⎟
M̃β (s0 , s1 , j, h) = ⎜ |D| 2s0
dt⎟ exp(−n/X).
n h=b2
n1+s1 n1 ⎝ Y 1 0<βD≤t ⎠
2
(n2 ,j)=1 D≡a(mod 8N )
(D,n2 h)=1
The two series M̃β and Mβ are related by the following estimate.
Lemma 3.1 We have
1
3
X −σ1 − 8 Y 1+2σ0 ).
1
M̃β (s0 , s1 , j, h) = Mβ (s0 , s1 , j, h) + O( 1 + 3/4
p
p|j
uniformly in h.
§3 The main terms 145
μ̃(d, f ) 1
ā(n) a
× 1+s1 2πi 1−s1 −w
d (−η) n dh=b2 n n1
d|δ 2 2
⎛ ⎞
⎜ Y /δ2 ⎟
⎜ 1 ⎟
×⎜
⎜ Y /δ 2 |D0 |−2(s1 +w−s0 ) dt⎟
⎟
⎝ 1 0<βD0 ≤t ⎠
D0 δ2 ≡a(mod 8N )
(D0 ,2N n2 dh)=1
Γ(1 − s1 − w)
× (X/dA2 )w Γ(w)dw.
Γ(1 + s1 + w)
For η ≥ σ1 , the “main term” may be written in terms of the function Fd as
follows:
a 2
δ μ̃(d, f )
N = −βω (a) A−2s1 ¯(δ 2 )δ 4s0
N δ2 ≤Y
h 2
d1+s1
d|δ
j|δ
(δ,2N )=1
Y /δ2
1
× |D0 |2s0 −2s1
Y /δ 2 1
w
1 X ¯ Γ(1 − s1 − w)
× Fdh (−s1 − w, f , D0 ) Γ(w)dw dt.
2πi (−η) dA2 |D0 |2 Γ(1 + s1 + w)
If 0 < σ1 < 18 , we move the line of integration to the right, and get a residue at
w = 0 equal to
a 2
def −2s Γ(1 − s1 ) 2 4s δ μ̃(d,f )
Nβ (s0 ,s1 ,j,h) = βω (a) A 1
¯(δ )δ 0
N Γ(1 + s1 ) δ2 ≤Y h 2
d1+s1
d|δ
j|δ
(δ,2N )=1
⎛ ⎞
Y /δ 2
⎜ 1 ⎟
×⎝ |D0 |2(s0 −s1 ) Fdh (−s1 , f¯,D0 )dt⎠ .
Y /δ 2 1 0<βD0 ≤t
D0 δ2 ≡a(mod8N )
We also set
1 1
Nβ (s0 , s1 , j, h) = 0 if < σ1 < .
8 2
146 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
Remark. The point here is that the residue is insignificant if σ1 is bounded away
from zero. The choice of 1/8 as a breaking point was only for convenience, and
any other point will work just as well.
Lemma 3.2 For 0 ≤ σ1 < 1
8, we have
X 16 −σ1 ).
5 3
N = Nβ (s0 , s1 , j, h) + O(Y 8 +2σ0
For 1
8 ≤ σ1 < 12 , we have
is asymptotic to
Mβ (s0 , s1 , j, h) + Nβ (s0 , s1 , j, h).
(p)p
1
−1
1 + (p)p
−1
Fd0 (s,f ) = L(2s,Sym )ζ(4s − 2) 2
1 − 2s 1− .
p p4s−2 ps
p|d0
and this gives the analytic continuation of Fd0 (s) for Re(s) > 3/4.
§3 The main terms 147
The first factor is entire for σ > − 12 , and uniformly bounded for σ ≥ −3/16. Now,
let us write d = d0 d21 with d0 squarefree. As for the second factor, we see then that
it is equal to
∞ −1
a(n2 d0 ) a(p2 )
= Fd0 (1 + s, f ) .
(n2 d0 )1+s p2(1+s)
(n,2ND) = 1 =0
p|2ND
It is clear that G(s, f ) is analytic for Re(s) > − 21 . Combining this with what was
stated earlier, it follows that Fd (s, f, D) is analytic for Re(s) > −1/4.
Lemma 3.3 Write d = d0 d21 with d0 squarefree, and suppose that (d, 2N D) = 1
and (D, 2N ) = 1. Then, for σ ≥ −3/16, and an absolute constant c1 > 0,
1
Fd (s, f, D) c1 0 (d0 )−σ− 2 |L(2s + 2, Sym2 )ζ(2 + 4s)−1 | (1 + 1+2σ )3 .
ν(d ) 1
p
p|D
and this is 3
1
1+ .
p1+2σ
It follows that the desired result holds with a possibly larger value of c2 .
148 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
Remark. It should be clear that it is only essential that σ be larger than and
bounded away from −1/4.
Now we give the proofs of results stated earlier.
Proof of Lemma 3.1. We have
⎛ ⎞
⎜ Y ⎟
1 a(n) a ⎜1 ⎟
M= ⎜ |D|2s0 dt⎟ X w Γ(w)dw.
2πi (2) n2 h=b2
n1+s1 +w n1 ⎝ Y 1 0<βD≤t ⎠
(n2 ,j)=1 D≡a(mod 8N )
(D,n2 h)=1
1
−3
1
X −σ1 − 8 Y 1+2σ0 |Γ(−σ1 − )|.
1
1−
p3/4 8
p|j
N − Nβ (s0 , s1 , j, h)
is equal to
a 2
−2s1 δ μ̃(d, f )
−βω (a) A 2 4s0
¯(δ )δ
N δ2 ≤Y
h 2
d1+s1
d|δ
j|δ
Y /δ 2
1
|D0 |2s0 −2s1
Y /δ 2 1
w
1 X Γ(1 − s1 − w)
Fdh (−s1 − w, f¯, D0 ) Γ(w)dw dt
2πi (η) dA2 |D0 |2 Γ(1 + s1 + w)
1
for some 4 > η > 0. Using the factorization
Fd (s, f, D) = Fd0 (1 + s, f )G(s, f ) Bp (1 + s)−1
p|D
§3 The main terms 149
Y /δ2
1
|D0 |2s0 −2s1
Y /δ 2 1
w
1 X ¯ Γ(1 − s1 − w)
Fdh (−s1 − w, f , D0 ) Γ(w)dw dt
2πi (η) dA2 |D0 |2 Γ(1 + s1 + w)
is equal to
1 ¯ ¯
Fd (1 − s1 − w, f)G(−s 1 − w, f )
2πi (η) 0
⎛ ⎞
Y /δ2
⎝ 1
|D0 |2s0 −2s1 −2w Bp (1 − s1 − w)−1 dt⎠
Y /δ 2 1
0<βD0 ≤t p|D0
w
X Γ(1 − s1 − w)
Γ(w)dw.
dA2 Γ(1 + s1 + w)
Thus,
|D0 |2s0 −2s1 −2w Bp (1 − s1 − w)−1
0<βD0 ≤t p|D0
3
1 1
−1
2σ0 −2σ1 −2η
|D0 | 1 + 1−2σ1 −2η 1 − 2−4σ1 −4η
p p
0<βD0 ≤t p|D0
and choosing η = 3
16 − σ1 (say), this is
|D0 |2σ0 − 8 σ−5/8 (D0 )3 t2σ0 + 8 .
3 5
0<βD0 ≤t
Also,
Inserting all this into the integral, we see that the entire expression that we are
trying to estimate is
d 12 d(d)
δ 4σ0
δ2 ≤Y
d1+σ1
d|δ 2
j|δ
−1 *
2σ0 + 58 3 −σ
p 1 Y X 16 1
× 1+ 1 − p3/16
p13/8 p5/4 δ2 d
p|d0
which is
d 12 d(d)
Y 2σ0 + 8 X 16 −σ1 δ −5/4
5 3
d3/16 σ−5/8 (d)
δ2 ≤Y
d19/16
d|δ 2
j|δ
Y 2σ0 + 8 X 16 −σ1 .
5 3
For 1
2 > σ1 > 18 , Lemma 3.3 implies that for σ1 ≤ η < 1,
which is c
X
t1+2(σ0 −σ1 −c) |Γ(c)|.
dA2
Inserting this into the big expression, we see that
1
Nβ (s0 , s1 + w, j, h)X w Γ(w)dw
2πi (−σ1 )
|μ̃(d, f )| X c Y 1+2(σ0 −σ1 −c)
δ 4σ0
|Γ(c)|
δ2 ≤Y 2
d1+σ1 d δ2
d|δ
j|δ
d(d)
|Γ(c)|Y 1+2(σ0 −σ1 −c) X c δ −2+4(σ1 +c) 1
d 2 +σ1 +c
δ 2 ≤Y d|δ 2
|a(p)| |a(p2 )|
d(b2 )
1 + + + · · · exp(−b2 /X).
p p2 b
p|2N b
The first factor depends only on N and can be absorbed into the implied constant.
For the sum over b, we see by standard estimates that it is
(log X)3 .
152 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
Now consider the contribution of the main term. We see that it is equal to
Y a(n) a φ(n2 )
exp(−n/X)
16N 2
n n1 n2
n2 =b
which is
Y 1
F̃1 (w, f, 1)X w Γ(w)dw
16N 2πi (2)
a(m) a φ(m2 d)
F̃d (s, f, D) = .
m2 d=b2
m1+s m1 m2 d
(m,D)=1
Using the properties of F̃d given in Exercise 4, and moving the line of integration
to the left, we see that this is
⎧ ⎫
⎨ ⎬
Y a(n) a φ(n2 ) 1
+ F̃1 (w, f, 1)X w Γ(w)dw .
16N ⎩ 2
n n1 n2 2πi (−1/8) ⎭
n2 =b
Now applying the estimate for F̃1 of Exercise 4, we see that this is
Y
F̃1 (0, f, 1) + O(Y X −1/8 )
16N
as required.
' . (
where the sum over n only includes values such that χ hn is a nontrivial char-
acter, and the implied constant depends only on the conductor of χ.
§4 Estimates for real character sums 153
Proof. This follows by a small modification in the proof of Jutila [J, Theorem 3]
and partial summation. Indeed, we have
. χ(m) m log X
L(1, χ ) = + O( √ )
hn m hn X
m≤X
' . (
for fixed χ and all n ≤ X such that χ hn = 1. It follows that
2
.
χ(m) m
∗
n|L(1, χ )|2 = ∗
n 3/2 2
+ O(X (log X) ).
hn m hn
n≤X n≤X m≤X
and m m
χ(m1 m2 ) ∗ 1 2
B = n .
m1 ,m2 ≤X
m1 m2 hn
n≤X
m1 m2 =b2
Let us set
d
S(d) = n .
n
n≤X
As r is fixed, we deduce that
∗ d d
n = S(d) + O(X 3/2 ).
hn h
n≤X
Therefore,
d(m)
B X 3/2 (log X)2 + |S(m)|.
m≤X 2
m
m =b2
If we set
d
T (Z, d) = , then S(d) = XT (X, d) − T (n, d)
n
n≤Z n≤X−1
and [J, Theorem 1] (see also [MM, Lemma 1]) asserts that
|T (Z, d)|2 ZW (log W )2 .
d≤W
d =b2
154 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
(log X) n 2 1/2
(log X) 3/2
= n1/2 (log X)7/2 .
It follows that
d(m)
|S(m)| X 3/2 (log X)7/2 .
m≤X 2
m
m =b2
It follows that B = O(X 3/2 (log X)7/2 ) and this proves the lemma.
Recall that we have set
log a if a > 1 .
log+ a =
1 otherwise
Lemma 4.2. Let χ be a fixed Dirichlet character and (h, 2N ) = 1. Then
1 Y 2
D Xh 2
∗
Y χ(D) dt hX 2 (log+ )
0 hn Y
n≤X 0<D≤t
(See [BC] or [MV].) Here g(ψ) is the Gauss sum attached to ψ. Integrating with
respect to t, we deduce that
⎛ ⎞
Y
1 ⎝ D ⎠
χ(D) dt
Y 0 hn
0<D≤t
ψ̄(m) qlogq qg(ψ) ψ̄(m) −2πimY /q
= g(ψ) + O(1 + )+ 1−e .
2πim H 4π 2 Y m2
0<|m|≤H 0<|m|≤H
§4 Estimates for real character sums 155
Now,
ψ̄(m) 1 q
= (1 − ψ̄(−1))(L(1, ψ̄) + O( )).
2πim 2πi H
0<|m|≤H
is equal to
1 qg(ψ)
g(ψ) (1 − ψ̄(−1))L(1, ψ̄) + O(1) + (1 + ψ̄(−1))L(2, ψ̄)
2πi 4π 2 Y
qg(ψ) ψ̄(m) −2πimY /q
∞
− e + ψ̄(−1)e 2πimY /q
.
4π 2 Y m=1 m2
g(ψ) 1 q
L(1, ψ̄) + O(1) + O(q 2 log+ ).
πi Y
If ψ(−1) = +1, then by Lemma 2.1
⎛ ⎞
1 Y⎝ D ⎠ √
χ(D) dt q.
Y 1 hn
0<D≤t
hX 2 .
and this is
Xh 2
hX 2 + hX 2 (log
) .
Y
If ψ is not primitive, we let ψ∗ denote the primitive character mod q ∗ say, that
induces ψ. Let us write q = q ∗ r. Then, we have
⎛ ⎞ ⎛ ⎞
Y Y
1 ⎝ D ⎠ 1 ⎜ ⎟
χ(D) dt = ⎝ ψ∗ (D)⎠ dt
Y 0 hn Y 0
0<D≤t 0<D≤t
(D,r)=1
Thus, ⎛ ⎞ 2
1 Y
D
∗
Y
⎝ χ(D) ⎠ dt
0 hn
n≤X 0<D≤t
⎛ ⎞ 2
1 Y /d
∗
d(r)2 ⎝ ψ ∗
(D)⎠ dt .
Y /d
n≤X d|r 0 0<D≤t
Since
r
|L(1, ψ∗ )| L(1, ψ)
φ(r)
it follows that the above is
⎛ ⎞ ⎛ ⎞
3
d(r) r ∗
q 2⎠
∗
q|L(1, ψ̄)|2 + O⎝ d(r)3 ⎠ + O ⎝ d(r)3 q ∗ (log+ ) .
φ(r)2 Y
n≤X n≤X n≤X
Since d(r) r and φ(r) ≥ r/(log log r), the quantity d(r)3 r/φ(r)2 is bound-
ed. Thus, by Lemma 4.1, the first term above is hX 2 . The second O term
above is clearly X(log X)7 . Finally, as d(r)3 q ∗ ≤ q, the final O term is
hX 2 (log+ Xh/Y )2 ).
Lemma 4.3. With notation as above and σ ≥ 0, we have
⎛ ⎞ 2
1 Y
D ⎠ s
∗ ⎝ + Xh 2
h(1 + |s| )Y X (log Y )
2 2σ 2
Y χ(D) t dt
1 hn
n≤X 0<D≤t
Proof. This follows from Lemma 4.2 by integration by parts and an application of
the Cauchy-Schwarz inequality.
Lemma 4.4 With notation as above and σ ≥ 0, we have
⎛ ⎞ 2
1 Y
D
Xh 2
∗ ⎝ s⎠
Y χ(D) |D| dt (1 + |s(s − 1)|2 )hY 2σ X 2 (log+ )
1 hn Y
n≤X 0<D≤t
2
Y
1
∗1 D s
χ(D) |D| dt
φ(8N ) χ Y 0 hn
n≤X 0<D≤t
Lemma 4.6 Let a, h be integers with (ah, 2N ) = 1 and β = ±1. With notation as
above and σ ≥ 0, we have
2
1 Y
D s Xh 2
|D| dt (1 + |s(s − 1)|2 )hY 2σ X 2 (log+ ) .
Y 0 hn Y
n≤X
2
0<βD≤t
hn2 =b D≡a(mod 8N )
Proof. This follows immediately by noting that the previous result holds
' even
( if
we sum over 0 < −D ≤ t. (This is clear since we can factor out χ(−1) −1hn from
the sum over D.)
a(n) (j)
χ (nh)g̃Yβ (nh, 2s0 ; a)
n1+s1 0
n≤U,n2 h=b2
If σ1 > 12 , we have
a(n) (j)
χ (nh)g̃Yβ (nh, 2s0 ; a)
n1+s1 0
n>U,n2 h=b2
Proof. This is essentially Lemma 4 and Lemma 8 of [MM]. For example, to prove
the first estimate, one checks that
is
(|s0 | + 1)h 2 U 2 −σ1 t 2 +2σ0 (log U h).
1 1 1
a(n) β
g (nh, 2s0 ; a) (|s0 | + 1)h1/2 U 1/2−σ1 Y 1/2+2σ0 (log Y )(log U h).
n1+s1 Y
n≤U,n2 h=b2
If σ1 > 12 , then
a(n) β
g (nh, 2s0 ; a) (|s0 | + 1)h1/2 U 1/2−σ1 Y 1/2+2σ0 (log Y )(log U h).
n1+s1 Y
n>U,n2 h=b2
Proof. This is due to Rankin [Ra] (see [MM, Lemma 17] or Theorem IV.9.1).
X 1−σ1
(log X)−ρ .
1
(|s0 | + 1)h 2 Y 2σ0
1 − σ1
Proof. Use Lemma 2.2 and Lemma 5.3.
160 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
= Mβ (s0 , s1 , j, h) + Nβ (s0 , s1 , j, h)
√
+ O(Y 1−σ1 +2σ0 h(1 + |s0 | + |s0 − s1 |)2
,
× (log Y h)α log(h log Y ) + σ−3/4 (j)3
as usual.) Let us estimate the contribution of those n for which n2 h is not a square.
Applying the Cauchy-Schwarz inequality, we see that the terms with n ≤ X are
⎛ ⎞ 12 ⎛ ⎞ 12
|a(n)|2 2
⎝ ⎜ 1 β ⎟
5 exp(−n/X)⎠ × ⎝ 3 g̃Y (nh,2s0 ;a) exp(−n/X)⎠ .
n 4 +σ1 n 4 +σ1
n≤X n≤X
n2 h =b2
X 2 ( 4 −σ1 ) .
1 3
§6 The statements A± (α) and C ± (α) 161
For the second factor, we apply Lemma 4.6, and find that it is
Xh
(1 + |s0 (2s0 − 1)|)h 2 Y 2σ0 X 2 ( 4 −σ1 ) log+
1 1 5
.
Y
Putting these estimates together, we get an estimate of
def 1 Xh
E1 = (1 + |s0 (2s0 − 1)|)h 2 Y 2σ0 X 1−σ1 log+
Y
Similarly, the sum over terms n > X is majorized by
⎛ ⎞ 12
12
|a(n)| 2
⎜ 1 β ⎟
exp (−n/X) ⎝ |g̃ (nh, 2s0 ; a)|2 exp (−n/X)⎠
n2σ1 n>X
n2 Y
n>X
n2 h =b2
On the other hand, moving the line of integration to the line Re(w) = −η,
we get a residue at w = 0 equal to
1 Y D
|D|2s0 LDj 2 (1 + s1 , f )dt.
Y 1 0<βD≤t
h
D ≡ a(mod 8N )
plus an integral along the line Re(w) = −η. We proceed as in §3 and rewrite it as
a μ̃(d, f ) 1 ā(n)
βω A−2s1 (a) δ 4s0 ¯(δ 2 )
N δ2 ≤Y 2
1+s
d 1 2πi (−η) n1−s1 −w
d|δ
(δ,2N h)=1
j|δ
Γ(1 − s1 − w)
× gY /δ2 (ndh, 2(s0 − s1 − w); aδ̄ 2 ) (X/A2 d)w Γ(w)dw.
Γ(1 + s1 + w)
162 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
To estimate this integral, we split the sum over n into those for which n2 dh
is a square, and the remaining values. If 18 ≤ σ1 < 12 , then by Lemma 3.2, we see
that the first set contribute an amount which is
Y 1+2σ0 X −σ1 .
If 0 ≤ σ1 < 18 , then it is
X 16 −σ1 ).
5 3
Nβ (s0 , s1 , j, h) + O(Y 8 +2σ0
Using Lemma 5.2, we shall estimate the contribution of the second set. Split the
sum over n at an intermediate point U to be specified later. For the terms with
n < U, we move the line of integration to a line Re(w) = −η1 , with
1
σ1 ≤ η1 < σ1 + ,
2
and for those with n > U, we move the line of integration to a line Re(w) = −η2 ,
with
2 > η2 > 1.
Let us set Z = Y /δ 2 . The terms with n < U then contribute an amount
d 12 d(d) X −η1
δ 4σ0 |Γ(−η1 )|(1 + |s0 − s1 + η1 |)
d1+σ1 A2 d
δ 2 ≤Y 2
d|δ
We choose
1
η1 = σ1 + −ν
2
for a ν satisfying
1 1 3
< ν < min( , σ1 + )
4 2 8
say. (This choice ensures that η1 > 1/8 and in particular is bounded away from
zero.) Then the sum over δ above is
δ −1+4(ν− 2 ) d 2 −ν d(d)
1 1
δ 2 ≤Y d|δ 2
§6 The statements A± (α) and C ± (α) 163
and this is
δ 4ν−3 (δ 2 ) 2 −ν d(δ 2 )2
1
δ 2 ≤Y
δ 2ν−2 d(δ 2 )2 1.
δ 2 ≤Y
The contribution of the terms with n > U is entirely similar to the above with η2
replacing η1 . In moving to the line −η2 , we also encounter a residue at w = −1.
Using Lemma 5.2, this residue is easily shown to be
√
(1 + |s0 − s1 |) hX − 2 −σ1 Y 3/2+2σ0 (log Y )(log Y h).
1
Lemma 6.2 Assume Aβ (α) and suppose σ0 ≥ 0. If 0 ≤ σ1 < 12 , then we have for
U ≥Y
√ 0
IU4 (A) Y 1−σ1 +2σ0 h(1+|s0 |+|s0 −s1 |)2 (logY h)α log(hlogY ) + σ−3/4 (j)3 .
Then
IU4 (A∗ (u))
⎧ ⎫
⎪
⎨1 Y ⎪
⎬
1 D dw
= |D|2so LDj 2 (1 + s1 + w, f )dt IU4 (uw )
2πi (c) ⎪
⎩Y 1 0<βD≤t
h ⎪
⎭ w
D≡a(mod 8N )
gives the contribution of squares in IU4 (A∗ (u)) (that is, those n for which n2 h = b2 ).
This proves the first statement of the lemma. For the second part, consider the
sum
a(n) β
S = IU3 ( g̃ (nh, 2s0 ; a)).
n>u
n1+s1 Y
(n2 ,j)=1
§6 The statements A± (α) and C ± (α) 165
We have ∞ ∞
1
t− 2 dC(t)) = IU3 ( C(t)t−3/2 dt)
1
S = IU3 (
u 2 u
provided σ1 +c > 1 and 0 < c < 12 . Now move the line of integration to Re(w) = c1
where 12 ≤ σ1 + c1 < 1. (This is possible because 12 < σ1 < 1.) Now apply Aβ (α)
on the line c1 . The “main term” in Aβ (α) gives
1 1 1 1 2dw
{Mβ (s0 , s1 + w − ) + Nβ (s0 , s1 + w − )}IU3 (uw− 2 ) .
2πi (c1 ) 2 2 1 − 2w
The Mβ term gives the square terms in S. The Nβ term is zero if σ1 > 5/8. If
σ1 ≤ 5/8 then it is
c +σ1 − 12
(|s1 | + 1)1−2(σ1 +c1 − 2 ) U c1 − 2 Y
1 1 1
2 +2σ0 h01 d(h0 ).
Choosing c1 = 1
2 − σ1 gives the result.
Next we consider the following statement which provides estimates for func-
tions similar to A(u) and B(u) in which we restrict the sum to fundamental dis-
criminants.
a(n) β
4
C β (α) : IX ( g (nh, 2s0 ; a))
n≤u
n1+s1 Y
n2 h =b2
(n,j)=1
Y 2 +2σ0 X 2 −σ1
1 1
1 √ 0
h(1 + |s0 | + |s0 − s1 |)2 (log Y h)α log(h log Y ) + σ−3/4 (j)3
2σ1 − 1
166 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
g 2 ≤Y n≤u
n1+s1 Y /g
(g,2N h)=1 n2 h =b2
(n2 ,jg)=1
and this is )
√ Y
h(1 + |s0 (2s0 − 1)|)Y 2σ0 1−σ1
X (log h)
X
√
h(1 + |s0 (2s0 − 1)|)Y 2 +2σ0 X 2 −σ1 (log h).
1 1
§6 The statements A± (α) and C ± (α) 167
1−σ1 +2σ0
Y
g 4σ0
√ g2
Y >g>V
√ 0
(1 + |s0 | + |s0 − s1 |)2 h(log Y h)α (log(h log Y )) + σ−3/4 (j)3
) 2σ1 −1
−1 Y
Y 1−σ1 +2σ0
(1 − 2σ1 )
X
√ 0
(1 + |s0 | + |s0 − s1 |)2 h(log Y h)α log(h log Y ) + σ−3/4 (j)3
and this is
For the second half, we can apply Lemma 6.2 directly to the entire range of
the sum over g.
Proof. We proceed exactly as in the proof of Lemma 6.1, only in place of Lemma 5.2,
we use C β (α) to estimate the contribution of the second set. For each fixed value
of δ, we will split the sum over n at an intermediate point uδ to be specified later.
For the terms with n < uδ , we move the line of integration to a line Re(w) = −η1 ,
and bounded away from zero, with
1
σ1 < η1 < σ1 + .
2
For those with n > uδ , we move the line of integration to a line Re(w) = −η2 ,
with
σ1 + 12 < η2 < 1 if σ1 < 1/8
1 < η2 < 1 + σ1 if 1/8 < σ1 < 12 .
When η2 > 1, we also pick up a residue R from the pole at w = −1. (We remark
that the point 1/8 was only used as a convenient breaking point. The essential
difference between the two cases is whether σ1 is bounded away from zero or not.)
Let us set Z = Y /δ 2 and
U = Uδ = Z(log(Zh + 100))γ
168 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
for some 2 > γ > 0 to be specified later. By the first part of C β (α) and the mean
value theorem, there exists V = Vδ in the interval (Uδ , 2Uδ ) such that
ā(n)
IV3 ( g β (ndh, 2(s0 − s1 − w); aδ̄ 2 ))
1−s1 −w Z
n<u
n
n2 h =b2
(n2 ,j)=1
1
−(η −σ )
Z 2 +2(σ0 −σ1 +η1 ) Uδ2 1 1
1
1 √
× dh(1 + |s0 − s1 − w| + |s0 |)2
(2(η1 − σ1 ) − 1)
(
× (log Y dh)α log(dh log Y ) + σ−3/4 (j)3
Moreover, as V ≥ Z
ā(n)
IV3 ( g β (ndh, 2(s0 − s1 − w); aδ̄ 2 ))
1−s1 −w Z
n>u
n
n2 h =b2
(n2 ,j)=1
1
−(η −σ )
Z 2 +2(σ0 −σ1 +η2 ) Uδ2 2 1
1
√
× dh(1 + |s0 − s1 − w| + |s0 |)2 (log Y dh)α log(dh log Y ) + σ−3/4 (j)3 .
For each δ we apply the operator
IV3δ
with respect to the variable uδ to both sides of the basic equation. Then, the left
hand side and the residue do not change (as they are independent of uδ ). And the
n < uδ summed over all δ contribute an amount
d 12 d(d)
log d|Γ(−η1 )|(X/d)−η1 Z 2 +2(σ0 −σ1 +η1 ) U 2 −(−σ1 +η1 )
1 1
4σ0
δ 1+σ1
d
δ 2 ≤Y d|δ 2
1 √ 0
dh(|s0 | + |s0 − s1 | + 1)2
(log Zdh)α
log(h log Y ) + σ−3/4 (j)3
(η1 − σ1 − 12 )
2 +2(σ0 −σ1 +η1 )
X −η1 (Y (log Y h)γ ) 2 +σ1 −η1
1 1
Y
1 d 12 +η1 d(d)
−2+2σ1 −2η1
× δ
(η1 − σ1 − 12 ) δ2 ≤Y d1+σ1
d|δ 2
√
(|s0 | + |s0 − s1 | + 1)2 dh(log Y h)α log(h log Y ) + σ−3/4 (j)3
The above may be simplified to
1
Y 1+2σ0 −σ1 +η1 X −η1 (log Y h)γ( 2 +σ1 −η1 )
1
(η1 − σ1 − 12 )
√
(|s0 | + |s0 − s1 + η1 | + 1)2 h(log Y h)α log(h log Y ) + σ−3/4 (j)3
δ −2+2σ1 −2η1 d(d)dη1 −σ1 log d.
δ 2 ≤Y d|δ 2
§6 The statements A± (α) and C ± (α) 169
This is seen to be
η1
def Y
E2 = Y 1+2σ0 −σ1 (logY h)γ( 2 +σ1 −η1 )
1
X
1 √
(|s0 | + |s0 − s1 + η1 | + 1)2
h(logY h)α
log(hlogY ) + σ−3/4 (j)3
(η1 − σ1 − 12 )
Now, using the second part of C β (α), we see that the sum over δ of the terms with
n > uδ contribute an amount
η2
def −σ Y
(log Y h)γ( 2 +σ1 −η2 )
1
E3 = Y 1+2σ 0 1
X
√ 0
h(|s0 | + |s0 − s1 + η2 | + 1)2 (log Y h)α log(h log Y ) + σ−3/4 (j)3 .
This is ensured if
α
γ = 1
2 + 2η1 − 2σ1
and
X = Y (log Y h)γ .
With this choice, (and with E1 as in Lemma 6.1),
α(1−σ1 )
1 √ 0
(|s0 | + |s0 − s1 + η1 | + 1)2
h log(h log Y ) + σ−3/4 (j)3
.
(η1 − σ1 − 12 )
1
{(|s0 | + |s0 − s1 | + 1)2 h 2 (log(h log Y )) + σ−3/4 (j)3 }.
170 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
The error term E3 gives a similar quantity with the exponent of log Y h equal to
1 α
α − (2η2 − σ1 − ) .
2 2η1 − 2σ1 + 12
We choose
σ1 + 5/8 if σ1 < 1/8
η2 =
17/16 if 1/8 < σ1 < 1/2.
Then, the above is
4
≤ α.
5
Hence,
E3 Y 1+2σ0 −σ1 {(|s0 | + |s0 − s1 | + 1)2 h 2 (log Y h) 5 α log(h log Y ) + σ−3/4 (j)3 }.
1 4
The second part of C β (α) easily implies a similar estimate holds for R. We conclude
that
E1 + E2 + E3 + R
Y 1+2σ0 −σ1 {(|s0 | + |s0 − s1 | + 1)2 h 2
1
4
(log Y h) 5 α log(h log Y ) + σ−3/4 (j)3 }.
and so Aβ ( 45 α) holds.
Proof. From Lemma 6.1, we know that Aβ (2) is true. From Proposition 6.3, we
deduce that C β (2) holds. Suppose we have established C β (α). By Proposition 6.4,
we deduce that Aβ ( 4α β 4α
5 ) is true. By Proposition 6.3, we deduce that C ( 5 ) holds.
Iterating this proves the result.
Remark. We see also that given λ > 0, the statement Aβ (λ) holds.
Proof of Theorem 1.2. Let a ≡ 1(mod 4), (a, 2N ) = 1. Consider the integral
Y
1 1
LD (1 + w, f )X w Γ(w)dwdt.
Y 1 |D|≤t
2πi (2)
D≡a(mod 8N )
§7 Proof of main result 171
Let us estimate the contribution of those n for which n2 is not a square. We apply
Lemma 5.4 and find that the sum is
X(log X)−ρ .
Thus, if we choose X ≤ Y (log Y )ν with 0 < ν < ρ, we see that this is
Y (log Y )−κ
for some κ > 0. Now, the contribution of those values of n for which n2 is a square
is seen to be equal to
⎛ ⎞
a(n) a ⎜ ⎜1
Y ⎟
⎟
⎜ 1 dt⎟ exp(−n/X)
2
n n1 ⎝ Y 1 |D|≤t
⎠
n2 =b
D≡a(mod 8N )
(D,n)=1
and an integral
a μ̃(d, f ) 1 ā(n)
ω (a) 2
¯(δ ) ×
N δ2 ≤Y 2
d 2πi (−η) n1−w
d|δ
(δ,2N )=1
⎛ ⎞
⎜ δ2 ⎟
2
Y /δ
D0 (∗)
×⎜
⎝Y |D0 |−2w (sgnD0 ) dt⎟
⎠×
1 |D0 |≤t
nd
D0 δ2 ≡a(mod 8N )
Γ(1 − w)
× (X/A2 d)w Γ(w)dw
Γ(1 + w)
172 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
which has been simplified in the (by now) familiar way. Let us write it as
Σ1 + Σ2
1
μ(m)χ0 2 (m2 )ψ(m2 )m−4w
(n d)
= ψ̄(aδ̄ 2 )
φ(8N ) √
ψ m≤ t/δ
χ0 2 (h)ψ(h)|h|−2w (sgn h)
(n d)
×
|h|≤t/δ 2 m2
Σ2 Y X −1/4 .
1
We now estimate Σ1 . By C β (λ) we know that for 0 < η1 < 2 and Re(w) =
−η1 , there exists U ∈ (X, 2X) such that
⎛ ⎞
2 Y /δ ā(n) β
2
⎜δ ⎟
IU3 ⎝ f (nd, −2w; aδ̄ 2 )dt⎠
1−w t
Y 1 n≤u
n
n2 d =b2
' (
(log Y d/δ 2 )λ (log d log Y /δ 2 ) .
Also, for 1
2 < η2 < 1, and Re(w) = −η2 , 2X ≥ U ≥ X ≥ Y ,
⎛ ⎞
Y /δ 2
⎜ δ2 ā(n) β ⎟
IU3 ⎝ f (nd, −2w; aδ̄ 2 )dt⎠
Y 1 n>u
n1−w t
n2 d =b2
We shall choose η1 and η2 bounded away from 0 and 1 respectively. Using the
estimate
1
|μ̃(d, f )| d(d)d 2
we deduce that
2 1 d 12 d(d)
X 2 −2ηi (log Y )λ (log log Y ) ×
1 1 1
Σ1 Y 2 +2ηi dηi d 2 log d
δ 1+4ηi d
i=1 δ 2 ≤Y d|δ 2
and this is
2η1 2η2 *
1 1
d(δ 2 )2 Y Y
X Y (log Y ) (log log Y )
2 2 λ
log δ + .
δ Xδ Xδ
δ
X = Y (log Y )ν
then
Σ1 Y (log Y )ν/2+λ−2η1 ν (log log Y ).
Now, if we choose λ = ν/10 for any 0 < ν < ρ, and η1 = 2/5(say) then
Together with our earlier estimate of Σ2 , this proves the main theorem.
Exercises
1. Deduce Theorem 1.1 from Theorem 1.2 by showing that if there are only a
finite number of fundamental discriminants D with LD (1, f )
= 0 then
√
LD (1, f ) Y (log Y ).
|D|≤Y
D≡a(mod 8N )
174 Chapter 6 Non-Vanishing of Quadratic Twists of Modular L-Functions
∞
∞
a(p2j ) a(p2j+1 )
Bp (s) = and Cp (s) = .
j=0
p2js j=0
p(2j+1)s
and
−1 −1
αp2 βp2
Cp (s) = p−s (1 + (p)p) 1 − 2s 1 − 2s
p p
1
where a(p) = αp + βp and |αp | = |βp | = p 2 .
∞
Fd (s, f ) = a(dn2 )(dn2 )−s .
n=1
Then,
∞
a(n2 )
Fd0 (s, f ) = Bp (s)−1 Cp (s).
n=1
n2s
p|d0
∞
a(n2 )
2s
= Bp (s) = L(2s, Sym2 )ζ(4s − 2)−1 ,
n=1
n p
deduce that
Show that F̃d (s, f, D) is defined for Re(s) ≥ 0 and has an analytic continu-
ation for Re(s) > −1/4. Moreover, for Re(s) ≥ −3/16 (say), it satisfies the
estimate of Lemma 3.3: in the notation of that lemma,
1 3
(d0 )−σ− 2 |L(2s + 2, Sym2 )ζ(2 + 4s)−1 |
ν(d0 ) 1
F̃d (s, f, D) c1 (1 + ) .
p1+2σ
p|D
∞
a(n) −2πn/√N
L(1, f ) = 2 e .
n=1
n
Deduce that
LD (1, f ) Y
|D|≤Y
where the sum ranges over D which are prime to N . State and prove a similar
result for forms f with non-trivial character.
6. Let ∞
A(X, χ) = a(n)χ(n)n−1 e−2πn/X .
n=1
Here, the sum over χ ranges over primitive characters. Deduce the estimate
of Iwaniec [I]
|LD (1, f )|4 Y 2+ .
|D|≤Y
References
[BC] P. T. Bateman and S. Chowla, Averages of character sums, Proc. Amer.
Math. Soc., 1 (1950), 781–787.
[FH] S. Friedberg and J. Hoffstein, Non-vanishing theorems for automorphic L-
functions on GL(2), Annals of Math., 142 (1995), 385–423.
[FS] A. S. Fainleib and O. Saparnijazov, Dispersion of real character sums and the
moments of L(1, χ), (Russian) Izv. Akad. Nauk. USSR, Ser. Fiz.-Mat. Nauk,
19 (1975), 24–29.
[I] H. Iwaniec, On the order of vanishing of modular L-functions at the critical
point, Sém. de Théorie des Nombres Bordeaux, 2 (1990), 365–376.
[J] M. Jutila, On character sums and class numbers, J. Number Theory, 5 (1973),
203–214.
[MV] H.L. Montgomery and R.C. Vaughan, Mean values of character sums, Cana-
dian J. Math., 31 (1979), 476–487.
[M] V. Kumar Murty, A non-vanishing theorem for quadratic twists of modular
L-functions, preprint, 1991.
[MM] M. Ram Murty and V. Kumar Murty, Mean values of derivatives of modular
L-series, Annals of Math., 133 (1991), 447–475.
[MS] V. Kumar Murty and T. Stefanicki, Non-vanishing of quadratic twists of L-
functions attached to automorphic representations of GL(2) over Q, preprint,
1994.
[Ra] R. Rankin, Sums of powers of cusp form coefficients II, Math. Ann., 272
(1985), 593–600.
[Ro] D. Rohrlich, Non-vanishing of L-functions for GL2 , Invent. Math., 97 (1989),
381–403.
[Sh] G. Shimura, On the periods of modular forms, Math. Ann., 229 (1977), 211–
221.
[W1] J. Waldspurger, Sur les valeurs de certaines fonctions L automorphe en leur
centre de symétrie, Comp. Math., 54 (1985), 173–242.
[W2] J. Waldspurger, Correspondances de Shimura et quaternions, Forum Math.,
3 (1991), 219–307.
Chapter 7
Selberg’s Conjectures
∞
an
F (s) = ,
n=1
ns
d
Φ(s) = Qs Γ(αi s + ri )F (s)
i=1
satisfies
Φ(s) = wΦ(1 − s̄)
for some complex number w with |w| = 1;
(iv) (Euler product)
F (s) = Fp (s)
p
M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, 177
Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_8, © Springer Basel AG 1997
178 Chapter 7 Selberg’s Conjectures
where ∞
bpk
Fp (s) = exp
pks
k=1
kθ
where bpk = O(p ) for some θ < 1/2, where p runs over prime numbers.
(v) (Ramanujan hypothesis) an = O(n ) for any fixed > 0.
Note that the family S is multiplicatively closed, and so is a multiplicative
monoid.
All known examples of elements in S are automorphic L-functions. In all of
these cases, Fp (s) is an inverse of a polynomial in p−s of bounded degree.
Selberg [S] introduced this family to study the value distribution of finite lin-
ear combinations of Dirichlet series with Euler products and functional equations.
For this purpose, he introduced the important concept of a primitive function and
made significant conjectures about them.
A function F ∈ S is called primitive if the equation F = F1 F2 with F1 , F2 ∈ S
implies F = F1 or F = F2 . As we shall see below, one of the most serious conse-
quences of the Selberg conjectures is that S has unique factorization into primitive
elements. It is not difficult to show that every element of S can be factored into
primitive elements. This is a consequence of an old theorem of Bochner [B], though
Selberg [S] and more recently Conrey and Ghosh [CG] seem to have found it in-
dependently.
Selberg conjectures:
Conjecture A: For all F ∈ S, there exists a positive integer nF such that
|ap (F )|2
= nF log log x + O(1).
p
p≤x
|ap (F )|2
= log log x + O(1);
p
p≤x
ap (F )ap (F )
= O(1).
p
p≤x
dim F = 2αF
where
d
αF = αi .
i=1
(Sometimes, we write L(s, ρ) if the field extension is clear. Since L(s, ρ) depends
only on the character χ of ρ, we will also sometimes write L(s, χ) or L(s, χ, K/k)
for L(s, ρ, K/k).) Clearly, the Artin L-function L(s, ρ) is a product of L-functions
attached to irreducible constituents of ρ. We have:
Artin’s conjecture: If ρ is irreducible
= 1, then L(s, ρ, K/k) extends to an entire
function of s.
180 Chapter 7 Selberg’s Conjectures
§2 Basic consequences
We record in this section the results of Bochner [B], Selberg [S] and Conrey-Ghosh
[CG].
Proposition 2.1 (Bochner [B]) If F ∈ S, and αF > 0, then αF ≥ 1/2.
Remark. In their paper, Conrey and Ghosh [CG] give a simple proof of this and
also treat the case αF = 0. They prove that the constraint bn = O(nθ ) for some
θ < 1/2 implies there is no element in S with αF = 0 except the constant function
1.
Proposition 2.2 (Selberg [S]) Let NF (T ) be the number of zeroes ρ = β + iγ of
F (s) satisfying 0 < γ ≤ T . Then,
αF
NF (T ) = T (log T + c) + SF (T ) + O(1),
π
where c is a constant and SF (T ) = O(log T ).
If F = F1 F2 , then clearly NF (T ) = NF1 (T )+NF2 (T ) so that αF = αF1 +αF2 .
Thus, if F is such that αF < 1, then F is necessarily primitive. The following is
now immediate.
Proposition 2.3 (Conrey-Ghosh [CG]) Every F ∈ S has a factorization into
primitive functions.
Proof. If F is not primitive, then F = F1 F2 and by the above, αF = αF1 + αF2 .
By Proposition 2.1, each of αF1 , αF2 is strictly less than αF . Continuing this
process, we find that the process terminates because Proposition 2.1 implies the
number of factors is ≤ 2αF . This completes the proof.
We see immediately that the Riemann zeta function and the classical Dirichlet
functions L(s, χ) with χ a primitive character are primitive in the sense of Selberg.
Indeed, the Γ-factor appearing in the functional equation is Γ(s/2) or Γ((s + 1)/2)
and the result is now clear from Proposition 2.1.
Conjecture B forces the factorization in Proposition 2.3 to be unique. Indeed,
suppose that F had two factorizations into primitive functions:
F = F1 . . . Fr = G1 . . . Gt
where F1 , . . . , Fr , G1 , . . . , Gt are primitive functions. Without loss of generality,
we may suppose that no Gi is an F1 . But then,
ap (F1 ) + · · · + ap (Fr ) = ap (G1 ) + · · · + ap (Gt )
so that
ap (F1 )(ap (F1 ) + · · · + ap (Fr )) ap (F1 )(ap (G1 ) + · · · + ap (Gt ))
= .
p p
p≤x p≤x
As x → ∞, the left hand side tends to infinity, whereas the right hand side is
bounded since no Gi is an F1 . This contradiction proves:
§3 Artin’s conjecture and Selberg’s conjectures 181
where the product is over irreducible characters φ of Gal(K̃/Q) and m(φ) are non-
negative integers. To prove Artin’s conjecture, it suffices to show that L(s, φ, K̃/Q)
is entire for each irreducible character φ of Gal(K̃/Q). By Brauer’s induction
theorem and the Artin reciprocity law, we can write
L(s, χ1 )
L(s, φ, K̃/Q) =
L(s, χ2 )
where χ1 and χ2 are characters of Gal(K̃/Q) and L(s, χ1 ), L(s, χ2 ) are entire
functions, being products of Hecke L-functions. Thus, they belong to S and hence,
by Proposition 2.4, have a unique factorization into primitive functions. We can
therefore write
m
L(s, φ) = Fi (s)ei , ei ∈ Z.
i=1
Decompose the sum on the left hand side according to the conjugacy class C of
Gal(K̃/Q) to which the Frobenius automorphism σp belongs:
|φ(p)|2 1
= |φ(gC )|2 ,
p p
p≤x C
p≤x
σp ∈C
Hence,
|φ(p)|2 |C|
= |φ(gC )|2 log log x + O(1).
p |G|
p≤x C
§3 Artin’s conjecture and Selberg’s conjectures 183
from which follows m = 1 and e1 = ±1. Thus, L(s, φ) = F (s) or 1/F (s), where
F (s) is primitive and analytic everywhere except possibly at s = 1. However,
L(s, φ) has trivial zeroes and so the latter possibility cannot arise. We conclude
that L(s, φ) = F (s) is primitive and entire.
Corollary 3.2 Let K/Q be Galois and let χ be an irreducible character of
Gal(K/Q). Conjecture B implies that L(s, χ) is primitive.
Proof. This is evident from the last line in the proof of the previous lemma. Or
we can derive it as follows. By the previous theorem, F = L(s, χ) ∈ S and by the
Chebotarev density theorem, nF = 1. The result now follows from Proposition
2.5 (b).
Of course, Dedekind’s conjecture that the zeta function of a number field
is always divisible by the Riemann zeta function follows from Artin’s conjecture.
However, it is rather interesting to note that the unique factorisation conjecture is
sufficient to deduce this. Indeed, if K is a number field, K̃ its Galois closure, and
ζK (s) is the Dedekind zeta function of K, then
is also entire. Since ζ(s) is primitive, it appears as a primitive factor in ζK̃ (s) =
ζ(s)G(s). Since ζK̃ (s) = ζK (s)F (s) and F is entire, ζ(s) must appear in the unique
factorization of ζK (s). This is Dedekind’s conjecture.
Conjecture B is that if F is any primitive function which is not the Riemann zeta
function, then
ap (F )
= O(1).
p1+it
p≤x
Exercises
1. If F and G ∈ S, define
∞
−ks
F ×G= Hp (s) where Hp (s) = exp kbpk (F )bpk (G)p .
p k=1
4. Show that the Dedekind zeta function of Q(21/3 ) factors into a product of
two distinct primitive functions, one of which is the Riemann zeta function
and the other of dimension 2.
5. If F, G ∈ S are such that ap (F ) = ap (G) and ap2 (F ) = ap2 (G) for all but
finitely many primes p, show that F = G.
6. For F ∈ S, denote by ZF (T ) the multiset of zeros of F (s) in the region
Re(s) ≥ 1/2 and | Im(s)| ≤ T. For F, G ∈ S, suppose the symmetric difference
satisfies
|ZF (T )ΔZG (T )| = o(T )
as T →∞. Show that F = G.
References
[A] E. Artin, Collected papers, Springer-Verlag, New York-Berlin, 1982.
[B] S. Bochner, On Riemann’s functional equation with multiple gamma factors,
Annals of Mathematics, 67 (1958) 29–41.
[CG] B. Conrey and A. Ghosh, On the Selberg class of Dirichlet series, Duke Math.
Journal, 72 No. 3, (1993) 673–693.
[JS] H. Jacquet and J.A. Shalika, A non-vanishing theorem for zeta functions of
GLn , Inventiones Math., 38 (1976) p. 1–16.
[M] M. Ram Murty, A motivated introduction to the Langlands program, in Ad-
vances in Number Theory (eds. F. Gouvea and N. Yui), pp. 37–66, Clarendon
Press, Oxford, 1993.
[M1] M. Ram Murty, Selberg’s conjectures and Artin L-functions, Bulletin of the
Amer. Math. Soc., 31 (1) (1994) p. 1–14.
[MM] M. Ram Murty and V. Kumar Murty, Strong multiplicity one for Selberg’s
class, C.R. Acad. Sci. Paris, 319 (Series I) (1994) p. 315–320.
[Mu] M. Ram Murty, Selberg conjectures and Artin L-functions, II, in Current
Trends in Mathematics and Physics, A tribute to Harish-Chandra, (edited
by S. D. Adhikari), Narosa Publishing House, 1995.
[S] A. Selberg, Old and new conjectures and results about a class of Dirichlet
series, Collected Papers, Volume II, pp. 47–63, Springer-Verlag.
[V] M.F. Vignéras, Facteurs gamma et équations fonctionelles, Lecture notes in
mathematics, 627 Springer-Verlag, Berlin-New York, 1976.
Chapter 8
Suggestions for Further Reading
with some constants α = 0 and β which depend on f and the test function F .
From this, he is able to deduce that given any > 0, and Y > C( ), the number
of fundamental discriminants of D ≤ Y such that LD (1, f )
= 0 is at least Y 3 − .
2
The method is capable of generalization and extension. For instance, see Murty
and Stefanicki [MS] and Stefanicki [St], as well as [PP].
In [GV], Goldfeld and Viola formulate the following conjecture. Let us sup-
pose that we have a Dirichlet series
∞
an
L1 (s) =
n=1
ns
M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, 187
Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1_9, © Springer Basel AG 1997
188 Chapter 8 Suggestions for Further Reading
∞
an2
L2 (s) = .
n=1
ns
∞
an χ(n)
L1 (s, χ) =
n=1
ns
is an entire function of s.
Here Tχ (s) denotes a product of gamma factors
.
J+
Γ(s + αi+ ) if D > 0
Tχ (s) = .i=1
J−
i=1 Γ(s + αi− ) if D < 0
for positive integers J + , J − , and real numbers αi+ , αi− > −k/2 depending only on
L1 (s). For convenience, we will write
J
Tχ (s) = Γ(s + αi )
i=1
it being clear that J and the αi depend on the sign of D. We also assume that
Aχ = f (|D|)
2λ
' (−δ
L2 (s) = 1 − γp,i p−s i
p i=1
Chapter 8 Suggestions for Further Reading 189
1 ρ 2λ
= (1 + o(1)) [z L2 (k + 2z)]z=0 (1 + wχ ) logρ f (|D|) (1 − γp,i p−k )δi ,
ρ!
|D|≤X i=1
p|D
where the dash on the sum means that we sum over fundamental discriminants.
In Chapters 5 and 6, we have established special cases of this conjecture.
It is natural to consider this conjecture for the quadratic twists of a fixed
automorphic L-function on GLr . (See [Mu] for an introduction to terminology and
notation.) It is then possible to prove upper bounds of general r. This has recently
been done by Y. Zhang [Z, pp. 54–60]. She obtains that if π is an irreducible
cuspidal automorphic representation of GLr (AQ ), where AQ denotes the adele
ring of the rational number field, and π has trivial central character, then
D
L(1/2, π ⊗ ) X (r+1)/2 log2 X
·
|D|≤X
2))), there are infinitely many ray class characters χ of F such that L(s0 , π⊗χ)
= 0.
Non-vanishing near Re(s) = 1 is the subject of investigation in [HR]. Non-vanishing
at s = 1/2 would have consequences for the construction of p-adic L-functions
associated to cuspidal automorphic representation of GL2r as in the work of Ash
and Ginzburg [AG].
Rohrlich [R1], proved the following non-vanishing theorem on GL2 . Let π
be an irreducible cuspidal automorphic representation of GL2 over any number
field F and let s0 be a complex number. Then, there are infinitely many ray class
characters of F (of finite order) such that L(s0 , π ⊗ χ) = 0. Some applications are
given in [R2]. Friedberg and Hoffstein [FH] give necessary and sufficient conditions
for the existence of a quadratic ray class character with this property.
There are other related results such as the recent work of Luo, Rudnick and
Sarnak [LRS] on the Selberg eigenvalue conjecture. This again is an extension of
the methods outlined in the previous chapters.
Recent work of Böcherer, Furusawa and Schulze-Pillot [BFS] raises the ques-
tion of the simultaneous non-vanishing of quadratic twists of two Hecke eigenforms.
There is the problem of Merel [Me] which asks for non-vanishing at s = 1/2
of the L-functions of the twists of a given eigenform by even Dirichlet characters.
Such a result will have applications in determining good upper bounds for torsion
of elliptic curves over cyclotomic fields.
References
[AG] A. Ash and D. Ginzburg, p-adic L-functions for GL(2n), Invent. Math.,
116(1994), 27–73.
[BFS] S. Böcherer, M. Furusawa and R. Schulze-Pillot, On Whittaker coefficients
of some metaplectic forms, Duke Math. J., 76(1994), 761–772.
[BR] L. Barthel and D. Ramakrishnan, A nonvanishing result for twists of L-
functions of GL(n), Duke Math. J., 74(1994), 681–700.
[FH] S. Friedberg and J. Hoffstein, Non-vanishing theorems for automorphic L-
functions on GL(2), Annals of Math., 142 (1995) pp. 385–423.
[GV] D. Goldfeld and C. Viola, Mean values of L-functions associated to elliptic,
Fermat, and other curves at the center of the critical strip, Journal of Number
Theory, 11 (1979) pp. 305–320.
[HR] J. Hoffstein and D. Ramakrishnan, Siegel zeros and cusp forms, Int. Math.
Res. Not., 1995, pp. 279–308.
[Iw] H. Iwaniec, On the order of vanishing of modular L-functions at the critical
point, Séminaire de Théorie des Nombres, Bordeaux, 2 (1990) pp. 365–376.
[LRS] W. Luo, Z. Rudnick and P. Sarnak, On Selberg’s eigenvalue conjecture,
Geom. and Func. Anal., 5(1995), 387–401.
References 191
M.R. Murty and V.K. Murty, Non-vanishing of L-Functions and Applications, 192
Modern Birkhäuser Classics, DOI 10.1007/978-3-0348-0274-1, © Springer Basel AG 1997
Name Index 193
194
Subject Index 195