Professional Documents
Culture Documents
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
is collaborating with JSTOR to digitize, preserve and extend access to Annals of Mathematics
By PAUL VOJTA*
p 1
-qq
-a < <s +
Of course, the best possible estimate of this form is due to Roth: the
right-hand side can be sharpened to 1/q2 + . But the weaker estimate above
suffices to prove Siegel's theorem for finiteness of integral points on certain
affine curves. We show in this paper how similar methods imply finiteness of
rational points on certain projective curves, namely those of genus g > 1. Thus,
this paper provides another proof of the following theorem:
As is the case with Faltings' proof, this proof does not provide an effectively
computable upper bound for the heights of rational points, only for the number
of rational points. Also, this proof does not contain Dyson's theorem. Subsequent
work will address the possibility of incorporating the non-compact case into this
proof.
We introduce the following notation.
0.3. Throughout this paper, let R be the ring of integers of a number field,
and let B be the arithmetic curve corresponding to Spec R (defined in (0.5),
below). Let K(B) denote the field of rational functions on B. Let w: X -> B be
regular semi-stable family of curves over B with smooth generic fibre C of genus
g. Again, X will be regarded as an arithmetic variety. We also assume that all
nodes occurring on the (non-archimedean) geometric fibres of X are rational over
the residue field, and that at each node, the tangents for both branches of the
fibre (in the completed local ring) are also rational over the residue field.
The semi-stability and rationality assumptions will hold after a finite base
change. Since a base change does not weaken the conclusion of Theorem 0.2, no
generality is lost.
Our proof of Theorem 0.2 starts with the assumption that there are
infinitely many rational points on C and eventually derives a contradiction. The
proof uses the notion of the index of a section of an invertible sheaf: the
contradiction follows from incompatible upper and lower bounds for the index.
ai 12 =0
for all pairs (il. i2) of natural numbers satisfying,
i Ii ~2
- +-< t.
d1 d2
(3) Choose rational points P1, P2 on C. These are chosen such that (a) the
height h(PF) is sufficiently large, (b) the ratio h(P2)/h(P1) is sufficiently large,
and (c) a certain sphere-packing condition holds.
(4) Prove a lower bound for the index of the divisor Z of step 2, at the
point f = (P1, P2), where r is close to h(P2)/h(P1).
(5) This lower bound contradicts the upper bound (Dyson's lemma) proved
in [V2].
Note that steps 1 and 5 take place on the generic fibre, and that the
intermediate steps are where the arithmetic takes place.
The above steps will be described in Sections 1-5, respectively. Sections 6
and 7 describe complementary results regarding bounds on the number of
rational points, and regarding algebraic points of bounded degree, respectively.
U~~_ P
CH(X)Q $= ( CH (I9) ?z Q.
p?0
deg: CH ) - R.
( A(,_)) On R
For each complex embedding 0a: K(B) - C let AYt, be the (Arakelov)
canonical volume form on CQ':= C X, C for which the adjunction formula holds
(see [L2, Ch. II, ?2]). Let w0X/B denote the relative dualizing sheaf, in the sense
of Arakelov theory. Points P on C rational over K(B) correspond to sections
sp: B -- X of wr: X -* B; th
Ep on X. Let the height of
(Ep (OXI/B)/(2 g -
We note that the singula
points lying over double points of the fibres on each factor. Thus (by the
rationality assumptions of (0.3)), they are of the form x1y1 = x2y2, where xi an
yi locally define branches of the fibre at the origin. These singularities can
removed by introducing new variables tl = x1/x2 (= Y2/Y1) and t2 = x2/x
This replaces the singular point with a projective line; in particular, the
desingularization has fibre dimension one. If the double points are the same on
each factor (i.e., if the singular point lies on the diagonal), then we also require
that x1 = x2 and Yi = Y2 (as functions on Xfin); hence the strict transform of th
diagonal is a smooth section of the projection onto each factor. (More generally,
one obtains a different desingularization by interchanging xl and yl; these tw
desingularizations are "flops" of each other, in the sense of Mori.)
0.6. We conclude this introduction with a brief historical note. The main
method of this paper dates back to 1909, when Thue [Th] proved a result similar
to Theorem 0.1, with the exponent 2m + e replaced by m/2 + 1 + e. This
was proved using the index as above, with the general outline roughly similar to
steps 1-5 above. Instead of sections of line bundles, however, his proof used
polynomials in two variables of bidegree (1, r) for sufficiently large r E Z. He
used this estimate to prove finiteness of pairs (x, y) E Z2 satisfying the Thue
equation
f(x, y) = a,
where f is an irreducible homogeneous polynomial of degree at least 3. Siegel
[Siel] generalized Thue's method to polynomials of bidegree (s, r), 1 < s < m,
sharpening the exponent to
m
min + s + e.
1<s<m S + 1
For simplicity we will also take 8 < 1. Let r > 1 be any rational number such
so that (Al. F1) = (At. F2) =O. Also, F2 = F2 = O. A,2 = 2g, and (F1.F2) = 1.
Then let
( 1 .4) yr = 28 > 0.
PROPOSITION 1.5. If
Definition 2.4.3. The coherent sheaf Rqcr *E on B is the coherent sheaf for
which
(Rq7W*E)fil = Rq(7rrfi)*Efil
and which has hermitian metric obtained from H0 q(g,; E,) via the Dolbeault
theorem.
Aq :=ad* + d*D
has eigenvalues 0 < A1 < A2 < ... and an associated "zeta function"
vqor(s) EAi-S
i>O
which converges absolutely for Re s > n. (This zeta function also depends on E,
its metric, and the KUhler metric.) It is known [Se] that this extends meromor-
phically to the complex plane and is holomorphic at s = 0. Then, motivated by
the definition of the Ray-Singer "analytic torsion" [R-S], let
LEMMA 2.6.2. For i = 1,2, let ri: Xi -> Spec R be the finite part of an
arithmetic surface with generic fibre Ci. Let q: W -> Spec R be a regular
three-fold over Spec R, such that there exists a birational morphism onto
Xi XSpec R X2. Let Y be a divisor on W. for which:
(a). The restriction of Y to the generic fibre C1 X C2 of q is ample;
(b). The desingularization W -> X1 X X2 has fibre dimension at most one;
and
(c). For all closed points X E X2 and all components Z of the geometric
fibre of P2 at x, M(Y) I Z has nonnegative degree.
Also let 0 be a fixed invertible sheaf on W. Then
deg R2q * (-Y0 @ (dY)) = 0(d2)
as d -> oo.
Remark. For the application to Theorem 2.2, one could instead use [V3,
Lemma 2.3].
Y = L + EDi,
i=l
To prove (2.6.2.1), it will suffice to show that this grows at most linearly in d.
By more exact-sequence-in-cohomology arguments, it is possible to show
that
The proofs in this subsection closely follow the methods of Bismut and
Vasserot ([Bi-Vi], [Bi-V2]).
ts Tr[exp( tAqd)]
d-n + 0O0 t dt
(2.7.1) tqd(s) = r f tS Tr exp dqd)
Then clearly
d -q AdS() = d sqdd(s)
and therefore
d- [ Tr ( -xdp - )] ak +
j= -n
Proof: Part (a) follows from [Bi-V2, (14)] and the remark immediately
following (19). Part (b) follows from op. cit. (19). Note that these parts of [Bi-V2]
do not depend on the assumption that the curvature of y/ is positive. M
Also, by (b), one can replace the o(tk) in (a) by O(tk+1) and still have the
uniformity statement. It then follows from (2.7.1) that
s + 0O0 t dt
+ FF(s
-+1 +
tsdn' Tr exp(
1)11P d -q,
dAf~)
d t
+ q~dS EadJ+ aq,d,o
t 0 dt~~
(2.75) '() d- "Tr expf - -~ ) Af- a i
(2.7.5) gq~ ~~~~~~pdd(O)
+j d-nrTr exP( - -A d)]-
+ E q, da j _qa d OF(1)
i =-n Xi
before except that an upper bound on the second term is needed. For this, it
suffices to find a lower bound on the first eigenvalue, as the following lemma
shows.
LEMMA 2.7.7. Let M, n, q, d, and A'qd be as above, and assume that the
first eigenvalue Aq d of A'q d satisfies the inequality
(2.7.7.1) Aq d db- bo
By Proposition 2.7.3, the first factor on the right is bounded; let C' equal this
bound times ebi. The lemma is then immediate. M
complex vector bundle with hermitian metric (resp. metrized complex line
bundle) on C1. Let N0 be the curvature VE2 of the holomorphic hermitian
connection VE on E and assume that the curvature of -/ is -donut for some
fixed do > 0. Fix d E N. let f be a non-zero section of AO' 0(E ? _/d), and let
AOd denote the Laplacian on AO '(E ? _Zd). Choose a basis w for the tangent
bundle TC1, such that ,uj = -1dw A diT. Let
A - - Ki is
Then
[t1, A] = -1.
The lemma then follows by the definition of I[No, A] I. U
Aod ? 2wdd0 - C.
AOd = (K*Dfdfd)
Kfdx afd).
di =dii A d-
we have
a = di + d2'
Now, by the choice of the KUhler metric, we have dz1 orthogonal to dz2
(pointwise, in fact) and therefore
Now, note that (by the choice of KUhler metric on M) the hermitian
pairings on AO q(E ? _Z d) (on M) and AO q((E ? _Zfd)IFx) are related (respec-
tively) by the formula
(2.7.8.4.3) f fd IFxXIdIFX) y2 = 1. 2
as before by Proposition 2.7.3 (b). Thus ;4 d(?) is bounded from above; the result
then follows by (2.7.2) and (2.7.4). El
(2.9.1) IIZII12 ,r ? 1.
Before giving the proof, we note that this result implies Theorem 2.2.
Indeed, let do be an integer cancelling the denominators in Yr and b; then
Y = do(Yr + by).
By Lemma 2.8, condition (a) holds, and the other conditions hold by construc-
tion.
We also note that the semi-stability assumption on X is used to show that
condition (d) holds. Moreover, note that condition (e) is an archimedean
analogue of (d).
d3
degL2 q*,Zd ? -(Y3)6 + 0(d2 log d).
Therefore,
degL 2 q* -d,r, b
- +00
rank q +r0
as d becomes large. By a generalization of Minkowski's second theorem (e.g.,
[Bo-V, Th. 3] or [VI, 6.1.10]), there exists a section of qua dr b (and thus a
section of y4,r b) satisfying (2.9.1).
Except for a few minor changes in normalization, the first half of this
section is identical to the function field case, but we will repeat it for the
convenience of the reader.
(Ep1.Ep )
(3.1.1) [K(B): Q] 1 2
h = (E..X/B)/(2 g - 2) [ K(B): Q]
is the height. If P1 and P2 are two rational points, then we also write F
D1, D2, h1, and h2 for the quantities associated to E1 = EP, and E2
(respectively) via (3.1.2).
By (3.1.2), it follows that
Now let us consider the divisors Fi'. Since the intersection numb
lies in the range [0, 1] for all components Cj of singular (non-archimedea
0 = arccos(( g + 8 + -y + Ig
(cf. (1.1)). Then (3.1.1) follows immediately from (3.1.10), (3.1.11), and (3.1.12.1).
1cEmin
2[K(B):Q] c~xc
If P1 * P2 and
h2 2 hi > 2 Vg + 8 + y + a)C2 + co + C3
2h1 2 2(gg+8 + y+8-)I
then the ratio h2/hl is boundedfrom below by a constant > 1 dep
X, 'y, 8, and hi.
E = pp*El.p2*E2
On the generic fibre C X C, E corresponds to the point (P1, P2). Also, let s:
B -* W be the section of q whose image is E.
Remark 3.5. Note that neither divisor Ej passes through a double point of a
fibre of XT; therefore W is isomorphic to X XB X in a neighborhood of E.
divisor D on B: let
(Cf. (2.3.4.1).) In pa
in the support of some D E Z(W), then deg j * D is defined. If moreover 1 is
the canonical section of &( D), then also j * D = (j * 1) and the degrees of both
coincide with deg j * ( D). We define the corresponding notions for arithmetic
Q-divisors by linearity.
From this discussion and from Proposition 3.1, it follows that
degs*A (E1.E2)
[K(B):Q]= [K(B):Q]
[K(B):Q]
deg s*V
[K(B):Q]=
Therefore
deg s *A' -
[K(B):Q] ? 2 (+ + h)h2 + C3.
Let -r 1 be any positive number, and let r be a number satisfying the
rationality assumptions of Section 1 as well as the condition
h2 h2
(3.6) <_<
h1 h (1 -
Thus
h 1h2 ? Vrhl(l - q
Let 72 be any positive number
role of the numbers mqj is not cr
will be taking the limit as j -*
From Theorem 2.2, it then follows that the divisor d(Yr + bV) is linearly
equivalent to an effective arithmetic divisor whose Green's form satisfies (2.2.1).
But also
We have, trivially,
on C such that
1 ?(z) < 2.
In the sequel, we will write such statements as
LEMMA 3.9.2. Letf be a real function on DP, which can be written in the
form f(z) = If(z) 12f2(z), where f1 is holomorphic and f2 is positive, real, and
smooth. Also assume that for some integer m there exist integers il,... im and
real constants b ,..., b., such that
(3.9.2.1) ddc log f < (b1IzI2i,-2 + +b mIZi2im-2)
v_11
x dz A di.
2
X hlm f(z)
Z-*O IZI2n
where n = ordzo(fl).
Proof: Let fo(z) = f(Z)/ I Z 12n; by assumption this can be extended contin-
uously to z = 0. Let
Let nf(s) be defined as the number of zeros (counted with multiplicity) of the
holomorphic function f1 in DS, and let n0(s) = nf(s) - n. Then, for all but
finitely many values of s,
h'(s) = - dc log fo
2~~~
by Stokes' theorem. Thus
2 IZ2i- +1i T2
< -f (b1IzI2i'2 + ... +bmIZI2im 2) 2 dz A dz
S
il t~~~~m
and therefore
Now let
S 27r T. dO
g(s) = fo(se26) 2
By concavity of the logarithm,
therefore
all P E C,
11111L2 > 1 2n
- exp(-2wn IIl(P)II.
f(z) = IIl(j(Z))112.
By assumption,
Fl-f
< 2n dz A dz.
2
2W T. dO
g(s) = ff(se26) 2
Then
f =2wfsg(s) ds
2wTf(O)f se-27ns2(ds
2 Jo
2 f() 22e-2rrnp2
2n
But also
I|lIIL2 ? fj*/2|f
Dp Dp
Proof of Proposition 3.9. This now follows directly from two applications of
Lemma 3.9.2, once on each factor Ci. The value of c4 is given by
2
4 1 - exp(- 2wTdOP2)
Returning now to the section y, assume that y does not vanish along E.
Then s * y is a nonzero section of s *fj(d(Y, + bV)) with nonnegative multi
ities at all the finite places. At the infinite places, (2.2.2) and Proposition 3.9
imply that
In this section, we will use similar techniques to produce a lower bound for
the index. Instead of using a covering construction as in [V3], however, it is
easier to work directly with certain coefficients. I thank the referee for suggest-
ing this approach.
IIYIIL2,cr < 1
for all u: K(B) - C. Let t = t(y, (P1, P2), dal(r), da2(r)). Then there exis
constants c5, C6, and p > 0, depending only on C; integers al, a2 satisfying
a1 a2
+ = t.
(4.1.1) dal(r) da2(r)
For the rest of this proof, we assume that all objects C, At, y, etc. are
implicitly complexified via v.
Since there are only finitely many a, the constants c5, c6, and p resulting
from the following argument can be made independent of ar.
From the proof of Proposition 3.9, recall the holomorphic coordinate
patches Jp: DP -* C (P E C); also let j1 = jpl and j2 = jP2 Let yo be a fixe
holomorphic local section of -Z in a neighborhood of A, with llyo(f) I = 1. Then
in such a neighborhood,
O(Z1,z2) = E bizt12
(i)
for some b(E) e C. Let el, e be positive even integers with e2/e1 = r. Then, for
-y E C, let
1 An
where
Jb(n-i)/ejii/e2 if el I n - i and
O otherwise.
In particular,
P e2a2 (a)'
1 an 2
(4.1.5) 11b(,l < n!)2 f0n*(O) |
There exists a constant c5 such that for all P E C,
For i = 1, 2, one can regard zi via ji as a local section of e( - Pd), having norm
? e2c.
Therefore,
e 2c5dal(r)t an 2
(n!)2 -Z n ?Y(0)
by (4.1.5). Letting
fy(z) 1 an 2
(4.1.6) lim Iz 2n = (!)2
where
By (3.9.1),
a2
dzld2 2 g (jP1(zJ) iP?(z2)) < 7Fc6.
(Using the known expression for ddcg [L2, II, Th. 3.3], one can take c6 = 4g,
but we will still use c6 to preserve generality.) Then
Recall that, by assumption, e1 and e2 are even; also, since e1 < e2 and VI = 1,
f, is defined (and the above bounds hold) for Iz I < p' = pl/ei. By Lemma 3.9.2
and (4.1.6), it follows that
21T.io dO
? lim sup ff(se?6)
*Ila(a)112 exp(-2c5dal(r)t)
- 2nlogp' + logc2d2. E
IIYIIL2,u ? 1
for all C: K(B) - C. Then there exist constants c5, C6, and p > 0,
only on C, such that the index
+ (log c4d)/d)/1 .
In order to make this expression a little more palatable, we note th
r, h1, and d become large and - 1 becomes small, the bound for t appro
(g - 1)Vg + 8
In Step 5 it will follow from Dyson's lemma that one must have
/g+8 lrj
+ (log c4d)/d).
Combining these inequalities and solving for t give (4.2.1). E
We continue with the notation of the preceding section; also, for simplicity,
we now assume that 8 < g/8.
PROPOSITION 5.1. The index t = t(s, (PF, P2), da1, da2) satisfies the
equality
2 25 2g - I
(5.1.1) 8g + 2r
Proof It follows from the m
(dYr)2 a2
(5.1.2) f(t) f~<2d
2 + 2 2ala2
(2g-1)
2a1
where f(t) is defined by
f(t) f dzdz2
Z1 +Z2< t
therefore f(t) < 1/2 and so t < 1, f(t) = t2/2. This then gives (5.1.1). E
Notice that the term (log c4d)/d has dropped out of this expression:
nothing else depends on d, and we may take d arbitrarily large. Moreover,
nothing depends on 77l or -2 except r, which tends to h2/h1 as -* 0.
Therefore, taking limits of both sides as i -* 0 gives
C-5log P( 4y2
2g - 2 (g _ 1)25 }
Then, provided the quantity inside the square is positive, (5.3) becomes
2
y-(g-_1)28/2y Y 285
(5.5) <(g-1 g+8 g+ + 2
2 + (2gl 7h
(g-_1)2(g +8) <g +85+(2g h2
Let
(5.6) 8 = (2g -2 )2
that subset; let h1 denote its height. Then any other point P2 in that subset has
height h2 satisfying
Indeed, for all -rj1 > 0 there exists a value of r satisfying (3.6) as well as the
rationality restrictions of Section 1. If (5.7) does not hold, then (1.5.1) holds,
and one then finds a global section y of &)(d(Yr + bV)) for some d > 0 and
b = c1Vr/8 + r,2, deriving a contradiction in steps 4-5 if mi are suffici
small. Thus the points in each subset have bounded heights; since there are
finitely many such subsets, the same is true of C(K(B)). Theorem 0.2 then
follows since a curve has only finitely many rational points of bounded height.
This proof is ineffective because, for any given subset, we are unable to
determine whether or not it is empty.
THEOREM 6.1. Let r denote the rank of the Mordell-Weil group of the
Jacobian Jac(C) of C. Also let
hm
in(C2(=-max,
+ 8 + y + ) 2y(C1/8
(g - + C3+
1)28wp2(2Vg + + C6))
15(c5 - log p
2g-2 8)'
rma
max 2max (2 g -1
2r log rmax
sinri 0 cos 0 log rmin
The values of rmax and rmin can be given explicitly; for exam
one can set
g - Vg
2
7. Algebraic points
In the function field case, this is related to the arithmetic genus of Hp; si
in the number field case, it is related to the discriminant of K(P) plus
contributions coming from the singularities of Hp. For points on P', it e
(2sp - 2)h(P) + 0(1).
THEOREM 7.1. Fix E > 0 and s E N. Then, as P varies over all algebraic
points of C of degree sp < s,
REFERENCES