Professional Documents
Culture Documents
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at .
http://www.jstor.org/action/showPublisher?publisherCode=annals. .
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Annals of Mathematics is collaborating with JSTOR to digitize, preserve and extend access to The Annals of
Mathematics.
http://www.jstor.org
On heightsof algebraicsubspacesand
diophantine approximations
By WOLFGANG M. SCHMIDT
Introduction
The basic problemof diophantineapproximationscould be formulatedas
follows. Assumea Cartesiancoordinatesystemin euclidean E2 to be given.
Let L be a line throughthe origin0. Find a line L' through0 and defined
over therationals Q (definedby a linear equation withrationalcoefficients)
whichis "as closeas possible"to L. As a measureoftheircloseness,introduce
/r(L,L') = sinAp,where p is the angle betweenL and L', 0 < p < r/2. Now
obviouslyforeverys > 0, therewill be infinitely manysuch linesL' satisfy-
ing J(L,L') < s. This closenessof L and L' will in generalonlybe possible
if L' is fairly"complicated". As a measureof the complicatedness of L', we
defineits heightH(L') as follows. If L' goes through0 and is definedover Q,
there is a unique pair of points (p, q), (- p, - q) on L' so that p, q are
relativelyprimeintegers. Now put
H(L') = (p2 + q2)1/2
one has
F,(x) < FE(x) < F,(x) < n"12FE(x)< nF.(x).
Let K be an algebraic numberfieldof degree [K: Q] = p and I the ring
of integersof K. An ideal will always be a subringof K closed undermulti-
plication by elements of I, and finitelygenerated as an I-module. Let a, ***, a,
be the different of K intothe fieldC of complexnumbers. For
isomorphisms
C K, put d(i) for the image of e under at and vj(d) = I 1'(i lo
Let Kn be the vector space of n-tuples(2,, *.., * i , j e K. Let L be a
432 WOLFGANG M. SCHMIDT
one-dimensional
subspaceof K% Choosea pointX (2, ** , i) + 0 of L and
writea a(X) forthe ideal generatedby 2, * , (a consistsof all elements
i.
the permutation
/1 ...d d
+i1 ... n
Vijl ***i jd i jl ijn-d/
'a)'(S*) for subspacesSd of Kn. Using this and (5) (appliedon KN) one finds
(6) HF(oj(Sd)) ?< C d(o?)HF(Sd) .
From now on, H(S) will always mean HF(S) with F= FE.
2. Generalized determinants
Given vectors X1, * , Xm in euclidean or unitary vector space, denote by
<X,, ***, Xm> the subspace generated by these vectors. Also, if A, B, *** are
subspaces, denote the subspace generated by them by <A, B, * * *>. Given
vectors X1, ... , Xm, form the determinant I XiXj I (1 < i, j < m). This de-
terminant is zeroif X1,** ,Xmare linearlydependent.Otherwise,if Z1, ... ,Zm
434 WOLFGANG M. SCHMIDT
LEMMA 5. The index of A(X1, ***, Xd) in A(S) is N(a), where a is the
ideal of K generated by r, ***, ONE
Now Lemma 5 impliesd(A(S)) N(a)-1d(A(X1, * , Xd)), andthetheorem
followsfrom(7) andLemma4. Henceit will sufficeto showthese two lemmas.
PROOF OF LEMMA 4. Let Xt = (itjl ..., *t), and let f1, **., ,S be an
integral basis of K over Q. Now
*, Zpl, .., ..
Zily .. Zldi Zpd
where
..
Zjk =((Sjekl)[] ..
* * (*S ( iek)[P' * (a $ )[1], (a i )[p])
0
OY(2) ... o X
(1 .) * . Y(P) /
where Y'i) is the matrix ($'i)) (1 < k < d 1 < i < n) (1 < j < p). A typical
not triviallyvanishingdeterminant of a pd x pd-submatrixis nowof the form
'1' *** Y)d where 1 < ii < N and (y), ** ,) are the Grassmann coordinates
introducedjust beforeLemma 4. This gives
J(I (j X) 12 + ?+ I y(j) 12)
whence
det(A(X, * **, Xd))2 - 2-2r2dd* A2d
HP (| ((i) 12 + ...+ (j) 12)
= A2d D2(Xl(j, ... Xj))
and Lemma4 follows.
PROOF OF LEMMA 5. The indexof A(X1, * , Xd) in A(S) equals the index
of I(X1, ..., Xd) in I(S). Let aX1 + .* + adXd be in I(S). Then airj must
be integral,ai~j E 1(1 < i < d, 1 < i _ N). Now let p E Ibe fixedand a common
multipleof l, * **,By.. Then/i = aip E I(1 < i < d). It will sufficeto show
that the number of d-tuples 81,. . ., /d of I modulo (p) such that
15lXl + *** + CdXd is p timesan elementof I(S), equals N(a).
Writea and the principalideal (p) as a productof prime-ideals;
a=pi .P.. 4ps (p) = 4pf'... Ppfs (ej < fj)
What one has to show is that the numberof integral solutions 81,** d
modulo4/i of
( 9) Sit $~k + ***+ iSdedk -=0 (modptfi) (k = 1, * n* )
equals N(pQ)ei foreach i (1 < i < s). Now let i be fixed,and set 4p,e, f for
Pi, et,fi, respectively. Write 81$lk + ... + /didk as an innerproductBRk,
whereB = (,81,* ,d), Rk = ($lk, *dk).
$ The congruences(9) now become
(10) BRk =0(mod Op) (k = 1, ,n).
Withoutloss of generalitywe may assume thedeterminant of R1, * , Rd
(it is one of the rj) to be in pe but not in pe~l Any Rk may be written
ciRl + *** + Cdd. Since each determinantIR1, **, Rj>, Rk, R+1, ... , Rd I is
in pe, each (c) musthave p with non-negativeexponentin its factorization.
Thus the congruences(10) will be satisfiedif and onlyif theyare satisfiedfor
k= 1,. .,d.
Lemma 5 now followsfromthe case g, 9d = f of
438 WOLFGANG M. SCHMIDT
R1, , T, Rd. ,
G' is the projection of some G E A(Kn), where G is not in the subspace spanned
by A(Sd). Say G is the image of X under p, G = p(X). Now Xe I(Kn), but
XC Sd.
Let Sd+l be the (d + l)-dimensional subspace of Kn spanned by Sd and by
X. Now A(Sd+l) contains the lattice A* spanned by A(Sd) and by G1, ***, G,
where Gj = p(,SjX) (j = 1, * * *, p) and where ,81, .**, ,Sp is an integral basis of
K/Q. Thus it will suffice to show
By Lemma 7,
|p(,SjX') |< CH(Sd)-IIV(n_ d) (j=1***SA
Let H1, ... , Hpd be basis vectorsof A(Sd). By (iv) and Lemma 2 of ? 2,
d(A*) = D(H1q ...*, Hvdq G1, *** d) =D(Hiq**,Hd G1,*, )
<D(H1q ..** Hvd) I Gf Gf = 'IdH(Sd) |Gf Gf
?< c4IH(Sd)Il(Sd)l/I(n-d) - _(Sd)(n-d-1)/(n-d)
LEMMA 11. Assume 1 < d < n, and assume N(d - 1, H) < c11H* is true
for all H > 1. Also assume H' > Hf" > 1 and
H " = cHf(d-1)Id
442 WOLFGANG M. SCHMIDT
By partialsummation,
N(d, H')
? C13H n~ld((Hff ? l)dn E 1 + Elch-H" hdn1 ES1 1)
H(S") < X" h integral H(S1)h /h
? ? 8d/n c16H't ?
8c1l5H'n c128d/nHn
The inner products xi, *., Xf are not quite arbitrary. If d + e > n,
at leastd + e
Ad n Be hasdimension - n, whencex, =d+e-n = 1. Put
g = d + e-n,
(2) t = min(d, e, n - d, n - e) = min(d, e, e - g, d - g) .
We definev , v, as follows. If d + e < n, then t=f, and set v*=
Xj(Ad,Be) (i = ..., t). If d + e> n, thent = f-g, and set of = Xi+,(Ad,Bd)
(i 1, .*,. 9One has 1 > vi >-... >-vt >-?.
Call two pairs of subspaces Ad, Be and A*d, B'e similar if there is a
unitarytransformation T withz(Ad) = A*d, z(B8) = B*e. This is an equivalence
relation. Similar pairs have equal vj, , Vt.
THEOREM 5. Given 0 < d < n, 0 < e < n and 1 >-vi > .. _ vt >-?
wheret is definedby(2), thereis up to similarity exactly one pair of sub-
spaces Ad, Be having 2? = >*(Ad,Be) (i = 1, ***, t).
PROOF. (a) Existence. Assume d < e. Choose an orthonormalbasis
El, ***, E. of Un. First case: d + e < n, whence t = d. Put s* (1 2)1/2 -
(i = . . .
1, ., d). Set Ad= <X1, .., Xd>, Be = <Y1, ..., Ye> where
X* = E(i= 1, * *,d)y Y*= >*E*+(o)Ed+i(i= 1, * * * ,d), Yj = Ed+j(j= d +1,. * A
Then X1, *.**, Xd are orthonormal, Y1, **, Ye are orthonormal, and Xi Yj =
v*a*j(1 < i < d, 1 < j < e). Second case: d + e > n, whencet = n-e. Set
HEIGHTS AND DIOPHANTINE APPROXIMATIONS 445
=Xqt = 1 (g = d + e-n),
Xi = *** X+ = (i = 1, ** *,t), and put (i)=
(1-_\)12(i = 1s ,d). Set Ad = <X1, ... , Xd>, Be = <Y, *.**, Ye> where
X*= E*(i = 1s ...,d) and
VJA1, B 1) = vi(AgB) (i = 1, . ..
t).
PROOF. The relation for t is obvious. Dim A+ dim B+ dim A' + dim B'
2n. By symmetrywe may assume d + e = dim A + dim B < n. Without loss
of generality assume d < e, whence t = d.
Now we make a simple observation. Suppose A is the orthogonal sum of
V and A1, B the orthogonal sum of V and B1. Then the successive inner
products of A, B consist of the successive inner products of A1, B1, plus r
dim V times the number 1.
To prove the theorem, set V= A n B and A1, B1 for the orthogonal
complement of V in A, B, respectively. Our observation applies. Since
A1rnl = 0, 1 does not occur among the innerproducts of Al, B1. If A1,...* *,d-r
are the inner products of A1, B1, then the inner products of A, B are 1, *..., 1
(r times), Xi ..., _d-r. There are orthonormal bases X1, ... , Xd-r and
...
Y, * Ye-r of A1 B1, respectively, such that Xi Yj = Xi&ij(1 ? i < d -
(V1(A , B), *..., It(A', B')) = (1, *..., 1 (r times), X1,*. . , d-r)
whereX1, *., Xd and Y1, **, Ye are arbitrary bases of Ad, Be, respectively.
If X is a vector 7 0, B a subspace # 0, put aj(X, B) = aol(<X>, B). As
is easy to see,
(8) aj2(X, B) + a()2(X, B') = 1
foreverysubspaceBe, 0 < e < n.
LEMMA 13. Let 0 < d < n, 0 < e < n, f =min (d, e), and a a non-
HEIGHTS AND DIOPHANTINE APPROXIMATIONS 447
Then
(0 (Ad, B e) -< (216)i't2 (O
using inductionon j. For j = 1, (10) is obvious. We next deal with the case
448 WOLFGANG M. SCHMIDT
Now write the given vectors X1,X2 in the form X1 = c11Z, + C12Z2,IX =
c21Z1 + C22Z2,and set s = max(I C12 I I C2 1). Then I cjj | _ 1 - (j 1, 2) and
1 - ~> \(XI, X2) =X1X21 = I C11C21 + c12c22? 1 26, whences > 6/2. If,
say, 6s =c C121 > (X1,B) > 62wD ? G2/2, whence (11). This proves(10)
forj = 2.
Now suppose(10) has beenshownforj - 1. We have to show co(X,Be) <
(2/0)(6-11'/o2 foreveryX e Ai. This is true forX = Xi, whencewe mayassume
X, Xi to be linearlyindependent.Set A2(X) = <X, Xi>, and let Xj* # 0 be in
the one-dimensionalsubspace A2(X) n Ai-1. Then X(Xj, Xj*) < 1 - and
ao(Xj, B) < w, o(Xj*,B) < (2/0)(i-2)'2w by induction.
Since (2/1)(2/1)j2 '2wo <1,
we may apply the case j 2 of (10) on Aj(X) and obtain (02(A2(X), B) <
(2/1)(lj-1)2aw, whence w)(X,B) < (216)'i-!'/2
COROLLARY. Let Ad, Be,I, i be as in the theorem. Let X1, ... , XiG Ad,
Y I* Y, e Be be vectors,none of them0. Assume
Xk) < 4-i
X(Xjy (1 < j < k < i),
9X3 I Y3) < 60 U
( ,*oi
Then
wJ(A,B) < 2iwo.
PROOF. We have to show X1, ..., X3 are linearly independent,and Ai
<Xi I ... , Xj> satisfies X(X,, Ai-') < I (j = 2, ... , i). We may assume X1 =
* Xi = 1.
d
(, *... in) As before, I will denote the ring of integers of K. With
X e Kn we associate
(3) X = p(X) = (XE1], ... YX[P])
for sufficiently large C2. This shows the existence of a Xj e A satisfying (i)-(iv).
We choose such a Xj with D(XA, ... , XjA) as small as possible. This finishes
our construction of X1, ... , Xu.
There are Xi e I(Kn) with XY = p(Xi). We set Bi = <X1, ** *, Xi> and are
going to verify that Bi has all the required properties. First we will show
_
(5) X1 ? 2nC2Hn/c+I/P
if H is large. If this were not, (i) and the negation of (5) gives
- 2n+lC2H n/c+l/P
I Xl I , Ilk ? 22'+1C2H-Ic+1'p
(7) w Xl Aby>i
th Xs t(Xkio
/2, eie1 )
XjD (4by!)2 (1 k< . ).
We will do this by induction on j. Define Yj by XjA =iXjA I Yj (j= 1,*ly ,
452 WOLFGANG M. SCHMIDT
This proves (7). By the argument of the corollary of Theorem 7, X1A, ... ,YXA
are linearly independent. Hence also X1, ... , X. are linearly independent,and
dim Bi = i (i = 1, ..* * ).
Our next aim is to verify (4). By (iv) it sufficesto take j 1. By (i), (5)
and (7)
|Xi* | < I XiA I (i=1 .. )
At the end of the proof one has I Xi w(o(A, Xi) < C2H-n2c+lIP, whence
The desired inequality (2) now follows from (7) and the corollary to
Theorem 7.
HEIGHTS AND DIOPHANTINE APPROXIMATIONS 453
whence
c)w(Ad, B e-l) < CgcH(B"')-ys (i = 1, ..., h)
On the other hand, if instead of (9),
(11) 0) (Ad, Be) = 0 (i = 1s ... 9h))
put
Y = e(qh)-l .
Then, for given H' > C9H, there is a subspace B'e- ci B', defined over K, of
height H(B'e-) < H', having
H(Be-l)ai(Ad, B'') < CjoHsY'HJ-(Yo-l) (i = 1, ..., h) ,
whence
w-)i(Ad, Be-') < C,,HY'H(Be-l)y6 (i = 1, , h)
h)
454 WOLFGANG M. SCHMIDT
(j =1,..,h)
HEIGHTS AND DIOPHANTINE APPROXIMATIONS 455
whereX*e S *, I,, e Th and X,is orthogonalto both S * and Th. The set of X
satisfying
( ib) X*e H,
(iib) XJi < HI-(Yj-(2Y-l)/(ep)) (H/H(Be))112h (j = 1 h; i 1, 2),
(iiib) Ilo I < C12HI(2y-l)/(ep)
is convex,symmetric, and of volume > 2"An if C12is large. Thereis an I e A,
X # O,in thisset. Let X = p(W).
IWY, yJl]
1,4W[1] _ W[2] yr2] + i(W[1] y[2] + W [2]Y41])
-
IX?)j + iXW,7 I < IX
Y-JHI + I4-J7T'I 21Xji,
where Xj is the orthogonal projection of XT on Tj. By (iib), XIj I <
C13H-
(yj-(2y-1)/(ep)) (H/H(B e))1/2h , whence
(14) WYj I < 2C13H-(Yji-2Y(l)I'eP')
(H/H(Be)) ((j = 1 , h)
Now let either(a) or (b) hold. (ii), (iii) impliesin eithercase
IY- * I < C14H(qy-[)1(ep)[
This implies IWVRW I < 2C14H'qy-l)l(ep) R WI, hence, since R Wis arbitrary
K(j) and thereforedense in Bedj),
in Bed() nK
(15) VWjI < 2C14H(qy-1'1'ep)
This is trueforj =1, *..., p.
Let a be the ideal of K generatedby all determinantsof m x m-sub-
matricesof the m x n-matrixwith row vectors Z1, * , Zm, and b the ideal
456 WOLFGANG M. SCHMIDT
= p&2( (B e-l)1)I Y)
-2
= D2(Yi, WJ Zi, ,
Zm) I *2
i|2D((I, Z ,
= 1
-iY~Wi 12D2(Z1, ** Zm) (D(W, Z1, *,
Now by (II. 8),
co)(B -l Yj) = YW
j ID(Z1, * * Zm)/D(Wg Z19 .. *I* Zm) I
Dq(W, Zi, ***, Zm) ap(B'e-, Yi) = qyWtqDq(Z1, *, Zm)
This gives
H(B e-~~(B e i)
= (e (i= 1,., h)
Also, by hypothesis,
H(B e)soq(Xi, Yj) _ cqH-qyi+l
whence
H(Be l)q(Xig Yi
1) CH-qi+l+(qy-l)le = C5H-(qyi-')(qy+e-l)le
(iv) I Zi
-H
l~lep (j = 2y ... . p).
This set is convex,symmetric,
and of volume
> C23H(B e) (H 11ep(H1H(Be))11d)d( C22H- (np-n+d+e)/(ep(n-d-e)) n-d-e (H ilep) (n-e) (p-1)
> 2nPAn
for large enough C22. Let X 0,. X e A be in this set. If X p( W), then
WX Be because X*E I. Set
B + = <Be, W>.
Now if Be = <X1 **, Xe>, whence
H(B e) = N(a)-l II`=, D(Xj), **, Xe( ))
? cC22IJ -(d+e)I(e(n-d-e))
Small H are dealt withas at the end of part (a) of the proofof Theorem9.
*qj(A
d
Bi) < H(B') lCH -dl (i(n-d)) < CHH-d/(j(n-d))
This takes care of the case e = j.
For H > 1, define H1 by H = C3Hlin-e-l)n-ewhere C3 is the constant of
the Going-up-theorem.If H is large, H1 > 1. Assumingthe truth of our
theoremfora particulare, therewill be a Be, definedover K, with H(Be) <
H1, q(Ad Be) < C24H-d(n-i)(i(n-d)(n-e)). thereis a
By the Going-up-theorem,
Be+1, definedover K, with
H(Be+1) < Hf *q(A d Be+1) C24H-dn-j)(j(n-d)(n-e))[(n-e)(-e-1)]
y = (i + n - t)/(q(j + n
d - e - 1)). -
whence
pq(A d9 Bi) < C27H(Bi)-(n/i)(l/(n-d)+l/(n-d _l)+ -+l/(n- d-i+l)) (i =S ** U).
c Bi
PROOF. The existence of B1 follows fromTheorem 8. Suppose B' c ..
whence
pq(Ad , B e) < C28H(B
e)-n(n-t)/(t(n-e))(l/(n-d)+- -+l/(n-d-t+l))
d~~~~a
= . . . 9 Yd>ad. (? d
and set Ad 1<Y, *,
Now let B1 be an arbitrary 1-dimensional subspace defined over K. Say
B' = <X>, X = (1, ...*, E%) with ~i e I (integers of K). We may, because of
the finitenessof the class-number, assume the ideal a generated by 1, ...,
to have norm < C30= C30(K). Define c0 by
I Y11k+ + .+* eddk - ed+k <? colX ID( Y1, **, Yd)H(B l) nIc
(C31, ** C39 depend on Ad). For c < i < d, I LrTi(X)I < C321
, X I, whence
On the other hand, the left side, being the absolute value of the normof
L(X) # 0, whereX e I(Kn) and the coefficients of L(X) are fixedelementsof
the field L, is > C39. This proves c0 C29,
> and hence our theorem for t =
e = 1.
We next take care of the case wheree < d, whencet = e, n + t- d - e =
n - d. The same subspaceAdas beforewill do. This followsby inductionon
e. Suppose
*t(Adq B e) = c~oH(B e)-nlqtc
so thatL is real in case (a) and complexin case (b). Thereare subspacesAd
of Gl, definedoverL, so thatfor everyBe, definedover K,
pq(Ad, B e) > C4oH(Be) I S
of the matrix with row vectors X1, *.., Xd, Y, ..., Ye has absolute value at
mostthis large. Since Ad n Be # 0, one of the determinats,say D0, is # 0.
D. is of the formD. = -ik + XiYk, wherethe xi are d x d-subdeterminants of
the matrixwithrow vectorsX1, ... , Xd, and the Yk are similarlydefined.
J1_R
I < coC46H(Be)-lIqD(Yi, , Ye)
fll~~lRl2= [Dij I oCqH(B6)-1D( Yj, .. * .e)lq9
J1=J=1I oij ? c(CH(Be)-l IL D( Yf1i,* *,eY
< C Cq5H(Be)-1 H(Be)lN(b)l < CqC46
For some fixed t # 0, t E L, xit are integers 1 < i < (n)), hence tDo is
integral. The leftside of the last inequality,being the normof Do, is there-
fore > N(t)-1P,and this yields c3 > N(t)-'PC4-1,
whencecq > C40. This proves
our lemma.
LEMMA15. Let K c L c &?befields,whereK is infinite,L/K a separable
finite algebraic extension of degree 1, and &i algebraically closed. Let
U**, a be theisomorphismsof L/K in &2. Finally, let h(X('', *. . , X'V) =
(x),* .., x...
***, , ***.., x() be a non-vanishing polynomial in sl variables
and coefficients in K. There is an X E L3 so thath(X01,.* *, Xci) + 0.
PROOF. Let 8l,9 ..., f be a basis of LIK. Let X = (x1, ... , x8) e Ls and
Xij = (xoi, *...,x). There are y'j) in K(1 < i < s, 1 < i < 1) with xi
y1)581 + *. + y('),S5, whence xzi = ylf)lij + *a + Let Y(j) = (yj*i),
y'd') (j =1 *.., 1). Then h(X01, *..., Xlz) Y (l)), where k is a
- k((1), *...,
non-vanishing polynomial.This is true since the transitionXn'e,..., X01
s
Y(1)..., Y(1) is accomplishedby a linear transformation of determinant
lf3 j Is # 0. Now since K is infinite,one may choose Y(l), ** , Y(1) in Ks such
that k(Y(1), ..., Y(1)) # 0.
We now turnto the proofof Theorem17. The basic notionsof algebraic
geometryneeded can be found in Artin[1] or Lang [12]. An expositionof
Grassmanvarieties can he found in Hodge-Pedoe[5, ch. VII]. Given fields
K1 c K2, let Td(K2/K1)denotethe transcendency degree of K2 over K1. For
therestof thissection,Q willdenotetheprimefieldof the fieldK of Theorem
17, &?(universal domain) an algebraicallyclosed fieldof infinitedegree of
transcendencyover K, L. All the fieldsconsideredwill be subfieldsof &a.
Givenan algebraicset S, dimS will be the maximumof the dimensionsof its
componentvarieties.
To everysubspaceAdof &P, we let correspondthe one-dimensional sub-
=
space y(Ad) <Y> of &WN where Y (C... , N) N =() and (, **, rN
is a set of Grassmancoordinatesof Ad. The union of all the subspacesy(Ad)
HEIGHTS AND DIOPHANTINE APPROXIMATIONS 467
REFERENCES