Professional Documents
Culture Documents
Ha Huy Vui
Hanspeter Kraft
Peter Russell
David Wright
Polynomial Automorphisms
and Related Topics
Lecture Notes of the International School and Workshop
ICPA2006, October 2006, Institute of Mathematics
Hanoi, Vietnam
Editors:
Hymann Bass
Nguyen Van Chau
Stefan Maubach
Authors:
Arno van den Essen
Ha Huy Vui
Hanspeter Kraft
Peter Russell
David Wright
Editors:
Hyman Bass
Nguyen Van Chau
Stefan Maubach
Preface III
Suggested Notation for Groups of Polynomial Automorphisms IX
Chapter 1. Polynomial automorphism groups 1
1. What is a polynomial automorphism? 1
2. Recognizing an automorphism 2
3. The case n = 1, R a domain 2
4. Notation 3
5. Canonical homomorphisms 3
6. Subgroups 4
7. Formal inverse 5
8. More subgroups 6
9. More on n = 1 6
10. Elementary and tame automorphisms 7
11. Stable tameness 9
12. The case n = 2, R a field 10
13. Structure of GA2 (k), k a field 16
14. Structure of T2(R), R a domain 18
15. The case n = 3 19
Chapter 2. Perhaps the Jacobian Conjecture is simple 21
Introduction 21
1. The classical formulation 21
2. The Rolle formulation 22
3. The degree of the inverse 26
4. Zhao’s vanishing conjecture 30
5. The Dixmier Conjecture 33
6. Endomorphisms of canonical Poisson algebras over commutative
rings 40
Chapter 3. Basics from algebraic geometry 43
1. Affine varieties 43
I
II CONTENTS
2. Morphisms 60
3. Dimension 69
4. Tangent Spaces and Differentials 81
5. Normal Varieties and Divisors 101
Chapter 4. Embedding problems in affine algebraic geometry 113
1. The problem and first examples 113
2. The case of affine spaces 114
3. The high codimension case 115
4. General and generic fibers 117
5. More on n=3 121
6. Curves with one place at infinity 123
Appendix chapter 4 part 1: Notation 127
Appendix chapter 4 part 2: More on general and generic fibers 129
Appendix chapter 4 part 3: Plane algebroid curves 131
Chapter 5. Bifurcation set of the global Milnor fibration 137
1. Introduction and definitions 137
2. Case n = 2 138
3. Rabier-Jelonek Fibration Theorem 144
4. Jelonek set 148
Bibliography 153
List of notations 157
Index 159
Preface
III
IV
Organizations
Organizers
Scientific Committee
List of mini-courses
List of talks1
David Wright
The notation R[n] for the polynomial ring in n variables over a commuta-
tive ring R has now become standard. As the group of R-algebra automor-
phisms of R[n] and certain of its subgroups have become the focus of much
attention, some unifying notation is greatly needed.
We often see notations such as AutR (R[X1 , . . . , Xn ]) or AutR (R[n] ) for the
full group of automorphisms, but this is akin to writing AutR (Re1 ⊕· · ·⊕Ren )
or AutR(Rn ) for the group of R-module automorphisms of the free R-module
Re1 ⊕ · · · ⊕ Ren = Rn . Instead, linear algebra and algebraic K-theory call it
the general linear group and use the simpler functorial notation GLn (R) or
GL(n, R), which presumes that a basis {e1, . . . , en } has been chosen.
A polynomial automorphism is given by a vector of polynomials F =
(F1, . . . , Fn ), with X = (X1 , . . . , Xn ) being the identity. We first observe that
in order to have the more natural group law
F ◦ G = (F1(G1 , . . . , Gn ), . . . , Fn (G1 , . . . , Gn ))
The general linear group. We then have the general linear group GLn (R)
naturally contained in GAn (R) as the subgroup of all F = (F1, . . . , Fn ) for
which each Fi is a linear form. Here an invertible matrix A is identified with
the automorphism F given by the vector (A · X)t , where X = (X1 , . . . , Xn )t.
We can view GLn (R) anti-isomorphically as the stabilizer in R[n] of the R-
module RX1 ⊕ · · · ⊕ RXn .
The affine group. The affine group consists of those automorphisms of the
form F = (L1 + c1 , . . . , Ln + cn ) where L1 , . . . , Ln are linear forms in R[n] and
c1 , . . . , cn ∈ R. We denote this group by Afn (R). Note that GLn (R) ⊆ Afn (R),
and, in fact, GLn (R) = Afn (R) ∩ GA0n (R).
The group of translations. Also contained in the affine group is the group
of translations, i.e., those automorphisms of the form
F = (X1 + c1 , . . . , Xn + cn )
with c1 , . . . , cn ∈ R. We denote this subgroup by Trn (R). It is isomorphic to
the additive group Rn .
The triangular group. Unlike the subgroups defined above, the trian-
gular subgroup depends on the order of the variables X1 , . . . , Xn , as does
the linear triangular group B2(R), sometimes called the Borel group. We
will take it to be the group of lower triangular matrices. An automorphism
F = (F1 , . . . , Fn ) is called triangular if, for i = 1, . . . , n, Fi has the form
ui Xi + f (X1 , . . . , Xi−1 ), with ui a unit in R. Such automorphisms form
a group which can be viewed as the non-linear extension of Bn (R). We
will call it the nonlinear Borel group, and denote it by BAn (R). Note that
BAn (R) ∩ GLn (R) = Bn (R).
The affine triangular group. The affine triangular group, affine Borel
group, Bfn (R) is defined as the intersection BAn (R) ∩ Afn (R). It consists of
those affine automorphisms whose linear homogeneous part is upper trian-
gular. The Jung-van der Kulk Theorem can be stated as
GA2 (k) = Af2 (k) ∗Bf2 (k) BA2 (k)
for k a field.
We note that all the other groups defined here stabilize and we similarly
denote their direct limits, e.g., by T∞ (R), EA∞ (R), Ws,∞ (R).
CHAPTER 1
David Wright
ϕ=ϕF
R[X1 , . . . , Xn ] −−−→ R[X1 , . . . , Xn ]
F =Fϕ
AnR ←−−− AnR
Fϕ is given by F = (F1, . . . , Fn ) ∈ R[X]n where Fi = ϕ(Xi ).
Contravariance means:
Fψ◦ϕ = Fϕ ◦ Fψ and ϕG◦F = ϕF ◦ ϕG .
We will adopt 2 as our definition, so that composition is given by
F ◦ G = (F1(G1 , . . . , Gn ), . . . , Fn (G1 , . . . , Gn ))
and the familiar Chain Rule holds:
J (F ◦ G) = J (F )|X=G · J (G)
where
D1 F1 · · · Dn F1
J (F ) = ... .. .. , D = ∂ .
. . i
∂Xi
D1 Fn · · · Dn Fn
The group of automorphisms (over R in dimension n) is taken to be the
set of such maps which are invertible, under this law of composition.
1
2 1. POLYNOMIAL AUTOMORPHISM GROUPS
2. Recognizing an automorphism
How does one tell if F = (F1, . . . , Fn) ∈ R[X]n determines an automor-
phism?
If so then F −1 ◦ F = (X1 , . . . , Xn ), so
J (F −1 ◦ F ) = J (F −1)|X=F · J (F ) = In ,
hence a necessary condition is that J (F ) ∈ GLn (R[X]), i.e., det J (F ) ∈
R[X]∗ .
Sufficient? For R a field of characteristic zero (or just a Q-algebra), this
is the famous Jacobian Conjecture.
Note: If F is an automorphism, then J F |X=0 ∈ GLn (R),
JF |X=0 = (aij ),
where ai1X1 + · · · + ain Xn is the linear homogeneous summand of Fi .
So a necessary condition for F to be an automorphism is that “the linear
part of F is good”.
Proposition 1.1. F = (F1, . . . , Fn ) is an automorphism ⇐⇒
R[X1 , . . . , Xn ] = R[F1, . . . , Fn ].
Proof. =⇒ : Clear, since for G = F −1 we have X1 = Gi (F1 , . . . , Fn ).
X 7→Fi
⇐=: ϕF : R[X1 , . . . , Xn ] −−i−−→ R[X1 , . . . , Xn ] is surjective, i.e., F has a
left inverse. Show F injective. If not, choose H ∈ Ker ϕf and replace R by
the Noetherian ring R0 generated by the coefficients of F , G, and H. The
proposition now follows from the proposition below.
ϕ
Proposition 1.2. A surjective ring endomorphism A → A, where A is
Noetherian, is injective.
Proof. If not, we have
0 $ Ker ϕ $ Ker ϕ2 $ Ker ϕ3 $ · · ·
violating the Noetherian property.
4. Notation
We will sometimes write R[n] for the polynomial ring R[X1 , . . . , Xn ].
For G a group we write h · · · i for the subgroup of G generated by the
elements of G or subsets of G listed, and we write moniod [ · · · ] for the sub-
monoid so generated.
GAn (R) = group of polynomial automorphisms in dimension n over R,
i.e., automorphisms of AnR = Spec R[X1 , . . . , Xn ] over Spec R.
It is the group of invertible elements in MAn (R) = monoid of all algebraic
endomorphims of AnR over Spec R.
For F = (F1, . . . , Fn ) ∈ GAn (R) (or even MAn (R)) the degree of F ,
denoted by deg F , is the maximum of the total degrees of Fi in the variables
X1 , . . . , X n .
5. Canonical homomorphisms
(1) Functoriality (base change): A ring homomorphism α : R → R0 extends
to α̂ : R[X] → R0 [X], sending Xi to Xi , inducing a group homomorphism
GAn (R) → GAn (R0 )
F = (F1, . . . , Fn ) 7−→ (α(F1 ), . . . , α(Fn )).
(2) Stability: We have GAn (R) ,→ GAn+m (R) identifying F = (F1 , . . . , Fn )
with
F [m] = (F1, . . . , Fn , Xn+1 , . . . , Xn+m ).
We define GA∞ as the direct limit (union) of
GA1 (R) ,→ GA2(R) ,→ GA3(R) ,→ · · ·
(3) Restriction of scalars: We have GAm (R[Y1 , . . . , Yr ]) ,→ GAr+m (R) defined
by
(F1, . . . , Fm ) 7−→ (Y1 , . . . , Yr , F1, . . . , Fm).
4 1. POLYNOMIAL AUTOMORPHISM GROUPS
6. Subgroups
• General linear group: GLn ,→ GAn as the stabilizer of RX1 ⊕· · ·⊕ RXn .
t
a11 · · · a1n a11 · · · a1n X1
... ..
.
.. 7−→ ..
. .
..
.
.. ..
. .
an1 · · · ann an1 · · · ann Xn
• Translations: Rn ,→ GAn defined by
(b1 , . . . , bn ) 7−→ (X1 + b1, · · · , Xn + bn )
k k
b X +b
defines a homomorphism whose image is denoted by Trn (R), the subgroup
of translations.
• Affine group: It is defined as
Afn (R) = {F ∈ GAn (R) | deg F = 1}.
It consists of all F = (F1, . . . , Fn ) where
Fi = ai1 X1 + · · · + ain Xn + bi
with ai1, · · · , ain , bi ∈ R.
Moreover,
a11 · · · a1n b1
.. .. .. ..
Afn (R) ∼ . . . .
=
a · · · a b ⊆ GLn+1 (R)
n1 nn n
0 ··· 0 1
and
Afn = GLn nTrn
n
where GLn acts on Trn = R in the usual way.
• Origin preserving automorphisms:
GAon (R) = {F ∈ GAn (R) | F (0) = 0}.
This is (anti-isomorphically) the stabilizer of the ideal
(X1 , . . . , Xn )R[X1 , . . . , Xn ].
We have GLn ⊆ GAon with retraction
F 7−→ J F | X=0
7. FORMAL INVERSE 5
7. Formal inverse
Suppose P ∈ MAn (R) with “good” linear part, i.e., J P |X=0 ∈ GLn (R).
Then, there exists A ∈ Afn (R) such that F = A ◦ P = X + higher terms .
P will be an automorphism if and only if F is.
Such an F determines an automorphism of R[[X1 , . . . , Xn ]], so there exists
a unique system of power series G = (G1 , . . . , Gn ) of the form X + higher
such that G ◦ F = F ◦ G = X. G is called the formal inverse of F .
8. More subgroups
• Fixator of {X1 , . . . , Xs }: For 1 ≤ s ≤ n, the restriction of scalars map
gives
GAn−s (R[s] ) ,→ GAn (R)
identified as the subgroup of all F of the form
F = (X1 , . . . , Xs , Fs+1 , . . . , Fn).
Example: (X, Y +X 2 ) ∈ GA2 (R) can be viewed as an element of GA1 (R[X]).
Aside: Note that GAon (R) ∩ GAn−s (R[s] ) is not contained in
GAon−s (R[s] ). (see above)
This restricts to GLn−s (R[s] ) ⊆ GAon (R).
Example: The “Cohn matrix”
1 + XY Y2
∈ GL2(R[X, Y ])
−X 2 1 − XY
gets identified with (X, Y, (1 + XY )Z + Y 2 W, −X 2Z + (1 − XY )W ) ∈
GAo4 (R).
• Stabilizer of R ⊕ RX1 ⊕ · · · ⊕ RXi : For i = 1, . . . , n, let Wi,n (R) be this
stabilizer. The containment Afi (R) ⊆ Wi,n (R) via the stability homomor-
phism (F 7→ F [n−i] ) admits the retraction Wi,n → Afi which is the restriction
from R[n] to R[i] , whose kernel is GAn−i (R[i] ). Therefore
9. More on n = 1
Suppose R is a reduced ring, F = (a0 +a1 X+· · ·+ad X d ) ∈ GA1 (R), p ⊆ R
a prime ideal. Base change gives F = (a0 + a1 X + · · · + ad X d ) ∈ GA1 (R/p),
so a2 , . . . , ad = 0 and a1 6= 0, since R/p is a domain. Since ∩p = {0} we must
have a2 , . . . , ad = 0 and a1 ∈ R∗ , so F = (a0 + a1 X). This shows:
GA1 (R) = Af1(R) for R reduced.
p
If R is not reduced the above shows a2, . . . , ad ∈ Nil (R) = {0} and
a1 ∈ R∗ .
10. ELEMENTARY AND TAME AUTOMORPHISMS 7
N = A ◦ B ◦ A−1
2 !
1 1 1 1
= X + t2 (Y − X 2 ), Y − X 2 + X + t2 (Y − X 2 ) .
t t t t
We have:
N ∈ GA2 (R) (overring principle),
N ∈ T2 (K),
/ T2 (R) (we’ll see why later).
N∈
Moreover, N is stably tame over R. In fact:
N [1] ∈ T3 (R).
Specifically, letting
A = (X, Y, Z + (tY − X 2 )),
B = (X − tZ, Y, Z)
we have
N [1] = ABA−1B −1 .
However, if R = k[t] a polynomial ring, then
N∈
/ T3 (k)
by restriction of scalars. This will be discussed later.
Question: Is T∞ (R) = GA∞ (R) ?
10 1. POLYNOMIAL AUTOMORPHISM GROUPS
#$
*'#+)!,
%
!"
&'!$()#"(
Theorem 1.6 (Jung, van der Kulk). For k a field (of any characteristic),
T2 (k) = GA2 (k).
The original proofs of this theorem can be found in [43] and [50], the
latter extending the result from characteristic zero to arbitrary characteristic.
We will outline a proof given by Makar-Limanov,
P as modified by Dicks [24].
Given (F, G) ∈ GA2 (k) we write F = αij X i Y j and set m = degX F ,
n = degY F . We view the monomials X i Y j as elements of the free abelian
group hX, Y i which we identify with Z2 via X i Y j ↔ (i, j). We define
supp F = {X i Y j | αij 6= 0}.
and note that supp F lies within the rectangle Γ having vertices 1, X m , X m Y n
and Y n , which contains the triangle ∆ determined by 1, X m , Y n (Figure 1).
12. THE CASE n = 2, R A FIELD 11
& $
#$%& ' (
! "
Set Z = kAj − µB i k, and observe that Z and W ij must have the same
V -degree (since A and B are both V -homogeneous). Therefore Z and W ij
are independent, so Z and W are independent.
Let A0 = Ac B d , B 0 = Aj B −i − µ. From (2) above it follows that
k[A±1, B ±1 ] = k[A0±1, (B 0 + µ)±1 ]
(since A = (A0)i (B 0 + µ)d , B = (A0)j (B 0 + µ)−c ). So
kk[A±1,B ±1 ]× k = kk[A0±1, (B 0 + µ)±1 ]× k
⊆ kk[A0, B 0 + µ]× k × hW i (see below)
= kk[A0, B 0]× k × hW i
⊆ moniod [W ±1 , Z].
To justify the second line, note that kA0k = W and kB 0 + µk = 1. Let
H ∈ k[A0±1, (B 0 + µ)±1 ]× and write H = A0M (B 00 + µ)N h(A0 , B 0 + µ) where
h is a polynomial. Then
kHk =kA0kM kB 0 + µkN kh(A0, B 0 + µ)k
= W M kh(A0, B 0 + µ)k.
Thus we have X, Y ∈ moniod [W ±1, Z], and hW i = hkAk, kBki, complet-
ing Step 2.
Step 3 We have
• X m , Y n ∈ supp F ,
• supp F ⊆ ∆,
• kF k = X m ,
• hW i = hXi (i.e. W = ±X).
Proof: By Step 2, kAk = W i , and we can replace W by W −1 to assume
i ≥ 0. By Step 1, |F | = λA` and this implies kF k = kAk` = W i` . Now
kF k ∈ moniod [X, Y ] (being in the parallelogram Γ), and this forces W to
be in Γ as well; in particular, W ∈ moniod [X, Y ].
Now note that since X, Y ∈ moniod [W ±1 , Z], we must have X and Y
lying in the same half plane determined by W , hence W lies on one of the
lower two sides of Γ, which says W , and hence kF k, is a power of X or a power
of Y . But supp F meets the X and Y axes only within ∆, so kF k = X m or
Y n . This implies supp F ⊆ ∆, since any monomial M in Γ − ∆ would have
kMk > kF k.
Also, it is not possible that kF k = Y n as this would say there are no other
points in supp F along the line segment joining Y n and X m , contradicting
the fact that m = degX F . Therefore kF k = X m .
16 1. POLYNOMIAL AUTOMORPHISM GROUPS
or
ϕ(bc1a1c2 a2 · · · cr (ar )) = 1
(with ai , cj , b as before) then r = 0 and b = 1.
Now let R be a domain, and set G = T2 (R), A = Af2 (R), C = BA2 (R).
Then B = A ∩ C is the group Bf2 (R) consisting of automorphisms of the
form
(uX + a, vY + bX + c)
with u, v ∈ R∗ , a, b, c ∈ R. As before, let H = A ∗B C and consider the map
ϕ : A ∗B C → T2 (R).
This is surjective, by (1). Injectivity follows from (2) and the following
lemma:
Lemma 1.9. Let R be a domain. Given
A1 , . . . , Ar ∈ Af2 (R) − Bf2(R),
C1, . . . , Cr ∈ BA2(R) − Bf2 (R),
write
Ci = (ui X + ei , vi Y + fi (X))
with di = deg fi (= deg Ci ). (Note that di ≥ 2 since Ci ∈
/ Bf2 (R).) Then:
(1) Writing
A1 ◦ C2 ◦ A2 ◦ · · · ◦ Cr (◦Ar ) = (P, Q)
we have d2 · · · dr = deg P ≥ deg Q.
(2) Writing
C1 ◦ A1 ◦ · · · ◦ Cr (◦Ar ) = (S, T )
we have d1 · · · dr = deg T > deg S.
Proof. When r = 1 both statements are obvious. Assume r ≥ 2 and
that the lemma holds for r − 1. Let C2 ◦ A2 ◦ · · · ◦ Cr (◦Ar ) = (S 0, T 0), A1 =
aX + bY + s, cX + dY + t). Note that b 6= 0, since A1 ∈ / Bf2 (R). Inductively
we may assume d2 · · · dr = deg T 0 > deg S 0 , by (2). Then the P and Q of (1)
are determined by (P, Q) = A1 ◦ (S 0, T 0), hence
P = aS 0 + bT 0 + s
Q = cS 0 + dT 0 + t
from which it follows (since b 6= 0) that d2 · · · dr = deg P ≥ deg Q, which
proves (1).
18 1. POLYNOMIAL AUTOMORPHISM GROUPS
in T(R), which results from the proposition upon noting that the top forms
of N are tX 2 and tX 4 .
Question: For R a domain (even R a UFD, or R = k [1], or R a DVR), is
GA2 (R) ⊆ T∞ (R)? (In other words, are elements of GA2(R) stably tame?)
15. The case n = 3
In 2002 Shestakov and Umirbaev proved this landmark theorem [69]:
Introduction
In his Nobel lecture on December 11, 1965, the physicist Richard Feyn-
man, after having given various different descriptions of electrodynamics,
came to the following conclusion:
“Perhaps a thing is simple if you can describe it fully in sev-
eral different ways, without immediately knowing that you are
describing the same thing.”
It is the aim of this paper to show that the Jacobian Conjecture un-
doubtedly satisfies the hypothesis of Feynman’s statement, by giving five
completely different, fully equivalent desciptions of it. Only the future will
tell if indeed the Jacobian Conjecture is simple!
F 7→ (F ∗ : g 7→ g(F )).
So describing invertible polynomial maps from Cn to Cn is the same as de-
scribing C-automorphisms of C[x]. Sometimes we use F to denote the n-
tuple (F1, . . . , Fn ) and sometimes we also use it to denote the corresponding
C-automorphism F ∗.
21
22 2. PERHAPS THE JACOBIAN CONJECTURE IS SIMPLE
(J G)(F (x))J F = In ,
where JF denotes the Jacobian matrix ∂F i
∂xj
of F . Taking determi-
1≤i,j≤n
nants gives
det((J G)(F (x))detJF = 1,
which implies that detJF ∈ C∗ .
Now one can wonder if the converse is true. This problem became famous
under the name Jacobian Conjecture:
The conjecture was first formulated by O.Keller, in [48] 1939, for the
special case that the polynomials Fi have integer coefficients and became
famous as “Keller’s Problem”. However it can be shown that Keller’s problem
is equivalent to the Jacobian Conjecture ([27], Chapter 1).
Apart from the trivial case n = 1, the conjecture remains open for all
n ≥ 2, inspite of the efforts of many mathematicians during six decades.
More recently the conjecture appeared as “problem 16” on a list of 18 famous
open problems in the paper Mathematical problems for the next century by
the fields medalist Steve Smale [70]. For an extensive history of the conjecture
and its relations with many other open problems, we refer the reader to [9]
and [27].
G(x) = F (x + a) − F (a).
Then degGi ≤ 2 for all i. Furthermore G(0) = 0 and, putting c = b − a
we have G(c) = 0 and c is non-zero. Observe that JG(x) = J F (x + a), so
detJG ∈ C∗ . Now write G = G(1) + G(2), its decomposition in homogeneous
components, and consider G(tc) = tG(1)(c) + t2G(2) (c). Differentiating gives
d
G(1) (c) + 2tG(2) (c) = G(tc) = (J G)(tc).c 6= 0 for all t ∈ C
dt
since detJG ∈ C∗ and c is non-zero. Finally, substituting t = 1/2 gives
G(c) 6= 0, which contradicts G(c) = 0. So F is injective.
24 2. PERHAPS THE JACOBIAN CONJECTURE IS SIMPLE
Remark 2.4. Although Proposition 2.3 looks rather far away from the
general case, it was shown in [9] and [78] that it suffices to investigate the
Jacobian Conjecture “only” for all polynomial maps F with degFi ≤ 3 for all
i, in each dimension. So one fails to settle the conjecture by just one degree!
Now let’s turn to the proof of Theorem 2.2. The proof we will give below
consists of the following two parts.
1) First we show that an injective polynomial map F : Cn → Cn is automat-
ically surjective.
2) Then we deduce that the inverse, which by 1) exists as a map of sets, is
in fact a polynomial map.
So let’s look at the first statement. The proof we will present now uses
a very useful technique, namely solving a problem over C, by reducing it to
a problem over a finite field. To be more precise, we will show that every
injective polynomial map F : Cn → Cn is surjective, by reducing it to the
obvious statement that every injective (polynomial) map F : k n → k n is
surjective, in case k is a finite field!
The proof is based on the following general fact (see [27], 4.1.5)
Proposition 2.5. Let A be a finitely generated Z-algebra which is a
subalgebra of some Q-algebra. Then for almost all prime numbers p there
exists a maximal ideal m of A, containing p, such that A/m is a finite field.
Lemma 2.6. Every injective polynomial map F : Cn → Cn is surjective.
Proof. Let F : Cn → Cn be a polynomial map.
i) Then, F is injective if and only if for all (a, b) ∈ Cn × Cn the equations
F1 (a) − F1(b) = 0, . . . , Fn(a) − Fn (b) = 0
imply that
a1 − b1 = 0, . . . , an − bn = 0.
In other words, F is injective if and only if every zero (a, b) ∈ C2n of the ideal
I := (F1(x) − F1(y), . . . , Fn (x) − Fn (y))
in C[x, y] is a zero of the ideal J := (x1 − y1 , . . . , xn − yn ), or equivalently
(by the Nullstellensatz), there exist m ∈ N and aij (x, y) in C[x, y] such that
for all i
X
(1) (xi − yi )m = aij (x, y)(Fj (x) − Fj (y)).
j
2. THE ROLLE FORMULATION 25
ii) Suppose that F is not surjective. Then there exists a point (c1 , . . . , cn ) ∈
Cn which does not belong to F (Cn ), in other words such that the system
F1(x) − c1 = 0, . . . , Fn(x) − cn = 0
has no solution in Cn . By the Nullstellensatz this is equivalent to: there exist
a1(x), . . . , an (x) in C[x] such that
X
(2) 1= aj (x)(Fj (x) − cj ).
j
iii) Now let S be the subring of C generated over Z by all coefficients ap-
pearing in the equations (1) and (2). So S is a finitely generated Z-algebra.
By Proposition 2.5 it then follows that there exists a maximal ideal m of S
such that k := S/m is a finite field. Now passing from S to k we get k n and
an induced map F0 : k n → k n . From (1) it follows (by reduction modulo m)
that F0 is injective and by (2) we get that c does not belong to F0(k n ). So
F0 : k n → k n is not surjective. However an injective map from a finite set to
itself (in our case F0 : k n → k n is such a map) is automatically surjective. So
F0 is surjective, and we have a contradiction!
Proof. (of Theorem 2.2) By Lemma 2.5 F is surjective, so by Theorem
2.7 below, using that F is injective, so #F −1(u) = 1 for almost all u in
Cn , we get that C(x1, . . . , xn ) = C(F1 , . . . , Fn ). So there exist polynomials
gi , hi ∈ C[y] such that
xi = gi (F1, . . . , Fn )/hi (F1, . . . , Fn ) and g.c.d.(gi , hi ) = 1 for all i.
UB(n). For every d ≥ 1 there exists a positive integer C(n, d) with the follow-
ing property: for every Q-algebra R and every R-automorphism F of R[x],
having detJF =1 and degF ≤ d, the degree of its inverse is bounded by C(n, d)
The point in the Uniform Boundness statement is that one only considers
R-automorphisms F of R[x] having detJF = 1. This restriction rules out our
“bad” example above, whose degree can be arbitrarily large, depending on
the ring R. Namely if F = x+εx2, then detJ F = 1+2εx 6= 1. Apparently, by
Theorem 2.9, the existence of a uniform bound C(n, d) for the degree of the
inverse of an R-automorphism with detJ F = 1, independent of the Q-algebra
R, is a necessary condition for the Jacobian Conjecture to be true.
Now the remarkable point, observed by Bass around 1983 (in [8]), is
that the uniform boundness statement is also sufficient for the Jacobian
Conjecture to be true. In fact a stronger converse, with a very elegant proof,
was obtained by Harm Derksen in 1994 (in [21]):
Theorem 2.10. (Derksen) Let n ≥ 1. Then U B(n) implies JC(n, C),
where U B(n) is the following (weaker) version of the uniform boundness
statement:
U B(n). For every d ≥ 1 there exists a positive integer C(n, d) with the fol-
lowing property: for every m ≥ 1 and every Cm -automorphism F of Cm [x]
with detJF = 1 and degF ≤ d, the degree of its inverse is bounded by C(n, d).
28 2. PERHAPS THE JACOBIAN CONJECTURE IS SIMPLE
F T ◦ GT = x = GT ◦ F T
(compositition of formal power series in x). Now let l > C(n, d). Reducing
the previous equality mod T l we obtain that F T is a Cl -automorphism of
Cl [x], whose inverse is equal to
l−1
GT = x + G(2) T + . . . + G(l) T .
Also detJx F T = 1 and degF T ≤ d. So by U B(n) we get
(*) deg GT ≤ C(n, d).
l−1
However, GT = x + G(2) T + . . . + G(l) T . So if G(l) 6= 0 then, using that
l−1
T 6= 0, we get that degGT = l > C(n, d), contradicting (*). So G(l) = 0
for all l > C(n, d). In other words G is a polynomial map, as desired.
So Derksen’s theorem asserts that one has to consider as coefficient rings only
the special Q-algebras of the form Cm , with m ≥ 2 and all Cm -automorphisms
with detJF = 1.
Now one can push this result one step further: namely in [28] and [29]
the author obtained the following reformulation of the Jacobian Conjecture,
which asserts that one “only” needs to consider a very special class of Cm -
automorphisms F of Cm [x] with det J F = 1, namely so-called exponential
automorphisms coming from nilpotent derivations with divergence zero.
Before we can state this result we need to introduce some useful notions.
Let A and R be commutative rings.
3. THE DEGREE OF THE INVERSE 29
∂ 2f
JH = J (∇f ) = ( )1≤i,j≤n
∂xi∂xj
the so-called Hessian of f , which we denote by Hes(f ). So by Theorem 2.15
the Jacobian Conjecture can be reformulated as follows
Proposition 2.16. The Jacobian Conjecture is equivalent to the follow-
ing statement. For every n ≥ 1 and every homogeneous polynomial f of
degree 4 in C[x] we have: if Hes(f )(= J (∇f )) is nilpotent, then x − ∇f is
invertible.
In his paper [80] Zhao was able to reformulate both the hypothesis
“J (∇f ) is nilpotent” and the statement “x − ∇f is invertible”, of Propo-
sition 2.16 , in terms of the Laplace operator ∆ = ∂12 + . . . + ∂n2 .
First let’s take a look at the nilpotency of the matrix J (∇f ). In general,
if A is an n × n matrix with entries in a field, then A is nilpotent if and
only if T rAm = 0 for all 1 ≤ m ≤ n. Applying this result to the matrix
J (∇f ) = Hes(f ), whose entries belong to the field of rational functions, we
get
Hes(f ) is nilpotent if and only if T rHes(f )m = 0 for all 1 ≤ m ≤ n.
One easily verifies that T rHes(f ) = ∂12(f ) + . . . + ∂n2 (f ) = ∆f . On the
other hand studying the conditions T rHes(f )2 = 0, T rHes(f )3 = 0, etc. is
much more involved. Nevertheless, in [80], Zhao succeeds in expressing these
conditions in terms of the Laplace operator:
Theorem 2.17. (W. Zhao) Let f ∈ C[x]. Then the following statements
are equivalent.
i) Hes(f ) is nilpotent.
ii) ∆m (f m ) = 0 for all 1 ≤ m ≤ n.
iii) ∆m (f m ) = 0 for all m ≥ 1.
So the condition “J (∇f )(= Hes(f )) is nilpotent” is completely under-
stood in terms of the Laplace operator.
Then Zhao continues to study the condition “F = x − ∇f is invertible”.
First he observes that since degf = 4 ≥ 2 it follows that F has a unique
formal inverse G (using the well-known Formal Inverse Function Theorem).
Furthermore JG is symmetric: namely J (F ◦ G) = I, so J F (G)J G = I,
which implies that JG = (JF (G))−1 . Since the inverse matrix of a symmetric
matrix is also symmetric, it follows from the symmetry of J F that also JG
is symmetric.
32 2. PERHAPS THE JACOBIAN CONJECTURE IS SIMPLE
and for each i ≥ 1 one has that ∇gi = 0 if and only if gi = 0. So we get
that x + ∇g is a polynomial map if and only if ∇g2m+4 = 0 for all large m,
if and only if g2m+4 = 0 for all large m. So x − ∇f is invertible if and only
if ∆m (f m+1 ) = 0 for all large m. So the condition “x − ∇f is invertible” is
equivalent to the statement
∆m (f m+1 ) = 0 for all large m.
Then, by Proposition 2.16 and Theorem 2.17 the Jacobian Conjecture is
equivalent to
Vanishing Conjecture: (Zhao) For every n ≥ 1 and every homogeneous
polynomial f ∈ C[x] of degree 4, the following statement holds:
if ∆m (f m ) = 0 for all m ≥ 1, then ∆m−1 (f m ) = 0 for all large m.
5. THE DIXMIER CONJECTURE 33
Final remark. The vanishing conjecture has been proved for all n ≤ 5 by
de Bondt and the author in [15], even in case the homogeneous polynomial
f has degree d ≥ 2.
It is not difficult to prove that An (C) is a simple ring, i.e. that it has no
two-sided ideals, except the zero-ideal and the whole ring (see [12]). Since
the kernel of any C-endomorphism ϕ of An (C) is a two-sided ideal in An (C)
and 1 does not belong to such a kernel, it follows that kerϕ = 0. In other
words any C-endomorphism of An (C) is injective!
Now one may wonder if any C-endomorphism of An (C) is also surjective.
The question was posed for the first time, in case n = 1, by Dixmier in his
paper [23]. This question, generalized to all dimensions, became known as
the Dixmier Conjecture.
Dixmier Conjecture. For any n ≥ 1 the following statement holds:
DC(n, C) Every C-endomorphism of An (C) is surjective (and hence it is a
C-automorphism).
What is the relation between the Dixmier Conjecture and the Jacobian Con-
jecture?
First of all, it was observed by Kac and Vaserstein in the eighties of the
last century that the Dixmier Conjecture implies the Jacobian Conjecture.
More precisely
Proposition 2.19. (Kac, Vaserstein) For each n ≥ 1 DC(n, C) implies
JC(n, C).
Proof. Let F := (F1, . . . , Fn ) : Cn → Cn be a polynomial map with
detJF ∈ C∗. So ((JF )−1)T ∈ Mn (C[x]). Hence, if we define
34 2. PERHAPS THE JACOBIAN CONJECTURE IS SIMPLE
∂
∂F1 ∂1
.. .
= ((JF ) ) ..
−1 T
.
∂
∂Fn
∂n
∂
then we get derivations ∂F i
on the polynomial ring C[x]. These derivations
have the following properties (see [27], 2.2.3):
∂ ∂ ∂
[ , ] = 0 and [ , Fj ] = δij for all i, j.
∂Fi ∂Fj ∂Fi
So, by defining
∂
ϕ(∂i) = and ϕ(xj ) = Fj
∂Fi
we get a C-endomorphism ϕ of An (C). Since, by assumption DC(n, C) holds,
it follows that ϕ is surjective. In other words
∂ ∂
C[F1, . . . , Fn , ,..., ] = An (C).
∂F1 ∂Fn
P ∂ α
In particular, if g ∈ C[x], then g = aα(F )( ∂F ) , for some aα(F ) ∈ C[F ].
Applying the operator g to the element 1 we get that g = a0 (F ) ∈ C[F ].
So each polynomial of C[x] belongs to C[F ]. In other words C[x] ⊆ C[F ],
which implies that C[x] = C[F ]. So in particular each xi is of the form
Gi (F1, . . . , Fn ), whence F is invertible with inverse G = (G1 , . . . , Gn ).
So the Dixmier Conjecture implies the Jacobian Conjecture. Now the
surprising result, first obtained by Tsuchimoto in [74] and also discussed by
Belov and Kontsevich in [10], is that the converse holds! In other words the
Dixmier Conjecture is just one more, completely unrecognizable, description
of the Jacobian Conjecture! More precisely we have
Theorem 2.20. For each n ≥ 1 we have: JC(2n, C) implies DC(n, C)
and hence the Jacobian Conjecture and the Dixmier Conjecture are equiva-
lent.
The proof which we will give below is due to Adjamagbo and the author.
It was inspired by the paper [10] of Belov and Kontsevich. In our proof we
introduce one more conjecture, the Poisson Conjecture, which also turns out
to be equivalent to the Jacobian Conjecture.
In order to formulate this new conjecture we introduce some notations
and give some definitions. In particular we also define the n−th Weyl algebra
5. THE DIXMIER CONJECTURE 35
that ϕ(ψi) = yi for all i. If ψi has degree ≤ N , for some N , we say that ϕ is
N-surjective.
Now let’s explain how we are going to prove that P (n, C) implies D(n, C).
The idea is to reduce the proof of this implication to a similar statement
where C is replaced by a ring of the form R/pR, using a technique similar to
the one explained in section two. In order to perform this reduction, we will
rewrite both the hypothesis and the claim in terms of polynomial equations.
Let’s first consider the statement “D(n, C)”, in other words the statement
that every C-endomorphism of An (C) is surjective. In fact we will prove the
following more precise statement:
for every d ≥ 1, every C-endomorphism of An (C) of degree ≤ d is D-
surjective, where D = d2n−1 .
To prove this statement assume the contrary. So let ϕ be a C-endomorphism
of An (C), say of degree d, which is not D-surjective. This means that there
do not exist ψ1, . . . , ψ2n of degree ≤ D in An (C) such that
statement P (n, C, d), which asserts that the Poisson Conjecture holds for all
endomorphisms of Pn (C) of degree ≤ d. This means that if F = (F1, . . . , F2n )
satisfies degFi ≤ d for all i and {Fi, Fj } = {xi , xj }, then F has an inverse (of
degree ≤ d2n−1 , according to Proposition 2.8.
To rewrite this statement in terms of polynomial equations we consider
the universal polynomial map of degree d in 2n variables x = (x1, . . . , x2n),
i.e.
FiU := (F1U , . . . , F2n
U
),
P (i) (i)
where each FiU := Aα xα with |α| ≤ d and all Aα are different variables.
(i)
Let A := (. . . , Aα , . . .) and denote by C[A] the polynomial ring in A over
C. Consider the polynomials Pij := {FiU , FjU } − {xi , xj } for all 1 ≤ i <
j ≤ 2n and let J := (g1 (A), . . . , gr (A)) be the ideal in C[A] generated by
the coefficients of all monomials xα appearing in all Pij . Then by Theorem
2.21 the canonical image of F U in (C[A]/J )[x]2n has Jacobian determinant 1
and hence, by the formal inverse function theorem, it has a formal inverse in
(C[A]/J )[[x]]2n, represented by some G(A) in C[A][[x]]2n. Let I be the ideal
in C[A] generated by the coefficients in G(A) of all xα with |α| > D := d2n−1
and let h1 (A), . . . , ht (A) be a system of generators of I.
(i)
Proposition 2.24. If P (n, C, d) holds, there exist bj in C[A] and a
positive integer ρ such that
P (i)
(4) hi (A)ρ = j bj (A)gj (A) for all 1 ≤ i ≤ t.
Proof. Let a ∈ CA be a zero of J . Then, since P (n, C, d) holds, the
map F U (A = a) is invertible and its inverse is equal to G(A = a). So by
Proposition 2.8 degG(A = a) ≤ d2n−1 = D. Hence a is a zero of I. So every
zero of J is a zero of I. Then (4) follows from the Nullstellensatz.
As an immediate consequence of Proposition 2.24 we get
Corollary 2.25. Let R be a subring of C containing the coefficients of
(i)
the polynomials hi (A), bj (A) and gj (A). If a is a proper ideal of R and f
an endomorphism of Pn (R/a) of degree ≤ d, then f has an inverse of degree
≤ D(= d2n−1 ).
Proof. (of P (n, C, d) → D(n, C, d).) So assume P (n, C, d) and assume,
as above, that ϕ is a C-automorphism of An (C) of degree d which is not
surjective. This last assumption gives rise to the equations (3). Furthermore,
the assumption P (n, C, d) gives rise to the equations (4).
Let R be the Z-subalgebra of C, generated by n!1 , all coefficients of the
monomials y α appearing in the ϕ(yi ), all coefficients appearing in the hi , gj
38 2. PERHAPS THE JACOBIAN CONJECTURE IS SIMPLE
(i)
and the bj and all coefficients appearing in all Qj and Pj . Then ϕ is an
R-endomorphism of An (R) and R is a finitely generated Z-algebra without
Z-torsion. By Proposition 2.5 we can choose a prime number p which is not
invertible in R. By reduction modulo pR we get an R/pR-endomorphism ϕ̄
of An (R/pR). By Corollary 2.23 ϕ̄ is not D-surjective. On the other hand
we will show now that the hypothesis that P (n, C, d) holds, implies that ϕ̄
is D-surjective.
To obtain this contradiction we will use the following two theorems. The
first is proved in [5] and [4] and a proof of the second will be given at the
end of this section.
Theorem 2.26. The center of An (R/pR), denoted by Z, is a polynomial
ring in y1p, . . . , y2n
p
over R/pR. Furthermore:
i) ϕ̄(Z) ⊆ Z, so ϕ̄ induces a polynomial map, denoted by ϕ̄pol from the
polynomial ring R/pR[y1p , . . . , y2n
p
] to itself.
ii) If ϕ̄pol is surjective, then ϕ̄ is surjective. More precisely, if ϕ̄pol is N-
surjective for some N ≥ 1, then ϕ̄ is N-surjective.
Theorem 2.27. ϕ̄pol is an endomorphism of Pn (R/pR) (of degre ≤ d).
Now we are able to give
Proof of the surjectivity of ϕ̄. By Theorem 2.27 ϕ̄pol is an endo-
morphism of Pn (R/pR) of degree ≤ d. So it follows from Theorem 2.21, the
hypothesis P (n, C, d) and by reducing the equations (4) in Proposition 2.24
modulo pR, that ϕ̄pol has an inverse of degree ≤ D. It then follows from
Theorem 2.26 ii) that ϕ̄ is D-surjective, which completes the proof.
To prove Theorem 2.27 we give another description of the Poisson bracket
on Pn (R/pR). Put W := R[y1p, . . . , y2n p] and xi := yip for each i.
Lemma 2.28. Let A ∈ W and B ∈ An (R). Then [A, B] ∈ pAn (R).
Proof. Since [ , ] is R-bilinear we may assume that A = (y1p)α1 . . . (y2n p )α2n
and B = y1β1 . . . y2n β2n . Using Leibniz’ rule we may even assume that A = yi p
and B = yj , in which case the result is clear.
Proposition 2.29. Let a, b ∈ R/pR[x1 . . . , x2n ] and A, B ∈ W be such
that a = A(mod pAn(R)) and b = B(mod pAn (R)). Then 1p [A, B] is a well-
defined element of An (R) and
1
{a, b} = [A, B](mod pAn (R)).
p
5. THE DIXMIER CONJECTURE 39
Proof. Since R has no Z-torsion the first statement follows from Lemma
2.28. To prove the formula, observe that both { , } and [ , ] are bilinear,
antisymmetric and satisfy Leibniz’ rule. Therefore it suffices to show that
1
{xi , xj } = [yip, yj p ](mod pAn (R))
p
for all i < j. If j 6= i + n both sides are zero. So assume j = i + n. Then the
result follows from the following formula
p
1 p 1X (p!)2
(∗∗) [yi+n , yip] = y k y k = −1(mod pZ[yi , yi+n ]),
p p (k!)2(p − k)! i i+n
k=0
Hanspeter Kraft
1. Affine varieties
Regular functions. Our base field is the field C of complex numbers.
Every polynomial p ∈ C[x1, . . . , xn ] can be regarded as a C-valued function
on Cn in the usual way:
a = (a1, . . . , an ) 7→ p(a) = p(a1, . . . , an ).
These functions will be called regular. More generally, let V be a C-vector
space of dimension dim V = n < ∞.
Definition 3.1. A C-valued function f : V → C is called regular if f is
given by a polynomial p ∈ C[x1, . . . , xn ] with respect to one and hence all
bases of V . This means that for a given basis v1 , . . . , vn of V we have
f (a1 v1 + · · · + an vn ) = p(a1 , . . . , an )
for a suitable polynomial p. The algebra of regular functions on V will be
denoted by O(V ).
By our definition, every choice of a basis (v1, v2, . . . , vn ) of V defines an
∼
isomorphism C[x1, . . . , xn ] → O(V ) by identifying xi with the ith coordinate
function on V defined by the basis, i.e.,
xi(a1 v1 + a2v2 + · · · + an vn ) := ai .
Another way to express this is by remarking that the linear functions on V
are regular and thus the dual space V ∗ := Hom(V, C) is a subspace of O(V ).
So if (v1, v2, . . . , vn ) is a basis of V and (x1, x2 , . . . , xn ) the dual basis of V ∗
then O(V ) = C[x1, x2, . . . , xn ] and the linear functions xi are algebraically
independent.
Example 3.1. Denote by Mn = Mn (C) the complex n × n-matrices so
that O(Mn ) = C[xij | 1 ≤ i, j ≤ n]. Consider det(tEn − X) as a polynomial
43
44 3. BASICS FROM ALGEBRAIC GEOMETRY
Q
Define D̃ := i<j (yi −yj )2. Since D̃ is symmetric it can be (uniquely) written
as a polynomial in the elementary symmetric functions (see [6, Chap. 14,
Theorem 3.4]): D̃(y1, . . . , yn ) = D(σ1 , . . . , σn ) with a suitable polynomial D.
By construction, D has the required property.
Zero sets and Zariski topology. We now define the basic object of
algebraic geometry, namely the zero set of regular functions. Let V be a finite
dimensional vector space.
Definition 3.2. If f ∈ O(V ) then we define the zero set of f by
V(f ) := {v ∈ V | f (v) = 0} = f −1 (0).
More generally, the zero set of f1 , f2 , . . . , fs ∈ O(V ) or of a subset S ⊆ O(V )
is defined by
\
s
V(f1 , f2 , . . . , fs ) := V(fi ) = {v ∈ V | f1(v) = · · · = fs (v) = 0}
i=1
or
V(S) := {v ∈ V | f (v) = 0 for all f ∈ S}.
46 3. BASICS FROM ALGEBRAIC GEOMETRY
is linear in the variables which occur. But if xi0 j0 occurs in f1 then all the
variables xij0 and xi0 j have to occur in f1 , too, since det −1 does not contain
products of the form xij0 xi0 j0 and xi0 j xi0 j 0 . This implies that all variables
occur in f1 , hence f2 is a constant.)
Example 3.8. Consider the plane curve C := V(y 2 − x3 ) which is called
∼
Neil’s parabola. Then O(C) ' C[x, y]/(y 2 − x3 ) → C[t2, t3] ⊆ C[t] where the
second isomorphism is given by ρ : x 7→ t3, y 7→ t2.
Proof. Clearly, y 2 − x3 ∈ ker ρ. For any f ∈ C[x, y] we can write f =
f0 (x) + f1 (x)y + h(x, y)(y 2 − x3). If f ∈ ker ρ then 0 = ρ(f ) = f0 (t2) + f1 (t2 )t3
and so f0 = f1 = 0. This shows that ker ρ = (y 2 − x3 ), and the claim
follows.
Exercise 10. Let C ⊆ C2 be the plane curve defined by y − x2 = 0. Then
I(C) = (y − x2 ) and O(C) is a polynomial ring in one variable.
Exercise 11. Let D ⊆ C2 be the zero set of xy − 1. Then O(D) is not
∼
isomorphic to a polynomial ring, but there is an isomorphism O(D) → C[t, t−1 ].
Exercise 12. Consider the “parametric curve”
Y := {(t, t2, t3) ∈ C3 | t ∈ C}.
Then Y is closed in C3 . Find generators for the ideal I(Y ) and show that O(Y ) is
isomorphic to the polynomial ring in one variable.
Another important consequence of the “Nullstellensatz” is an one-to-one
correspondence between closed subsets of Cn and perfect ideals of the coor-
dinate ring C[x1, . . . , xn ].
Corollary 3.15. The map X 7→ I(X) defines an inclusion-reversing
bijection
∼
{X ⊆ V closed} → {a ⊆ O(V ) perfect ideal}
whose inverse map is given by a 7→ V(a). Moreover, for any finitely generated
reduced C-algebra R there is a closed subset X ⊆ Cn for some n such that
O(X) is isomorphic to R.
√
Proof. The first part is clear since V(I(X)) = X and I(V(a)) = a for
any closed subset X ⊆ V and any ideal a ⊆ O(V ).
If R is a reduced and finitely generated C-Algebra, R = C[f1, . . . , fn ],
then R ' C[x1, x2 , . . . , xn ]/a where a is the kernel
√ of the homomorphism
defined by xi 7→ fi . Since R is reduced we have a = a and so O(V(a)) '
C[x1, . . . , xn ]/a ' R.
1. AFFINE VARIETIES 51
Affine varieties. We have seen in the previous section that every closed
subset X ⊆ V (or X ⊆ Cn ) is equipped with an algebra of C-valued functions,
namely the coordinate ring O(X). We first remark that O(X) determines the
topology of X. In fact, define for every ideal a ⊆ O(X) the zero set in X by
VX (a) := {x ∈ X | f (x) = 0 for all f ∈ a}.
Clearly, we have VX (a) = V(ã) ∩ X where ã ⊆ O(V ) is an ideal which maps
surjectively onto a under the restriction map. This shows that the sets VX (a)
are the closed sets of the topology on X induced by the Zariski topology of
V . Moreover, the coordinate ring O(X) also determines the points of X since
they are in one-to-one correspondence with the maximal ideals of O(X):
x ∈ X 7→ mx := ker evx ⊆ O(X)
where evx : O(X) → C is the evaluation map f 7→ f (x). This leads to the
following definition of an “abstract” zero set, not embedded in a vector space.
Definition 3.16. A set Z together with a C-algebra O(Z) of C-valued
functions on Z is called an affine variety if there is a closed subset X ⊆ Cn
∼
for some n and a bijection ϕ : Z → X which identifies O(X) with O(Z), i.e.,
ϕ∗ : O(X) → O(Z) given by f 7→ f ◦ ϕ, is an isomorphism.
The functions from O(Z) are called regular, and the algebra O(Z) is called
coordinate ring of Z or algebra of regular functions on Z.
The affine variety Z has a natural topology, the Zariski topology, the
closed sets being the zero sets
VZ (a) := {z ∈ Z | f (z) = 0 for all f ∈ a}
where a runs through the ideals of O(Z). It follows from what we said above
∼
that the bijection ρ : Z → X is a homeomorphism with respect to the Zariski
topology.
Clearly, every closed subset X ⊆ V or X ⊆ Cn together with its co-
ordinate ring O(X) is an affine variety. More generally, if X is an affine
52 3. BASICS FROM ALGEBRAIC GEOMETRY
For the last example we start with a reduced and finitely generated C-
algebra R. Denote by Spec R the set of maximal ideals of R:
Spec R := {m | m ⊆ R a maximal ideal}.
We know from the “Nullstellensatz” (see Exercise 8) that R/m = C for all
maximal ideals m ∈ Spec R. This allows to identify the elements from R with
C-valued functions on Spec R: For f ∈ R and m ∈ Spec R we define
f (m) := f + m ∈ R/m = C.
Proposition 3.17. Let R be a reduced and finitely generated C-algebra.
Then the set of maximal ideals Spec R together with the algebra R considered
as functions on Spec R is an affine variety.
Proof. We have already seen earlier that every such algebra R is iso-
morphic to the coordinate ring of a closed subset X ⊆ Cn . The claim then
∼
follows by using the bijection X → Spec O(X), x 7→ mx = ker evx , and re-
marking that for f ∈ O(X) and x ∈ X we have f (x) = evx (f ) = f + mx , by
definition.
Exercise 18. Denote by Cn the n-tuples of complex numbers up to sign, i.e.,
Cn := Cn / ∼ where (a1, . . . , an) ∼ (b1, . . . , bn) if ai = ±bi for all i. Then every
polynomial in C[x21, x22, . . ., x2n ] is a well-defined function on Cn . Show that Cn
together with these functions is an affine variety.
(Hint: Consider the map Φ : Cn → Cn , (a1, . . . , an ) 7→ (a21, . . . , a2n ) and proceed
like in Example 3.10.)
Although every affine variety is “isomorphic” to a closed subset of Cn
for a suitable n, there are many advantages to look at these objects and
not only at closed subsets. In fact, an affine variety can be identified with
many different closed subsets of this form (see the following Exercise 19),
and depending on the questions we are asking, one of them might be more
useful than another. We will even see in the following section that certain
open subsets are affine varieties in a natural way.
On the other hand, whenever we want to prove some statements for an
affine variety X we can always assume that X = V(a) ⊆ Cn so that the
regular functions on X appear as restrictions of polynomial functions. This
will be helpful in the future and quite often simplify the arguments.
Exercise 19. Let X be an affine variety. Show that every choice of a generating
system f1 , f2, . . . , fn ∈ O(X) of the algebra O(X) consisting of n elements defines
an identification of X with a closed subset V(a) ⊆ Cn .
(Hint: Consider the map X → Cn given by x 7→ (f1(x), f2(x), . . ., fn (x)).)
54 3. BASICS FROM ALGEBRAIC GEOMETRY
This proves the first claim. For the second, we have to show that the canonical
homomorphism O(X)[t]/(f ·t−1) → O(X̃ ) is an isomorphism. In other words,
1. AFFINE VARIETIES 55
P
every function h = m i
i=0 hi t ∈ O(X)[t] which vanishes
P on X̃ is divisible by
m−i
f · t − 1. Since f |X̃ is invertible we first obtain i hi f = 0 which implies
X
m X
m−1
m m−i
h =h−t hi f = hi ti (1 − f m−i tm−i ),
i=0 i=0
is not defined in (0, 0). The interesting point here is that f has a continuous
extension to all of C with value 0 at (0, 0), even in the C-topology.
∼
Proof. There is an isomorphism O(C) → C[t2, t3 ] (see Example 3.8)
which maps x̄ to t2 and ȳ to t3, and so f = x̄ȳ is mapped to t. Since t ∈
/ C[t2, t3]
the first claim follows from Exercise 25(2) above. The second part is easy
because the map C → C : t 7→ (t2, t3) is a homeomorphism even in the C-
topology.
Assume that X is irreducible and let x ∈ X. Define
OX,x := {f ∈ C(X) | f is defined in x}.
It is easy to see that OX,x is the localization of O(X) at the maximal ideal mx .
(For the definition of the localization of a ring at a prime ideal and, more
generally, for the construction of rings of fractions we refer to [26, I.2.1].)
This example motivates the following definition.
Definition 3.26. Let X be an affine variety and x ∈ X an arbitrary
point. Then the localization O(X)mx of the coordinate ring O(X) at the
maximal ideal in x is called the local ring of X at x. It will be denoted by
OX,x , its unique maximal ideal by mX,x .
We will see later that the local ring of X at x completely determines X
in a neighborhood of x.
T
Exercise 26. If X is irreducible, then O(X) = x∈X OX,x .
Exercise 27. Let X be an affine variety, x ∈ X a point and X 0 ⊆ X the
union of irreducible components of X passing through x. Then the restriction map
∼
induces a natural isomorphism OX,x → OX 0 ,x .
Exercise 28. Let R be an algebra and µ : R → RS the canonical map r 7→ 1r
where RS is the localization at a multiplicatively closed subset 1 ∈ S ⊆ R (0 ∈
/ S).
60 3. BASICS FROM ALGEBRAIC GEOMETRY
2. Morphisms
Morphisms and comorphisms. In the previous sections we have de-
fined and discussed the main objects of algebraic geometry, the affine vari-
eties. Now we have to introduce the “regular maps” between affine varieties
which should be compatible with the concept of regular functions.
Definition 3.27. Let X, Y be affine varieties. A map ϕ : X → Y is called
regular or a morphism if the pull-back of a regular function on Y is regular
on X:
f ◦ ϕ ∈ O(X) for all f ∈ O(Y ).
Thus we obtain a homomorphism ϕ∗ : O(Y ) → O(X) of C-algebras given by
ϕ∗ (f ) := f ◦ ϕ, which is called comorphism of ϕ.
A morphism ϕ is called an isomorphism if ϕ is bijective and the inverse
map ϕ−1 is also a morphism. If, in addition, Y = X then ϕ is called an
automorphism.
Exercise 31. Let g ∈ GLn be an invertible matrix. Then left multiplication
A 7→ gA, right multiplication A 7→ Ag and conjugation A 7→ gAg −1 are automor-
phisms of Mn .
2. MORPHISMS 61
Exercise 33.
(1) Every morphism C → C∗ is constant.
(2) Describe all morphisms C∗ → C∗ .
(3) Every non-constant morphism C → C is surjective.
(4) An injective morphism C → C is an isomorphism, and the same holds for
morphisms C∗ → C∗.
Exercise 34. The graph of a morphism. Let ϕ : Cn → Cm be a morphism
and define
Γϕ := {(a, ϕ(a)) ∈ Cn+m }.
which is called the graph of the morphism ϕ. Show that Γϕ is closed in Cn+m , that
∼
the projection prCn : Cn+m → Cn induces an isomorphism p : Γϕ → Cn and that
ϕ = prCm ◦p .−1
∼
d. If d = 1, i.e. if ϕ∗ induces an isomorphism C(Y ) → C(X) then ϕ is called
a birational morphism.
Exercise 45. Let ϕ : C → C be a non-constant morphism. Then ϕ has finite
degree d, and there is a non-empty open set U ⊆ C such that #ϕ−1 (x) = d for all
x ∈ U.
There is a similar result as the second part of Proposition 3.33 saying
that affine varieties with isomorphic function fields are locally isomorphic.
The proof is similar as the proof above and will be left to the reader.
Proposition 3.35. Let X, Y be irreducible affine varieties and let
∼
ρ : C(Y ) → C(X) be an isomorphism. Then there exist special open sets
∼
X 0 ⊆ X and Y 0 ⊆ Y and an isomorphism ψ : X 0 → Y 0 such that ρ = ψ ∗ .
Products. If f is a function on X and h a function on Y then we denote
by f ·h the C-valued function on the product X ×Y defined by (f ·h)(x, y) :=
f (x) · h(y).
Proposition 3.36. The product X × Y of two affine varieties together
with the algebra
O(X × Y ) := C[f · h | f ∈ O(X), h ∈ O(Y )]
of C-valued functions is an affine variety. Moreover, the canonical homomor-
phism O(X) ⊗ O(Y ) → O(X × Y ), f ⊗ h 7→ f · h, is an isomorphism.
Proof. Let X ⊆ Cn and Y ⊆ Cm be closed subvarieties. Then X × Y ⊆
n+m
C is closed, namely equal to the zero set V(I(X) ∪ I(Y )). So it remains
to show that O(X × Y ) = C[x1, . . . , xn , y1, . . . , ym ]/I(X × Y ) is generated by
the products f · h and that f · h ∈ O(X × Y ) for f ∈ O(X) and h ∈ O(Y ).
But this is clear since x̄i = xi |X×Y = xi |X · 1 and ȳj = yj |X×Y = 1 · yj |Y , and
f |X · h|Y = (fh)|X×Y for f ∈ C[x1, . . . , xn ] and h ∈ C[y1, . . . , ym ].
For the last claim, we only have to show that the map O(X) ⊗ O(Y ) →
O(X × Y ), f ⊗ h 7→ f · h, is injective. For this, let (fi | i ∈ I) be a basis
of O(YP). Then every element s ∈ O(X) ⊗ O(Y ) can be uniquely P written as
s = finite si ⊗ fi . If s is in the kernel of the map, then P si (x)fi (y) = 0
for all (x, y) ∈ X × Y and so, for every fixed x ∈ X, si (x)fi is the zero
function on Y . This implies that si (x) = 0 for all x ∈ X and so si = 0 for all
i. Thus s = 0 proving the claim.
Example 3.20.
(1) The two projections prX : X × Y → X, (x, y) 7→ x, and prY : X ×
Y → Y , (x, y) 7→ y, are morphisms with comorphisms pr∗X (f ) = f · 1
and pr∗Y (h) = 1 · h.
68 3. BASICS FROM ALGEBRAIC GEOMETRY
3. Dimension
Definitions. If k is a field and A a k-algebra then a set a1, a2, . . . , an ∈ A
of elements from A are called algebraically independent over k if they do
not satisfy a non-trivial polynomial equation F (a1, a2, . . . , an ) = 0 where
F ∈ k[x1, . . . , xn ]. Equivalently, the canonical homomorphism of k-algebras
k[x1, . . . , xn ] → A defined by xi 7→ ai is injective.
In order to define the dimension of a variety we will need the concept of
transcendence degree tdegk K of a field extension K/k. It is defined to be the
maximal number of algebraically independent elements in K. We refer to [6,
Chap. 13, Sect. 8] for the basic properties of transcendental extensions.
Definition 3.40. Let X be an irreducible affine variety and C(X) its
field of rational functions. Then the dimension of X is defined by
dim X := tdegC C(X).
S
If X is reducible and X = Xi the irreducible decomposition (see 1) then
dim X := max dim Xi .
i
S
Finally, we define the local dimension of X in a point x ∈ X = Xi to be
dimx X := max dim Xi .
Xi 3x
Examples 3.21.
(1) We have dim Cn = n. More generally, if V is a complex vector space
of dimension n then dim X = n.
(2) If U ⊆ X is a special open subset which is dense in X, then dim U =
dim X.
(3) For affine varieties X, Y we have dim X × Y = dim X + dim Y .
Lemma 3.41. Let f ∈ C[x1, . . . , xn ] be a non-constant polynomial and
X := V(f ) ⊆ Cn its zero set. Then dim X = n − 1.
70 3. BASICS FROM ALGEBRAIC GEOMETRY
Proof. If y ∈ Y then ϕ−1 (y) = VX (ϕ∗ (my )) (see 2). If ϕ−1 (y) 6= ∅ then
the induced morphism ϕ−1 (y) → {y} is finite, too, and so O(ϕ−1 (y)) is a
finite dimensional C-algebra. Thus, the fiber ϕ−1 (y) is finite (Example 3.22)
proving the second claim.
For the first claim it suffices to show that ϕ(X) = ϕ(X). Hence we
can assume that ϕ(X) = Y , i.e. that ϕ∗ : O(Y ) → O(X) is injective. If
ϕ−1 (y) = ∅ then O(X)my = O(X). (We identify my with its image ϕ∗ (my ).)
The Lemma of Nakayama (see the following Lemma 3.46) now implies that
(1+a)O(X) = 0 for some a ∈ my which is a contradiction since 1+a 6= 0.
Lemma 3.46 (Lemma of Nakayama). Let R be a ring, a ⊆ R an ideal
and M a finitely generated R-module. If aM = M then there is an element
a ∈ a such that (1 + a)M = 0. In particular, if M is torsionfree and a 6= R
then M = 0.
P P
Proof. Let M = kj=1 Rmj . Then mi = j aij mj for all i where aij ∈ a.
If A denotes the k × k-matrix (aij )i,j and m the column vector (m1, . . . , mk )t
this means that m = A · m. Thus (E − A)m = 0, and so det(E − A)mj = 0
for all j. But
1 − a11 −a12 · · ·
det(E − A) = det −a21 1 − a22 · · · = 1 + a where a ∈ a.
.. ..
. .
and the claim follows.
Exercise 51. Let X be an affine variety and x ∈ X. Assume that f1 , . . . , fr ∈
mx generate the ideal mx modulo m2x, i.e., mx = (f1, . . . , fr ) + m2x . Then {x} is
an irreducible component of VX (f1 , . . . , fr ). (Hint: If C ⊆ VX (f1, . . . , fr ) is an
irreducible component containing x and m ⊆ O(C) the maximal ideal of x then
m2 = m. Hence m = 0 by the Lemma of Nakayama above.)
Exercise 52. Let ϕ : X → Y be a finite surjective morphism. Then dim X =
dim Y .
S
Exercise 53. Let X be an affine variety and X = i Xi the irreducible de-
composition. A morphism ϕ : X → Y is finite if and only if ϕ|Xi : Xi → Y is finite
for all i.
The following easy lemma will be very useful later on .
Lemma 3.47. Let A ⊆ B be rings and b ∈ B. Assume that b satisfies an
equation of the form
bm + a1bm−1 + a2bm−2 + · · · + am = 0
where a1, a2, . . . , am ∈ A. Then the subring A[b] ⊆ B is finite over A.
3. DIMENSION 73
where xm occurs in H with anP exponent < r. Since K is infinite we can find
r1 rm−1
γ1 , . . . , γm−1 ∈ K such that r1 +···+rm =r αr1 ,...,rm γ1 · · · γm−1 6= 0. Setting
b0j := bj − γj bm for j = 1, . . . , m − 1 we have A = K[b01, b02 , . . . , b0m−1 , bm ].
Now equation (3.4) implies that bm satisfies an equation of the form (3.3),
hence A is finite over K[b01, . . . , b0m−1 ] by Lemma 3.47, and the claim follows
by induction.
Remark 3.49. The proof above shows the following. If A = K[b P 1, . . . , bm]
then there are a number n ≤ m and n linear combinations ai := j γij bj ∈ A
such that a1, . . . , an are algebraically independent over K and that A is finite
over K[a1, . . . , am].
A first consequence is the following result.
Proposition 3.50. Let X be an affine variety of dimension n. Then
there is a finite surjective morphism ϕ : X → Cn .
74 3. BASICS FROM ALGEBRAIC GEOMETRY
Lemma 3.53. Let A be a C-algebra without zero divisors and K its field
of fractions. Let a1, . . . , an ∈ A be algebraically independent elements such
that A is finite over C[a1, . . . , an ] and denote by N : K → C(a1, . . . , an ) the
norm. Then
(1) N (A) ⊆ C[a1, . . . , an ]; p p
(2) For all a ∈ A we have Aa ∩ C[a1, . . . , an ] = C[a1, . . . , an ]N (a).
Proof. Let L/K be a finite field extension containing all the conjugates
a(1) := a, a(2), . . . , a(r) of a where r = [K : C(a1 , . . . , an )]. Since a belongs to
an algebra which is finite over C[a1, . . . , an ], namely A, the same holds for all
a(j). Thus, every a(j) satisfies an equation with coefficients in C[a1, . . . , an ]
and leading coefficient 1 (Lemma 3.52). This implies by Lemma 3.47 that the
subalgebra à := C[a1, . . . , an ][a(j)] ⊆ L is finite over C[a1, . . . , an ] and con-
tains all a(j). Therefore, N (a) = a(1)a(2) · · · a(r) belongs to à ∩ C(a1 , . . . , an )
which is equal to C[a1, . . . , an ] by the exercise above. This proves the first
claim.
Now we have
Y
(t − a(j) ) = tr + h1 tr−1 + · · · + hr−1 t + hr
j
Since dim Yj = dim X −1 for all j we see, by induction, that every irreducible
component of VYj (f2 , . . . , fr ) has dimension ≥ (dim X −1)−(r−1) = dim X −
r, and the claim follows.
Exercise 58. Let X be an affine variety and f ∈ O(X) a non-zero divisor.
For any x ∈ VX (f ) we have dimx VX (f ) = dimx X − 1. (Hint: If f is a non-zero
divisor, then f is non-zero on every irreducible component Xi of X and so VXi (f )
is either empty or every irreducible component has codimension 1. Now the claim
follows easily.)
3. DIMENSION 77
(1) ϕ : M2 → M2 , ϕ(A) 2
:= A .
a b
(2) ϕ : SL2 → C3 , ϕ( ) := (ab, ad, cd) (see Exercise 43).
c d
What is the degree of the finite morphism ρ in each case?
In particular,
∂fi
dim Tx0 X = n − rk (x0 ) .
∂xj i,j
∂fi
The matrix (x0) is called the Jacobian matrix at the point x0 and
∂xj i,j
will be denoted by Jac(f1, . . . , fs )(x0).
Example 3.26. Consider the plane curve C = V(y 2 − x3) ⊆ C2 . Then
I(C) = (y 2 − x3) and so the tangent space in an arbitrary point x0 = (a, b) ∈
C is given by T(a,b)C = {(u, v) ∈ C2 | −3a2 u + 2bv = 0}. Since (a, b) = (t2, t3)
for some t ∈ C we get
2
C " # for t = 0,
T(t2 ,t3 )C = 2
C 3t for t 6= 0.
Exercise 66. Calculate the tangent spaces of the plane curves C1 := V(y −x2 )
and C2 = V(y 2 − x2 − x3) in arbitrary points (a, b).
Remark 3.72. Consider the C-algebra C[ε] := C[t]/(t2) called the alge-
bra of dual numbers. By definition, we have C[ε] = C ⊕ Cε and ε2 = 0. If
X is an affine variety and ρ : O(X) → C[ε] an algebra homomorphism, then
ρ is of the form ρ = evx ⊕δx ε for some x ∈ X where evx is the evaluation
map at x and δx a derivation in x, i.e., ρ(f ) = f (x) + δx(f )ε. Conversely, if
δx is a derivation in x then ρ := evx ⊕δxε is an algebra homomorphism. If
X = V is a vector space, then the homomorphisms ρ : O(V ) → C[ε] are in
one-to-one correspondence with the elements of V ⊕ V ε. In fact, there are
canonical bijections
∼ ∼
AlgC (O(V ), C[ε]) → Hom(V, C[ε]) → V ⊕ V ε,
4. TANGENT SPACES AND DIFFERENTIALS 85
and so
∂f ∂f ∂f
Hsing = V(f, , ,··· , ) ⊆ H.
∂x1 ∂x2 ∂xn
It follows that Hsing is a proper closed subset whose complement is dense.
∂f
(This is clear for irreducible hypersurfaces since a non-zero derivative ∂x i
∂f ∂f
cannot be a multiple of f and so V(f, ∂x 1
, · · · , ∂xn
) is a proper closed sub-
set of V(f ). This implies that every irreducible component of H contains
a non-empty open set of nonsingular points which does not meet the other
components, and the claim follows.)
It is also interesting to remark that a common point of two or more
irreducible components of H is always singular. We will see that this is true
in general (Corollary 3.80).
Proposition 3.74. Let X be an irreducible affine variety. Then the set
Xsing of singular points of X is a proper closed subset of X whose complement
is dense.
Proof. We can assume that X is an irreducible closed subvariety of Cn
of dimension d. If I(X) = (f1 , . . . , fs ), then, by Proposition 3.71,
∂fj
Xsing = {x ∈ X | rk (x) < n − d}
∂xi
which is the closed subset defined by the vanishing of all (n − d) × (n − d)
minors of the Jacobian matrix Jac(f1 , . . . , fs ). In order to see that Xsing has
a dense complement, we use the fact, that every irreducible variety contains
a special open set which is isomorphic to a special open set of an irreducible
hypersurface H (see Lemma 3.43). Since H contains a dense open set of
non-singular points (see Example 3.28 above) the claim follows.
Exercise 68. If X is an affine variety such that all irreducible components
have the same dimension, then Xsing is closed and has a dense complement.
(We will see later in Corollary 3.80 that this holds for every affine variety.)
Exercise 69. The hypersurface H ⊆ C3 from Exercise 43 is nonsingular.
Exercise 70. Let q ∈ C[x1, . . . , xn ] be a quadratic form and Q := V(q) ⊆ Cn .
Then 0 is a singular point of Q. It is the only singular point if and only if q is
nondegenerate.
4. TANGENT SPACES AND DIFFERENTIALS 87
∼
(2) There is a canonical isomorphism R̃/R̃t → grm R.
∼ g
(3) If a ⊆ m is an ideal and ã := a[t, t−1] ∩ R̃ then R̃/ã → R/a.
(4) If n ⊆ R is the nilradical, then ñ := n[t, t−1] ∩ R̃ is the nilradical of
∼ g
R̃, and R̃/ñ → R/n.
(5) Assume that R is a finitely generated domain. Then R̃ is a domain,
and we have
dim R̃ = dim R + 1 and dim R̃/R̃t = dim R.
(6) Assume that R is finitely generated and that the minimal primes
p1 , . . . , pr are all contained in m. Then the p̃1, . . . , p̃r are the minimal
primes of R̃.
Proof. (1) If R = C[h1, · · · , hm ] and m = (f1 , . . . , fn ) then
R̃ = C[h1, . . . , hm , t, f1t−1 , . . . , fn t−1 ],
and so R̃ is finitely generated.
(2) By definition, we have
R̃t = · · · ⊕ m3 t−2 ⊕ m2 t−1 ⊕ m ⊕ Rt ⊕ Rt2 ⊕ · · · .
Hence
R̃/R̃t = · · · ⊕ (m2 /m3 )t−2 ⊕ (m/m2 )t−1 ⊕ R/m
and the claim follows.
(3) The canonical map π : R[t, t−1] → (R/a)[t, t−1 ] induces, by our con-
struction, a surjective homomorphism π̃ : R̃ → R/a g with kernel ker π ∩ R̃ =
−1
a[t, t ] ∩ R̃.
(4) Put Rred := R/n. Then Rred [t, t−1] is reduced, i.e. without nilpotent el-
g
ements 6= 0, and so is R −1 −1
red. Since the kernel of the map R[t, t ] → Rred [t, t ]
−1
is equal to n[t, t ] and consists of nilpotent elements the claim follows from
(3).
(5) The first part is clear since R[t, t−1] is a domain. Since R̃t = R[t, t−1]
we get dim R̃ = dim R[t, t−1 ] = dim R[t] = dim R + 1. Moreover, by the
Principal Ideal Theorem (Theorem 3.54) we have dim R̃/R̃t = dim R̃ − 1.
T
(6) It follows from (3) and (5) that the ideals p̃i are prime. Since i pi = n
we obtain from (2)
\ \
p̃i = pi [t, t−1] ∩ R̃ = n[t, t−1] ∩ R̃ = ñ.
i i
4. TANGENT SPACES AND DIFFERENTIALS 89
Since p̃i ∩ R[t] = pi [t] there are no inclusions p̃i ⊆ p̃j for i 6= j, and the
claim follows. (We use here the well-known fact that the minimal T primes in
a finitely generated C-algebra are characterized by the condition pi = n,
cf. Remark 3.25.)
In the next lemma we give some properties of the associated graded al-
gebra grm R where m is a maximal ideal of R.
Lemma 3.77. Let R be a C-algebra and m ⊆ R a maximal ideal.
T
(1) Assume that j mj = (0). If grm R is a domain, then so is R.
(2) Denote by mRm ⊆ Rm the maximal ideal of the localization Rm .
∼
There is a natural isomorphism grm R → grmRm Rm of graded C-
algebras.
Proof. (1) If ab = 0 for non-zero elements a, b ∈ R, we can find i, j ≥ 0
such that a ∈ mi \mi+1 and b ∈ mj \mj+1 . Thus ā := a+mi+1 and b̄ := b+mj+1
are non-zero elements in grm A, and āb̄ = ab + mi+j+1 = 0. This contradiction
proves the claim.
(2) Set M := mRm ⊆ Rm . The image of S := R \ m in R/mk consists of
invertible elements and so R/mk → Rm /Mk is surjective. It is also injective,
because Rm /Mk can be identified with the localization of R/mk at S. Thus
∼ ∼
R/mk → Rm /Mk and so mi /mi+1 → Mi /Mi+1 for all i ≥ 0.
Finally, we need the following result due to Krull. It implies
T that in a
local Noetherian C-algebra R with maximal ideal m we have j≥0 mj = (0).
Lemma T 3.78 (Krull). Let R be a Noetherian C-algebra, a ⊆ R an ideal
and b := j≥0 aj . Then ab = b. In particular, there is an a ∈ a such that
(1 + a)b = 0.
Proof. The second claim follows from the first and the lemma of
Nakayama (Lemma 3.46). Let a = (a1, . . . , as ) and put I := hf | f ∈
R[x1 , . . . , xs ] homogeneous and f (a1, . . . , as ) ∈ bi. Note that I ⊆ R[x1 , . . . , xs ].
It is easy to see that I is an ideal of R[x1, . . . , xs ] and so I = (f1, . . . , fk )
where the fj are homogeneous. Choose an n ∈ N, n > deg fj for all j. By
definition, b ⊆ an and so, for every b ∈ b, there is a homogeneous polyno-
mial f ∈ R[x P 1 , · · · , xs ] of degree n such that f (a1, . . . , as ) = b. It follows
that f = j hj fj Pwhere the hj are homogeneous of degree > 0, and so
b = f (a1 , . . . , as ) = j hj (a1 , . . . , as )fj (a1, . . . , as ) ∈ ab.
The next proposition is a reformulation of our main Theorem 3.75. For
later use we will prove it in this slightly more general form.
90 3. BASICS FROM ALGEBRAIC GEOMETRY
Let us denote by ÔX,x the mx -adic completion of the local ring OX,x . It
is defined to be the inverse limit
ÔX,x := lim O(X)/mkx .
←−
(We refer to [26, I.7.1 and I.7.2] for more details and some basic properties.)
T
Since mkx = {0} we have a natural embedding OX,x ⊆ ÔX,x.
If X = Cn and x = 0 then the completion coincides with the algebra of
formal power series in n variables:
ÔCn ,0 = C[[x1, . . . , xn ]].
The next result is an easy consequence of Theorem 3.75 above.
Corollary 3.81. The point x ∈ X is non-singular if and only if ÔX,x
is isomorphic to the algebra of formal power series in dimx X variables.
Remark 3.82. A famous result of Auslander-Buchsbaum states that the
local ring OX,x in a nonsingular point of a variety X is always a unique
factorization domain. For a proof we refer to [55, §20, Theorem 20.3].
Vector fields S and tangent bundle. Let X be an affine variety. De-
note by T X := x∈X TxX the disjoint union of the tangent spaces and by
p : T X → X the natural projection, δ ∈ TxX 7→ x. We call T X the tangent
bundle of X. We will see later that T X has a natural structure of an affine
variety and that p is a morphism.
A section ξ : X → T X of p, i.e. p ◦ ξ = IdX or ξx := ξ(x) ∈ Tx X for all
x ∈ X, is a collection (ξx )x∈X of tangent vectors and thus can be considered
as an operator on regular functions f ∈ O(X):
ξf(x) := ξx f for x ∈ X.
Definition 3.83. An (algebraic) vector field on X is a section ξ : X →
T X with the property that ξf ∈ O(X) for all f ∈ O(X). The space of
algebraic vector fields is denoted by Vec(X).
(In the following, we will mostly talk about “vector fields” and omit the term
“algebraic” whenever it is clear from the context.)
Thus a vector field ξ can be considered as a linear map ξ : O(X) → O(X),
and so Vec(X) is a vector subspace of End(O(X)). More generally, the vector
fields form a module over O(X) where the product fξ for f ∈ O(X) is defined
in the obvious way: (f ξ)x := f (x)ξx .
92 3. BASICS FROM ALGEBRAIC GEOMETRY
which implies that this vector field is indeed algebraic. We claim that every
algebraic vector field on V is of this form. In fact, if V = Cn then
M
n
∂
Vec(Cn ) = C[x1, . . . , xn ]
i=1
∂xi
which shows that this is a closed subspace of X × Cn . Now (2) follows easily.
(3) Using the identification of T X with the closed
P subvariety T 0X above,
∂
an arbitrary section ξ : X → T X has the form ξx = hi (x) ∂x i
with arbitrary
functions hi on X. The vector field ξ is algebraic if and only if hi = ξx̄i is
regular on X which is equivalent to the condition that ξ : X → T X is a
morphism.
Example 3.30. Consider the curve H := V(xy − 1) ⊆ C2 . Then I(H) =
(xy − 1). For a vector field ξ = a(x, y)∂x + b(x, y)∂y on C2 we get
ξ(xy − 1) = a(x, y)y + b(x, y)x.
Thus ξ(xy −1)|H = 0 if and only if ay +bx = 0 on H. It follows that x∂x −y∂y
defines a vector field ξ0 on H and that Vec(H) = O(C)ξ0 . (In fact, setting
h := ay|H = −bx|H we get a|H = h · x|H and b|H = −h · y|H .)
The tangent bundle T H ⊆ H × C2 has the following description (see
Proposition 3.85(1)):
TH
= {(t, t−1, α, β) | αt−1 + βt = 0}
= {(t, t−1, −βt2, β | t ∈ C∗ , β ∈ C}
∼
→ H × C.
Example 3.31. Now consider Neil’s parabola C := V(y 2 − x3) ⊆ C2 (see
Example 3.8). Then a vector field a∂x + b∂y defines a vector field on C if and
only if
−3ax2 + 2by = 0 on C.
94 3. BASICS FROM ALGEBRAIC GEOMETRY
∼
To find the solutions we use the isomorphism O(C) → C[t2, t3], x 7→ t2, y 7→
t3 (see Example 3.19). Thus we have to solve the equation 3āt = 2b̄ in
C[t2, t3]. This is easy: Every solution is a linear combination (with coefficients
in C[t2, t3]) of the two solutions (2t2 , 3t3) and (2t3, 3t4 ). This shows that
ξ0 := (2x∂x + 3y∂y )|D and ξ1 := (2y∂x + 3x2∂y )|D
are vector fields on C and that Vec(C) = O(C)ξ0 + O(C)ξ1 . Moreover, x̄2ξ0 =
ȳξ1 .
Our calculation also shows that every vector field on C vanishes in the
singular point 0 of the curve. For the tangent bundle we get
T C = {(t2, t3, α, β) | −3αt4 + 2βt3 = 0} ⊆ C × C2
which has two irreducible components, namely
T C = {(t2, t3, 2α, 3αt) | t, α ∈ C} ∪ {(0, 0)} × C2
Exercise 74. Determine the vector fields on the curve D := V(y 2 − x2 − x3 ) ⊆
C2 . Do they all vanish in the singular point of D?
Exercise 75. Determine the vector fields on the curves D1 := {(t, t2, t3) ∈
C | t ∈ C} and D2 := {(t3 , t4, t5) ∈ C3 | t ∈ C}.
3
L
(Hint: For D2 one can use that O(D2) ' C[t3 , t4, t5] = C ⊕ i≥3 Cti .)
2 dim X.
(Hint: If I(X) = (f1 , . . . fm ) then T X ⊆ Cn × Cn is defined by the equations
n
X ∂fj
fj = 0 and yi (x) = 0 for j = 1, . . ., m.
∂xi
i=1
O(Y )
If X := V and Y := W are vector spaces then a homomorphism ρ : O(V ) →
C[ε] corresponds to an element x ⊕ vε ∈ V ⊕ V ε where ρ(f ) = f (x + vε),
and so ρ ◦ ϕ∗ corresponds to the element ϕ(x + vε) ∈ W ⊕ W ε. Thus we
obtain the following result which is very useful for calculating differentials of
morphisms.
Lemma 3.90. Let ϕ : V → W be a morphism between vector spaces, and
let x ∈ V and v ∈ TxV = V . Then we have
ϕ(x + εv) = ϕ(x) + dϕx (v) ε
where both sides are considered as elements of W ⊕ W ε.
Example 3.33. The differential of the m-th power maps Mn → Mn ,
A 7→ Am , in E is m · Id. In fact, (E + Xε)m = E + mXε.
The differential of ϕ : M2 → M2 , ϕ(A) := A2, in an arbitrary matrix A is
given by dϕA (X) = AX + XA, because (A + Xε)2 = A2 + (AX + XA)ε.
4. TANGENT SPACES AND DIFFERENTIALS 97
' ⊆ )
ϕ−1 (U ) / VU ×C (f ) /U ×C
MM r
MMM
MMM p rrrrr
ϕ MM rrr prU
M& yrrr
U
Proof. We have to show that there is a non-zero s ∈ O(Y ) such that
O(X)s ' O(Y )s [t]/(f ) with a polynomial f ∈ O(Y )s [t]. Then the claim
follows by setting U := Ys .
By assumption, the field C(X) is a finite extension of C(Y ) of degree n,
say,
C(X) = C(Y )[h] ' C(Y )[t]/(f )
Pn i
where f = i=0 fi t , fi ∈ C(Y ) and fn = 1. There is a non-zero element
s ∈ O(Y ) such that
(a) fi ∈ O(Y )s for all i,
(b) h ∈ O(X)s and L
(c) O(X)s = O(Y )s [h] = n−1 i
i=0 O(Y )s h .
In fact, (a) and (b) are clear. For (c) we first remark that O(Y )s [h] =
Ln−1 i
i=0 O(Y )s h ⊆ O(X)s , because of (a) and (b). If h1 , . . . , hm is a set of gen-
erators of O(X) we can find a non-zero s ∈ O(Y ) such that hi ∈ O(Y )s [h],
proving (c).
Setting U := Ys we get ϕ−1 (U ) = Xs and O(Xs ) = O(Ys )[t]/(f ), by (c),
and the claim follows.
Proof of Theorem 3.92. By the Decomposition Theorem (Theorem
3.59) we can assume that ϕ is the composition of a finite surjective morphism
and a projection of the form Y × Cr → Y . Since the differential of the second
morphism is surjective in any point we are reduced to the case of a finite
morphism. Now the claim follows from Lemma 3.93 above and Example 3.35.
Lemma 3.94. Let ϕ : X → Y be a morphism, x ∈ X and y := ϕ(x) ∈ Y .
Assume that X is smooth in x and dϕx is surjective. Then,
(1) Y is smooth in y;
(2) The fiber ϕ−1 (y) is reduced and smooth in x, and dimx F = dimx X −
dimy Y .
Proof. By assumption,
dim TxF ≤ dim ker dϕx = dim TxX − dim Ty Y ≤ dim X − dim Y ≤ dimx F
100 3. BASICS FROM ALGEBRAIC GEOMETRY
that ϕ = η ◦ ϕ̃:
8 X̃
ϕ̃q qq
η
q qq
q ϕ
Y /X
Proof. Let O(X)0 ⊆ C(X) be the integral closure of O(X) and define
a := {f ∈ O(X) | f O(X)0 ⊆ O(X)}.
Then a is a non-zero ideal of O(X), because O(X)0 is finite over O(X), and
Xnorm = X \ VX (a). In fact, for S := O(X) \ mx we have
OX,x = O(X)S ⊆ O(X)0S
and the latter is the integral closure of OX,x . On the other hand, O(X)S =
O(X)0S if and only if S ∩ a 6= ∅ which is equivalent to x ∈
/ VX (a).
Exercise 89. Consider the morphism ϕ : C2 → C4 , (x, y) 7→ (x, xy, y 2, y 3).
(1) ϕ is finite and ϕ : C2 → Y := ϕ(C2) is the normalization.
(2) 0 ∈ Y is the only non-normal and the only singular point of Y .
(3) Find defining equations for Y ⊆ C4 and generators of the ideal I(Y ).
Exercise 90. If X is a normal variety then so is X × Cn .
C(Y ) C(X)
If H ⊆ Y is an irreducible hypersurface then, by assumption (a), H meets
the image ϕ(X) in a dense set and so ϕ(ϕ−1 (H)) = H. This implies that
there is an irreducible hypersurface H 0 ⊆ X such that ϕ(H 0 ) = H. If we
denote by p := I(H) ⊆ O(Y ) and p0 := I(H 0) ⊆ O(X), the corresponding
minimal prime ideals, we get p0 ∩ O(Y ) = p. Thus
O(Y )p ⊆ O(X)p0 $ C(Y ) = C(X).
110 3. BASICS FROM ALGEBRAIC GEOMETRY
Since O(Y )p is a discrete valuation ring this implies O(Y )p = O(X)p0 (see
Exercise 91). Thus, by Corollary 3.115,
\ \
O(X) ⊆ O(X)p0 = O(Y )p = O(Y ),
p0 p
Peter Russell
(ii) This is not true even for m = 1, n = 2 , if char(k) > 0 (see Example
4.1 part (ii)). The classification of lines in the plane in this case is one of the
truly challenging remaining problems in 2-dimensional geometry.
(iii) There are no counterexamples known if char(k) = 0 and k = k.
On the other hand it has been pointed out in [68] that there are closed
embeddings R ⊆ R3 that represent knots and so cannot be rectified over R.
(iv) The high codimension theorem: This is true for n > 2m + 1, in
particular m = 1, n = 4.
Several, in principle quite similar, proofs [19, 36, 45, 71] were given, the
first two with slightly weaker bounds.
(v) Here are the next two (very challenging) open cases in zero charac-
teristic.
(v.1) m = 1, n = 3 (lines in space). See [11, 18, 19, 68] for some partial
results.
(v.2) m = 2, n = 3 (planes in space). See [46, 62, 65, 66] for some partial
results.
4.5. One can, of course, study the embedding problem in other categories,
for instance in complex analytic geometry. In that case there exist non rec-
tifiable lines in the plane, i.e. proper holomorphic embeddings of C into C2
that are not equivalent to a linear embedding. See [22], for instance.
4.6. An important generalization [54]of the AMS-theorem is the
Lin-Zaidenberg theorem: Let C be a topologically contractible curve
in X = C2. Then there are coordinates x, y for X such that C = V (xp − y q )
for some p, q ∈ N, GCD(p, q) = 1.
Proof. This is a sketch of the proof of Craighero [19] for (ii), the first
for a result of this nature, for the case Y = Am and n > 2m + 1 replaced by
n > 3m. Note that this includes the case m = 1, n = 4.
Step I. We look at Y 1 ⊆ Pn = X. The variety Z of lines in Pn meeting
Y 1 in at least 2 points or tangent to Y 1 has dimension ≤ 2m + 1. Hence
Z ∞ = Z ∩ X∞ (X∞ = hyperplane at infinity of X) has dimension ≤ 2m and
dimZ ∞ + m − 1 ≤ 3m − 1 ≤ n − 2 = dimX∞ − 1. Hence for a general linear
subspace L of X∞ of dimension m − 1 we have
(∗) L ∩ Z ∞ = ∅.
Let π be the projection out of such an L. (∗) implies that
(∗∗) any of the m-dimensional fibers of π (by definition they intersect X∞ in
L) meets Y1 in at most one point and is not tangent to Y1 .
After a linear change of coordinates we may assume that
L = V+ (X0 , X1 , ..., Xn−m ),
with V+ (X0 ) = X∞ . Then
π|X : X −→ X
is given by
(x1, ..., xn) 7−→ (x1, ..., xn−m, 0, ...0).
Let π 0 be the restriction of π to Y1 . By (∗∗), π 0 is injective and so is the
induced map on the tangent space at any p ∈ Y1 . Since Y 1 ∩ L = ∅, π 0 is a
proper map. It follows that π 0 is a closed embedding.
Step II. We need
Lemma 4.8. Let σ : k[x1, ..., xn] → B be a surjective homomorphism
(corresponding to a closed embedding of Spec B into An ) and put σ(xi ) = bi .
Suppose b1, ..., bn−m generate B over k, where m > 0. Let vj ∈ B, j = 1, ..., m.
Define τ : k[x1, ..., xn] −→ B by
τ (xi ) = bi , i = 1, ..., n − m
and
τ (xn−m+j ) = vj , j = 1, ..., m.
Then σ and τ are equivalent by an elementary automorphism.
Proof. Since B = k[b1, ..., bn−m], we can find polynomials Gi , Hi ∈
k[x1, ..., xn−m], i = 1, ..., m such that Gi (b1, ..., bn−m) = bn−m+i and Hi (b1 , ...,
bn−m ) = vi . Then
η : k[x1, ..., xn] −→ k[x1, ..., xn]
η(xi ) = xi, i = 1, ...., n − m
4. GENERAL AND GENERIC FIBERS 117
corresponds to choosing b01 , ..., b0n ∈ B so that B = k[b01, ..., b0n]. By Step I
there is a linear automorphism λ of A = k[x1, ..., xn] such that b1 , ..., bn−m
generate B, where bi = λ(b0i ). By the lemma there are elementary auto-
morphisms η1 , η2 transforming (b1, ..., bn) into (b1, ..., bn−m, u1 , ..., um) and
(b1, ..., bn−m , u1, ..., um) into (0, ..., 0, u1, ..., um) respectively. The composite
of λ, η1 and η2. gives a tame automorphism transforming the given embed-
ding into a standard linear one.
Exercise 97. Deduce the following from Lemma 4.8: Let
ρ : Am −→ An
be a closed embedding. Then the embedding
Am −→ An × Am
y 7−→ (y, 0)
is rectifiable. Work this out explicitly for Example 4.1 (ii). Note that we are
claiming that (y 4 + x + x6, z)k[x, y, z] is generated by a subset of a set of
variables for k[x, y, z], note that y 4 + x + x6 is a variable.
For example
Definition 4.9. X is generically (resp. generally) affine m-space over Y
if S −1 A ' K [m] (resp. A/my A ' k [m] , y ∈ U , U dense open in Y ).
Theorem 4.10. (i) Suppose X is generically affine m-space over Y . Then
it is so generally.
(ii) The converse of (i) is true if k = C and m = 1 or m = 2.
Proof. (i) Since A is finitely generated over B, we can find s ∈ S, i.e.
[m]
0 6= s ∈ B, such that As = Bs . Then U = Spec Bs is dense in Y and
A/my A = As/my As = k [m] for y ∈ U .
(ii) Without restriction on m it is true, at least under a mild assumption
on k (it holds for k = C), that there is a finite algebraic extension of fields
L ⊇ K such that L ⊗K S −1 A ' L[m] . See Corollary 4.33. The next result
then finishes the proof.
Theorem 4.11. (Absence of non-trivial separable forms of the affine line
and the affine plane) Let K be a field. Let D be a finitely generated K-
algebra and suppose L ⊇ K is separably algebraic and m = 1 or m = 2. If
L ⊗K D ' L[m] , then D ' K [m] .
For m = 1 the proof of 4.11 is a nice exercise in Galois theory, requiring
an application of both the additive and multiplicative version of Hilbert’s
Theorem 90. For two quite different proofs in case m = 2 see [44] and [64].
The corresponding question for m ≥ 3 is very interesting and very open.
Exercise 98. Let char(k) = 2 and f = y 4 + x + x6 ∈ A = k[x, y] as
in 1.2 (ii). Take B = k[f ] and S = B \ {0}. Then S −1 B = k(f )[x, y] '
k(t)[x, y]/(f + t) = E, t transcendental over k[x, y]. Then E k(t)[1], but for
e
suitable e, L ⊗k(f ) E ' L[1], where L = k(t1/2 ) is purely inseparable over
k(t).
So Theorem 4.11 is not true without the separability assumption. In Ex-
ercise 98 the generic fiber of f is a purely inseparable form of A1 . It is a much
worked on conjecture that this is always the case for a line V (f ) ⊆ A2 .
carrying out the last step of the proof, the modification of t to a suitable t0,
if this is not the case. See [67] for the case m = 2.
Exercise 99. Consider the hypersurface X = V (x + x2 y + z 2 + t3 ) ⊆ C4
and the morphism
f : X −→ C
given by x. The generic fiber of f is Spec k(x)[z, t] ' A2k(x) , whereas the
fiber f0 is the singular surface C × C, C = V (z 2 + t3) ⊆ C2 . Note that
f0 is homeomorphic to C2. X is what is known as an exotic affine space, an
algebraic variety diffeomorphic but not isomorphic to Cm . In particular Γ(X)
is factorial and has trivial units. See [47] and [49] for more on this.
An important positive result is the following from [67].
Theorem 4.14. Suppose char(k) = 0. Let B be a discrete valuation ring
with field of quotients K and maximal ideal m. Let A be a finitely generated
[2]
B-algebra such that A/mA ' (B/mB) and K ⊗B A ' K [2]. Then A ' B [2].
This result is easily extended to the case when B is a PID, but it is not
clear whether, or in what form, it extends to B of dimension ≥ 2. For a
specific open question see [75], in particular Property 5 and Property 6.
The following result from [46] for m = 2 is not quite as strong as Theorem
4.12 is for m = 1, but the best one can expect in the light of examples as in
Exercise 99. It establishes an important property of A3 that distinguishes it
from general exotic affine spaces.
Theorem 4.15. Let k = C and f ∈ A = C[3]. If f generally defines a
plane, then f generically defines a plane, i.e. k(f ) ⊗k[f ] A ' k(f )[2] , and f is
a variable in A.
In the proof it is shown that after a choice of generic variables special
fibers not isomorphic to C2 are similar to the one appearing in Exercise 99
and that the choice of variables can be improved by a suitable modification
process. Once no such fibers appear, all fibers are C2 and Theorem 4.14 com-
bined with 4.10(ii) gives the result.
Moreover
R[G, H] = R[v, w].
A linear plane in A3 is an A2 defined by an equation that is linear in
a variable for A3. In [66] (for char(k) = 0) and [63] it is shown that such
planes are rectifiable. Here is a precise statement from [63]. The last part of
the proof is a direct consequence of 4.17.
Theorem 4.18. Let a, b ∈ A ' k [2], b 6= 0 and H = bw − a ∈ A[w] ' k [3].
If
(*) A[w]/HA[w] ' k [2],
then
(**) there exist u, v ∈ A such that A = k[u, v], b ∈ k[u] and a − v is
nilpotent mod bA.
Moreover, there exists G ∈ A[w] = k[u, v, w] such that
k[u, v, w] = k[u, G, H].
Conversely, if (∗∗) holds, then so does (∗).
This result
P hasi been considerably generalized in [65] to polynomials of the
form H = aiw such that the ai , i ≥ 1 have a non trivial common factor
b ∈ A. In another direction it has been generalized in [76] to polynomials
linear in variables w1, ..., ws and coefficients in k[x, y].
As mentioned before, the case of space lines (n = 3, m = 4) is not at all
settled. I do not even know of a counterexample to rectifiability in positive
characteristic.
Let
f : A1 = Spec k[t] −→ A3 = Spec k[x, y, z]
be given by
f (t) = (x(t), y(t), z(t)).
1
We put L = f (A ).
It had been proposed that f with the property that none of degt x, degt y,
degt z is in the semigroup generated by the other two are not rectifiable:
there is then no obvious elementary automorphism that will reduce one of
the degrees. We discuss an example [18] showing that this is not the case.
Note that f is a closed embedding precisely if
Rf = {F ∈ k[x, y, z]|F (x(t), y(t), z(t)) = t} =
6 ∅.
Moreover, Rf then is a coset of
I(L) = {H ∈ k[x, y, z]|H(x(t), y(t), z(t)) = 0}.
6. CURVES WITH ONE PLACE AT INFINITY 123
with 0 6= a0 ∈ k and
degη (F ) = n = [k(ξ, η) : k(ξ)] = −vF (ξ).
We will assume a0 = 1.
(ii) By Definition 4.20, F has one point at infinity, which we will assume is
on the line {η = 0}. This means that
the degree form of F is η n .
Then
deg(F ) = degη (F ) = n > degξ (F ) = m.
So either m = 0 and F = aη + b with a, b ∈ k and a 6= 0 or m > 0 and F
is monic in ξ, i.e. the degree coefficient in ξ is a non-zero constant. (We may
assume it is 1 if we like.)
4.25. We keep the notation of 4.24.
(i) We introduce local coordinates at p∞ = (0, 1, 0), the point at infinity
of F (see the appendix to this chapter),
X = ξ −1 , Y = ηξ −1 .
Then for G ∈ k[ξ, η],
G∞ = Gξ −deg(G)
is the local equation at p∞ of V (G). We have
G∞ ∈ k[X, Y ] ⊆ k[[X]][Y ] ⊆ k[[X, Y ]].
(ii) In view of Lemma 4,23, f = F∞ is monic in Y and of the form
f = Y n + a1(X)Y n−1 + ... + an (X)
with X|ai (X) for i = 1, ..., n.
An f ∈ k[[X]][Y ] is as in 4.25(ii) is called a monic distinguished polyno-
mial, or a polynomial in Weierstrass form.
4.26. If p∞ = (0, 1, 0) is a point at infinity of G ∈ k[ξ, η], then one can
show that the places at infinity of G at p∞ , or, in the notation of Remark
4.22, the p̃ ∈ C̃ lying above p∞ , are in one to one correspondence with the
irreducible factors of G∞ in k[[X]][Y ], or equivalently in k[[X, Y ]]. This gives
the following basic result:
Proposition 4.27. With notation as in 4.24, F ∈ k[ξ, η] has one place at
infinity if and only if (see 4.35 for details) f = F∞ is irreducible in k[[X, Y ]]
and residually rational over k.
126 4. EMBEDDING PROBLEMS IN AFFINE ALGEBRAIC GEOMETRY
β(xi) = bi
which defines a closed embedding of Y into X = An . (Consider this as the
definition of ”closed embedding”.)
We recall that a B-algebra A is said to be finite over B if A is a finite
B-module.
Pn denotes projective n-space. We write its homogeneous coordinate ring
as k[X0, X1 , ..., Xn ]. Note that in contrast to the affine situation, the homo-
geneous coordinates Xi are determined up to an invertible linear transfor-
mation. We have k(Pn ) = field of rational functions on Pn = {P/Q|P, Q ∈
k[X0 , X1 , ..., Xn] homogeneous of the same degree, Q 6= 0}.
If E ⊆ k[X0 , X1 , ..., Xn] is a set of homogeneous elements, V+ (E) ⊆ Pn
denotes the zero-locus of E.
V+ (X0 ) ' Pn−1 and Pn \ V+ (X0 ) ' X = An = Spec k[x1, ..., xn] with
xi = Xi /X0 . We call X∞ = V+ (X0 ) the hyperplane at infinity of X.
To study subvarieties Y of X = An , we often consider Y , the closure of
Y in X = Pn , and Y∞ = Y ∩ X∞ , the locus at infinity of Y . There is a
1 − 1-correspondence between closed irreducible subvarieties of X = An and
closed irreducible subvarieties of X = Pn not contained in X∞ , which I will
describe for hypersurfaces. Let
f = F0 + ... + Fd ∈ k[x1, ..., xn], Fi homogeneous of degree i, Fd 6= 0.
Put
f+ = X0d f (X1 /X0 , ..., Xd/X0 )
= X0d F0 + X0d−1 F1 (X1 , ..., Xn) + .... + Fd (X1 , ..., Xn ).
Note f+ ∈ k[X0 , X1 , ..., Xn] is homogeneous and that X0 - f+ . We have
Y = V+ (f+ ) and Y∞ = V+ (X0 , Fd).
For example, if n = 2, Y = V (f ) is an affine plane curve, Y a projective
plane curve and Y∞ consists of finitely many points, the points at infinity of
Y . In the other direction, let F (X0, ..., Xn ) ∈ k[X0, ..., Xn ] be homogeneous
of degree d with X0 - F and put
fa = F (1, x1, ..., xn) ∈ k[x1, ..., xn].
We can write
F = X0d F0 + X0d−1 F1 + .... + Fd
with Fi ∈ k[X1, ..., Xn ] homogeneous of degree i, Fd 6= 0. Then
fa = F0 + F1(x1, ..., xn) + ... + Fd (x1, ..., xn).
APPENDIX CHAPTER 4 PART 2: MORE ON FIBERS 129
transforms and multiplicities of f and g at these points. We can also see this
as inductively constructing rings
k[[x, y]] = k[[x1, y1]] ⊆ k[[x2, y2 ]] ⊆ ... ⊆ R̃ = k[[τ ]], (xi+1, yi+1 ) = (x∗i , yi∗).
We will see below that R̃ = k[[xi, yi ]] for i >> 0.
The strategy now is to handle questions about f , in particular about
intersection multiplicities, inductively. An example is the following direct
consequence of Noether’s formula
X
i(f, g) = µi (f )µi (g),
the sum extended over all infinitely near points of f . (Unless f |g, g will pass
through only finitely many of these, i.e. µi (g) = 0 from some point on, and the
sum will be finite.) We note that the sequence of the µj (f ) is non-increasing,
and so will eventually stabilize at some i with value δ, say. If we are in case
(1) or (2) of 4.40 and c - p or p - c, further quadratic transformations will
eventually reduce the multiplicity. We have therefore constructed (xi , yi ) =
(u, w) so that, say, δ = v(u) and δ|v(yj ) for j ≥ i. This says that for s > 0, w
is of the form ws us plus a polynomial in u, with v(ws ) > 0. Hence w ∈ k[[u]].
By 4.40, x and y are polynomials in xi , yi and hence x, y ∈ k[[u]]. If δ > 1,
we have k((x, y)) = k((u)) $ k((τ )) = L = k((x, y)). Hence δ = 1 and u = τ
up to a unit. We record this as:
4.43. By the process of successive quadratic transformations along f we
construct an element τ of value 1 in R̃.
P
4.44. Let g = aj (X)Y j ∈ k[[X, Y ]] and suppose X - g. Then the
Weierstrass degree of g is defined to be
w = W (g) = min{j ; X - aj (X)}.
Then by the Weierstrass preparation theorem there is a unit u ∈ k[[X, Y ]]
such that ug is a monic distinguished polynomial in Y of degree w, i.e.
ug = Y w + Xb1 (X)Y n−1 + ... + Xbw (X) with bj (X) ∈ k[[X]].
4.45. Let f ∈ k[[X, Y ]] be irreducible and residually rational. Let
µ1 ≥ µ2 ≥ ...
be the sequence of multiplicities of f at its successive infinitely near points.
We put
X
µ = min{µj |µj > 1} and λ = λ(f ) = µ + µ2j .
{j|µj >1}
APPENDIX CHAPTER 4 PART 3: PLANE ALGEBROID CURVES 135
Ha Huy Vui
2. Case n = 2
In the example f (x, y) = x2y − x, we have seen that the group H1 cannot
distinguish the special fiber from the general fiber, but the group H0 can do
this. In fact, we have the following simple characterization of critical values,
corresponding to the singularities at infinity of polynomials in two complex
variables.
2. CASE n = 2 139
∂f
(iv) There is a Puiseux root at infinity of ∂x
(x, y) = 0 such that
f (a(y), y)) − t0 → 0, as y → ∞.
and we have (iv). If it is not the case, let (1, h1 ) denote the lowest Newton
dot on X = 1. We must have h1 > 0 since otherwise ∂M ∂X
(0, Y ) = ∂f
∂x
(λ(x), y)
does not tent to zero when y tends to infinity.
Now the idea is to use the Newton dots on or bellow the X-axis to “swal-
low” (1, h1 ).
Let’s consider this idea on the example
M(X, Y ) = Y 3 − 2XY + X 3 Y −1 + X 4 .
The dot (1, 1) represent −2XY . We use the dot (3, −1) (which represents
X 3 Y −1 ) to swallow (1, 1).
p dH(z)
Choose a non-zero root c = 2/3 of = 0 with H(z) = z 3 − 2z
dz
and γ = (2/3)Y . We see that the lowest Newton dot on X = 1 of
p
Mf(X, Y ) = M(X + 2/3 Y, Y )
is higher than (1, 1). P
In the general case, let M(X, Y ) = cij X i Y j/N and let EH 0 be the
∂M
highest Newton edge of . The dot (0, h1 ) is lying on EH 0 . Let us collect
∂X
all the terms whose derivatives lie on EH 0
X
ϕH (X, Y ) := cij X i Y j/N , (i − 1, i/N ) ∈ EH 0 .
d
Let c be a non-zero root of ϕH 0 (z, 1) = 0. In the expansion of ϕH 0 (X +
dz
cY tan θH 0 , Y ), where θH 0 is the angle between EH 0 and the X−axis, the term
d
XY h1 has coefficient 0, since ϕH 0 (c) = 0. Denote by γ1 (Y ) := cY tan θH , we
dz
∂M
say that γ1 (Y ) is the result of the sliding of λ along .
∂X
We see that the lowest Newton dot on X = 1 of M1(X, Y ) = M(X +
γ1 (Y ), Y ) is higher than (1, h1 ).
On X = 0, all dots remain above the X−axis. A recursive sliding λ → γ1 →
γ2 → . . . , will then yields a series γ, for which
f
M(X, Y ) = M(X + γ(Y ), Y )
has no dots on X = 1, and dots on X = 0 all lie above the X−axis.
Thus
∂ M̃ ∂M
(0, Y ) = (γ(Y ), Y ) = 0.
∂X ∂X
The series γ̃(y) := γ(1/y) satisfies the condition (iv).
2. CASE n = 2 143
Rabier and Gaffney numbers. Denote the set of linear maps from X
to Y by L(X, Y ) and by Σ(X, Y ) ⊆ L(X, Y ) the set of non-surjective linear
maps.
Definition 5.10. Let A ∈ L(X, Y ). Set
ν(A) = inf kA∗(Φ)k,
kΦk=1
Take y = (y1, .., ym) ∈ U . Take the index I = (1, .., m) and we consider
the following (formal) system of equation
X
n
∂f1
(x)Vj (x) = y1
j=1
∂xj
...
X
n
∂fm
(x)Vj (x) = ym
j=1
∂xj
Vm+1 = 0
Vn = 0
Using the Cramer’s rule we can solve this system, let Mj (i) = Mk , for J =
I\k, we have
X
m
V1 (x) = (−1)1+k yk M1k /MI
k=1
...
X
m
Vm (x) = (−1)m+k yk Mmk /MI
k=1
Vm+1 = 0
...
Vn = 0
4. Jelonek set
We call Jf the Jelonek set of f. It turns out that if f is a polynomial
mapping from Rn to Rm , or from Cn to Cn , then Jf has a very useful property.
Namely, Jf is R-uniruled (for real polynomial maps) and C-uniruled (for the
complex case). In fact, we have
Theorem 5.14. ([39], [41]) Let f : Rn −→ Rn be a polynomial non-
constant mapping. Then the set Jf is closed, semi-algebraic and for every
non-empty connected component S ⊆ Sf , we have 1 ≤ dim S ≤ n − 1.
Moreover, the set Sf is R-uniruled. It means that for every point a ∈ Sf ,
there is a non-constant polynomial mapping ϕ : R −→ Sf such that ϕ(0) = a.
A curve Γ is called an affine parametric line if it is affine and there is a
dominant polynomial mapping f : C −→ Γ .
4. JELONEK SET 149
f cannot be finite. Thus by Theorem 5.15, we get that the set Jf has a
complex codimension 1, hence it has a real codimension 2. Consequently, if
the Jelonek Conjecture holds in dimension 2n, this gives a contradiction.
The second application to the Jacobian Problem of results about the
Jelonek set is a solution by Jelonek of the problem posed by Arno van den
Essen and Shpilrain.
Let p ∈ C[x1, . . . , xn ] be a polynomial. We say that p is a coordinate
polynomial, if it can be included in a generating set from n elements of the
algebra C[x1, . . . , xn ].
If p is a coordinate polynomial, then it is irreducible and its zero-set is
isomorphic to Cn−1 .
Let ϕ : C[x1, . . . , xn ] −→ C[x1, . . . , xn ] be a polynomial automorphism,
then if f is a coordinate polynomial, then so is ϕ(f ). Conversely, Arno van
den Essen and Vladimir Shpilrain have stated the following
Problem. Is it true that every endomorphism of C[x1, . . . , xn ] taking
any coordinate polynomial to a coordinate one is actually an automorphism?
Let us recall some fact about C-uniruled varieties. For an affine variety
X ⊆ Cn , we say that X has s non-uniruled components at infinity, if the
variety cl(X)\X (where cl denotes the projective closure in Pn (C)) has s
non-uniruled irreducible components.
It is proved by Jelonek that:
• A C-uniruled affine irreducible variety X ⊆ Cn has at most one
non-uniruled component at infinity.
• Every smooth hypersurface S ⊆ Pk (C) of degree r > k is not unir-
uled. A projective variety X is said to be uniruled if it is of positive
dimension and for a generic point in X there exists a rational curve
in X through this point.
Solution of the Problem. We prove something more general.
Theorem 5.21. ([37]) Let Φ be an endomorphism of C[x1, . . . , xn ] and D
be a natural number. Assume that Φ takes any coordinate polynomial p with
deg p ≥ D, to a coordinate one. Then, Φ is an automorphism.
Sketch of the proof. Let Φ be an endomorphism of C[x1, . . . , xn ] and Φ
takes any coordinate polynomial p with deg p ≥ D, to a coordinate one. Φ
induces a polynomial mapping f : Cn −→ Cn , Φ(p) = p ◦ f . It is enough to
show that f is an automorphism of Cn .
We do it in two steps. First we show that f has an invertible Jacobian.
Finally, we show that f is a finite mapping.
4. JELONEK SET 151
153
154 BIBLIOGRAPHY
[19] P.C. Craighero, A result on m-flats in An k , Rend. Sem. Mat. Univ. Padova 75
(1986), 39-46.
[20] S. Cynk and K. Rusek, Injective endomorphisms of algebraic and analytic sets,
Ann. Polon. Math. 56 (1991), No.1, 29-35.
[21] H. Derksen, Inverse degrees and the Jacobian Conjecture, Comm. Algebra, 22,
(1994),No.12, 4793-4794.
[22] H. Derksen and F. Kutzschebauch, Nonlinearizable holomorphic group actions,
Math. Ann. 311 (1998), 41-53.
[23] J. Dixmier, Sur les algèbres de Weyl, Bull. Soc. Math. France, 96 (1968), 209-242.
[24] W. Dicks, Automorphisms of the polynomial ring in two variables, Publ. Sec.
Mat. Univ. Autònoma Barcelona 27 (1983), no. 1, 155-162.
[25] A. Durfee, Five definitions of critical point at infinity, Progress in Mathematics,
162 (1998), Birkhäuser, Basel, 345-360.
[26] D. Eisenbud, Commutative Algebra with a View Towards Algebraic Geometry.
Graduate Texts in Math. vol. 150, Springer Verlag 1995.
[27] A. van den Essen, Polynomial automorphisms and the Jacobian conjecture,
Progress in Mathematics, vol. 190, Birkhäuser Verlag, Basel, 2000.
[28] A. van den Essen, The sixtieth anniversary of the Jacobian Conjecture: a new
approach, Ann. Polon. Math. 76, No.1-2 (2001), 77-87.
[29] A. van den Essen, On Bass’ inverse degree approach to the Jacobian Conjecture
and exponential automorphisms, Contemp. Math 264 (2000), 207-214.
[30] T. Gaffney, Fibers of polynomial mappings at infinity and a generalized Malgrange
condition, Compositio Math. 119 (1999), no. 2, 157-167.
[31] R. Ganong, On plane curves with one place at infinity, J. Reine Angew. Math.
307/308 (1979), 173-193.
[32] C. Godbillon, Géometrie différentielle et Mécanique Analytique, Hermann Paris,
1969.
[33] R. Gurjar, K. Masuda, M. Miyanishi and P. Russell, Affine lines on affine surfaces
and the Makar-Limanov in variant, to appear in Can. J. Math.
[34] Hà Huy Vui and D. T. Lê, Sur la topologie des polynôme complexes, Acta Math.
Vietnam. 9 (1984), 21-32.
[35] Hà Huy Vui, Nombres de Lojasiewicz et singularités à l’infini des polynômes de
deux variables complexes, C.R. Acad. Sci. Paris, t.311, Serie I (1990), 429-432.
[36] Z. Jelonek, The extension of regular and rational embeddings, Math. Ann. 277
(1987), 113-120.
[37] Z. Jelonek, Testing sets for properness of polynomial mapping, Math.Ann.315
(1991), 1-35 .
[38] Z. Jelonek, The Jacobian conjecture and and the extensions of polynomial em-
beddings, Math. Ann. 294 (1992), 289-293.
[39] Z. Jelonek, The set of points at which a polynomial map is not proper, Ann.Pobn.
Math. LVIII.3 (1993), 259-266 .
[40] Z. Jelonek, A solution of the problem of van den Essen and Shpilrain, J. Pure
Appl. Algebra, 137 (1999), 49-55 .
[41] Z. Jelonek, A geometry of real polynomial mappings, Math. Z. 239 (2002), 321-
333 .
BIBLIOGRAPHY 155