Professional Documents
Culture Documents
(Lecture Notes) Wai Kiu Chan - Arithmetic of Quaternion Algebra 2012 (2012)
(Lecture Notes) Wai Kiu Chan - Arithmetic of Quaternion Algebra 2012 (2012)
1 Quaternion Algebras
In this section, F is a field of characteristic 6= 2. Unless stated otherwise, all algebras
considered here are finite dimensional algebras over F . If 1A (or simply 1) is the identity
of an F -algebra A, then the map α 7→ α1A is a monomorphism of F -algebras. This map
identifies F as a subalgebra of A.
If R is a ring, then R× denotes the group of units in R.
i2 = a, j 2 = b, ij = k = −ji
for some a, b ∈ F × .
2
k = −ab. The basis {1, i, j, k} is called a standard basis
In this definition, notice that
for H and we write H = a,b F . Note that there are infinitely many standard bases for
H, and hence therecould be another
pair of nonzero
elements
2 c,2 b ∈ F , different from the
c,d a,b a,b ax ,by
pair a, b, such that F = F . For instance, F = F for any x, y ∈ F × , and
a,b
F = a,−ab
F .
The notation H = a,b F is functorial in F , that is, if K is a field extension of F , then
a, b a, b
⊗K ∼
= as K-algebras.
F K
1
a,b
Theorem 1.4 Let a, b ∈ F × . Then F exists.
Proof. Fix α, β in an algebraic closure E of F such that α2 = a and β 2 = −b. Consider the
two matrices
α 0 0 β
i= , j= .
0 −α −β 0
Direct computations show that
0 αβ
i2 = a, j 2 = b, ij = = −ji.
αβ 0
Since {I2 , i, j, ij} is clearly independent over E, it is also linearly independent overF .
Therefore the F -span of {I2 , i, j, ij} is a 4-dimensional algebra H over F , and H = a,b
F .2
Theorem 1.5 A quaternion algebra over F is central simple, that is, its center is F and
it does not have any nonzero proper two-sided ideal.
Proof. Let H be a quaternion algebra over F , and {1, i, j, k} be a standard basis of H over
F . Consider an element x = α + βi + γj + δk in the center of H, where α, β, γ, δ ∈ F . Then
0 = jx − xj = 2k(β + δj).
2
Theorem 1.8 Let H be a quaternion algebra over F .
Proof. Part (a) is a direct consequence of Wedderburn’s structure theorem. For part (b),
since the characteristic of F is not 2, we can write E = F (i) so that i2 ∈ F × . Let τ be the
nontrivial automorphism of E/F . By the Skolem-Noether Theorem, −i = τ (i) = jij −1 for
some invertible element j ∈ H. Clearly j 6∈ E and 1, i, j are linearly independent over F . If
ij = α + βi + γj with α, β, γ ∈ F , then (i − γ)j = α + βi. But i − γ 6= 0; thus j ∈ F (i) = E
which is impossible. Therefore, {1, i, j, ij} is a basis for H. Note that ij = −ji and so
j 2 ij −2 = i. Therefore, j 2 is in the center of H which is F , and this means that j 2 = b ∈ F .
Clearly, H = E + Ej.2
Definition 1.9 Let {1, i, j, k} be a standard basis for a quaternion algebra H. The elements
in the subspace H0 spanned by i, j and k are called the pure quaternions of H.
The next proposition shows that H0 does not depend on the choice of the standard basis
for H.
(i) x + y = x + y;
(ii) xy = y x;
(ii) x = x;
3
In particular, the conjugation is an involution on H (or, equivalently, an algebra isomor-
phism from H to its opposite algebra H ◦ ). In M2 (F ),
a b d −b
= .
c d −c a
Definition 1.11 For x ∈ H, the (reduced) norm and (reduced) trace of x are the elements
nr(x) = xx and tr(x) = x + x, respectively.
A direct computation shows that both nr(x) and tr(x) are elements of F . The norm
is multiplicative, that is, nr(xy) = nr(x)nr(y) for all x, y ∈ H. The invertible elements of
H are precisely those with nonzero norm. The trace, however, is linear as tr(ax + by) =
atr(x) + btr(y) for all a, b ∈ F . For M2 (F ), the norm of an element is just its determinant.
4
Check that nr(y) = 0. If y 6= 0, then we are done. If y = 0, then −aa21 + aba23 = 0 and thus
nr(a1 i + a3 k) = 0.
Since a1 6= 0, a1 i + a3 k is an isotropic element in H0 .
Suppose that H0 has an isotropic element a1 i + a2 j + a3 k. Then −aa21 − ba22 + aba23 = 0,
and so at least two of a1 , a2 and a3 are nonzero. If a3 6= 0, then
a2 2 a1 2
a +b = 1.
aa3 ba3
If a3 = 0, then
2 2
1+a a2 (1 − a)
a +b = 1.
2a 2aa1
This proves (d) ⇒ (e). √
For (e) ⇒ (f), we assume that ax20 +by02 = 1. If x0 = 0, then b ∈ F and E = √
F , in which
−1 −1
case (f) is certainly true. If x0 6= 0, then one can check that NE/F (x0 + x0 y0 b) = a.
√
Lastly, we suppose that a is a norm from F ( b). If b = c2 for some√c ∈ F , then
(c + j)(c − j) = 0 and H is not a division algebra. So, we may assume that b 6∈ F . Then
a = x21 − by12 for some x1 , y1 ∈ F . Since nr(x1 + i + y1 j) = 0, H again is not a division
algebra. This proves (f) ⇒ (b). 2
Definition 1.14 Let K/F be a field extension. A quaternion algebra H over F splits over
K if H ⊗F K ∼= M2 (K). We say that a quaternion algebra over F splits if it splits over F .
Corollary 1.15 If F is algebraically closed, then every quaternion algebra over F splits.
Proof. This is clear since every element of F is a square in F .2
Corollary 1.16 The quaternion algebras 1,a and a,−a
F F splits.
5
1.3 Quaternion Algebras over Finite Fields
Theorem 1.19 (Wedderburn’s Little Theorem) Let A be a finite division ring. Then A is
a field.
Proof. Let F be the center of A. Then F is a finite field of order q, a prime power ≥ 2. Let
n = dimF A. We shall show that n = 1. Assume the contrary that n > 1. The finite group
A× acts on itself by conjugation. It follows from the class equation that
X
|A× | = q n − 1 = q − 1 + [A× : CA (a)× ],
a
where CA (a) is the centralizer of a, and the a in the summation runs over a (nonempty) set
of representatives of non-singleton conjugacy classes of A× . Let r(a) = dimF CA (a). Then
1 ≤ r(a) < n, and the transitivity of dimensions shows that r(a) | n. Rewriting the class
equation, we have
X qn − 1
(∗) qn − 1 = q − 1 + .
a
q r(a) − 1
Let r be one of the r(a) in the summation. Since r | n, we have the following factorization
in Z[x]:
xn − 1 = Φn (x)(xr − 1)h(x), h(x) ∈ Z[x],
where Φn (x) is the n-th cyclotomic polynomial. This equation implies that (q n − 1)/(q r − 1)
is always an integer divisible by Φn (q). It follows from (∗) that Φn (q) divides q − 1 as well.
In particular, Y
q − 1 ≥ |Φn (q)| = |q − ζ|,
where ζ ranges over all the primitive n-th roots of unity. This is impossible since n > 1 and
q ≥ 2 clearly implies that |q − ζ| > q − 1 ≥ 1 for each ζ.2
Corollary 1.20 If A is a central simple algebra over a finite field F , then A ∼= Mn (F ) for
some n ≥ 1. In particular, every quaternion algebra over a finite field splits.
6
2.1 Quadratic Spaces
A quadratic space over a field F is a pair (V, Q), where V is a finite dimensional vector
space over F and Q : V → F satisfies:
Lemma 2.1 If B and B 0 are two bases for V , then there exists a matrix T in GLn (F ) such
that MB0 = T MB T t .
Let V ∗ be the dual space of V , the vector space of all linear maps V → F . If B =
{v1 , . . . , vn } is a basis for V , then B ∗ = {v1∗ , . . . , vn∗ } denotes its dual basis for V ∗ , where
Lemma 2.2 If B is a basis for V and B ∗ is its dual basis for V ∗ , then the matrix of the
linear transformation B̂ with respect to B and B ∗ is MB .
Proof. From B̂(vi )(vj ) = B(vi , vj ) follows
n
X
B̂(vi ) = B(vi , vj )vj∗
j=1
7
Corollary 2.4 A quadratic space (V, B) is nondegenerate if and only if B̂ is an isomor-
phism or, equivalently, when the matrix MB is invertible for one particular basis B for
V.
Proof. This follows from the previous two lemmas.2
P {e1 , . . . , en } be an orthogonal basis for V , and let Q(ei ) = ai for all i. For any
Let
v = xi ei ∈ V , we have
Q(v) = a1 x21 + · · · + an x2n .
In this case, we shall write V ∼
= ha1 , . . . , an i.
Let (V, Q) and (V, Q0 ) be quadratic spaces over F . A linear map σ : V → V 0 is an
isometry if
Two quadratic spaces are isometric if there is an isometry from one to the other. The set
of all isometries from V to V itself form a group which is called the orthogonal group of V ,
denoted O(V ). Suppose that x is an anisotropic vector in V . Then the function τx : V → V
defined by
2B(y, x)
τx (y) = y − x
Q(x)
is called the symmetry with respect to x, which is an element in O(V ).
8
Proof. Since V is the orthogonal sum of 1-dimensional subspaces, it suffices to consider the
case where dimF (V ) = 1; thus V = F x. Under an isometry F x ⊥ V1 → F x ⊥ V2 , x is sent
to a vector y ∈ F x ⊥ V2 . Let u = (x + y)/2 and v = (x − y)/2. Then B(u, v) = 0 and
Q(x) = Q(u) + Q(v). This implies that either Q(u) or Q(v) is nonzero. If Q(u) 6= 0, then
−τu (x) = y; otherwise τv (x) = y. Therefore, there is an isometry Σ : F x ⊥ V1 → F x ⊥ V2
such that Σ(x) = x. It is easy to see that Σ must send V1 to V2 .2
Theorem 2.8 Let H and H 0 be quaternion algebras over F . Then H and H 0 are isomorphic
if and only if the quadratic spaces H0 and H00 are isometric.
Proof. Let nr and nr0 be the reduced norms of H and H 0 , respectively. Suppose first that
there exists an algebra isomorphism φ : H → H 0 . Let x be a nonzero vector in H0 . Then
x 6∈ F but x2 ∈ F . So, φ(x) 6∈ F and φ(x)2 = φ(x2 ) ∈ F . Therefore, φ(x) ∈ H00 ,
which shows that φ is a vector space isomorphism from H0 into H00 . For any x ∈ H0 ,
nr0 (φ(x)) = −φ(x)2 = φ(−x2 ) = φ(nr(x)) = nr(x). Thus H0 and H00 are isometric.
0
that σ : H0 → H0 is an isometry. Let {1, i, j, ij} be a standard basis
Now suppose
a,b
for H = F . Since nr(i) = −i2 = −a, therefore σ(i)2 = −nr0 (σ(i)) = a. Similarly, we
have σ(j)2 = b. The elements i and j are orthogonal in H0 . Therefore, σ(i) and σ(j) are
also orthogonal. So, σ(i)σ(j) = −σ(j)σ(i). Then one can easily check that σ(i) does not
commute with σ(i)σ(j), which means that σ(i)σ(j) 6∈ F . Also, (σ(i)σ(j))2 = −ab ∈ F .
Thus σ(i)σ(j) ∈ H00 . If rσ(i) + sσ(j) + tσ(i)σ(j) = 0 for some r, s, t ∈ F , then left
multiplication by σ(i) to this equation forces r = 0. By a similar
token,
s = t = 0. Thus
0 0 a,b
{1, σ(i), σ(j), σ(i)σ(j)} is a standard basis for H so that H = F .2
9
3 Quaternion Algebras over Local Fields
In this section, we give a more thorough discussion of quaternion algebras over a local
field. A local field, by our definition, is the completion of a number field with respect to
a nontrivial valuation. The complex numbers C and the real numbers R are examples of
local fields. By Corollary 1.15,
every
quaternion
algebra
over C must split. Over R, the
1,1 1,−1 −1,−1
only quaternion algebras are R , R and R = H, since R× has only two square
classes represented by 1 and −1 respectively. The first two are isomorphic to M2 (R).
(b) it is integrally closed, which means that an element of F that is a root of a monic
polynomial over o is already in o;
10
for all x, y ∈ F with equality when v(x) 6= v(y). A place v is called a finite place if it
contains a nonarchimedean valuation. Otherwise it is called an infinite place. The set of all
finite places of F and the set of infinite places of F are denoted by Ωf and Ω∞ , respectively.
Let v be a place of F . The function d(x, y) = v(x − y) defines a metric on F . The
completion of F with respect to this metric is denoted by Fv , which is a locally compact
complete metric space. The valuation v extends uniquely to a valuation on Fv which we also
denote by v. It turns out that Fv is a field, and its addition, subtraction, multiplication and
taking inverse are all continuous operations with respect to the metric topology. Associated
to v is an embedding σv : F → Fv , and we can identify F as a subfield of Fv through σv .
We usually make no distinction between F and σv (F ). So, when we write F ⊆ Fv , it is
understood that F is embedded in Fv through σv . In this way, we identify each element
a ∈ F with its image σv (a) in Fv .
If v is a finite place, then Fv is called a p-adic field. The set {x ∈ Fv : v(x) ≤ 1} is a
subring of Fv , which is called the ring of integers in Fv . It is a local ring with maximal ideal
xo = pn a,
where p - a. Let N(p) := |o/p| be the norm of p and set v(x) = N(p)−n . Then v extends to an
nonarchimedean valuation on F . It turns out that this way of constructing nonarchimedean
valuations yields a bijection between Ωf and the nonzero prime ideals of o. The residue
field of Fv is the field ov /pv , which is isomorphic to o/p, a finite extension of the field Z/pZ
of p elements. A consequence of this is that ov is a compact subset in Fv , which makes Fv
locally compact. The ring ov is a PID, and its maximal ideal pv is generated by any element
π ∈ p \ p2 . In general, a generator of pv is called a uniformizer of Fv , which is an element
in pv of the largest valuation. The nonzero ideals of ov are of the form pnv , n ∈ N. Let π be
a uniformizer of Fv . Then every nonzero element in Fv× can be written as π m with m ∈ Z
and a unit in ov .
Suppose that [F : Q] = n. Then there are n different embeddings of F into C. An
embedding σ : F → C is called a real embedding if σ(F ) ⊆ R; otherwise it is called a
complex embedding. Since complex embeddings occur in pairs, n = r + 2s where r (resp.
s) is the number of real (resp. complex) embeddings of F . If σ is an embedding of F
into C, then v(a) = |σ(a)| is a valuation on F . Here | | is the complex modulus. Note
that a complex embedding and its complex conjugation yield the same valuation. So the
total number of inequivalent valuations obtained in this way is r + s, and they correspond
to all the infinite places of F . An infinite place v of F is real if it corresponds to a real
embedding; otherwise it is called a complex place. When v is a real (resp. complex) place,
Fv is isomorphic to R (resp. C).
11
Example 3.1 There is only one infinite place on Q which contains the usual absolute
value on R. This is clearly a real place. Let p be a prime number. For any nonzero rational
number x, we can write x = pn z, where z is a rational number for which p divides neither
the numerator nor the denominator. Then v(x) = p−n is called the p-adic valuation on Q.
The completion of Q with respect to this valuation is called the field of p-adic numbers,
denoted Qp . The ring of integers in Qp is Zp , the ring of p-adic integers.
In this lecture notes, a local field always means the completion of a number field with
respect to a valuation. Let K be a local field and E/K be a finite extension. It turns out
that E is also a local field. If v is a valuation on K, then v extends uniquely to a valuation
w on E by
1
w(x) = v(NE/K (x)) [E:K] ,
where NE/K is the norm from E to K.
Definition 3.2 A finite extension of p-adic fields E/K is called unramified if a uniformizer
in K is also a uniformizer in E. Otherwise E/K is ramified.
Let oK and oE be the ring of integers of K and E, respectively, and let π be a uniformizer
of K. Then E/K is unramified if and only if π generates the maximal ideal of oE . In general,
if pE is the maximal ideal of oE , then πoE = peE for some positive integer e. This e is called
the ramification index of E/K. Note that E/K is unramified exactly when e = 1. The
residue field oE /pE is a finite extension of the residue field oK /pK . Its degree of extension
[oE /pE : oK /pK ] is called the residue degree of E/K, usually denoted by f .
Theorem 3.3 (Local Fundamental Identity) If E/K is a finite extension of p-adic fields,
then ef = [E : K].
The next theorem is an important result form local class field theory.
Theorem 3.4 (Local Norm Index) Let E/K be a finite abelian extension of local fields.
Then
[K × : NE/K (E × )] = [E : K].
Corollary 3.5 Let E/K be a quadratic extension of p-adic fields. Then E/K is unramified
if and only if NE/K (E × ) contains all the units of the ring of integers in K.
Proof. Suppose that E/K is unramified. If π := NE/K (x) is a uniformizer of K, then
1 1
w(x) = v(NE/K (x)) 2 = v(π) 2 . But since π is also a uniformizer of E, we must have
1
1 > v(π) = w(π) ≥ w(x) = v(π) 2 ,
which is impossible. So, NE/K (E × ) does not contain any uniformizer of K. Notice that
NE/K (E × ) contains K ×2 , and that every coset in K × /K ×2 is represented by an element of
12
K of the form π δ , where δ ∈ {0, 1} and ∈ o× × × ×2
K . So, [K : oK K ] = 2. It follows form the
Local Norm Index Theorem that NE/K (E × ) is equal to o× K K ×2 in this case.
w:H→R
with equality if w(xy −1 ) 6= w(1). Note that w(1) = 1 and that w(xy −1 ) = w(x)w(y)−1 from
(b). Hence w satisfies (c).2
13
(c) Let z ∈ P be such that w(z) is maximal. Then P = zO = Oz.
(d) P 2 ⊆ πO ⊆ P.
(e) The map x + P 7−→ zx + P 2 is a F -vector space isomorphism from O/P to P/P 2 .
Proof. Parts (a) and (b) are straightforward. For (c), if y ∈ P, then w(y) ≤ w(z). Hence
w(z −1 y) = w(yz −1 ) ≤ 1, which means that y is in zO and Oz.
For (d), it is clear that πO ⊆ P. Note that w(π −1 z 2 ) = [v(π)−1 v(nr(z))]2 ≤ 1. So,
z 2 ∈ πO and hence P 2 ⊆ πO.
For (e), the map x + P 7−→ zx + P 2 is a F -linear map. By (c), it is surjective. Suppose
that zx ∈ P 2 . Then x ∈ P which shows that the map is injective. 2
14
extension of F . By Corollary 3.7(e), O/P and P/P 2 have the same dimension as F -vector
spaces. Therefore dimF (O/P) > 1, in particular, O/P = 6 F.
Now choose α ∈ O such that O/P = F (α + P). Then α 6∈ F and hence K = F (α) is
a quadratic extension over F . Since K/F is a nontrivial extension, therefore the residue
degree f of K/F is at least 2. It then follows from the Local Fundamental Identity that
f must be exactly 2 and the ramification index of K/F is 1. So, K/F is unramified, and
hence there exists i ∈ K such that i2 = u.
The two square roots ±i of u give two embeddings of K into H. By the Skolem-Noether
Theorem, there is a j ∈ H × such that −i = jij −1 . Thus {1, i, j, ij} is a basis of H (verify!).
Since j 2 commutes with i, j 2 is in center of H and hence j 2 ∈ F . This implies that {1, i, j, ij}
is a standard basis of H.
Let j 2 = π m , where ∈ o× . We may assume that m = 0 or 1. Since every in o×
u,
is a norm of an element in K, F splits. Thus m = 1, and there exists a, b ∈ F such
that a2 − ub2 = . It remains to show that H = u,π F . It suffices to show that H0 has an
orthogonal basis {e1 , e2 , e3 } such that nr(e1 ) = −u, nr(e2 ) = −π and nr(e3 ) = πu.
Since H = u,π F , H0 has an orthogonal basis {f1 , f2 , f3 } such that nr(f1 ) = −u,
nr(f2 ) = −π and nr(f3 ) = πu. Now, let e1 = f1 , e2 = −1 (af2 + bf3 ) and e3 = −1 (ubf2 +
af3 ). It is direct to check that {e1 , e2 , e3 } is the desired orthogonal basis of H.2
[M : F ] = [M : L][L : F ] = [M : L]
Theorem 3.11 (Hensel’s Lemma) Let f (x) be a monic polynomial in o[x]. Suppose that
f (x) mod p admits a factorization g(x)h(x), where g(x) and h(x) are relatively prime poly-
nomials in F [x]. Then f (x) admits a factorization g(x)h(x) in o[x], where g(x) mod p =
g(x) and h(x) mod p = h(x).
15
Corollary 3.12 Let F be a nondyadic p-adic field. Then F × /F ×2 is a group of order 4
whose elements are represented by 1, u, π and πu where u is a nonsquare unit in o.
Proof. By Hensel’s Lemma, an element c ∈ o× is a square if and only if c is a square
modulo p. Since the residue field o/p is a finite field of odd characteristic, it has exactly
two square classes. So, o× also has exactly two square classes. The Corollary now follows
immediately.2
Theorem 3.13 Let F be a nondyadic p-adic field. Let H = a,b F , where a, b ∈ o.
K ⊗F Fv ∼
Y
(]) = Kw .
w|v
and X
TK/F (a) = Tw/v (a).
w|v
16
Here Nw/v and Tw/v are the norm and the trace of the extension Kw /Fv . Recall that the
valuation on Kw that extends v on Fv is defined by w(x) = v(Nw/v (x))1/[Kw :Fv ] . It also
follows from (]) that X
[Kw : Fv ] = [K : F ].
w|v
If ew and fw are the ramification index and residue degree of Kw /Fv , then the Local
Fundamental Identity implies:
X
ew fw = [K : F ] (Fundamental Identity).
w|v
We now mention some results from class field theory. An element a ∈ F is called a
global norm (of the extension K/F ) if a ∈ NK/F (K). For any v ∈ ΩF , a is called a local
norm at v if a ∈ Nw/v (Kw ) for all w | v.
Theorem 4.1 (Hasse’s Norm Theorem) Let K/F be a cyclic extension of number fields,
and let a ∈ F . Then a is a global norm if and only if it is a local norm at every v ∈ ΩF .
We shall be
√ interested in the special case when K/F is a quadratic extension. In this
case, K = F ( δ) for some δ 6∈ F 2 . An element a ∈ F is a global norm if the diophantine
equation x2 − δy 2 = a has a solution over F . For every v ∈ ΩF , by (]) there are either one
or two places of w lying above v. The latter occurs exactly when δ is a square in Fv and
we say that v splits in K. If there is only one w lying above v, then Kw /Fv is a quadratic
extension and a is a local norm at v if and only if x2 − δy 2 = a has a solution over Fv . If
there are two places of K lying above v, then every a ∈ F is a local norm at v. At the
same time, the equation x2 − δy 2 = a always has a solution over Fv . Hence Hasse’s Norm
Theorem in the special case can be rephrased as: x2 − δy 2 = a has a solution over F if and
only if x2 − δy 2 = a has a solution over Fv for every v ∈ ΩF .
Theorem 4.2 (Global Square Theorem) Let δ be an element in a number field F . Then δ
is a square in F if and only if δ is a square in Fv for almost all v ∈ ΩF .
Recall that a quadratic space (V, Q) is called isotropic if there exists a nonzero vector
v in V such that Q(x) = 0; such a v is called an isotropic vector. The space V is called
isotropic if it has an isotropic vector, and is called nondegenerate if it does not have any
nonzero vector that is orthogonal to all vectors in V . If v is a place of F , then Vv denotes
the quadratic space Fv ⊗F V with quadratic form Qv (a ⊗ x) = a2 Q(x) for all a ∈ Fv and
x ∈ V . We often abuse the notation and use the same Q, instead of Qv , to denote the
quadratic form on Vv .
The next theorem is one of the most important theorem in the algebraic theory of
quadratic forms.
17
(a) Let V be a nondegenerate quadratic space over a number field F . Then V is isotropic
if and only if Vv is isotropic for all places v of F .
(b) Let V and W be nondegenerate quadratic spaces over a number field F . Then V and
W are isometric if and only if Vv and Wv are isometric for all places v of F .
Let us look at part (a) of the Hasse-Minkowski Theorem. We may assume, by scaling
the quadratic form on V suitably, that there exists v1 ∈ V with Q(v1 ) = 1. We can extend
v1 to an orthogonal basis {v1 , . . . , vn } of V . If n = 2, then there exists a δ ∈ F × such that
every Q(v) is of the form x2 + δy 2 with x, y ∈ F . So, V is isotropic if and only if −δ is a
square in F . Thus part (a) in this case is just the Global Square Theorem. If n = 3, then
the corresponding quadratic form on V is of the form x2 + δy 2 + γz 2 . We may assume that
−δ is not a square
√ in F . Then V is isotropic if and only if γ is a norm from the quadratic
extension F ( −δ), and part (a) in this case is just Hasse’s Norm Theorem. The rest of
the proof for part (a) is an (nontrivial) induction on the dimension of V ; see page 187 in
O’Meara’s book.
For part (b), we first observe that the case n = 1 the theorem is equivalent to the Global
Square Theorem. Suppose that n > 1. It suffices to look at the ”if” part of the statement.
We may assume that V has a vector x such that Q(x) = 1. Then V ∼ = h1i ⊥ V 0 for some
0
subspace V of dimension n − 1 . At each v, there is a vector wv ∈ Wv such that Q(wv ) = 1,
since Wv ∼= Vv . This means that the space h−1i ⊥ Wv is isotropic for every v and, by part
(a), the space h−1i ⊥ W is isotropic. Thus there exists a vector w ∈ W with Q(w) = 1.
So, W ∼= h1i ⊥ W 0 for some subspace W 0 of dimension n − 1. Now, by Witt’s Cancellation
Theorem, Vv0 ∼ = Wv0 for all places v. It follows from an induction on the dimension that
0 ∼
V = W , whence V ∼
0
= W.
4.2 Classification
Let v be a place of F . For any a, b ∈ Fv× , define the Hilbert Symbol
18
Let H be a quaternion algebra over F . For any place v of F , let Hv denote the quaternion
algebra Fv ⊗F H over Fv . If v is a complex place, then Hv necessarily splits. However, if v is
a real place or a finite place, then Hv either splits or is isomorphic to the unique quaternion
division algebra over Fv . Note that Hv splits for almost all places v.
Theorem 4.5 Let H be a quaternion algebra over a number field F . Then H splits over
F if and only if Hv splits over Fv for all places v of F .
Proof. By Theorem 1.13, H splits over F if and only if H0 is isotropic. The theorem now
follows immediately from the Hasse-Minkowski Theorem.2
Definition 4.6 Let H be a quaternion algebra over a number field F . Then H is said to
be ramified at a place v if Hv is a division algebra. Otherwise, H splits at v. The set of
places at which H is ramified is denoted by Ram(H).
Proposition 4.7 The set Ram(H) is a finite set containing even number of places.
Proof. This is a consequence of Hilbert’s Reciprocity Law.2
Theorem 4.8 Let H and H 0 be quaternion algebras over a number field F . Then H ∼
= H0
if and only if Ram(H) = Ram(H 0 ).
Proof. By Theorem 2.8, H and H 0 are isomorphic if and only if H0 and H00 are isometric as
quadratic spaces. By the Hasse-Minkowski Theorem, H0 and H00 are isometric if and only
if (H0 )v and (H00 )v are isometric for all places v, which is the same as saying that Hv and
Hv0 are isomorphic for all places v. But Hv and Hv0 are isomorphic if and only if either they
both split or they are both ramified. Thus H and H 0 are isomorphic if and only if they are
ramified at the same set of places.2
19
Dedekind domain is that IF is the free abelian group on the set of nonzero prime ideals of
o. In other words, every fractional ideal a has a unique prime ideal factorization
a = pa11 · · · pat t ,
Theorem 5.2 (Invariant Factor Theorem) Let L and M be two complete lattices in a vector
space V over F . Then there is a basis {x1 , . . . , xn } of V such that
L = a1 x1 + · · · + an xn
M = a1 r1 x1 + · · · + an rn xn
20
Henceforth, H is always a quaternion algebra over F . We first demonstrate the existence
of an order in H. Unless stated otherwise, an ideal in H is always an o-ideal. If I is an
ideal in H, then the left order of I and the right order of I are defined respectively by
Thus bi ∈ d−1 o and O ⊆ d−1 L. So, O is a finitely generated o-module which implies that
O is an order.2
21
The set of all elements
in H that are integral over o is not necessarily an order. For
−1,−1
example, let H = Q with standard basis {1, i, j, ij}. Then α = i and β = (3i + 4j)/5
are integral over Z, but neither αβ nor α + β is integral over Z.
If O is an order in H and α ∈ H × , then αOα−1 is also an order in H. So, the conjugate
of a maximal order in H is also a maximal order. However, there could be more than one
conjugacy classes of maximal orders in a quaternion algebra.
5.2 Localizations I
In this subsection, F is a number field and o is the ring of integers in F . The symbol p
always denote a finite place of F or its associated prime ideal of o. For any p ∈ Ωf , let o(p)
be the localization of o with respect to the multiplicative set o \ p. In other words,
p̃ = {a/b ∈ F : a ∈ p, b ∈ o \ p}.
and so
p̃ = {α ∈ F : ordp (α) > 0}.
It is clear that ordp (α) ≥ 0 for all α ∈ o(p). Conversely, suppose that ordp (α) ≥ 0 and we
can write α = a/b, where a, b ∈ o. Then ordp (a) ≥ ordp (b). Suppose that ordp (b) = n, or
equivalently, bo = pn b for some integral ideal b. Let a be an integral ideal belongs to the
ideal class containing p−1 . So, ap is a principal ideal to for some t ∈ o.
If p - a, then a 6⊆ p. Then there exists x ∈ a but x 6∈ p. Therefore, xo = ac, where p - c.
This implies
xn bo = (ap)n cb = tn cb.
Set b0 := xn b/tn , which is an element in o \ p. Using the same argument we can show that
the element a0 := xn a/tn is in o. Then α = a/b = a0 /b0 ∈ o(p).
If p | a, then a ⊆ p. Choose an integral ideal i such that ia is principal generated by
δ ∈ o. Then ia 6⊆ δp. Fix an ∈ i such that a 6⊆ δp and set γ = /δ. Then γa ⊆ o and
γa 6⊆ p. So, γa is an integral ideal in the ideal class containing p−1 and p - γa. We can then
replace a by γa in the last paragraph.
Let I be a nonzero ideal of o(p). Among all the elements in I, choose one, say x,
such that ordp (x) is the smallest. For any a ∈ I, ordp (a) ≥ ordp (x), which implies that
ordp (ax−1 ) ≥ 0, that is ax−1 ∈ o(p). Thus, a ∈ xo(p) and hence I = xo(p). This shows
22
that o(p) is a PID and its ideals are π n o(p) = p̃n , n ≥ 0. The rings o(p) are subrings of F
and o can be recovered from them as
\
o= o(p).
p∈Ωf
Lemma 5.8 Let V be a finite-dimensional vector space over F and let L be an o-lattice in
V . Then \
L= o(p)L.
p∈Ωf
Proof. It is clear that L is contained in the intersection. For the converse, let {x1 , . . . , xk }
be a generating set for L as an o-module; thus it is also a generating set for o(p)L as an
o(p)-module for each p. Suppose that x is in the intersection. Let
J = {y ∈ o : yx ∈ L}.
Lemma 5.9 Let I be an o-ideal in an quaternion algebra H over F . For each prime ideal
p ∈ Ωf , let I(p) be an o(p)-ideal in H such that I(p) = o(p)I for almost all p. Then
\
J= I(p)
p∈Ωf
Then \ \
J= I(p) ⊆ a−1 o(p)L = a−1 L
p∈Ωf p∈Ωf
23
by Lemma 5.8. Thus J is an o-lattice in H. By the same token, aL ⊆ J; thus J is an
o-ideal in H.
Now, for each p ∈ Ωf , o(p)J ⊆ o(p)I(p) = I(p). For the reverse inclusion,
P let j1 , . . . , jk
be a generating set for J as o-module. Let x ∈ I(p) so that x = i ai ji with ai ∈ F .
Choose a nonzero s1 ∈ o so that s1 a1 ∈ o. Suppose that s1 o has the following prime ideal
factorization
s1 o = pn0 qn1 1 · · · qnt t
where ni ≥ 1 for 0 ≤ i ≤ t. By the Chinese Remainder Theorem, there exists d1 ∈ o such
that
s1 a1 + s1 mod pn0 +1 ,
d1 ≡
s1 mod qini +1 for 1 ≤ i ≤ t.
Then b1 = d1 /s1 is such that b1 − a1 ∈ o(p) and b1 ∈Po(q) for all prime ideals q 6= p. Repeat
the same for each ai toPobtain a bi , and let y = bi ji . Then y ∈ o(q)J ⊆ I(q) for all
q 6= p. Also, y − x = (bi − ai )ji ∈ o(p)J ⊆ I(p). Thus y ∈ I(p) and so y ∈ J. Hence
x = y − (y − x) ∈ o(p)J.2
Note that if O is an o-order in H, then o(p)O is an o(p)-order in H and the above lemma
holds with “ideals” replaced by “orders”.
5.3 Localizations II
In this subsection, we shall interpret the local-global results obtained in the last subsection
in the context of ideals and orders over the p-adics. If H is a quaternion algebra over F ,
then Hp denotes the quaternion algebra Fp ⊗F H. For any lattice L in H, Lp denotes the
op -lattice op ⊗o L in Hp . If O is an order in H, then Op is an op -order in Hp . Note that
Op = op ⊗o O = op ⊗o(p) (o(p)O).
24
Lemma 5.11 There is a bijection between o(p)-ideals (resp. orders) in a quaternion algebra
H over F and the op -ideals (resp. orders) in the quaternion algebra Hp over Fp given by
the map
I 7−→ op ⊗o(p) I
which has the inverse J 7→ J ∩ H.
Proof. Since o(p) is a PID, I is free as an o(p)-module. Let {x1 , x2 , x3 , x4 } be a basis of
I over o(p). Then in Fp ⊗F H, (op ⊗o(p) I) ∩ H consists of the op ∩ F = o(p) linearly
combinations of the xi . Thus (op ⊗o(p) I) ∩ H = I.
Now, suppose that J is an op -ideal in Hp and that {yP 1 , y2 , y3 , y4 } is a basis of J over op .
Let {z1 , z2 , z3 , z4 } be a basis of H over F so that zi = j bij yj for all i. Then B = (bij )
is an invertible matrix in M4 (Fp ). Since F is dense in Fp , we can choose cij ∈ F such that
the entries of C = −1 to make CB to be a unit in M (o ).
P(cij ) is close
P enough to those of B 4 p
0
Now let zi = j cij zj = j,k cij bjk yk . Then {z1 , z20 , z30 , z40 } is a basis of J over op , which
0
5.4 Discriminants
In this subsection, F is the field of fractions of a Dedekind domain o, and H is a quaternion
algebra over F .
25
Proposition 5.15 If an o-order O in H is free with a basis {u1 , u2 , u3 , u4 } over o, then
d(O) is the principal ideal generated by det(tr(ui uj )).
P
Proof. Let x1 , x2 , x3 , x4 ∈ O so that xi = k aik uk with aik ∈ o for all i, k. Then
Example 5.16 Let O = M2 (o). It has a basis {Eij : 1 ≤ i, j ≤ 2}, where Eij is the matrix
with 1 in the (i, j)-entry and 0 elsewhere. Using this basis one can easily compute d(O) = o.
−1,−1
Example 5.17 Let H be the quaternion algebra Q and O be the Z-order Z + Zi +
Zj + Zij. Then d(O) = 16Z. Let O0 be the Z- order O + Zα, where α = (1 + i + j + ij)/2.
Then O ⊆ O0 , and d(O0 ) = 4Z.
Lemma 5.18 Suppose that o is a PID. If O1 and O2 are two o-orders in H with O1 ⊆ O2 ,
then d(O2 ) | d(O1 ), and O1 = O2 if and only if d(O1 ) = d(O2 ).
Proof. The first assertion is clear. For the second assertion, suppose that d(O1 ) = d(O2 ).
Let {u1 , u2 , u3 , u4 } be an o-basis of O1 , and let {v1 , v2 , v3 , v4 } be o-basis of O2 . Since
O1 ⊆ O2 , the matrix T that expresses the ui in terms of the vj has entries in o. But
Theorem 5.19 Suppose that F is a number field and o is its ring of integers. Let O1 and
O2 be o-orders in H with O1 ⊆ O2 . Then d(O2 ) | d(O1 ), and d(O1 ) = d(O2 ) if and only if
O1 = O2 . In particular, O is maximal if d(O) = o.
Proof. The first assertion is clear. Suppose that d(O1 ) = d(O2 ). Then d(o(p)O1 ) =
d(o(p)O2 ) for all p ∈ Ωf . It follows from Lemma 5.18 that o(p)O1 = o(p)O2 for all p,
and so O1 = O2 by Lemma 5.8.2
Let us continue to assume that F is a number field and o is its ring of integers. Suppose
that
Yr
d(O) = pni i
i=1
26
is the prime ideal factorization of d(O). If p is not one of those pi , then d(o(p)O) =
o(p)d(O) = o(p). For i = 1, . . . , t, d(o(pi )O) = pni i o(pi ). Thus
Y
d(O) = (d(o(p)O) ∩ o).
p∈Ωf
Over the p-adics, one can show that d(Op ) = d(O)p for all p. Since the unique prime
ideal in op is pop , one can, with an abuse of notation, re-write the above product as
Y
d(O) = d(Op ).
p∈Ωf
5.5 Orders in M2 (F )
In this subsection, we discuss the special case when H = M2 (F ). Here F is the field of
fractions of a Dedekind domain o. Let V be a 2-dimensional vector space over F . We fix
a basis {e1 , e2 } of V so that M2 (F ) is identified with End(V ). The o-lattice oe1 + oe2 is
denoted by L0 .
If L is a complete o-lattice in V , define
In particular, End(L0 ) is identified with the o-order M2 (o). It is clear that End(L) is a
subring of End(V ) for any L. Moreover, End(L) = End(aL) for all a ∈ F × .
For any complete o-lattice L in V , there exists nonzero a ∈ o such that aL0 ⊆ L ⊆ a−1 L0 .
It follows that
a2 End(L0 ) ⊆ End(L) ⊆ a−2 End(L0 ).
Thus End(L) is an o-order in H.
Lemma 5.21 Let O be an o-order in End(V ). Then there exists a complete o-lattice L in
V such that O ⊆ End(L).
Proof. Let L = {` ∈ L0 : O` ⊆ L0 }. Then L is an o-submodule of L0 . In particular, L is
finitely generated. Also, if 0 6= a in o such that aEnd(L0 ) ⊆ O ⊆ a−1 End(L0 ), then for all
` ∈ L0 , we have
Oa` ⊆ End(L0 )` ⊆ L0 .
Thus aL0 ⊆ L and L is a complete o-lattice in V .
Let α ∈ O. For any ` ∈ L, Oα` ⊆ O` ⊆ L0 . Therefore, α` ⊆ L and O ⊆ End(L).2
27
Corollary 5.22 Suppose that o is a PID. Then the maximal o-orders in M2 (F ) are precisely
the orders End(L). Every maximal o-order in M2 (F ) is conjugate to M2 (o).
Proof. Let O be a maximal o-order in M2 (F ). By Lemma 5.21, there exists a complete
o-lattice L in V such that O = End(L). Conversely, if L is a complete o-lattice in V , then
L = of1 + of2 for some basis {f1 , f2 } of V . Let σ ∈ End(V ) defined by σ(ei ) = fi with
i = 1, 2. Then L = σ(L0 ) and hence End(L) = σEnd(L0 )σ −1 = σM2 (o)σ −1 . This shows
that End(L) is a maximal o-order, and that every maximal o-order in M2 (F ) is conjugate
to M2 (o).2
When o is not a PID, not every L is free as an o-module. However, since o is a Dedekind
domain, there exists a basis {x, y} of V and a fractional ideal a such that L = ox + ay.
Thus End(L) is a conjugate of
a b −1
M2 (o, a) = : a, d ∈ o, b ∈ a , c ∈ a .
c d
Now suppose that F is a p-adic field with ring of integers o. Let π be a uniformizer for
F and let q be the size of the residue field o/p. Then for any integer n ≥ 0, [o : pn ] = q n .
O ∼ O0
=
O ∩ O0 O ∩ O0
as o-modules.
Proof. Suppose that O = End(L) and O0 = End(L0 ), where L and L0 are complete o-lattices
in V . By the Invariant Factor Theorem and scaling L0 by an element in F × if necessary,
we can find a basis {e, f } of L and n ≥ 0 such that {e, π n f } is a basis of L0 . Using {e, f },
0 −1 1 0
we can O with M2 (o) and O with xOx where x is the matrix ( 0 πn ). Then
identify
o pn
O0 = p−n o
. Thus
O ∼ o ∼ O0
= = .
O ∩ O0 pn O ∩ O0
2
Lemma 5.25 Suppose that the distance between two maximal o-orders O and O0 in M2 (F )
is n. Then O0 = xOx−1 for some x ∈ GL2 (F ) with ordp (det(x)) = n.
Proof. This is clear from the proof of Lemma 5.23.
28
5.6 Orders in the Local Case
In this subsection, we deal with the case when F is a p-adic field and o is its ring of
integers. So, o is a PID with prime ideal p = πo. Let H is the unique quaternion algebra
u,π √
F , where u is a nonsquare unit so that F ( u)/F is an unramified quadratic extension.
If v is a valuation on F and w = v ◦ nr, then w is a valuation on H. Let O be the associated
valuation ring {x ∈ H : w(x) ≤ 1}. Note that O is also equal to {x ∈ H : nr(x) ∈ o}.
Theorem 5.26 The valuation ring O is the unique o-maximal order in H and has discrim-
inant d(O) = π 2 o = p2 .
Proof. For any x ∈ H, there exists a nonzero r ∈ o such that rx ∈ O. So, F O = H. If
x ∈ O, then x ∈ O and so tr(x) = x + x ∈ O ∩ F = o. But nr(x) ∈ o since x ∈ O. Thus x
is integral over o. This shows that O is an o-order in H.
If x ∈ H is integral over o, then nr(x) ∈ o and hence w(x) ≤ 1. Thus x ∈ O, which
means that O is precisely the set of elements in H that are integral over o. Hence O is the
maximal o-order in H.
√
Let {1, i, j, ij} be a standard basis of H such that i2 = u and j 2 = π. Let K be F ( u).
Then H = K + Kj and nr|K is the norm NK/F . Since K/F is unramified, π is also a
uniformizer for K. Let oK be the ring of integers of K. Then
oK = {x ∈ K : nr(x) ∈ o}.
Now let α = x + yj ∈ H with x, y ∈ K. Then nr(α) = nr(x) − nr(y)π. Since nr(x) and
nr(y) are of the form π 2m z, where z ∈ o× , we see that nr(α) ∈ o if and only if nr(x) and
nr(y) are in o. Thus α ∈ O if and only if α ∈ oK + oK j. Hence O = oK + oK j.
Now, let y ∈ oK such that {1, y} is a basis of oK over o. Then {1, y, j, yj} is a basis of
O over o. Note that jy = yj and tr(αj) = 0 for all α ∈ K. From this it follows that
d(O) = (y − y)4 π 2 o.
Since K/F is unramified, the reside field of K is a quadratic extension of the reside field of
F . Thus, if y − y is not a unit, then the images of y and y in the residue field of K would
be the same, which means that the image of y is in the residue field of F . However, oK is
equal to o[y], and so the residue fields of K and F must be the same which is impossible.
Therefore, d(O) = π 2 o.2
Definition 5.27 The discriminant of H, denoted ∆(H), is the product of all the finite
places at which H is ramified.
29
Theorem 5.28 Let O be an o-order in a quaternion algebra H over a number field F .
Then O is a maximal o-order if and only if d(O) = ∆(H)2 . In particular, all maximal
o-orders in H have the same discriminant.
Proof. By Lemma 5.13, O is maximal if and only if Op is maximal for every finite place
p. By Example 5.16 and Theorem 5.26, the discriminant of a maximal op -order in Hp is
either op or p2 according to whether Hp splits or is ramified. Furthermore, orders with these
discriminants are necessarily maximal by Theorem 5.19. The result now follows from the
fact that d(O) is the product of all d(Op ).2
Example 5.29 Let H be the quaternion algebra −1,−1 Q . Then Hp splits for all odd
primes p; see Theorem 3.13. Since |Ram(H)| is even and H ⊗Q R = H, H is ramified at 2.
Thus ∆(H) = 2Z. The discriminant of the Z-order O0 in Example 5.17 is 4Z. Thus O0 is a
maximal Z-order in H.
The elements of HA× are called ideles of H. Clearly HA× is a subgroup of the direct product
Y
Hv× . For each v ∈ Ω, there is an embedding σv : H ,→ Hv = Fv ⊗F H, where σv (x) =
v∈Ω
1 ⊗ x for all x ∈ H. Using this embedding, we can identify H × as a subgroup of Hv× and
we will not make any distinction between x and σv (x). If x ∈ H × , there exists r 6= 0 in F
such that rx−1 ∈ O. Since r ∈ o× ×
v for almost all v, therefore x ∈ Ov for almost all v; hence
× ×
H can be identified as a subgroup of HA .
Let x = (xv ) be an idele of H. Define an order xOx−1 in H by specifying its local
completion at a finite place v as
(xOx−1 )v = xv Ov x−1
v .
This definition is meaningful by Lemma 5.12 since xv ∈ Ov× for almost all finite places v
so that (xOx−1 )v = Ov for almost all v. Since the conjugate of a maximal order in Hv is
again maximal, the order xOx−1 is a maximal order in H by Corollary 5.13.
30
Lemma 6.1 Let O0 be another maximal order in H. Then O0 = xOx−1 for some x ∈ HA× .
Proof. Let S be the set of finite places v for which Ov 6= Ov0 . Then S is a finite set and
S ∩ Ram(H) = ∅. If v ∈ S, then Hv ∼ = M2 (Fv ). Since ov is a PID, Ov0 is conjugate to Ov
×
and hence there exists hv ∈ Hv× such that hv Ov h−1 0
v = Ov . Now define an idele x ∈ HA by
1 if v 6∈ S;
xv =
hv if v ∈ S.
It is clear that xOx−1 = O0 .2
Up to this point, we see that HA× acts on the set of all maximal orders in H by conjugation
and the action is transitive. For every v ∈ Ωf , let N(Ov ) be the normalizer of Ov , that is
N(Ov ) = {xv ∈ Hv× : xv Ov x−1
v = Ov }.
It is easy to see that N(Ov ) is a subgroup of Hv× . Moreover, Ov× ⊆ N(Ov ). Let
N(O)A = {x = (xv ) ∈ HA× : xv ∈ N(Ov ) for all v ∈ Ωf }.
Clearly N(Ov ) is a subgroup of HA× . In fact, N(O)A is the stabilizer of O in HA× .
Proposition 6.2 The set of conjugacy classes of maximal orders in H is in bijection with
the double coset space H × \ HA× /N(O)A .
Proof. Let C be the set of conjugacy classes of maximal orders in H. We shall set up a
bijection Φ from the double coset space H × \ HA× /N(O)A to C.
Suppose that H × xN(O)A = H × yN(O)A . Then x = hyn, where h ∈ H × and n ∈
N(O)A . So xOx−1 = h(yOy −1 )h−1 and thus xOx−1 and yOy −1 are in the same conjugacy
class. This means that we can define a function Φ : H × \ HA× /N(O)A → C such that
Φ(H × xN(O)A ) = conjugacy class that contains xOx−1 .
Lemma 6.1 implies that Φ is surjective.
Now, let x, y ∈ HA× such that xOx−1 and yOy −1 are in the same conjugacy class. Then
xOx−1 = hyO(hy)−1 for some h ∈ H × . So, x−1 hy ∈ N(O)A and hence y ∈ H × xN(O)A .
This shows that Φ is injective.2
31
Lemma 6.3 If v is a finite place of F , then nr(Hv× ) = Fv× .
Proof. This
is clear if Hv = M2 (Fv ). Thus we assume that Hv is the unique quaternion
π,u
algebra Fv over Fv , where π is a uniformizer for Fv and u is a nonsquare unit of ov so
√ √
that Fv ( u)/Fv is unramified. The restriction of the reduced norm on Fv ( u) is the usual
√ √ √
norm N of the field extension Fv ( u)/Fv . Since [Fv× : N (Fv ( u)× )] = 2 and N (Fv ( u)× )
contains all the units of ov , it remains to show that nr(Hv× ) contains
a uniformizer of Fv .
But this is clear; nr(j) = −π if {1, i, j, ij} is a standard basis for π,u
Fv .2
Let Ram∞ (H) be the set of infinite places at which H is ramified. It is necessary that
Fv ∼
= R for all v ∈ Ram∞ (H). Let
Proposition 6.4 (Theorem on Norms) Let H be a quaternion algebra over a number field
F . Then nr(H × ) = FH× .
Proof. It is clear that nr(H × ) ⊆ FH× . Let a ∈ FH× . Then a ∈ nr(Hv× ) for all v ∈ Ω by
Lemma 6.3. Since H, when equipped with the reduced norm, is a nondegenerate quadratic
space over F , we can apply Hasse-Minkowski Theorem to the present situation and deduce
that a ∈ nr(H × ).2
The idele group of F is the set
Y
JF = {x = (xv ) ∈ Fv× : xv ∈ o×
v for almost all finite places v}.
v∈Ω
The elements of JF are called the ideles of F . It is clear that JF is indeed an abelian group
under the operation (xv )(yv ) = (xv yv ). For any a ∈ F × , a ∈ o×
v for almost all finite places
v. Therefore, a can be regarded as the idele whose v-component is a itself (of course, here
we identify a with σv (a) where σv : F → Fv is the embedding associated with the place v).
Thus we can identify F × as a subgroup of JF . Since nr(Ov× ) ⊆ o× v for all finite places v, we
can define the reduced norm nr : HA× → JF by nr((xv )) = (nr(xv )).
For each v ∈ Ω, let
nr(N(Ov )) if v ∈ Ωf ;
Nv =
nr(Hv× ) if v ∈ Ω∞ .
Since Z(Hv ) = Fv , Nv contains Fv×2 . Let
It is clear that JF (O) is the image of N(O)A under the reduced norm, and JF (O) ⊇ J2F .
32
Proof. Suppose that Hv is a division algebra. Then Ov is the unique maximal order in Hv .
Thus xOv x−1 = Ov for all x ∈ Hv× . So N(Ov ) = Hv× , and hence Nv = Fv× by Lemma 6.3.
Now suppose that Hv = M2 (Fv ) = End(V ) where V is a 2-dimensional vector space
over Fv . We may assume that Ov = M2 (ov ) = End(L), where L is a complete ov -lattice
in V . In this case, Ov× contains all invertible matrices in M2 (ov ). Therefore, o× ×2
v Fv ⊆ Nv .
For the other inclusion, let σ be an element in N(Ov ). Then
End(L) = σEnd(L)σ −1 = End(σ(L)).
Since End(σL) = End(a σ(L)) for all a ∈ Fv× , we may also assume that σ(L) ⊆ L and, by
choosing a suitably, that there exist a basis {e, f } of L and α ∈ ov such that {e, αf } is a
basis of σ(L). Let τ ∈ End(V ) be the map that sends e to e and f to αf . Then τ σ −1 is
an element in End(σ(L)). Thus nr(σ) = αu for some u ∈ o× v . We claim that α is also in
o×v and this will conclude the proof of this case. For, let β ∈ End(V ) be the element which
switches e and f . Then β ∈ End(L) and hence β is also in End(σ(L))× . But then {f, αe}
×
33
containing Ω∞ , let Y Y
HA× (S) = Hv× × Ov× ,
v∈S v6∈S
which is a subset of HA× . Then the restricted product topology on HA× (S) coincides with
the product topology. Since Ov× is compact for all v ∈ Ωf , HA× (S) is locally compact. Since
HA× is the union of all these HA× (S), HA× is a locally compact topological group.
Example 6.6 The restrictedQproduct topology on HA× is not the subspace topology induced
by the product topology on v∈Ω Hv× . The set
Y Y
U= Hv× × Ov×
v∈Ω∞ v∈Ωf
is
Q open in HA× . If the restricted product topology were the subspace topology induced from
×
v∈Ω Hv , then U must contain a set of the form
\ Y Y
WS = HA× Wv × Hv×
v∈S v6∈S
where S is a finite subset of Ω and Wv is an open subset of Hv× for each v ∈ S. But it is
clear that for U does not contain any such WS .
For each v ∈ Ω, let Hv1 = {xv ∈ Hv× : nr(xv ) = 1}. Define
It is clear that HA1 is the kernel of the reduced norm nr : HA× → JF . Thus HA1 is a normal
subgroup of HA× . Moreover, it contains the commutator subgroup of HA× . We give HA1 the
subspace topology induced by the restricted product topology on HA× . For any finite subset
S of Ω, let
HS1 = {x = (xv ) ∈ HA1 : xv = 1 for all v 6∈ S}.
and set
Y Y Y
U = HA1 ∩ Hv× × Uv × Ov× ,
v∈S v∈T v6∈T ∪S
34
which is an open neighborhood of x in HA1 , where x is idele such that its v-th component
is xv for all v ∈ T and 1 eleswhere. So, there exists h ∈ H 1 and y ∈ HS1 such that hy ∈ U.
This implies
(1) h ≡ xv mod pN
v Lv for all v ∈ T ;
We shall apply the Strong Approximation Theorem to the case where S = Ω∞ . Thus it
is useful to introduce the following standard notion to cover the circumstances under which
the Strong Approximation Theorem will be applied.
Definition 6.8 A quaternion algebra H over a number field F is said to satisfy the Eichler
condition if there is at least one infinite place of F at which H splits.
There is the restricted product topology on JF which is defined similarly to the one on
HA× . For each v ∈ Ω, Fv× is a locally compact; and for each v ∈ Ωf , o× v is compact. The
restricted topology on JF is defined by specifying a fundamental
Q system of neighborhoods
of the identity in JF consisting of the sets of the form v∈Ω Uv , where each Uv is an open
neighborhood of 1 in Fv× with Uv = o× v for almost all finite places v. With this topology,
JF is a locally compact topological group.
Lemma 6.9 Let v be a finite place of F , and Ov0 be a maximal order in Hv . If x ∈ Hv× ,
there exists an open neighborhood U of x such that yOv0 y −1 = xOv0 x−1 for all y ∈ U .
Proof. It suffices to prove the lemma for x = 1. For every y ∈ U = 1 + pv Ov0 ⊆ Ov0 , nr(y) is
a unit. So, y −1 ∈ Ov0 and yOv0 y −1 = Ov0 .2
35
This shows that (∗) is a well-defined function. Clearly it is surjective.
Now suppose that H × xHA1 N(O)A = H × yHA1 N(O)A . Then there exist h ∈ H × , α ∈ HA1
and n ∈ N(O)A such that
hx = αyn.
For the sake of convenience, we let [z]O be the order zOz −1 for z ∈ HA× , and [zv ]Ov be the
order zv Ov zv−1 for all zv ∈ Hv× with v ∈ Ωf . Let
and
J = {v ∈ Ωf \ T : [yv ]Ov 6= Ov }.
Both T and J are finite subsets of Ωf .
Let W be an open set of HA1 of the form
Y Y Y
Hv1 × Uv × Ov1
v∈Ω∞ v∈T ∪J v∈Ωf \(T ∪J)
If v ∈ T , then
[σyv ]Ov = [αv yv ]Ov = [hxv ]Ov ,
while if v ∈ J we have
[σyv ]Ov = [yv ]Ov = [hxv ]Ov .
Therefore, y −1 σ −1 hx ∈ N(O)A and thus H × xN(O)A = H × yN(O)A . This proves that (∗)
is injective.2
Recall that JF (O) is the image of N(O)A under the reduced norm. Let
θ : HA× → JF /F × JF (O)
36
Theorem 6.11 Let H be a quaternion algebra over a number field F . Suppose that H
satisfies the Eichler condition. Then the number of conjugacy classes of maximal orders in
H is equal to the group index [JF : F × JF (O)].
Proof. It suffices to show that the homomorphism θ is surjective with H × HA1 N(O)A as the
kernel.
Let a be an element in JF . By the Weak Approximation Theorem for F , there exists
α ∈ F × such that av α is positive in Fv for every v ∈ Ram∞ (H). So we can assume that
av > 0 for every v ∈ Ram(H). Thus for every v ∈ Ω∞ , av = nr(xv ) for some xv ∈ Hv× .
For almost all finite places v, Hv splits and av is a unit in ov . At any one of these v, the
maximal order Ov is isomorphic to M2 (ov ); hence nr(Ov× ) = o× v . Therefore, av = nr(xv ) for
×
some xv ∈ Ov . For the remaining finitely many places v, it follows from Lemma 6.3 that
av = nr(xv ) for some xv ∈ Hv× . Then x = (xv ) is an element in HA× and nr(x) = a. This
proves that θ is surjective.
It is clear that H × HA1 N(O)A is a part of the kernel of θ. Now, suppose that nr(x) ∈
F JF (O). Then there exists n ∈ N(O)A such that nr(xn) ∈ F × . Since nr(Hv× ) = Fv×2
×
whenever v ∈ Ram∞ (H), it follows that nr(xn) ∈ FH× . By Theorem 6.4, there exists
h ∈ H × with nr(hxn) = 1. As a result, hxn ∈ HA1 and hence the kernel of θ is precisely
H × HA1 N(O)A .2
Definition 6.12 The type number of a quaternion algebra H over a number field is the
number of conjugacy classes of maximal orders in H.
For our convenience, we let h be the set Ram∞ (H). Let
JhF = {x ∈ JF : xv > 0 for all v ∈ h and xv ∈ o×
v for all v ∈ Ωf }.
Let IF be the group of fractional ideals of F , and let PFh be the subgroup of principal
fractional ideals that are generated by a ∈ F × with a positive in Fv for all v ∈ h.
Let PF+ be the set of principal fractional ideals that are generated by totally positive
elements in F . Then PF2 ⊆ PF+ , and so PF /PF+ is an elementary 2-group whose order is less
than 2r , where r is the number of real places of F . Thus the quotient IF /PF+ is a finite
abelian group, called the narrow class group of F , and its order is the narrow class number
of F . Since PF+ ⊆ PFh ⊆ PF , the quotient IF /PFh is also a finite abelian group and its order
divides the narrow class number of F .
For any x ∈ JF , let (x) be the fractional ideal
Y
(x) = pordp (xp ) .
p∈Ωf
The class of (x) in the quotient IF /PFh is denoted by [x]. Given an x ∈ JF , there exists
a ∈ F × such that axv > 0 for all v ∈ h. If b is another element in F × such that bxv > 0 for
all v ∈ h, then [ax] = [bx] in IF /PFh . Hence we have a well-defined homomorphism
x ∈ JF /F × 7−→ [ax] ∈ IF /PFh .
It is clear that this homomorphism is surjective.
37
Lemma 6.13 [JF : F × JhF ] = |IF /PFh |.
Proof. It suffices to show that the kernel of the above homomorphism is F × JhF /F × . Take
an idele x ∈ JhF . Then xv ∈ o× v for all finite places v, whence (x) is trivial. Therefore,
F × JhF /F × is in the kernel. Conversely, suppose that x is in the kernel. Let a ∈ F × be
chosen so that axv > 0 for all v ∈ h. Then there exists b ∈ F × such that b > 0 in Fv for all
v ∈ h and (ax) = (b). Let β = axb−1 . Then βv ∈ o× v for all finite places v, and βv > 0 for
h × h
all v ∈ h. Hence β ∈ JF and x ∈ F JF . Therefore the kernel of the above homomorphism
is F × JhF and the lemma is proved.2
Corollary 6.14 Let H be a quaternion algebra over a number field F . If H satisfies the
Eichler condition, then its type number is finite; it divides the narrow class number of F .
Proof. Let O be a maximal order in a quaternion algebra H over F . Then JF (O) contains
JhF , hence [JF : F × JF (O)] divides [JF : F × JhF ] and the latter divides the narrow class
number of F .2
Corollary 6.15 Let H be a quaternion algebra over a number field F . If H satisfies the
Eichler condition, then its type number is a power of 2.
Proof. It is clear because JF (O) contains J2F .2
Corollary 6.16 Let H be a quaternion algebra over Q which splits at the infinite place.
Then the type number of H is 1.
Remark 6.17 The type number of a quaternion algebra which does not satisfy the Eichler
condition is also finite.
Theorem 7.1 Let m > 1 be a squarefree √ integer such that m ≡ 1, 2 mod 4, h(m) be the
class number of the quadratic field Q( −m), and ψ(m) be the number of integer solutions
to the equation x2 + y 2 + z 2 = m. Then ψ(m) = 12 h(m).
38
Lemma 7.2 Let u be a nonzero element in H. Then u, iu, ju, and ku are mutually orthog-
onal.
Proof. Let p and q be two different elements from {1, i, j, k}. Then
2
So, when u 6= 0, the set Z[i, j, k]u, which is a complete Z-lattice onpH, produces a grid
of 4-dimensional cubes in Q4 . The side of any one of these cubes is nr(u). Let x ∈ H.
Then x must be in one of these cubes. Let su, s ∈ Z[i, j, k], be a corner of this cube that is
closest to x. Then
1
x − su = (a0 + a2 i + a3 j + a4 k)u, |ai | ≤ for all i.
2
So, nr(x − su) < nr(u) unless x happens to be the midpoint of the cube, in which case all
the ai are equal to 12 and so nr(x − su) = nr(u). This shows that the order Z[i, j, k] does
not have any division algorithm (with respect to nr).
Now, let O be the order Z[i, j, k, δ], where δ = (1+i+j+k)
2 . We have seen in Example 5.29
that O is a maximal order in H. This order O is called the Hurwitz order of quaternions.
Note that as a set, O is obtained by adding all the midpoints of the cubes formed by Z[i, j, k].
Moreover, nr(δ) = nr(i) = nr(j) = nr(k) = 1. Therefore, the elements in O are vertices of
a grid of 4-dimensional rhombohedrons in H. If u is a nonzero Hurwitz quaternion, then
the principalpleft ideal Ou of O produces a grid of 4-dimensional rhombohedrons in H with
side length nr(u). Let x be an arbitrary element in H. Then x falls into one of these
rhombohedrons. Let su be one of the closest corner. It is easy to see that nr(x−su) < nr(u)
and so O has a division algorithm.
Proposition 7.3 Let x ∈ H and u be a nonzero element of the Hurwitz order O. Then
there exist s, r ∈ O such that x = su + r with nr(r) < nr(u).
It is clear that in the above discussion one can consider left multiplication of u and
obtains an analogous division algorithm.
A nonzero ideal I in H is said to be a fractional O-ideal if its left order is O. In other
words, I is a fractional O-ideal if
O = {x ∈ H : xI ⊆ I}.
Corollary 7.4 Every left fractional O-ideal in H is principal, that is, it is of the form Ou
for some u ∈ H.
Proof. Let A be a nonzero left fractional O-ideal in H. Since A is a finitely generated
Z-module, there exists a nonzero integer m such that all the elements in mA are integral
over Z. So, we may assume at the outset that all the elements in A are integral over Z.
In particular, the set of reduced norms of nonzero elements in A contains only positive
39
integers and hence it must have a minimum. Let 0 = 6 u ∈ A be chosen such that nr(u) is
this minimum.
For any x ∈ A, by Proposition 7.3 there exists s ∈ O such that nr(x − su) < nr(u).
Since x − su is in A, therefore x − su must be zero. This shows that A = Ou.2
Lemma 7.5 The unit group of O is generated by i, j, δ, and it contains exactly 24 elements.
Proof. This can be verified directly, using the fact that x ∈ O× if and only if nr(x) = 1.2
√
Let m > 1 be a positive integer, and K be the quadratic field Q( −m). If µ ∈ H0 such
2
√ to µ = −m, then we have an embedding φµ from K
that nr(µ) = m, which is equivalent
to Kµ := Q(µ) ⊆ H which maps −m to µ.
Now, suppose that m is a positive integer such that m ≡ 1, 2 mod 4. The set of integer
solutions of the diophantine equation x2 + y 2 + z 2 = m is in bijective correspondence with
the set
Rm (or simply R) = {µ ∈ O : µ2 = −m}
which we call the set of roots of m. So, |R| = ψ(m), which is always positive by a theorem
of Legendre.
Let µ ∈ R and let oµ = O ∩ Kµ . Then oµ is an order in Kµ , and all its elements are
integral over Z. Thus oµ is contained in the ring of integers of Kµ . However, since m ≡ 1, 2
mod 4, it is not hard to show that the ring of integers of Kµ is Z[µ], and µ is clearly in oµ .
Thus oµ is precisely the ring of integers of Kµ .
Oa ∩ Kµ = (Oa ∩ Kµ )a−1 a
⊆ (Oaa−1 ∩ Kµ a−1 )a
= (Ooµ ∩ Kµ )a
= (O ∩ Kµ )a
= oµ a = a.
40
Proof. Again, by the Noether-Skolem Theorem, the two embeddings φµ and φη are conju-
gate inside H, that is there exists x ∈ H × such that φµ (a) = x−1 φη (a)x for all a ∈ K. In
particular, xµ = ηx, and αµ = ηα if and only if x−1 α centralizes Kµ . By Proposition 7.7,
x−1 α ∈ Kµ , or equivalently, α ∈ xKµ .2
Oaµ = Oκ.
W = {B(µ) : µ ∈ R}
∆ : IK × W → W
defined by
∆(a, B(µ)) = B(κµκ−1 ),
where κ = κ(a, µ) is chosen such that Oaµ = Oκ.
Lemma 7.9 The map ∆ defines an action of the group IK on the set W.
Proof. Let a, b be two fractional ideals of K. Let λ = κ(b, µ) µ κ(b, µ)−1 ; that is,
Then φλ is φµ followed by the inner automorphism given by κ(b, µ). Therefore, it must be
that aλ = κ(b, µ)aµ κ(b, µ)−1 and
Oκ(a, λ) = Oaλ
= Oκ(b, µ)aµ κ(b, µ)−1
= Obµ aµ κ(b, µ)−1 ;
41
that is, Oκ(a, λ)κ(b, µ) = O(ba)µ = O(ab)µ . So, we may choose κ(ab, µ) to be the product
κ(a, µ)κ(b, µ), and find that
This shows that ∆(a, ∆(b, B(µ))) = ∆(ab, B(µ)), which proves the proposition.2
Let µ, ν ∈ R, and let
Tµ,ν = {λ ∈ O : λµ = νλ}.
By Corollary 7.8, Tµ,ν is the intersection of O with a two dimensional Q-vector space; so it
is a rank 2 Z-submodule of O. In particular, it is nonzero and hence OTµ,ν is a complete
Z-lattice in H. So, OTµ,ν is an ideal in H. It is easy to see that OTµ,ν is in fact a left
fractional ideal of O.
In (1), it is direct to check that nr(µ + ν) ≡ nr(iµ + νi) ≡ 0 mod 4, but nr(µ + ν) −
nr(iµ + νi) ≡ 4 mod 8. Hence either nr(µ + ν) or nr(iµ + νi) is not divisible by 8. Then we
can take ω to be either (µ + ν)/2 or (iµ + νi)/2. Note that ω will be in O.
For (2), we use γ = (1 + j)µ + ν(1 + j), which is
Notice that all the coefficients in this linear combination are odd. Therefore, ω := γ/2 is in
O, and hence ω ∈ Tµ,ν . It is also easy to see that 2 - nr(ω).
Now, suppose that nr(ρ) is divisible by an odd prime p. Since ρ is a right divisor of
ω0 := µ + ν and ω1 := iµ + νi, therefore it is also a right divisor of i(ω0 + i(ω1 )) = iν − νi
whose reduced norm is 4(y22 + y32 ). We may conclude that y22 + y32 ≡ 0 mod p. Using j,
k instead of i, we obtain y12 + y32 ≡ 0 ≡ y12 + y22 mod p. These three congruences have
only one common solution mod p, namely y1 ≡ y2 ≡ y3 ≡ 0 mod p. This shows that
m = y12 + y22 + y33 ≡ 0 mod p2 , which is impossible.
The case m ≡ 2 mod 4 can be done by a similar argument.2
42
Proposition 7.11 The action of IK on W is transitive. Moreover, the stabilizer of any
B(µ) ∈ W is the subgroup of principal fractional ideals of K.
Proof. Let µ, ν ∈ R. We have indicated earlier that Tν,µ is a Z-module of rank 2. Let {ξ, η}
be a Z-basis of Tν,µ ., and set
b := oµ ξη + oµ nr(η).
Note that α ∈ Tν,µ if and only if α ∈ Tµ,ν . Hence ξη centralizes Kµ , and so ξη ∈ Kµ by
Proposition 7.7. Thus ξη ∈ Kµ ∩ O = oµ . This shows that b is an ideal of oµ . By Lemma
7.10,
Ob = O(oµ ξη + oµ ηη) = Oξη + Oηη = (Oξ + Oη)η = OTν,µ η = Oη.
So, if we use a = φ−1
µ (b) and η = κ(a, µ), then aµ = b, Oaµ = Oη, and η ∈ Tµ,ν , that is
−1
ηµη . Hence
∆(a, B(µ)) = B(ν),
which means that the action of IK on W is transitive.
For the second assertion, let α ∈ K × and a be the principal fractional ideal of K
generated by α. Then φµ (a) = oµ β, where β = φµ (α). We may then choose β ∈ Kµ to be
κ(a, µ) and obtain
∆(a, B(µ)) = B(βµβ −1 ) = B(µ).
Conversely, suppose ∆(a, B(µ)) = B(µ). Then we may select κ(a, µ) such that κµκ−1 =
µ. It follows from Proposition 7.7 that κ ∈ Kµ . Hence
Oaµ = Oκ = Ooµ κ.
aµ = Oaµ ∩ Kµ = Ooµ κ ∩ Kµ = oµ κ,
and so a is principal.2
We have shown that the root bundle ∆(a, B(µ)) depends only the ideal class of a. So,
given an ideal class C ∈ IK , we can define a function ΠC : W → W by ΠC (B(µ)) =
B(a, B(µ)), where a is any fractional ideal in C. By Proposition 7.11, this function ΠC is a
permutation on W, and the map C 7→ ΠC is an isomorphism sending the ideal class group
of K onto a sharply transitive permutation group of W. As a result, |W| = h(m).
Finally, let us give a proof of Theorem 7.1. We need to count the number of roots
belonging to a bundle B(µ). Let the group O× act on the set of roots R by conjugation.
The stabilizer of a root µ is
{ ∈ O× : µ−1 } = O× ∩ Kµ = o×
µ = {±1}.
The last equality is from the hypothesis that m > 1 and m ≡ 1, 2 mod 4. Hence |B(µ)| =
|O× |/2 = 12, and thus
ψ(m) = |R| = 12|W| = 12h(m).
43