ABSTRACT ALGEBRA
Third Edition
I. N. Herstein
Late Professor of Mathematics
University of Chicago
II PRENTICEHALL, Upper Saddle River, New Jersey 07458 @
Library of Congress Cataloging in Publication Data
Herstein, I. N.
Abstract algebra / I.N. Herstein.  3rd ed.
p. em.
Includes index.
ISBN 0133745627 (alk. paper)
1. Algebra, Abstract. I. Title
QA162.H47 1995 9521470
512' .02dc20 CIP
Acquisition Editor: George Lobell
EditorinChief: Jerome Grant
Production Editor: Elaine Wetterau/University Graphics, Inc.
Art Director: Jayne Conte
Cover Designer: Bruce Kenselaar
Cover Photograph: Charnley Residence, entryway.
Photo by © Nick Merrick/HedrichBlessing
Manufacturing Buyer: Alan Fischer
Marketing Manager: Frank Nicolazzo
II© 1996 PrenticeHall, Inc.
= Simon & Schuster/A Viacom Company
 Upper Saddle River, New Jersey 07458
All rights reserved. No part of this book may be
reproduced, in any form or by any means,
without permission in writing from the publisher.
Earlier editions © 1986 and 1990.
Printed in the United States of America
10 9 8 7 6 5 4 3 2
ISBN 0133745627
PrenticeHall International (UK) Limited, London
PrenticeHall of Australia Pty. Limited, Sydney
PrenticeHall Canada Inc., Toronto
PrenticeHall Hispanoamericana, S.A., Mexico
PrenticeHall of India Private Limited, New Delhi
PrenticeHall of Japan, Inc., Tokyo
Simon & Schuster Asia Pte. Ltd., Singapore
Editora PrenticeHall do Brasil, Ltda., Rio de Janeiro
To Biska
CONTENTS
Preface ix
1 Things Familiar and Less Familiar 1
1 A Few Preliminary Remarks 1
2 Set Theory 3
3 Mappings 8
4 A(S) (The Set of 11 Mappings of S onto Itself) 16
5 The Integers 21
6 Mathematical Induction 29
7 Complex Numbers 32
2 Groups 40
1 Definitions and Examples of Groups 40
2 Some Simple Remarks 48
3 Subgroups 51
4 Lagrange's Theorem 56
5 Homomorphisms and Normal Subgroups 66
6 Factor Groups 77
7 The Homomorphism Theorems 84
8 Cauchy's Theorem 88
9 Direct Products 92
10 Finite Abelian Groups (Optional) 96
11 Conjugacy and Sylow's Theorem (Optional) 101
vii
viii Contents
3
The Symmetric Group 108
1 Preliminaries 108
2 Cycle Decomposition 111
3 Odd and Even Permutations 119
4
Ring Theory 125
1
2
3
4
5
6
7
5 Fields
1
2
3
4
5
6
Definitions and Examples 125
Some Simple Results 137
Ideals, Homomorphisms, and Quotient Rings 139
Maximal Ideals 148
Polynomial Rings 151
Polynomials over the Rationals 166
Field of Quotients of an Integral Domain 172
176
Examples of Fields 176
A Brief Excursion into Vector Spaces 180
Field Extensions 191
Finite Extensions 198
Constructibility 201
Roots of Polynomials 207
6 Special Topics (Optional) 215
1 The Simplicity of An 215
2 Finite Fields I 221
3 Finite Fields II: Existence 224
4 Finite Fields III: Uniqueness 227
5 Cyclotomic Polynomials 229
6 Liouville's Criterion 236
7 The Irrationality of 'IT 239
Index 243
PREFACE TO THE THIRD EDITION
When we were asked to prepare the third edition of this book, it was our con
sensus that it should not be altered in any significant way, and that Herstein's
informal s t ~ l e should be preserved. We feel that one of the book's virtues is
the fact that it covers a big chunk of abstract algebra in a condensed and in
teresting way. At the same time, without trivializing the subject, it remains ac
cessible to most undergraduates.
We have, however, corrected minor errors, straightened out inconsis
tencies, clarified and expanded some proofs, and added a few examples.
To resolve the many typographical problems of the second edition,
Prentice Hall has had the book completely retypesetmaking it easier and
more pleasurable to read.
It has been pointed out to us that some instructors would find it useful
to have the Symmetric Group Sn and the cycle notation available in Chapter
2, in order to provide more examples of groups. Rather than alter the
arrangement of the contents, thereby disturbing the original balance, we sug
gest an alternate route through the material, which addresses this concern.
After Section 2.5, one could spend an hour discussing permutations and their
cycle decomposition (Sections 3.1 and 3.2), leaving the proofs until later. The
students might then go over several past examples of finite groups and explic
itly set up isomorphisms with subgroups of Sn This exercise would be moti
vated by Cayley's theorem, quoted in Section 2.5. At the same time, it would
have the beneficial result of making the students more comfortable with the
concept of an isomorphism. The instructor could then weave in the various
subgroups of the Symmetric Groups Sn as examples throughout the remain
ix
x Preface to Third Edition Ch.6
der of Chapter 2. If desired, one could even introduce Sections 3.1 and 3.2
after Section 2.3 or 2.4.
Two changes in the format have been made since the first edition. First,
a Symbol List has been included to facilitate keeping track of terminology.
Second, a few problems have been marked with an asterisk (*). These serve
as a vehicle to introduce concepts and simple arguments that relate in some
important way to the discussion. As such, they should be read carefully.
Finally, we take this opportunity to thank the many individuals whose
collective efforts have helped to improve this edition. We thank the review
ers: Kwangil Koh from North Carolina State University, Donald Passman
from the University of Wisconsin, and Robert Zinc from Purdue University.
And, of course, we thank George Lobell and Elaine Wetterau, and others at
Prentice Hall who have been most helpful.
Barbara Cortzen
David J. Winter
PREFACE TO THE FIRST EDITION
In the last halfcentury or so abstract algebra has become increasingly impor
tant not only in mathematics itself, but also in a variety of other disciplines.
For instance, the importance of the results and concepts of abstract algebra
play an ev;r more important role in physics, chemistry, and computer science,
to cite a few such outside fields.
In mathematics itself abstract algebra plays a dual role: that of a unify
ing link between disparate parts of mathematics and that of a research subject
with a highly active life of its own. It has been a fertile and rewarding research
area both in the last 100 years and at the present moment. Some of the great
accomplishments of our twentiethcentury mathematics have been precisely
in this area. Exciting results have been proved in group theory, commutative
and noncommutative ring theory, Lie algebras, Jordan algebras, combina
tories, and a host of other parts of what is known as abstract algebra. A sub
ject that was once regarded as esoteric has become considered as fairly down
toearth for a large cross section of scholars.
The purpose of this book is twofold. For those readers who either want
to go on to do research in mathematics or in some allied fields that use alge
braic notions and methods, this book should serve as an introductionand,
we stress, only as an introductionto this fascinating subject. For interested
readers who want to learn what is going on in an engaging part of modern
mathematics, this book could serve that purpose, as well as provide them with
some highly usable tools to apply in the areas in which they are interested.
The choice of subject matter has been made with the objective of intro
ducing readers to some of the fundamental algebraic systems that are both in
xi
xii Preface to First Edition
teresting and of wide use. Moreover, in each of these systems the aim has
been to arrive at some significant results. There is little purpose served in
studying some abstract object without seeing some nontrivial consequences of
the study. We hope that we have achieved the goal of presenting interesting,
applicable, and significant results in each of the systems we have chosen to
discuss.
As the reader will soon see, there are many exercises in the book. They
are often divided into three categories: easier, middlelevel, and harder (with
an occasional very hard). The purpose of these problems is to allow students
to test their assimilation of the material, to challenge their mathematical inge
nuity, to prepare the ground for material that is yet to come, and to be a
means of developing mathematical insight, intuition, and techniques. Readers
should not become discouraged if they do not manage to solve all the prob
lems. The intent of many of the problems is that they be triedeven if not
solvedfor the pleasure (and frustration) of the reader. Some of the prob
lems appear several times in the book. Trying to do the problems is undoubt
edly the best way of going about learning the subject.
We have strived to present the material in the language and tone of a
classroom lecture. Thus the presentation is somewhat chatty; we hope that
this will put the readers at their ease. An attempt is made to give many and
revealing examples of the various concepts discussed. Some of these exam
ples are carried forward to be examples of other phenomena that _come up.
They are often referred to as the discussion progresses.
We feel that the book is selfcontained, except in one sectionthe sec
ond last one of the bookwhere we make implicit use of the fact that a poly
nomial over the complex field has complex roots (that is the celebrated Fun
damental Theorem ofAlgebra due to Gauss), and in the last section where we
make use of a little of the calculus.
We are grateful to many people for their comments and suggestions on
earlier drafts of the book. Many of the changes they suggested have been in
corporated and should improve the readability of the book. We should like to
express our special thanks to Professor Martin Isaacs for his highly useful
comments.
We are also grateful to Fred Flowers for his usual superb job of typing
the manuscript, and to Mr. Gary W. Ostedt of the Macmillan Company for
his enthusiasm for the project and for bringing it to publication.
With this we wish all the readers a happy voyage on the mathematical
journey they are about to undertake into this delightful and beautiful realm
of abstract algebra.
I.N.H.
aES
a$S
SeT, T:J S
S = T ~
o
AUB
AnB
{s E Sis satisfies P}
AB
A'
(a, b)
AXB
IR
f : S ~ T
f(s)
i: S ~ S, is
fl(t)
fl(A)
fo g, fg
A(S)
Sn
n!
7L
O(s)
SYMBOL LIST
a is an element of the set S, 3
a is not an element of the set S, 3
S is a subset of the set T, 3
The sets Sand T are equal (have the same elements), 4
The empty set, 4
The union of the sets A and B, 4
The intersection of the sets A and B, 4
The subset of elements of S satisfying P, 4
The difference of the sets A and B, 4
The complement of A, 5
Ordered pair consisting of a, b (see also below), 5
The Cartesian product of A and B, 5
The set of real numbers, 8
Function from the set S to the set T, 8
Image of the element s under the function f, 8
The identity function on S, 9
Inverse image of t under f, 10
Inverse image of a subset A of T under f: S ~ T, 10
Composition or product of functions f and g, 11, 18
Set of 11 mappings from a set S to S, 16
Symmetric group of degree n, 16,109
n factorial, 17
Set of integers, 21
Orbit of s relative to mapping f, 21
xiii
xiv Symbol List
N
mIn
m ~ n
(a, b)
C
i, i
z = a + bi
z = a  bi
lIz
Izl
r(cos 9 + i sin 9)
9
n
Q
En
IGI
C(a)
(a)
Z(G)
ab
a == bmodn
a == ben)
[a]
cl(a)
o(a)
iG(H)
7L
n
Un
'P(n)
Hb
aH
G=G'
'P(G)
Ker 'P
N<JG
GIN
AB
G
1
X G
2
X ... X G
n
(
a b c)
u v w
(a, b, ,c)
An
ao + ali + a2j + a3
k
detx
Set of positive integers, 21
m divides n, 22
m does not divide n, 22
Greatest common divisor of a, b (see also above), 23
Set of complex numbers, 32
Square roots of 1, 32
Complex number z with real part a and imaginary
part b, 32
Conjugate of complex number z = a + bi, 32
Inverse of the complex number z, 33
Absolute value of complex number z, 34
Polar form of a complex number, 35
Primitive nth root of unity, 36, 42
Set of rational numbers, 42
Group of nth roots of unity, 42
Order of the group G, 42
Centralizer of a in G, 53, 102
Cyclic group generated by a, 53
Center of group G, 53
a is equivalent to b in a specified sense, 57
a is congruent to b modulo n (long form), 57
a is congruent to b modulo n (short form), ;'7
Class of all b equivalent to a, 58
Conjugacy class of a, 58, 101
Order of element a of a group, 60
Index of H in G, 59
Set of integers mod n, 61
Group of invertible elements of 7L
n
, 62
Euler 'P function (phi function), 62
Right coset of subgroup H, 58
Left coset of subgroup H, 64
Group G is isomorphic to group G', 68
Image of homomorphism, 70
Kernel of the homomorphisms 'P, 70, 140
N is a normal subgroup of G, 72
Quotient of a group G by a subgroup N, 78
Product of subsets A, B of a group, 79
Direct product of G}, G
2
, ••• , G
n
, 93
Permutation sending a to u, b to v, ... ,c to w, 110
Cycle sending a to b, ... ,c to a, 111
Alternating group of degree n, 121, 215
Quaternion, 131
Determinate of the 2x2 matrix x, 136
H(F)
R(f)S
(a)
F[x]
deg p(x)
(g(x))
g(x) I f(x)
R[x]
F(x)
vE V
av
atvt + · .. + anV
n
(Vb V2' • • • , Vn)
V(f)W
dimF(V)
U+W
[K:F]
F[a]
F(a)
E(K)
f'(x)
<f>n(X)
Symbol List xv
Ring of quaternions over F, 136
Direct sum of rings R, S, 146
Ideal generated by a in a commutative ring, 145
Polynomial ring over the field F, 152
Degree of polynomial p(x), 153
Ideal generated by g(x) in a polynomial ring, 157
Polynomial g(x) divides f(x) , 157
Polynomial ring over ring R, 163
Field of rational functions in x over F, 177
Vector V in a vector space V, 180
Scalar a times vector v, 180
Linear combination of vectors Vt , ••• , V
n
, 181
SUbspace spanned by Vt, V2' ••• ,V
n
, 181
Direct sum of vector spaces V, W, 181
Dimension of V over F, 186
Sum of subspaces U, W of V, 190
Degree of Kover F, 191
Ring generated by a over F, 196
Field extension obtained by adjoining to a to F, 196
Field of algebraic elements of Kover F, 198
Formal derivative ofpolynomialf(x), 227
nth cyclotomic polynomial, 230
ABSTRACT ALGEBRA
1
THINGS FAMILIAR
AND LESS FAMILIAR
1. A FEW PRELIMINARY REMARKS
For many r9aders this book will be their first contact with abstract mathe
matics. l'he· subject to be discussed is usually called "abstract algebra," but
the difficulties that the reader may encounter are not so much due to the "al
gebra" part as they are to the "abstract" part.
On seeing some area of abstract mathematics for the first time, be it in
analysis, topology, or whatnot, there seems to be a common reaction for the
novice. This can best be described by a feeling of being adrift, of not having
something solid to hang on to. This is not too surprising, for while many of the
ideas are fundamentally quite simple, they are subtle and seem to elude one's
grasp the first time around. One way to mitigate this feeling of limbo, or asking
oneself "What is the point of all this?," is to take the concept at hand and see
what it says in particular cases. In other words, the best road to good under
standing of the notions introduced is to look at examples. This is true in all of
mathematics, but it is particularly true for the subject matter of abstract algebra.
Can one, with a few strokes, quickly describe the essence, purpose, and
background for the material we shall study? Let's give it a try.
We start with some collection of objects S and endow this collection
with an algebraic structure by assuming that we can combine, in one or sev
eral ways (usually two), elements of this set S to obtain, once more, elements
of this set S. These ways of combining elements of S we call operations on S.
2 Things Familiar and Less Familiar Ch. 1
Then we try to condition or regulate the nature of S by imposing certain
rules on how these operations behave on S. These rules are usually called the
axioms defining the particular structure on S. These axioms are for us to de
fine, but the choice made comes, historically in mathematics, from noticing
that there are many concrete mathematical systems that satisfy these rules or
axioms. We shall study some of the basic axiomatic algebraic systems in this
book, namely groups, rings, and fields.
Of course, one could try many sets of axioms to define new structures.
What would we require of such a structure? Certainly we would want that
the axioms be consistent, that is, that we should not be led to some nonsensi
cal contradiction computing within the framework of the allowable things the
axioms permit us to do. But that is not enough. We can easily set up such al
gebraic structures by imposing a set of rules on a set S that lead to a patho
logical or weird system. Furthermore, there may be very few examples of
something obeying the rules we have laid down.
Time has shown that certain structures defined by "axioms" play an im
portant role in mathematics (and other areas as well) and that certain others
are of no interest. The ones we mentioned earlier, namely groups, rings, and
fields, have stood the test of time.
A word about the use of "axioms." In everyday language "axiom"
means a selfevident truth. But we are not using everyday languige; we are
dealing with mathematics. An axiom is not a universal truthbut one of sev
eral rules spelling out a given mathematical structure. The axiom is true in
the system we are studying because we have forced it to be true by hypothe
sis. It is a license, in the particular structure, to do certain things.
We return to something we said earlier about the reaction that many
students have on their first encounter with this kind of algebra, namely a lack
of feeling that the material is something they can get their teeth into. Do not
be discouraged if the initial exposure leaves you in a bit of a fog. Stick with
it, try to understand what a given concept says, and most importantly, look at
particular, concrete examples of the concept under discussion.
PROBLEMS
1. Let S be a set having an operation * which assigns an element a * b of S
for any a, b E S. Let us assume that the following two rules hold:
1. If a, b are any objects in S, then a * b = a.
2. If a, b are any objects in S, then a * b = b * a.
Show that S can have at most one object.
Sec. 2 Set Theory 3
2. Let S be the set of all integers 0, ±1, ±2, ... , ±n, . ... For a, b in S define
* by a * b = ab. Verify the following:
(a) a * b =1= b * a unless a = b.
(b) (a * b) * c =1= a * (b * c) in general. Under what conditions on a, b, c is
(a * b) * c = a * (b * c)?
(c) The integer 0 has the property that a * 0 = a for every a in S.
(d) For a in S, a * a = O.
3. Let S consist of the two objects D and ~ . We define the operation * on S
by subjecting D and ~ to the following conditions:
1. D * ~ = ~ = ~ * D.
2. D * D = D.
3. ~ * ~ = D.
Verify by explicit calculation that if a, b, c are any elements of S (i.e., a, b
and c can be any of D or ~ ) , then:
(a) a * b is in S.
(b) (a * b) * c = a * (b * c).
(c) a * b = b * a.
(d) There is a particular a in S such that a * b = b * a = b for all b in S.
(e) Given b in S, then b * b = a, where a is the particular element in Part
(d).
2. SET THEORY
With the changes in the mathematics curriculum in the schools in the United
States, many college students have had some exposure to set theory. This in
troduction to set theory in the schools usually includes the elementary no
tions and operations with sets. Going on the assumption that many readers
will have some acquaintance with set theory, we shall give a rapid survey of
those parts of set theory that we shall need in what follows.
First, however, we need some notation. To avoid the endless repetition
of certain phrases, we introduce a shorthand for these phrases. Let S be a
collection of objects; the objects of S we call the elements of S. To denote
that a given element, a, is an element of S, we write a E Sthis is read "a is
an element of S." To denote the contrary, namely that an object a is not an
element of S, we write a fl. S. So, for instance, if S denotes the set of all posi
tive integers 1, 2, 3, ... , n, ... , then 165 E S, whereas 13 f£ S.
We often want to know or prove that given two sets Sand T, one of
these is a part of the other. We say that S is a subset of T, which we write
SeT (read "S is contained in T") if every element of S is an element of T.
4 Things Familiar and Less Familiar Ch. 1
In terms of the notation we now have: SeTif s E S implies that sET. We
can also denote this by writing T => S, read "T contains S." (This does not ex
clude the possibility that S = T, that is, that Sand T have exactly the same
elements.) Thus, if T is the set of all positive integers and S is the set of all
positive even integers, then SeT, and S is a subset of T. In the definition
given above, S => S for any set S; that is, S is always a subset of itself.
We shall frequently need to show that two sets Sand T, defined per
haps in distinct ways, are equal, that is, they consist of the same set of ele
ments. The usual strategy for proving this is to show that both SeT and
T C S. For instance, if S is the set of all positive integers having 6 as a factor
and T is the set of all positive integers having both 2 and 3 as factors, then
S = T. (Prove!)
The need also arises for a very peculiar set, namely one having no ele
ments. This set is called the null or empty set and is denoted by 0. It has the
property that it is a subset of any set S.
Let A, B be subsets of a given set S. We now introduce methods of con
structing other subsets of S from A and B. The first of these is the union of A
and B, written A U B, which is defined: A U B is that subset of S consisting
of those elements of S that are elements of A or are elements of B. The "or"
we have just used is somewhat different in meaning from the ordinary usage
of the word. Here we mean that an element c is in A U B if it is in A, or is in
B, or is in both. The "or" is not meant to exclude the possibility that both
things are true. Consequently, for instance, A U A = A.
If A = {I, 2, 3} and B = {2, 4, 6, IO}, then A U B = {I, 2, 3, 4, 6, IO}.
We now proceed to our second way of constructing new sets from old.
Again let A and B be subsets of a set S; by the intersection of A and B, writ
ten A n B, we shall mean the subset of S consisting of those elements that
are both in A and in B. Thus, in the example above, A n B = {2}. It should
be clear from the definitions involved that A nBC A and A nBc B.
Particular examples of intersections that hold universally are: A n A = A,
A n S = A, A n 0 = 0.
This is an opportune moment to introduce a notational device that will
be used time after time. Given a set S, we shall often be called on to de
scribe the subset A of S, whose elements satisfy a certain property P. We
shall write this as A = {s E Sis satisfies Pl. For instance, if A, B are subsets
of S, then A U B = {s E Sis E A or s E B} while A n B = {s E Sis E A
andsEB}.
Although the notions of union and intersection of subsets of Shave
been defined for two subsets, it is clear how one can define the union and in
tersection of any number of subsets.
We now introduce a third operation we can perform on sets, the differ
ence of two sets. If A, B are subsets of S, we define A  B = {a E A Ia f£ B}.
Sec. 2 Set Theory 5
So if A is the set of all positive integers and B is the set of all even integers,
then A  B is the set of all positive odd integers. In the particular case when
A is a subset of S, the difference S  A is called the complement of A in S
and is written A' .
We represent these three operations pictorially. If A is ®and B is ®,
then
1. A U B =.is the shaded area.
2. A n B =GBis the shaded area.
3. A  B =G1Vis the shaded area.
4. B  A = ~ is the shaded area.
Note the relation among the three operations, namely the equality
A U B = (A n B) U (A  B) U (B  A). As an illustration of how one goes
about proving the equality of sets constructed by such settheoretic construc
tions, we pfove this latter alleged equality. We first show that (A n B) U
(A  B) U (B  A) C A U B; this part is easy for, by definition, A nBC A,
A  B C A, and B  A C B, hence
(A n B) U (A  B) U (B  A) C A U A U B = A U B.
Now for the other direction, namely that A U B C (A n B) U (A  B) U
(B  A). Given II E A U B, if u E A and u E B, then u E A n B, so it is cer
tainly in (A n B) U (.11  B) U (B  A). On the other hand, if u E A but
u ft. B, then, by the v ~ r y definition of A  B, u E A  B, so again it is cer
tainly in (A n B) U (A  B) U (B  A). Finally, if u E B but u $ A, then
u E B  A, so again it is in (A n B) U (A  B) U (B  A). We have thus
covered all the possibilities and have shown that A U B C (A n B) U
(A  B) U (B  A). Having the two opposite containing relations of A U B
and (A n B) U (A  B) U (B  A), we obtain the desired equality of these
two sets.
We close this brief review of set theory with yet another construction
we can carry out on sets. This is the Cartesian product defined for the two
sets A, B by A X B = {(a, b) Ia E A, b E B}, where we declare the ordered
pair (a, b) to be equal to the ordered pair (ab b
t
) if and only if a = at and
b = b
I
. Here, too, we need not restrict ourselves to two sets; for instance, we
6 Things Familiar and Less Familiar Ch. 1
can define, for sets A, B, C, their Cartesian product as the set of ordered
triples (a, b, c), where a E A, b E B, C E C and where equality of two or
dered triples is defined componentwise.
PROBLEMS
Easier Problems
1. Describe the following sets verbally.
(a) S == {Mercury, Venus, Earth, ... ,Pluto}.
(b) S == {Alabama, Alaska, ... , Wyoming}.
2. Describe the following sets verbally.
(a) S == {2, 4,6,8, ...}.
(b) S == {2, 4, 8, 16,32, ...}.
(c) S == {I, 4, 9, 16,25, 36, ...}.
3. If A is the set of all residents of the United States, B the set of all Cana
dian citizens, and C the set of all women in the world, describe the sets
A n B n C, A  B, A  C, C  A verbally. ~
4. If A == {I, 4, 7, a} and B == {3, 4, 9, II} and you have been told that
A n B == {4, 9}, what must a be?
5. If A C Band B C C, prove that A C C.
6. If A C B, prove that A U C C B U C for any set C.
7. Show that A U B == B U A and A n B == B n A.
8. Prove that (A  B) U (B  A) == (A U B)  (A n B). What does this
look like pictorially?
9. Prove that A n (B U C) == (A n B) U (A n C).
10. Prove that A U (B n C) == (A U B) n (A U C).
11. Write down all the subsets of S == {I, 2, 3, 4}.
MiddleLevel Problems
*12. If C is a subset of S, let C' denote the complement of C in S. Prove the
De Morgan Rules for subsets A, B of S, namely:
(a) (A n B)' == A' U B'.
(b) (A U B)' == A' n B'.
* 13. Let S be a set. For any two subsets of S we define
A + B == (A  B) U (B  A) and A· B == A n B.
Sec. 2 Set Theory 7
Prove that:
(a) A + B = B + A.
(b) A + 0 = A.
(c) A· A = A.
(d) A + A = 0.
(e) A + (B + C) = (A + B) + C.
(f) If A + B = A + C, then B = C.
(g) A . (B + C) = A· B + A . C.
*14. If C is a finite set, let m(C) denote the number of elements in C. If A, B
are finite sets, prove that
meA U B) = meA) + m(B)  meA n B).
15. For three finite sets A, B, C find a formula for meA U B U C). (Hint:
First consider D = B U C and use the result of Problem 14.)
16. Take a shot at finding meAl U A
2
U ... U An) for n finite sets AI, A
2
, ••• ,
An·
17. Use the result of Problem 14 to show that if 800/0 of all Americans have
gone to high school and 70% of all Americans read a daily newspaper,
then at least 500/0 of Americans have both gone to high school and read a
daily newspaper.
18. A pubirc opinion poll shows that 930/0 of the population agreed with the
government on the first decision, 84% on the second, and 740/0 on the
third, for three decisions made by the government. At least what per
centage of the population agreed with the government on all three deci
sions? (Hint: Use the results of Problem 15.)
19. In his book A Tangled Tale, Lewis Carroll proposed the following riddle
about a group of disabled veterans: "Say that 700/0 have lost an eye, 750/0
an ear, 800/0 an arm, 85% a leg. What percentage, at least, must have lost
all four?" Solve Lewis Carroll's problem.
*20. Show, for finite sets A, B, that meA X B) = m(A)m(B).
21. If S is a set having five elements:
(a) How many subsets does Shave?
(b) How many subsets having four elements does Shave?
(c) How many subsets having two elements does Shave?
Harder Problems
22. (a) Show that a set having n elements has 2
n
subsets.
(b) If 0 < m < n, how many subsets are there that have exactly m ele
ments?
8 Things Familiar and Less Familiar Ch. 1
3. MAPPINGS
One of the truly universal concepts that runs through almost every phase of
mathematics is that of a function or mapping from one set to another. One
could safely say that there is no part of mathematics where the notion does
not arise or playa central role. The definition of a function from one set to
another can be given in a formal way in terms of a subset of the Cartesian
product of these sets. Instead, here, we shall give an informal and admittedly
nonrigorous definition of a mapping (function) from one set to another.
Let S, T be sets; a function or mapping f from S to T is a rule that as
signs to each element s E S a unique element t E T. Let's explain a little
more thoroughly what this means. If s is a given element of S, then there is
only one element t in T that is associated to s by the mapping. As s varies
over S, t varies over T (in a manner depending on s). Note that by the defini
tion given, the following is not a mapping. Let S be the set of all people in
the world and T the set of all countries in the world. Let f be the rule that as
signs to every person his or her country of citizenship. Then f is not a map
ping from S to T. Why not? Because there are people in the world that enjoy
a dual citizenship; for such people there would not be a unique country of cit
izenship. Thus, if Mary Jones is both an English and French fwould
not make sense, as a mapping, when applied to Mary Jones. On the other
hand, the rule f: IR IR, where IR is the set of real numbers, defined by
f(a) = a
2
for a E IR, is a perfectly good function from IR to IR. It should be
noted thatf(2) = (2)2 = 4 = f(2), andf(a) = f(a) for all a E IR.
We denote that f is a mapping from S to T by f: S T and for the
t E T mentioned above we write t = f(s); we call t the image of sunder f.
The concept is hardly a new one for any of us. Since grade school we
have constantly encountered mappings and functions, often in the form of
formulas. But mappings need not be restricted to sets of numbers. As we see
below, they can occur in any area.
Examples
1. llet S = {all men who have ever lived} and T = {all women who have ever
lived}. Define f: S T by f(s) = mother of s. Therefore, f(John F. Ken
nedy) = Rose Kennedy, and according to our definition, Rose Kennedy is
the image under f of John F. Kennedy.
2. Let S = {all legally employed citizens of the United States} and T = {posi
tive integers}. Define, for s E S, f(s) by f(s) = Social Security Number of s.
(For the purpose of this text, let us assume that all legally employed citizens
of the United States have a Social Security Then f defines a map
ping from S to T.
Sec. 3 Mappings 9
3. Let S be the set of all objects for sale in a grocery store and let T = {all
real numbers}. Define f: S ~ T by f(s) = price of s. This defines a mapping
from S to T.
4. Let S be the set of all integers and let T = S. Define f: S ~ T by f(m) =
2m for any integer m. Thus the image of 6 under this mapping, f(6), is given
by f(6) = 2 . 6 = 12, while that of 3, f( 3), is given by f( 3) = 2( 3) =
6. If SJ, S2 E S are in Sand f(Sl) = f(S2), what can you say about Sl and S2?
5. Let S = T be the set of all real numbers; define f: S ~ T by f(s) = S2.
Does every element of T come up as an image of some s E S? If not, how
would you describe the set of all images {f(s) Is E S}? When is f (s 1) =
f(S2)?
6. Let S = T be the set of all real numbers; define f: S ~ T by f (s) = S3. This
is a function from S to T. What can you say about {f(s) Is E S}? When is
f(s 1) = f(S2)?
7. Let T be any nonempty set and let S = TXT, the Cartesian product of T
with itself. Definef: T X T ~ Tby f(t
1
, t
2
) = t
1
• This mapping from TXT
to T is called the projection of TXT onto its first component.
8. Let S be the set of all positive integers and let T be the set of all positive
rational numbers. Define f: S X S ~ T by f(m, n) = min. This defines a
mapping from S X S to T. Note that f(l, 2) = ~ while f(3, 6) = ~ = ~ =
f(l, 2), although (1, 2) =1= (3, 6). Describe the subset of S X S consisting of
those (a, b) such that f(a, b) = ~ .
The mappings to be defined in Examples 9 and 10 are mappings that
occur for any nonempty sets and playa special role.
9. Let S, T be nonempty sets, and let to be a fixed element of T. Define
f: S ~ T by f (s) = t 0 for every s E S; f is called a constant function from
Sto T.
10. Let S be any nonempty set and define i: S ~ S by i(s) = s for every
s E S. We call this function of S to itself the identity function (or identity map
ping) on S. We may, at times, denote it by is (and later in the book, bye).
Now that we have the notion of a mapping we need some way of identi
fying when two mappings from one set to another are equal. This is not
God given; it is for us to decide how to declare f = g where f: S ~ T and
g : S ~ T. What is more natural than to define this equality via the actions of
f and g on the elements of S? More precisely, we declare that f = g if and
only if f(s) = g(s) for every s E S. If S is the set of all real numbers and f is
defined on S by f(s) = S2 + 2s + 1, while g is defined on S by g(s) =
(s + 1)2, our definition of the equality of f and g is merely a statement of the
familiar identity (s + 1)2 = S2 + 2s + 1.
10 Things Familiar and Less Familiar Ch. 1
Having made the definition of equality of two mappings, we now want
to single out certain types of mappings by the way they behave.
Definition. The mapping [: S ~ T is onto or surjective if every t E T
is the image under [ of some s E S; that is, if and only if, given t E T, there
exists an s E S such that t = [(s).
In the examples we gave earlier, in Example 1 the mapping is not onto,
since not every woman that ever lived was the mother of a male child. Simi
larly, in Example 2 the mapping is not onto, for not every positive integer is
the Social Security Number of some U.S. citizen. The mapping in Example 4
fails to be onto because not every integer is even; and in Example 5, again,
the mapping is not onto, for the number 1, for instance, is not the square of
any real number. However, the mapping in Example 6 is onto because every
real number has a unique real cube root. The reader can decide whether or
not the given mappings are onto in the other examples.
If we define [(S) = {[(s) E Tis E S}, another way of saying that the
mapping [: S ~ T is onto is by saying that [(S) = T.
Another specific type of mapping plays an important and particular
role in what follows.
Definition. A mapping [: S ~ T is said to be onetoone (written 11)
or injective if for SI =1= S2 in S, [(SI) =1= [(S2) in T. Equivalently, [is 11 if
[(SI) = [(S2) implies that SI = S2.
In other words, a mapping is 11 if it takes distinct objects into distinct
images. In the examples of mappings we gave earlier, the mapping of Example
1 is not 11, since two brothers would have the same mother. However in Ex
ample 2 the mapping is 11 because distinct U.S. citizens have distinct Social
Security numbers (provided that there is no goofup in Washington, which is
unlikely). The reader should check if the various other examples of mappings
are 11.
Given a mapping [: S ~ T and a subset ACT, we may want to look at
B = {s E S 1 [(s) E A}; \ve use the notation [l(A) for this set B, and call
Pl(A) the inverse image o[A under [. Of particular interest is [l(t), the in
verse image of the subset {t} of T consisting of the element t E T alone. If
the inverse image of {t} consists of only one element, say s E S, we could try
to define [I(t) by defining [I(t) = s. As we note below, this need not be a
mapping from Tto S, but is so if[is 11 and onto. We shall use the same no
tation [I in cases of both subsets and elements. This [I does not in general
define a mapping from T to S for several reasons. First, if [is not onto, then
Sec. 3 Mappings 11
there is some t in T which is not the image of any element s, so Ilet) = 0.
Second, if [is not 11, then for some t E T there are at least two distinct
Sl =1= S2 in S such that fest) = t = [(S2). SO Ilet) is not a unique element of
Ssomething we require in our definition of mapping. However, if [is both
11 and onto T, then [I indeed defines a mapping of Tonto S. (Verify!) This
brings us to a very important class of mappings.
Definition. The mapping [: S T is said to be a 11 correspondence
or bijection if [is both 11 and onto.
Now that we have the notion of a mapping and have singled out various
types of mappings, we might very well ask: "Good and well, but what can we
do with them?" As we shall see in a moment, we can introduce an operation
of combining mappings in certain circumstances.
Consider the situation g : S T and [: T u. Given an element s E S,
then g sends it into the element g(s) in T; so g(s) is ripe for being acted on
by f Thus we get an element leges») E U. We claim that this procedure pro
vides us with a mapping from S to U. (Verify!) We define this more formally
in the
If g : S T and [: T U, then the composition (or prod
uct), by [0 g, is the mapping [0 g: S U defined by ([ 0g)(s)
for every s E S.
Note that to compose the two mappings [ and gthat is, for [0 g to
have any sensethe terminal set, T, for the mapping g must be the initial set
for the mapping f. One special time when we can always compose any two
mappings is when S = = U, that is, when we map S into itself Although
special, this case is of the utmost importance.
We verify a few properties of this composition of mappings.
Lemma 1.3.1. If h : S T, g : T U, and [: U V, then [0 (g 0h) =
{fog)oh.
Proof How shall we go about proving this lemma? To verify that two
mappings are equal, we merely must check that they do the same thing to
every element. Note first of all that both [0 (g 0h) and ([0g) 0h define map
pings from S to V, so it makes sense to speak about their possible equality.
Our task, then, is to show that for every s E S, ([0 (g 0 h)(s)
(([0 g) 0 h)(s). We apply the definition of composition to see that
([0 (g 0 h) )(s) = [((g 0 h )(s) = [(g(h (s) ).
12 Things Familiar and Less Familiar
Unraveling
((f 0 g) 0 h)(s) = (f 0 g)(h(s» = f(g(h (s») ,
we do indeed see that
(f 0 (g 0 h»(s) = ((f 0 g) 0 h)(s)
for every s E S. Consequently, by definition, fo (g 0 h) = (fo g) 0 h. D
Ch. 1
(The symbol D will always indicate that the proof has been completed.)
This equality is described by saying that mappings, under composition,
satisfy the associative law. Because of the equality involved there is really no
need for parentheses, so we write fo (g 0 h) as fo go h.
Lemma 1.3.2. If g: S ~ Tandf: T ~ U are both 11, thenfo g: S ~ U
is also 11.
Proof Let us suppose that (fo g)(Sl) = (fo g)(S2); thus, by definition,
f(g(Sl» = f(g(S2»· Since f is 11, we get from this that g(Sl) = g(S2); how
ever, g is also 11, thus Sl = S2 follows. Since (fo g)(Sl) = (fo g)(S2) forces
Sl = S2, the mappingfo g is 11. D
We leave the proof of the next Remark to the reader.
Remark. If g: S ~ T and f: T ~ U are both onto, then fo g: S ~ U is
also onto.
An immediate consequence of combining the Remark and Lemma
1.3.2 is to obtain
Lemma 1.3.3. If g: S ~ T and f: T ~ U are both bijections, then
fo g : S ~ U is also a bijection.
If f is a 11 correspondence of S onto T, then the "object" f
l
: T ~ S
defined earlier can easily be shown to be a 11 mapping of Tonto S. In this
case it is called the inverse of f In this situation we have
Lemma 1.3.4. If f: S ~ T is a bijection, then fo f
l
= iT and f
l
0 f =
is, where is and iT are the identity mappings of Sand T, respectively.
Proof We verify one of these. If t E T, then (fo fl)(t) = f(fl(t».
But what is fl(t)? By definition, fl(t) is that element So E S such that
Sec. 3 Mappings 13
t = f(so). So f(fl(t» = f(so) = t. In other words, (fo fl)(t) = t for every
t E T; hence fo f
1
= iT, the identity mapping on T.D
We leave the last result of this section for the reader to prove.
Lemma 1.3.5. If f: S ~ T and iT is the identity mapping of T onto it
self and is is that of S onto itself, then iT 0 f =f and fo is =f.
PROBLEMS
Easier Problems
1. For the given sets S, T determine if a mapping f: S ~ T is clearly and un
ambiguously defined; if not, say why not.
(a) S = set of all women, T = set of all men, f(s) = husband of s.
(b) S = set of positive integers, T = S, f(s) = s  1.
(c) S = set of positive integers, T = set of nonnegative integers, f(s) =
s 1.
(d) S = set of nonnegative integers, T = S, f(s) = s  1.
(e) S ~ set of all integers, T = S, f(s) = s  1.
(f) S = ~ s e t of all real numbers, T = S, f(s) = ~ .
(g) S = set of all positive real numbers, T = S, f(s) = ~ .
2. In those parts of Problem 1 where f does define a function, determine if
it is 11, onto, or both.
*3. If f is a 11 mapping of S onto T, prove that f
1
is a 11 mapping of T
onto S.
*4. If f is a 11 mapping of S onto T, prove that f
1
0 f = is.
S. Give a proof of the Remark after Lemma 1.3.2.
*6. If f: S ~ T is onto and g : T ~ U and h : T ~ U are such that g 0 f = h 0 f,
prove that g = h.
*7. If g: S ~ T, h : S ~ T, and if f: T ~ U is 11, show that if fo g = fo h,
then g = h.
8. Let S be the set of all integers and T = {I, I}; f: S ~ T is defined by
f(s) = 1 if s is even, f(s) = 1 if s is odd.
(a) Does this define a function from S to T?
(b) Show that f(SI + sz) = f(st)f(sz). What does this say about the inte
gers?
(c) Is f(Stsz) = f(SI)f(sz) also true?
14 Things Familiar and Less Familiar Ch. 1
9. Let S be the set of all real numbers. Define f: S ~ S by f(s) = S2, and
g: S ~ S by g(s) = s + 1.
(a) Find fog.
(b) Find g 0 f.
(c) Isfog = gof?
10. Let S be the set of all real numbers and for a, b E S, where a =1= 0; define
fa,b(S) = as + b.
(a) Show that fa,b 0 fc,d = fu,u for some real u, v. Give explicit values for
u, v in terms of a, b, C, and d.
(b) Is fa,b 0 fc,d = fc,d 0 fa,b always?
(c) Find all fa,b such that fa,b 0 fl,1 = fl,1 0 fa,b'
(d) Show that f;,1 exists and find its form.
11. Let S be the set of all positive integers. Define f: S ~ S by f(l) = 2,
f(2) = 3, f(3) = 1, and f(s) = s for any other s E S. Show that fo fo [ =
is. What is [I in this case?
MiddleLevel Problems
12. Let S be the set of nonnegative rational numbers, that is, S = {min Im, n
nonnegative integers, n =1= O}, and let T be the set of all integers.
(a) Does f: S ~ T defined by f(mln) = 2
m
3
n
define a legitimate function
from S to T?
(b) If not, how could you modify the definition of [so as to get a legiti
mate function?
13. Let S be the set of all positive integers of the form 2
m
3
n
, where m > 0,
n > 0, and let T be the set of all rational numbers. Define [: S ~ T by
[(2
m
3
n
) = min. Prove that f defines a function from S to T. (On what
properties of the integers does this depend?)
14. Let [: S ~ S, where S is the set of all integers, be defined by [(s) =
as + b, where a, b are integers. Find the necessary and sufficient condi
tions on a, b in order that f 0 [= is.
15. Find all [of the form given in Problem 14 such that [0[0f = is.
16. If [is a 11 mapping of S onto itself, show that ([1)1 = f.
17. If S is a finite set having m > 0 elements, how many mappings are there
of S into itself?
18. In Problem 17, how many 11 mappings are there of S into itself?
19. Let S be the set of all real numbers, and define f: S ~ S by [(s)
S2 + as + b, where a, b are fixed real numbers. Prove that for no values
at a, b can f be onto or 11.
Sec. 3 Mappings 15
20. Let S be the set of all positive real numbers. For positive reals a, c and
nonnegative reals b, d, is it ever possible that the mapping f: S S de
fined by f(s) = (as + b)/(cs + d) satisfiesf
o
f= is? Find all such a, b, c, d
that do the trick.
21. Let S be the set of all rational numbers and let fa,b : S S be defined by
fa,b(S) = as + b, where a =1= 0, b are rational numbers. Find all fe,d of this
form fe, d0 fa, b = fa, b0 fe, dfor every fa, b·
22. Let S be the set of all integers and a, b, c rational numbers. Define
f: S S by f(s) = as
2
+ bs + c. Find necessary and sufficient conditions
on a, b, c, so that f defines a mapping on S [Note: a, b, c need not be inte
gers; for example, f(s) = + 1) = + does always give us an in
teger for integral s.]
Harder Problems
23. Let S be the set of all integers of the form 2
m
3
n
, m 2:: 0, n 2:: 0, and let T
be the set of all positive integers. Show that there is a 11 correspondence
of S onto T.
24. Prove that there is a 11 correspondence of the set of all positive integers
onto tITe: set of all positive rational numbers.
25. Let S be the set of all real numbers and T the set of all positive reals.
Find a 11 mapping f of S onto T such that f(S1 + S2) = f(S1) f(S2) for all
Sb S2 E S.
26. For the fin Problem 25, findf
1
explicitly.
27. If f, g are mappings of S into Sand fog is a constant function, then
(a) What can you say about f if g is onto?
(b) What can you say about g if f is II?
28. If S is a finite set and f is a mapping of S onto itself, show that f must be
11.
29. If S is a finite set and f is a 11 mapping of S into itself, show that f must
be surjective.
30. If S is a finite set and f is a 11 mapping of S, show that for some integer
n > 0,
f
ofofo . .. of = i
\ I s.
n times
31. If S has m elements in Problem 30, find an n > °(in terms of m) that
works simultaneously for all 11 mappings of S into itself.
16 Things Familiar and Less Familiar Ch. 1
4. A(S) (THE SET OF 11 MAPPINGS OF 5 ONTO ITSELF)
We focus our attention in this section on particularly nice mappings of a non
empty set, S, into itself. Namely, we shall consider the set, A(S), of all 11 map
pings of S onto itself. Although most of the concern in the book will be in the
case in which S is a finite set, we do not restrict ourselves to that situation here.
When S has a finite number of elements, say n, then A (S) has a special
name. It is called the symmetric group of degree n and is often denoted by Sn'
Its elements are called permutations of S. If we are interested in the structure
of Sn' it really does not matter much what our underlying set S is. So, you
can think of S as being the set {I, ... , n}. Chapter 3 will be devoted to a
study, in some depth, of Sn' In the investigation of finite groups, Sn plays a
central role.
There are many properties of the set A(S) on which we could concen
trate. We have chosen to develop those aspects here which will motivate the
notion of a group and which will give the reader some experience, and feel
ing for, working in a grouptheoretic framework. Groups will be discussed in
Chapter 2.
We begin with a result that is really a compendium of some of the re
sults obtained in Section 3.
Lemma 1.4.1. A (S) satisfies the following:
(a) f, g E A(S) implies thatfo g E A(S).
(b) f, g, h E A(S) implies that (fo g) 0 h = fo (g 0 h).
(c) There exists an elementthe identity mapping isuch that fo i
i
o
f= ffor every fE A(S).
(d) Given f E A(S), there exists agE A(S) (g = fl) such that fo g =
gof = i.
Proof All these things were done in Section 3, either in the text mate
rial or in the problems. We leave it to the reader to find the relevant part of
Section 3 that will verify each of the statements (a) through (d). D
We should now like to know how many elements there are in A(S)
when S is a finite set having n elements. To do so, we first make a slight di
gression.
Suppose that you can do a certain thing in r different ways and a sec
ond independent thing in s different ways. In how many distinct ways can
you do both things together? The best way of finding out is to picture this in
Sec. 4 A(S) (The Set of 11 Mappings of S Onto Itself) 17
a concrete context. Suppose that there are r highways running from Chicago
to Detroit and s highways running from Detroit to Ann Arbor. In how many
ways can you go first to Detroit, then to Ann Arbor? Clearly, for every road
you take from Chicago to Detroit you have s ways of continuing on to Ann
Arbor. You can start your trip from Chicago in r distinct ways, hence you
can complete it in
IS + S + S + ... + s, = rs
r times
different ways.
It is fairly clear that we can extend this from doing two independent
things to doing m independent ones, for an integer m > 2. If we can do the
first things in rI distinct ways, the second in r2 ways, , the mth in rm dis
tinct ways, then we can do all these together in rlr2 rm different ways.
Let's recall something many of us have already seen:
Definition. If n is a positive integer, then n! (read "n factorial") is de
fined by n! = 1 ·2 . 3 ... n.
Lemma 1.4.2. If S has n elements, then A(S) has n! elements.
Proof: Let f E A(S), where S = {Xb X2, ... , x
n
}. How many choices
does fhave as a place to send x I? Clearly n, for we can send x I under fto any
element of S. But now f is not free to send X2 anywhere, for since f is 11, we
must have f(XI) =1= f(X2). So we can send X2 anywhere except onto f(XI).
Hence f can send X2 into n  1 different images. Continuing this way, we see
that f can send Xi into n  (i  1) different images. Hence the number of
suchf's is n(n  1)(n  2) .. ·1 = n! D
Example
The number n! gets very large quickly. To be able to see the picture in its en
tirety, we look at the special case n = 3, where n! is still quite small.
Consider A(S) = S3' where Sconsists of the three elements Xl, X2, X3. We
list all the elements of S3' writing out each mapping explicitly by what it does
to each of Xl, X2, X3.
1.
2.
3. g: Xl X2, X2 Xb X3 X3·
4. gof:XI (Verify!)
18 Things Familiar and Less Familiar
5. fo g: Xl ~ X3, X2 ~ X2, X3 ~ Xl. (Verify!)
6. fo f : Xl ~ X3' X2 ~ Xl, X3 ~ X2. (Verify!)
Ch. 1
Since we have listed here six different elements of S3, and S3 has only
six elements, we have a complete list of all the elements of S3. What does
this list tell us? To begin with, we note that fo g =1= go f, so one familiar rule
of the kind of arithmetic we have been used to is violated. Since g E S3 and
g E S3 , we must have gog also in S3. What is it? If we calculate gog, we
easily get gog = i. Similarly, we get
(f 0 g) 0 (f 0 g) = i = (g 0 f) 0 (g 0 f).
Note also that fo (fo f) = i, hence f
l
= fo f. Finally, we leave it to the
reader to show that g 0 f = f
l
0 g.
It is a little cumbersome to write this product in A (S) using the o. From
now on we shall drop it and write fo g merely as fg. Also, we shall start using
the shorthand of exponents, to avoid expressions like fo fo fo ... 0 f. We de
fine, for f E A(S), fa = i, f2 = fo f = ff, and so on. For negative exponents
n we define fn by fn = (fl)n, where n is a positive integer. The usual
rules of exponents prevail, namely frfs = fr+s and (fr)s = frs. We leave
these as exercisessomewhat tedious ones at thatfor the reader.
Example
Do not jump to conclusions that all familiar properties of exponents go over.
For instance, in the example of the f, g E S3 defined above, we claim that
(fg)2 =1= f2g2. To see this, we note that
so that (fg)2: Xl ~ Xl, X2 ~ X2, X3 ~ X3, that is, (fg)2 = i. On the other
hand, f2 =1= i and g2 = i, hence f2
g
2 = f2 =1= i, whence (fg)2 =1= f2g2in this case.
However, some other familiar properties do go over. For instance, if
f, g, h are in A (S) and fg = fh, then g = h. Why? Because, from fg = fh we
have fl(fg) = fl(fh); therefore, g = ig = (fIf)g = fl(fg) = fl(fh) =
(fIf)h = ih = h. Similarly, gf = hf implies that g = h. So we can cancel an
element in such an equation provided that we do not change sides. In S3 our
f, g satisfy gf = flg, but since f =1= /1 we cannot cancel the g here.
PROBLEMS
Recall that fg stands for fo g and, also, what fm means. S, without subscripts,
will be a nonempty set.
Sec. 4
Easier Problems
A(S) (The Set of 11 Mappings of S Onto Itself) 19
1. If Sl =1= S2 are in S, show that there is anfE A(S) such thatf(sl) = S2.
2. If Sl E S, let H === {fE A(S) I f(Sl) = sd. Show that:
(a) i E H.
(b) If f, g E H, then fg E H.
(c) IffE H, thenf
1
E H.
3. Suppose that Sl =1= S2 are in Sand f(Sl) = S2, where f E A (S). Then if H is
as in Problem 2 and K = {g E A (S) Ig(S2) = S2}, show that:
(a) If g E K, then flgf E H.
(b) If h E H, then there is some g E K such that h = flgf.
4. Iff, g, h E A(S), show that (flgf)(f1hf) = fl(gh)f. What can you say
about (flgf)n?
5. If f, g E A (S) and fg = gf, show that:
(a) (fg)2 = f2
g
2.
(b) (fg)l = flgl.
6. Push the result of Problem 5, for the same f and g, to show that (fg)m =
fmgm for all integers m.
*7. Verify_ the rules of exponents, namely frfs = fr+s and (fr)s = frs for
f E A (5) and positive integers r, s.
8. If f, g E A (S) and (fg)2 = f2g2, prove that fg = gf.
9. If S = {x}, X2, X3' X4}, let f, g E S4 be defined by
and
Calculate:
(a) f2, f3, f4.
(b) g2, g3.
(c) fg.
(d) gf.
(e) (fg)3, (gf)3.
(f) f
1
, gl
10. IffE S3, show thatf6 = i.
11. Can you find a positive integer m such that fm = i for all f E S4?
20 Things Familiar and Less Familiar
MiddleLevel Problems
Ch. 1
*12. If f E Sn, show that there is some positive integer k, depending on f, such
that fk = i. (Hint: Consider the positive powers of f.)
*13. Show that there is a positive integer t such that ft = i for all f E Sn.
14. If m < n, show that there is a 11 mapping F: Sm ~ Sn such that F(fg) =
F(f)F(g) for alII, g E Sm.
15. If S has three or more elements, show that we can find f, g E A (S) such
that fg =1= gf.
16. Let S be an infinite set and let MeA(S) be the set of all elements
f E A(S) such that f(s) =1= s for at most a finite number of s E S. Prove
that:
(a) f, gEM implies that fg E M.
(b) f E M implies that f
l
E M.
17. For the situation in Problem 16, show, if f E A(S), that f
1
Mf =
{flgfl gEM} must equal M.
18. Let S ~ T and consider the subset VeT) = {f E A (S) If(t) E T for every
t E T}. Show that:
(a) i E VeT).
(b) f, g E VeT) implies thatfg E VeT).
19. If the S in Problem 18 has n elements and T has m elements, how
many elements are there in VeT)? Show that there is a mapping
F: VeT) ~ Sm such that F(fg) = F(f)F(g) for f, g E VeT) and F is onto
Sm·
20. If m < n, can F in Problem 19 ever be II? If so, when?
21. In Sn show that the mapping f defined by
[i.e., f(xJ = Xi+l if i < n, f(x
n
) = xtl can be written as f = gtg2 gnl
where each gi E Sn interchanges exactly two elements of S = {Xl, , x
n
},
leaving the other elements fixed in S.
Harder Problems
22. If f E Sn, show that f = h
l
h
2
••• h
m
for some h
j
E Sn such that h; = i.
*23. Call an element in Sn a transposition if it interchanges two elements, leav
ing the others fixed. Show that any element in Sn is a product of transpo
sitions. (This sharpens the result of Problem 22.)
24. If n is at least 3, show that for some fin Sn, f cannot be expressed in the
form f = g3 for any g in Sn.
Sec. 5 The Integers 21
25. If f E Sn is such that f =1= i but f3 = i, show that we can number the
elements of S in such a way that f(Xl) = X2, f(X2) = X3, f(X3) = Xl,
f(X4) = Xs, f(xs) = X6, f(X6) = X4, ... ,f(X3k+l) = X3k+2' f(X3k+2) = X3k+3,
f(X3k+3) = X3k+l for some k, and, for all the other X
t
E S, f(xt) = Xt.
26. View a fixed shuffle of a deck of 52 cards as a 11 mapping of the deck
onto itself. Show that repeating this fixed shuffle a finite (positive) num
ber of times will bring the deck back to its original order.
*27. If f E A(S), call, for s E S, the orbit of s (relative to f) the set O(s) =
{fj(s) I all integersj}. Show that if s, t E S, then either O(s) n OCt) = 0 or
O(s) = OCt).
28. If S = {Xl' X2, ... , X12} and f E S12 is defined by f(xt) = Xt+l if i = 1, 2, ... ,
11 and f(x 12) = Xl, find the orbits of all the elements of S (relative to f).
29. If f E A (S) satisfies f3 = i, show that the orbit of any element of S has
one or three elements.
*30. Recall that a prime number is an integer p>1 such that p cannot be fac
tored as a product of smaller positive integers. If f E A(S) satisfies fP = i,
what can you say about the size of the orbits of the elements of Srelative to
f? What property of the prime numbers are you using to get your answer?
31. Prove that if S has more than two elements, then the only elements fa in
A (S) ~ c h that fof = ffo for all f E A (S) must satisfy fa = i.
*32. We say that g E A(S) commutes with f E A(S) if fg = gf. Find all
the elements in A(S) that commute with f: S ~ S defined by
f (x 1) = X2 , f (X2) = Xl, and f (s) = s if s =1= Xb X2 .
33. In Sn show that the only elements commuting with f defined by f(Xi) =
Xi+1 i"f i < n, f(x
n
) = Xb are the powers of f, namely i = fa, f, f2, ... ,f
n

I
.
34. For f E A (S), let C(f) = {g E A (S) Ifg = gf}. Prove that:
(a) g, h E C(f) implies that gh E C(f).
(b) g E C(f) implies that gl E C(f).
(c) C(f) is not empty.
5. THE INTEGERS
The mathematical set most familiar to everybody is that of the positive inte
gers 1,2, ... , which we shall often call N. Equally familiar is the set, 7L, of all
integerspositive, negative, and zero. Because of this acquaintance with 7L,
we shall give here a rather sketchy survey of the properties of 7L that we shall
use often in the ensuing material. Most of these properties are well known to
all of us; a few are less well known.
The basic assumption we make about the set of integers is the
22 Things Familiar and Less Familiar Ch. 1
WellOrdering Principle. Any nonempty set of nonnegative integers
has a smallest member.
More formally, what this principle states is that given a nonempty set V
of nonnegative integers, there is an element Vo E V such that Vo ~ v for every
v E V. This principle will serve as the foundation for our ensuing discussion
of the integers.
The first application we make of it is to show something we all know
and have taken for granted, namely that we can divide one integer by an
other to get a remainder that is smaller. This is known as Euclid's Algorithm.
We give it a more formal statement and a proof based on wellordering.
Theorem 1.5.1 (Euclid's Algorithm). If m and n are integers with
n > 0, then there exist integers q and r, with 0 ::5 r < n, such that m = qn + r.
Proof. Let W be the set of m  tn, where t runs through all the
integers, i.e., W = {m  tn It E Z}. Note that W contains some nonnegative
integers, for if t is large enough and negative, then m  tn > O. Let
V = {v E W Iv 2:: O}; by the wellordering principle V has a smallest element,
r. Since rEV, r 2:: 0, and r = m  qn for some q (for that is the form of all
elements in W ~ V). We claim that r < n. If not, r = m  qn ~ n, hence
m  (q + l)n 2:: O. But this puts m  (q + l)n in V, yet m  (q + l)n < r,
contradicting the minimal nature of r in V. With this, Euclid's Algorithm is
proved. D
Euclid's Algorithm will have a host of consequences for us, especially
about the notion of divisibility. Since we are speaking about the integers,
be it understood that all letters used in this section will be integers. This will
save a lot of repetition of certain phrases.
Definition. Given integers m =1= 0 and n we say that m divides n, writ
ten as min, if n = cm for some integer c.
Thus, for instance, 2114, (7) 114, 41 (16). If min, we call m a divi
sor or factor of n, and n a multiple of m. To indicate that m is not a divisor of
n, we write m t n; so, for instance, 3 t 5.
The basic elementary properties of divisibility are laid out in
Lemma 1.5.2. The following are true:
(a) 11 n for all n.
(b) If m =1= 0, then m 10.
Sec. 5
(c) If min and n Iq, then m I q.
(d) If min and m Iq, then m I(un + vq) for all u, v.
(e) If mil, then m = 1 or m = 1.
(f) If min and n I m, then m = ±n.
The Integers 23
Proof The proofs of all these parts are easy, following immediately
from the definition of min. We leave all but Part (d) as exercises but prove
Part (d) here to give the flavor of how such proofs go.
So suppose that min and m Iq. Then n = cm and q = dm for some c
and d. Therefore, un + vq = u(cm) + v(dm) = (uc + vd)m. Thus, from the
definition, m I(un + vq). D
Having the concept of a divisor of an integer, we now want to introduce
that of the greatest common divisor of two (or more) integers. Simply
enough, this should be the largest possible integer that is a divisor of both in
tegers in question. However, we want to avoid using the size of an integer
for reasons that may become clear much later when we talk about rings. So
we make the definition in what may seem as a strange way.
Definition. Given a, b (not both 0), then their greatest common divi
sor c is defined by:
~
(a) c > o.
(b) c Ia and c lb.
(c) Ifdlaanddlb,thendlc.
We write this c as c = (a, b).
In other words, the greatest common divisor of a and b is the positive
number c which divides a and b and is divisible by every d which divides a
andb.
Defining something does not guarantee its existence. So it is incumbent
on us to prove that (a, b) exists, and is, in fact, unique. The proof actually
shows more, namely that (a, b) is a nice combination of a and b. This combi
nation is not unique; for instance,
(24, 9) = 3 = 3 . 9 + ( 1)24 = ( 5)9 + 2 . 24.
Theorem 1.5.3. If a, b are not both 0, then their greatest common divi
sor c = (a, b) exists, is unique, and, moreover, c = moa + nob for some suit
able mo and no.
24 Things Familiar and Less Familiar Ch. 1
Proof Since not both a and bare 0, the set A = {ma + nb Im, nEE}
has nonzero elements. If x E A and x < 0, then xis also in A and x> 0,
for if x = mta + ntb, then x = (mt)a + (nt)b, so is in A. Thus A has
positive elements; hence, by the wellordering principle there is a smallest
positive element, c, in A. Since C E A, by the form of the elements of A we
know that c = moa + nob for some mo, no.
We claim that c is our required greatest common divisor. First note that
if d Ia and d Ib, then d I (moa + nob) by Part (d) of Lemma 1.5.2, that is,
d Ic. So, to verify that c is our desired elenient, we need only show that c Ia
and c lb.
By Euclid's Algorithm, a = qc + r, where 0 ::5 r < c, that is, a =
q(moa + nob) + r. Therefore, r = qnob + (1  qmo)a. So r is in A. But
r < c and is in A, so by the choice of c, r cannot be positive. Hence r = 0; in
other words, a = qc and so cia. Similarly, c Ib.
For the uniqueness of c, if t > 0 also satisfied t Ia, t Iband d It for all d
such that d Ia and d Ib, we would have tic and cit. By Part (f) of Lemma
1.5.2 we get that t = c (since both are positive). D
Let's look at an explicit example, namely a = 24, b = 9. By direct exami
nation we know that (24, 9) = 3; note that 3 = 3· 9 + (1)24. What is
(24,9)?
How is this done for positive numbers a and b which may be quite large?
If b > a, interchange a and b so that a > b > O. Then we can find (a, b) by
1. observing that (a, b) = (b, r) where a = qb + r with a::5 r < b (Why?);
2. finding (b, r), which now is easier since one of the numbers is smaller
than before.
So, for example, we have
(100, 28) = (28, 16)
( 28, 16) = (16, 12)
( 16, 12) = (12, 4)
This gives us
(100, 28) = (12, 4) = 4.
since 100 = 3 (28) + 16
since 28 = 1 (16) + 12
since 16 = 1 (12) + 4
It is possible to find the actual values of mo and no such that
4 = mo 100 + no 28
Sec. 5 The Integers 25
by going backwards through the calculations made to find 4:
Since 16 = 1 (12) + 4,
Since 28 = 1 (16) + 12,
Since 100 = 3 (28) + 16,
But then
4= 16+(1)12
12= 28+(1)16
16 = 100 + (3) 28
4 = 16 + (1) 12 = 16 + (1)(28 + (1) 16)
= (1) 28 + (2) 16 = (1) 28 + (2)(100 + (3) 28)
= (2) 100 + (7) 28
so that mo = 2 and no = 7.
This shows how Euclid's Algorithm can be used to compute (a, b) for
any positive integers a and b.
We shall include some exercises at the end of this section on other
properties of (a, b).
We come to the very important
DefilJ!tion. We say that a and b are relatively prime if (a, b) = 1.
So the integers a and b are relatively prime if they have no nontrivial
common factor. An immediate corollary to Theorem 1.5.3 is
Theorem 1.5.4. The integers a and b are relatively prime if and only if
1 = rna + nb for suitable integers m and n.
Theorem 1.5.4 has an immediate consequence
Theorem 1.5.5. If a and b are relatively prime and a Ibc, then a Ic.
Proof By Theorem 1.5.4, ma + nb = 1 for some m and n, hence
(ma + nb)c = c, that is, mac + nbc = c. By assumption, a I bc and by obser
vation a I mac, hence a I (mac + nbc) and so a I c. D
Corollary. If band c are both relatively prime to a, then bc is also rel
,atively prime to a.
Proof We pick up the proof of Theorem 1.5.5 at mac + nbc = c. If d =
(a, bc), then d Ia and d Ibc, hence d I(mac + nbc) = c. Since d Ia and d Ic
26 Things Familiar and Less Familiar Ch. 1
and (a, c) = 1, we get that d = 1. Since 1 = d = (a, be), we have that be is
relatively prime to a. D
We now single out an ultraimportant class of positive integers, which
we met before in Problem 30, Section 4.
Definition. A prime number, or a prime, is an integer p > 1, such that
for any integer a either p Ia or p is relatively prime to a.
This definition coincides with the usual one, namely that we cannot fac
tor p nontrivial/yo For if p is a prime as defined above and p = ab where
1 ~ a < p, then (a, p) = a (Why?) and p does not divide a since p > a. It fol
lows that a = 1, so p = b. On the other hand, if p is a prime in the sense that
it cannot be factored nontrivially, and if a is an integer not relatively prime to
p, then (a, b) is not 1 and it divides a and p. But then (a, b) equals p, by our
hypothesis, so p divides a.
Another result coming out of Theorem 1.5.5 is
Theorem 1.5.6. If p is a prime and p I(ala2 ... an), then p Iaj for
some i with 1 $ i s; n.
Proof If p IaI, there is nothing to prove. Suppose that p {a1; then p
and al are relatively prime. But pial (a2 ... an), hence by Theorem 1.5.5,
p Ia2 ... an' Repeat the argument just given on az, and continue. 0
The primes playa very special role in the set of integers larger than 1 in
that every integer n > 1 is either a prime or is the product of primes. We
shall show this in the next theorem. In the theorem after the next we shall
show that there is a uniqueness about the way n > 1 factors into prime fac
tors. The proofs of both these results lean heavily on the wellordering prin
ciple.
Theorem 1.5.7. If n > 1, then either n is a prime or n is the product of
primes.
Proof. Suppose that the theorem is false. Then there must be an
intger m > 1 for which the theorem fails. Therefore, the set M for which the
theorem fails is nonempty, so, by the wellordering principle, M has a least
element m. Clearly, since m E M, m cannot be a prime, thus m = ab, where
1 < a < m and 1 < b < m. Because a < m and b < m and m is the least
element in M, we cannot have a E M or b E M. Since a $. M, b $. M, by the
definition of M the theorem must be true for both a and b. Thus a and bare
Sec. 5 The Integers 27
primes or the product of primes; from m = ab we get that m is a product
of primes. This puts m outside of M, contradicting that m E M. This proves
the theorem. D
We asserted above that there is a certain uniqueness about the decom
position of an integer into primes. We make this precise now. To avoid trivi
alities of the kind 6 = 2 . 3 = 3 . 2 (so, in a sense, 6 has two factorizations into
the primes 2 and 3), we shall state the theorem in a particular way.
Theorem 1.5.8. Given n > 1, then there is one and only one way to
write n in the form n = ... pC;/, where PI < P2 < ... < Pk are primes
and the exponents aI, a2, ... , ak are all positive.
Proof We start as we did above by assuming that the theorem is
false, so there is a least integer m > 1 for which it is false. This m must have
two distinct factorizations as m = ... p%k = ... where
PI < P2 < ... < Pk' ql < q2 < ... < qe are primes and where the exponents
ab ... , ak and bb ... , be are all positive. Since PI I .. · p%k = ...
by Theorem 1.5.6 PI Iq7
1
for some i; hence, again by Theorem 1.5.6, PI Iqi,
hence PI = qi' By the same token ql = Pj for some j; thus PI Pj =
ql qi = Pl' This gives us that PI = qi' Now since m/PI < m, m/PI
has the unique factorization property. But m/PI = ... p%k =
>:. and since m/PI can be factored in one and only one way in
this form, we easily get k = e, P2 = q2, ... ,Pk = qk, al  1 = b
i
 1,
a2 = b
2
, ... ,ak = b
k
. So we see that the primes and their exponents arising
in the factorization of m are unique. This contradicts the lack of such unique
ness for m, and so proves the theorem. D
What these last two theorems tell us is that we can build up the integers
from the primes in a very precise and welldefined manner. One would ex
pect from this that there should be manythat is, an infinityof primes.
This old result goes back to Euclid; in fact, the argument we shall give is due
to Euclid.
Theorem 1.5.9. There is an infinite number of primes.
Proof If the result were false, we could enumerate all the primes in
PI' P2' ... ,Pk' Consider the integer q = 1 + PIP2 ... Pk' Since q > Pi for
every i = 1,2, ... , k, q cannot be a prime. Since Pi q, for we get a remain
der of 1 on dividing q by Pi, q is not divisible by any of Pb' .. ,Pk' So q is
not a prime nor is it divisible by any prime. This violates Theorem 1.5.7,
thereby proving the theorem. D
28 Things Familiar and Less Familiar Ch. 1
Results much sharper than Theorem 1.5.9 exist about how many primes
there are up to a given point. The famous prime number theorem states that
for large n the number of primes less than or equal to n is "more or less"
n/log
e
n, where this "more or less" is precisely described. There are many
open questions about the prime numbers.
PROBLEMS
Easier Problems
1. Find (a, b) and express (a, b) as ma + nb for:
(a) (116, 84).
(b) (85, 65).
(c) (72, 26).
(d) (72, 25).
2. Prove all the parts of Lemma 1.5.2, except part (d).
3. Show that (ma, mb) = mea, b) if m > o.
4. Show that if a Im and b Im and (a, b) = 1, then (ab) 1m.
5. Factor the following into primes.
(a) 36.
(b) 120.
(c) 720.
(d) 5040.
6. If m = p ~ l ... p%k and n = pr
1
••• pZ\ where p t , ••. , Pk are distinct
primes and at, ... , ak are nonnegative and b
t
, ... , b
k
are nonnegative,
express (m, n) as p ~ l ... p ~ k by describing the c's in terms of the a's and
b's.
* 7. Define the least common multiple of positive integers m and n to be the
smallest positive integer v such that both m Iv and n Iv.
(a) Show that v = mn/(m, n).
(b) In terms of the factorization of m and n given in Problem 6, what is
v?
8. Find the least common multiple of the pairs given in Problem 1.
9. If m, n > 0 are two integers, show that we can find integers u, v with
n/2 ~ v ~ n/2 such that m = un + v.
10. To check that a given integer n > 1 is a prime, prove that it is enough to
show that n is not divisible by any prime p with p ~ yj,.
Sec. 6 Mathematical Induction 29
11. Check if the following are prime.
(a) 301.
(b) 1001.
(c) 473.
12. Starting with 2, 3, 5, 7, , construct the positive integers 1 + 2· 3,
1 + 2 ·3 ·5, 1 + 2 . 3 ·5 . 7, Do you always get a prime number this
way?
MiddleLevel Problems
13. If P is an odd prime, show that p is of the form:
(a) 4n + 1 or 4n + 3 for some n.
(b) 6n + 1 or 6n + 5 for some n.
14. Adapt the proof of Theorem 1.5.9 to prove:
(a) There is an infinite number of primes of the form 4n + 3.
(b) There is an infinite number of primes of the form 6n + 5.
15. Show that no integer u = 4n + 3 can be written as u = a
2
+ b
2
, where
a, b are integers.
16. If T is an infinite subset of N, the set of all positive integers, show that
there is a 11 mapping of Tonto N.
17. If p is_a prime, prove that one cannot find nonzero integers a and b such
that a 2 ~ pb
2
. (This shows that vIP is irrational.)
6. MATHEMATICAL INDUCTION
If we look back at Section 5, we see that at several placesfor instance, in
the proof of Theorem 1.5.6we say "argue as above and continue." This is
not very satisfactory as a means of nailing down an argument. What is clear
is that we need some technique of avoiding such phrases when we want to
prove a proposition about all the positive integers. This is provided for us by
the Principle of Mathematical Induction; in fact, this will be the usual method
that we shall use for proving theorems about all the positive integers.
Theorem 1.6.1. Let pen) be a statement about the positive integers
such that:
(a) P(l) is true.
(b) If P(k) happens to be true for some integer k 2:: 1, then P(k + 1) is also
true.
Then P(n) is true for all n 2:: 1.
30 Things Familiar and Less Familiar Ch. 1
Proof Actually, the arguments given in proving Theorems 1.5.7 and
1.5.8 are a prototype of the argument we give here.
Suppose that the theorem is false; then, by wellordering, there is a
least integer m 2:: 1 for which P(m) is not true. Since P(l) is true, m =I= 1,
hence m > 1. Now 1 ~ m  1 < m, so by the choice of m, P(m  1) must be
valid. But then by the inductive hypothesis [Part (b)] we must have that P(m)
is true. This contradicts that P(m) is not true. Thus there can be no integer
for which P is not true, and so the theorem is proved. D
We illustrate how to use induction with some rather diverse examples.
Examples
1. Suppose that n tennis balls are put in a straight line, touching each other.
Then we claim that these balls make n  1 contacts.
Proof If n = 2, the matter is clear. If for k balls we have k  1 con
tacts, then adding one ball (on aline) adds one contact. So k + 1 balls would
have k contacts. So if pen) is what is stated above about the tennis balls, we
see that if P(k) happens to be true, then so is P(k + 1). Thus, by the theo
rem, P(n) is true for all n 2:: 1. D
2. If p is a prime and p Iala2 ... an' then p Iai for some 1 ~ i ~ n.
Proof Let pen) be the statement in Example 2. Then P(l) is true, for
if pial, it certainly divides ai for some 1 ::::; i ~ 1.
Suppose we know that P(k) is true, and that p Iala2 ... ak ak+l . Thus,
by Theorem 1.5.6, since p I(ala2 ... ak)ak+l either p Iak+l (a desired con
clusion) or pial· .. ak. In this second possibility, since P(k) is true we have
that p Iai for some 1 :5 i ::::; k. Combining both possibilities, we get that p Iaj
for some 1 ~ j ~ k + 1. So Part (b) of Theorem 1.6.1 holds; hence pen) is
true for all n 2:: 1. 0
3. For n 2:: 1,1 + 2 + ... + n = ~ n ( n + 1).
Proof If pen) is the proposition that 1 + 2 + ... + n = ~ n ( n + 1), then
P(l) is certainly true, for 1 = ~ (1 + 1). If P(k) should be true, this means that
1 + 2 + ... + k = ~ k ( k + 1).
The question is: Is P(k + 1) then also true, that is, is 1 + 2 + ... + k +
(k + 1) = ~ (k + l)((k + 1) + I)? Now 1 + 2 + ... + k + (k + 1) =
(1 + 2 + ... k) + (k + 1) = ~ k ( k + 1) + (k + 1), since P(k) is valid. But
~ k ( k + 1) + (k + 1) = ~ ( k ( k + 1) + 2(k + 1» = ~ ( k + l)(k + 2), which
assures us that P(k + 1) is true. Thus the proposition 1 + 2 + ... + n =
~ n(n + 1) is true for all n 2:: 1.0
Sec. 6 Mathematical Induction 31
We must emphasize one point here: Mathematical induction is not a
method for finding results about integers; it is a means of verifying a result.
We could, by other means, find the formula given above for 1 + 2 + ... + n.
Part (b) of Theorem 1.6.1 is usually called the induction step.
In the problems we shall give some other versions of the principle of in
duction.
PROBLEMS
Easier Problems
1. Prove that 1
2
+ 2
2
+ 3
2
+ ... + n
2
= ~ n ( n + 1)(2n + 1) by induction.
2. Prove that 1
3
+ 2
3
+ ... + n
3
= ~ n 2 ( n + 1)2 by induction.
3. Prove that a set having n ~ 2 elements has ~ n ( n  1) subsets having ex
actly two elements.
4. Prove that a set having n ~ 3 elements has n(n  l)(n  2)/3! subsets
having exactly three elements.
5. If n ;::: 4 and S is a set having n elements, guess (from Problems 3 and 4)
how many subsets having exactly 4 elements there are in S. Then verify
your guess using mathematical induction.
*6. Complete the proof of Theorem 1.5.6, replacing the last sentence by an
induction argument.
7. If a =1= 1, prove that 1 + a + a
2
+ ... + an = (a
n
+
1
 l)/(a  1) by induc
tion.
8. By induction, show that
1 + 1 + ... + 1 n
N 2 . 3 n(n + 1) n + 1 .
* 9. Suppose that P(n) is a proposition about positive integers n such that
P(no) is valid, and if P(k) is true, so must P(k + 1) be. What can you say
about pen)? Prove your statement.
*10. Let pen) be a proposition about integers n such that P(l) is true and
such that if P(j) is true for all positive integers j < k, then P(k) is true.
Prove that P(n) is true for all positive integers n.
MiddleLevel Problems
11. Give an example of a proposition that is not true for any positive integer,
yet for which the induction step [Part (b) of Theorem 1.6.1] holds.
12. Prove by induction that a set having n elements has exactly 2n subsets.
32 Things Familiar and Less Familiar Ch. 1
13. Prove by induction on n that n
3
 n is always divisible by 3.
14. Using induction on n, generalize the result in Problem 13 to: If p is a prime
number, then n
P
 n is always divisible by p. (Hint: The binomial theorem.)
15. Prove by induction that for a set having n elements the number of 11
mappings of this set onto itself is n!.
7. COMPLEX NUMBERS
We all know something about the integers, rational numbers, and real num
bersindeed, this assumption has been made for some of the text material
and many of the problems have referred to these numbers. Unfortunately,
the complex numbers and their properties are much less known to present
day college students. At one time the complex numbers were a part of the
high school curriculum and the early college one. This is no longer the case.
So we shall do a rapid development of this very important mathematical set.
The set of complex numbers, C, is the set of all a + bi, where a, bare
real and where we declare:
1. a + bi = c + di, for a, b, c, d real, if and only if a = c and b = d.
2. (a + bi) ± (c + di) = (a ± c) + (b ± d)i.
3. (a + bi)(c + di) = (ac  bd) + (ad + bc)i.
This last propertymultiplicationcan best be remembered by using
i
2
= 1 and multiplying out formally with this relation in mind.
For the complex number z = a + bi, a is called the real part of z and b
the imaginary part of z. If a is 0, we call z purely imaginary.
We shall write 0 + Oi as 0 and a + Oi as a. Note that z + 0 = z, z1 = z
for any complex number z.
Given z = a + bi, there is a complex number related to z, which we
write as Z, defined by Z = a  bi. This complex number, Z, is called the
complex conjugate of z. Taking the complex conjugate gives us a mapping
of C onto itself. We claim
Lemma 1.7.1. If z, w E C, then:
(a) (Z) = z.
(b) (z + w) = Z + w.
(c) (zw) = zw
(d) zz is real and nonnegative and is, in fact, positive if z =1= O.
Sec. 7 Complex Numbers 33
(e) z + z is twice the real part of z.
(f) z  z is twice the imaginary part of z times i.
Proof Most of the parts of this lemma are straightforward and merely in
volve using the definition of complex conjugate. We do verify Parts (c) and (d).
Suppose that z = a + bi, W = e + di, where a, b, e, d are real. So zw =
(ae  bd) + (ad + be)i, hence
(zw) = (ae  bd) + (ad + be)i = (ae  bd)  (ad + be)i.
On the other hand, z = a  bi and W = e  di, hence, by the definition of the
product in C, z W = (ae  bd)  (ad + be) i. Comparing this with the result
that we obtained for (zw), we see that indeed (zw) = Zw. This verifies Part (c).
We go next to the proof of Part (d). Suppose that z = a + bi =1= 0; then
z = a  bi and zz = a
2
+ b
2
• Since a, b are real and not both 0, a
2
+ b
2
is
real and positive, as asserted in Part (d). D
The proof of Part (d) of Lemma 1.7.1 shows that if z = a + bi =1= 0, then
zz = a
2
+ b
2
¥Oandz(zl(a
2
+ b
2
» = 1,so
acts like the inverse liz of z. This allows us to carry out division in C, staying
in C while doing so.
We now list a few properties of C.
Lemma 1.7.2. C behaves under its sum and product according to the
following: If u, v, w E C, then
(a) u + v = v + u.
(b) (u + v) + w = u + (v + w).
(c) uv = VUe
(d) (uv)w = u(vw).
(e) u =1= 0 implies that u
1
= 11u exists in C such that uu
1
= 1.
Proof We leave the proofs of these various parts to the reader. 0
These properties of C make of C what we shall call a field, which we
shall study in much greater depth later in the book. What the lemma says is
that we are allowed to calculate in C more or less as we did with real num
bers. However, C has a much richer structure than the set of real numbers.
34 Things Familiar and Less Familiar
We now introduce a "size" function on C.
Ch. 1
Definition. If z = a + bi E C, then the absolute value of z, written as
Izl, is defined by Izi = v.a = Ya
2
+ b
2
•
We shall see, in a few moments, what this last definition means geomet
rically. In the meantime we prove
Lemma 1.7.3. If u, v E C, then luvl = lullvl.
Proof By definition, Iu I = ~ and Iv I = ~ . Now
luvl = Y(uv) (uv) = Y(uv) (uv) (by Part (c) of Lemma 1.7.1)
= Y(uu)(vv) (by Lemma 1.7.2)
= ~ ~ = lullvl·D
Another way of verifying this lemma is to write u = a + bi, v = e + di,
uv = (ae  bd) + (ad + bc)i and to note the identity
(ae  bd)2 + (ad + be)2 = (a
2
+ b
2
)(e
2
+ d
2
).
Note several small points about conjugates. If z E C, then z is real if
and only if z = z, and z is purely imaginary if and only if z =  z. If
z, w E C, then

(zW + zw) = zW + zW = zw + zW,
so zW + ZW is real. We want to get an upper bound for Izw + zwl; this will
come up in the proof of Theorem 1.7.5 below.
But first we must digress for a moment to obtain a statement about
quadratic expressions.
Lemma 1.7.4. Let a, b, e be real, with a > O. If aa
2
+ ba + e 2:: 0 for
every real a, then b
2
 4ae :5 O.
Proof Consider the quadratic expression for a = bI2a. We get
a( bI2a)2 + b( bI2a) + e 2:: O. Simplifying this, we obtain that (4ae  b
2
)/4a
~ 0, and since a > 0, we end up with 4ae  b
2
~ 0, and so b
2
 4ae :5 O. D
We use this result immediately to prove the important
Theorem 1.7.5 (Triangle Inequality). For z, w E C, Iz + wI:5 Iz I+ IwI·
Proof If z = 0, there is nothing to prove, so we may assume that z =I= 0;
thus zz > O. Now, for a real,
Sec. 7 Complex Numbers
0::::; laz + wl
2
= (az + w) (az + w) = (az + w)(az + W)
= a
2
zz + a(zW + ZW) + wW.
35
If a = ZZ > 0, b = zW + zw, c = wW, then Lemma 1.7.4 tells us that
b
2
 4ac = (zw + ZW)2  4(zz)(ww) ::::; 0, hence (zw + ZW)2 ::::; 4(zz)(ww) =
41z1
2
1w1
2
• Therefore, zW + zw ::::; 2lzllwl.
For a = 1 above,
Iz + wl
2
= zz + wW + zW + zw = Izl
2
+ Iwl
2
+ zW + zw
::::; Izl
2
+ Iwl
2
+ 21z1 Iwl
from the result above. In other words, Iz + W 1
2
::::; (I z I + IwI)2; taking square
roots we get the desired result, Iz + wi::::; Iz I + Iwi. 0
Why is this result called the triangle inequality? The reason will be clear
once we view the complex numbers geometrically. Represent the complex
number z = a + bi as the point having coordinates (a, b) in the xy plane.
The distance r of this point from the origin is Va
2
+ b
2
, in other words, Izl.
The angle () is called the argument of z and, as we see, tan () = b/a. Also,
a = r cos (), b = r sin (); therefore, z = a + bi = r (cos () + i sin (). This rep
resentation of z is called its polar form.
Given z = a + bi, w = c + di, then their sum is z + w = (a + c) +
(b + d)i. Geometrically, we have the picture:
(a +c, b+d)
36 Things Familiar and Less Familiar Ch. 1
where n ~ 1 is an integer.
The statement Iz + wi::::; Izi + Iw I merely reflects the fact that in a triangle
one side is of smaller length than the sum of the lengths of the other two
sides; thus the term triangle inequality.
The complex numbers that come up in the polar form cos () + i sin ()
are very interesting numbers indeed. Specifically,
Icos () + isin ()j = Vcos
2
() + sin
2
() = Vi = 1,
so they give us many complex numbers of absolute value 1. In truth they give
us all the complex numbers of absolute value 1; to see this just go back and
look at the polar form of such a number.
Let's recall two basic identities from trigonometry, cos( () + t/J) =
cos () cos t/J  sin () sin t/J and sine() + t/J) = sin () cos t/J + cos () sin t/J. There
fore, if z = r(cos () + i sin () and w = s(cos t/J + i sin t/J), then
zw = r(cos () + i sin () . s(cos t/J + i sin t/J)
= rs(cos () cos t/J  sin () sin t/J) + i rs(sin () cos t/J + cos () sin t/J)
= rs[cos( () + t/J) + i sine() + t/J)].
Thus, in multiplying two complex numbers, the argument of the product is
the sum of the arguments of the factors.
This has another very interesting consequence.
Theorem 1.7.6 (De Moivre's Theorem). For any integer n ;::: 1,
[r(cos () + isin ()]n = rn[cos(n() + isin(n()].
Proof We proceed by induction on n. If n = 1, the statement is
obviously true. Assume then that for some k, [r(cos () + i sin ()]k
r
k
[cos k() + i sin k()]. Thus
[r(cos () + i sin ()]k+l = [r(cos () + i sin ()]k . r(cos () + i sin ()
= r
k
( cos k() + i sin k() . r(cos () + i sin ()
= r k + 1[ cos (k + 1) () + i sin(k + 1) ()]
by the result of the paragraph above. This completes the induction step;
hence the result is true for all integers n ;::: 1. 0
In the problems we shall see that De Moivre's Theorem is true for all
integers m; in fact, it is true even if m is rational.
Consider the following special case:
()
21T. . 21T
n = cos  + l sIn ,
n n
Sec. 7
By De Moivre's Theorem,
Complex Numbers 37
( (
21T) .' (21T))n
cos n + l sIn n
= cos (n en
7T
)) + i sin (n (2
n
7T))
= cos 21T + i sin 21T = 1.
So = 1; you can verify that 0':: =1= 1 if 0 < m < n. This property of On makes
it one of the primitive nth roots of unity, which will be encountered in Prob
lem 26.
PROBLEMS
Easier Problems
1. Multiply.
(a) (6  7i)(8 + i).
(b) + 
(c) (6 + 7i)(8  i).
2. Express Z1 in the form Z1 = a + bi for:
(a) z = 6 + 8i.
(b) z = 6
1
8i. 1 .
(c) z = v2 + v2
l
.
*3. Show that (Z)l = (Zl).
4. Find (cos 0 + i sin 0)1.
5. Verify parts a, b, e, fof Lemma 1.7.1.
*6. show that z is real if and only if z = z, and is purely imaginary if and
only if z =  z.
7. VerIfy the commutative law of multiplication zw = wz in C.
8. Show that for z =1= 0, Iz
1
1= 1/1zl.
9. Find:
(a) 16  4il·
(b) +
(c) +
10. Show that Izi = Izi.
38 Things Familiar and Less Familiar Ch. 1
11. Find the polar form for
)
V2 1.
(a z == "2"  V2 l.
(b) z == 4i.
(
66.
c) z == V2 + V2 l.
(d) z ==  13 + 39_ i.
2 2V3
12. Prove that 0) + i 0»2 == cos 0 + i sin o.
13. By direct mUltiplication show that + i)3 == 1.
MiddleLevel Problems
14. Show that (cos 0 + isin o)'n == cos(mO) + isin(mO) for all integers m.
15. Show that (cos 0 + i sin o)r == cos (rO) + i sin(rO) for all rational num
bers r.
16. If z E C and n ;::: 1 is any positive integer, show that there are n distinct
complex numbers w such that z == w
n
•
17. Find the necessary and sufficient condition on k such that:
( (
27Tk) .. (27Tk))n 1
cos n + 1 SIn n ==
( (
27Tk) .. (27Tk))rn 1
cos n + 1 SIn n ¥
and
if 0 < m < n.
18. Viewing the xy plane as the set of all complex numbers x + iy, show that
multiplication by i induces a 90° rotation of the xy plane in a counter
clockwise direction.
19. In Problem 18, interpret geometrically what multiplication by the com
plex number a + bi does to the xy plane.
*20. Prove that Iz + w 1
2
+ Iz  wl
2
== 2(lz/
2
+ IwI
2
).
21. Consider the set A == {a + bi Ia, b E Z}. Prove that there is a 11 corre
spondence of A onto N. (A is called the set of Gaussian integers.)
22. If a is a (complex) root of the polynomial
where the (Xi are real, show that a must also be a root. [r is a root of a
polynomialp(x) ifp(r) == 0.]
Sec. 7
Harder Problems
Complex Numbers 39
23. Find the necessary and sufficient conditions on z and w in order that
Iz + wi = Izl + Iwl·
24. Find the necessary and sufficient conditions on Z1, ... , Zk in order that
IZI + ... + zkl = IZII + ... + IZkl·
*25. The complex number () is said to have order n ;::: 1 if ()n = 1 and ()m =1= 1
for 0 < m < n. Show that if () has order nand ()k = 1, where k > 0, then
n I k.
*26. Find all complex numbers () having order n. (These are the primitive nth
roots of unity.)
2
GROUPS
1. DEFINITIONS AND EXAMPLES OF GROUPS
We have seen in Section 4 of Chapter 1 that given any nonempty set, the set
A (S) of all 11 mappings of S onto itself is not just a set alone, but has a far
richer texture. The possibility of combining two elements of A(S) to get
yet another element of A(S) endows A (S) with an algebraic structure. We
recall how this was done: If f, g E A (S), then we combine them to form the
mapping fg defined by (fg)(s) = f(g(s» for every s E S. We called fg the
product of f and g, and verified that fg E A (S), and that this product obeyed
certain rules. From the myriad of possibilities we somehow selected four par
ticular rules that govern the behavior of A (S) relative to this product.
These four rules were
1. Closure, namely if f, g E A(S), then fg E A(S). We say that A(S) is
closed under this product.
2. Associativity, that is, givenf, g, h E A(S), thenf(gh) = (fg)h.
3. Existence of a unit element, namely, there exists a particular element
i E A (S) (the identity mapping) such that fi = if = f for all f E A (S).
4. Existence of inverses, that is, given f E A (S) there exists an element,
denoted by fI, in A(S) such that ff
1
= flf = i.
To justify or motivate why these four specific attributes of A (S) were
singled out, in contradistinction to some other set of properties, is not easy to
40
Sec. 1 Definitions and Examples of Groups 41
do. In fact, in the history of the subject it took quite some time to recognize
that these four properties played the key role. We have the advantage of his
torical hindsight, and with this hindsight we choose them not only to study
A (S), but also as the chief guidelines for abstracting to a much wider context.
Although we saw that the four properties above enabled us to calculate
concretely in A (S), there were some differences with the kind of calculations
we are used to. If S has three or more elements, we saw in Problem 15,
Chapter 1, Section 4 that it is possible for f, g E A (S) to have fg =1= gf. How
ever, this did not present us with insurmountable difficulties.
Without any further polemics we go to the
Definition. A nonempty set G is said to be a group if in G there is de
fined an operation * such that:
(a) a, bEG implies that a * bEG. (We describe this by saying that G is
closed under *.)
(b) Given a, b, c E G, then a * (b * c) = (a * b) * c. (This is described by
saying that the a$sociative law holds in G.)
(c) There exists a special element e E G such that a * e = e * a = a for all
a E G (e is called the identity or unit element of G).
(d) For every a E G there exists an element bEG such that a * b =
b * a = e. (We write this element b as aI and call it the inverse of
a in G.)
These four defining postulates (called the group axioms) for a group
were, after all, patterned after those that hold in A (S). So it is not surprising
that A(S) is a group relative to the operation "composition of mappings."
The operation * in G is usually called the product, but keep in mind
that this has nothing to do with product as we know it for the integers, ratio
nals, reals, or complexes. In fact, as we shall see below, in many familiar ex
amples of groups that come from numbers, what we call the product in these
groups is actually the addition of numbers. However, a general group need
have no relation whatsoever to a set of numbers. We reiterate: A group is no
more, no less, than a nonempty set with an operation * satisfying the four
group axioms.
Before starting to look into the nature of groups, we look at some ex
amples.
Examples of Groups
1. Let 7L be the set of all integers and let * be the ordinary addition, +, in 7L.
That 7L is closed and associative under * are basic properties of the integers.
What serves as the unit element, e, of 7L under *? Clearly, since a = a * e =
42 Groups Ch.2
a + e, we have e = 0, and 0 is the required identity element under addition.
What about aI? Here too, since e = 0 = a * aI = a + aI, the aI in this in
stance is a, and clearly a * (a) = a + (a) = o.
2. Let 0 be the set of all rational numbers and let the operation * on 0 be
the ordinary addition of rational numbers. As above, 0 is easily shown to be
a group under *. Note that 7L C 0 and both 7L and 0 are groups under the
same operation *.
3. Let 0' be the set of all nonzero rational numbers and let the opera
tion * on 0' be the ordinary multiplication of rational numbers. By the fa
miliar properties of the rational numbers we see that 0' forms a group t:ela
tive to *.
4. Let IR+ be the set of all positive real numbers and let the operation * on IR +
be the ordinary product of real numbers. Again it is easy to check that IR + is
a group under *.
5. Let En be the set of O ~ , i = 0, 1, 2, ... , n  1, where On is the complex num
ber On = Cos(27T/n) + i sin(27T/n). Let O ~ * 0h = O ~ + j , the ordinary product of
the powers of On as complex numbers. By De Moivre's Theorem we saw that
O ~ = 1. We leave it to the reader to verify that En is a group under *. The ele
ments of En are called the n'th roots of unity. The picture below illustrates the
group £6, whose elements are represented by the dots on the unit circle in the
complex plane.
Note one striking difference between the Examples 1 to 4 and Example
5; the first four have an infinite number of elements, whereas En has a finite
number, n, of elements.
Definition. A group G is said to be a finite group if it has a finite
number of elements. The number of elements in G is called the order of G and
is denoted by IGI.
Sec. 1 Definitions and Examples of Groups 43
Thus En above is a finite group, and IEn I = n.
All the examples presented above satisfy the additional property that
a * b = b * a for any pair of elements. This need not be true in a group. Just
witness the case of A (S), where S has three or more elements; there we saw
that we could find f, g E A (S) such that fg =1= gf.
This prompts us to single out as special those groups of G in which
a * b = b * a for all a, bEG.
Definition. A group G is said to be abelian if a * b = b * a for all
a,b E G.
The word abelian derives from the name of the great Norwegian mathematician
Niels Henrik Abel (18021829), one of the greatest scientists Norway has ever
produced.
A group that is not abelian is called nonabelian, a not too surprising
choice of name.
We now give examples of some nonabelian groups. Of course, the
A(S) afford us an infinite family of such. But we present a few other exam
ples in which we can compute quite readily.
6. Let IR be the set of all real numbers, and let G be the set of all mappings
Ta,b: IR ~ IR defined by Ta,b(r) = ar + b for any real number r, where a, b
are real numbers and a =1= o. Thus, for instance, T
S
,6 is such that T
S
,6(r) =
5r  6; T
s
,6(14) = 5 . 14  6 = 64, T
S
,6(1T) = 571"  6. The Ta,b are 11 map
pings of IR onto itself, and we let T
a
, b * T
e
, d be the product of two of these
mappings. So
(Ta,b * Te,d)(r) = Ta,b(Te,d(r» = aTe,d(r) + b = a(er + d) + b
= (ae)r + (ad + b) = Tae,ad+b(r).
So we have the formula
Ta,b * Te,d = Tae,ad+b·
(1)
This result shows us that Ta,b * Te,d is in Gfor it satisfies the membership
requirement for belonging to Gso G is closed under *. Since we are talking
about the product of mappings (i.e., the composition of mappings), * is
associative. The element T1,o = i is the identity mapping of IR onto itself.
Finally, what is T ~ , l b ? Can we find real numbers x =1= 0 and y, such that
44 Groups Ch.2
Go back to (1) above; we thus want Tax,ay+b = TI,o, that is, ax = 1, ay + b = 0.
Remember now that a =I= 0, so if we put x = aI and y = a1b, the required
relations are satisfied. One verifies immediately that
So G is indeed a group.
What is T
c
, d * T
a
, b? According to the formula given in (1), where we re
place a bye, e by a, b by d, d by b, we get
(2)
Thus Tc,d * Ta,b = if Ta,b * Tc,d and only if be + d = ad + b. This fails to be
true, for instance, if a = 1, b = 1, e = 2, d = 3. So G is nonabelian.
7. Let H C G, where G is the group in Example 6, and H is defined by
H = {T
a
, bEG I a is rational, b any real}. We leave it to the reader to verify
that H is a group under the operation * defined on G. H is nonabelian.
8. Let K C He G, where H, G are as above and K = {TI,b E G Ib any real}.
The reader should check that K is a group relative to the operation * of G,
and that K is, however, abelian.
9. Let S be the plane, that is, S = {(x, y) Ix, y real} and consider f, g E A(S)
defined by f (x, y) = (x, y) and g(x, y) = (y, x); f is the reflection about
the yaxis and g is the rotation through 90
0
in a counterclockwise direction
about the origin. We then define G = {fig j Ii = 0, 1; j = 0, 1, 2, 3}, and let *
in G be the product of elements in A(S). Clearly, f2 = g4 = identity mapping;
(f* g)(x, y) = (fg)(x, y) = f(g(x, y» = f(  y, x) = (y, x)
and
(g*f)(x,y) = g(f(x,y» = g(x,y) = (y, x).
So g * f =1= f * g. We leave it to the reader to verify that g * f = f * gl and G
is a nonabelian group of order 8. This group is called the dihedral group of
order 8. [Try to find a formula for (fig
j
) * (fSgt) = fagb that expresses a, b
in terms of i, j, s, and t.]
10. Let S be as in Example 9 and f the mapping in Example 9. Let n > 2 and
let h be the rotation of the plane about the origin through an angle of 27T/n
in the counterclockwise direction. We then define G = {fkh
j
Ik = 0, 1;
j = 0, 1,2, ... , n  I} and define the product * in G via the usual product of
mappings. One can verify that f2 = h
n
= identity mapping, and fh = hIr
Sec. 1 Definitions and Examples of Groups 45
These relations allow us to show (with some effort) that G is a nonabelian
group of order 2n. G is called the dihedral group of order 2n.
11. Let G = {f E A (S) If(s) =1= s for only a finite number of s E S}, where we
suppose that S is an infinite set. We claim that G is a group under the product
* in A (S). The associativity holds automatically in G, since it already holds in
A(S). Also, i E G, since i(s) = s for all s E S. So we must show that G is
closed under the product and if f E G, then f
1
E G.
We first dispose of the closure. Suppose that f, g E G; then f(s) = s
except, say, for SI, S2, ••• 'Sn and g (s) = s except for s{, s ~ , , s ~ . Then
(fg)(s) = f(g(s» = s for all s other than SI, S2, ••• , Sn, s{, ,s/n (and
possibly even for some of these). So fg moves only a finite number of ele
ments of S, so fg E G.
Finally, if f(s) = s for all s other than Sb S2' ••• , Sn, then fl(f(s» =
f
1
(s), but fl(S) = fl(f(s» = (flf)(s) = i(s) = s. So we obtain that
f
1
(s) = s for all s except SI, ••• 'Sn. Thus f
1
E G and G satisfies all the group
axioms, hence G is a group.
12. Let G be the set of all mappings T(j, where To is the rotation of a given cir
cle about its center through an angle () in the clockwise direction. In G define
* by the composition of mappings. Since, as is readily verified, To * T", = T0+'" '
G is closed under *. The other group axioms check out easily. Note that
121T = To = the identity mapping, and r;l = T
o
= 121To. G is an abelian group.
As we did for A (S) we introduce the shorthand notation an for
,a * a * a·· · * a
n times
and define a
n
= (a
1
)n, for n a positive integer, and aO = e. The usual rules
of exponents then hold, that is, (am)n = a
mn
and am * an = a
m
+
n
for any inte
gers m and n.
Note that with this notation, if G is the group of integers under +, then
an is really na.
Having seen the 12 examples of groups above, the reader might get the
impression that all, or almost all, sets with some operation * form groups.
This is far from true. We now give some examples of nongroups. In each case
we check the four group axioms and see which of these fail to hold.
Nonexamples
1. Let G be the set of all integers, and let * be the ordinary product of inte
gers in G. Since a * b = ab, for a, bEG we clearly have that G is closed and
associative relative to *. Furthermore, the number 1 serves as the unit ele
46 Groups Ch.2
ment, since a * 1 = a1 = a = 1a = 1 * a for every a E G. So we are three
fourths of the way to proving that G is a group. All we need is inverses for
the elements of G, relative to *, to lie in G. But this just isn't so. Clearly, we
cannot find an integer b such that 0 * b = Ob = 1, since Ob = 0 for all b. But
even other integers fail to have inverses in G. For instance, we cannot find an
integer b such that 3 * b = 1 (for this would require that b = ~ , and ~ is not
an integer).
2. Let G be the set of all nonzero real numbers and define, for a, bEG, a * b
= a
2
b; thus 4 * 5 = 4
2
(5) = 80. Which of the group axioms hold in Gunder
this operation * and which fail to hold? Certainly, G is closed under *. Is *
associative? If so, (a * b) * c = a * (b * c), that is, (a * b)2C= a
2
(b * c), and
so (a
2
b)2c = a
2
(b
2
c), which boils down to a
2
= 1, which holds only for
a = ±1. So, in general, the associative law does not hold in G relative to *.
We similarly can verify that G does not have a unit element. Thus even to
discuss inverses relative to * would not make sense.
3. Let G be the set of all positive integers, under * where a * b = ab, the
ordinary product of integers. Then one can easily verify that G fails to be a
group only because it fails to have inverses for some (in fact, most) of its ele
ments relative to *.
We shall find some other nonexamples of groups in the exercises.
PROBLEMS
Easier Problems
1. Determine if the following sets G with the operation indicated form a
group. If not, point out which of the group axioms fail.
(a) G = set of all integers, a * b = ab.
(b) G = set of all integers, a * b = a + b + abo
(c) G = set of nonnegative integers, a * b = a + b.
(d) G = set of all rational numbers =1= 1, a * b = a + b + abo
(e) G = set of all rational numbers with denominator divisible by 5 (writ
ten so that numerator and denominator are relatively prime), a * b =
a + b.
(f) G a set having more than one element, a * b = a for all a, bEG.
2. In the group G defined in Example 6, show that the set H = {Ta,b Ia = ±1,
b any real} forms a group under the * of G.
3. Verify that Example 7 is indeed an example of a group.
Sec. 1 Definitions and Examples of Groups 47
4. Prove that K defined in Example 8 is an abelian group.
5. In Example 9, prove that g * f = f * gt, and that G is a group, is non
abelian, and is of order 8.
6. Let G and H be as in Examples 6 and 7, respectively. Show that if
Ta,b E G, then Ta,b * V * T;;:l E H if V E H.
7. Do Problem 6 with H replaced by the group K of Example 8.
8. If G is an abelian group, prove that (a * b)n = an * b
n
for all integers n.
9. If G is a group in which a
2
= e for all a E G, show that G is abelian.
10. If G is the group in Example 6, find all Ta,b E G such that Ta,b * TI,x =
Tl,x * Ta,b for all real x.
11. In Example 10, for n = 3 find a formula that expresses (fi h
j
) * (fSh
t
) as
fah
b
• Show that G is a nonabelian group of order 6.
12. Do Problem 11 for n = 4.
13. Show that any group of order 4 or less is abelian.
14. If G is any group and a, b, c E G, show that if a * b = a * c, then b = c,
and if b * a = c * a, then b = c.
15. Express (a * b)I in terms of aI and b
1
.
16. Using the result of Problem 15, prove that a group G in which a = aI
for every a E G must be abelian.
17. In any group G, prove that (aI)I = a for all a E G.
*18. If G is a finite group of even order, show that there must be an element
a =1= e such that a = aI. (Hint: Try to use the result of Problem 17.)
19. In S3, show that there are four elements x satisfying x
2
= e and three ele
ments y satisfying y3 = e.
20. Find all the elements in S4 such that x
4
= e.
MiddleLevel Problems
21. Show that a group of order 5 must be abelian.
22. Show that the set defined in Example 10 is a group, is nonabelian, and
has order 2n. Do this by finding the formula for (fih
j
) * (fsh
t
) in the
form fah
b
•
23. In the group G of Example 6, find all elements U E G such that
U * Ta,b = Ta,b * U for every Ta,b E G.
24. If G is the dihedral group of order 2n as defined in Example 10, prove that:
(a) If n is odd and a EGis such that a * b = b * a for all bEG, then a = e.
(b) If n is even, show that there is an a E G, a =1= e, such that a * b = b * a
for all bEG.
48 Groups Ch.2
(c) If n is even, find all the elements a E G such that a * b = b * a
for all bEG.
25. If G is any group, show that:
(a) e is unique (i.e., if f E G also acts as a unit element for G, then f = e).
(b) Given a E G, then aI EGis unique.
*26. If G is a finite group, prove that, given a E G, there is a positive integer
n, depending on a, such that an = e.
*27. In Problem 26, show that there is an integer m > 0 such that am = e
for all a E G.
Harder Problems
28. Let G be a set with an operation * such that:
1. G is closed under *.
2. * is associative.
3. There exists an element e E G such that e * x = x for all x E G.
4. Given x E G, there exists ayE G such that y * x = e.
Prove that G is a group. (Thus you must show that x * e = x and x * y = e
for e, y as above.)
29. Let G be a finite nonempty set with an operation * such that:
1. G is closed under *.
2. * is associative.
3. Given a, b, c E G with a * b = a * c, then b = c.
4. Given a, b, c, E G with b * a = C * a, then b = c.
Prove that G must be a group under *.
30. Give an example to show that the result of Problem 29 can be false if G
is an infinite set.
31. Let G be the group of all nonzero real numbers under the operation *
which is the ordinary multiplication of real numbers, and let H be the
group of all real numbers under the operation #, which is the addition of
real numbers.
(a) Show that there is a mapping F: G ~ H of G onto H which satisfies
F(a * b) = F(a)#F(b) for all a, bEG [i.e., F(ab) = F(a) + F(b)].
(b) Show that no such mapping F can be 11.
2. SOME SIMPLE REMARKS
In this short section we show that certain formal properties which follow from
the group axioms hold in any group. As a matter of fact, most of these results
have already occurred as problems at the end of the preceding section.
Sec. 2 Some Simple Remarks 49
It is a little clumsy to keep writing the * for the product in G, and from
now on we shall write the product a * b simply as ab for all a, bEG.
The first such formal results we prove are contained in
Lemma 2.2.1. If G is a group, then:
(a) Its identity element is unique.
(b) Every a E G has a unique inverse aI E G.
(c) If a E G, (al)l == a.
(d) For a, bEG, (ab)l == bIa
l
.
Proof We start with Part (a). What is expected of us to carry out the
proof? We must show that if e, f E G and af == fa == a for all a E G and
ae == ea == a for all a E G, then e == f This is very easy, for then e == ef and
f == ef; hence e == ef == f, as required.
Instead of proving Part (b), we shall prove a stronger result (listed
below as Lemma 2.2.2), which will have Part (b) as an immediate conse
quence. We claim that in a group G if ab == ac, then b == c; that is, we can
cancel a given element from the same side of an equation. To see this, we
have, for a E G, an element u E G such that ua == e. Thus from ab == ac we
have
u(ab) == u(ac),
so, by the associative law, (ua)b == (ua)c, that is, eb == ec. Hence b == eb ==
ec == c, and our result is established. A similar argument shows that if
ba == ca, then b == c. However, we cannot conclude from ab == ca that b == c;
in any abelian group, yes, but in general, no.
Now to get Part (b) as an implication of the cancellation result. Suppose
that b, c E G act as inverses for a; then ab == e == ac, so by cancellation b == c
and we see that the inverse of a is unique. We shall always write it as aI.
To see Part (c), note that by definition aI (al)l == e; but aI a == e, so
by cancellation in al(al)l == e == aIa we get that (al)l == a.
Finally, for Part (d) we calculate
(ab)(b1a
1
) == ((ab)b1)a
1
(associative law)
== (a(bb
1
)a
1
(again the associative law)
== (ae )a
1
== aa
1
== e.
Similarly, (bIal)(ab) == e. Hence, by definition, (ab)l == bIa
l
. D
50 Groups Ch.2
We promised to list a piece of the argument given above as a separate
lemma. We keep this promise and write
Lemma 2.2.2. In any group G and a, b, c E G, we have:
(a) If ab = ac, then b = c.
(b) If ba = ca, then b =c.
Before leaving these results, note that if G is the group of real
numbers under +, then Part (c) of Lemma 2.2.1 translates into the familiar
(a) = a.
There is only a scant bit of mathematics in this section; accordingly,
we give only a few problems. No indication is given as to the difficulty of these.
PROBLEMS
1. Suppose that G is a set closed under an associative operation such that
1. given a, y E G, there is an x E G such that ax = y, and
2. given a, w E G, there is a u E G such that ua = w.
Show that G is a group.
*2. If G is a finite set closed under an associative operation such that ax = ay
forces x = y and ua = wa forces u = w, for every a, x, y, u, w E G, prove
that G is a group. (This is a repeat of a problem given earlier. It will be
used in the body of the text later.)
3. If G is a group in which (ab)i = aib
i
for three consecutive integers i, prove
that G is abelian.
4. Show that the result of Problem 3 would not always be true if the word
"three" were replaced by "two." In other words, show that there is a
group G and consecutive numbers i, i + 1 such that G is not abelian but
does have the property that (ab)i = aib
i
and (ab)i+l = ai+1b
i
+
1
for all
a,bin G.
5. Let G be a group in which (ab)3 = a
3
b
3
and (ab)5 = a
5
b
5
for all a, bEG.
Show that G is abelian.
6. Let G be a group in which (ab)n = anb
n
for some fixed integer n > 1 for
all a, bEG. For all a, bEG, prove that:
(a) (ab)nl = bn1a
n

1
.
(b) anb
n

1
= bn1a
n
.
(c) (aba1b1)n(nl) = e.
[Hint for Part (c): Note that (aba1)r = abra
1
for all integers r.]
Sec. 3
3. SUBGROUPS
Subgroups 51
In order for us to find out more about the makeup of a given group G, it may
be too much of a task to tackle all of G headon. It might be desirable to
focus our attention on appropriate pieces of G, which are smaller, over which
we have some control, and are such that the information gathered about
them can be used to get relevant information and insight about G itself. The
question then becomes: What should serve as suitable pieces for this kind of
dissection of G? Clearly, whatever we choose as such pieces, we want them
to reflect the fact that G is a group, not merely any old set.
A group is distinguished from an ordinary set by the fact that it is en
dowed with a wellbehaved operation. It is thus natural to demand that such
pieces above behave reasonably with respect to the operation of G. Once
this is granted, we are led almost immediately to the concept of a subgroup
of a group.
Definition. A nonempty subset, H, of a group G is called a subgroup
of G if, relative to the product in G, H itself forms a group.
We stress the phrase "relative to the product in G." Take, for instance,
the subset A = {I, I} in 7L, the set of integers. Under the multiplication of
integers, A is a group. But A is not a subgroup of 7L viewed as a group with
respect to +.
Every group G automatically has two obvious subgroups, namely G it
self and the subgroup consisting of the identity element, e, alone. These two
subgroups we call trivial subgroups. Our interest will be in the remaining
ones, the proper subgroups of G.
Before proceeding to a closer look at the general character of sub
groups, we want to look at some specific subgroups of some particular, ex
plicit Some of the groups we consider are those we introduced as ex
in Section 1; we maintain the numbering given there for them. In
some of these examples we shall verify that certain specified subsets are indeed
subgroups. We would strongly recommend that the reader carry out such a
verification in lots of the others and try to find other examples for themselves.
In trying to verify whether or not a given subset of a group is a sub
group, we are spared checking one of the axioms defining a group, namely
the associative law. Since the associative law holds universally in a group G,
given any subset A of G and any three elements of A, then the associative
law certainly holds for them. So we must check, for a given subset A of G,
whether A is closed under the operation of G, whether e is in A, and finally,
given a E A, whether aI is also in A.
52 Groups Ch.2
Note that we can save one more calculation. Suppose that A eGis
nonempty and that given a, b E A, then ab E A. Suppose further that given
a E A, then aI E A. Then we assert that e E A. For pick a E A; then aI E A
by supposition, hence aa
l
E A, again by supposition. Since aa
l
= e, we
have that e E A. Thus a is a subgroup of G. In other words,
Lemma 2.3.1. A nonempty subset A eGis a subgroup of G if and
only if A is closed with respect to the operation of G and, given a E A, then
aI EA.
We now consider some examples.
Examples
1. Let G be the group 7L of integers under + and let H be the set of even in
tegers. We claim that H is a subgroup of 7L. Why? Is H closed, that is, given
a, b E H, is a + b E H? In other words, if a, b are even integers, is a + b an
even integer? The answer is yes, so H is certainly closed under +. Now to the
inverse. Since the operation in 7L is +, the inverse of a E 7L relative to this op
eration is a. If a E H, that is, if a is even, then a is also even, hence
a E H. In short, H is a subgroup of 7L under +.
2. Let G once again be the group 7L of integers under +. In Example 1, H,
the set of even integers, can be described in another way: namely H consists
of all multiples of 2. There is nothing particular in Example 1 that makes use
of 2 itself. Let m > 1 be any integer and let Hm consist of all multiples of m
in 7L. We leave it to the reader to verify that Hm is a subgroup of 7L under +.
3. Let S be any nonempty set and let G = A(S). If a E S, let H(a) =
{f E A(S) If(a) = a}. We claim that H(a) is a subgroup of G. For if
f, g E H(a), then (fg)(a) = f(g(a» = f(a) = a, since f(a) = g(a) = a. Thus
fg E H(a). Also, iffE H(a), thenf(a) = a, so thatfl(f(a» = fl(a). But
fl(f(a» = fl(a) = i(a) = a. Thus, since a = fl(f(a» = fl(a), we have
that f
l
E H(a). Moreover, His nonempty. (Why?) Consequently, H(a) is a
subgroup of G.
4. Let G be as in Example 6 of Section 1, and H as in Example 7. Then H is a
subgroup of G (see Problem 3 in Section 1).
5. Let G be as in Example 6, H as in Example 7, and K as in Example 8 in
Section 1. Then K C He G and K is a subgroup of both H and of G.
6. Let C' be the nonzero complex numbers as a group under the multiplica
tion of complex numbers. Let V = {a E C' Ila Iis rational}. Then V is a sub
group of C'. For if lal and Ibl are rational, then labl = lallbl is rational, so
Sec. 3 Subgroups 53
ab E V; also, IaI I = lila I is rational, hence aI E V. Therefore, V is a sub
group of C'.
7. Let C' and V be as above and let
U = {a E C' I a = cos () + isin (), ()anyreal}.
If a = cos () + i sin () and b = cos l/J + i sin l/J, we saw in Chapter 1 that ab =
cos(() + l/J) + i sin(() + l/J), so that ab E U, and that aI = cos ()  i sin () =
cos(() + i sin(() E U. Also, lal = 1, since lal = Vcos
2
() + sin
2
() = 1.
Therefore, U eve C' and U is a subgroup both of V and of C'.
8. Let C', U, V be as above, and let n > 1 be an integer. Let ()n =
cos(27Tln) + i sin(27T/n), and let B = {I, ()n, ••• , Since = 1 (as
we saw by De Moivre's Theorem), it is easily checked that B is a subgroup of
U, V, and C', and is of order n.
9. Let G be any group and let a E G. The set A = {a
i
Ii any integer} is a sub
group of G. For, by the rules of exponents, if a
i
E A and a
j
E A, then aia
j
=
a
i
+
j
, so is in A. Also, (ai)I = a
i
, so (ai)I E A. This makes A into a sub
group of G.
A is the cyclic subgroup of G generated by a in the following sense.
Definition. The cyclic subgroup of G generated by a is a set {a
i
Ii any
integer}. It is denoted (a).
Note that if e is the identity element of G, then (e) = tel. In Example 8,
the group B is the cyclic group (()n) of C generated by () n.
10. Let G be any group; for a E G let C(a) = {g E G Iga = agIo We claim
that C(a) is a subgroup of G. First, the closure of C(a). If g, h E C(a), then
ga = ag and ha = ah, thus (gh)a = g (ha) = g (ah) = (ga)h = (ag)h =
the repeated use of the associative law), hence gh E C(a). Also, if
g E tea), then from ga = ag we have gI(ga)gl = gI(ag)gI, which
simplifies to ag
I
= gIa; whence gl E C(a). So, C(a) is thereby a sub
group of G.
These particular subgroups C(a) will come up later for us and they are
given a special name. We call C(a) the centralizer of a in G. If in a group
ab = ba, we say that a and b commute. Thus C(a) is the set of all elements in
G that commute with a.
11. Let G be any group and let Z(G) = {z E G Izx = xz for all x E G}. We
leave it to the reader to verify that Z(G) is a subgroup of G. It is called the
center of G.
54 Groups Ch.2
12. Let G be any group and H a subgroup of G. For a E G, let a1Ha =
{a1ha Ih E H}. We assert that a1Ha is a subgroup of G. If x = a1h1a and
y = a
1
h
2
a where hI, h
2
E H, then xy = (a
1
h
1
a)(a
1
h
2
a) = a
l
(h
1
h
2
)a
(associative law), and since H is a subgroup of G, h
1
h
2
E H. Therefore,
a
1
(h
1
h
2
)a E a1Ha, which says that xy E a1Ha. Thus a1Ha is closed.
Also, if x = a1ha E a1Ha, then, as is easily verified, XI = (a1ha)1 =
a1h1a E a1Ha. Therefore, a1Ha is a subgroup of G.
An even dozen seems to be about the right number of examples, so we
go on to other things. Lemma 2.3.1 points out for us what we need in order
that a given subset of a group be a subgroup. In an important special case we
can make a considerable saving in checking whether a given subset H is a
subgroup of G. This is the case in which H is finite.
Lemma 2.3.2. Suppose that G is a group and H a nonempty finite sub
set of G closed under the product in G. Then H is a subgroup of G.
Proof. By Lemma 2.3.1 we must show that a E H implies a I E H. If
a = e, then a 1 = e and we are done. Suppose then that a =1= e ; consider
the elements a, a
2
, ••• , a
n
+
1
, where n = IHI, the order of H. Here we
have written down n + 1 elements, all of them in H since H is closed, and H
has only n distinct elements. How can this be? Only if some two of the ele
ments listed are equal; put another way, only if a
i
= a
j
for some 1 ~ i <
j ~ n + 1. But then, by the cancellation property in groups, a
j
 i = e. Since
j  i 2:: 1, a
j
 i E H, hence e E H. However, j  iI 2:: 0, so a
j
 i 1 E H
and aa
j
 i 1 = a
j
 i = e, whence a 1 = a
j
 i 1 E H. This proves the
lemma.D
An immediate, but nevertheless important, corollary to Lemma 2.3.2 is the
Corollary. If G is a finite group and H a nonempty subset of G closed
under multiplication, then H is a subgroup of G.
PROBLEMS
Easier Problems
1. If A, B are subgroups of G, show that A n B is a subgroup of G.
2. What is the cyclic subgroup of 7L generated by 1 under +?
3. Let S3 be the symmetric group of degree 3. Find all the subgroups of S3'
4. Verify that Z(G), the center of G, is a subgroup of G. (See Example 11.)
Sec. 3 Subgroups 55
5. If C(a) is the centralizer of a in G (Example 10), prove that Z (G)
naEGC(a).
6. Show that a E Z(G) if and only if C(a) = G.
7. In S3, find C(a) for each a E S3.
8. If G is an abelian group and if H = {a E G Ia
2
= e}, show that H is a
subgroup of G.
9. Give an example of a nonabelian group for which the H in Problem 8 is
not a subgroup.
10. If G is an abelian group and n > 1 an integer, let An = {an Ia E G}.
Prove that An is a subgroup of G.
*11. If G is an abelian group and H = {a E G Ian(a) = e for some n(a) > 1 de
pending on a}, prove that H is a subgroup of G.
We say that a group G is cyclic if there exists an a E G such that
every x EGis a power of a, that is, x = a
j
for some j. In other words, G
is cyclic if G = (a) for some a E G, in which case we say that a is a gen
erator for G.
*12. Prove that a cyclic group is abelian.
13. If G is cyclic, show that every subgroup of G is cyclic.
14. If G has no proper subgroups, prove that G is cyclic.
15. If G is a group and H a nonempty subset of G such that, given a, b E H,
then ab1 E H, prove that H is a subgroup of G.
MiddleLevel Problems
*16. If G has no proper subgroups, prove that G is cyclic of order p, where p
is a prime number. (This sharpens the result of Problem 14.)
17. If G is a group and a, x E G, prove that C(x
1
ax) = x
1
C(a)x. [See Ex
amples 10 and 12 for the definitions of C (b) and of x 1C (a )x.]
18. If S is a nonempty set and xeS, show that T(X) = {fE A(S) If(X) c
X} is a subgroup of A (S) if X is finite.
19. If A, B are subgroups of an abelian group G, let AB = tab Ia E A, b E B}.
Prove that AB is a subgroup of G.
20. Give an example of a group G and two subgroups A, B of G such that
AB is not a subgroup of G.
21. If A, B are subgroups of G such that b
1
Ab C A for all b E B, show that
AB is a subgroup of G.
* 22. If A and B are finite subgroups, of orders m a ~ d n, respectively, of the
abelian group G, prove that AB is a subgroup of order mn if m and n are
relatively prime.
56 Groups Ch.2
23. What is the order of AB in Problem 22 if m and n are not relatively prime?
24. If H is a subgroup of G, let N = n
xEG
x
1
Hx. Prove that N is a subgroup
of G such that y
1
Ny = N for every y E G.
Harder Problems
25. Let S, X, T(X) be as in Problem 18 (but X no longer finite). Give an ex
ample of a set S and an infinite subset X such that T(X) is not a sub
group of A(S).
*26. Let G be a group, H a subgroup of G. Let Hx = {hx Ih E H}. Show that,
given a, bEG, then Ha = Hb or Ha n Hb = 0.
*27. If in Problem 26 H is a finite subgroup of G, prove that Ha and Hb have
the same number of elements. What is this number?
28. Let M, N be subgroups of G such that XI Mx C M and XI Nx C N for
all x E G. Prove that MN is a subgroup of G and that x
1
(MN)x C MN
for all x E G.
*29. If M is a subgroup of G such that x
1
Mx C M for all x E G, prove that
actually XI Mx = M.
30. If M, N are such that x
1
Mx = M and x
1
Nx = N for all x E G, and if
M n N = (e), prove that mn = nm for any m E M, n E N. (Hint: Con
sider the element m
1
n
1
mn.)
4. LAGRANGE'S THEOREM
We are about to derive the first real grouptheoretic result of importance.
Although its proof is relatively easy, this theorem is like the ABC's for fi
nite groups and has interesting implications in number theory.
As a matter of fact, those of you who solved Problems 26 and 27 of Sec
tion 3 have all the necessary ingredients to effect a proof of the result. The
theorem simply states that in a finite group the order of a subgroup divides
the order of the group.
To smooth the argument of this theoremwhich is due to Lagrange
and for use many times later, we make a short detour into the realm of set
theory.
Just as the concept of "function" runs throughout most phases of math
ematics, so also does the concept of "relation." A relation is a statement aRb
about the elements a, b E S. If S is the set of integers, a = b is a relation
on S. Similarly, a < b is a relation on S, as is a ~ b.
Sec. 4 Lagrange's Theorem 57
Definition. A relation '"'' on a set S is called an equivalence relation if,
for all a, b, c E S, it satisfies:
(a) a '"'' a (reflexivity).
(b) a '"'' b implies that b '"'' a (symmetry).
(c) a '"'' b, b '"'' c implies that a '"'' c (transitivity).
Of course, equality, =, is an equivalence relation, so the general notion
of equivalence relation is a generalization of that of equality. In a sense, an
equivalence relation measures equality with regard to some attribute. This
vague remark may become clearer after we see some examples.
Examples
1. Let S be all the items for sale in a grocery store; we declare a '"'' b, for
a, b E S, if the price of a equals that of b. Clearly, the defining rules of an
equivalence relation hold for this '"''. Note that in measuring this "generalized
equality" on S we ignore all properties of the elements of S other than their
price. So a '"'' b if they are equal as far as the attribute of price is concerned.
2. Let S be the integers and n > 1 a fixed integer. We define a '"'' b for a,
b E S if n I(a  b). We verify that this is an equivalence relation. Since n I0
and 0 = a  a, we have a '"'' a. Because n I(a  b) implies that n I(b  a), we
have that a '"'' b implies that b '"'' a. Finally, if a '"'' band b '"'' c, then
n I(a  b) and n I(b  c); hence n I((a  b) + (b  c», that is, n I(a  c).
Therefore, a '"'' c.
This relation on the integers is of great importance in number theory
and is called congruence modulo n; when a '"'' b, we write this as a == b mod n
[or, sometimes, as a == ben)], which is read "a congruent to b mod n." We'll
be running into it very often from now on. As we shall see, this is a special
case of a much wider phenomenon in groups.
3. We generalize Example 2. Let G be a group and H a subgroup of G. For
a, bEG, define a '"'' b if ab
1
E H. Since e E Hand e = aa
1
, we have that
a '"'' a. Also, if ab
1
E H, then since H is a subgroup of G, (ab1)1 E H. But
(ab
1
)1 = (b
1
)l
a
1 = bat, so ba
1
E H, hence b '"'' a. This tells us that
a '"'' b implies that b '"'' a. Finally, if a '"'' band b '"'' c, then ab1 E Hand
bc
1
E H. But (ab
1
)(bc
1
) = ac
1
, whence ac
1
E H and therefore
a '"'' c. We have shown the transitivity of '"'', thus  is an equivalence relation
on G.
Note that if G = 7L, the group of integers under +, and H is the sub
group consisting of all multiples of n, for n > 1 a fixed integer, then ab
1
E H
58 Groups Ch.2
translates into a == ben). So congruence mod n is a very special case of the
equivalence we have defined in Example 3.
It is this equivalence relation that we shall use in proving Lagrange's
theorem.
4. Let G be any group. For a, bEG we declare that a  b if there exists an
x E G such that b = xlax. We claim that this defines an equivalence rela
tion on G. First, a  a for a = elae. Second, if a  b, then b = xlax, hence
a = (Xl)lb(x
l
), so that b  a. Finally, if a  b, b  c, then b = xlax, c =
ylby for some x, y E G. Thus c = yl(xlax)y = (xy)la(xy), and so a  c.
We have established that this defines an equivalence relation on G.
This relation, too, plays an important role in group theory and is given
the special name conjugacy. When a  b we say that "a and b are conjugate
in G." Note that if G is abelian, then a  b if and only if a = b.
We could go on and on to give numerous interesting examples of equiva
lence relations, but this would sidetrack us from our main goal in this section.
There will be no lack of examples in the problems at the end of this section.
We go on with our discussion and make the
Definition. If  is an equivalence relation on S, then [a], the class of
a, is defined by [a] = {b E SIb  a}.
Let us see what the class of a is in the two examples, Examples 3 and 4,
just given.
In Example 3, a  b if ab
l
E H, that is, if ab
l
= h, for some h E H.
Thus a  b implies that a = hb. On the other hand, if a = kb where k E H,
then ab
l
= (kb)b
l
= k E H, so a  b if and only if a E Hb = {hb Ih E H}.
Therefore, [b] = Hb.
The set Hb is called a right coset of H in G. We ran into such in Prob
lem 26 of Section 3. Note that b E Hb, since b = eb and e E H (also because
b E [b] = Hb). Right cosets, and left handed counterparts of them called left
cosets, play important roles in what follows.
In Example 4, we defined a  b if b = xlax for some x E G. Thus [a] =
{xlax I x E G}. We shall denote [a] in this case as cl(a) and call it the conju
gacy class of a in G. If G is abelian, then cl(a) consists of a alone. In fact, if
a E Z(G), the center of G, then cl(a) consists merely of a.
The notion of conjugacy and its properties will crop up again often, es
pecially in Section 11.
We shall examine the class of an element a in Example 2 later in this
chapter.
The important influence that an equivalence relation has on a set is to
break it up and partition it into nice disjoint pieces.
Sec. 4 Lagrange's Theorem 59
Theorem 2.4.1. If '"'"' is an equivalence relation on S, then S = Ural,
where this union runs over one element from each class, and where [a] =1= [b]
implies that [a] n [b] = 0. That is, '"'"' partitions S into equivalence classes.
Proof Since a E [a], we have UaES[a] = S. The proof of the second as
sertion is also quite easy. We show that if [a] =1= [b], then [a] n [b] = 0, or,
what is equivalent to this, if [a] n [b] =1= 0, then [a] = [b].
Suppose, then, that [a] n [b] =1= 0; let c E [a] n [b]. By definition of
class, C '"'"' a since c E [a] and c ~ b since c E [b]. Therefore, a ~ c by sym
metry of '"'"', and so, since a '"'"' c and c '"'"' b, we have a ~ b. Thus a E [b]; if
x E [a], then x '"'"' a, a ~ b gives us that x '"'"' b, hence x E [b]. Thus [a] C [b].
Twargument is obviously symmetric in a and b, so we have [b] C [a],
whence [a] = [b], and our assertion above is proved.
The theorem is now completely proved. D
We now can prove a famous result of Lagrange.
Theorem 2.4.2 (Lagrange's Theorem). If G is a finite group and H is
a subgroup of G, then the order of H divides the order of G.
Proof Let us look back at Example 3, where we established that the
relation a '"'"' b if ab1 E H is an equivalence relation and that
[a] = Ha = {halh E H}.
Let k be the number of distinct classeseall them Ha 1, ... ,Hak. By Theo
rem 2.4.1, G = Hal U Ha2 U ... U Hak and we know that Ha
j
n Hai = 0 if
i =1= j.
We assert that any Hai has IHI = order of H number of elements. Map
H ~ Ha
i
by sending h ~ ha
i
. We claim that this map is 11, for if hai =
h'ai,then by cancellation in G we would have h = h'; thus the map is 11. It
is definitely onto by the very definition of Hai. So Hand Ha i have the same
number, IHI, of elements.
Since G = Hal U ... U Hak and the Hai are disjoint and each Hai has
IH I elements, we have that IGI k IH I. Thus IH I divides IG I and La
grange's Theorem is proved. D
Although Lagrange sounds like a French name, J. L. Lagrange (17361813) was
actually Italian, having been born and brought up in Turin. He spent most of
his life, however, in France. Lagrange was a great mathematician who made
fundamental contributions to all the areas of mathematics of his day.
If G is finite, the number of right cosets of H in G, namely IGI/ IHI, is
called the index of H in G and is written as i
G
(H).
60 Groups Ch.2
Recall that a group G is said to be cyclic if there is an element a E G
such that every element in G is a power of a.
Theorem 2.4.3. A group G of prime order is cyclic.
Proof If H is a subgroup of G then, by invoking Lagrange's Theorem,
IHI divides IGI = p, p a prime, so IHI = 1 or p. So if H =1= (e), then H = G. If
a E G, a =1= e, then the powers of a form a subgroup (a) of G different from
(e). So this subgroup is all of G. This says that any x EGis of the form x = a
i
•
Hence, G is cyclic by the definition of cyclic group. D
If G is finite and a E G, we saw earlier in the proof of Lemma 2.3.2 that
an(a) = e for some n(a) 2:: 1, depending on a. We make the
Definition. If G is finite, then the order of a, written 0 (a), is the least
positive integer m such that am = e.
Suppose that a E G has order m. Consider the set A = {e, a, a
2
, ••• , a
m

1
~ ;
we claim that A is a subgroup of G (since am = e) and that the m elements
listed in A are distinct. We leave the verification of these claims to the
reader. Thus IAI = m = o(a). Since IA IIIG/, we have
Theorem 2.4.4. If G is finite and a E G, then 0 (a) I IG I.
If a E G, where G is finite, we have, by Theorem 2.4.4, IG I
Thus
We have proved the
k . o(a).
Theorem 2.4.5. If G is a finite group of order n, then an = e for all
aE G.
When we apply this last result to certain special groups arising in num
ber theory, we shall obtain some classical numbertheoretic results due to
Fermat and Euler.
Let 7L be the integers and let n > 1 be a fixed integer. We go back to
Example 2 of equivalence relations, where we defined a == b mod n (a
congruent to b mod n) if n I(a  b). The class of a, raj, consists of all
a + nk, where k runs through all the integers. We call it the congruence class
of a.
Sec. 4 Lagrange's Theorem 61
By Euclid's Algorithm, given any integer b, b = qn + r, where
o~ r < n, thus [b] = [r]. So the n classes [0], [1], ... , [n  1] give us all
the congruence classes. We leave it to the reader to verify that they are
distinct.
Let En = {[OJ, [1], ... , [n  I]}. We shall introduce two operations, +
and· in En. Under + En will form an abelian group; under· En will not form
a group, but a certain piece of it will become a group.
How to define [a] + [b]? What is more natural than to define
[a] + [b] = [a + b].
But there is a fly in the ointment. Is this operation + in En welldefined?
What does that mean? We can represent [a] by many a'sfor instance, if
n = 3, [1] = [4] = [2] = ... , yet we are using a particular a to define the
addition. What we must show is that if [a] = [a'] and [b] = [b '], then [a + b]
= [a' + b'], for then we will have [a] + [b] = [a + b] = [a' + b'] =
[a'] + [b'].
Suppose that [a] = [a']; then n I(a  a'). Also from [b] = [b'],
n I(b  b'), hence n I«a  a') + (b  b'» = «a + b)  (a' + b'». There
fore, a + b == (a' + b') mod n, and so [a + b] = [a' + b'].
So we now have a welldefined addition in En. The element [0] acts as
the identity element and [a] acts as  [a], the inverse of [a]. We leave it to
the reader to check out that En is a group under +. It is a cyclic group of
order n generated by [1].
We summarize this all as
Theorem 2.4.6. En forms a cyclic group under the addition [a] + [b] =
[a + b].
Having disposed of the addition in En, we turn to the introduction of a
multiplication. Again, what is more natural than defining
[a] · [b] = [ab]?
So, for instance, if n = 9, [2][7] = [14] = [5], and [3][6] = [18] = [0]. Under
this multiplicationwe leave the fact that it is welldefined to the reader
En does not form a group. Since [0][a] = [0] for all a, and the unit element
under multiplication is [1], [0] cannot have a multiplicative inverse. Okay,
why not try the nonzero elements [a] =1= [0] as a candidate for a group under
this product? Here again it is no go if n is not a prime. For instance, if n = 6,
then [2] =1= [0], [3] =1= [0], yet [2][3] = [6] = [0], so the nonzero elements do
not, in general, give us a group.
So we ask: Can we find an appropriate piece of En that will form a
62 Groups Ch.2
group under multiplication? Yes! Let Un = {[a] E 7L
n
I (a, n) = I}, noting
that (a, n) = 1 if and only if (b, n) = 1 for [a] = [b]. By the Corollary to
Theorem 1.5.5, if (a, n) = 1 and (b, n) = 1, then (ab, n) = 1. So [a][b] = [ab]
yields that if [a], [b] E Un, then [ab] E Un and Un is closed. Associativity is
easily checked, following from the associativity of the integers under multi
plication. The identity element is easy to find, namely [1]. Multiplication is
commutative in Un.
Note that if [a][b] = [a][c] where [a] E Un' then we have [ab] = [ac],
and so [ab  ac] = [0]. This says that n Ia(b  c) = ab  ac; but a is rela
tively prime to n. By Theorem 1.5.5 one must have that n I (b  c), and so
[b] = [c]. In other words, we have the cancellation property in Un. By Prob
lem 2 of Section 2, Un is a group.
What is the order of Un? By the definition of Un, IUn I = number of in
tegers 1 ~ m < n such that (m, n) = 1. This number comes up often and we
give it a name.
Definition. The Euler cpfunction, cp (n), is defined by cp (1) = 1 and,
for n > 1, cp (n) = the number of positive integers m with 1 ::::; m < n such
that (m, n) = 1.
Thus IUnl = cp(n). If n = p, a prime, we have cp(p) = p  1. We see
that cp (8) = 4 for only 1, 3, 5, 7 are less than 8 and positive and relatively
prime to 8. We try another one, cp (15). The numbers 1 ~ m < 15 relatively
prime to 15 are 1,2,4,7,8,11,13,14, so cp(15) = 8.
Let us look at some examples of Un .
1. Us = {[I], [3], [5], [7]}. Note that [3][5] = [15] = [7], [5]2 = [25] = [1]. In
fact, Us is a group of order 4 in which a
2
= e for every a E Us.
2. V
15
= {[I], [2], [4], [7], [8], [11], [13], [14]}. Note that [11][13] = [143] =
[8], [2]4 = [1], and so on.
The reader should verify that a
4
= e = [1] for every a E U
15
•
3. Ug = {[I], [2], [4], [5], [7], [8]}. Note that [2]1 = [2], [2]2 = [4], [2]3 = [8],
[2
4
] = [16] = [7], [2]5 = [32] = [5]; also [2]6 = [2][2]5 = [2][5] = [10] = [1].
So the powers of [2] give us every element in Ug• Thus Ug is a cyclic group of
order 6. What other elements in Ug generate U
g
?
In parallel to Theorem 2.4.6 we have
Theorem 2.4.7. Un forms an abelian group, under the product
[a][ b] = [ab], of order cp (n), where cp (n) is the Euler cpfunction.
Sec. 4 Lagrange's Theorem 63
An immediate consequence of Theorems 2.4.7 and 2.4.5 is a famous re
sult in number theory.
Theorem 2.4.8 (Euler). If a is an integer relatively prime to n, then
acp(n) == 1 mod n.
Proof Un forms a group of order cp(n), so by Theorem 2.4.5, acp(n) = e
for all a E Un' This translates into [aCP(n)] = [a]cp(n) = [1], which in turn trans
lates into n I(aCP(n)  1) for every integer a relatively prime to p. In other
words, acp(n) == 1 mod n. D
A special case, where n = p is a prime, is due to Fermat.
Corollary (Fermat). If p is a prime and p ~ a, then
a
P

1
== 1 mod p.
For any integer b, b
P
== b mod p.
Proof Since cp(p) = p  1, if (a, p) = 1, we have, by Theorem 2.4.8,
that a
P
1 == l(p), hence a
1
• a
P

1
== a (p), so that a
P
== a (p). If p Ib, then
b == O(p) and b
P
== O(p), so that b
P
== b(p). D
Leonard Euler (17071785) was probably the greatest scientist that Switzerland
has produced. He was the most prolific of all mathematicians ever.
Pierre Fermat (16011665) was a great number theorist. Fermat's Last The
oremwhich was in fact first proved in 1994 by Andrew Wilesstates that the
equation an + b
n
= c
n
(a, b, c, n being integers) has only the trivial solution where
a = 0 or b = 0 or C = 0 if n > 2.
One final cautionary word about Lagrange's Theorem. Its converse in
general is not true. That is, if G is a finite group of order n, then it need not
be true that for every divisor m of n there is a subgroup of G of order m. A
group with this property is very special indeed, and its structure can be
spelled out quite well and precisely.
PROBLEMS
Easier Problems
1. Verify that the relation '"'' is an equivalence relation on the set S given.
(a) S = IR reals, a '"'' b if a  b is rational.
(b) S = C, the complex numbers, a'"'' b if la I = Ib \.
(c) S = straight lines in the plane, a '"'' b if a, b are parallel.
(d) S = set of all people, a '"'' b if they have the same color eyes.
64 Groups Ch.2
2. The relation '" on the real numbers IR defined by a '" b if both a > band
b > a is not an equivalence relation. Why not? What properties of an
equivalence relation does it satisfy?
3. Let '" be a relation on a set S that satisfies (1) a '" b implies that b '" a
and (2) a ~ band b  c implies that a ~ c. These seem to imply that
a '" a. For if a '" b, then by (1), b '" a, so a  b, b '" a, so by (2), a '" a. If
this argument is correct, then the relation '" must be an equivalence rela
tion. Problem 2 shows that this is not so. What is wrong with the argu
ment we have given?
4. Let S be a set, {Sa} nonempty subsets such that S = UaS
a
and Sa n Sf3 =
o if a =1= {3. Define an equivalence relation on S in such a way that the Sa
are precisely all the equivalence classes.
* 5. Let G be a group and H a subgroup of G. Define, for a, bEG, a  b if
aIb E H. Prove that this defines an equivalence relation on G, and show
that [a] = aH = {ah Ih E H}. The sets aH are called left cosets of H
in G.
6. If G is S3 and H = {i, f}, where f: S ~ S is defined by f(XI) =
X2, f(X2) = Xl, f(X3) = X3' list all the right cosets of H in G and list all
the left cosets of H in G.
7. In Problem 6, is every right coset of H in G also a left coset of H in G?
8. If every right coset of H in G is a left coset of H in G, prove that
aHa
1
= H for all a E G.
9. In E
16
, write down all the cosets of the subgroup H = {[OJ, [4], [8], [12]}.
(Since the operation in En is +, write your coset as [a] + H. We don't
need to distinguish between right cosets and left cosets, since En is
abelian under +.)
10. In Problem 9, what is the index of H in E
16
? (Recall that we defined the
index iG(H) as the number of right cosets in G.)
11. For any finite group G, show that there are as many distinct left cosets of
H in G as there are right cosets of H in G.
12. If aH and bH are distinct left cosets of H in G, are Ha and Hb distinct
right cosets of H in G? Prove that this is true or give a counterexample.
13. Find the orders of all the elements of U
l8
• Is U
l8
cyclic?
14. Find the orders of all the elements of U
20
• Is U
20
cyclic?
*15. If p is a prime, show that the only solutions of x
2
== 1 mod p are X ==
1 modp or x == 1 modp.
*16. If G is a finite abelian group and aI, ... , an are all its elements, show
that X = ala2 ... an must satisfy x
2
= e.
17. If G is of odd order, what can you say about the x in Problem 16?
Sec. 4 Lagrange's Theorem 65
18. Using the results of Problems 15 and 16, prove that if p is an odd prime
number, then (p  I)! == 1 mod p. (This is known as Wilson's Theo
rem.) It is, of course, also true if p = 2.
19. Find all the distinct conjugacy classes of S3 •
20. In the group G of Example 6 of Section 1, find the conjugacy class of the
element T
a
, b • Describe it in terms of a and b.
21. Let G be the dihedral group of order 8 (see Example 9, Section 1). Find
the conjugacy classes in G.
22. Verify Euler's Theorem for n = 14 and a = 3, and for n = 14 and a = 5.
23. In U
41
, show that there is an element a such that [a]2 = [1], that is, an
integer a such that a
2
== 1 mod 41.
24. If p is a prime number of the form 4n + 3, show that we cannot solve
x
2
== 1 modp
[Hint: Use F'ermat's Theorem that a
P

1
== 1 modp ifp {a.]
25. Show that the nonzero elements in 7L
n
form a group under the product
[a] [b] = [ab] if and only if n is a prime.
MiddleLevel Problems
26. Let G be a group, H a subgroup of G, and let S be the set of all distinct
right cosets of H in G, T the set of all left cosets of H in G. Prove that there
is a 11 mapping of S onto T. (Note: The obvious map that comes to mind,
which sends Ha into aH, is not the right one. See Problems 5 and 12.)
27. If aH = bH forces Ha = Hb in G, show that aHa
1
= H for every a E G.
28. If G is a cyclic group of order n, show that there are 'P (n) generators for
G. Give their form explicitly.
29. If in a group G, aba
1
= bi, show that arba
r
= b
ir
for all positive inte
gers r.
30. If in G a
5
= e and aba
1
= b
2
, find o(b) if b =1= e.
*31. If o(a) = m and as = e, prove that m Is.
32. Let G be a finite group, H a subgroup of G. Let f(a) be the least positive
m such that am E H. Prove thatf(a) lo(a).
33. If i =1= f E A (S) is such that f P = i, p a prime, and if for some
s E S, fj (s) = s for some 1 ~ j < p, show that f(s) = s.
34. If f E A (S) has order p, p a prime, show that for every s E S the orbit of
sunder f has one or p elements. [Recall: The orbit of sunder f is
{fj(s) Ij any integer}.]
66 Groups Ch.2
35. If1E A (S) has order p, p a prime, and S is a finite set having n elements,
where (n,p) == 1, show that for some s E S, I(s) == s.
Harder Problems
36. If a > 1 is an integer, show that n Iq;(a
n
 1), where q; is the Euler
q;function. [Hint: Consider the integers mod(a
n
 1).]
37. In a cyclic group of order n, show that for each positive integer m that di
vides n (including m == 1 and m == n) there are q;(m) elements of order m.
38. Using the result of Problem 37, show that n == ~ m l nq;(m).
39. Let G be a finite abelian group of order n for which the number of solu
tions of x
m
== e is at most m for any m dividing n. Prove that G must be
cyclic. [Hint: Let l/J(m) be the number of elements in G of order m. Show
that l/J(m) ::::; q;(m) and use Problem 38.]
40. Using the result of Problem 39, show that Up, if p is a prime, is cyclic.
(This is a famous result in number theory; it asserts the existence of a
primitive root mod p.)
41. Using the result of Problem 40, show that if p is a prime of the form
p == 4n + 1, then we can solve x
2
== 1 mod p (with x an integer).
42. Using Wilson's Theorem (see Problem 28), show that if p is a prime of
the form p == 4n + 1 and if
p 1 (p  1)
y == 1· 2·3··· 2 == 2 !,
then y2 == 1 mod p. (This gives another proof of the result in Problem
41.)
43. Let G be an abelian group of order n, and al, ... , an its elements. Let
x == ala2··· an. Show that:
(a) If G has exactly one element b =1= e such that b
2
== e, then x == b.
(b) If G has more than one element b =1= e such that b
2
== e, then x == e.
(c) If n is odd, then x == e (see Problem 16).
5. HOMOMORPHISMS AND NORMAL SUBGROUPS
In a certain sense the subject of group theory is built up out of three basic
concepts: that of a homomorphism, that of a normal subgroup, and that of
the factor or quotient group of a group by a normal subgroup. We discuss the
first two of these in this section, and the third in Section 6.
Without further ado we introduce the first of these.
Sec. 5 Homomorphisms and Normal Subgroups 67
Definition. Let G, G' be two groups; then the mapping q;: G ~ G' is
a homomorphism if q;(ab) == q;(a)q;(b) for all a, bEG.
(Note: This q; has nothing to do with the Euler q;function.)
In this definition the product on the left sidein q; (ab)is that of G,
while the product q; (a) q; (b) is that of G'. A short description of a homomor
phism is that it preserves the operation of G. We do not insist that q; be onto; if
it is, we'll say that it is. Before working out some facts about homomorphisms,
we present some examples.
Examples
1. Let G be the group of all positive reals under the multiplication of reals,
and G' the group of all reals under addition. Let q;: G ~ G' be defined by
q;(x) == IOgloX for x E G. Since IOglO(XY) == IOgloX + IOgloY, we have
q;(xy) == q;(x) + q;(y), so q; is a homomorphism. It also happens to be onto
and 11.
2. Let G be an abelian group and let q;: G ~ G be defined by q; (a) == a
2
.
Since q; (ab) == (ab)2 == a
2
b
2
== q; (a) q; (b), q; is a homomorphism of G into it
self. It need not be onto; the reader should check that in Us (see Section 4)
a
2
== e for all a E Us, so q;(G) == (e).
3. The example of Us above suggests the socalled trivial homomorphism. Let
G be any group and G' any other; define q; (x) == e', the unit element of G',
for all x E G. Trivially, q; is a homomorphism of G into G'. It certainly is not
a very interesting one.
Another homomorphism always present is the identity mapping, i, of any
group G into itself. Since i(x) == x for all x E G, clearly i(xy) == xy == i(x)i(y).
The map i is 11 and onto, but, again, is not too interesting as a homomor
phism.
4. Let G be the group of integers under + and G' == {I, I}, the subgroup of
the reals under multiplication. Define q; (m) == 1 if m is even, q; (m) == 1 if m
is odd. The statement that q; is a homomorphism is merely a restatement of:
even + even == even, even + odd == odd, and odd + odd == even.
5. Let G be the group of all nonzero complex numbers under multiplication
and let G' be the group of positive reals under multiplication. Let q; : G ~ G'
be defined by q; (a) == Ia I;then q; (ab) == Iab I == Ia I IbI == q; (a) q; (b), so q; is a
homomorphism of G into G'. In fact, q; is onto.
6. Let G be the group in Example 6 of Section 1, and G' the group of
nonzero reals under multiplication. Define q;: G ~ G' by q;(Ta,b) == a. That
68 Groups Ch.2
q; is a homomorphism follows from the product rule in G, namely, Ta,b Tc,d =
Tac,ad+b'
7. Let G = 7L be the group of integers under + and let G' = 7L
n
• Define
q;: G ~ 7L
n
by q;(m) = [m]. Since the addition in 7L
n
is defined by [m] + [r] =
[m + r], we see that q;(m + r) = q;(m) + q;(r), so q; is indeed a homomor
phism of 7L onto 7L
n
•
8. The following general construction gives rise to a wellknown theorem.
Let G be any group, and let A(G) be the set of all 11 mappings of G onto it
selfhere we are viewing G merely as a set, forgetting about its multiplication.
Define T
a
: G ~ G by Ta(x) = ax for every x E G. What is the product, T
a
T
b
, of
T
a
and T
b
as mappings on G? Well,
(we used the associative law). So we see that T
a
T
b
= Tab'
Define the mapping q;: G ~ A(G) by q;(a) = T
a
, for a E G. The prod
uct rule for the T's translates into q; (ab) = Tab = T
a
T
b
= q; (a) q; (b), so q; is a
homomorphism of G into A(G). We claim that q; is 11. Suppose that q;(a) =
q;(b), that is, T
a
= T
b
. Therefore, a = Ta(e) = Tb(e) = b, so q; is indeed 11.
It is not onto in generalfor instance, if G has order n > 2, then A(G) has
order n!, and since n! > n, q; doesn't have a ghost of a chance of being onto.
It is easy to verify that the image of 'fJ, q;( G) = {T
a
Ia E G}, is a subgroup
of A(G).
The fact that q; is 11 suggests that perhaps 11 homomorphisms should
playa special role. We single them out in the following definition.
Definition. The homomorphism q;: G ~ G' is called a monomor
phism if q; is 11. A monomorphism that is onto is called an isomorphism. An
isomorphism from G to G itself is called an automorphism.
One more definition.
Definition. Two groups G and G' are said to be isomorphic if there is
an isomorphism of G onto G'.
We shall denote that G and G' are isomorphic by writing G === G'.
This definition seems to be asymmetric, but, in point of fact, it is not.
..For if there is an isomorphism of G onto G', there is one of G' onto G (see
Problem 2).
We shall discuss more thoroughly later what it means for two groups to
be isomorphic. But now we summarize what we did in Example 8.
Sec. 5 Homomorphisms and Normal Subgroups 69
Theorem 2.5.1 (Cayley's Theorem). Every group G is isomorphic to
some subgroup of A (S), for an appropriate S.
The appropriate S we used was G itself. But there may be better
choices. We shall see some in the problems to follow.
When G is finite, we can take the set S in Theorem 2.5.1 to be finite, in
which case A (S) is Sn and its elements are permutations. In this case, Cay
ley's Theorem is usually stated as: A finite group can be represented as a
group ofpermutations.
(Arthur Cayley (18211895) was an English mathematician who worked in ma
trix theory, invariant theory, and many other parts of algebra.)
This is a good place to discuss the importance of "isomorphism." Let q;
be an isomorphism of G onto G'. We can view G' as a relabeling of G,
using the label q; (x) for the element x. Is this labeling consistent with the
structure of G as a group? That is, if x is labeled q;(x), y labeled q;(y), what
is xy labeled as? Since q; (x) q; (y) = q; (xy), we see that xy is labeled as
q; (x) q; (y), so this renaming of the elements is consistent with the product in
G. So two groups that are isomorphicalthough they need not be equalin
a certain sense, as described above, are equal. Often, it is desirable to be able
to identify a given group as isomorphic to some concrete group that we
know.
We go on with more examples.
9. Let G be any group, a E G fixed in the discussion. Define q;: G ~ G by
q;(x) = aIxa for all x E G. We claim that q; is an isomorphism of G onto
itself. First,
so q; is at least a homomorphism of G into itself. It is 11 for if q; (x) = q; (y),
then aI xa = a 1 ya, so by cancellation in G we get x = y. Finally, q; is onto,
for x = aI(axaI)a = q;(axa
I
) for any x E G.
Here q; is called the inner automorphism of G induced by a. The notion
of automorphism and some of its properties will come up in the problems.
One final example:
10. Let G be the group of reals under + and let G' be the group of all
nonzero complex numbers under multiplication. Define q; : G ~ G' by
q;(x) = cos x + i sin x.
We saw that (cos x + i sin x)(cos y + i sin y) = cos(x + y) + i sin(x + y),
70 Groups Ch. 2
hence 'P (x )'P(Y) = 'P (x + y) and 'P is a homomorphism of G into G'. 'P is not
11 because, for instance, 'P(D) = 'P(27T) = 1, nor is 'P onto.
Now that we have a few examples in hand, we start a little investigation
of homomorphisms. We begin with
Lemma 2.5.2. If 'P is a homomorphism of G into G', then:
(a) 'P(e) = e', the unit element of G'.
(b) 'P (a1) = 'P (a) 1 for all a E G.
Proof Since x = xe, 'P(x) = 'P(xe) = 'P(x)'P(e); by cancellation in G'
we get 'P(e) = e'. Also, 'P(aa
1
) = 'P(e) = e', hence e' = 'P(aa
1
) =
'P(a)'P(a
1
), which proves that 'P(a
I
) = 'P(a)I. D
Definition. The image of 'P, 'P(G), is 'P(G) = {'P(a) I a E G}.
We leave to the reader the proof of
Lemma 2.5.3. If 'P is a homomorphism of G into G', then the image
of 'P is a subgroup of G'.
We singled out certain homomorphisms and called "them monomor
phisms. Their property was that they were 11. We want t6 measure how far
a given homomorphism is from being a monomorphism. This prompts the
Definition. If 'P is a homomorphism of G into G', then the kernel of
'P, Ker 'P, is defined by Ker 'P = {a E G I'P(a) = e'}.
Ker'P measures the lack of 111 ness at' one point e'. We claim that
this lack is rather uniform. What is ,W = {x E G I'P (x) = w'} for a given
w' E G'? We show that if 'P(x) = w' for some x E G, then W =
{kx IkE Ker 'P} = (Ker 'P)x. Clearly, if k E Ker'P and 'P(x) = w', then
'P(kx) = 'P(k)'P(x) = e''P(x) = w
'
, so kx E W. Also, if 'P(x) = 'P(Y) = w',
then 'P(x) = 'P(Y), hence 'P(Y)'P(X)I = e'; but 'P(X)l = 'P(x
1
) by
Lemma 2.5.2, so e' = 'P(Y)jP(X)1 = 'P(Y)'P(x
l
) = 'P(yx
1
), whence
yx 1 E Ker 'P and so y E (Ker 'P)x. Thus the inverse image of any element
w ' in 'P (G) E G' is the set (Ker 'P)x, where x is any element in G such that
'P(x) = w'.
We state this as
Lemma 2.5.4. If w' E G' is of the form 'P (x)
{y E G I'P(Y) = w'} = (Ker 'P)x.
w', then
Sec. 5 Homomorphisms and Normal Subgroups 71
We now shall study some basic properties of the kernels of homomor
phisms.
Theorem 2.5.5. If 'P is a homomorphism of G into G', then
(a) Ker 'P is a subgroup of G.
(b) Given a E G, aI (Ker 'P)a C Ker 'P.
Proof Although this is so important, its proof is easy. If a, b E Ker 'P,
then 'P (a) == 'P (b) == e', hence 'P (ab) == 'P (a) 'P (b) == e', whence ab E Ker 'P,
so Ker 'P is closed under product. Also 'P (a) == e' implies that 'P (a  I) ==
'P(a)I == e', and so aI E Ker 'P. Therefore, Ker'P is a subgroup of G. If
k E Ker'P and a E G, then 'P (k) == e'. Consequently, 'P(aIka) ==
'P (a I) 'P (k)'P(a) == 'P (a I)e ' 'P (a) == 'P (a I)'P (a) == 'P (a Ia) == 'P (e) == e'. This
tells us that aIka E Ker 'P, hence aI (Ker 'P) a E Ker 'P. The theorem is
now completely proved. D
Corollary. If 'P is a homomorphism of G into G', then 'P is a
monomorphism if and only if Ker 'P == (e).
Proof This result is really a corollary to Lemma 2.5.4. We leave the
few details to the reader. D
Property (b) of Ker 'P in Theorem 2.5.5 is an interesting and basic one
for a subgroup to enjoy. We ran into this property in the text material and
problems earlier on several occasions. We use it to define the ultraimportant
class of subgroups of a group.
Definition. The subgroup N of G is said to be a normal subgroup of
G if aINa C N for every a E G.
Of course, Ker 'P, for any homomorphism, is a normal subgroup of G.
As we shall see in the next section, every normal subgroup of G is the kernel
of some appropriate homomorphism of G into an appropriate group G'. So in
a certain sense the notions of homomorphism and normal subgroups will be
shown to be equivalent.
Although we defined a normal subgroup via aINa C N, we actually
have aINa == N. For if aINa C N for all a E G, then N == a (aINa)a
1
C
aNa
1
== (a
I
) INa 1 C N. So N == aNa
1
for every a E G. Transposing,
we have Na == aN; that is, every left coset of N in G is a right coset of
Nin G.
On the other hand, if every left coset of N in G is a right coset, then the
left coset aN, which contains a, must be equal to the right coset containing a,
72 Groups Ch.2
namely Na. Thus, aN = Na and N = aINa for all a E G, which is to say that
N is normal in G.
We write "N is a normal subgroup of G" by the abbreviated symbol
N<JG.
Note that aINa = N does not mean that aIna = n for every n E N.
Nomerely that the set of all aIna is the same as the set of all n.
We have proved
Theorem 2.5.6. N <J G if and only if every left coset of N in G is a
right coset of N in G.
Before going any further, we pause to look at some examples of kernels
of homomorphisms and normal subgroups.
If G is abelian, then every subgroup of G is normal, for aIxa = x for
every a, x E G. The converse of this is not true. Nonabelian groups exist in
which every subgroup is normal. See if you can find such an example of order
8. Such nonabelian groups are called Hamiltonian, after the Irish mathemati
cian W. R. Hamilton (18051865). The desired group of order 8 can be found
in the quaternions of Hamilton, which we introduce in Chapter 4, Section 1.
In Example 1, cp(x) = 10gloX, and Ker cp = {x IlogloX = O} = {I}. In
Example 2, where G is abelian, and cp (x) = X
2
,
Ker cp = {x E G Ix
2
= e}
The kernel of the trivial homomorphism of Example 3 is all of G. In Exam
ple 4, Ker cp is the set of all even integers. In Example 5, Ker cp =
{a E C' Iia I = I}, which can be identified, from the polar form of a complex
number, as Ker cp = {cos x + i sin x Ix real}. In Example 6, Ker cp =
{Tt,b E G Ib real}. In Example 7, Ker cp is the set of all multiples of n. In Ex
amples 8 and 9, the kernels consists of e alone, for the maps are monomor
phisms. In Example 10, we see that Ker cp = {27Tm Im any
Of course, all the kernels above are normal subgroups of their respec
tive groups. We should look at some normal subgroups, intrinsically in G it
self, without recourse to the kernels of homomorphism. We go back to the
examples of Section 1.
1. In Example 7, H = {Ta,b E G Ia rational}. If Tx,y E G, we leave it to the
reader to check that HTx,y CHand so H <J G.
2. In Example 9 the subgroup {i, g, i, g3} <J G. Here too we leave the
checking to the reader.
3. In Example 10 the subgroup H = Ii, h, h
2
, ••• , h
n

I
} is normal in G. This
we also leave to the reader.
Sec. 5 Homomorphisms and Normal Subgroups 73
4. If G is any group, Z(G), the center of G, is a normal subgroup of G (see
,Example 11 of Section 3).
5. If G = S3' G has the elements i, f, g, i, fg, and gf, where f(XI) = X2,
f(X2) = Xt, f(X3) = X3 and g(Xt) = X2' g(X2) = X3, g(X3) = Xt. We claim
that the subgroup N = Ii, g, i} <l S3' As we saw earlier (or can compute
now), fgft = gt = i, f
g
2f
t
= g. (fg) g(fg)l = fgggIfl = fgft =
g2, and so on. So N <l S3 follows.
The material in this section has been a rather rich diet. It may not seem
so, but the ideas presented, although simple, are quite subtle. We recom
mend that the reader digest the concepts and results thoroughly before going
on. One way of seeing how complete this digestion is, is to take a stab at
many of the almost infinite list of problems that follow. The material of the
next section is even a richer diet, and even harder to digest. Avoid a mathe
matical stomachache later by assimilating this section well.
PROBLEMS
Easier Problems
1. Determine in each of the parts if the given mapping is a homomorphism.
If so, identify its kernel and whether or not the mapping is 11 or onto.
(a) G = Z under +, G' = Zn, cp(a) = [a] for a E Z.
(b) G group, cp: G ~ G defined by cp(a) = aI for a E G.
(c) G abelian group, cp: G ~ G defined by cp(a) = aI for a E G.
(d) G group of all nonzero real numbers under multiplication, G' =
{I, I}, cp(r) = 1 if r is positive, cp(r) = 1 if r is negative.
(e) G an abelian group, n > 1 a fixed integer, and cp: G ~ G defined by
cp(a) = an for a E G.
2. Recall that G ~ G' means that G is isomorphic to G'. Prove that for all
groups G
b
G
2
, G
3
:
(a) G
t
~ G
t
.
(b) G
I
~ G
2
implies that G
2
~ G
I
•
(c) G
t
~ G
2
, G
2
~ G
3
implies that G
I
~ G
3
•
3. Let G be any group and A (G) the set of all 11 mappings of G, as a set,
onto itself. Define La: G ~ G by La(x) = xa
t
. Prove that:
(a) La E A(G).
(b) LaL
b
= Lab'
(C) The mapping t/J: G ~ A(G) defined by t/J(a) = La is a monomor
phism of G into A (G).
74 Groups Ch.2
4. In Problem 3 prove that for all a, bEG, TaL
b
= LbT
a
, where T
a
is de
fined as in Example 8.
5. In Problem 4, show that if V E A (G) is such that T
a
V = VT
a
for all
a E G, then V = L
b
for some bEG. (Hint: Acting on e E G, find out
what b should be.)
6. Prove that if cp: G ~ G' is a homomorphism, then cp (G), the image of G,
is a subgroup of G' .
7. Show that cp : G ~ G', where cp is a homomorphism, is a monomorphism
if and only if Ker cp = (e).
8. Find an isomorphism of G, the group of all real numbers under +, onto
G', the group of all positive real numbers under multiplication.
9. Verify that if G is the group in Example 6 of Section 1, and H =
{T
a
, bEG I a rational}, then H <l G, the dihedral group of order 8.
10. Verify that in Example 9 of Section 1, the set H = {i, g, g2, g3} is a nor
mal subgroup of G, the dihedral group of order 8.
11. Verify that in Example 10 of Section 1, the subgroup
H = {i, h, h
2
, ••• ,h
n

I
}
is normal in G.
12. Prove that if Z (G) is the center of G, then Z (G) <l G.
13. If G is a finite abelian group of order nand cp: G ~ G is defined by
cp (a) = am for all a E G, find the necessary and sufficient condition that
cp be an isomorphism of G onto itself.
14. If G is abelian and cp : G ~ G' is a homomorphism of G onto G', prove
that G' is abelian.
15. If G is any group, N <J G, and cp: G ~ G' a homomorphism of G onto
G', prove that the image, cp(N), of N is a normal subgroup of G'.
16. If N <l G and M <l G and MN = {mn 1m E M, n EN}, prove that MNis
a subgroup of G and that MN <l G.
17. If M <l G, N <l G, prove that M n N <l G.
18. If H is any subgroup of G and N = n
aEG
aI Ha, prove that N <l G.
19. If H is a subgroup of G, let N(H) be defined by the relation N(H) =
{a E G Ia1Ha = H}. Prove that:
(a) N(H) is a subgroup of G and N(H) ~ H.
(b) H <l N(H).
(c) If K is a subgroup of G such that H <l K, then K C N(H). [So N(H)
is the largest subgroup of G in which H is normal.]
20. If M <l G, N <l G, and M n N = (e), show that for mE M, n EN, mn = nm.
Sec. 5 Homomorphisms and Normal Subgroups 75
21. Let S be any set having more than two elements and A(S) the set of
all 11 mappings of S onto itself. If s E S, we define H(s) =
{fE A(S) If(s) = s}. Prove that H(s) cannot be a normal subgroup ofA(S).
22. Let G = S3, the symmetric group of degree 3 and let H = {i, f}, where
f(Xl) = x2,f(X2) = xl,f(X3) = X3·
(a) Write down all the left cosets of H in G.
(b) Write down all the right cosets of H in G.
(c) Is every left coset of H a right coset of H?
23. Let G be a group such that all subgroups of G are normal in G. If
a, bEG, prove that ba = ajb for some j.
24. If G
b
G
2
are two groups, let G = G
l
X G
2
, the Cartesian product of G
b
G
2
[i.e., G is the set of all ordered pairs (a, b) where a E G
l
, b E G
2
].
Define a product in G by (al' b
l
)(a2' b
2
) = (ala2' b
l
b
2
).
(a) Prove that G is a group.
(b) Show that there is a monomorphism 'Pl of G
l
into G such that
'Pl (G
l
) <l G, given by 'Pl (al) = (ab e2), where e2 is the identity ele
ment of G
2
•
(c) Find the similar monomorphism 'P2 of G
2
into G.
(d) Using the mappings 'Pb 'P2 of Parts (b) and (c), prove that
'Pl(G
1
)'P2(G
2
) = G and 'Pl(G
l
) n 'P2(G
2
) is the identity element of G.
(e) Prove that G
l
X G
2
~ G
2
X G
l
•
25. Let G be a group and let W = G X G as defined in Problem 24. Prove that:
(a) The mapping 'P: G ~ W defined by 'P(a) = (a, a) is a monomorphism
of G into W.
(b) The image 'P(G) in W [i.e., {(a, a) Ia E G}] is normal in W if and only
if G is abelian.
MiddleLevel Problems
*26. If G is a group and a E G, define O"a: G ~ G by O"a(g) = aga
l
. We saw
in Example 9 of this section that O"a is an isomorphism of G onto itself, so
O"a E A (G), the group of all 11 mappings of G (as a set) onto itself. De
fine t/J: G ~ A(G) by t/J(a) = O"a for all a E G. Prove that:
(a) t/J is a homomorphism of G into A (G).
(b) Ker t/J = Z(G), the center of G.
27. If 0 is an automorphism of G and N <l G, prove that O(N) <l G.
28. Let 0, t/J be automorphisms of G, and let Ot/J be the product of 0 and t/J as
mappings on G. Prove that Ot/J is an automorphism of G, and that Ol is
an automorphism of G, so that the set of all automorphisms of G is itself
a group.
76 Groups Ch.2
* 29. A subgroup T of a group W is called characteristic if 'P (T) C T for all au
tomorphisms, 'P, of W. Prove that:
(a) M characteristic in G implies that M <l G.
(b) M, N characteristic in G implies that MN is characteristic in G.
(c) A normal subgroup of a group need not be characteristic. (This is
quite hard; you must find an example of a group G and a noncharac
teristic normal subgroup.)
30. Suppose that IG I = pm, where p (m and p is a prime. If H is a normal
subgroup of order p in G, prove that H is characteristic.
31. Suppose that G is an abelian group of order pnm where p (m is a prime.
If H is a subgroup of G of order pn, prove that H is a characteristic sub
group of G.
32. Do Problem 31 even if G is not abelian if you happen to know that for
some reason or other H <l G.
33. Suppose that N <l G and MeNis a characteristic subgroup of N. Prove
that M <J G. (It is not true that if M <l Nand N <l G, then M must be
normal in G. See Problem 50.)
34. Let G be a group, .:fI(G) the group of all automorphisms of G. (See Prob
lem 28.) Let I (G) = {Ua Ia E G}, where U
a
is as defined in Problem 26.
Prove that I( G) <l .:fI(G).
35. Show that Z (G), the center of G, is a characteristic subgroup of G.
36. If N <l G and H is a subgroup of G, show that H n N <l H.
Harder Problems
37. If G is a nonabelian group of order 6, prove that G ~ S3 .
38. Let G be a group and H a subgroup of G. Let S = {Ha I a E G} be the
set of all right cosets of H in G. Define, for bEG, T
b
: S ~ S by Tb(Ha)
= Hab
1
•
(a) Prove that TbT
e
= T
be
for all b, c E G [therefore the mapping
l/J: G ~ A(S) defined by l/J(b) = T
b
is a homomorphism].
(b) Describe Ker l/J, the kernel of l/J: G ~ A(S).
(c) Show that Ker l/J is the largest normal subgroup of G lying in H
[largest in the sense that if N <l G and N C H, then N C Ker l/J].
39. Use the result of Problem 38 to redo Problem 37.
Recall that if H is a subgroup of G, then the index of H in G, iG(H), is
the number of distinct right cosets of Hand G (if this number is finite).
40. If G is a finite group, H a subgroup of G such that n (iG(H)! where n =
IG I, prove that there is a normal subgroup N =I= (e) of G contained in H.
Sec. 6 Factor Groups 77
41. Suppose that you know that a group G of order 21 contains an element a
of order 7. Prove that A = (a), the subgroup generated by a, is normal in
G. (Hint: Use the result of Problem 40.)
42. Suppose that you know that a group G of order 36 has a subgroup H or
order 9. Prove that either H <J G or there exists a subgroup N <J G,
N c H, and INI = 3.
43. Prove that a group of order 9 must be abelian.
44. Prove that a group of order p2, P a prime, has a normal subgroup of
order p.
45. Using the result of Problem 44, prove that a group of order p2, P a
prime, must be abelian.
46. Let G be a group of order 15; show that there is an element a =1= e in G
such that a
3
= e and an element b =1= e such that b
5
= e.
47. In Problem 46, show that both subgroups A = {e, a, a
2
} and B =
fe, b, b
2
, b
3
, b
4
} are normal in G.
48. From the result of Problem 47, show that any group of order 15 is cyclic.
Very Hard Problems
49. Let G be a group, H a subgroup of G such that i
G
(H) is finite. Prove that
there is a subgroup N c H, N <J G such that i
G
(N) is finite.
50. Construct a group G such that G has a normal subgroup N, and N has a
normal subgroup M (i.e., N <J G, M <J N), yet M is not normal in G.
51. Let G be a finite group, q; an automorphism of G such that q;2 is the iden
tity automorphism of G. Suppose that 'P(x) = x implies that x = e. Prove
that G is abelian and q;(a) = aI for all a E G.
52. Let G be a finite group and q; an automorphism of G such that q; (x) =
XI for more than threefourths of the elements of G. Prove that 'P(Y) =
y
1
for all y E G, and so G is abelian.
6. FACTOR GROUPS
Let G be a group and N a normal subgroup of G. In proving Lagrange's The
orem we used, for an arbitrary subgroup H, the equivalence relation a '"'' b if
ab
1
E H. Let's try this out when N is normal and see if we can say a little
more than one could say for just any old subgroup.
So, let a '"'' b if ab1 E N and let [a] = {x E G I x '"'' a}. As we saw
earlier, [a] = Na, the right coset of N in G containing a. Recall that in
looking at 7L
n
we defined for it an operation + via [a] + [b] = [a + b]. Why
78 Groups Ch.2
not try something similar for an arbitrary group G and a normal subgroup
NofG?
So let M = {[a] Ia E G}, where [a] = {x E G Ixa
1
E N} = Na. We de
fine a product in M via [a] [b] = [ab]. We shall soon show that M is a group
under this product. But first and foremost we must show that this product in
M is welldefined. In other words, we must show that if [a] = [a'] and [b] =
[b'], then [ab] = [a'b'], for this would show that [a][b] = [ab] = [a'b'] =
[a '][ b']; equivalently, that this product of classes does not depend on the par
ticular representatives we use for the classes.
Therefore let us suppose that [a] = [a'] and [b] = [b ']. From the defi
nition of our equivalence we have that a' = na, where n E N. Similarly,
b' = mb, where mEN. Thus a'b' = namb = n(ama1)ab; since N <l G,
ama
1
is in N, so n(ama
1
) is also in N. So if we let nl = n(ama
1
), then
nl E Nand a'b' = nlab. But this tells us that a'b' E Nab, so that
a' b' ~ ab, from which we have that [a' b'] = [ab], the exact thing we re
quired to ensure that our product in M was welldefined.
Thus M is now endowed with a welldefined product [a][b] = [able
We now verify the group axioms for M. Closure we have from the very def
inition of this product. If [a], [b], and [c] are in M, then [a]([ b][c]) =
[a][bc] = [a (be)] = [(ab)c] (since the product in G is associative) =
[ab][ c] = ([a][ b])[c]. Therefore, the associative law has been established
for the product in M. What about a unit element? Why not try the obvious
choice, namely [e]? We immediately see that [a][e] = rae] = [a] and
[e][a] = rea] = [a], so [e] does act as the unit element for M. Finally, what
about inverses? Here, too, the obvious choice is the correct one. If a E G,
then [a][a
1
] = [aa
1
] = [e], hence [aI] acts as the inverse of [a] relative
to the product we have defined in M.
We want to give M a name, and better still, a symbol that indicates its
dependence on G and N. The symbol we use for M is GIN (read "G over Nor
G mod N") and GIN is called the factor group or quotient group of G by N.
What we have shown is the very important
Theorem 2.6.1. If N <l G and
GIN = {[a]/aEG} = {NalaEG},
then GIN is a group relative to the operation [a] [b] = [ab].
One observation must immediately be made, namely
Theorem 2.6.2. If N <l G, then there is a homomorphism f/J of G onto
GIN such that Ker f/J, the kernel of f/J, is N.
Sec. 6 Factor Groups 79
Proof The most natural mapping from G to GIN is the one that
does the trick. Define t/J: G ~ GIN by t/J (a) = [a]. Our product as defined
in GIN makes of t/J a homomorphism, for t/J(ab) = [ab] = [a][b] =
t/J(a)t/J(b). Since every element X E GIN is of the form X = [b] = t/J(b) for
some bEG, t/J is onto. Finally, what is the kernel, Ker t/J, of t/J? By defini
tion, Ker t/J = {a E G I t/J (a) = E}, where E is the unit element of GIN. But
what is E? Nothing other than E = [e] = Ne = N, and a E Ker t/J if and
only if E = N = t/J(a) = Na. But Na = Ntells us that a = ea E Na = N, so
we see that Ker t/J C N. That N C Ker t/Jwhich is easywe leave to the
reader. So Ker t/J = N. D
Theorem 2.6.2 substantiates the remark we made in the preceding sec
tion that every normal subgroup N of G is the kernel of some homomor
phism of G onto some group. The "some homomorphism" is the t/J defined
above and the "some group" is GIN.
This construction of the factor group G by N is possibly the single most
important construction in group theory. In other algebraic systems we shall
have analogous constructions, as we shall see later.
One might ask: Where in this whole affair did the normality of N in G
enter? Why not do the same thing for any subgroup H of G? So let's try and
see what happens. As before, we define
W= {[a]laEG} = {HalaEG}
where the equivalence a '"'' b is defined by ab
1
E H. We try to introduce a
product in W as we did for GIN by defining [a] [b] = [ab]. Is this product well
defined? If h E H, then [hb] = [b], so for the product to be well defined, we
would need that [a][b] = [a][hb], that is, [ab] = [ahb]. This gives us that Hab
= Hahb, and so Ha = Hah; this implies that H = Hahat, whence aha
1
E H.
That is, for all a E G and all h E H, aha
1
must be in H; in other words, H
must be normal in G. So we see that in order for the product defined in W to
be welldefined, H must be a normal subgroup of G.
We view this matter of the quotient group in a slightly different way. If
A, B are subsets of G, let AB = tab Ia E A, b E B}. If H is a subgroup of G,
then HH CHis another way of saying that H is closed under the product
ofG.
Let GIN = {Na Ia E G} be the set of all right cosets of the normal sub
group N in G. Using the product of subsets of G as defined above, what is
(Na)(Nb)? By definition, (Na)(Nb) consists of all elements of the form
(na)(mb), where n, mEN, and so
(na)(mb) = (nama1)(ab) = ntab,
80 Groups Ch.2
where nl = nama
1
is in N, since N is normal. Thus (Na)(Nb) C Nab. On the
other hand, if n E N, then
n(ab) = (na)(eb) E (Na)(Nb),
so that Nab C (Na)(Nb). In short, we have shown that the productas sub
sets of Gof Na and Nb is given by the formula (Na)(Nb) = Nab. All the
other group axioms for GIN, as defined here, are now readily verified from
this product formula.
Another way of seeing that (Na)(Nb) = Nab is to note that by the nor
mality of N, aN = Na, hence (Na)(Nb) = N(aN)b = N(Na)b = NNab =
Nab, since NN = N (because N is a subgroup of G).
However we view GINas equivalence classes or as a set of certain
subsets of Gwe do get a group whose structure is intimately tied to that of
G, via the natural homomorphism l/J of G onto GIN.
We shall see very soon how we combine induction and the structure of
GIN to get information about G.
When G is a finite group and N <J G, then the number of right cosets of
N in G, i
G
(N), is givenas the proof of Lagrange's Theorem showedby
iG(n) = IGI/INI. But this is the order of GIN, which is the set of all the right
cosets of N in G. Thus IGINI = IGII INI. We state this more formally as
Theorem 2.6.3. If G is a finite group and N <J G, then IGINI
IGI/INI·
As an application of what we have been talking about here, we shall
prove a special case of a theorem that we shall prove in its full generality
later. The proof we givefor the abelian caseis not a particularly good
one, but it illustrates quite clearly a general technique, that of pulling back
information about GIN to get information about G itself.
The theorem we are about to prove is due to the great French mathematician
A. L. Cauchy (17891857), whose most basic contributions were in complex
variable theory.
Theorem 2.6.4 (Cauchy). If G is a finite abelian group of order IGI
and p is a prime that divides IG I, then G has an element of order p.
Proof Before getting involved with the proof, we point out to the
reader that the theorem is true for any finite group. We shall prove it in the
general case later, with a proof that will be much more beautiful than the one
we are about to give for the special, abelian case.
We proceed by induction on IGI. What does this mean precisely? We shall
Sec. 6 Factor Groups 81
assume the theorem to be true for all abelian groups of order less than IGI and
show that this forces the theorem to be true for G. If IGI = 1, there is no such p
and the theorem is vacuously true. So we have a starting point for our induction.
Suppose that there is a subgroup (e) =1= N =1= G. Since IN I < IG I, if p di
vides IN I, by our induction hypothesis there would be an element of order p
in N, hence in G, and we would be done. So we may suppose that p (INI.
Since G is abelian, every subgroup is normal, so we can form GIN. Because p
divides IGI and p(INI, and because IGINI = IGI/INI, we have that p
divides IGINI. The group GIN is abelian, since G is (Prove!) and since
N =1= (e), INI > 1, so IGINI = IGI/INI < IGI. Thus, again by induction, there
exists an element in GIN of order p. In other words, there exists an a E G
such that [a]P = [e], but [a] =1= [e]. This translates to a
P
E N, a f!:. N. So if
m = INI, then (aP)m = e. So (am)P = e. If we could show that b = am =1= e,
then b would be the required element of order p in G. But if am = e, then
[a] m = [e], and since [a] has order p, p I m (see Problem 31 of Section 4). But,
by assumption, p (m = INI. SO we are done if G has a nontrivial subgroup.
But if G has no nontrivial subgroups, it must be cyclic of prime order.
(See Problem 16 of Section 3, which you should be able to handle more eas
ily now.) What is this "prime order"? Because p divides IGI, we must have
IG I = p. But then any element a =1= e E G satisfies a
P
= e and is of order p.
This completes the induction, and so proves the theorem. D
We shall have other applications of this kind of grouptheoretic argu
ment in the problems.
The notion of a factor group is a very subtle one, and of the greatest
importance in the subject. The formation of a new set from an old one by
using as elements of this new set subsets of the old one is strange to the neo
phyte seeing this kind of construction for the first time. So it is worthwhile
looking at this whole matter from a variety of points of view. We consider
GIN from another angle now.
What are we doing when we form GIN? Sure, we are looking at equiva
lences classes defined via N. Let's look at it another way. What we are doing
is identifying two elements in G if they satisfy the relation ab1 E N. In a
sense we are blotting out N. So although GIN is not a subgroup of G, we can
look at it as G, with N blotted out, and two elements as equal if they are
equal "up to N."
For instance, in forming TLI N, where 7L is the group of integers and N is
the set of all multiples of 5 in 7L, what we are doing is identifying 1 with 6, 11,
16, 4, 9, and so on, and we are identifying all multiples of 5 with O. The
nice thing about all this is that this identification jibes with addition in 7L
when we go over to 7LIN.
Let's look at a few examples from this point of view.
82 Groups Ch. 2
1. Let G = {T
a
, b Ia =1= 0, b real} (Example 6 of Section 1). Let N =
{ T1, bib real} C G; we saw that N <J G, so it makes sense to talk about GIN.
Now T
a
, band T
a
, °are in the same left coset of N in G, so in GIN we are get
ting an element by identifying T
a
, b with T
a
, 0. The latter element just depends
on a. Moreover, the Ta,b multiply according to Ta,bTc,d = Tac,ad+b and if we
identify Ta,b with Ta,o, Tc,d with Tc,o, then their product, which is Tac,ad+b' is
identified with T
ac
,°. So in GIN multiplication is like that of the group of
nonzero real numbers under multiplication, and in some sense (which will be
made more precise in the next section) GIN can be identified with this group
of real numbers.
2. Let G be the group of real numbers under + and let 7L be the group of in
tegers under +. Since G is abelian, 7L <J G, and so we can talk about GI7L.
What does GI7L really look like? In forming GI7L, we are identifying any two
real numbers that differ by an integer. So 0 is identified with 1, 2,  3, ...
and 1, 2, 3, ... ; ~ is identified with ~ , ~ ,  ~ ,  ~ , .... Every real number a
thus has a mate, ii, where 0 ::::; ii < 1. So, in GI7L, the whole real line has been
compressed into the unit interval [0, 1]. But a little more is true, for we have
also identified the end points of this unit interval. So we are bending the unit
interval around so that its two end points touch and become one. What do
we get this way? A circle, of course! So GI7L is like a circle, in a sense that
can be made precise, and this circle is a group with an appropriate product.
3. Let G be the group of nonzero complex numbers and let N =
{a E Gila I = I} which is the unit circle in the complex plane. Then N is a
subgroup of G and is normal since G is abelian. In going to GIN we are de
claring that any complex number of absolute value 1 will be identified with the
real number 1. Now any a E G, in its polar form, can be written as a =
r(cos () + i sin (), where r = 1a I, and Icos () + i sin ()I = 1. In identifying
cos () + i sin () with 1, we are identifying a with r. So in passing to GIN every
element is being identified with a positive real number, and this identification
jibes with the products in G and in the group of positive real numbers, since
lab 1 = 1a II bl· So GIN is in a very real sense (no pun intended) the group of
positive real numbers under multiplication.
PROBLEMS
1. If G is the group of all nonzero real numbers under multiplication and N
is the subgroup of all positive real numbers, write out GIN by exhibiting
the cosets of N in G, and construct the multiplication in GIN.
2. If G is the group of nonzero real numbers under multiplication and
Sec. 6 Factor Groups 83
N = {I, I}, show how you can "identify" GIN as the group of all
positive real numbers under multiplication. What are the cosets of
Nin G?
3. If G is a group and N <l G, show that if M is a subgroup of GIN and M =
{a E GINa E M}, then M is a subgroup of G, and M ~ N.
4. If M in Problem 3 is normal in GIN, show that the M defined is normal
in G.
5. In Problem 3, show that MIN must equal M.
6. Arguing as in the Example 2, where we identified GIlL as a circle, where
G is the group of reals under + and lL integers, consider the following:
let G = {(a, b) I a, b real}, where + in G is defined by (a, b) + (c, d) =
(a + c, b + d) (so G is the plane), and let N = {(a, b) E G Ia, b are inte
gers}. Show that GIN can be identified as a torus (donut), and so we can
define a product on the donut so that it becomes a group. Here, you may
think of a torus as the Cartesian product of two circles.
7. If G is a cyclic group and N is a subgroup of G, show that GIN is a cyclic
group.
8. If G is an abelian group and N is a subgroup of G, show that GIN is an
abelian group.
9. Do Problems 7 and 8 by observing that GI N is a homomorphic image
ofG.
10. Let G be an abelian group of order p ~ l P 2 2 ... p%k, where Ph P2, ,Pk
are distinct prime numbers. Show that G has subgroups S1, S2, , Sk of
orders p ~ l , P2
2
, ... , p%k, respectively. (Hint: Use Cauchy's Theorem and
pass to a factor group.) This result, which actually holds for all finite
groups, is a famous result in group theory known as Sylow's Theorem.
We prove it in Section 11.
11. If G is a group and Z(G) the center of G, show that if GIZ(G) is cyclic,
then G is abelian.
12. If G is a group and N <l G is such that GIN is abelian, prove that
aba1b
1
E N for all a, bEG.
13. If G is a group and N <l G is such that
for all a, bEG, prove that GIN is abelian.
14. If G is an abelian group of order PIP2 ... Pk, where Ph P2, ... , Pk are
distinct primes, prove that G is cyclic. (See Problem 15.)
84 Groups Ch.2
15. If G is an abelian group and if G has an element of order m and one of
order n, where m and n are relatively prime, prove that G has an element
of order mn.
16. Let G be an abelian group of order pnm, where p is a prime and p (m.
Let P = {a E G Ia
pk
= e for some k depending on a}. Prove that:
(a) P is a subgroup of G.
(b) GIP has no elements of order p.
(c) Ipi = pn.
17. Let G be an abelian group of order mn, where m and n are relatively
prime. Let M = {a E G Iam = e}. Prove that:
(a) M is a subgroup of G.
(b) GIM has no element, x, other than the identity element, such that
x
m
= unit element of GIM.
18. Let G be an abelian group (possibly infinite) and let the set T =
{a E G Iam = e, m > 1 depending on a}. Prove that:
(a) T is a subgroup of G.
(b) GIT has no elementother than its identity elementof finite order.
7. THE HOMOMORPHISM THEOREMS
Let G be a group and cp a homomorphism of G onto G'. If K is the kernel of
cp, then K is a normal subgroup of G, hence we can form GIK. It is fairly nat
ural to expect that there should be a very close relationship between G' and
GIK. The First Homomorphism Theorem, which we are about to prove,
spells out this relationship in exact detail.
But first let's look back at some of the examples of factor groups in
Section 6 to see explicitly what the relationship mentioned above might be.
1. Let G = {T
a
, b Ia =1= 0, b real} and let G' be the group of nonzero reals
under multiplication. From the product rule of these T's, namely
Ta,bTc,d = Tac,ad+b' we determined that the mapping cp: G ~ G' defined
by CP(Ta,b) = a is a homomorphism of G onto G' with kernel K =
{T1, bib real}. On the other hand, in Example 1 of Section 6 we saw that
GIK = {KT
a
,°I a =1= 0 real}. Since
(KTa,o)(KTx,o) = KTax,o
the mapping of GIK onto G', which sends each KTa,o onto a, is readily
seen to be an isomorphism of GIK onto G'. Therefore, GIK::::= G'.
2. In Example 3, G was the group of nonzero complex numbers under multipli
cation and G' the group of all positive real numbers under multiplication.
Sec. 7 The Homomorphism Theorems 85
Let cp: G ~ G' defined by cp (a) = la I for a E G. Then, since Iabl =
IaI Ib I, cp is a homomorphism of G onto G' (can you see why it is onto?).
Thus the kernel K of cp is precisely K = {a E Gila I = I}. But we have al
ready seen that if Ia I = 1, then a is of the form cos () + i sin (). So the set
K = {cos () + i sin () I0 ~ () < 27T}. If a is any complex number, then
a = r(cos () + i sin (), where r = Ia I, is the polar form of a. Thus Ka =
Kr(cos () + i sin () = K(cos () + i sin ()r = Kr, since K(cos () + i sin() = K
because cos () + i sin () E K. So G/K, whose elements are the cosets Ka,
from this discussion, has all its elements of the form Kr, where r > O. The
mapping of G/K onto G' defined by sending Kr onto r then defines an iso
morphism of G/K onto G'. So, here, too, G/K ~ G'.
With this little experience behind us we are ready to make the jump the
whole way, namely, to
Theorem 2.7.1 (First Homomorphism Theorem). Let cp be a homo
morphism of G onto G' with kernel K. Then G' ~ G/K, the isomorphism be
tween these being effected by the map
I/J: G / K ~ G'
defined by I/J(Ka) = cp(a).
Proof The best way to show that G/K and G' are isomorphic is to ex
hibit explicitly an isomorphism of G/K onto G'. The statement of the theo
rem suggests what such an isomorphism might be.
So define I/J: G/K ~ G' by I/J(Ka) = cp(a) for a E G. As usual, our first
task is to show that I/J is well defined, that is, to show that if Ka = Kb, then
I/J(Ka) = I/J(Kb). This boils down to showing that if Ka = Kb, then cp(a) =
cp(b). But if Ka = Kb, then a = kb for some k E K, hence cp(a) = cp(kb) =
cp(k)cp(b). Since k E K, the kernel of cp, then cp(k) = e', the identity element
of G', so we get cp(a) = cp(b). This shows that the mapping I/J is well defined.
Because cp is onto G', given x E G', then x = cp(a) for some a E G,
thus x = cp(a) = I/J(Ka). This shows that I/J maps G/K onto G'.
Is I/J II? Suppose that I/J(Ka) = I/J(Kb); then cp(a) = I/J(Ka) =
I/J(Kb) = cp(b). Therefore, e' = cp(a)cp(b)l = cp(a)cp(b
1
) = cp(ab
1
). Be
cause ab
1
is thus in the kernel of cpwhich is Kwe have ab
1
E K. This
implies that Ka = Kb. In this way I/J is seen to be 11.
Finally, is I/J a homomorphism? We check: 1/J((Ka)(Kb» = I/J(Kab) =
cp(ab) = cp(a)cp(b) = I/J(Ka)I/J(Kb), using that cp is a homomorphism and
that (Ka)(Kb) = Kab. Consequently, I/J is a homomorphism of G/K onto G',
and Theorem 2.7.1 is proved. D
86 Groups Ch.2
Having talked about the First Homomorphism Theorem suggests that
there are others. The next result, however, is an extension of the First
Homomorphism Theorem, and is traditionally called the Correspondence
Theorem. In the context of the theorem above, it exhibits a 11 correspon
dence between subgroups of G
I
and those subgroups of G that contain K.
Theorem 2.7.2 (Correspondence Theorem). Let the map q;: G ~ G' be
a homomorphism of G onto G' with kernel K. If H' is a subgroup of G' and if
H = {a E GI q;(a) E H'},
then H is a subgroup of G, H :> K, and HIK == H'. Finally, if H' <J G
'
, then
H<J G.
Proof We first verify that the H above is a subgroup of G. It is not
empty, since e E H. If a, b E H, then q;(a), q;(b) E H', hence q;(ab) =
q;(a)q;(b) E H', since H' is a subgroup of G'; this puts ab in H, so H is
closed. Further, if a E H, then q;(a) E H', hence q;(a
I
) = q;(a)I is in H',
again since H' is a subgroup of G', whence aI E H. Therefore, H is a sub
group of G.
Because q;(K) = {e'} C H', where e' is the unit element of G', we have
that K C H. Since K <J G and K C H, it follows that K <J H. The mapping q;
restricted to H defines a homomorphism of H onto H' with kernel K. By the
First Homomorphism Theorem we get HIK == H'.
Finally, if H' <J G' and if a E G, then q;(a)IH' q;(a) C H', so
q;(aI)H'q;(a) C H'. This tells us that q;(aIHa) C H', so aIHa C H. This
proves the normality of H in G. D
It is worth noting that if K is any normal subgroup of G, and q; is the
natural homomorphism of G onto G/K, then the theorem gives us a 11 cor..
respondence between all subgroups H' of G/K and those subgroups of G
that contain K. Moreover, this correspondence preserves normality in the
sense that H' is normal in G/K if and only if H is normal in G. (See Problem
7, as well as the last conclusion of the theorem.)
We now state the Second Homomorphism Theorem, leaving its proof
to the reader in Problem 5.
Theorem 2.7.3 (Second Homomorphism Theorem). Let H be a sub
group of a group G and N a normal subgroup of G. Then HN =
{hn Ih E H, n EN} is a subgroup of G, H n N is a normal subgroup of H,
and HI(H n N) =: (HN)/N.
Sec. 7 The Homomorphism Theorems 87
Finally, we go on to the Third Homomorphism Theorem, which tells us
a little more about the relationship between Nand N' when N' <J G'.
Theorem 2.7.4 (Third Homomorphism Theorem). If the map
cp: G ~ G' is a homomorphism of G onto G' with kernel K then, if N' <J G'
and N = {a E G Icp(a) EN'}, we conclude that GIN ~ G'IN'. Equivalently,
GIN ~ (GIK)/(NIK).
Proof Define the mapping l/J: G ~ G'IN' by l/J(a) = N'cp(a) for every
a E G. Since cp is onto G' and every element of G'IN' is a coset of the form
N'x', and x' = cp(x) for some x E G, we see that l/Jmaps G onto G'IN'.
Furthermore, l/J is a homomorphism of G onto G'IN', for l/J(ab) =
N' cp(ab) = N' cp(a)cp(b) = (N' cp(a»(N' cp(b» = l/J(a)l/J(b), since N' <J G'.
What is the kernel, M, of l/J? If a E M, then l/J(a) is the unit element of
G'IN', that is, l/J(a) = N'. On the other hand, by the definition of l/J, l/J(a) =
N' cp(a). Because N' cp(a) = N' we must have cp(a) EN'; but this puts a in N,
by the very definition of N. Thus MeN. That N C M is easy and is left to
the reader. Therefore, M = N, so l/J is a homomorphism of G onto G'IN'
with kernel N, whence, by the First Homomorphism Theorem, GIN ~ G'IN'.
Finally, again by Theorems 2.7.1 and 2.7.2, G' ~ GIK, N' ~ NIK, which
leads us to GIN ~ G'IN' ~ (GIK)/(NIK). D
This last equality is highly suggestive; we are sort of "canceling out" the
K in the numerator and denominator.
PROBLEMS
1. Show that M ~ N in the proof of Theorem 2.7.3.
2. Let G be the group of all realvalued functions on the unit interval [0, 1],
where we define, for I, g E G, addition by (I + g)(x) = I(x) + g(x) for
every x E [0,1]. If N = {I E G II ( ~ ) = O}, prove that GIN ~ real num
bers under +.
3. Let G be the group of nonzero real numbers under multiplication and let
N = {1, 1}. Prove that GIN ~ positive real numbers under multiplication.
4. If G
b
G
2
are two groups and G = G
l
X G
2
= {(a, b) Ia E G
l
, b E G
2
},
where we define (a, b)(c, d) = (ac, bd), show that:
(a) N = {(a, e2) Ia E G
l
}, where e2 is the unit element of G
2
, is a normal
subgroup of G.
(b) N ~ G
l
.
(c) GIN ~ G
2
•
88 Groups Ch.2
5. Let G be a group, H a subgroup of G, and N <J G. Let the set HN =
{hn I h E H, n EN}. Prove that:
(a) H n N <J H.
(b) HN is a subgroup of G.
(c) N C HN and N <J HN.
(d) (HN)IN HI(H n N).
*6. If G is a group and N <J G, show that if a E G has finite order o(a), then
Na in GIN has finite order m, where m lo(a). (Prove this by using the
homomorphism of G onto GIN.)
7. If q; is a homomorphism of G onto G' and N <J G, show that q;(N) <J G'.
8. CAUCHY'S THEOREM
In Theorem 2.6.4Cauchy's Theoremwe proved that if a prime p divides
the order of a finite abelian group G, then G contains an element of order p.
We did point out there that Cauchy's Theorem is true even if the group is not
abelian. We shall give a very neat proof of this here; this proof is due to
McKay.
We return for a moment to set theory, doing something that we men
tioned in the problems in Section 4.
Let S be a set, f E A(S), and define a relation on S as follows: s '"'' t if
t = fi(S) for some integer i (i can be positive, negative, or zero). We leave it
to the reader as a problem that this does indeed define an equivalence rela
tion on S. The equivalence class of s, [s], is called the orbit of sunder f So S
is the disjoint union of the orbits of its elements.
When fis of order p, p a prime, we can say something about the size of
the orbits under f; those of the readers who solved Problem 34 of Section 4
already know the result. We prove it here to put it on the record officially.
[Iffk(s) = s, of courseftk(s) = s for every integer t. (Prove!)]
Lemma 2.8.1. If f E A(S) is of order p, p a prime, then the orbit of
any element of Sunder fhas 1 or p elements.
Proof Let s E S; iff(s) = s, then the orbit of sunder fconsists merely
of s itself, so has one element. Suppose then that f(s) =1= s. Consider the ele
ments s,f(s),f
2
(s), ... ,f
p

1
(s); we claim that thesep elements are distinct
and constitute the orbit of sunder f If not, then fi(S) = fj(s) for some
o i < j p  1, which gives us that fji(S) = s. Let m = j  i; then
0< m  1 andfm(s) = s. ButfP(s) = s and ap + bm = 1 for
some integers a and b. Thus fl(S) = fap+bm(s) = fap(fbm(s» = fap(s) = S,
Sec. 8 Cauchy's Theorem 89
since fm(s) = fP(s) = s. This contradicts that f(s) =1= s. Thus the orbit of s
under f consists of s, f(s), f
2
(s), ... ,f
P
1 (s), so as p elements. D
We now give McKay's proof of Cauchy's Theorem.
Theorem 2.8.2 (Cauchy). If p is a prime and p divides the order of G,
then G contains an element of order p.
Proof If p = 2, the result amounts to Problem 18 in Section 1. Assume
that p =1= 2. Let S be the set of all ordered ptuples (ab a2, ... , apb a
p
), where
ab a2' ... ' a
p
are in G and where ala2 ... apla
p
= e. We claim that S has n
P

l
elements where n = IGI. Why? We can choose ab , apl arbitrarily in G,
and by putting a
p
= (ala2 ... apl)t, the ptuple (ab a2, , apb a
p
) then satisfies
so is in S. Thus S has n
P
1 elements.
Note that if ala2 ... ap1a
p
= e, then a
p
ala2 ... apl = e (for if xy = e in
a group, then yx = e). So the mapping f: S ~ S defined by f(ab ... , a
p
) =
(a
p
, aI, a2,.··, apl) is in A(S). Note thatf =1= i, the identity map on S, and
that fP = i, so f is of order p.
If the orbit of sunder fhas one element, then f(s) = s. On the other
hand, if f(s) =1= s, we know that the orbit of sunder f consists precisely of p
distinct elements; this we have by Lemma 2.8.1. Now when is f(s) =1= s? We
claim that f(s) =1= s if and only if when s = (aI' a2, ... , a
p
), then for some
i =1= j, ai =1= aj. (We leave this to the reader.) So f(s) = s if and only if s =
(a, a, ... , a) for some a E G.
Let m be the number of s E S such that f(s) = s; since for s =
(e, e, ... , e), f(s) = s, we know that m 2:: 1. On the other hand, if f(s) =1= s,
the orbit of s consists of p elements, and these orbits are disjoint, for they are
equivalence classes. If there are k such orbits where f(s) =1= s, we get that
n
P

1
= m + kp, for we have accounted this way for every element of S.
But pin by assumption and p I (kp). So we must have p Im, since m =
n
P

l
 kp. Because m =1= 0 and p Im, we get that m > 1. But this says that
there is an s = (a, a, ... , a) =1= (e, e, ... , e) in S; from the definition of S this
implies that a
P
= e. Since a =1= e, a is the required element of order p. D
Note that the proof tells us that the number of solutions in G of x
P
= e
is a positive multiple of p.
We strongly urge the reader who feels uncomfortable with the proof
just given to carry out its details for p = 3. In this case the action of f on S
becomes clear and our assertions about this action can be checked explicitly.
90 Groups Ch.2
Cauchy's Theorem has many consequences. We shall present one of
these, in which we determine completely the nature of certain groups of
order pq, where p and q are distinct primes. Other consequences will be
found in the problem set to follow, and in later material on groups.
Lemma 2.8.3. Let G be a group of order pq, where p, q are primes
and p > q. If a EGis of order p and A is the subgroup of G generated by a,
then A <J G.
Proof We claim that A is the only subgroup of G of order p. Suppose
that B is another subgroup of order p. Consider the set AB ==
{xy Ix E A, y E B}; we claim that AB has p2 distinct elements. For suppose
that xy == uv where x, u E A, y, v E B; then ulx == vyl. But ulx E A,
vyl E B, and since ulx == vyt, we have ulx E A n B. Since B =1= A and
A n B is a subgroup of A and A is of prime order, we are forced to conclude
that A n B == (e) and so ulx == e, that is, u == x. Similarly, v == y. Thus the
number of distinct elements in AB is p2. But all these elements are in G,
which has only pq < p2 elements (since p > q). With this contradiction we
see that B == A and A is the only subgroup of order p in G. But if x E G, B ==
xlAx is a subgroup of G of order p, in consequence of which we conclude
that xlAx == A; hence A <J G. D
Corollary. If G, a are as in Lemma 2.8.3 and if x E G, then
xlax == ai, where 0 < i < p, for some i (depending on x).
Proof Since e =1= a E A and xlAx == A, xlax E A. But every element
of A is of the form ai, 0 ::; i < p, and xlax =1= e. In consquence, xlax == ai,
where 0 < i < p. D
We now prove a result of a different flavor.
Lemma 2.8.4. If a EGis of order m and bEG is of order n, where
m and n are relatively prime and ab == ba, then c == ab is of order mn.
Proof Suppose that A is the subgroup generated by a and B that gener
ated by b. Because IAI == m and IBI == nand (m, n) == 1, we get A n B == (e),
which follows from Lagrange's Theorem, for IA n B I Inand IA n B II m.
Suppose that c
i
== e, where i > 0; thus (ab)i == e. Since ab == ba, e ==
(ab)i == aib
i
; this tells us that a
i
== b
i
E A n B == (e). So a
i
== e, whence m I i,
and b
i
== e, whence n I i. Because (m, n) == 1 and m and n both divide i, mn
divides i. So i 2:: mn. Since (ab )mn == a
mn
b
mn
== e, we see that mn is the small
est positive integer i such that (ab)i == e. This says that ab is of order mn, as
claimed in the lemma. D
Sec. 8 Cauchy's Theorem 91
Before considering the more general case of groups of order pq, let's
look at a special case, namely, a group G of order 15. By Cauchy's Theorem,
G has elements b of order 3 and a of order 5. By the Corollary to Lemma
2.8.3, b1ab = ai, where 0 < i < 5. Thus
b
2
ab
2
= b1(b1ab)b = b1aib = (b1ab)i = (ai)i = a
i2
and similarly, b
3
ab
3
= a
i3
. But b
3
= e, so we get a
i3
= a, whence a
i3

1
= e.
Since a is of order 5, 5 must divide i
3
 1, that is, i
3
== 1(5). However, by Fer
mat's Theorem (Corollary to Theorem 2.4.8), i
4
== 1(5). These two equations
for i tell us that i == 1(5), so, since 0 < i < 5, i = 1. In short, b1ab = a
i
= a,
which means that ab = ba. Since a is of order 5 and b of order 3, by Lemma
2.8.4, c = ab is of order 15. This means that the 15 powers e = co, c, c
2
, ••• ,
C
14
are distinct, so must sweep out all of G. In a word, G must be cyclic.
The argument given for 15 could have been made shorter, but the form
in which we did it is the exact prototype for the proof of the more general
Theorem 2.8.5. Let G be a group of order pq, where p, q are primes
and p > q. If q (p  1, then G must be cyclic.
Proof By Cauchy's Theorem, G has an element a of order p and an
element b of order q. By the Corollary to Lemma 2.8.3, b1ab = a
i
for some i
with 0 < i < p. Thus brab
r
= air for all r 2:: 0 (Prove!), and so bqab
q
= a
iq
.
But b
q
= e; therefore, a
iq
= a and so a
iq

1
= e. Because a is of order p, we
conclude that p Ii
q
 1, which is to say, i
q
== l(p). However, by Fermat's
Theorem, i
P

1
== l(p). Since q (p  1, we conclude that i == l(p), and since
o < i < p, i = 1 follows. Therefore, b1ab = a
i
= a, hence ab = ba. By
Lemma 2.8.4, c = ab has order pq, so the powers of c sweep out all of G.
Thus G is cyclic, and the theorem is proved. D
PROBLEMS
MiddleLevel Problems
1. In the proof of Theorem 2.8.2, show that if some two entries in s
(ab a2, ... , a
p
) are different, then I(s) =1= s, and the orbit of sunder 1
has p elements.
2. Prove that a group of order 35 is cyclic.
3. Using the result of Problem 40 of Section 5, give another proof of
Lemma 2.8.3. (Hint: Use for H a subgroup of order p.)
4. Construct a nonabelian group of order 21. (Hint: Assume that a
3
= e,
92 Groups Ch. 2
b
7
= e and find some i such that a1ba = a
i
=1= a, which is consistent with
the relations a
3
= b
7
= e.)
5. Let G be a group of order pnm, where p is prime and p ~ m. Suppose that
G has a normal subgroup P of order pn. Prove that O(P) = P for every
automorphism 0 of G.
6. Let G be a finite group with subgroups A, B such that IAI > VTGI and
IBI > VTGI. Prove that A n B =1= (e).
7. If G is a group with subgroups A, B of orders m, n, respectively, where
m and n are relatively prime, prove that the subset of G,
AB = {ab Ia E A, b E B}, has mn distinct elements.
8. Prove that a group of order 99 has a nontrivial normal subgroup.
9. Prove that a group of order 42 has a nontrivial normal subgroup.
10. From the result of Problem 9, prove that a group of order 42 has a nor
mal subgroup of order 21.
Harder Problems
11. If G is a group and A, B finite subgroups of G, prove that the set AB =
tab Ia E A, b E B} has (IA liB1)/ IA n B Idistinct elements.
12. Prove that any two nonabelian groups of order 21 are isomorphic. (See
Problem 4.)
Very Hard Problems
13. Using the fact that any group of order 9 is abelian, prove that any group
of order 99 is abelian.
14. Let p > q be two primes such that q I p  1. Prove that there exists a
nonabelian group of order pq. (Hint: Use the result of Problem 40 of
Section 4, namely that Up is cyclic if p is a prime, and the idea needed to
do Problem 4 above.)
15. Prove that if p > q are two primes such that q I p  1, then any two non
abelian groups of order pq are isomorphic.
9. DIRECT PRODUCTS
In several of the problems and examples that appeared earlier, we went
through the following construction: If G1, G2 are two groups, then G =
G
1
X G
2
is the set of all ordered pairs (a, b), where a E G
1
and b E G
2
and
Sec. 9 Direct Products 93
where the product was defined componentwise via (aI, b
1
)(a2' b
2
)
(a1a2' b
1
b
2
), the products in each component being carried out in the respec
tive groups G
1
and G
2
• We should like to formalize this procedure here.
Definition. If G
1
, G
2
, ... , G
n
are n groups, then their (external) di
rect product G
1
X G
2
X G
3
X ... X G
n
is the set of all ordered ntuples
(ab a2' ... ' an) where ai E G
i
, for i = 1,2, ... , n, and where the product in
G
1
X G
2
X ... X G
n
is defined componentwise, that is,
That G = G
1
X G
2
X ... X G
n
is a group is immediate, with
(e1' e2' ... , en) as its unit element, where ei is the unit element of Gi, and
h ( )
1  (1 1 1)
were aba2, ... ,an  a1 ,a2 , ... ,a
n
.
G is merely the Cartesian product of the groups G1, G2, ... , Gn with a
product defined in G by componentwise multiplication. We call it external,
since the groups G
1
, G
2
, ••• , G
n
are any groups, with no relatioh necessarily
holding among them.
Consider the subsets G
j
C G
1
X G
2
x·· . X G
n
= G, where
in other words, G
j
consists of all ntuples where in the ith component any el
ement of G
i
can occur and where every other component is the identity ele
ment. Clearly, G
i
is a group and is isomorphic to G
i
. by the isomorphism
1Ti: G
j
~ G
i
defined by 1Ti (e1' e2' ... , ai' ... , en) = a
i
. Furthermore, not
only is G
j
a subgroup of G but G
j
<J G. (Prove!)
Given any element a = (ab a2' ... , an) E G, then
that is, every a E G can be written as a = a
l
a
2
· .. an, where each a
i
E G
i
•
Moreover, a can be written in this way in a unique manner, that is, if a =
a
l
a2· . · an = b
l
b
2
· · · b
n
, where the a
i
E G
i
and b
i
E G
i
, then
a
l
= b
i
, ... ,an = b
n
. So G is built up from certain normal subgroups, the
G
j
, as G = G
I
G
2
• • • G
n
in such a way that every element a E G has a
unique representation in the form a = a
l
a2 . · · an with a
j
E G
j
•
This motivates the following
Definition. The group G is said to be the (internal) direct product of
its normal subgroups N
b
N
2
, . .. ,N
n
if every a E G has a unique representa
tion in the form a = ala2 ... an' where each ai E N
i
for i = 1,2, ... ,n.
From what we have discussed above we have the
94 Groups Ch.2
Lemma 2.9.1. If G = G
I
X G
2
X ... X G
n
is the external direct prod
uct of G
I
, G
2
, ••• , G
n
, then G is the internal direct product of the normal
subgroups G
b
G
2
, ••• , G
n
defined above.
We want to go in the other direction, namely to prove that if G is the in
ternal direct product of its normal subgroups N
b
N
2
, .•• , N
n
, then G is iso
morphic to N
I
X N
2
X ... X N
n
• To do so, we first get some preliminary results.
The result we are about to prove has already occurred as Problem 20,
Section 5. For the sake of completeness we prove it here.
Lemma 2.9.2. Let G be a group, M, N normal subgroups of G such
that M n N = (e). Then, given m E M and n E N, mn = nm.
Proof Consider the element a = mnmIn
I
. Viewing a as bracketed
one way, a = (mnmI)n
I
; then, since N <J G and n E N, mnm
I
E N, so
a = (mnmI)n
I
is also in N. Now bracket a in the other way, a = m(nmIn
I
).
Since M <J G and m
I
E M, we have nmIn
I
E M and so a =
m(nmIn
I
) E M. Thus a E M n N = (e), which is to say, mnmIn
I
= e.
This gives us that mn = nm, as required. D
If G is the internal direct product of the normal subgroups N I,
N
2
, ••• , N
n
, we claim that N
i
n N
j
= (e) for i =1= j. For suppose that
a E N
i
n N
j
; then a = e . e . · . eae ... e, where the a occurs in the ith place.
This gives us one representation of a in G = N
I
N
2
••• N
n
• On the other hand,
a = e . e ... e . a . e ... e, where the a occurs in the jth place, so a has the second
representation as an element of N
I
N
2
••• N
n
• By the uniqueness of the repre
sentation, we get a = e, and so N
i
n N
j
= (e).
Perhaps things would be clearer if we do it for n = 2. So suppose that
N
I
<J G, N
2
<J G, and every element a E G has a unique representation as
a = al . a2, where al E N
I
, a2 E N
2
. Suppose that a E N
I
n N
2
; then a =
a . e is a representation of a = al . a2 with al = a E N
l
, a2 = e E N
2
. How
ever a = e . a, so a = b
l
. b
2
, where b
l
= e E N
b
b
2
= a E N
2
. By the
uniqueness of the representation we must have aI = bl, that is, a = e.
So N
I
n N
2
= (e).
The argument given above for N
b
... , N
n
is the same argument as that
given for n = 2, but perhaps is less transparent. At any rate we have proved
Lemma 2.9.3. If G is the internal direct product of its normal sub
groups N
b
N
2
, .•. ,N
n
, then, for i =1= j, N
i
n N
j
= (e).
Corollary. If G is as in Lemma 2.9.3, then if i =1= j and ai E N
i
and
aj E N
j
, we have aiaj = ajai'
Sec. 9 Direct Products 95
Proof By Lemma 2.9.3, N
i
n N
j
= (e) for i =1= j. Since the N's are nor
mal in G, by Lemma 2.9.2 we have that any element in N
i
commutes with any
element in N
j
, that is, aiaj = aja
i
for ai E N
i
, aj E N
j
. 0
With these preliminaries out of the way we can now prove
Theorem 2.9.4. Let G be a group with normal subgroups N
I
, N
2
, ••• ,
N
n
. Then the mapping t/J(ab a2, ... , an) = aIa2 ... an is an isomorphism
from N
I
X N
2
X ... X N
n
(external direct product) onto G if and only if G is
the internal direct product of N I, N 2 , ••• , N n.
Proof Suppose G is an internal direct product of N b ... , N
n
• Since
every element a in G has a representation a = ala2 ... an, with the ai E N
i
,
we have that the mapping t/J is onto. We assert that it is also 11. For if
t/J((ab a2'···' an» = t/J((b
1
, b
2
,···, b
n
», then by the definition of t/J,
aIa2 ... an = b
I
b
2
... b
n
. By the uniqueness of the representation of an ele
ment in this form we deduce that al = b
h
a2 = b
2
, ... ,an = b
n
. Hence t/Jis 11.
All that remains is to show that t/J is a homomorphism. So, consider
t/J((a
b
a
2
, ... ,an)(b
1
, b
2
, ••• ,b
n
» = t/J((a1b
b
a
2
b
2
, ... ,anb
n
»
= (a
1
b
1
)(a
2
b
2
) • • • (anb
n
)
Since b
i
E N
b
it commutes with ai' b
i
for i > 1 by the Corollary to Lemma
2.9.3. So we can pull the b
i
across all the elements to the right of it to get
aIbIa2b2· .. anb
n
= aIa2b2a3b3 anbnb
l
. Now repeat this procedure with b
2
,
and so on, to get that a
I
b
I
a
2
b
2
anb
n
= (aIa2 .. · a
n
)(b
I
b
2
... b
n
). Thus
t/J((a
1
, a2,· · . ,a
n
)(b
1
, b
2
,· .. ,b
n
» = a
1
b
1
a
2
b
2. · · anb
n
= (a a ... a )(b b ... b )
1 2 n 1 2 n
= t/J((a
b
a
2
, ... ,a
n
»t/J((b
1
b
2
,· .. ,b
n
»·
In other words, t/J is a homomorphism.
On the other hand, suppose that t/J is an isomorphism. Then the conclu
sion that G is the internal direct product of N I, N 2, ... , N
n
easily follows
from the fact that t/Jis onto and 11.
With this the proof of Theorem 2.9.4 is complete. D
Corollary. Let G be a group with normal subgroups N
I
, N
2
• Then G
is the internal direct product of N I and N 2 if and only if G = N 1N 2 and
N
I
n N
2
= (e).
96 Groups Ch.2
Proof This follows easily from the fact that l/J: N
l
X N
2
~ G, which is
given by l/J(a}, a2) = aIa2, is an isomorphism if and only if N
I
N
2
= G and
N
I
n N
2
= (e). 0
In view of the result of Theorem 2.9.4 and its corollary, we drop the ad
jectives "internal" and "external" and merely speak about the "direct prod
uct." When notation G = N
I
X N
2
is used it should be clear from context
whether it stands for the internal or external direct product.
The objective is often to show that a given group is the direct product
of certain normal subgroups. If one can do this, the structure of the group
can be completely determined if we happen to know those of the normal
subgroups.
PROBLEMS
1. If G
I
and G
2
are groups, prove that G
I
X G
2
= G
2
X G
I
•
2. If G
l
and G
2
are cyclic groups of orders m and n, respectively, prove that
GI X G2 is cyclic if and only if m and n are relatively prime.
3. Let G be a group, A = G X G. In A let T = {(g, g) Ig E G}.
(a) Prove that T = G.
(b) Prove that T <J A if and only if G is abelian.
4. Let G be an abelian group of order p'{'lp'{'2 . . · p'!:k, where PI, P2'···' Pk
are distinct primes and mi > 0, m2 > 0, ... , mk > O. By Problem 10 of
Section 6, for each i, G has a subgroup Pi of order pr'. Show that
G = PI X P
2
X ... X P
k
•
5. Let G be a finite group, N
I
, N
2
, ••• , N
k
normal subgroups of G such that
G = N
I
N
2
••• N
k
and IGI = IN
I
IIN
2
1·· ·INkl. Prove that G is the direct
product of N}, N
2
, ••• , N
k
•
6. Let G be a group, N I, N2, ... , N
k
normal subgroups of G such that:
1. G = N
I
N
2
• •• N
k
.
2. For each i, N
i
n (N
I
N
2
••• NiIN
i
+
1
••• N
k
) = (e).
Prove that G is the direct product of N}, N
2
, ••• , N
k
.
10. FINITE ABELIAN GROUPS (OPTIONAL)
We have just finished discussing the idea of the direct product of groups. If
we were to leave that topic at the point where we ended, it might seem like
a nice little construction, but so what? To give some more substance to it,
Sec. 10 Finite Abelian Groups (Optional) 97
we should prove at least one theorem which says that a group satisfying a
certain condition is the direct product of some particularly easy groups. For
tunately, such a class of groups exists, the finite abelian groups. What we
shall prove is that any finite abelian group is the direct product of cyclic
groups. This reduces most questions about finite abelian groups to questions
about cyclic groups, a reduction that often allows us to get complete an
swers to these questions.
The results on the structure of finite abelian groups are really special
cases of some wider and deeper theorems. To consider these would be going
too far afield, especially since the story for finite abelian groups is so impor
tant in its own right. The theorem we shall prove is called the Fundamental
Theorem on Finite Abelian Groups, and rightfully so.
Before getting down to the actual details of the proof, we should like to
give a quick sketch of how we shall go about proving the theorem.
Our first step will be to reduce the problem from any finite abelian
group to one whose order is pn, where p is a prime. This step will be fairly
easy to carry out, and since the group will have order involving just one
prime, the details of the proof will not be cluttered with elements whose or
ders are somewhat complicated.
So we shall focus on groups of order pn. Let G be an abelian group of
order pn. We want to show that there exist cyclic subgroups of G, namely
A I, A 2, ... , A k, such that every element x E G can be written as x =
b
I
b
2
· .. b
k
, where each b
i
E Ai' in a unique way. Otherwise put, since
each Ai is cyclic and generated by ai, say, we want to show that x =
a';la'{'2 ... a7:\ where the elements a'r
i
are unique.
A difficulty appears right away, for there is not just one choice for these
elements aI, ... , ak. For instance, if G is the abelian group of order 4 with
elements e, a, b, ab, where a
2
= b
2
= e and ab = ba, then we can see that if
A, B, C are the cyclic subgroups generated by a, b, and ab, respectively, then
G = A X B = A X C = B X C. So there is a lack of uniqueness in the choice of
the ai. How to get around this?
What we need is a mechanism for picking al and which, when applied
after we have picked aI, will allow us to pick a2, and so on. What should this
mechanism be? Our control on the elements of G lies only in specifying their
orders. It is the order of the elementwhen properly usedthat will give us
the means to prove the theorem.
Suppose that G= Al X A
2
X ... X A
k
, where IGI = pn and the A's
have been numbered, so that IAil = pn
1
and nl ;::: n2 ;::: ... ;::: nk, and each Ai
is cyclic generated by ai. If this were so and x = a';l ... a7:\ then
98 Groups Ch.2
because nl 2:: ni, pnil pnt, so since every a'r
lpfl
, = e, thus x
pfl1
= e. In other
words, al should be an element of G whose order is as large as it can possibly
be. Fine, we can now pick al. What do we do for a2? If G = G/A
b
then to
get the first element needed to represent G as a direct product of cyclic
groups, we should pick an element in G whose order is maximal. What does
this translate into in G itself? We want an element a2 such that a2 requires as
high a power as possible to fall into A 1. So that will be the road to the selec
tion of the second element. However, if we pick an element a2 with this
property, it may not do the trick; we may have to adapt it so that it will. The
doing of all this is the technical part of the argument and does go through.
Then one repeats it appropriately to find an element a3 , and so on.
This is the procedure we shall be going through to prove the theorem.
But to smooth out these successive choices of aI, a2,.'" we shall use an in
duction argument and some subsidiary preliminary results.
With this sketch as guide we hope the proof of the theorem will make
sense to the reader. One should not confuse the basic idea in the proof
which is quite reasonablewith the technical details, which may cloud the
issue. So we now begin to fill in the details of the sketch of the proof that we
outlined above.
Lemma 2.10.1. Let G be a finite abelian group of order mn, where m
and n are relatively prime. If M = {x E G Ix
m
= e} and N = {x E G Ix
n
= e},
then G = M X N. Moreover, if neither m nor n is 1, then M =1= (e) and N =1= (e).
Proof The sets M and N defined in the assertion above are quickly
seen to be subgroups of G. Moreover, if m =1= 1, then by Cauchy's Theorem
(Theorem 2.6.4) we readily obtain M =1= (e), and similarly if n =1= 1, that
N =1= (e). Furthermore, since M n N is a subgroup of both M and N, by La
grange's Theorem, 1M n NI divides IMI = m and INI = n. Because m and n
are relatively prime, we obtain 1M n NI = 1, hence M n N= (e).
To finish the proof, we need to show that G = MN and G = M X N.
Since m and n are relatively prime, there exist integers rand s such that
rm + sn = 1. If a E G, then a = a
l
= a
sn
+
rm
= asna
rm
; since (asn)m =
a
snm
= e, we have that a
sn
E M. Similarly, arm E N. Thus a = asna
rm
is in
MN. In this way G = MN. It now follows from Corollary to Theorem 2.9.4
that G = M X N. D
An immediate consequence is the
Corollary. Let G be a finite abelian group and let p be a prime such
that p divides IGI. Then G = P X T for some subgroups P and T, where
/PI = pm, m > 0, and ITI is not divisible by p.
Sec. 10 Finite Abelian Groups (Optional) 99
Proof Let P = {x E G I x
Ps
= e for some s} and let the subset
T = {x E G I Xl = e for t relatively prime to p}. By Lemma 2.10.1, G = P X T
and P =1= (e). Since every element in P has order a power ofp, Ipi is not divis
ible by any other prime (by Cauchy's Theorem), so Ipi = pm for some m.
It is easy to see that p tITI by making use of Lagrange's Theorem. Thus
we really have that P is not merely some subgroup of G but is what is called
apSylow subgroup of G. (See Section 11).0
We now come to the key step in the proof of the theorem we seek.
The proof is a little difficult, but once we have this result the rest will be
easy.
Theorem 2.10.2. Let G be an abelian group of order pn, p a prime,
and let a E G have maximal order of all the elements in G. Then G =
A X Q, where A is the cyclic subgroup generated by a.
Proof We proceed by induction on n. If n = 1, then IG I = p and G is
already a cyclic group generated by any a =1= e in G.
We suppose the theorem to be true for all m < n. We first show that
the theorem is correct if there exists an element bEG such that b tt. A = (a)
and b
P
= e. Let B = (b), the subgroup of G generated by b; thus
A n B = (e) (see Problem 1).
Let G= G/B; by assumption B =1= (e), hence IGI < IGI. In G, what is
the order of a = Ba? We claim that o(a) = o(a). To begin with, we know
that o(a) Io(a) (see Problem 6 of Section 2.7). On the other hand, ao(a) = e,
so ao(a) E B. Since ao(a) E A, we see that ao(a) E A n B = (e), whence
ao(a) = e. This tells us that o(a) I o(a). Hence o(a) = o(a).
Since a is an element of maximal order in G, by the induction we know
that G = (a) X T for some subgroup T of G. By the Correspondence
Theorem we also know that T = Q/B for some subgroup Qof G. We claim
that G is the internal direct product A X Q. That G = AQ is left to
the reader. It remains to show that A n Q = (e). Let a
i
E A n Q. Then
a
i
E Q/B = T, and since (a) n T = (e), we have that a
i
=e. But since o(a) =
o(a), this implies a
i
= e. Therefore, A n Q = (e) and we obtain that
G = A X Q.
Suppose, then, that there is no element b in G, b not in A, such that
b
P
= e. We claim that this forces G = A = (a), in which case G is a cyclic
group. Suppose that G =1= A and let x E G, x f!:. A have smallest possible
order. Because o(x
P
) < o(x), we have, by our choice of x, that x
P
E A, hence
x
P
= a
i
for some i.
We claim that p I i. Let o(a) = pS, and note that the maximality
100 Groups Ch.2
of the order of a implies that x
Ps
= e. But x
Ps
= (XP)pSl = (ai)pSl = e. Since
o(a) = pS, we have p I i.
Thus x
P
= ai, where p Ii. Let y = auP. x. Then yP = aix
p
= aia
i
= e.
Moreover, y f£ (a) = A, because x f£ A. But this puts us back in the situation
discussed above, where there exists abE G, b f£ A such that b
P
= e; in that
case we saw that the theorem was correct. So we must have G = (a), and G
is a cyclic group. This finishes the induction and proves the theorem. D
We are now able to prove the very basic and important
Theorem 2.10.3 (Fundamental Theorem on Finite Abelian Groups).
A finite abelian group is the direct product of cyclic groups.
Proof Let G be a finite abelian group and p a prime that divides IG I.
By the Corollary to Lemma 2.10.1, G = P X T, where IPI = pn. By Theorem
2.10.2, P = Al X A
2
X ... X A
k
, where the Ai are cyclic subgroups of P. Ar
guing by induction on IGI, we may thus assume that T = T
I
X T
2
X ... X T
q
,
where the T
i
are cyclic subgroups of T. Thus
G = (AI X A
2
X X A
k
) X (T
1
X T
2
X X T
q
)
= Al X A
2
X X A
k
X T
I
X T
2
X X T
q
•
This very important theorem is now proved. D
We return to abelian groups G of order pn. We now have at hand that
G = Al X A
2
X ... X A
k
, where the Ai are cyclic groups of order pn,. We
can arrange the numbering so that n1 2:: n2 2:: ... 2:: nk· Also, IGI =
IA
I
X A
2
X ... X A
k
I = IA
I
IIA
2
/· • ·IA
k
I, which gives us that
hence n = ni + n2 + ... + nk. Thus the integers ni 2:: 0 give us a partition of
n. It can be shown that these integers n1, n2, ... , nkwhich are called the
invariants of Gare unique. In other words, two abelian groups of order pn
are isomorphic if and only if they have the same invariants. Granted this, it
follows that the number of nonisomorphic abelian groups of order pn is equal
to the number of partitions of n.
For example, if n = 3, it has the following three partitions: 3 = 3, 3 =
2 + 1, 3 = 1 + 1 + 1, so there are three nonisomorphic abelian groups of
order p3 (independent of p). The groups corresponding to these partitions
are a cyclic group of order p3, the direct product of a cyclic group of order p2
by one of order p, and the direct product of three cyclic groups of order p,
respectively.
Sec. 11 Conjugacy and Sylow's Theorem (Optional) 101
For n = 4 we see the partitions are 4 = 4, 4 = 3 + 1, 4 = 2 + 2, 4 =
2 + 1 + 1,4 = 1 + 1 + 1 + 1, which are five in number. Thus there are five
nonisomorphic groups of order p
4
• Can you describe them via the partitions
of 4?
Given an abelian group of order n = p ~ l p ~ 2 ... p%\ where the Pi are
distinct primes and the ai are all positive, then G is the direct product of its
socalled Pi Sylow subgroups (see, e.g., the Corollary to Lemma 2.10.1). For
each prime Pi there are as many groups of order pfi as there are partitions of
ai. So the number of nonisomorphic abelian groups of order n = p ~ l ... p%k
is f(al)f(a2) .. · f(ak), where fern) denotes the number of partitions of m.
Thus we know how many nonisomorphic finite abelian groups there are for
any given order.
For instance, how many nonisomorphic abelian groups are there of
order 144? Since 144 = 2
4
3
2
, and there are five partitions of 4, two partitions
of 2, there are 10 nonisomorphic abelian groups of order 144.
The material treated in this section has been hard, the path somewhat
tortuous, and the effort to understand quite intense. To spare the reader too
much further agony, we assign only three problems to this section.
PROBLEMS
1. Let A be a normal subgroup of a group G, and suppose that bEG is an
element of prime order p, and that b f£ A. Show that A n (b) = (e).
2. Let G be an abelian group of order pn, p a prime, and let a E G have max
imal order. Show that xo(a) = e for all x E G.
3. Let G be a finite group, with N <J G and a E G. Prove that:
(a) The order of aN in GIN divides the order of a in G, that is,
o(aN) lo(a).
(b) If (a) n N = (e), then o(aN) = o(a).
11. CONJUGACY AND SYLOW'S THEOREM (OPTIONAL)
In discussing equivalence relations in Section 4 we mentioned, as an example
of such a relation in a group G, the notion of conjugacy. Recall that the ele
ment b in G is said to be conjugate to a E G (or merely, a conjugate of a) if
there exists an x E G such that b = xlax. We showed in Section 4 that this
defines an equivalence relation on G. The equivalence class of a, which we
denote by cl(a), is called the conjugacy class of a.
102 Groups Ch.2
For a finite group an immediate question presents itself: How large is
cl(a)? Of course, this depends strongly on the element a. For instance, if
a E Z(G), the center of G, then ax == xa for all x E G, hence xlax = a; in
other words, the conjugacy class of a in this case consists merely of the ele
ment a itself. On the other hand, if cl(a) consists only of the element a, then
xlax = a for all x E G. This gives us that xa = ax for all x E G, hence
a E Z(G). So Z(G) is characterized as the set of those elements a in G
whose conjugacy class has only one element, a itself.
For an abelian group G, since G == Z(G), two elements are conjugate if
and only if they are equal. So conjugacy is not an interesting relation for
abelian groups; however, for nonabelian groups it is a highly interesting no
tion.
Given a E G, cl(a) consists of all xlax as x runs over G. So to deter
mine which are the distinct conjugates of a, we need to know when two con
jugates of a coincide, which is the same as asking: When is x lax = yIay? In
this case, transposing, we obtain a(xyl) = (xyl)a; in other words, xyl
must commute with a. This brings us to a concept introduced as Example 10
in Section 3, that of the centralizer of a in G. We repeat something we did
there.
Definition. If a E G, then C(a), the centralizer of a in G, is defined
by C(a) == {x E G I xa == ax}.
When C(a) arose in Section 3 we showed that it was a subgroup of G.
We record this now more officially as
Lemma 2.11.1. For a E G, C(a) is a subgroup of G.
As we saw above, the two conjugates x lax and y1ay of a are equal
only if xyl E C(a), that is, only if x and yare in the same right coset of C(a)
in G. On the other hand, if x and yare in the same right coset of C(a) in G,
then xyl E C(a), hence xyla == axyl. This yields that xlax = y1ay. So x
and y give rise to the same conjugate of a if and only if x and yare in the
same right coset of C(a) in G. Thus there are as many conjugates of a in G as
there are right cosets of C(a) in G. This is most interesting when G is a finite
group, for in that case the number of right cosets of C(a) in G is what we
called the index, ic(C(a)), of C(a) in G, and is equal to IGII IC(a)l.
We have proved
Theorem 2.11.2. Let G be a finite group and a E G; then the number
of distinct conjugates of a in G equals the index of C(a) in G.
Sec. 11 Conjugacy and Sylow's Theorem (Optional) 103
In other words, the number of elements in cl(a) equals iG(C(a» =
IGI/IC(a)l·
This theorem, although it was relatively easy to prove, is very impor
tant and has many consequences. We shall see a few of these here.
One such consequence is a kind of bookkeeping result. Since conjugacy
is an equivalence relation on G, G is the union of the disjoint conjugacy
classes. Moreover, by Theorem 2.11.2, we know how many elements there
are in each class. Putting all this information together, we get
Theorem 2.11.3 (The Class Equation). If G is a finite group, then
IGI = ic(C(a» =
where the sum runs over one a from each conjugacy class.
It is almost a sacred tradition among mathematicians to give, as the first
application of the class equation, a particular theorem about groups of order
pn, where p is a prime. Not wanting to be accused of heresy, we follow this
tradition and prove the pretty and important
Theorem 2.11.4. If G is a group of order pn, where p is a prime, then
Z(G), the center of G, is not trivial (i.e., there exists an element a =1= e in G
such that ax = xa for all x E G).
Proof We shall exploit the class equation to carry out the proof. Let
z = IZ(G)I; as we pointed out previously, z is then the number of elements
in G whose conjugacy class has only one element. Since e E Z (G), z 1.
For any element b outside Z(G), its conjugacy class contains more than one
element and IC(b)1 < IGI. Moreover, since IC(b)1 divides IGI by Lagrange's
theorem, IC(b)1 = pn(b), where 1 ::::; neb) < n. We divide the pieces of the
class equation into two parts: that coming from the center, and the rest. We
get, this way,
pn = IGI = Z + L lQL = Z + L = Z + L
IC(b)1 n(b)<nP n(b)<n
Clearly, p divides the lefthand side, pn, and divides Ln(b)<npnn(b). The net
result of this is that p Iz, and since z 1, we have that z is at least p. So since
z = IZ(G)I, there must be an element a =1= e in Z(G), which proves the theo
rem.D
This last theorem has an interesting application, which some readers
may have seen in solving Problem 45 of Section 5. This is
104 Groups Ch.2
Theorem 2.11.5. If G is a group of order p2, where p is a prime, then
G is abelian.
Proof By Theorem 2.11.4, Z(G) =1= (e), so that there is an element, a,
of order p in Z(G). If A = (a), the subgroup generated by a, then A C Z(G),
hence A C C(x) for all x E G. Given x E G, x $. A, then C(x) :J A and
x E C(x); so IC(x)1 > p, yet IC(x)1 must divide p2. The net result of this is
that IC(x)1 = p2, so C(x) = G, whence x E Z(G}. Since every element of G
is in the center of G, G must be abelian. D
In the problems to come we shall give many applications of the nature
of groups of order pn, where p is a prime. The l1atural attack on virtually all
these problems follows the lines of the argumellt we are about to give. We
choose one of a wide possible set of choices to illustrate this technique.
Theorem 2.11.6. If G is a group of order pn, p a prime, then G con
tains a normal subgroup of order pnl.
Proof We proceed by induction on n. If n = 1, then G is of order p
and (e) is the required normal subgroup of order pll = po = 1.
Suppose that we know that for some k every group of order pk has a
normal subgroup of order pkl. Let G be of order pk+l; by Theorem 2.11.4
there exists an element a of order p in Z(G), the center of G. Thus the sub
group A = (a) generated by a is of order p and is normal in G. Consider r =
G/A; r is a group of order IGI/IAI = pk+l/p = pk by Theorem 2.6.3. Since r
has order pk, we know that r has a normal subgroup M of order pkl. Since
r is a homomorphic image of G, by the Correspondence Theorem (Theorem
2.7.2) there is a normal subgroup N in G, N :J A, such that N/A = M. But
then we have
pkl = IMI = IN/AI = l ~ i '
that is, pkl = INI/p, leading us to INI = pk. Thus N is our required normal
subgroup in G of order pk. This completes the induction and so proves the
theorem. D
By far the most important application we make of the class equation is
the proof of a farreaching theorem due to Sylow, a Norwegian mathemati
cian, who proved it in 1871. We already showed this theorem to be true for
abelian groups. We shall now prove it for any finite group. It is impossible to
overstate the importance of Sylow's Theorem in the study of finite groups.
Without it the subject would not get off the ground.
Sec. 11 Conjugacy and Sylow's Theorem (Optional) 105
Theorem 2.11.7 (Sylow's Theorem). Suppose that G is a group of
order pnm, where p is a prime and p t m. Then G has a subgroup of order pn.
Proof If n = 0, this is trivial. We therefore assume that n 2:: 1. Here,
again, we proceed by induction on IGI, assuming the result to be true for all
groups H such that IHI < IGI.
Suppose that the result is false for G. Then, by our induction hypothe
sis, pn cannot divide IHI for any subgroup H of Gif H =1= G. In particular, if
a $. Z(G), then C(a) =1= G, hence pn t IC(a)l. Thus p divides IGI/IC(a)1 =
ic(C(a» for a $. Z(G).
Write down the class equation for G following the lines of the argu
ment in Theorem 2.11.4. If z = IZ(G)I, then z 2:: 1 and
But P I ic(C(a» if a $. Z(G), so p I LaEtZ(C) ic(C(a». Since p Ipnm, we get
p I z. By Cauchy's Theorem there is an element a of order p in Z(G). If A is
the subgroup generated by a, then IAI = p and A<l G, since a E Z(G). Con
sider f = GIA; If1 = IGI/IAI = pnmlp = pnlm. Since If1 < IGI, by our in
duction hypothesis r has a subgroup M of order pnl. However, by the Cor
respondence Theorem there is a subgroup P of G such that P :J A and
PIA = M. Therefore, Ipi = IMIIAI = pnlp = pn and P is the soughtafter
subgroup of G of order pn, contradicting our assumption that G had no such
subgroup. This completes the induction, and Sylow's Theorem is established. D
Actually, Sylow's Theorem consists of three parts, of which we only
proved the first. The other two are (assuming pnm = IG I, where p tm):
1. Any two subgroups of order pn in G are conjugate; that is, if Ipi
IQI = pn for subgroups P, Qof G, then for some x E G, Q = x1px.
2. The number of subgroups of order pn in G is of the form 1 + kp and di
vides IGI.
Since these subgroups of order pn pop up all over the place, they are
called pSylow subgroups of G. An abelian group has one pSylow subgroup
for every prime p dividing its order. This is far from true in the general case.
For instance, if G = S3' the symmetric group of degree 3, which has order 6
= 2 . 3, there are three 2Sylow subgroups (of order 2) and one 3Sylow sub
group (or order 3).
For those who want to see several proofs of that part of Sylow's Theo
rem which we proved above, and of the other two parts, they might look at
the appropriate section of our book Topics in Algebra.
106 Groups
PROBLEMS
Ch.2
Easier Problems
1. In S3' the symmetric group of degree 3, find all the conjugacy classes, and
check the validity of the class equation by determining the orders of the
centralizers of the elements of S3 .
2. Do Problem 1 for G the dihedral group of order 8.
3. If a E G, show that C(x1ax) = x1C(a)x.
4. If cp is an automorphism of G, show that C(cp(a» = cp(C(a» for a E G.
5. If IGI = p3 and IZ(G)I ;::: p2, prove that G is abelian.
6. If P is a pSylow subgroup of G and P <l G, prove that P is the only
pSylow subgroup of G.
7. If P <l G, PapSylow subgroup of G, prove that cp(P) = P for every
automorphism cp of G.
8. Use the class equation to give a proof of Cauchy's Theorem.
If H is a subgroup of G, let N(H) = {x E G Ix1Hx = H}. This does
not mean that xa = ax whenever x E N(H), a E H. For instance, if
H <l G, then N(H) = G, yet H need not be in the center of G.
9. Prove that N(H) is a subgroup of G, H C N(H) and in fact H <l N(H).
10. Prove that N(x1Hx) = x1N(H)x.
11. If P is a pSylow subgroup of G, prove that P is a pSylow subgroup of
N(P) and is the only pSylow subgroup of N(P).
12. If P is a pSylow subgroup and a EGis of order pm for some m, show
that if a1pa = P then a E P.
13. Prove that if G is a finite group and H is a subgroup of G, then the num
ber of distinct subgroups x1Hx of G equals iG(N(H».
14. If P is a pSylow subgroup of G, show that the number of distinct XI Px
cannot be a multiple of p.
15. If N <l G, let B(N) = {x E G I xa = ax for all a EN}. Prove that
B(N) <l G.
MiddleLevel Problems
16. Show that a group of order 36 has a normal subgroup of order 3 or 9.
(Hint: See Problem 40 of Section 5.)
17. Show that a group of order 108 has a normal subgroup of order 9 or 27.
18. If Pis apSylow subgroup of G, show that N(N(P» = N(P).
19. If IGI = pn, show that G has a subgroup of order pm for alII::::; m ::::; n.
Sec. 11 Conjugacy and Sylow's Theorem (Optional) 107
20. Ifpm divides IGI, show that Ghas a subgroup of order pm.
21. If IGI = pn and H =1= G is a subgroup of G, show that N(H) ~ H.
22. Show that any subgroup of order pnl in a group G of order pn is normal
in G.
Harder Problems
23. Let G be a group, H a subgroup of G. Define for a, bEG, a  b if b =
h1ah for some h E H. Prove that
(a) this defines an equivalence relation on G.
(b) If [a] is the equivalence class of a, show that if G is a finite group,
then [a] has m elements where m is the index of H n C(a) in H.
24. If G is a group, H a subgroup of G, define a relation B  A for sub
groups A, B of G by the condition that B = h1Ah for some h E H.
(a) Prove that this defines an equivalence relation on the set of sub
groups of G.
(b) If G is finite, show that the number of distinct subgroups equivalent
to A equals the index of N(A) n H in H.
25. If P is a pSylow subgroup of G, let S be the set of all pSylow subgroups
of G. For Qb Q2 E S define Ql  Q2 if Q2 = a1Qla with a E P. Prove,
using this relation, that if Q =1= P, then the number of distinct a1Qa, with
a E P, is a multiple of p.
26. Using the result of Problem 25, show that the number of pSylow sub
groups of G is of the form 1 + kp. (This is the third part of Sylow's Theo
rem.)
27. Let P be a pSylow subgroup of G, and Q another one. Suppose that
Q =1= x1px for any x E G. Let S be the set of all ylQy, as y runs over G.
For Qb Q2 E S define Ql  Q2 if Q2 = a1Qla, where a E P.
(a) Show that this implies that the number of distinct ylQy is a multiple
ofp.
(b) Using the result of Problem 14, show that the result of Part (a) can
not hold.
(c) Prove from this that given any two pSylow subgroups P and Qof G,
then Q = x1px for some x E G.
(This is the second part of Sylow's Theorem.)
28. If H is a subgroup of G of order pm show that H is contained in some
pSylow subgroup of G.
29. If P is a pSylow subgroup of G and a, b E Z(P) are conjugate in G,
prove that they are already conjugate in N(P).
3
THE SYMMETRIC GROUP
1. PRELIMINARIES
Let us recall a theorem proved in Chapter 2 for abstract groups. This result,
known as Cayley's Theorem (Theorem 2.5.1), asserts that any group G is iso
morphic to a subgroup of A (S), the set of 11 mappings of the set S onto it
self, for some suitable S. In fact, in the proof we gave we used for S the
group G itself viewed merely as a set.
Historically, groups arose this way first, long before the notion of an
abstract group was defined. We find in the work of Lagrange, Abel, Galois,
and others, results on groups of permutations proved in the late eighteenth
and early nineteenth centuries. Yet it was not until the midnineteenth cen
tury that Cayley more or less introduced the abstract concept of a group.
Since the structure of isomorphic groups is the same, Cayley's Theorem
points out a certain universal character for the groups A (S). If we knew the
structure of all subgroups of A(S) for any set S, we would know the structure
of all groups. This is much too much to expect. Nevertheless, one could try to
exploit this embedding of an arbitrary group G isomorphically into some
A (S). This has the advantage of transforming G as an abstract system into
something more concrete, namely a set of nice mappings of some set onto
itself.
We shall not be concerned with the subgroups of A (S) for an arbitrary
set S. If S is infinite, A (S) is a very wild and complicated object. Even if S is
finite, the complete nature of A (S) is virtually impossible to determine.
108
Sec. 1 Preliminaries 109
In this chapter we consider only A (S) for S a finite set. Recall that if S
has n elements, then we call A (S) the symmetric group of degree n, and de
note it by Sn The elements of Sn are called permutations; we shall denote
tI:tem by lowercase Greek letters.
Since we multiplied two elements u, r E A(S) by the rule (ur)(s) =
u(r e s » ~ this will have the effect that when we introduce the appropri
ate symbols to represent the elements of Sn, these symbols, or permutations,
will multiply from right to left. If the readers look at some other book on al
gebra, they should make sure which way the permutations are being multi
plied: right to left or left to right. Very often, algebraists multiply permuta
tions from left to right. To be consistent with our definition of the
composition of elements in Sn, we do it from right to left.
By Cayley's Theorem we know that if G is a finite group of order n,
then G is isomorphic to a subgroup of Sn and Sn has n! elements. Speaking
loosely, we usually say that G is a subgroup of Sn' Since n is so much smaller
than n! for n even modestly large, our group occupies only a tiny little corner
in Sn. It would be desirable to embed G in an Sn for n as small as possible. For
certain classes of finite groups this is achievable in a particularly nice way.
Let S be a finite set having n elements; we might as well suppose that
S = {x}, X2, ... , x
n
}. Given the permutation u E Sn = A(S), then
U(Xk) E S for k = 1, 2, ... , n, so U(Xk) = Xi
k
for some i
k
, 1 ~ i
k
~ n. Because
U is 11, if j =1= k, then Xi) = u(Xj) =1= U(Xk) = Xi
k
• Therefore, the numbers
iI, i
2
, ... , in are merely the numbers 1,2, ... , n shuffled about in some order.
Clearly, the action of U on S is determined by what U does to the sub
script j of Xj' so the symbol "x" is really excess baggage and, as such, can be
discarded. In short, we may assume that S = {I, 2, ... , n}.
Let's recall what is meant by the product of two elements of A (S). If
u, r E A(S), then we defined ur by (ur)(s) = u(r(s» for every s E S. We
showed in Section 4 of Chapter 1 that A(S) satisfied four properties that we
used later as the model to define the notion of an abstract group. Thus Sn, in
particular, is a group relative to the product of mappings.
Our first need is some handy way of denoting a permutation, that is, an
element u in Sn. One clear way is to make a table of what u does to each ele
ment of S. This might be called the graph of u. We did this earlier, writing
out u, say u E S3' in the fashion: u: Xl ~ X2, X2 ~ X3' X3 ~ Xl' But this is
cumbersome and space consuming. We certainly can make it more compact
by dropping the x's and writing u = ( ~ ; ~ ) . In this symbol the number in
the second row is the image under (J' of the number in the first row directly
above it. There is nothing holy about 3 in all this; it works equally well for
any n.
110 The Symmetric Group Ch.3
If 0' E Sn and 0'(1) = i}, 0'(2) = i
2
, ••• , u(n) = in, we use the symbol
(
1 ~ ... ~ ) to represent 0' and we write 0' = ( ~ ~ ~ ) o Note
it l2 ••• In It l2 In
that it is not necessary to write the first row in the usual order 1 2 ... n;
any way we write the first row, as long as we carry the i/s along accord
ingly, we still have u. For instance, in the example in S3 cited,
U = ( ~ ~ ~ ) = ( ~ ~ ~ ) = ( ~ ~ ~ ) .
If we know 0' = ( ~ ~ ~ ) , what is u
1
? It is easy, just flip the
It l2 ••• In
symbol for u over to get u
1
= (i ~ ~ ) . (Prove!) In our example
u = G~ ~ } u
1
= (i ~ ~ ) = Gi ~ ) . The identity element
which we shall write as eis merely e = ( ~ ; ::: : ).
How does the product in Sn translate in terms of these symbols? Since
UT means: "First apply T and to the result of this apply 0'," in forming the
product of the symbols for 0' and T we look at the number k in the first row
of T and see what number i
k
is directly below k in the second row of T. We
then look at the spot i
k
in the first row of 0' and see what is directly below it
in the second row of u. This is the image of k under UTe We then run through
k = 1,2, ... , n and get the symbol for UTe We just do this visually.
We illustrate this with two permutations
u = ( ~ ~ i ~ ~ ) and T= ( ~ ~ ; 1;)
in S5' Then UT = G; ~ ~ ;).
Even the economy achieved this way is not enough. After all, the first
row is always 1 2 ... n, so we could dispense with it, and write u =
(
1 2 ... n) (" 0 0) Tho "fi b 0 h "
o 0 "as ll, l2, ••• , In' IS IS ne, ut In t e next sectIon we
It l2 ••• In
shall find a better and briefer way of representing permutations.
PROBLEMS
1. Find the products:
(
1 2 3 4 5 63)(21 2 3 4 5 6)
(a) 6 4 5 2 1 3 4 5 6 1"
Sec. 2 Cycle Decomposition 111
(b) G
2 3 4
5)C 2
3 4
;) .
1 3 4 5 3 2 1 4
(c) ( ~
2 3 4
5r
1
C2 3 4 5) C2 3 4 5)
1 3252134541325·
2. Evaluate all the powers of each permutation a (i.e., find a
k
for all k).
(a) G
2 3 4 5
~ } 3 4 5 6
(b) G
2 3 4 5
67)
1 3 4 6
5 7·
(c) ( ~
2 3 4 5
~ } 4 5 2 1
3. Prove that ( ~
2
Zr
1
= (1
i
2
~ n ) .
II
i
2
2
4. Find the order of each element in Problem 2.
5. Find the order of the products you obtained in Problem 1.
2. CYCLE DECOMPOSITION
We continue the process of simplifying the notation used to represent a given
permutation. In doing so, we get something more than just a new symbol; we
get a device to decompose any permutation as a product of particularly nice
permutations.
Definition. Let i
b
i
2
, ••• , i
k
be k distinct integers in S = {I, 2, ... , n}.
The symbol (i
l
i
2
••• i
k
) will represent the permutation a E Sn, where
a(i
l
) = i
2
, a(i
2
) = i
3
, ••• , a(i
j
) = i
j
+
l
for j < k, u(i
k
) = i
b
and
a(s) = s for any s E S if s is different from i
l
, i
2
, ••• , i
k
•
Thus, in S7, the permutation (1 3 5 4) is the permutation
(
1 2 3 4 5 6 7)
3 2 5 1 4 6 7· We call a permutation of the form (i
l
i
2
••• i
k
)
a kcycle. For the special case k = 2, the permutation (i
l
i
2
) is called a trans
position. Note that if a = (i
l
i
2
••• i
k
), then a is also (i
k
i
l
i
2
••• i
k

l
),
(i
k

1
i
k
i
1
i
2
••• i
k

2
), and so on. (Prove!) For example,
(1 3 5 4) = (4 1 3 5) = (5 4 1 3) = (3 5 4 1).
Two cycles, say a kcycle and an mcycle, are said to be disjoint cycles if
112 The Symmetric Group Ch.3
they have no integer in common. Whence (1 3 5) and (4 2 6 7) in S7
are disjoint cycles.
Given two disjoint cycles in Sn, we claim that they commute. We leave
the proof of this to the reader, with the suggestion that if u, T are disjoint
cycles, the reader should verify that (uT)(i) = (Tu)(i) for every i E S =
{I, 2, ... , n }. We state this result as
Lemma 3.2.1. If u, T E Sn are disjoint cycles, then UT = TU.
Let's consider a particular kcycle U = (1 2 ... k) in Sn. Clearly,
u(l) = 2 by the definition given above; how is 3 related to I? Since u(2) = 3,
we have al(l) = u(2) = 3. Continuing, we see that u
j
(l) = j + 1 for
j ~ k  1, while Uk (1) = 1. In fact, we see that Uk = e, where e is the identity
element in Sn.
There are two things to be concluded from the paragraph above.
1. The order of a kcycle, as an element of Sn, is k. (Prove!)
2. If U = (i
l
i
2
••• i
k
) is a kcycle, then the orbit of i
l
under U (see
Problem 27 in Section 4 of Chapter 1) is {i
l
, i
2
, ••• , i
k
}. So we can see
that the kcycle U = (i 1 i
2
••• i
k
) is
U = (i
l
u(i
l
) u
2
(i
l
) ... Ukl(i
l
».
Given any permutation Tin Sn, for i E {I, 2, ... , n}, consider the orbit
of i under T. We have that this orbit is {i, T(i), T
2
(i) , ... , T
s

l
(i)}, where
TS(i) = i and s is the smallest positive integer with this property. Consider the
scycle (i T(i) T
2
(i) ... TSl(i»; we call it the cycle of Tdetermined by i.
We take a specific example and find all its cycles. Let
(
1 2 3 4 5 6 7 8 9)
T = 3 941 5 627 8;
what is the cycle of T determined by I? We claim that it is (1 3 4). Why?
T takes 1 into 3, 3 into 4 and 4 into 1, and since T(l) = 3, T
2
(1) = T(3) = 4,
T
3
(1) = T(4) = 1. We can get this visually by weaving through
(
~ 5 6 7 8 9)
394 1 562 7 8
with the thin path. What is the cycle of T determined by 2? Weaving
through
(
1 2 ~ )
394 1 5 627 8
Sec. 2 Cycle Decomposition 113
with the thin path, we see that the cycle of T determined by 2 is
(2 9 8 7). The cycles of T determined by 5 and 6 are (5) and (6), respec
tively, since 5 and 6 are left fixed by T. SO the cycles of Tare (1 3 4),
(2 9 8 7), (5), and (6). Therefore we have that T = (1 3 4)(2 9 8 7)
(5)(6), where we view these cyclesas defined aboveas permutations in Sg
because every integer in S = {I, 2, ... , 9} appears in one and only one cycle,
and the image of any i under T is read off from the cycle in which it appears.
There is nothing special about the permutation T above that made the
argument we gave go through. The same argument would hold for any per
mutation in Sn for any n. We leave the formal writing down of the proof to
the reader.
Theorem 3.2.2. Every permutation in Sn is the product of disjoint
cycles.
In writing a permutation u as a product of disjoint cycles, we omit all
Icycles; that is, we ignore the i's such that u(i) = i. Thus we write
u = (1 2 3)(4 5)(6)(7) simply as u = (1 2 3)(4 5). In other words, writing
u as a product of kcycles, with k > 1, we assume that u leaves fixed any in
teger not present in any of the cycles. Thus in the group S 11 the permuta
tion T = (1 5 6)(2 3 9 8 7) leaves fixed 4, 10, and 11.
Lemma 3.2.3. If T in Sn is a kcycle, then the order of T is k; that is,
T
k
= e and T
j
=I= e for 0 < j < k.
Consider the permutation T = (1 2)(3 4 5 6)(7 8 9) in Sg.
What is its order? Since the disjoint cycles (1 2), (3 4 5 6), (7 8 9)
commute, T
m
= (1 2)m(3 4 5 6)m(7 8 9)m; in order that T
m
= e we
need (1 2)m = e, (3 4 5 6)m = e, (7 8 9)m = e. (Prove!) To have
(7 8 9)m = e, we must have 3 Im, since (7 8 9) is of order 3; to have
(3 4 5 6)m = e, we must have 4 1 m, because (3 4 5 6) is of order 4,
and to have (1 2)m = e, we must have 21 m, because (1 2) is of order 2.
This tells us that m must be divisible by 12.
On the other hand,
So T is of order 12.
Here, again, the special properties of T do not enter the picture. What
we did for T works for any permutation. To formulate this properly, recall
that the least common multiple of m and n is the smallest positive integer v
which is divisible by m and by n. (See Problem 7, Chapter 1, Section 5.) Then
we have
114 The Symmetric Group Ch.3
Theorem 3.2.4. Let U E Sn have its cycle decomposition into disjoint
cycles of length mb m2, .•. , mk' Then the order of U is the least common
multiple of ml, m2, ••• , mk'
Proof Let U = Tl T2 .•• Tk, where the Ti are disjoint cycles of length mi'
Since the Ti are disjoint cycles, Ti Tj = Tj Ti; therefore if M is the least common
multiple of ml, m2,"" mk' then u
M
= (TIT2 .•. Tk)M = T ~ T ~ ... Tk
M
= e
(since Tt;t= e because Ti is of order mi and mi IM). Therefore, the order of
u is at most M. On the other hand, if UN = e, then TfT!j •• • TZ = e. This
forces each Tf = e, (prove!) because Ti are disjoint permutations, so m
i
IN,
since Ti is of order mi' Thus N is divisible by the least common multiple of
mb m2' ... , mk' so MIN. Consequently, we see that u is of order M as
claimed in the theorem. D
Note that the disjointness of the cycles in the theorem is imperative.
For instance, (1 2) and (1 3), which are not disjoint, are each of order 2,
but their product (1 2)(1 3) = (1 3 2) is of order 3.
Let's consider Theorem 3.2.4 in the context of a card shuffle. Suppose
that we shuffle a deck of 13 cards in such a way that the top card is put into
the position of the 3rd card, the second in that of the 4th, ... , the ith into the
i + 2 position, working mod 13. As a permutation, u, of 1, 2, ... , 13, the
shuffle becomes
(
1 2 3 4 5 6 7 8 9 10 11 12 13)
u = 3 4 5 6 7 8 9 10 11 12 13 1 2'
and u is merely the 13cycle (1 3 5 7 9 11 13 2 4 6 8 10 12),
so u is of order 13. How many times must we repeat this shuffle to get the
cards back to their original order? The answer is merely the order of u, that
is, 13. So it takes 13 repeats of the shuffle to get the cards back to their origi
nalorder.
Let's give a twist to the shuffle above. Suppose that we shuffle the
cards as follows. First take the top card and put it into the secondtoIast
place, and then follow it by the shuffle given above. How many repeats are
now needed to get the cards back to their original order? The first operation
is the shuffle given by the permutation T = (1 12 11 10 9 8 7 6 5
4 3 2) followed by u above. So we must compute UT and find its order.
But
UT = (1 3 5 7 9 11 13 2 4 6 8 10 12)
X (1 12 11 10 9 8 7 6 5 4 3 2)
= (1)(2 3 4 5 6 7 8 9 10 11 12 13),
Sec. 2 Cycle Decomposition 115
so is of order 12. So it would take 12 repeats of the shuffle to get back to the
original order.
Can you find a shuffle of the 13 cards that would require 42 repeats? Or
20 repeats? What shuffle would require the greatest number of repeats, and
what would this number be?
We return to the general discussion. Consider the permutation
(1 2 3); we see that (1 2 3) = (1 3)(1 2). We can also see that
(1 2 3) = (2 3)(1 3). So two things are evident. First, we can write
(1 2 3) as the product of two transpositions, and in at least two distinct
ways. Given the kcycle (i
1
i
2
... i
k
), then (i
1
i
2
... i
k
)
(i
1
ik)(i
1
i
k

1
)··· (i
1
i
2
), so every kcycle is a product of k  1 transposi
tions (if k > 1) and this can be done in several ways, so not in a unique way.
Because every permutation is the product of disjoint cycles and every cycle is
a product of transpositions we have
Theorem 3.2.5. Every permutation in Sn is the product of transposi
tions.
This theorem is really not surprising for it says, after all, nothing more
or less than that any permutation can be effected by carrying out a series of
interchanges of two objects at a time.
We saw that there is a lack of uniqueness in representing a given per
mutation as a product of transpositions. But, as we shall see in Section 3,
some aspects of this decomposition are indeed unique.
As a final word of this section, we would like to point out the conve
nience of cycle notation. When we represent elements of a permutation
group as products of disjoint cycles, many things become transparentfor
example, the order of the permutation is visible at a glance. To illustrate this
point, we now give a few examples of certain geometric groups, which are in
fact permutation groups that have already appeared in Chapter 2 under dif
ferent guises.
Examples
1. Informally, a motion of a geometric figure is a permutation of its vertices
that can be realized by a rigid motion in space. For example, there are eight
motions of a square, whose vertices are numbered 1, 2, 3, 4 as below:
4 3
2
116 The Symmetric Group Ch.3
a = (13) is the reflection about the axis of symmetry joining vertices 2 and
4 in the original position,
~ = (1234) is the counterclockwise rotation by 90°,
~ 2 = (13)(24) is the counterclockwise rotation by 180°,
~ 3 = (1432) is the counterclockwise rotation by 270°,
a ~ =(12)(34) is the reflection in the vertical axis of symmetry,
a ~ 2 = (24) is the reflection in the other diagonal axis,
a ~ 3 = (14)(23) is the reflection in the horizontal axis, and, of course
a
2
= ~ 4 = (1) is the "motion" that leaves the vertices unchanged.
We also have the relation ~ a = a ~ 3 .
These motions, or symmetries of a square, form a subgroup of S4 which is
called the oetie group, or the dihedral group of order 8. This group (or,
strictly speaking, a group isomorphic to it) was introduced in Example 9 of
Section 2.1 without mention of permutations.
2. There are only four symmetries of a nonsquare rectangle:
4
1
 u _ i _ 1
3
2
the reflections in the two axes of symmetry, rotation by 180° and the identity.
These motions can be identified with permutations (1), (14)(23), (12)(34),
(13)(24), and form a subgroup of the group obtained in Example 1. This sub
group is often called Klein's 4group.
3. We leave it to the reader to verify that the group of all motions of an
equilateral triangle is the full symmetric group S3.
3
2
4. The motions of a regular hexagon form the dihedral group of order 12,
generated by the permutations a = (15)(24), corresponding to a reflection
Sec. 2 Cycle Decomposition 117
about one of the axes of symmetry, and J3 = (123456), corresponding to the
counterclockwise rotation by 60°.
5 4
6 3
2
In general, the dihedral group of order 2n, which was first introduced in Ex
ample 10 of Section 2.1, can be interpreted as the group of symmetries of a
regular ngon (a polygon with n edges of equal length).
PROBLEMS
Easier Problems
2 3 4 5 6 7)
3 1 567 4'
3. Express as the product of disjoint cycles and find the order.
(a) (1 2 3 5 7)(2 4 7 6).
(b) (1 2)(1 3)(1 4).
(c) (1 2 3 4 5)(1 2 3 4 6)(1 2 3 4 7).
(d) (1 2 3)(1 3 2).
(e) (1 2 3)(3 5 7 9)(1 2 3)1.
(f) (1 2 3 4 5)3.
4. Give a complete proof of Theorem 3.2.2.
5. Show that a kcycle has order k.
6. Find a shuffle of a deck of 13 cards that requires 42 repeats to return the
cards to their original order.
1. Show that if u, T are two disjoint cycles, then UT = TU.
2. Find the cycle decomposition and order.
(
1 2 3 4 5 6 7 8 9)
(a) 3 1 4 2 7 6 9 8 5'
(
1 2 3 4 5 6 7)
(b) 7 6 5 4 3 2 l'
(c) G~ ; ~ ~ ~ i) G
118 The Symmetric Group Ch.3
7. Do Problem 6 for a shuffle requiring 20 repeats.
8. Express the permutations in Problem 3 as the product of transpositions.
9. Given the two transpositions (1 2) and (1 3), find a permutation 0
such that u(1 2)u
1
= (1 3).
10. Prove that there is no permutation U such that u(1 2)u
1
= (1 2 3).
11. Prove that there is a permutation u such that u(1 2 3)u
1
(4 5 6).
12. Prove that there is no permutation u such that u(1 2 3)u
1
(1 2 4)(5 6 7).
MiddleLevel Problems
13. Prove that (1 2) cannot be written as the product of disjoint 3
cycles.
14. Prove that for any permutation u, UTU
I
is a transposition if T is a trans
position.
15. Show that if T is a kcycle, then UTU
I
is also a kcycle, for any permuta
tion u.
16. Let <I> be an automorphism of S3' Show that there is an element U E S3
such that <1>( T) = UITU for every T E S3.
17. Let (1 2) and (1 2 3 ... n) be in Sn' Show that any subgroup of Sn
that contains both of these must be all of Sn (so these two permutations
generate Sn)'
18. If TI and T2 are two transpositions, show that TIT2 can be expressed as the
product of 3cycles (not necessarily disjoint).
19. Prove that if TI' T2, and T3 are transpositions, then TI T2T3 =1= e, the identity
element of Sn.
20. If T}, T2 are distinct transpositions, show that TI T2 is of order 2 or 3.
21. If u, T are two permutations that disturb no common element and UT = e,
prove that U = T = e.
22. Find an algorithm for finding UTU
I
for any permutations u, Tof Sn.
23. Let u, T be two permutations such that they both have decompositions
into disjoint cycles of cycles of lengths ml, m2, ... , mk' (We say that
they have similar decompositions into disjoint cycles.) Prove that for
some permutation p, T = pUpI.
24. Find the conjugacy class in Sn of (1 2 ... n). What is the order of the
centralizer of (1 2 ... n) in Sn?
25. Do Problem 24 for U = (1 2)(3 4).
Sec. 3
3. ODD AND EVEN PERMUTATIONS
Odd and Even Permutations 119
We noticed in Section 2 that although every permutation is the product of trans
positions, this decomposition is not unique. We did comment, however, that cer
tain aspects of this kind of decomposition are unique. We go into this now.
Let's consider the special case of S3' for here we can see everything ex
plicitly. Let f(Xb X2' X3) = (Xl  X2)(XI  X3)(X2  X3) be an expression in
the three variables Xb X2, X3. We let S3 act on f(x) = f(Xb X2, X3) as follows.
If 0' E S3' then
We consider what 0'* does to f(x) for a few of the o"S in S3.
Consider 0' = (1 2). Then 0'(1) = 2,0'(2) = 1, and 0'(3) = 3, so that
O'*(f(x» = (Xu(l)  Xu(2» (Xu(l)  Xu(3»(Xu(2)  Xu(3»
= (x
2
 Xl) (X
2
 X
3
) (Xl  X
3
)
 (Xl  X
2
) (Xl  X
3
) (X2  X
3
)
 f(x).
So u* coming from 0' = (1 2) changes the sign of f(x). Let's look at the ac
tion of another element, ,., = (1 2 3), of S3 on f(x). Then
T*(f(x» = (XT(I)  X
T
(2» (XT(I)  X
T
(3»(X7{2)  X7{3»
= (x
2
 X
3
)(X
2
 X
I
)(X
3
 Xl)
= (Xl  X
2
) (Xl  X3) (X
2
 X
3
)
= f(x),
so T* coming from,., = (1 2 3) leaves f(x) unchanged. What about the
other permutations in S3; how do they affectf(x)? Of course, the identity el
ement e induces a map e* on f(x) which does not change f(x) at all. What
does ,.,2, for,., above, do to f(x)? Since ,.,*f(x) = f(x), we immediately see that
(,.,2) * (f (x» = (X
T
2(1)  X
T
2(2» (X
T
2
(1)  X
T
2
(3» (X
T
2(2)  X
T
2(3»
=f(x). (Prove!)
Now consider U'T = (1 2)(1 2 3) = (2 3); since ,., leaves f(x) alone and 0'
changes the sign off(x), U7'must change the sign off(x). Similarly, (1 3) changes
the sign off(x). We have accounted for the action of every element of S3 onf(x).
120 The Symmetric Group Ch.3
Suppose that p E S3 is a product p = T(T2··· Tk of transpositions
Tb ... , Tk; then p acting on f(x) will change the sign of f(x) k times, since
each Ti changes the sign off(x). So p*(f(x» = (1)kf(x). If p = UI U2··· U
t
,
where Ub ... , U
t
are transpositions, by the same reasoning, p*(f(x» =
(1)tf(x). Therefore, (1)kf(x) = (1)tf(x), whence (1)t = (1)k. This
tells us that t and k have the same parity; that is, if t is odd, then k must be
odd, and if t is even, then k must be even.
This suggests that although the decomposition of a given permutation U
as a product of transposition is not unique, the parity of the number of trans
positions in such a decomposition of U might be unique.
We strive for this goal now, suggesting to readers that they carry out
the argument that we do for arbitrary n for the special case n = 4.
As we did above, define f(x) = f(x}, ... , x
n
) to be
where in this product i takes on all values from 1 to n  1 inclusive, and j all
those from 2 to n inclusive. If U E Sn, define u* on f(x) by
u*(f (x» = f1 (xu(i)  xU(j»·
1<]
If u, T E Sn' then
= u* (T* (11 (Xi  X
j
»)) = U*(T*(f(X))) = (U*T*)(f(X»
SO (UT)* = U*T* when applied to f(x).
What does a transposition T do to f(x)? We claim that T*(f(x» =
f(x). To prove this, assuming that T = (i j) where i < j, we count up the
number of (xu  xv), with u < v, which get transformed into an (x
a
 Xb) with
a > b. This happens for (xu  Xj) if i < u < j, for (Xi  Xv) if i < v < j, and fi
nally, for (Xi  Xj). Each of these leads to a change of sign on f(x) and since
there are 2(j  i  1) + 1 such, that is, an odd number of them, we get an
odd number of changes of sign on f(x) when acted on by T*. Thus
T*(f(x» = f(x). Therefore, our claim that T*(f(x» = f(x) for every
transposition T is substantiated.
Sec. 3 Odd and Even Permutations 121
If 0' is any permutation in Sn and 0' = T1 T2 ... Tb where Tb T2' ... , Tk
are transpositions, then 0'* = (T1T2 ... Tk)* = T i T ~ ... Tk as acting on f(x),
and since each T ~ ( f ( x » = f(x), we see that O'*(f(x» = (l)kf(x). Simi
larly, if 0' = '1'2··· 't, where 'b '2' ... ' 't are transpositions, then
O'*(f(x» = (l)tf(x). Comparing these two evaluations of O'*(f(x», we
conclude that (l)k = (l)t. So these two decompositions of O'as the prod
uct of transpositions are of the same parity. Thus any permutation is either
the product of an odd number of transpositions or the product of an even
number of transpositions, and no product of an even number of transpositions
can equal a product of an odd number of transpositions.
This suggests the following
Definition. The permutation 0' E Sn is an odd permutation if 0' is the
product of an odd number of transpositions, and is an even permutation if 0'
is the product of an even number of transpositions.
What we have proved above is
Theorem 3.3.1. A permutation in Sn is either an odd or an even per
mutation, but cannot be both.
With Theorem 3.3.1 behind us we can deduce a number of its conse
quences.
Let An be the set of all even permutations; if 0', T E An, then we imme
diately have that O'T E An. Since An is thus a finite closed subset of the (fi
nite) group Sn, An is a subgroup of Sn, by Lemma 2.3.2. An is called the alter
nating group of degree n.
We can show that An is a subgroup of Sn in another way. We already
saw that An is closed under the product of Sn, so to know that An is a sub
group of Sn we merely need show that 0' E Sn implies that 0'1 E Sn. For any
permutation 0' we claim that 0' and 0'1 are of the same parity. Why? Well, if
0' = Tt'T2· .. Tk' where the Ti are transpositions, then
since Til = T
i
. Therefore, we see that the parity of 0' and 0'1 is (l)k, so
they are of equal parity. This certainly shows that 0' E An forces 0"1 E An,
whence An is a subgroup of Sn.
But it shows a little more, namely that An is a normal subgroup of Sn.
For suppose that 0" E An and p E Sn. What is the parity of p
1
0"p? By the
122 The Symmetric Group Ch.3
above, p and pl are of the same parity and IT is an even permutation so p1lTp
is an even permutation, hence is in An. Thus An is a normal subgroup of Sn.
We summarize what we have done in
Theorem 3.3.2. An' the alternating group of degree n, is a normal
subgroup of Sn.
We look at this in yet another way. From the very definitions involved
we have the following simple rules for the product of permutations:
1. The product of two even permutations is even.
2. The product of two odd permutations is even.
3. The product of an even permutation by an odd one (or of an odd one
by an even one) is odd.
If IT is an even permutation, let () (IT) = 1, and if IT is an odd permuta
tion, let ()(IT) = 1. The foregoing rules about products translate into
()(lTr) = ()(IT)()(r), so () is a homomorphism of Sn onto the group E = {I, I}
of order 2 under multiplication. What is the kernel, N, of ()? By the very defi
nition of An we see that N = An. So by the First Homomorphism Theorem,
E ~ Sn/An. Thus 2 = lEI = ISn/Anl = ISnl/IAnl, if n > 1. This gives us that
IAnl = ! ISnl = !n!.
Therefore,
Theorem 3.3.3. For n > 1, An is a normal subgroup of Sn of order !n!.
Corollary. For n > 1, Sn contains !n! even permutations and !n! odd
permutations.
A final few words about the proof of Theorem 3.3.1 before we close
this section. Many different proofs of Theorem 3.3.1 are known. Quite
frankly, we do not particularly like any of them. Some involve what might be
called a "collection process," where one tries to show that e cannot be writ
ten as the product of an odd number of transpositions by assuming that it is
such a shortest product, and by the appropriate finagling with this product,
shortening it to get a contradiction. Other proofs use other devices. The
proof we gave exploits the gimmick of the function [(x), which, in some
sense, is extraneous to the whole affair. However, the proof given is probably
the most transparent of them all, which is why we used it.
Finally, the group An' for n 2:: 5, is an extremely interesting group. We
Sec. 3 Odd and Even Permutations 123
shall show in Chapter 6 that the only normal subgroups of An' for n 2:: 5, are
(e) and An itself. A group with this property is called a simple group (not to
be confused with an easy group). The abelian finite simple groups are merely
the groups of prime order. The An for n 2:: 5 provide us with an infinite fam
ily of nonabelian finite simple groups. There are other infinite families of finite
simple groups. In the last 20 years or so the heroic efforts of algebraists have
determined all finite simple groups. The determination of these simple groups
runs about 10,000 printed pages. Interestingly enough, any nonabelian finite
simple group must have even order.
PROBLEMS
Easier Problems
1. Find the parity of each permutation.
(a) (1 2 3 4 5 6 7 8 9)
245 1 3 7 8 96·
(b) (1 2 3 4 5 6)(7 8 9).
(c) (1 2 3 4 5 6)(1 2 3 4 5 7).
(d) (1 2)(1 2 3)(4 5)(5 6 8)(1 7 9).
2. If (]" is a kcycle, show that (]" is an odd permutation if k is even, and is an
even permutation if k is odd.
3. Prove that (]" and T
1
(]"T, for any u, T E Sn, are of the same parity.
4. If m < n, we can consider Sm C Sn by viewing u E Sn as acting on
1, 2, ... , m, ... , n as it did on 1, 2, ... , m and u leaves j > m fixed.
Prove that the parity of a permutation in Sm, when viewed this way as an
element of Sn, does not change.
5. Suppose you are told that the permutation
(
1 2 3 4 5 6 7 8 9)
312 7 896
in Sg, where the images of 5 and 4 have been lost, is an even permuta
tion. What must the images of 5 and 4 be?
MiddleLevel Problems
6. If n 2:: 3, show that every element in An is a product of 3cycles.
7. Show that every element in An is a product of ncycles.
8. Find a normal subgroup in A
4
of order 4.
124 The Symmetric Group Ch.3
Harder Problems (In fact, very hard)
9. If n 2:: 5 and (e) =1= N C An is a normal subgroup of An' show that N must
contain a 3cycle.
10. Using the result of Problem 9, show that if n 2:: 5, the only normal sub
groups of An are (e) and An itself. (Thus the groups An for n 2:: 5 give us
an infinite family of nonabelian finite simple groups.)
4
RING THEORY
1. DEFINITIONS AND EXAMPLES
So far in our study of abstract algebra, we have been introduced to one kind
of abstract system, which plays a central role in the algebra of today. That
was the notion of a group. Because a group is an algebraic system with only
one operation, and because a group need not satisfy the rule ab = ba, it ran
somewhat counter to our prior experience in algebra. We were used to sys
tems where you could both add and multiply elements and where the ele
ments did satisfy the commutative law of multiplication ab = ba. Further
more, these systems of our acquaintance usually came from sets of
numbersintegers, rational, real, and for some, complex.
The next algebraic object we shall consider is a ring. In many ways
this system will be more reminiscent of what we had previously known
than were groups. For one thing rings will be endowed with addition and
multiplication, and these will be subjected to many of the familiar rules
we all know from arithmetic. On the other hand, rings need not come
from our usuall1umber systems, and, in fact, usually have little to do with
these familiar ones. Although many of the formal rules of arithmetic
hold, many strangeor what may seem as strangephenomena do take
place. As we proceed and see examples of rings, we shall see some of
these things occur.
With this preamble over we are ready to begin. Naturally enough, the
first thing we should do is to define that which we'll be talking about.
125
126 Ring Theory Ch.4
Definition. A nonempty set R is said to be a ring if in R there are two
operations + and· such that:
(a) a, b E R implies that a + b E R.
(b) a + b = b + a for a, b E R.
(c) (a + b) + c = a + (b + c) for a, b, c E R.
(d) There exists an element 0 E R such that a + 0 = a for every a E R.
(e) Given a E R, there exists abE R such that a + b = O. (We shall write
b as a.)
Note that so far all we have said is that R is an abelian group under +. We
now spell out the rules for the multiplication in R.
(f) a, b E R implies that a . b E R.
(g) a· (b . c) = (a· b) . c for a, b, c E R.
This is all that we insist on as far as the multiplication by itself is concerned.
But the + and· are not allowed to live in solitary splendor. We interweave
them by the two distributive laws
(h) a·(b+c)=a·b+a·c and
(b + c) . a = b . a + c . a, for a, b, c E R.
These axioms for a ring look familiar. They should be, for the concept
of ring was introduced as a generalization of what happens in the integers.
Because of Axiom (g), the associative law of multiplication, the rings we de
fined are usually called associative rings. Nonassociative rings do exist, and
some of these play an important role in mathematics. But they shall not be
our concern here. So whenever we use the word "ring" we shall always mean
"associative ring."
Although Axioms (a) to (h) are familiar, there are certain things they
do not say. We look at some of the familiar rules that are not insisted upon
for a general ring.
First, we do not postulate the existence of an element 1 E R such that
a . 1 = 1 . a = a for every a E R. Many of the examples we shall encounter
will have such an element, and in that case we say that R is a ring with unit.
In all fairness we should point out that many algebraists do demand that a
ring have a unit element. We do insist that 1 =1= 0; that is, the ring consisting
of 0 alone is not a ring with unit.
Sec. 1 Definitions and Examples 127
Second, in our previous experience with things of this sort, whenever
a . b = 0 we concluded that a = 0 or b = O. This need not be true, in general,
in a ring. When it does hold, the ring is kind of nice and is given a special
name; it is called a domain.
Third, nothing is said in the axioms for a ring that will imply the com
mutative law of multiplication a . b = b . a. There are noncommutative rings
where this law does not hold; we shall see some soon. Our main concern in
this chapter will be with commutative rings, but for many of the early results
the commutativity of the ring studied will not be assumed.
As we mentioned above, some things make certain rings nicer than oth
ers, and so become worthy of having a special name. We quickly give a list of
definitions for some of these nicer rings.
Definition. A commutative ring R is an integral domain if a . b = 0 in
R implies that a = 0 or b = O.
It should be pointed out that some algebra books insist that an integral
domain contain a unit element. In reading another book, the reader should
check if this is the case there. The integers, 7L, give us an obvious example of
an integral domain. We shall see other, somewhat less obvious ones.
Definition. A ring R with unit is said to be a division ring if for every
a * 0 in R there is an element b E R (usually written as aI) such that
a . aI = aI. a = 1.
The reason for calling such a ring a division ring is quite clear, for we
can divide (at least keeping left and right sides in mind). Although noncom
mutative division rings exist with fair frequency and do play an important
role in noncommutative algebra, they are fairly complicated and we shall
give only one example of these. This division ring is the great classic one in
troduced by Hamilton in 1843 and is known as the ring of quaternions. (See
Example 13 below.)
Finally, we come to perhaps the nicest example of a class of rings, the field.
Definition. A ring R is said to be a field if R is a commutative division
ring.
In other words, a field is a commutative ring in which we can divide
freely by nonzero elements. Otherwise put, R is a field if the nonzero ele
ments of R form an abelian group under· , the product in R.
128 Ring Theory Ch.4
For fields we do have some ready examples: the rational numbers, the
real numbers, the complex numbers. But we shall see many more, perhaps
less familiar, examples. Chapter 5 will be devoted to the study of fields.
We spend the rest of the time in this section looking at some examples
of rings. We shall drop the· for the product and shall write a · b simply as abo
Examples
1. It is obvious which ring we should pick as our first example, namely 7L, the
ring of integers under the usual addition and multiplication of integers. Natu
rally enough, 7L is an example of an integral domain.
2. The second example is equally obvious as a choice. Let Q be the set of all
rational numbers. As we all know, Q satisfies all the rules needed for a field,
so Q is a field.
3. The real numbers, IR, also give us an example of a field.
4. The complex numbers, C, form a field.
Note that Q eIRe C; we describe this by saying that Q is a subfield of
IR (and of C) and IR is a subfield of C.
5. Let R = 7L
6
, the integers mod 6, with the addition and the multiplication
defined by [a] + [b] = [a + b] and [a] [b] = [ab].
Note that [0] is the 0 required by our axioms for a ring, and [1] is the unit
element of R. Note, however, that 7L
6
is not an integral domain, for
[2][3] = [6] = [0], yet [2] =1= [0] and [3] =1= [0]. R is a commutative ring with unit.
This example suggests the
Definition. An element a =1= 0 in a ring R is a zerodivisor in R if ab = 0
for some b =1= 0 in R.
We should really call what we defined a left zerodivisor; however,
since we shall mainly talk about commutative rings, we shall not need any
leftright distinction for zerodivisors.
Note that both [2] and [3] in 7L
6
are zerodivisors. An integral domain
is, of course, a commutative ring without zerodivisors.
6. Let R = 7L
s
, the ring of integers mod 5. R is, of course, a commutative ring
with unit. But it is more; in fact, it is a field. Its nonzero elements are [1], [2],
[3], [4] and we note that [2][3] = [6] = [1], and [1] and [4] are their own in
verses. So every nonzero element in 7L
s
has an inverse in 7L
s
.
We generalize this to any prime p.
Sec. 1 Definitions and Examples 129
7. Let 7L
p
be the integers mod p, where p is a prime. Again 7L
p
is clearly a
commutative ring with 1. We claim that 7L
p
is a field. To see this, note that if
[a] =1= [0], then p (a. Therefore, by Fermat's Theorem (Corollary to Theorem
2.4.8), a
P

1
== l(p). For the classes [.] this says that [a
P

1
] = [1]. But [a
P

1
] =
[alPI, so [a]pl = [1]; therefore, [a]p2 is the required inverse for [a] in 7L
p
,
hence 7Lp is a field.
Because 7L
p
has only a finite number of elements, it is called a finite
field. Later we shall construct finite fields different from the 7L
p
's.
8. Let Q be the rational numbers; if a E Q, we can write a = mIn, where m
and n are relatively prime integers. Call this the reduced form for a. Let R be
the set of all a E Q in whose reduced form the denominator is odd. Under
the usual addition and multiplication in Q the set R forms a ring. It is an inte
gral domain with unit but is not a field, for ~ , the needed inverse of 2, is not
in R. Exactly which elements in R do have their inverses in R?
9. Let R be the set of all a E Q in whose reduced form the denominator is
not divisible by a fixed prime p. As in (8), R is a ring under the usual addi
tion and multiplication in Q, is an integral domain but is not a field. What
elements of R have their inverses in R?
Both Examples 8 and 9 are subrings of Q in the following sense.
Definition. If R is a ring, then a subring of R is a subset S of R which
is a ring if the operations ab and a + b are just the operations of R applied to
the elements a, b E S.
For S to be a subring, it is necessary and sufficient that S be nonempty
and that ab, a ± b E S for all a, b E S. (Prove!)
We give one further commutative example. This one comes from the
calculus.
10. Let R be the set of all realvalued continuous functions on the closed unit
interval [0, 1]. For f, g E R and x E [0,1] define (f + g)(x) = f(x) + g(x),
and (f· g)(x) = f (x)g (x). From the results in the calculus, f + g and f· g are
again continuous functions on [0, 1]. With these operations R is a commuta
tive ring. It is not an integral domain. For instance, if f(x) = x + ~ for
o::::; x ::::; ~ and f(x) = 0 for ~ < x ::::; 1, and if g(x) = 0 for 0 ::::; x ::::; ~ and
g(x) = 2x  1 for ~ < x::::; 1, thenf, g E R and, as is easy to verifY,f' g = O.
It does have a unit element, namely the function e defined by e(x) = 1 for all
x E [0, 1]. What elements of R have their inverses in R?
130 Ring Theory Ch.4
We should now like to see some noncommutative examples. These are
not so easy to come by, although noncommutative rings exist in abundance,
because we are not assuming any knowledge of linear algebra on the reader's
part. The easiest and most natural first source of such examples is the set of
matrices over a field. So, in our first noncommutative example, we shall
really create the 2 X 2 matrices with real entries.
11. Let F be the field of real numbers and let R be the set of all formal square
arrays
where a, b, c, d are any real numbers. For such square arrays we define addi
tion in a natural way by defining
It is easy to see that R forms an abelian group under this + with ( ~ ~ ) act
(
a b) (a b)
ing as the zero element and _ C _ d the negative of cd. To make of
R a ring, we need a multiplication. We define one in what may seem a highly
unnatural way via
(
a b)(r s) = (ar + bt as + bU).
c d t U cr + dt cs + du
It may be a little laborious, but one can check that with these operations R is a
noncommutative ring with ( ~ ~ ) acting as its multiplicative unit element.
Note that
( ~ ~ ) ( ~ ~ ) = ( ~ ~ )
while
so
Sec. 1 Definitions and Examples 131
Note that ( ~ ~ ) and ( ~ ~ ) are zerodivisors; in fact,
( ~ ~ r = ( ~ ~ )
so
is a nonzero element whose square is the 0 element of R. This R is known as
the ring of all 2 X 2 matrices over F, the real field.
For those unfamiliar with these matrices, and who see no sense in the
product defined for them, let's look at how we do compute the product. To
get the top left entry in the product AB, we "multiply" the first row of A by
the first column of B, where A, B E R. For the top right entry, it is the first
row of A versus the second column of B. The bottom left entry comes from
the second row of A versus the first column of B, and finally, the bottom
right entry is the second column of A versus the second column of B.
We illustrate with an example: Let
A = ( 1 ~ )
3 2
and
Then the first row of A is 1, ~ and the first column of B is ~ , 71"; we "multiply"
these via 1 . ~ + ~ . 71" = 71"/2 + ~ , and so on. So we see that
AB = ( ~ + 71"/2
1 + 271"
~  71"/2)
 ~  271" .
In the problems we shall have many matrix multiplications, so that the
reader can acquire some familiarity with this strange but important example.
12. Let R be any ring and let
s = {(: ~ ) I a, b, C, dE R}
with + and· as defined in Example 11. One can verify that S is a ring, also,
under these operations. It is called the ring of2 X 2 matrices over R.
Our final example is one of the great classical examples, the real qua
ternions, introduced by Hamilton (as a noncommutative parallel to the com
plex numbers).
13. The quaternions. Let F be the field of real numbers and consider the set
of all formal symbols ao + ali + a2j + a3k , where ao, ab a2' a3 E F.
Equality and addition of these symbols are easy, via the obvious route
132 Ring Theory Ch.4
ao + ali + a2j + a3
k
= f30 + f31
i
+ f32j + f33
k
if and only if ao = f3o, a 1 = f3l' a2 = f32 and a 3 = f3 3 , and
(ao + ali + a2j + a3
k
) + (f30 + f31
i
+ f32j + f33
k
)
= (a
o
+ f30) + (a
1
+ f31) i + (a2 + f32) j + (a3 + f33)k.
We now come to the tricky part, the multiplication. When Hamilton discov
ered it on October 6, 1843, he cut the basic rules of this product out with his
penknife on Brougham Bridge in Dublin. The product is based on i
2
= j2 =
k
2
= 1, ij = k, jk = i, ki = j and ji =  k, kj =  i, ik =  j. If we go around
the circle clockwise
the product of any two successive ones is the next one, and going around
counterclockwise we get the negatives.
We can write out the product now of any two quaternions, according to
the rules above, declaring by definition that
(ao + ali + a2j + a3
k
)(f30 + f31
i
+ f32j + f33
k
)
= 1'0 + 1'1i + 1'2j + 1'3
k
,
where
1'1 = aof31 + a1 f30 + a2f33  a3f32
1'2 = aof32  a1f33 + a2f30 + a3f31
1'3 = aof33 + a1f32  a2f31 + a3f30
(I)
It looks horrendous, doesn't it? But it's not as bad as all that. We are
multiplying out formally using the distributive laws and using the product
rules for the i, j, k above.
If some ai is 0 in x = ao + ali + a2j + a3k, we shall omit it in express
ing x; thus 0 + Oi + OJ + Ok will be written simply as 0,1 + Oi + OJ + Ok as 1,
o+ 3i + 4j + Ok as 3i + 4j, and so on.
A calculation reveals that
(II)
= a6 + ai + a ~ + a ~ .
Sec. 1 Definitions and Examples 133
This has a very important consequence; for suppose that x = ao + ali +
a2j + a3 k =1= 0 (so some ai =1= 0). Then, since the a's are real, f3 = + ai +
+ =1= o. Then from (II) we easily get
(
. . k)(a
o
al· a2· a
3
k
) 1
ao + all + a2] + a3 Ii  li
l
 Ii]  Ii =.
So, if x =1= 0, then x has an inverse in the quaternions. Thus the quaternions
form a noncommutative division ring.
Although, as we mentioned earlier, there is no lack of noncommutative
division rings, the quaternions above (or some piece of them) are often the
only noncommutative division rings that even many professional mathemati
cians have eVer seen.
We shall have many problemssome easy and some quite a bit
harderabout the two examples: the 2 X 2 matrices and the quaternions.
This way the reader will be able to acquire some skill with playing with
noncommutative rings.
One final comment in this section: If 1'0, 1'1, 1'2, 1'3 are as in (I), then
+ ai + + + f3i + +
= + I'i + +
(III)
This is known as Lagrange's Identity; it expresses the product of two sums of
four squares again as a sum of four squares. Its verification will be one of the
exercises.
PROBLEMS
Easier Problems
*1. Find all the elements in E
24
that are invertible (i.e., have a multiplicative
inverse) in E
24
•
2. Show that any field is an integral domain.
3. Show that En is a field if and only if n is a prime.
4. Verify that Example 8 is a ring. Find all its invertible elements.
5. Do Problem 4 for Example 9.
6. In Example 11, the 2 X 2 matrices over the reals, check the associative
law of multiplication.
134 Ring Theory Ch.4
7. Work out the following:
(
1 2 ) ( 1 ~ )
(a) 4 7 0 1·
(
1 1)2
(b) 1 1 .
(
1 1)3
(c) g g.
(d) (: ~ ) ( ~ ~ )  ( ~ ~ ) ( : ~ ) .
8. Find all matrices (: ~ ) such that (: ~ ) ( ~ ~ ) = ( ~ ~ ) ( : ~ ) 
9. Find all 2 X 2 matrices (: ~ ) that commute with all 2 X 2 matrices.
10. Let R be any ring with unit, S the ring of 2 X 2 matrices over R. (See Ex
ample 12.)
(a) Check the associative law of multiplication in S. (Remember: R need
not be commutative.)
(b) Show that {(~ :) I a, b, C, E R} is a subring of S.
(c) Show that ( ~ :) has an inverse in S if and only if a and C have
inverses in R. In that case write down ( ~ ~ r
l
explicitly.
11. Let F: C ~ C be defined by F(a + bi) = a  bi. Show that:
(a) F(xy) = F(x)F(y) for x, y E C.
(b) F(xx) = Ix1
2
•
(c) Using Parts (a) and (b), show that
(a
2
+ b
2
)(e
2
+ d
2
) = (ae  bd)2 + (ad + be)2.
[Note: F(x) is merely x.]
12. Verify the identity in Part (e) of Problem 11 directly.
13. Find the following products of quaternions .
(a) (i + j)(i  j).
(b) (1  i + 2j  2k)(1 + 2i  4j + 6k).
(c) (2i  3j + 4k)2.
(d) i(ao + ali + a2j + a3k )  (aD + ali + a2j + a3k )i.
Sec. 1 Definitions and Examples 135
14. Show that the only quaternions commuting with i are of the form a + f3i.
15. Find the quaternions that commute with both i and j.
16. Verify that
(ao + ali + a
2
j + a3k )(aO  ali  a2j  a3k)
== a ~ + ai + a ~ + a ~ .
17. Verify Lagrange's Identity by a direct calculation.
MiddleLevel Problems
18. In the quaternions, define
Show that Ixy I == Ix II y Ifor any two quaternions x and y.
19. Show that there is an infinite number of solutions to x
2
== 1 in the
quaternions.
20. In the quaternions, consider the following set G having eight elements:
G
{+l +. +. +k}
  ,l, j,  .
(a) Prove that G is a group (under multiplication).
(b) List all subgroups of G.
(c) What is the center of G?
(d) Show that G is a nonabelian group all of whose subgroups are
normal.
21. Show that a division ring is a domain.
22. Give an example, in the quaternions, of a noncommutative domain that
is not a division ring.
23. Define the map * in the quaternions by
Show that:
(a) x** == (x*)* == x.
(b) (x + y)* == x* + y*.
(c) xx* == x*x is real and nonnegative.
(d) (xy)* == y*x*.
[Note the reversal of order in Part (d).]
24. Using *, define Ixl == Vxx*. Show that Ixyl == Ixllyl for any two qua
ternions x and y, by using Parts (c) and (d) of Problem 23.
136 Ring Theory Ch.4
25. Use the result of Problem 24 to prove Lagrange's Identity.
In Problems 26 to 30, let R be the 2 X 2 matrices over the reals.
26. If ( ~ ~ ) E R, show that ( ~ ~ ) is invertible in R if and only if ad  be * O.
In that case find ( ~ ~ ) 1.
27. Define det(: ~ ) = ad  be. For x, y E R show that det(xy)
(det x)(det y).
28. Show that {x E R Idet x =1= o} forms a group, G, under matrix multiplica
tion and that N = {x E R Idet x = I} is a normal subgroup of G.
29. If x E R is a zerodivisor, show that det x = 0, and, conversely, if x =1= °is
such that det x = 0, then x is a zerodivisor in R.
30. In R, show that {( _ ~ : ) I a, b real} is a field.
Harder Problems
31. Let R be the ring of all 2 X 2 matrices over 7L
p
, p a prime. Show that if
det(: ~ ) = ad  be * 0, then ( ~ ~ ) is invertible in R.
32. Let R be as in Problem 31. Show that for x, y E R, det(xy) = det(x) det(y).
33. Let G be the set of elements x in the ring R of Problem 31 such that
det(x) =1= 0.
(a) Prove that G is a group.
(b) Find the order of G. (Quite hard)
(c) Find the center of G.
(d) Find a pSylow subgroup of G.
34. Let T be the group of matrices A with entries in the field 7L
2
such that det A
is not equal to 0. Prove that T is isomorphic to S3' the symmetric group
of degree 3.
35. For R as in Example 10, show that S = {f E R Ifis differentiable on (0, I)}
is a subring of R which is not an integral domain.
If F is a field, let H(F) be the ring of quaternions over F, that is, the
set of all aa + ali + a2j + a3k, where aa, ab a2, a3 E F and where equal
ity, addition, and multiplication are defined as for the real quaternions.
Sec. 2 Some Simple Results 137
36. If F = C, the complex numbers, show that H(C) is not a division ring.
37. In H(C), find an element x =1= 0 such that x
2
= O.
38. Show that H(F) is a division ring if and only if a6 + ai + a ~ + a ~ = 0
for ab a2, a3' a4 in F forces ao = al = a2 = a3 = O.
39. If Q is the field of rational numbers, show that H(Q) is a division ring.
40. Prove that a finite domain is a division ring.
41. Use Problem 40 to show that 7L
p
is a field if p is a prime.
2. SOME SIMPLE RESULTS
Now that we have seen some examples of rings and have had some experi
ence playing around with them, it would seem wise to develop some compu
tational rules. These will allow us to avoid annoying trivialities that could
beset a calculation we might be making.
The results we shall prove in this section are not very surprising, not
too interesting, and certainly not at all exciting. Neither was learning the al
phabet, but it was something we had to do before going on to bigger and bet
ter things. The same holds for the results we are about to prove.
Since a ring R is at least an abelian group under +, there are
certain things we know from our group theory background, for instance,
(a) = a, (a + b) = (a) + (b); if a + b = a + c, then b = c, and
so on.
We begin with
Lemma 4.2.1. Let R be any ring and let a, b E R. Then
(a) aO = Oa = o.
(b) a(b) = (a)b = (ab).
(c) (a)( b) = abo
(d) If 1 E R, then (l)a = a.
Proof We do these in turn.
(a) Since 0 = 0 + 0, aO = a(O + 0) = aO + aO, hence aO = O. We have
used the left distributive law in this proof. The right distributive law gives
Oa = O.
(b) ab + a( b) = a(b + (b» = aO = 0 from Part (a). Therefore,
a(b) = (ab). Similarly, (a)b = (ab).
(c) By Part (b), (a)(b) = «a)b) = ((ab» = ab, since we are
in an abelian group.
138 Ring Theory Ch.4
(d) If 1 E R, then (l)a + a == (l)a + (l)a == (1 + l)a == Oa == O. So
( l)a ==  a by the definition of  a. D
Another computational result.
Lemma 4.2.2. In any ring R, (a + b)2 == a
2
+ b
2
+ ab + ba for a, b E R.
Proof This is clearly the analog of (a + f3)2 == Q? + 2af3 + f32 in the in
tegers, say, but keeping in mind that R may be noncommutative. So, to it. By
the right distributive law (a + b)2 == (a + b)(a + b) == (a + b)a + (a + b)b ==
a
2
+ ba + ab + b
2
, exactly what was claimed. D
Can you see the noncommutative version of the binomial theorem? Try
it for (a + b)3.
One curiosity follows from the two distributive laws when R has a unit
element. The commutative law of addition follows from the rest.
Lemma 4.2.3. If R is a system with 1 satisfying all the axioms of a
ring, except possibly a + b == b + a for a, b E R, then R is a ring.
Proof We must show that a + b == b + a for a, b E R. By the right dis
tributive law (a + b)(l + 1) == (a + b)l + (a + b)l == a + b + a + b. On the
other hand, by the left distributive law (a + b)(l + 1) == a(l + 1) + b(l + 1)
== a + a + b + b. But then a + b + a + b == a + a + b + b; since we are in a
group under +, we can cancel a on the left and b on the right to obtain b + a
== a + b, as required. R is therefore a ring. D
We close this brief section with a result that is a little nicer. We say that
a ring R is a Boolean ring [after the English mathematician George Boole
(18151864)] if x
2
== x for every x E R.
We prove a nice result on Boolean rings.
Lemma 4.2.4. A Boolean ring is commutative.
Proof Let x, y E R, a Boolean ring. Thus x
2
== x, y2 == y, (x + y)2 ==
X + y. But (x + y)2 == x
2
+ xy + yx + y2 == X + xy + yx + y, by Lemma 4.2.2,
so we have (x + y) == (x + y)2 == X + xy + yx + y, from which we have
xy + yx == O. Thus 0 == x(xy + yx) == x
2
y + xyx == xy + xyx, while 0 ==
(xy + yx)x == xyx + yx
2
== xyx + yx. This gives us xy + xyx == xyx + yx, and
so xy == yx. Therefore, R is commutative. D
Sec. 3 Ideals, Homomorphisms, and Quotient Rings
PROBLEMS
139
1. Let R be a ring: since R is an abelian group under +, na has meaning for
us for n E 7L, a E R. Show that (na)(mb) = (nm)(ab) if n, m are integers
and a, b E R.
2. If R is an integral domain and ab = ac for a =1= 0, b, c E R, show that b = c.
3. If R is a finite integral domain, show that R is a field.
4. If R is a ring and e E R is such that e
2
= e, show that (xe  exe)2 =
(ex  exe)2 = 0 for every x E R.
5. Let R be a ring in which x
3
= x for every x E R. Prove that R is commu
tative.
6. If a
2
= 0 in R, show that ax + xa commutes with a.
7. Let R be a ring in which x
4
= x for every x E R. Prove that R is commu
tative.
8. If F is a finite field, show that:
(a) There exists a prime p such that pa = 0 for all a E F.
(b) If F has q elements, then q = pn for some integer n. (Hint: Cauchy's
Theorem)
9. Let p be an odd prime and let 1 + ~ + ... + 1/(p  1) = alb, where a, b
are integers. Show thatp Ia. (Hint: As a runs through Up, so does aI.)
10. Ifp is a prime andp > 3, show that if 1 + ~ + ... + 1/(p  1) = alb, where
a, b are integers, then p2 Ia. (Hint: Consider 1Ia
2
as a runs through Up.)
3. IDEALS, HOMOMORPHISMS, AND QUOTIENT RINGS
In studying groups, it turned out that homomorphisms, and their kernels
the normal subgroupsplayed a central role. There is no reason to expect
that the same thing should not be true for rings. As a matter of fact, the
analogs, in the setting of rings, of homomorphism and normal subgroup do
playa key role.
With the background we have acquired about such things in group the
ory, the parallel development for rings should be easy and quick. And it will
be! Without any further fuss we make the
Definition. The mapping 'P : R ~ R' of the ring R into the ring R' is a
homomorphism if
(a) 'P(a + b) = 'P(a) + 'P(b) and
(b) 'P(ab) = 'P(a)'P(b) for all a, b E R.
140 Ring Theory Ch.4
Since a ring has two operations, it is only natural and just that we de
mand that both these operations be preserved under what we would call a
ring homomorphism. Furthermore, Property (a) in the Definition tells us that
q; is a homomorphism of R viewed merely as an abelian group under + into
R' (also viewed as a group under its addition). So we can call on, and expect,
certain results from this fact alone.
Just as we saw in Chapter 2, Section 5 for groups, the image of R under
a homomorphism from R to R', is a subring of R', as defined in Chapter 4,
Section 1 (Prove!).
Let q;: R ~ R' be a ring homomorphism and let Ker q; =
{x E R Iq;(x) = O}, the 0 being that of R'. What properties does Ker q; enjoy?
Clearly, from group theory Ker q; is an additive subgroup of R. But much
more is true. If k E Ker q; and r E R, then q;(k) = 0, so q;(kr) = q;(k)q;(r) =
Oq;(r) = 0, and similarly, q;(rk) = O. So Ker q; swallows up multiplication
from the left and the right by arbitrary ring elements.
This property of Ker q; is now abstracted to define the important analog
in ring theory of the notion of normal subgroup in group theory.
Definition. Let R be a ring. A nonempty subset I of R is called an
ideal of R if:
(a) I is an additive subgroup of R.
(b) Given r E R, a E I, then ra E I and ar E I.
We shall soon see some examples of homomorphisms and ideals. But
first we note that Part (b) in the definition of ideal really has a left and a right
part. We could split it and define a set L of R to be a left ideal of R if L is an
additive subgroup of R and given r E R, a E L, then ra E L. So we require
only left swallowingup for a left ideal. We can similarly define right ideals.
An ideal as we defined it is both a left and a right ideal of R. By all rights we
should then call what we called an ideal a twosided ideal of R. Indeed, in
working in noncommutative ring theory one uses this name; here, by "ideal"
we shall always mean a twosided ideal. Except for some of the problems, we
shall not use the notion of onesided ideals in this chapter.
Before going on, we record what was done above for Ker q; as
Lemma 4.3.1. If q;: R ~ R' is a homomorphism, then Ker q; is an
ideal of R.
We shall soon see that every ideal can be made the kernel of a homo
morphism. Shades of what happens for normal subgroups of groups!
Sec. 3 Ideals, Homomorphisms, and Quotient Rings 141
Finally, let K be an ideal of R. Since K is an additive subgroup of R, the
quotient group R/K exists; it is merely the set of all cosets a + K as a runs
over R. But R is not just a group; it is, after all, a ring. Nor is K merely an ad
ditive subgroup of R; it is more than that, namely an ideal of R. We should
be able to put all this together to make of R/K a ring.
How should we define a product in R/K in a natural way? What do we
want to declare (a + K)(b + K) to be? The reasonable thing is to define
(a + K)(b + K) = ab + K, which we do. As always, the first thing that
comes up is to show that this product is welldefined. Is it? We must show
that if a + K = a' + K and b + K = b' + K, then (a + K)(b + K) = ab + K =
a'b' + K = (a' + K)(b' + K). However, if a + K = a' + K, then a  a' E K,
so (a  a')b E K, since K is an ideal of R (in fact, so far, since K is a right
ideal of R). Because b + K = b' + K, we have b  b' E K, so a'(b  b') E K,
since K is an ideal of R (in fact, since K is a left ideal of R). So both (a  a')b =
ab  a'b and a'(b  b') = a'b  a'b' are in K. Thus (ab  a'b) + (a'b  a'b') =
ab  a'b' E K.
But this tells us (just from group theory) that ab + K = a'b' + K, ex
actly what we needed to have the product welldefined.
So R/K is now endowed with a sum and a product. Furthermore, the
mapping q;: R ~ R/K defined by q;(a) = a + K for a E R is a homomor
phism of R onto R/K with kernel K. (Prove!) This tells us right away that
R/K is a ring, being the homomorphic image of the ring R.
We summarize all this in
Theorem 4.3.2. Let K be an ideal of R. Then the quotient group R/K
as an additive group is a ring under the multiplication (a + K)(b + K) =
ab + K. Furthermore, the map q;: R ~ R/K defined by q;(a) = a + K for a E R
is a homomorphism of R onto R/K having K as its kernel. So R/K is a homo
morphic image of R.
Just from grouptheoretic consideration of R as an additive group, we
have that if q; is a homomorphism of R into R', then it is 11 if and only if
Ker q; = (0). As in groups, we define a homomorphism to be a monomor
phism if it is 11. A monomorphism which is also onto is called an isomor
phism. We define Rand R' to be isomorphic if there is an isomorphism of R
onto R'.
An isomorphism from a ring R onto itself is called an automorphism of
R. For example, suppose R is the field C of complex numbers. Then the map
ping from C to C sending each element of C to its complex conjugate is an
automorphism of C. (Prove!)
142 Ring Theory Ch.4
One would have to be an awful pessimist to expect that the homomor
phism theorems proved in Section 7 of Chapter 2 fail in this setting. In fact
they do hold, with the slightly obvious adaptation needed to make the proofs
go through. We state the homomorphism theorems without any further ado,
leaving the few details needed to complete the proofs to the reader.
Theorem 4.3.3 (First Homomorphism Theorem). Let the mapping
cp: R ~ R
'
be a homomorphism of R onto R' with kernel K. Then R' === RIK;
in fact, the mapping l/J: RIK ~ R' defined by l/J(a + K) = cp (a) defines an
isomorphism of RIK onto R'.
Theorem 4.3.4 (Correspondence Theorem). Let the mapping cp: R ~ R'
be a homomorphism of R onto R' with kernel K. If I' is an ideal of R', let
I = {a E R I cp(a) E I'}. Then I is an ideal of R, I:> K and 11K === I'. This sets
up a 11 correspondence between all the ideals of RI and those ideals of R
that contain K.
Theorem 4.3.5 (Second Homomorphism Theorem). Let A be a sub
ring of a ring R and I an ideal of R. Then A + I = {a + i Ia E A, i E I} is a
subring of R, I is an ideal of A + I, and (A + 1)11 === A I(A n I).
Theorem 4.3.6 (Third Homomorphism Theorem). Let the mapping
cp : R ~ R I be a homomorphism of R onto R I with kernel K. If I' is an ideal
of R' and I = {a E R I cp(a) E I'}, then RII = R'II'. Equivalently, if K is an
ideal of R and I:> K is an ideal of R, then RII = (RIK)/(IIK).
We close this section with an inspection of some of the things we have
discussed in some examples.
Examples
1. As usual we use 7L, the ring of integers, for our first example. Let n > 1 be
a fixed integer and let In be the set of all multiples of n; then In is an ideal of
I. If 7L
n
is the integers mod n, define cp: 7L ~ 7L
n
by cp(a) = [aJ. As is easily
seen, cp is a homomorphism of 7L onto 7L
n
with kernel In. So by Theorem
4.3.3, 7L
n
= 7Lll
n
• (This should come as no surprise, for that is how we origi
nally introduced 7L
n
.)
2. Let F be a field; what can the ideals of F be? Suppose that I =1= (0) is an
ideal of F; let a =1= 0 E I. Then, since I is an ideal of F, 1 = a1a E I; but
now, since 1 E I, rl = rEI for every rEF. In short, I = F. So F has only
the trivial ideals (0) and F itself.
3. Let R be the ring of all rational numbers having odd denominators in their
reduced form. Let I be those elements of R which in reduced form have an
Sec. 3 Ideals, Homomorphisms, and Quotient Rings 143
even numerator; it is easy to see that I is an ideal of R. Define q; : R ~ 7L
2
, the
integers mod 2, by q;(alb) = 0 if a is even (a, b have no common factor) and
q; (alb) = 1 if a is odd. We leave it to the reader to verify that q; is a homo
morphism of R onto 7L
2
with kernel I. Thus 7L
2
::::::: RII. Give the explicit iso
morphism of RII onto 7L
2
•
4. Let R be the ring of all rational numbers whose denominators (when in re
duced form) are not divisible by p, p a fixed prime. Let I be those elements in
R whose numerator is divisible by p; I is an ideal of Rand RII::::::: 7L
p
, the inte
gers modp. (Prove!)
5. Let R be the ring of all realvalued continuous functions on the closed unit
interval where (f + g)(x) = f(x) + g(x) and (fg)(x) = f(x)g(x) for f, g E R,
x E [0, 1]. Let I = {f E R If ( ~ ) = OJ. We claim that I is an ideal of R.
Clearly, it is an additive subgroup. Furthermore if f E I and g E R, then
f ( ~ ) = 0, so ( f g ) ( ~ ) = f ( ~ ) g ( ~ ) = Og ( ~ ) = O. Thus fg E I; since I is commuta
tive, gfis also in I. So I is an ideal of R.
What is RII? Given any fER, then
f(x) = (f(x)  f ( ~ » + f ( ~ ) = g(x) + f ( ~ ) ,
where g(x) = f(x)  f ( ~ ) . Because g ( ~ ) = f ( ~ )  f ( ~ ) = 0, g is in I. So
g + I = I. Thusf + I = ( f ( ~ ) + g) + I = f ( ~ ) + I. Because f ( ~ ) is just a real
number, RII consists of the cosets a + I for a real. We claim that every real
a comes up. For if f ( ~ ) = f3 =1= 0, then
af31f + I = (af3
1
+ I) (f + I) = (af31 + I) ( f ( ~ ) + I)
= (af31 + 1)(/3 + I) = af31f3 + I = a + I.
So RII consists of all a + I, a real. Thus we have shown directly that RII is
isomorphic to the real field.
We now use Theorem 4.3.3 to show that RII ::::::: real field. Let q;: R ~ ~
be defined by q;(f) = f ( ~ ) . Then it is easy to verify that q; is a homomor
phism, q; is onto and Ker q; = {f E R If ( ~ ) = O}; in other words, Ker q; = I.
So RII ::::::: image of q; = IR.
6. Let R be the integral quaternions, in other words, R = {aa + ali + a2j +
a3k Iaa, al, a2, a3 E 7L}. For a fixed prime p, let
The reader should verify that I
p
is an ideal of R and that RlI
p
::::::: H(7L
p
) (see
Problem 36 of Section 1 and the paragraph before it).
144 Ring Theory Ch.4
7. Let R = {(~ ~ ) I a, b E ~ } ; R is a subring of the 2 X 2 matrices over the
reals.
Let I = {( ~ ~ ) I b E ~ } ; I is easily seen to be an additive subgroup of R. Is it
an ideal of R? Consider
(
X Y)(O b) = (0 Xb).
OxOO 00'
so it is in I. Similarly,
(
0 b)(X Y) = (0 bX)
OOOx 00'
so it, too, is in I. So I is an ideal of R. What is R/I? We approach it from two
points of view.
Given ( ~ :) E R ,
then ( ~ :) = ( ~ ~ ) + ( ~ ~ ) so that
( ~ :) + I = ( ( ~ ~ ) + ( ~ ~ ) ) + I = ( ~ ~ ) + L
since ( ~ ~ ) is in 1. Thus all the cosets of I in R look like ( ~ ~ ) + 1. If we map
this onto a, that is, 1\1 ( ( ~ ~ ) + I) = a, we can check that '" is an isomorphism
onto the real field. So R/I :::::: IR.
We see that R/I "'" ~ another way. Define cp: R ~ ~ b
YCP
( ~ ~ ) = a. We
claim that cP is a homomorphism. For, given ( ~ ~ ). ( ~ ~ ) . then
cP ( ~ ~ ) = a, cP ( ~ ~ ) = c,
(
a b) + (c d) = (a + C b+ d)
o a 0 C 0 a+c'
Sec. 3 Ideals, Homomorphisms, and Quotient Rings 145
and = ad be} hence
= a + e = ( :) + (
and : )( = ad be)
= ae =
So is indeed a homomorphism of R onto R What is Ker If ( ) E Ker
(
a b) (a b) . (a b)
then'P 0 a = a, but also 'P 0 a = 0, SInce 0 a E Ker 'P.
Thus a = O. From this we see that I = Ker 'P. So R/I image of 'P = IR by
Theorem 4.3.3.
8. Let R = {( _ : ) I a, b E } and let C be the field of complex numbers.
Define "': R C by '"( _ ) = a + bi. We leave it to the reader to verify that
l/J is an isomorphism of R onto C. So R is isomorphic to the field of complex
numbers.
9. Let R be any commutative ring with 1. If a E R, let (a) = {xa Ix E R}. We
claim that (a) is an ideal of R. To see this, suppose that u, v E (a); thus
u = xa, v = ya for some x, y E R, whence
u ± v = xa ± ya = (x ± y) a E (a).
Also, if u E (a) and r E R, then u = xa, hence ru = r(xa) = (rx)a, so is in (a).
Thus (a) is an ideal of R.
Note that if R is not commutative, then (a) need not be an ideal; but it
is certainly a left ideal of R.
146 Ring Theory
PROBLEMS
Ch.4
Easier Problems
1. If R is a commutative ring and a E R, let L(a) = {x E R Ixa = OJ. Prove
that L(a) is an ideal of R.
2. If R is a commutative ring with 1 and R has no ideals other than (0) and
itself, prove that R is a field. (Hint: Look at Example 9.)
*3. If q; : R ~ R' is a homomorphism of R onto R' and R has a unit element,
1, show that q;(1) is the unit element of R'.
4. If I, I are ideals of R, define I + I by I + I = {i + j liE I, j E I}. Prove
that I + I is an ideal of R.
5. If I is an ideal of R and A is a subring of R, show that I n A is an ideal of A.
6. If I, I are ideals of R, show that I n I is an ideal of R.
7. Give a complete proof of Theorem 4.3.2.
8. Give a complete proof of Theorem 4.3.4.
9. Let q; : R ~ R' be a homomorphism of R onto R' with kernel K. If A' is
a subring of R', let A = {a E R I q;(a) E A '}. Show that:
(a) A is a subring of R, A :> K.
(b) A/K ~ A'.
(c) If A' is a left ideal of R', then A is a left ideal of R.
10. Prove Theorem 4.3.6.
11. In Example 3, give the explicit isomorphism of R/I onto 7L
2
•
12. In Example 4, show that R/I ~ 7L
p
•
13. In Example 6, show that R/I
p
~ H(7L
p
)'
14. In Example 8, verify that the mapping t/J given is an isomorphism of R onto C.
15. If I, I are ideals of R, let II be the set of all sums of elements of the form
ij, where i E I, j E I. Prove that II is an ideal of R.
16. Show that the ring of 2 X 2 matrices over the reals has nontrivial left
ideals (and also nontrivial right ideals).
17. Prove Theorem 4.3.5.
If R, S are rings, define the direct sum of Rand S, R EB S, by
R EB S = {(r, s) IrE R, s E S}
where (r, s) = (rb Sl) if and only if r = rb s = Sb and where
(r, s) + (t, u) = (r + t, s + u), (r, s)(t, u) = (rt, su).
Sec. 3 Ideals, Homomorphisms, and Quotient Rings 147
18. Show that R EB S is a ring and that the subrings {(r, 0) IrE R} and
{CO, s) Is E S} are ideals of R EB S isomorphic to Rand S, respectively.
:) I a,b,creal} and I =
(a) R is a ring.
(b) I is an ideal of R.
(c) R/I FEB F, where F is the field of real numbers.
20. If I, f are ideals of R, let R
1
= RII and R
2
= Rlf. Show that 'P: R R
1
EB R
2
defined by 'P(r) = (r + I, r + f) is a homomorphism of R into R
1
EB R
2
such that Ker 'P = I n f.
21. Let Z15 be the ring of integers mod 15. Show that Z15 Z3 EB Z5.
MiddleLevel Problems
22. Let Z be the ring of integers and m, n two relatively prime integers, 1m
the multiples of m in Z, and In the multiples of n in Z.
(a) What is 1m n In?
(b) Use the result of Problem 20 to show that there is a onetoone
homomorphism from ZII
mn
to ZII
m
EB ZII
n
•
23. If m, n are relatively prime, prove that Zmn Zm EB Zn. (Hint: Use a count
ing argument to show that the homomorphism of Problem 22(b) is onto.)
*24. Use the result of Problem 23 to prove the Chinese Remainder Theorem,
which asserts that if m and n are relatively prime integers and a, b any in
tegers, we can find an integer x such that x == a mod m and x == b mod n
simultaneously.
25. Let R be the ring of 2 X 2 matrices over the real numbers; suppose that I
is an ideal of R. Show that I = (0) or I = R. (Contrast this with the result of
Problem 16.)
Harder Problems
26. Let R be a ring with 1 and let S be the ring of 2 X 2 matrices over R. If I
is an ideal of Sshow that there is an ideal f of R such that I consists of all
the 2 X 2 matrices over f.
27. If Pb P2' ... , Pn are distinct odd primes, show that there are exactly 2
n
solutions of x
2
== x mod(p1 ••• Pn), where 0 ::; x < Pl· .. Pn·
28. Suppose that R is a ring whose only left ideals are (0) and R itself. Prove
that either R is a division ring or R has P elements, p a prime, and ab = 0
for every a, b E R.
148 Ring Theory Ch.4
29. Let R be a ring with 1. An element a E R is said to have a left inverse if
ba = 1 for some b E R. Show that if the left inverse b of a is unique, then
ab = 1 (so b is also a right inverse of a).
4. MAXIMAL IDEALS
This will be a section with one major theorem. The importance of this result
will only become fully transparent when we discuss fields in Chapter 5. How
ever, it is a result that stands on its own two feet. It isn't difficult to prove,
but in mathematics the correlation between difficult and important isn't al
ways that high. There are many difficult results that are of very little interest
and of even less importance, and some easy results that are crucial. Of
course, there are some resultsmany, manywhich are of incredible diffi
culty and importance.
Lemma 4.4.1. Let R be a commutative ring with unit whose only
ideals are (0) and itself. Then R is a field.
Proof Let a =1= 0 be in R. Then (a) = {xa Ix E R} is an ideal of R, as
we verified in Example 9 in the preceding section. Since a = la E (a), (a) =1= (0).
Thus, by our hypothesis on R, (a) = R. But then, by the definition of (a),
every element i E R is a multiple xa of a for some x E R. In particular, be
cause 1 E R, 1 = ba for some b E R. This shows that a has the inverse b in R.
So R is a field. 0
In Theorem 4.3.4the Correspondence Theoremwe saw that if
'P : R ~ R' is a homomorphism of R onto R' with kernel K, then there is a
11 correspondence between ideals of R' and ideals of R that contain K. Sup
pose that there are no ideals other than K itself and R which contain K.
What does this imply about R'? Since (0) in R' corresponds to K in R, and
R' corresponds to all of R, we must conclude that in this case R' has no
ideals other than (0) and itself. So if R' is commutative and has a unit ele
ment, then, by Lemma 4.4.1, R' must be a field.
This prompts the following definition.
Definition. A proper ideal M of R is a maximal ideal of R if the only
ideals of R that contain Mare M itself and R.
The discussion preceding this definition has already almost proved
for us
Sec. 4 Maximal Ideals 149
Theorem 4.4.2. Let R be a commutative ring with 1, and let M be a
maximal ideal of R. Then RIM is a field.
Proof There is a homomorphism of R onto R' = RIM, and since 1 E R
we have that R' has 1 + M as its unit element. (See Problem 3, Section 3).
Because M is a maximal ideal of R, we saw in the discussion above that R'
has no nontrivial ideals. Thus, by Lemma 4.4.1, R' = RIM is a field. 0
This theorem will be our entry into the discussion of fields, for it will
enable us to construct particularly desirable fields whenever we shall need
them.
Theorem 4.4.2 has a converse. This is
Theorem 4.4.3. If R is a commutative ring with 1 and M an ideal of R
such that RIM is a field, then M is a maximal ideal of R.
Proof We saw in Example 2 of Section 3 that the only ideals in a field
Fare (0) and F itself. Since RIM is a field, it has only (0) and itself as ideals.
But then, by the correspondence given us by Theorem 4.3.4, there can be no
ideal of R property between M and R. Thus M is a maximal ideal of R. D
We give a few examples of maximal ideals in commutative rings.
Examples
1. Let 7L be the integers and M an ideal of 7L. As an ideal of 7L we certainly
have that M is an additive subgroup of 7L, so must consist of all multiples of
some fixed integer n. Thus since RIM == 7L
n
and since 7L
n
is a field if and only
if n is a prime, we see that M is a maximal ideal of 7L if and only if M consists
of all the multiples of some prime p. Thus the set of maximal ideals in 7L cor
responds to the set of prime numbers.
2. Let 7L be the integers, and let R = {a + bi Ia, b E 7L}, a subring of
C (i
2
= 1). Let M be the set of all a + bi in R, where 31 a and 31 b. We
leave it to the reader to verify that M is an ideal of R.
We claim that M is a maximal ideal of R. For suppose that N :J M and
N =1= M is an ideal of R. So there is an element r + si E N, where 3 does not
divide r or 3 does not divide s. Therefore, 3 (r
2
+ S2). (Prove using congru
ences mod 3 !) But t = r
2
+ S2 = (r + si)(r  si), so is in N, since r + si E N
and N is an ideal of R. So N has an integer t = r
2
+ S2 not divisible by 3. Thus
ut + 3v = 1 for some integers u, v; but tEN, hence ut E Nand 3 E MeN,
so 3v E N. Therefore, 1 = ut + 3v E N. Therefore, (a + bi)1 E N, since N is
an ideal of R, for all a + bi E R. This tells us that N = R. So the only ideal of
R above M is R itself. Consequently, M is a maximal ideal of R.
150 Ring Theory Ch.4
By Theorem 4.4.2 we know that RIM is a field. It can be shown (see
Problem 2) that RIM is a field having nine elements.
3. Let R be as in Example 2 and let I = {a + bi I Sla and SIb}. We assert that
I is not a maximal ideal of R.
In R we can factor S = (2 + i)(2  i). Let M = {x(2 + i) Ix E R}. Mis
an ideal of R, and since S = (2 + i)(2  i) is in M, we see that I C M. Clearly,
I =1= M for 2 + i E M and is not in I because S{2. So I =1= M. Can M = R?
If so, then (2 + i)(a + bi) = 1 for some a, b. This gives 2a  b = 1 and
2b + a = 0; these two equations imply that Sa = 2, so a = ~ , b = t. But
~ fl. 7L, t $. 7L; the element a + bi = ~  ti is not in R. So M =1= R.
One can show, however, that M is a maximal ideal of R. (See Problem 3.)
4. Let R = {a + bv'21 a, b integers}, which is a subring of the real field under
the sum and product of real numbers. That R is a ring follows from
(a + bv'2) + (c + dv'2) = (a + c) + (b + d)v'2
and
(a + bv'2)(c + dv'2) = (ac + 2bd) + (ad + bc)v'2.
Let M = {a + bv'2 E R I Sla and 5Ib}. M is easily seen to be an ideal of R.
We leave it to the reader to show that M is a maximal ideal of R and that
RIM is a field having 2S elements.
5. Let R be the ring of all realvalued continuous functions on the closed
unit interval [0, 1]. We showed in Example S of Section 3 that if M =
{fER If ( ~ ) = O}, then M is an ideal of Rand RIM is isomorphic to the real
field. Thus, by Theorem 4.4.3, M is a maximal ideal of R.
Of course, if we let My = {f E R If( y) = O}, where y E [0, 1], then My is
also a maximal ideal. It can be shown that every maximal ideal in R is of the
form My for some y E [0,1], but to prove it we would require some results from
real variable theory.
What this example shows is that the maximal ideals in R correspond to
the points of [0, 1].
PROBLEMS
1. If a, b are integers and 3 {a or 3 { b, show that 3 { (a
2
+ b
2
).
2. Show that in Example 2, RIM is a field having nine elements.
3. In Example 3, show that M = {x(2 + i) Ix E R} is a maximal ideal of R.
Sec. 5 Polynomial Rings 151
4. In Example 3, show that RIM ~ 7L
s
.
5. In Example 3, show that RII ~ 7L
s
EB 7L
s
.
6. In Example 4, show that M is a maximal ideal of R.
7. In Example 4, show that RIM is a field having 25 elements.
8. Using Example 2 as a model, construct a field having 49 elements.
We make a short excursion back to congruences mod p, where p is an odd
prime. If a is an integer such that p ~ a and r == a mod p has a solution x in 7L,
we say that a is a quadratic residue mod p. Otherwise, a is said to be a quadratic
nonresidue mod p.
9. Show that (p  1)/2 of the numbers 1, 2, ... , p  1 are quadratic
residues and (p  1)/2 are quadratic nonresidues modp. [Hint: Show
that {x
2
1 x =1= 0 E 7L
p
} forms a group of order (p  1)/2.]
10. Let m > 0 be in 7L, and suppose that m is not a square in 7L. Let R =
{a + Y;;; b I a, b E 7L }. Prove that under the operations of sum and
product of real numbers R is a ring.
11. If p is an odd prime, let us set I
p
= {a +Y;;; b I pia and pi b}, where
a + Y;;; b E R, the ring in Problem 10. Show that I
p
is an ideal of R.
12. If m is a quadratic nonresidue mod p, show that the ideal I
p
in Problem
11 is a maximal ideal of R.
13. In Problem 12 show that Rll
p
is a field having p 2 elements.
5. POLYNOMIAL RINGS
The material that we consider in this section involves the notion of polyno
mial and the set of all polynomials over a given field. We hope that most
readers will have some familiarity with the notion of polynomial from their
high school days and will have seen some of the things one does with polyno
mials: factoring them, looking for their roots, dividing one by another to get
a remainder, and so on. The emphasis we shall give to the concept and alge
braic object known as a polynomial ring will be in a quite different direction
from that given in high school.
Be that as it may, what we shall strive to do here is to introduce the
ring of polynomials over a field and show that this ring is amenable to a care
ful dissection that reveals its innermost structure. As we shall see, this ring is
very wellbehaved. The development should remind us of what was done for
the ring of integers in Section 5 of Chapter 1. Thus we shall run into the ana
log of Euclid's algorithm, greatest common divisor, divisibility, and possibly
most important, the appropriate analog of prime number. This will lead to
152 Ring Theory Ch.4
unique factorization of a general polynomial into these "prime polyno
mials," and to the nature of the ideals and the maximal ideals in this new
setting.
But the polynomial ring enjoys one feature that the ring of integers did
not: the notion of a root of a polynomial. The study of the nature of such
rootswhich will be done, for the most part, in the next chapterconsti
tutes a large and important part of the algebraic history of the past. It goes
under the title Theory of Equations, and in its honorable past, a large variety
of magnificent results have been obtained in this area. Hopefully, we shall
see some of these as our development progresses.
With this sketchy outline of what we intend to do out of the way, we
now get down to the nittygritty of doing it.
Let F be a field. By the ring of polynomials in x over F, which we shall
always write as F[x], we mean the set of all formal expressions p(x) =
ao + alx + ... + an_lX
n

1
+ anx
n
, n 2:: 0, where the ai' the coefficients of
the polynomial p (x), are in F. We sometimes employ the alternative nota
tion: p(x) = aox
n
+ alx
n

1
+ ... + an. In F[x] we define equality, sum, and
product of two polynomials so as to make of F[x] a commutative ring as fol
lows:
1. Equality. We declare p(x) = ao + alx + ... + anx
n
and q(x) =
b
o
+ b1x + ... + bnx
n
to be equal if and only if their corresponding coeffi
cients are equal, that is, if and only if ai = b
i
for all i 2:: o.
We combine this definition of equality of polynomials p(x) and q(x)
with the convention that if
q(x) = b
o
+ b1x + · · . + bnx
n
and if b
m
+
1
= ... = b
n
= 0, then we can drop the last n  m terms and write
q(x) as
q(x) = b
o
+ b1x + ... + bmX
m
•
This convention is observed in the definition of addition that follows,
where s is the larger of m and n and we add coefficients an+l = ... = as = 0
if n < s or b
m
+
1
= ... = b
s
= 0 if m < s.
2. Addition. If p(x) = ao + alx + ... + anx
n
and q(x) = b
o
+ b1x
+ ... + bmx
m
, we declare p(x) + q(x) = Co + CIX + ... + csx
s
, where for
each i, Ci = ai + bi.
So we add polynomials by adding their corresponding coefficients.
The definition of multiplication is a little more complicated. We define
it loosely at first and then more precisely.
Sec. 5 Polynomial Rings 153
3. Multiplication. If p(x) = ao + alx + ... + anx
n
and q(x) =
b
o
+ b1x + ... + bmx
m
, we declare p(x)q(x) = Co + CIX + ... + ctX
t
, where
the Ci are determined by multiplying the expression out formally, using the
distributive laws and the rules of exponents XU XV = x
u
+
v
, and collecting
terms. More formally,
We illustrate these operations with a simple example, but first a nota
tional device: If some coefficient is 0, we just omit that term. Thus we write
9 + Ox + 7x
2
+ Ox
3
 14x
4
as 9 + 7x
2
 14x
4
•
Let p(x) = 1 + 3x
2
, q(x) = 4  5x + 7x
2
 x
3
• Then p(x) + q(x) =
5  5x + 10x
2
 x
3
while
p(x)q(x) = (1 + 3x
2
)(4  5x + 7x
2
 x
3
)
= 4  5x + 7x
2
 x
3
+ 3x
2
(4  5x + 7x
2
 x
3
)
= 4  5x + 7x
2
 x
3
+ 12x
2
 15x
3
+ 21x
4
 3x
5
= 4  5x + 19x
2
 16x
3
+ 21x
4
 3x
5
•
Try this product using the Ci as given above.
In some sense this definition of F[x] is not a definition at all. We have
indulged in some hand waving in it. But it will do. We could employ se
quences to formally define F[x] more precisely, but it would merely cloud
what to most readers is well known.
The first remark that we makeand do not verifyis that F[x] is a
commutative ring. To go through the details of checking the axioms for a
commutative ring is a straightforward but laborious task. However, it is im
portant to note
Lemma 4.5.1. F[x] is a commutative ring with unit.
Definition. If p(x) = ao + alx + ... + anx
n
and an =1= 0, then the
degree ofp(x), denoted by degp(x), is n.
So the degree of a polynomial p(x) is the highest power of x that occurs in
the expression for p(x) with a nonzero coefficient. Thus deg(x  x
2
+ x
4
) = 4,
deg(7x) = 1, deg 7 = O. (Note that this definition does not assign a degree to
O. It is, however, sometimes convenient to adopt the convention that the de
gree of 0 be  00, in which case many degree related results will hold in this
extended context.) The polynomials of degree 0 and the polynomial 0 are
called the constants; thus the set of constants can be identified with F.
The degree function on F[x] will playa similar role to that played by
154 Ring Theory Ch.4
the size of the integers in 7L, in that it will provide us with a version of Eu
clid's Algorithm for F[x].
One immediate and important property of the degree function is that it
behaves well for products.
Lemma 4.5.2. If p(x), q(x) are nonzero elements of F[x], then
deg(p(x)q(x» = degp(x) + deg q(x).
Proof Let m = degp(x) and n = deg q(x); thus the polynomialp(x) =
ao + alx + ... + amx
m
, where am =1= 0, and the polynomial q(x) = b
o
+
b1x + ... + bnx
n
, where b
n
=1= 0. The highest power of x that can occur in
p(x)q(x) is x
m
+
n
, from our definition of the product. What is the coefficient
of x
m
+
n
? The only way that x
m
+
n
can occur is from (amxm)(bnx
n
) =
ambnx
m
+
n
. So the coefficient of x
m
+
n
in p(x)q(x) is amb
n
, which is not 0, since
am =1= 0, b
n
=1= 0. Thus deg(p(x)q(x» = m + n = degp(x) + deg q(x), as
claimed in the lemma. D
One also has some information about deg(p(x) + q(x». This is
Lemma 4.5.3. If p(x), q(x) E F[x] and p(x) + q(x) =1= 0, then
deg(p(x) + q(x» ::; max(degp(x), deg q(x».
We leave the proof of Lemma 4.5.3 to the reader. It will play no role in
what is to come, whereas Lemma 4.5.2 will be important. We put it in so that
the "+" should not feel slighted visavis the product.
An immediate consequence of Lemma 4.5.2 is
Lemma 4.5.4. F[x] is an integral domain.
Proof If p(x) =1= °and q(x) =1= 0, then degp(x) ;::: 0, deg q(x) ;::: 0, so
deg(p(x)q(x» = degp(x) + deg q(x) ;::: 0. Therefore, p(x)q(x) has a degree,
so cannot be °(which has no degree assigned to it). Thus F[x] is an integral
domain. D
One of the things that we were once forced to learn was to divide
one polynomial by another. How did we do this? The process was
called long division. We illustrate with an example how this was done,
for what we do in the example is the model of what we shall do in the gen
eral case.
We want to divide 2x
2
+ 1 into x
4
 7x + 1. We do it schematically as
follows:
Sec. 5
and we interpret this as:
Polynomial Rings
7x + 1
7x + 1
7x + 1 ~
155
x
4
 7x + 1 = (2x
2
+ 1 ) ( ~ x 2  ~ ) + (7x + ~ )
and 7x + ~ is called the remainder in this division.
What exactly did we do? First, where did the ~ X 2 come from? It came
from the fact that when we multiply 2x
2
+ 1 by ~ x
2
we get x
4
, the highest
power occurring in x
4
 7x + 1. So subtracting ~ x 2 ( 2 x 2 + 1) from x
4
 7x + 1
gets rid of the x
4
term and we go on to what is left and repeat the procedure.
This "repeat the procedure" suggests induction, and that is how we
shall carry out the proof. But keep in mind that all we shall be doing is what
we did in the example above.
What this gives us is something like Euclid's Algorithm, in the integers.
However, here we call it the Division Algorithm.
Theorem 4.5.5 (Division Algorithm). Given the polynomials f(x),
g(x) E F[x], where g(x) =1= 0, then
f(x) = q(x)g(x) + rex),
where q(x), rex) E F[x] and rex) = 0 or deg rex) < deg g(x).
Proof We go by induction on degf(x). If either f(x) = 0 or degf(x) <
deg g(x), then f(x) = Og(x) + f(x), which satisfies the conclusion of the
theorem.
So suppose that degf(x) 2:: deg g(x); thus the polynomial f(x) =
ao + at X + ... + am
xm
, where am =1= 0 and the polynomial g(x) = b
o
+ btx
+ ... + bnx
n
, where b
n
=1= 0 and where m 2:: n.
Consider
:m xmng(x) = ~ m xmn(b
o
+ b1x + · · · + bnx
n
)
n n
156 Ring Theory Ch.4
Thus (amlbn)xmng(x) has the same degree and same leading coefficient as does
f(x), so f(x)  (ami bn)xmng(x) = h(x) is such that the relation deg h(x) <
degf(x) holds. Thus, by induction,
h (x) = qt (x)g(x) + r(x), where qt(x), r(x) E F [x]
and r(x) = 0 or deg r(x) < deg g(x). Remembering what h(x) is, we get
h(x) = f(x)  ~ m xmng(x) = qJ(x)g(x) + r(x)
n
so
If q(x) = (amlbn)x
m

n
+ ql(X), we have achieved the form claimed in the
theorem.D
The Division Algorithm has one immediate application: It allows us to
determine the nature of all the ideals of F[x]. As we see in the next theorem,
an ideal of F[x] must merely consist of all multiples, by elements of F[x], of
some fixed polynomial.
Theorem 4.5.6. If I =1= (0) is an ideal of F[x], then I = {f(x)g(x) If(x) E
F[x]}; that is, I consists of all multiples of the fixed polynomial g(x) by the el
ements of F[x].
Proof To prove the theorem, we need to produce that fixed polyno
mial g(x). Where are we going to dig it up? The one control we have numeri
cally on a given polynomial is its degree. So why not use the degree function
as the mechanism for finding g(x)?
Since I =1= (0) there are elements in I having nonnegative degree. So
there is a polynomial g(x) =1= 0 in I of minimal degree; that is, g(x) =1= 0 is in I
and if 0 =1= t(x) E I, then deg t(x) 2:: deg g(x). Thus, by the division algorithm,
t(x) = q(x)g(x) + r(x), where r(x) = 0 or deg r(x) < deg g(x). But since
g(x) E I and I is an ideal of F[x], we have that q(x)g(x) E I. By assumption,
t(x) E I, thus t(x)  q(x)g(x) is in I, so r(x) = t(x)  q(x)g(x) is in I. Since
g(x) has minimal degree for the elements of I and r(x) E I, deg r(x) cannot
be less than deg g(x). So we are left with r(x) = O. But this says that t(x) =
q(x)g(x). So every element in I is a multiple of g(x). On the other hand,
Sec. 5 Polynomial Rings 157
since g(x) E I and I is an ideal of F[x], f(x)g(x) E I for all f(x) E F[x]. The
net result of all this is that I = {f(x)g(x) If(x) E F[x]}. D
Definition. An integral domain R is called a principal ideal domain if
every ideal I in R is of the form I = {xa Ix E R} for some a E I.
Theorem 4.5.6 can be stated as: F[x] is a principal ideal domain.
We shall write the ideal generated by a given polynomial, g(x), namely
{f(x)g(x) If(x) E F[x]}, as (g(x».
The proof showed that if I is an ideal of F[x], then I = (g(x», where
g(x) is a polynomial of lowest degree contained in I. But g(x) is not unique, for
if a =1= 0 E F, then ag(x) is in I and has the same degree as g(x), so I = (ag(x».
To get some sort of uniqueness in all this, we single out a class of poly
nomials.
Definition. f(x) E F[x] is a monic polynomial if the coefficient of its
highest power is 1.
Thus f(x) is monic means that
We leave to the reader to show that if I is an ideal of F[x], then there is
only one monic polynomial of lowest degree in I. Singling this out as the gen
erator of I does give us a "monic" uniqueness for the generation of I.
Our next step in this parallel development with what happens in the in
tegers is to have the notion of one polynomial dividing another.
Definition. Suppose f(x), g(x) E F[x], with g(x) =1= O. We say that
g(x) divides f(x), written as g(x) If(x), if f(x) = a(x)g(x) for some a(x) E
F[x].
Note that if f(x) =1= 0 and g(x) If(x), then deg g(x) ~ degf(x) by Lemma
4.5.2. Moreover, the ideals (f(x» and (g(x» of F[x], generated by f(x) and
g(x), respectively, satisfy the containing relation (f(x» c (g(x». (Prove!)
We again emphasize the parallelism between 7L, the integers, and F[x]
by turning to the notion of greatest common divisor. In order to get some sort
of uniqueness, we shall insist that the greatest common divisor always be a
monic polynomial.
158 Ring Theory Ch.4
Definition. For any two polynomials f(x) and g(x) E F[x] (not both 0),
the polynomial d(x) E F[x] is the greatest common divisor of f(x), g(x) if
d (x) is a monic polynomial such that:
(a) d(x) If(x) and d(x) Ig(x).
(b) If hex) If(x) and hex) I g(x), then hex) I d(x).
Although we defined the greatest common divisor of two polynomials,
we neither know, as yet, that it exists, nor what its form may be. We could
define it in another, and equivalent, way as the monic polynomial of highest
degree that divides both f(x) and g(x). If we did that, its existence would be
automatic, but we would not know its form.
Theorem 4.5.7. Given f(x) and g(x) =1= 0 in F[x], then their greatest
common divisor d(x) E F[x] exists; moreover, d(x) = a(x)f(x) + b(x)g(x)
for some a(x), b(x) E F[x].
Proof Let I be the set of all r(x)f(x) + s(x)g(x) as rex), sex) run
freely over F[x]. We claim that I is an ideal of R. For,
(rl(x)f(x) + Sl(X)g(X» + (r2(x)f(x) + S2(X)g(X»
= (rl(x) + r
2
(x»f(x) + (Sl(X) + S2(X»g(x),
so is again in I, and for t(x) E F[x],
t(x)(r(x)f(x) + s(x)g(x» = (t(x)r(x»f(x) + (t(x)s(x»g(x),
so it, too, is again in I. Thus I is an ideal of F[x]. Since g(x) =1= 0, we know
that I =1= 0, since both f(x) and g(x) are in [.
Since [ =1= 0 is an ideal of F[x], it is generated by a unique monic polyno
mial d(x) (Theorem 4.5.6). Since f(x), g(x) are in I, they must then be multi
ples of d(x) by elements of F[x]. This assures us that d(x) I f(x) and
d(x) Ig(x).
Because d(x) E I and [ is the set of all r(x)f(x) + s ( ~ ) g ( x ) , we have
that d(x) = a(x)f(x) + b(x)g(x) for some appropriate a(x), b(x) E F[x].
Thus if hex) If(x) and hex) Ig(x), then hex) I(a(x)f(x) + b(x)g(x» = d(x).
So d(x) is the greatest common divisor off(x) and g(x).
This proves the theorem; the uniqueness of d(x) is guaranteed by the
demand that we have made that the greatest common divisor be monic. 0
Another way to see the uniqueness of d(x) is from
Lemma 4.5.8. If f(x) =1= 0, g(x) =1= 0 are in F[x] and f(x) Ig(x) and
g(x) If(x), then f(x) = ag(x), where a E F.
Sec. 5 Polynomial Rings 159
Proof By the mutual divisibility condition on f(x) and g(x) we have,
by Lemma 4.5.2, degf(x) :::; deg g(x) :::; degf(x), so degf(x) = deg g(x). But
f(x) = a(x)g(x), so
degf(x) = deg a(x) + deg g(x) = deg a(x) + degf(x),
in consequence of which deg a(x) = 0, so a(x) = a, an element of F. D
We leave the proof of the uniqueness of the greatest common divisor
via Lemma 4.5.8 to the reader.
Definition. The polynomialsf(x), g(x) in F[x] are said to be relatively
prime if their greatest common divisor is 1.
Although it is merely a very special case of Theorem 4.5.7, to empha
size it and to have it to refer to, we state:
Theorem 4.5.9. If f(x), g(x) E F[x] are relatively prime, then
a(x)f(x) + b(x)g(x) = 1 for some a(x), b(x) E F[x]. Conversely, if
a(x)f(x) + b(x)g(x) = 1 for some a(x), b(x) E F[x], then f(x) and g(x) are
relatively prime.
Proof We leave this "conversely" part to the reader as exercise. D
As with the integers, we have
Theorem 4.5.10. If q(x) and f(x) are relatively prime and if
q(x) If(x)g(x), then q(x) Ig(x).
Proof By Theorem 4.5.9 a(x)f(x) + b(x)q(x) = 1 for some a(x),
b(x) E F[x]. Therefore,
a(x)f(x)g(x) + b(x)q(x)g(x) = g(x). (1)
Since q(x) Ib(x)g(x)q(x) and q(x) If(x)g(x) by hypothesis, q(x) divides the
lefthand side of the relation in (1). Thus q(x) divides the righthand side
of (1), that is, q(x) Ig(x), the desired conclusion. 0
We are now ready to single out the important class of polynomials that
will play the same role as prime objects in F[x] as did the prime numbers in 7L.
Definition. The polynomial p (x) E F[x] is irreducible if p (x) is of pos
itive degree and given any polynomialf(x) in F[x], then either p(x) If(x) or
p(x) is relatively prime to f(x).
160 Ring Theory Ch.4
We should note here that the definition implies that a polynomial p(x)
of positive degree is irreducible in F[x] if and only if p(x) cannot be factored
as a product of two polynomials of positive degree. In other words, if p(x) =
a(x)b(x), where a(x) and b(x) are in F[x], then either a(x) is a constant (that
is, an element of F), or b(x) is constant. The proof of this fact is very similar
to the proof of an analogous observation concerning two equivalent defini
tions of a prime number.
First suppose that p(x) (of positive degree) cannot be factored as a
product of two noneonstant polynomials. Then, given any f(x) in F[x], we
have only two possibilities for (p(x),f(x», namely 1 or a monic polynomial of
the form c·p(x), where c is an element of F. Thus (p(x), f(x» = 1 or p(x) If(x),
which shows that p(x) is irreducible.
Now let p(x) be irreducible in F[x] and suppose p(x) = a(x)b(x) for
some a(x), b(x) in F[x]. According to the definition, we must have
p(x) Ia(x) or (P(x), a(x» = 1. If p(x) Ia(x), then b(x) must be a constant. If,
on the other hand, p(x) and a(x) are relatively prime, then by Theorem
4.5.10, p(x) Ib(x), and in this case a(x) must be a constant. This shows that an
irreducible polynomial cannot be factored as a product of two nonconstant
polynomials.
Note that the irreducibility of a polynomial depends on the field F. For
instance, the polynomial x
2
 2 is irreducible in Q[x], where Q is the field of
rational numbers, but x
2
 2 is not irreducible in lR[x], where IR is the field of
real numbers, for in lR[x]
x
2
 2 = (x  v2)(x + v2).
Corollary to Theorem 4.5.10. If p(x) is irreducible in F[x] and
p(x) Iat (x)a2(x) ... ak(x), where at (x), ... , ak(x) are in F[x], then p(x) Iai(x)
for some i.
Proof We leave the proof to the reader. (See Theorem 1.5.6.) D
Aside from its other properties, an irreducible polynomial p(x) in F[x]
enjoys the property that (p(x», the ideal generated by p(x) in F[x], is a max
imal ideal of F[x]. We prove this now.
Theorem 4.5.11. If p(x) E F[x] , then the ideal (p(x» generated by
p(x) in F[x] is a maximal ideal of F[x] if and only if p(x) is irreducible in F[x].
Proof We first prove that if p(x) is irreducible in F[x], then the ideal
M = (p(x» is a maximal ideal of F[x]. For, suppose that N is an ideal of
F[x], and N ~ M. By Theorem 4.5.6,
Sec. 5 Polynomial Rings 161
N = (f(x» for some f(x) E F[x].
Because p(x) E MeN, p(x) == a(x)f(x), since every element in N is of this
form. Butp(x) is irreducible in F[x], hence a(x) is a constant or f(x) is a con
stant. If a(x) == a E F, then p(x) == af(x), so f(x) = aIp(x). Thus f(x) E M,
which says that N C M, hence N == M. On the other hand, if f(x) == b E F,
then 1 == b1b E N, since N is an ideal of F[x], thus g(x)1 E N for all
g(x) E F[x]. This says that N == F[x]. Therefore, we have shown M to be a
maximal ideal of F[x].
In the other direction, suppose that M == (p(x» is a maximal ideal of
F[x]. If p(x) is not irreducible, then p(x) == a(x)b(x), where deg a(x) ;::: 1,
deg b(x) ;::: 1. Let N == (a(x»; then, since p(x) == a(x)b(x), p(x) E N. There
fore, MeN. Since deg a(x) ;::: 1, N == (a(x» =1= F[x], since every element in
(a(x» has degree at least that of a(x). By the maximality of M we conclude
that M == N. But then a(x) E N == M, which tells us that a(x) == f(x)p(x);
combined with p(x) == a(x)b(x) == b(x)f(x)p(x), we get that b(x)f(x) == 1.
Since deg 1 == °< deg b(x) ~ deg(b(x)f(x» == deg 1 == 0, we have reached a
contradiction. Thus p(x) is irreducible. D
This theorem is important because it tells us exactly what the maximal
ideals of F[x] are, namely the ideals generated by the irreducible polynomi
als. If M is a maximal ideal of F[x], F[x]/M is a field, and this field contains
F (or more precisely, the field {a + M Ia E F}, which is isomorphic to F).
This allows us to construct decent fields K :) F, the decency of which lies in
that p(x) has a root in K. The exact statement and explanation of this we
postpone until Chapter 5.
The last topic in this direction that we want to discuss is the factoriza
tion of a given polynomial as a product of irreducible ones. Note that if
p(x) == aox
n
+ atXn1 + ... + anIX + an, ao =1= 0, is irreducible in F[x], then
so is ao1p(x) irreducible in F[x]; however, ao1p(x) has the advantage of
being monic. So we have this monic irreducible polynomial trivially obtain
able from p(x) itself. This will allow us to make more precise the uniqueness
part of the next theorem.
Theorem 4.5.12. Let f(x) E F[x] be of positive degree. Then either
f(x) is irreducible in F[x] or f(x) is the product of irreducible polynomials in
F[x]. In fact, then,
f(x) == apl(X)m
1
p2
(X)m
2
••• Pk(X)mk,
where a is the leading coefficient off(x),Pt(x), ... ,Pk(X) are monic and ir
reducible in F[x], mi > 0, ... ,mk > °and this factorization in this form is
unique up to the order of the p;(x).
162 Ring Theory Ch.4
P r ~ o f We first show the first half of the theorem, namely that f(x) is
irreducible or the product of irreducibles. The proof is exactly the same as
that of Theorem 1.5.7, with a slight, obvious adjustment.
We go by induction on degf(x). If degf(x) = 1, then f(x) = ax + b
with a =1= 0 and is clearly irreducible in F[x]. So the result is true in this case.
Suppose, then, that the theorem is correct for all a(x) E F[x] such that
deg a(x) < degf(x). If f(x) is irreducible, then we have nothing to prove.
Otherwise, f(x) = a(x)b(x), a(x) and b(x) E F[x] and deg a(x) < degf(x)
and deg b(x) < degf(x). By the induction, a(x) [and b(x)] is irreducible or is
the product of irreducibles. But then f(x) is the product of irreducible poly
nomials in F[x]. This completes the induction, and so proves the opening half
of the theorem.
Now to the uniqueness half. Again we go by induction on degf(x). If
degf(x) = 1, thenf(x) is irreducible and the uniqueness is clear.
Suppose the result true for polynomials of degree less than degf(x).
Suppose that
f(x) = apl(X)m
1
p2
(x)m
2
••• Pk(X)m
k
= aql(X)n
1
••• qr(x)n
r
,
where the Pi(X), qi(X) are monic irreducible polynomials and the mi' ni are
all positive and a is the leading coefficient of f(x), that is, the coefficient
of the highest power term of f(x). Since Pl(X) If(x), we have that
Pl(X) Iql(X)n
1
••• qr(x)n
r
, so by the corollary to Theorem 4.5.10,Pl(X) Iqi(X)
for some i. Since qi(X) is monic and irreducible, as is Pl(X), we get Pl(X) =
qi(X). We can suppose (on renumbering) thatpl(x) = ql(X). Thus
;1«:)) = ap1(X)m,1
p2
(x)m
z
· · · Pk(X)m
k
= apl(X)n
1
l
q2
(x)n
2
• •• qr(x)n
r
•
By induction we have unique factorization in the required form for
f(X)/Pl(X), whose degree is less than degf(x). Hence we obtain that
ml  1 = nl  1 (so ml = nl), m2 = n2, ... ,mk = nk, r = k and P2(X) =
q2(X), ... ,Pk(X) = qk(X), on renumbering the q's appropriately. This com
pletes the induction and proves the theorem. 0
We have pointed out how similar the situation is for the integers 7L and
the polynomial ring F [x]. This suggests that there should be a wider class of
rings, of which the two examples 7L and F [x] are special cases, for which
much of the argumentation works. It worked for 7L and F [x] because we had
a measure of size in them, either by the size of an integer or the degree of a
polynomial. This measure of size was such that it allowed a Euclidtype algo
rithm to hold.
Sec. 5 Polynomials Rings 163
This leads us to define a class of rings, the Euclidean rings.
Definition. An integral domain R is a Euclidean ring if there is a
function d from the nonzero elements of R to the nonnegative integers that
satisfies:
(a) For a =1= 0, b =1= 0 E R, d(a) ::; d(ab).
(b) Given a =1= 0, b =1= 0, there exist q and r E R such that b == qa + r, where
r == 0 or d(r) < d(a).
The interested student should try to see which of the results proved for
polynomial rings (and the integers) hold in a general Euclidean ring. Aside
from a few problems involving Euclidean rings, we shall not go any further
with this interesting class of rings.
The final comment we make here is that what we did for polynomials
over a field we could try to do for polynomials over an arbitrary ring. That is,
given any ring R (commutative or noncommutative), we could define the
polynomial ring R[x] in x over R by defining equality, addition, and multipli
cation exactly as we did in F[x], for F a field. The ring so constructed, R[x], is
a very interesting ring, whose structure is tightly interwoven with that of R it
self. It would be too much to expect that all, or even any, of the theorems
proved in this section would carryover to R[x] for a general ring R.
PROBLEMS
In the following problems, F will always denote a field.
Easier Problems
1. If F is a field, show that the only invertible elements in F[x] are the
nonzero elements of F.
2. If R is a ring, we introduce the ring R[x] of polynomials in x over R, just as
we did F[x]. Defining degf(x) for f(x) E R[x] as we did in F[x], show that:
(a) deg(f(x)g(x» ::; degf(x) + deg g(x) if f(x)g(x) =1= O.
(b) There is a commutative ring R such that we can find f(x), g(x) in
R[x] with deg(f(x)g(x» < degf(x) + deg g(x).
3. Find the greatest common divisor of the following polynomials over Q,
the field of rational numbers.
(a) x
3
 6x + 7 and x + 4.
(b) x
2
1 and2x
7
 4x
5
+ 2.
164 Ring Theory Ch.4
(c) 3x
2
+ 1 and x
6
+ x
4
+ X + 1.
(d) x
3
 1 and x
7
 x
4
+ x
3
 1.
4. Prove Lemma 4.5.3.
5. In Problem 3, let I = {f(x)a(x) + g(x)b(x)}, where f(x), g(x) run over
Q[x] and a(x) is the first polynomial and b(x) the second one in each part
of the problem. Find d(x), so that I = (d(x» for Parts (a), (b), (c),
and (d).
6. If g(x), f(x) E F[x] and g(x) If(x), show that (f(x» c (g(x».
7. Prove the uniqueness of the greatest common divisor of two polynomials
in F[x] by using Lemma 4.5.8.
8. If f(x), g(x) E F[x] are relatively prime and f(x) I h(x) and g(x) Ih(x),
show thatf(x)g(x) Ih(x).
9. Prove the Corollary to Theorem 4.5.10.
10. Show that the following polynomials are irreducible over the field F indi
cated.
(a) x
2
+ 7 over F = real field = IR.
(b) x
3
 3x + 3 over F = rational field = Q.
(c) x
2
+ X + lover F = 7L
2
•
(d) x
2
+ lover F = 7L
19
•
(e) x
3
90verF=7L
13
•
(f) x
4
+ 2x
2
+ 2 over F = Q.
11. Ifp(x) E F[x] is of degree 3 andp(x) = aox
3
+ aIx
2
+ a2x + a3' show
that p(x) if irreducible over F if there is no element rEF such that
p(r) = aor
3
+ aIr
2
+ a2r + a3 = O.
12. If Fe K are two fields andf(x), g(x) E F[x] are relatively prime in F[x],
show that they are relatively prime in K[x].
MiddleLevel Problems
13. Let IR be the field of real numbers and C that of complex numbers. Show
that lR[x]/(x
2
+ 1) ~ C. [Hint: If A = lR[x]/(x
2
+ 1), let u be the image of
x in A; show that every element in A is of the form a + bu, where
a,bElRandu
2
= 1.]
14. Let F = 7L
II
, the integers mod 11.
(a) Let p(x) = x
2
+ 1; show that p(x) is irreducible in F[x] and that
F[x]/(p (x» is a field having 121 elements.
(b) Let p(x) = x
3
+ X + 4 E F[x]; show that p(x) is irreducible in F[x]
and that F[x]/(p(x» is a field having 11
3
elements.
Sec. 5 Polynomials Rings 165
15. Let F = 7L
p
be the field of integers mod P, where P is a prime, and let
q(x) E F[x] be irreducible of degree n. Show that F[x]/(q(x» is a field
having at most pn elements. (See Problem 16 for a more exact state
ment.)
16. Let F, q(x) be as in Problem 15; show that F[x]/(q(x» has exactly pn ele
ments.
17. Let Pl(X), P2(X), ... , Pk(X) E F[x] be distinct irreducible polynomials
and let q(x) = PI (X)P2(X) ... Pk(X). Show that
~ =.fuLEB F[x] EB'" EB F[x]
(q(x» (PI (X» (P2(X» (Pk(X»"
18. Let F be a finite field. Show that F[x] contains irreducible polynomials of
arbitrarily high degree. (Hint: Try to imitate Euclid's proof that there is
an infinity of prime numbers.)
19. Construct a field having P2 elements, for P an odd prime.
20. If R is a Euclidean ring, show that every ideal of R is principal.
21. If R is a Euclidean ring, show that R has a unit element.
22. If R is the ring of even integers, show that Euclid's algorithm is false in R
by exhibiting two even integers for which the algorithm does not hold.
Harder Problems
23. Let F = 7L
7
and let p(x) = x
3
 2 and q(x) = x
3
+ 2 be in F[x]. Show
that p(x) and q(x) are irreducible in F[x] and that the fields F[x]/(p(x»
and F[x]/(q(x» are isomorphic.
24. Let Q be the field of rational numbers, and let q(x) = x
2
+ X + 1 be in
Q(x). If a is a complex number such that a
2
+ a + 1 = 0, show that the
set {a + ba Ia, b E Q} is a field in two ways; the first by showing it to be
isomorphic to something you know is a field, the second by showing that
if a + ba =1= 0, then its inverse is of the same form.
25. If P is a prime, show that q(x) = 1 + x + x
2
+ ... x
p

l
is irreducible in
Q[x].
26. Let R be a commutative ring in which a
2
= 0 only if a = O. Show that if
q(x) E R[x] is a zerodivisor in R[x], then, if
q(
x) = a x
n
+ a x
n

I
+ ... + a
o 1 n'
there is an element b =1= 0 in R such that baa = bal = ... = ban = O.
166 Ring Theory Ch.4
27. Let R be a ring and I an ideal of R. If R[x] and I[x] are the polynomial
rings in x over R and I, respectively, show that:
(a) I [x] is an ideal of R[x].
(b) R[x]/I[x] ~ (R/I)[x].
Very Hard Problems
*28. Do Problem 26 even if the condition "a
2
= 0 only if a = 0" does not hold
in R.
29. Let R = {a + bi Ia, b integers} C C. Let d (a + bi) = a
2
+ b
2
• Show that
R is a Euclidean ring where d is its required Euclidean function. (R is
known as the ring of Gaussian integers and plays an important role in
number theory.)
6. POLYNOMIALS OVER THE RATIONALS
In our consideration of the polynomial ring F[x] over a field F, the particular
nature of F never entered the picture. All the results hold for arbitrary fields.
However, there are results that exploit the explicit character of certain fields.
One such field is that of the rational numbers.
We shall present two important theorems for Q[x], the polynomial ring
over the rational field Q. These results depend heavily on the fact that we are
dealing with rational numbers. The first of these, Gauss' Lemma, relates the
factorization over the rationals with factorization over the integers. The sec
ond one, known as the Eisenstein Criterion, gives us a method of constructing
irreducible polynomials of arbitrary degree, at will, in Q[x]. In this the field Q
is highly particular. For instance, there is no easy algorithm for obtaining irre
ducible polynomials of arbitrary degree n over the field 7L
p
of the integers
mod p, p a prime. Even over 7L
2
such an algorithm is nonexistent; it would be
highly useful to have, especially for coding theory. But it just doesn't exist
so far.
We begin our consideration with two easy results.
Lemma 4.6.1. Letf(x) E Q[x]; then
where u, m, ao, ... , an are integers and the ao, ab ... , an have no common
factor greater than 1 (i.e., are relatively prime) and (u, m) = 1.
Sec. 6 Polynomials Over the Rationals 167
Proof Since f(x) E Q[x], f(x) = qox
n
+ qlX
n

l
+ ... + qn, where the
qi are rational numbers. So for i = 0, 1,2, ... , n, qi = bilci' where bi' c
i
are
integers. Thus
clearing of denominators gives us
f(x) = COC
l
.1.. C
n
(uox
n
+ UlX
n

l
+ · · · + un),
where the Ui are integers. If w is the greatest common divisor of un, Ul, ... , Un'
then each Ui = wa
i
, where ao, ab ... , an are relatively prime integers. Then
canceling out the greatest common factor of wand COCl ••• C
n
gives us
where U, m are relatively prime integers, as is claimed in the lemma. D
The next result is a result about a particular homomorphic image of
R[x] for any ring R.
Lemma 4.6.2. If R is any ring and I an ideal of R, then I[x], the poly
nomial ring in x over I, is an ideal of R[x]. Furthermore, R[x]II[x] == (RII)[x],
the polynomial ring in x over RII.
Proof Let R = RII; then there is a homomorphism q;: R ~ R, defined
by q;(a) = a + I, whose kernel is I. Define <I> : R[x] ~ R[x] by: If
f(x) = aox
n
+ a1x
n

1
+ ... + an,
then
<I>(f(x» = q;(ao)x
n
+ q;(a1)x
n

1
+ ... + q;(a
n
).
We leave it to the reader to prove that <I> is a homomorphism of R[x] onto
R[x]. What is the kernel, Ker <1>, of <I>? If f(x) = aox
n
+ ... + an is in Ker <1>,
then <I>(f(x» = 0, the °element of R[x]. Since
<I>(f(x» = q;(ao)x
n
+ q;(a1)x
n

1
+ · · . + q;(a
n
) = 0,
we conclude q;(ao) = 0, q;(al) = 0, ... , q;(a
n
) = 0, by the very definition of
what we mean by the Opolynomial in a polynomial ring. Thus each ai is in
168 Ring Theory Ch.4
the kernel of 'P, which happens to be I. Because ao, al, ... , an are in I,[(x) =
aox
n
+ alX
n

l
+ ... + an is in I[x]. So Ker <1> C I[x]. That I[x] C Ker <1> is im
mediate from the definition of the mapping <1>. Hence I[x] = Ker <1>. By the
First Homomorphism Theorem (Theorem 4.3.3), the ring I[x] is then an
ideal of R[x] and R[x] ~ R[x]/Ker <1> = R[x]ll[x]. This proves the lemma, re
membering that R= RII. 0
As a very special case of the lemma we have the
Corollary. Let 7L be the ring of integers, p a prime number in 7L, and
I = (p), the ideal of 7L generated by p. Then 7L[x]ll[x] ~ 7L
p
[x].
Proof Since 7L
p
~ TLII, the corollary follows by applying the lemma to
R = 7L. 0
We are ready to prove the first of the two major results we seek in this
section.
Theorem 4.6.3 (Gauss' Lemma). If [(x) E 7L[x] is a monic polynomial
and [(x) = a(x)b(x), where a(x) and b(x) are in Q[x]. Then [(x) =
al (x)b
l
(x), where al (x), b
l
(x) are monic polynomials in 7L[x] and deg al (x) =
deg a(x), deg b
l
(x) = deg b(x).
Proof Suppose that [(x) E 7L[x] and that [(x) = a(x)b(x), where a(x),
b(x) E Q[x], and deg a(x) = S, deg b(x) = r. By Lemma 4.6.1, we can ex
press each of a(x), b(x) as a product of a rational number and a polynomial
with integer coefficients. More precisely,
a(x) = .!!l (a'x
S
+ a'x
s

1
+ ... + a') = .!!la (x)
m
1
0 1 s mIl'
where a ~ , aI, ... , a; are relatively prime integers and
b(x) = !!1 (b ,x
r
+ b'x
r

1
+ ... + b') = !!1 b (x)
m
2
0 1 r m
2
1 ,
where b ~ , b
l
, ... , b; are relatively prime. Thus
where v and ware relatively prime, by canceling out the common factor of
UlU2 and mlm2. Therefore, w[(x) = val (x)b
l
(x), and [(x), al (x), b
l
(x) are
all in 7L[x]. Of course, we may assume with no loss of generality that the lead
ing coefficients of al (x) and b
l
(x) are positive.
If w = 1, then, since [(x) is monic, we get that v a b b ~ = 1 and this leads
Sec. 6 Polynomials Over the Rationals 169
easily to v = 1, ab = bb = 1 and so f(x) = al (x)b
l
(x), where both al (x) and
b
l
(x) are monic polynomials with integer coefficients. This is precisely the
claim of the theorem, since deg al (x) = deg a(x) and deg b
l
(x) = deg b(x).
Suppose then that w =1= 1; thus there is a prime p such that p Iwand,
since (v, w) = 1, p ~ v. Also, since the coefficients ab, at, ... , a; of al (x)
are relatively prime, there is an i such that p ~ a i; similarly, there is a j
such that p ~ bj. Let I = (p) be the ideal generated by p in Z; then
Z/I:::= 7L
p
and, by the Corollary to Lemma 4.6.2, Z[x]/I[x] :::= 7L
p
[x], so is an in
tegral domain. However, since p I w, lV, the image of w in Z[x]/I[x], is 0,
and since p ~ v , v the image of v in Z[x]/I[x] is not O. Thus 01 (x) =
v al(x)bl(x), where v ¥ 0 and aleX) ¥ 0, bl(x) ¥ 0 because p ~ a; and p ~ bj for
the given i, j above. This contradicts that Z[x]/I [x] is an integral domain. So
w =1= 1 is not possible, and the theorem is proved. D
It might be instructive for the reader to try to show directly that if
x
3
+ 6x  7 is the product of two polynomials having rational coefficients,
then it is already the product of two monic polynomials with integer coeffi
cients.
One should say something about C. F. Gauss (17771855), considered by many
to be the greatest mathematician ever. His contributions in number theory, al
gebra, geometry, and so on, are of gigantic proportions. His contributions in
physics and astronomy are also of such a great proportion that he is considered
by physicists as one of their greats, and by the astronomers as one of the impor
tant astronomers of the past.
As we indicated at the beginning of this section, irreducible polynomi
als of degree n over a given field F may be very hard to come by. However,
over the rationals, due to the next theorem, these exist in abundance and are
very easy to construct.
Theorem 4.6.4 (The Eisenstein Criterion). Let f(x) = x
n
+ alX
n

1
+ ... + an be a nonconstant polynomial with integer coefficients. Suppose
that there is some prime p such that pial, p I a2, ... ,p I an, but p2 ~ an. Then
f(x) is irreducible in Q[x].
Proof Suppose that f(x) = u(x)v(x), where u(x), vex) are of positive
degree and are polynomials in Q[x]. By Gauss' Lemma we may assume that
both u(x) and vex) are monic polynomials with integer coefficients. Let
I = (p) be the ideal generated by p in Z, and consider Z[x]/ I[x], which is an
integral domain, since we know by the Corollary to Lemma 4.6.2 that
7L[x]/I[x] :::= (Z/I)[x] :::= Zp[x]. The image of f(x) = x
n
+ alX
n

1
+ ... + an in
Z[x]/I[x] is x
n
, since p lab . .. ,p Ian. So if u(x) is the image of u(x) and vex)
170 Ring Theory Ch.4
that of vex) in Z[x]/I[x], then x
n
= u(x)v(x). Since u(x)lx
n
, v(x)lx
n
in Z[x]/I[x],
we must have that u(x) = x
r
, vex) = x
n

r
for some 1 ::; r < n. But then
u(x) = x
r
+ pg(x) and v(x) = x
n

r
+ ph(x), where g(x) and hex) are polyno
mials with integer coefficients. Since u(x)v(x) = x
n
+ pxrh(x) + pxnrg(x) +
p2
g
(x)h(x), and since 1 ::; r < n, the constant term of u(x)v(x) is p2st, where
s is the constant term of g(x) and t the constant term of hex). Because f(x) =
u(x)v(x), their constant terms are equal, hence an = p2st. Since sand t are in
tegers, we get that p
2
1 an, a contradiction. In this way we see that f(x) is irre
ducible. D
We give some examples of the use to which the Eisenstein Criterion
can be put.
1. Let f(x) = x
n
 p, p any prime. Then one sees at a glance that f(x) is
irreducible in Q[x], for the Eisenstein Criterion applies.
2. Let f(x) = x
5
 4x + 22. Since 2122, 2
2
t 22 and 2 divides the other rel
evant coefficients of f(x), the Eisenstein Criterion tells us that f(x) is ir
reducible in Q[x].
3. Let f(x) = XlI  6x
4
+ 12x
3
+ 36x  6. We see that f(x) is irreducible
in Q[x] by using either 2 or 3 to check the conditions of the Eisenstein
Criterion.
4. Let f(x) = 5x
4
 7x + 7; f(x) is not monic, but we can modify f(x)
slightly to be in a position where we can apply the Eisenstein Criterion.
Let
g(x) = 5
3
f(x) = 5
4
x
4
 7 ·5
3
x + 7 . 53 = (5X)4  175(5x) + 875.
If we let y = 5x, then g(x) = hey) = y4  175y + 875. The polynomial
hey) is irreducible in Z[y] by using the prime 7 and applying the Eisen
stein Criterion. The irreducibility of h (y) implies that of g(x), and so
that off(x), in Q[x].
This suggests a slight generalization of the Eisenstein Criterion to
nonmonic polynomials. (See Problem 4.)
5. Let f(x) = x
4
+ x
3
+ x
2
+ X + 1; as it stands we cannot, of course,
apply the Eisenstein Criterion to f(x). We pass to a polynomial g(x)
closely related to f(x) whose irreducibility in Q[x] will ensure that of
f(x). Let g(x) = f(x + 1) = (x + 1)4 + (x + 1)3 + (x + 1)2 + (x + 1) +
1 = x
4
+ 5x
3
+ 10x
2
+ lOx + 5. The Eisenstein Criterion applies to
g(x), using the prime 5; thus g(x) is irreducible in Q[x]. This implies
that f(x) is irreducible in Q[x]. (See Problem 1.)
Sec. 6 Polynomials Over the Rationals 171
Gotthold Eisenstein (18231852) in his short life made fundamental contribu
tions in algebra and analysis.
PROBLEMS
1. In Example 5, show that because g(x) is irreducible in Q[x], then so is
f(x).
2. Prove thatf(x) = x
3
+ 3x + 2 is irreducible in Q[x].
3. Show that there is an infinite number of integers a such that f(x)
x
7
+ 15x
2
 30x + a is irreducible in Q[x]. What a's do you suggest?
4. Prove the following generalization of the Eisenstein Criterion. Let
f(x) = aox
n
+ alX
n

1
+ ... + an have integer coefficients and suppose
that there is a prime p such that
p (a
o
, pial' pi a
2
, ••• ,p IanI, P I an ,
but p2 ( an; then f(x) is irreducible in Q[x].
5. If p is a prime, show that f(x) = x
p

1
+ x
p

2
+ ... + x + 1 is irreducible
in Q[x].
6. Let Fbe the field and 'P an automorphism of F[x] such that 'P(a) = a for
every a E F. If f(x) E F[x], prove that f(x) is irreducible in F[x] if and
only if g(x) = 'P(f(x» is.
7. Let F be a field. Define the mapping
cp: F[x] ~ F[x] by 'P(f(x» = f(x + 1)
for every f(x) E F[x]. Prove that 'P is an automorphism of F[x] such that
'P(a) = a for every a E F.
8. Let F be a field and b =1= 0 an element of F. Define the mapping
'P: F[x] ~ F[x] by 'P(f(x» = f(bx) for every f(x) E F[x]. Prove that 'P is
an automorphism of F[x] such that 'P(a) = a for every a E F.
9. Let F be a field, b =1= 0, c elements of F. Define the mapping
'P: F[x] ~ F[x] by 'P(f(x» = f(bx + c) for every f(x) E F[x]. Prove that
'P is an automorphism of F[x] such that 'P(a) = a for every a E F.
10. Let 'P be an automorphism of F[x], where F is a field, such that 'P(a) = a
for every a E F. Prove that iff(x) E F[x], then deg 'P(f(x» = degf(x).
11. Let 'P be an automorphism of F[x], where F is a field, such that 'P(a) = a
for every a E F. Prove there exist b =1= 0, c in F such that 'P(f(x» =
f(bx + c) for every f(x) E F[x].
172 Ring Theory Ch.4
12. Find a nonidentity automorphism 'P of Q[x] such that 'P
2
is the identity
automorphism of Q[x].
13. Show that in Problem 12 you do not need the assumption 'P(a) = a for
every a E Q because any automorphism of Q[x] automatically satisfies
'P(a) = a for every a E Q.
14. Let C be the field of complex numbers. Given an integer n > 0, exhibit
an automorphism 'P of C[x] of order n.
7. FIELD OF QUOTIENTS OF AN INTEGRAL DOMAIN
Given the integral domain 7L, the ring of integers, then intimately related to 7L
is the field Q of rational numbers that consists of all fractions of integers; that
is, all quotients min, where m, n =1= 0 are in 7L. Note that there is no unique
way of representing ~ , say, in Q because ~ = ~ = (7)/(14) = ....
In other words, we are identifying ~ with ~ , (7)/( 14), and so on. This sug
gests that what is really going on in constructing the rationals from the inte
gers is some equivalence relation on some set based on the integers.
The relation of Q to 7L carries over to any integral domain D. Given an
integral domain D, we shall construct a field F ~ D whose elements will be
quotients alb with a, bED, b =1= O. We go through this construction formally.
Let D be an integral domain, and let S = {(a, b) Ia, bED, b =1= O}; Sis
thus the subset of D X Dthe Cartesian product of D with itselfin which
the second component is not allowed to be o. Think of (a, b) as alb for a mo
ment; if so, when would we want to declare that (a, b) = (e, d)? Clearly, we
would want this if alb = eld, which in D itself would become ad = be. With
this as our motivating guide we define a relation  on S by declaring:
(a, b)  (e, d) for (a, b), (e, d) in S, if and only if ad = be.
We first assert that this defines an equivalence relation on S. We go
through the three requirements for an equivalence relation term by term.
1. (a, b)  (a, b), for clearly ab = ba (since D is commutative). So  is
reflexive.
2. (a, b)  (e, d) implies that (e, d)  (a, b), for (a, b)  (e, d) means
ad = be; for (e, d)  (a, b) we need eb = da, but this is true, since
eb = be = ad = da. So  is symmetric.
3. (a, b)  (e, d), (e, d)  (e, f) implies that ad = be, ef = de, so adf =
bef = bde; but d =1= 0 and we are in an integral domain, hence af =
be follows. This says that (a, b)  (e, f). So the relation is transitive.
Sec. 7 Field of Quotients of an Integral Domain 173
We have shown that '"'' defines an equivalence relation on S. Let F be
the set of all the equivalence classes [a, b] of the elements (a, b) E S. F is our
required field.
To show that F is a field, we must endow it with an addition and multi
plication. First the addition; what should it be? Forgetting all the fancy talk
about equivalence relation and the like, we really want [a, b] to be alb. If so,
what should [a, b] + [e, d] be other than the formally calculated
~ + £. = ad + be?
b d bd .
This motivates us to define
[a, b] + [e, d] = [ad + be, bdl. (1)
Note that since b =1= 0, d =1= °and D is a domain, then bd =1= 0, hence
[ad + be, bd] is a legitimate element of F.
As usual we are plagued with having to show that the addition so de
fined in F is welldefined. In other words, we must show that if [a, b] =
[a', b'] and [e, d] = [e', d'], then [a, b] + [e, d] = [a', b'] + [e', d']. From (1)
we must thus show that [ad + be, bd] = [a'd' + b'e', b'd'], which is to say,
(ad + be)b'd' = bd(a'd' + b'e'). Since [a, b] = [a', b'] and [e, d] = [e', d'],
ab' = ba' and cd' = de'. Therefore, (ad + be)b'd' = ab'dd' + bb'ed' =
ba'dd' + bb' dc' = (a'd' + b'e')bd, as required. Thus "+" is welldefined in F.
The class [0, b], b =1= 0, acts as the °under" +," we denote it simply as 0,
and the class [a, b] is the negative of [a, b]. To see that this makes of F an
abelian group is easy, but laborious, for all that really needs verification is
the associative law.
Now to the multiplication in F. Again motivated by thinking of [a, b] as
alb, we define
[a, b][e, d] = [ae, bdl. (2)
Again since b =1= 0, d =1= 0, we have bd =1= 0, so the element [ae, bd] is
also a legitimate element of F.
As for the "+" we must show that the product so introduced is wellde
fined;thatis,if[a,b] = [a',b'], [e,d] = [e',d'],then
[ ~ , b][e, d] = [ae, bd] = [a'e', b'd
'
] = [a', b/][e
'
, d
'
].
We know that ab' = ba' and cd' = de', so aeb'd' = ab' cd' = ba'de' = bda' e',
which is exactly what we need for [ae, bd] = [a' e' , b' d']. Thus the product is
welldefined in F.
What acts as 1 in F? We claim that for any a =1= 0, b =1= °in D, [a, a] =
[b, b] (since ab = ba) and [e, d][a, a] = [ea, da] = [e, d], since (ea)d = (da)e..
174 Ring Theory Ch.4
So [a, a] acts as 1, and we write it simply as 1 = [a, a] (for all a =1= °in D).
Given [a, b] =1= 0, then a =1= 0, so [b, a] is in F; hence, because [a, b][b, a] =
[ab, ba] = [ab, ab] = 1, [a, b] has an inverse in F. All that remains to show
that the nonzero elements of F form an abelian group under this product is
the associative law and commutative law. We leave these to the reader.
To clinch that F is a field, we need only now show the distributive law.
But [ad + be, bd][e,f] = [(ad + be)e, bdf], so
([a, b] + [e, d])[e,f] = [ade + bee, bdf],
while
[a, b][e,f] + [e, d][e,f]
= rae, bf] + [ee, df] = [aedf + bfee, bdf2]
= [(ade + bee) f, bdf2] = [ade + bee, bdf][f:f]
= [ade + bee, bdf],
which we have seen is ([a, b] + [e, d])[e, fl. The distributive law is now es
tablished, so F is a field.
Let a =1= °be a fixed element in D and consider [da, a] for any d E D.
The map q;: d ~ [da, a] is a monomorphism of D into F. It is certainly 11,
for if q;(d) = [da, a] = 0, then da = 0, so d = 0, since D is an integral do
main. Also, q;(d
l
d
2
) = [d
l
d
2
a, a] while q;(d
l
)q;(d
2
) = [dla, a][d
2
a, a]
[d
l
d
2
a
2
, a
2
] = [d
l
d
2
a, a][a, a] = [d
l
d
2
a, a] = q;(d
l
d
2
). Furthermore,
[d
1
a, a] + [d
2
a, a] = [d
t
a
2
+ a
2
d
2
, a
2
]
= [dta + d
2
a, a][a, a]
= [edt + d
2
) a, a]
so q;(d
l
+ d
2
) = [(dl + dl)a, a] = [dla, a] + [d
2
a, a] = q;(d
l
) + q;(d
2
). Thus
q; maps D monomorphically into F. So, D is isomorphic to a subring of F, and
we can thus consider D as "embedded" in F. We consider every element
[a, b] of F as the fraction alb.
Theorem 4.7.1. Let D be an integral domain. Then there exists a field
F :J D which consists of all fractions alb, as defined above, of elements in D.
The field F is called the field of quotients of D. When D = 7L, then F is
isomorphic to the field Q of rational numbers. Also, if D is the domain of
even integers, then F is also the entire field Q.
What we did above in constructing the field of quotients of D was a
long, formal, wordy, and probably dull way of doing something that is by its
Sec. 7 Field of Quotients of an Integral Domain 175
nature something very simple. We really are doing nothing more than form
ing all formal fractions alb, a, b =1= 0 in D, where we add and multiply frac
tions as usual. However, it is sometimes necessary to see something done to
its last detail, painful though it may be. Most of us had never seen a really
formal and precise construction of the rationals from the integers. Now
that we have constructed F from D in this formal manner, forget the de
tails and think of F as the set of all fractions of elements of D.
PROBLEMS
1. Prove the associative law of addition in F.
2. Prove the commutative law of addition in F.
3. Prove that the product in F is commutative and associative.
4. If K is any field that contains D, show that K ~ F. (So F is the smallest
field that contains D.)
5
FIELDS
The notion of a ring was unfamiliar territory for most of us; that of a field
touches more closely on our experience. While the only ring, other than a
field, that we might have seen in our early training was the ring of integers,
we had a bit more experience working with rational numbers, real numbers,
and, some of us, complex numbers, in solving linear and quadratic equations.
The ability to divide by nonzero elements gave us a bit of leeway, which we
might not have had with the integers, in solving a variety of problems.
So at first glance, when we start working with fields we feel that we are
on home ground. As we penetrate deeper into the subject, we start running
across new ideas and new areas of results. Once again we'll be in unfamiliar
territory, but hopefully, after some exposure to the topic, the notions will be
come natural to us.
Fields play an important role in geometry, in the theory of equations,
and in certain very important parts of number theory. We shall touch upon
each of these aspects as we progress. Unfortunately, because of the technical
machinery we would need to develop, we do not go into Galois theory, a very
beautiful part of the subject. We hope that many of the readers will make con
tact with Galois theory, and beyond, in their subsequent mathematical training.
1. EXAMPLES OF FIELDS
Let's recall that a field F is a commutative ring with unit element 1 such that for
every nonzero a E F there is an element aI E F such that aa
1
= 1. In other
words, fields are "something like" the rationals Q. But are they really? The in
176
Sec. 1 Examples of Fields 177
(verify!),
tegers mod p, 7L
p
, where p is a prime, form a field; in 7L
p
we have the relation
pI = ,I + 1 + ... + I, = 0
(p times)
Nothing akin to this happens in Q. There are even sharper differences
among fieldshow polynomials factor over them, special properties of which
we'll see in some examples, and so on.
We begin with some familiar examples.
Examples
1. Q, the field of rational numbers.
2. IR, the field of real numbers.
3. C, the field of complex numbers.
4. Let F = {a + bi Ia, b E Q} c C. To see that F is a field is relatively easy.
We only verify that if a + bi =1= 0 is in F, then (a + bi)l is also in F. But
what is (a + bi)l? It is merely
a bi
(a
2
+ b
2
) (a
2
+ b
2
)
and since a
2
+ b
2
=1= 0 and is rational, therefore al(a
2
+ b
2
) and also
bl(a
2
+ b
2
) are rational, hence (a + bi)l is indeed in F.
5. Let F = {a + bV21 a, b E Q} C IR. Again the verification that F is a field
is not too hard. Here, too, we only show the existence of inverses in F for
the nonzero elements of F. Suppose that a + bVi =1= 0 is in F; then, since
V2 is irrational, a
2
 2b
2
=1= O. Because
(a + bV2)(a  bVi) = a
2
 2b
2
,
we see that (a + bV2)(ale  Vi ble) = 1, where e = a
2
 2b
2
• The re
quired inverse for a + bV2 is ale  Vi ble, which is certainly an ele
ment of F, since ale and ble are rational.
6. Let F be any field and let F[x] be the ring of polynomials in x over F.
Since F[x] is an integral domain, it has a field of quotients according to
Theorem 4.7.1, which consists of all quotients f(x)lg(x), where f(x) and
g(x) are in F[x] and g(x) =1= O. This field of quotients of F[x] is denoted by
F(x) and is called the field of rational functions in x over F.
7. 7L
p
, the integers modulo the prime p, is a (finite) field.
8. In Example 2 in Section 4 of Chapter 4 we saw how to construct a field
having nine elements.
These eight examples are specific ones. Using the theorems we
have proved earlier, we have some general constructions of fields.
178 Fields
Ch.5
9. If D is any integeral domain, then it has a field of quotients, by Theorem
4.7.1, which consists of all the fractions alb, where a and b are in D and
b =1= O.
10. If R is a commutative ring with unit element 1 and M is a maximal ideal
of R, then Theorem 4.4.2 tells us that RIM is a field.
This last example, for particular R's, will play an important role in what
is to follow in this chapter.
We could go on, especially with special cases of Examples 9 and 10, to
see more examples. But the 10 that we did see above show us a certain vari
ety of fields, and we see that it is not too hard to run across fields.
In Examples 7 and 8 the fields are finite. If F is a finite field having q el
ements, viewing F merely as an abelian group under its addition, "+," we
have, by Theorem 2.4.5, that qx = 0 for every x E F. This is a behavior quite
distinct from that which happens in the fields that we are used to, such as the
rationals and reals.
We single out this kind of behavior in the
Definition. A field F is said to have (or, to be of) characteristic p =1= 0
if for some positive integer p, px = 0 for all x E F, and no positive integer
smaller than p enjoys this property.
If a field F is not of characteristic p =1= 0 for any positive integer p, we
call it a field of characteristic O. So Q, IR, C are fields of characteristic 0, while
7L
3
is of characteristic 3.
In the definition given above the use of the letter p for the characteris
tic is highly suggestive, for we have always used p to denote a prime. In fact,
as we see in the next theorem, this usage of p is consistent.
Theorem 5.1.1. The characteristic of a field is either 0 or a prime
number.
Proof If the field F has characteristic 0, there is nothing more to say.
Suppose then that mx = 0 for all x E F, where m is a positive integer. Let p
be the smallest positive integer such that px = 0 for all x E F. We claim that
p is a prime. For if p = UV, where U > 1 and v > 1 are integers, then in F,
(u1)(v1) = (uv)l = 0, where 1 is the unit element of F. But F, being a field, is
an integral domain (Problem 1); therefore, u1 = 0 or vI = O. In either case
we get that 0 = (u1)(x) = ux [or, similarly, 0 = (v1)x = vx] for any x in F.
But this contradicts our choice of p as the smallest integer with this property.
Hence p is a prime. D
Sec. 1 Examples of Fields 179
Note that we did not use the full force of the assumption that F was a
field. We only needed that F was an integral domain (with 1). So if we define
the characteristic of an integral domain to be 0 or the smallest positive inte
ger p such that px = 0 for all x E F, we obtain the same result. Thus the
Corollary. If D is an integral domain, then its characteristic is either 0
or a prime number.
PROBLEMS
1. Show that a field is an integral domain.
2. Prove the Corollary even if D does not have a unit element.
3. Given a ring R, let S = R[x] be the ring of polynomials in x over R, and
let T = S[y] be the ring of polynomials in y over S. Show that:
(a) Any element f(x, y) in T has the form 'L'LaijXiyj, where the aij are
in R.
(b) In terms of the form of f(x, y) in T given in Part (a), give the condi
tion for the equality of two elements f(x, y) and g(x, y) in T.
(c) In terms of the form for f(x, y) in Part (a), give the formula for
f(x, y) + g(x, y), for f(x, y), g(x, y) in T.
(d) Give the form for the product of f(x, y) and g(x, y) if f(x, y) and
g(x, y) are in T. (T is called the ring of polynomials in two variables
over R, and is denoted by R[x, y].)
4. If D is an integral domain, show that D[x, y] is an integral domain.
5. If F is a field and D = F[x, y], the field of quotients of D is called the
field of rational functions in two variables over F, and is usually denoted
by F(x, y). Give the form of the typical element of F(x, y).
6. Prove that F(x, y) is isomorphic to F(y, x).
7. If F is a field of characteristic p =1= 0, show that (a + b)P = a
P
+ b
P
for all
a, b E F. (Hint: Use the binomial theorem and the fact that p is a prime.)
8. If F is a field of characteristic p =1= 0, show that (a + b)m = am + b
m
,
where m = pn, for all a, b in F and any positive integer n.
9. Let F be a field of characteristic p =1= 0 and let cp: F ~ F be defined by
cp(a) = a
P
for all a E F.
(a) Show that cp defines a monomorphism of F into itself.
(b) Give an example of a field F where cp is not onto. (Very hard.)
10. If F is a finite field of characteristic p, show that the mapping cp defined
above is onto, hence is an automorphism of F.
180 Fields Ch.5
2. A BRIEF EXCURSION INTO VECTOR SPACES
To get into the things we should like to do in field theory, we need some
technical equipment that we do not have as yet. This concerns the relation of
two fields K ~ F and what we would like to consider as some measure of the
size of K compared to that of F. This size is what we shall call the dimension
or degree of Kover F.
However, in these considerations, much less is needed of K than that it
be a field. We would be remiss if we proved these results only for the special
context of two fields K ~ F because the same ideas, proofs, and spirit hold in
a far wider situation. We need the notion of a vector space over a field F.
Aside from the fact that what we do in vector spaces will be important in our
situation of fields, the ideas developed appear in all parts of mathematics.
Students of algebra must see these things at some stage of their training. An
appropriate place is right here.
Definition. A vector space V over a field F is an abelian group under
"+" such that for every a E F and every v E V there is an element av E V,
and such that:
1. a(vl + V2) = aVl + aV2, for a E F, Vb V2 E V.
2. (a + (3)v = av + f3v, for a, f3 E F, v E V.
3. a(f3v) = (af3)v, for a, f3 E F, v E V.
4. Iv = v for all v E V, where 1 is the unit element of F.
In discussing vector spaceswhich we will do very brieflywe shall
use lowercase Latin letters for elements of V and lowercase Greek letters for
elements of F.
Our basic concern here will be with only one aspect of the theory of vec
tor spaces: the notion of the dimension of V over F. We shall develop this no
tion as expeditiously as possible, not necessarily in the best or most elegant
way. We would strongly advise the readers to see the other sides of what is
done in vector spaces in other books on algebra or linear algebra (for instance,
our books A Primer on Linear Algebra and Matrix Theory and Linear Algebra).
Before getting down to some results, we look at some examples. We
leave to the reader the details of verifying, in each case, that the example
really is an example of a vector space.
Examples
1. Let F be any field and let V = {( al, a2, ... , an) Iai E F for all i} be the set
of ntuples over F, with equality and addition defined componentwise. For
Sec. 2 A Brief Excursion into Vector Spaces 181
v = (ab a2' ... ' an) and f3 E F, define f3v by f3v = (f3
a
b f3
a
2'···' f3a
n
).
V is a vector space over F.
2. Let F be any field and let V = F[x] be the ring of polynomials in x over F.
Forgetting the product of any arbitrary elements of F[x] but using only that
of a polynomial by a constant, for example, we find that
f3(ao + alx + ... + anx
n
) = f3 ao + f3a
1
x + . · . + f3a
n
x
n
.
In this way V becomes a vector space over F.
3. Let Vbe as in Example 2 and let W = {f(x) E V Ideg(f(x» ::; n}. Then W
is a vector space over F, and W C V is a subspace of V in the following sense.
Definition. A subspace of a vector space V is a nonempty subset W of
V such that aw E Wand WI + W2 E W for all a in F and W, Wb W2 E W.
Note that the definition of subspace W of V implies that W is a vector
space whose operations are just those of V restricted to the elements of W.
4. Let V be the set of all realvalued differentiable functions on [0, 1], the
closed unit interval, with the usual addition and multiplication of a function
by a real number. Then V is a vector space over IR.
5. Let W be all the realvalued continuous functions on [0, 1], again with the
usual addition and multiplication of a function by a real number. W, too, is a
vector space over IR, and the V in Example 4 is a subspace of W.
6. Let F be any field, F[x] the ring of polynomials in x over F. Let [(x)
be in F[x] and J = (f(x» the ideal of F[x] generated by f(x). Let V =
F[x]/J, where we define a(g(x) + J) = ag(x) + J. V is then a vector space
over F.
7. Let IR be the real field and let V be the set of all solutions to the differen
tial equation d
2
y/dx
2
+ y = 0. V is a vector space over IR.
8. Let Vbe any vector space over a field F, VI, V2, ... ' V
n
elements of V. Let
(Vb V2'···, Vn) = {aIVI + a2v2 + ... + anV
n
IaI, a2,··., an E F}. Then
(Vb V2, , Vn) is a vector space over F and is a subspace of V. This subspace
(Vb V2' , Vn) is called the subspace of V generated or spanned by Vb ... , V
n
over F; its elements are called linear combinations of VI, ... , Vn. We shall
soon have a great deal to say about (VI, V2' ... , Vn).
9. Let V and W be vector spaces over the field F and let V EB W =
{(v, w) IV E V, w E W}, with equality and addition defined componentwise,
182 Fields Ch.5
and where a(v, w) = (av, aw). Then V Ef) W is easily seen to be a vector
space over F; it is called the direct sum of V and W.
10. Let K ~ F be two fields, where the addition "+" is that of K and where
av, for a E F and v E K is the product, as elements of the field K. Then Con
ditions 1 and 2 defining a vector space are merely special cases of the distrib
utive laws that hold in K, and Condition 3 is merely a consequence of the
associativity of the product in K. Finally, Condition 4 is just the restate
ment of the fact that 1 is the unit element of K. So K is a vector space
over F.
In one respect there is a sharp difference among these examples. We
specify this difference by examining some of these examples in turn.
1. In Example 1, if
VI = (1, 0, ... , 0), V2 = (0, 1, 0, ... ,0), ... 'V
n
= (0, 0, ... , 1),
then every element v in V has a unique representation in the form v =
aivi + · .. + anv
n
, where ab ... , an are in F.
2. In Example 3, if VI = 1, V2 = X, ••• ,Vi = xit, ... ,Vn+I = x
n
, then every
v E V has a unique representation as v = aivi + ... + anv
n
, with the (ti
in F.
3. In Example 7, every solution of d
2
y/dx
2
+ y = °is of the unique form
y = a cos x + {3 sin x, with a and {3 real.
4. In Example 8, every v E (VI' ... , vn) has a representationalbeit not nec
essarily uniqueas v = aIVt + ... + anV
n
from the very definition of
(Vt, ... , v
n
). Uniqueness of this representation depends heavily on the ele
ments VI, ... , Vn.
5. In the special case of Example 10, where K = C, the field of complex
numbers, and F = IR that of the real numbers, then every V E C is of the
unique form v = a + f3i, a, (3 E IR.
6. Consider K = F(x) ~ F, the field of rational functions in x over F. We
claimand leave to the readerthat we cannot find any finite set of ele
ments in K which spans Kover F. This phenomenon was also true in some
of the other examples we gave of vector spaces.
The whole focus of our attention here will be on this notion of a vector
space having some finite subset that spans it over the base field.
Before starting this discussion, we must first dispose of a list of formal
properties that hold in a vector space. You, dear reader, are by now so
Sec. 2 A Brief Excursion into Vector Spaces 183
sophisticated in dealing with these formal, abstract things that we leave the
proof of the next lemma to you.
Lemma 5.2.1. If V is a vector space over the field F, then, for every
a E F and every u E V:
(a) aO = 0, where 0 is the zeroelement of V.
(b) Ou = 0, where the first 0 is the zero in F.
(c) au = 0 implies that a = 0 or u = o.
(d) (a)u = (au).
In view of this lemma we shall not run into any confusion if we use the
symbol 0 both for the zero of F and that of V.
We forget vector spaces for a moment and look at solutions of certain
systems of linear equations in fields. Take, for example, the two linear homo
geneous equations with real coefficients, x I + X 2 + X 3 = 0 and 3x I  X 2 + X 3
= O. We easily see that for any Xl, X3 such that 4Xl + 2X3 = 0 and X2 =
 (x 1 + x 3)' we get a solution to this system. In fact, there exists an infinity of
solutions to this system other than the trivial one Xl = 0, X 2 = 0, X 3 = O. If we
look at this example and ask ourselves: "Why is there an infinity of solutions
to this system of linear equations?", we quickly come to the conclusion that,
because there are more variables than equations, we have room to maneuver
to produce solutions. This is exactly the situation that holds more generally,
as we see below.
Definition. Let F be a field; then the ntuple (f3b ••• , f3n), where the
f3i are in F, and not all of them are 0, is said to be a nontrivial solution in F to
the system of homogeneous linear equations
al1x
1
+ a12x2 + ... + al
n
x
n
= 0
a21
x
l + a22
x
2 + · . · + a
2n
x
n
= 0
(*) = 0
where the aij are all in F, if substituting Xl = f31, ••• ,xn = f3n satisfies all
the equations of (*).
For such a system (*) we have the following
184 Fields Ch.5
Theorem 5.2.2. If n > r, that is, if the number of variables (un
knowns) exceeds the number of equations in (*), then (*) has a nontrivial so
lution in F.
Proof The method is that, which some of us learned in high school, of
solving simultaneous equations by eliminating one of the unknowns and at
the same time cutting the number of equations down by one.
We proceed by induction on r, the number of equations. If r = 1, the
system (*) reduces to allxl + ... + alnX
n
= 0, and n > 1. If all the ali = 0,
then Xl = X2 = ... = X
n
= 1 is a nontrivial solution to (*). So, on renumber
ing, we may assume that all =1= 0; we then have the solution to (*), which is
nontrivial: X2 = ... = X
n
= 1 and Xl = (I/all)(a12 + .. · + al
n
).
Suppose that the result is correct for r = k for some k and suppose that
(*) is a system of k + 1 linear homogeneous equations in n > k + 1 vari
ables. As above, we may assume that some aij =1= 0, and, without loss of gen
erality, that all =1= 0.
We construct a related system, (**), of k linear homogeneous equations
in n  1 variables; since n > k + 1, we have that n  1 > k, so we can apply
induction to this new system (**). How do we get this new system? We want
to eliminate Xl among the equations. We do so by subtracting ail/all times
the first equation from the ith one for each of i = 2, 3, ... , k + 1. In doing
so, we end up with the new system of k linear homogeneous equations in
n  1 variables:
(**)
where f3ij = aij  ail/all for i = 2,3, ... , k + 1 andj = 2,3, ... , n.
Since (**) is a system of k linear homogeneous equations in n  1 vari
ables and n  1 > k, by our induction (**) has a nontrivial solution (1'2, ... , I'n)
in F. Let 1'1 = (a121'2 + ... + alnl'n)/all; we leave it to the reader to verify
that the (1'1, 1'2' ... , I'n) so obtained is a required nontrivial solution to (*).
This completes the induction and so proves the theorem. D
With this result established, we are free to use it in our study of vector
spaces. We now return to these spaces. We repeat, for emphasis, something
we defined earlier in Example 8.
Sec. 2 A Brief Excursion into Vector Spaces 185
Definition. Let V be a vector space over F and let Vb VZ, ... , V
n
be in
V. The element V E V is said to be a linear combination of VI, Vz, ... , V
n
if
V = aivi + ... + anV
n
for some aI, ... an in F.
As we indicated in Example 8, the set (VI, Vz, ... , V
n
) of all linear com
binations of Vb vz, ... , V
n
is a vector space over F, and being contained in V,
is a subspace of V. Why is it a vector space? If aivi + ... + anV
n
and
f3IVI + ... + f3n v n are two linear combinations of Vb ... , Vn' then
(aiV
I
+ ... + anv
n
) + (f3IVl + . · . + f3n v n)
= (a
l
+ f31)V
I
+ ... + (an + f3n)v
n
by the axioms defining a vector space, and so is in (vI, ... , Vn ). If y E F and
aIvl + ... + anV
n
E (Vb ... ' Vn), then
and is also in (Vb . .. , v
n
). Thus (Vb . .. , v
n
) is a vector space. As we called it
earlier, it is the subspace of V spanned over F by Vb ... , Vn.
This leads us to the ultraimportant definition.
Definition. The vector space V over F is finite dimensional over F if
V = (Vb . .. , v
n
) for some Vb ... , V
n
in V, that is, if V is spanned over F by a
finite set of elements.
Otherwise, we say that V is infinite dimensional over F if it is not finite
dimensional over F. Note that although we have defined what is meant by a
finitedimensional vector space, we still have not defined what is meant by its
dimension. That will come in due course.
Suppose that V is a vector space over F and VI, ... , V
n
in V are such
that every element V in (Vb . .. , v
n
) has a unique representation in the form
V = aIvl + ... + anv
n
, where ab ... , an E F. Since
oE (VI, . .. ,v
n
) and 0 = OV
I
+ ... + OVn'
by the uniqueness we have assumed we obtain that if aivi + ... + anV
n
= 0,
then al = az = .. · = an = O. This prompts a second ultraimportant defini
tion.
Definition. Let Vbe a vector space over F; then the elements Vb . •• , V
n
in V are said to be linearly independent over F if aivi + ... + anV
n
= 0,
where ab ... , an are in F, implies that al = az = ... = an = O.
186 Fields Ch.5
If the elements Vb ... , V
n
in V are not linearly independent over F,
then we say that they are linearly dependent over F. For example, if ~ is the
field of real numbers and V is the set of 3tuples over ~ as defined in Exam
ple 1, then (0, 0, 1), (0, 1, 0), and (1, 0, 0) are linearly independent over ~
(Prove!) while (1, 2,7), (0,1,0), and (1, 3,7) are linearly dependent over
IR, since 1(1, 2, 7) + (1)(0, 1, 0) + (1)(1,  3, 7) = (0, 0, 0) is a nontrivial
linear combination of these elements over IR, which is the Ovector.
Note that linear independence depends on the field F. If C ~ ~ are the
complex and real fields, respectively, then C is a vector space over ~ , but it is
also a vector space over C itself. The elements 1, i in C are linearly indepen
dent over ~ but are not so over C, since i 1 + (l)i = °is a nontrivial linear
combination of 1, i over C.
We prove
Lemma 5.2.3. If V is a vector space over F and VI, ... , Un in V are lin
early independent over F, then every element V E (Vb' .. , vn) has a unique
representation as
with ab ... , an in F.
Proof Suppose that V E (VI,"" vn) has the two representations as
U = alv + ... + anv
n
= f3IUI + ... + f3nvn with the a's and f3's in F. This
gives us that (al  f31)VI + ... + (an  f3n)v
n
= 0; since Vb'.', V
n
are lin
early independent over F, we conclude that al  f31 = 0, ... , an  f3n = 0,
yielding for us the uniqueness of the representation. D
How finite is a finitedimensional vector space? To measure this, call a
subset VI, ... , V
n
of Va minimal generating set for V over F if V = (VI, ... , vn)
and no set of fewer than n elements spans V over F.
We now come to the third vitally important definition.
Definition. If V is a finitedimensional vector space over F, then the
dimension of V over F, written dimp(V), is n, the number of elements in a
minimal generating set for V over F.
In the examples given, d i m ~ ( C ) = 2, since 1, i is a minimal generating
set for Cover IR. However, dimc(C) = 1. In Example 1, dimp(V) = n and in
Example 3, dimp(V) = n + 1. In Example 7 the dimension of V over F is 2.
Finally, if (VI, ... , Vn) C V, then dimp(vI, ... , Vn) is at most n.
We now prove
Sec. 2 A Brief Excursion into Vector Spaces 187
Lemma 5.2.4. If V is finite dimensional over F of dimension n and if
the elements Vb ... , V
n
of V generate V over F, then VI, ... , V
n
are linearly
independent over F.
Proof Suppose that VI, ••. , V
n
are linearly dependent over F; thus there
is a linear combination aivi + ... + anV
n
= 0, where not all the ai are O. We
may suppose, without loss of generality, that al =1= 0; then VI =
(1IaI)(a2v2 + ... + anv
n
). Given V E V, because Vb"', v
n
is a generating
set for V over F,
V = f31 Vl + . · · + f3nvn
= (  ~ ) (azu
z
+ · · · + anu
n
) + f3zuz + · · · + f3n
u
n;
thus V2' ..• , V
n
span V over F, contradicting that the subset Vb V2, .•• , V
n
is a
minimal generating set of V over F. D
We now come to yet another important definition.
Definition. Let V be a finitedimensional vector space over F; then
Vb ..• , V
n
is a basis of V over F if the elements Vb ... , V
n
span V over F and
are linearly independent over F.
By Lemma 5.2.4 any minimal generating set of V over F is a basis of V
over F. Thus, finitedimensional vector spaces have bases.
Theorem 5.2.5. Suppose that V is finite dimensional over F. Then any two
bases of V over F have the same number of elements, and this number is ex
actly dimp (V).
Proof Let Vb' •• , V
n
and Wb ••• , W
m
be two bases of V over F. We
want to show that m = n. Suppose that m > n. Because VI, ••. , V
n
is a basis
of V over F, we know that every element in V is a linear combination of the
Vi over F. In particular, WI, ... , W
m
are each a linear combination of Vb ..• , V
n
over F. Thus we have
188 Fields Ch.5
where the aij are in F.
Consider
f3
1
Wl +
The system of linear homogeneous equations
al
i
f31 + a
2i
f32 + ... + a
mi
f3m = 0, i = 1,2, ... , n,
has a nontrivial solution in F by Theorem 5.2.2, since the number of vari
ables, m, exceeds the number of equations, n. If f3l' ... , 13m is such a solution
in F, then, by the above, f3lWl + . · . + f3mwm = 0, yet not all the f3i are 0. This
contradicts the linear independence of Wb ••• , W
m
over F. Therefore, m ::; n.
Similarly, n ::; m; hence m = n. The theorem is now proved, since a minimal
generating set of V over F is a basis of V over F and the number of elements
in this minimal generating set is dimp(V), by definition. Therefore, by the
above, n = dimp(V), completing the proof. D
A further result, which we shall use in field theory, of a similar nature
to the things we have been doing is
Theorem 5.2.6. Let V be a vector space over F such that dimp (V) =
n. If m > n, then any m elements of V are linearly dependent over F.
Proof Let Wb •.• , W
m
E V and let Vb ... , V
n
be a basis of V over F;
here n = dimp(V) by Theorem 5.2.5. Therefore,
The proof given in Theorem 5.2.5, that if m > n we can find f3l' ... , 13m in F,
and not all 0, such that f3lWl + ... + f3mwm = 0, goes over word for word.
But this establishes that Wl, ... , W
m
are linearly dependent over F. D
We close this section with a final theorem of the same flavor as the pre
ceding ones.
Theorem 5.2.7. Let V be a vector space over F with dimp(V) = n.
Then any n linearly independent elements of V form a basis of V over F.
Sec. 2 A Brief Excursion into Vector Spaces 189
Proof. We want to show that if Vb ... , V
n
E V are linearly independent
over F, then they span V over F. Let V E V; then v, VI, . .. , V
n
are n + 1 ele
ments, hence, by Theorem 5.2.6, they are linearly dependent over F. Thus
there exist elements a, ab ... , an in F, not all 0, such that av + aivi + · .. +
anV
n
= 0. The element a cannot be 0, otherwise aivi + ... + anV
n
= 0, and
not all the ai are 0. This would contradict the linear independence of the ele
ments Vb , V
n
over F. Thus a =1= 0, and so V = (I/a)(alvl + ... + anv
n
) =
f3IVI + + f3nvn, where f3i = ai/ale Therefore, VI, . .. , V
n
span V over F,
and thus must form a basis of V over F. D
PROBLEMS
Easier Problems
1 Determine if the following elements in V, the vector space of 3tuples
over IR, are linearly independent over IR.
(a) (1,2,3), (4,5,6), (7,8,9).
(b) (1, 0, 1), (0, 1, 2), (0, 0, 1).
(c) (1, 2, 3), (0, 4, 5), ( ~ , 3, 2
4
1).
2. Find a nontrivial solution in 7L
s
of the system of linear homogeneous
equations:
Xl + 2x2 + 3x3 = °
3xI + 4x2 + 2x3 = °
3. If V is a vector space of dimension n over 7L
p
, p a prime, show that V has
pn elements.
4. Prove all of Lemma 5.2.1.
5. Let F be a field and V = F[x], the polynomial ring in X over F. Considering
V as a vector space over F, prove that V is not finite dimensional over F.
6. If V is a finitedimensional vector space over F and if W is a subspace of
V, prove that:
(a) W is finite dimensional over F and dimp(W) :s dimp(V).
(b) If dimp(W) = dimF(V), then V = W.
* 7. Define what you feel should be a vector space homomorphism fjJ of V
into W, where Vand Ware vector spaces over F. What can you say about
the kernel, K, of fjJ where K = {v E V I fjJ( v) = o}? What should a vector
space isomorphism be?
190 Fields Ch.5
8. If V is a vector space over F and W is a subspace of V, define the requi
site operations in V/W so that V/W becomes a vector space over F.
9. If V is a finitedimensional vector space over F and VI, ••• , V
rn
in V are
linearly independent over F, show we can find WI, ••• , W
r
in V, where
m + r = dimF(V), such that VI,' •• , V
rn
, Wb •.. , W
r
form a basis of V
over F.
10. If t/J : V ~ V' is a homomorphism of V onto V' with kernel K, show that
V' = V/K (as vector spaces over F). (See Problem 7).
11. Show that if dimF(V) = nand W is a subspace of V with dimF(W) = m,
then dimp(V/W) = n  m.
12. If V is a vector space over F of dimension n, prove that V is isomorphic
to the vector space of ntuples over F (Example 1). (See Problem 7).
MiddleLevel Problems
13. Let K :J F be two fields; suppose that K, as a vector space over F, has fi
nite dimension n. Show that if a E K, then there exist aa, ab ... , an in F,
not all 0, such that
14. Let F be a field, F[x] the polynomial ring in x over F, and f(x) =1= 0 in
F[x]. Consider V = F[x]/J as a vector space over F, where J is the ideal
of F[x] generated by f(x). Prove that
dim
p
(V) = degf(x).
15. If V and Ware two finitedimensional vector spaces over F, prove that
V EB W is finite dimensional over F and that dimp(V EB W) =
dimF(V) + dimp(W).
16. Let V be a vector space over F and suppose that U and Ware subspaces
of V. Define U + W = {u + W Iu E U, W E W}. Prove that:
(a) U + W is a subspace of V.
(b) U + W is finite dimensional over F if both U and Ware.
(c) U n W is a subspace of V.
(d) U + W is a homomorphic image of U EB W.
(e) If U and Ware finite dimensional over F, then
dimp(U + W) = dimp(U) + dimp(W)  dimp(U n W).
Sec. 3
Harder Problems
Field Extensions 191
17. Let K ~ F be two fields such that dimp(K) = m. Suppose that V is a vec
tor space over K. Prove that:
(a) V is a vector space over F.
(b) If V is finite dimensional over K, then it is finite dimensional over F.
(c) If dimK (V) = n, then dimp(V) = mn [i.e., dimp(V) =
dimK(V) dimp(K)].
18. Let K ~ F be fields and suppose that V is a vector space over K such that
dimp (V) is finite. If dimp (K) is finite, show that dimK (V) if finite and de
termine its value in terms of dimp(V) and dimp(K).
19. Let D be an integral domain with 1, which happens to be a finitedimen
sional vector space over a field F. Prove that D is a field. (Note: Since Fl,
which we can identify with F, is in D, the ring structure of D and the vec
tor space structure of Dover F are in harmony with each other.)
20. Let V be a vector space over an infinite field F. Show that V cannot be the
settheoretic union of a finite number of proper subspaces of V. (Very hard)
3. FIELD EXTENSIONS
Our attention now turns to a relationship between two fields K and F, where
K ~ F. We call K an extension (or extension field) of F, and call F a subfield
of K. The operations in F are just those of K restricted to the elements of F.
In all that follows in this section it will be understood that F C K.
We say that K is a finite extension of F if, viewed as a vector space over
F, dimp(K) is finite. We shall write dimp(K) as [K: F] and call it the degree
ofK over F.
We begin our discussion with what is usually the first result one proves
in talking about finite extensions.
Theorem 5.3.1. Let L ~ K ~ F be three fields such that both [L : K]
and [K: F] are finite. Then L is a finite extension of F and [L: F] =
[L : K][K: F].
Proof. We shall prove that L is a finite extension of F by explicitly ex
hibiting a finite basis of Lover F. In doing so, we shall obtain the stronger re
sult asserted in the theorem, namely that [L : F] = [L : K][K: F].
Suppose that [L : K] = m and [K: F] = n; then L has a basis VI, V2'" • 'V
m
over K, and K has a basis Wb W2, •.• , W
n
over F. We shall prove that the mn
192 Fields Ch.5
elements ViWj, where i = 1, 2, ... , m and j = 1, 2, ... , n, constitute a basis of
Lover F.
We begin by showing that, at least, these elements span Lover F; this
will, of course, show that L is a finite extension of F. Let a E L; since the ele
ments Vb ••. , V
m
form a basis of Lover K, we have a = klVI + . · . + kmv
m
,
where k
l
, k
2
, ••• , k
m
are in K. Since WI, .•• , w
n
is a basis of Kover F, we
can express each k
i
as
where the hj are in F. Substituting these expressions for the k
i
in the forego
ing expression of a, we obtain
a = (!IIWI + !12
W
2 + . · . + !lnWn) VI
+ .. · + (!mIWI + !m2
W
2 + ... + !mnWn) V
m
·
Therefore, on unscrambling this sum explicitly, we obtain
Thus the mn elements ViWj in L span Lover F; therefore, [L : F] is finite and,
in fact, [L : F] ::::; mn.
To show that [L : F] = mn, we need only show that the mn elements
ViWj above are linearly independent over F, for thentogether with the fact
that they span Lover Fwe would have that they form a basis of Lover F.
By Theorem 5.2.5 we would have the desired result [L: F] = mn =
[L : K][K: F].
Suppose then that for some b
ij
in F we have the relation
Reassembling this sum, we obtain CIVI + C2V2 + ... + CmV
m
= 0, where Cl =
bI1WI + ... + bInw
n
, •.. ,C
m
= bmlWI + ... + bmnw
n
• Since the Ci are ele
ments of K and the elements Vb •.• , V
n
in L are linearly independent over K,
we obtain CI = C2 = ... = C
m
= O.
Recalling that Ci = bnWI + ... + binw
n
, where the b
ij
are in F and
where WI, •.• , W n in K are linearly independent over F, we deduce from the
fact that CI = C2 = ... = C
m
= 0 that every b
ij
= o. Thus only the trivial linear
combination, with each coefficient 0, of the elements ViWj over F can be o.
Hence the ViWj are linearly independent over F. We saw above that this was
enough to prove the theorem. D
Sec. 3 Field Extensions 193
The reader should compare Theorem 5.3.1 with the slightly more gen
eral result in Problem 17 of Section 2. The reader should now be able to
solve Problem 17.
As a consequence of the theorem we have the
Corollary. If L ~ K ~ F are three fields such that [L : F] is finite,
then [K : F] is finite and divides [L : F].
Proof Since L ~ K, K cannot have more linearly independent ele
ments over F than does L. Because, by Theorem 5.2.6, [L : F] is the size of
the largest set of linearly independent elements in Lover F, we therefore get
that [K: F] ::::; [L : F], so must be finite. Since L is finite dimensional over F
and since K contains F, L must be finite dimensional over K. Thus all the
conditions of Theorem 5.3.1 are fulfilled, whence [L: F] = [L: K][K: F].
Consequently, [K: F] divides [L : F], as is asserted in the Corollary. 0
If K is a finite extension of F, we can say quite a bit about the behavior
of the elements of K visavis F.
Theorem 5.3.2. Suppose that K is a finite extension of F of degree n.
Then, given any element u in K there exist elements ao, at, ... , an in F, not
all zero, such that
Proof Since [K: F] = dimF(K) = n and the elements 1, u, u
2
, ... , un
are n + 1 in number, by Theorem 5.2.6 they must be linearly dependent over
F. Thus we can find ao, at, ... , an in F, not all 0, such that ao + atU +
a2u2 + ... + anu
n
= 0, proving the theorem. D
The conclusion of the theorem suggests that we single out elements in
an extension field that satisfy a nontrivial polynomial.
Definition. If K ~ F are fields, then a E K is said to be algebraic over
F if there exists a polynomial p(x) =1= 0 in F[x] such that pea) = o ~
By pea) we shall mean the element aoa
n
+ atant + ... + an in K,
where p(x) = aox
n
+ at
Xn

t
+ ... + an.
If K is an extension of F such that every element of K is algebraic over
F, we call K an algebraic extension of F. In these terms Theorem 5.3.2 can be
restated as: If K is a finite extension of F, then K is an algebraic extension of F.
The converse of this is not true; an algebraic extension of F need not be
of finite degree over F. Can you come up with an example of this situation?
194 Fields Ch.5
An element of K that is not algebraic over F is said to be transcendental
over F.
Let's see some examples of algebraic elements in a concrete context.
Consider C :J Q, the complex field as an extension of the rational one. The com
plex number a = 1 + i is algebraic over Q, since it satisfies a
2
 2a + 2 = 0
over Q. Similarly, the real number b= J1 +\0/1 + v'2 is algebraic over Q,
since b
2
= 1 + ~ 1 + 0, so (b
2
 1)3 = 1 + \/2, and therefore
((b
2
 1)3  1)2 = 2. Expanding this out, we get a nontrivial polynomial ex
pression in b with rational coefficients, which is O. Thus b is algebraic over Q.
It is possible to construct real numbers that are transcendental over Q
fairly easily (see Section 6 of Chapter 6). However, it takes some real effort
to establish the transcendence of certain familiar numbers. The two famil
iar numbers e and 1T can be shown to be transcendental over Q. That e is
such was proved by Hermite in 1873; the proof that 1T is transcendental
over Q is much harder and was first carried out by Lindemann in 1882. We
shall not go into the proof here that any particular number is transcenden
tal over Q. However, in Section 7 of Chapter 6 we shall at least show that
1T is irrational. This makes it a possible candidate for a transcendental num
ber of Q, for clearly any rational number b is algebraic over Q because it
satisfies the polynomial p (x) = x  b, which has rational coefficients.
Definition. A complex number is said to be an algebraic number if it
is algebraic over Q.
As we shall soon see, the algebraic numbers form a field, which is a
subfield of C.
We return to the general development of the theory of fields. We have
seen in Theorem 5.3.2 that if K is a finite extension of F, then every element
of K is algebraic over F. We turn this matter around by asking: If K is an ex
tension of F and a E K is algebraic over F, can we somehow produce a finite
extension of F using a? The answer is yes. This will come as a consequence of
the next theoremwhich we prove in a context a little more general than
what we really need.
Theorem 5.3.3. :Let D be an integral domain with 1 which is a finite
dimensional vector space over a field F. Then D is a field.
Proof To prove the theorem, we must produce for a =1= 0 in D an in
verse, aI, in D such that aa
I
= 1.
Sec. 3 Field Extensions 195
As in the proof of Theorem 5.3.2, if dimp(D) = n, then 1, a, a
2
, , an
in D are linearly dependent over F. Thus for some appropriate ao, aI, , an
in F, not all of which are 0,
Let p(x) = f3ox' + f3IX,1 + ... + f3, =1= °be a polynomial in F[x] of lowest
degree such that pea) = 0. We assert that f3, =1= 0. For if f3, = 0, then
°= f3
0
a
r
+ f3
1
a
r

1
+ . · · + f3rl
a
= (f3
o
a
r

1
+ f31a
r

2
+ · .. + f3rl) a.
Since D is an integral domain and a =1= 0, we conclude that f3oa,1 +
f3la,2 + + f3,1 = 0, hence q(a) = 0, where q(x) = f3ox,1 +
(31X,2 + + f3,l in F[x] is of lower degree than p(x), a contradiction.
Thus f3, =1= 0, hence f3; 1 is in F and
a(f3
o
a
r

1
+ ... + f3rl) = 1
f3r '
giving us that (f3
o
a,l + ... + f3,I)If3" which is in D, is the aI in D that
we required. This proves the theorem. D
Having Theorem 5.3.3 in hand, we want to make use of it. So, how do
we produce subrings of a field K that contain F and are finite dimensional
over F? Such subrings, as subrings of a field, are automatically integral do
mains, and would satisfy the hypothesis of Theorem 5.3.3. The means to this
end will be the elements in K that are algebraic over F.
But first a definition.
Definition. The element a in the extension K of F is said to be alge
braic of degree n if there is a polynomial p(x) in F[x] of degree n such that
pea) = 0, and no nonzero polynomial of lower degree in F[x] has this prop
erty.
We may assume that the polynomial p (x) in this definition is monic, for
we could divide this polynomial by its leading coefficient to obtain a monic
polynomial q(x) in F[x], of the same degree as p(x), and such that q(a) = 0.
We henceforth assume that this polynomial p (x) is monic; we call it the min
imal polynomial for a over F.
Lemma 5.3.4. Let a E K be algebraic over F with minimal polynomial
p(x) in F[x]. Then p(x) is irreducible in F[x].
196 Fields Ch.5
Proof Suppose that p(x) is not irreducible in F[x]; then p(x) =
f(x)g(x) where f(x) and g(x) are in F[x] and each has positive degree. Since
o= pea) = f(a)g(a), and since f(a) and g(a) are in the field K, we conclude
that f(a) = 0 or g(a) = 0, both of which are impossible, since both f(x) and
g(x) are of lower degree than f(x). Therefore, p(x) is irreducible in F[x]. D
Let a E K be algebraic of degree n over F and let p(x) E F[x] be its
minimal polynomial over F. Given f(x) E F[x], then f(x) = q(x)p(x) + rex),
where q(x) and rex) are in F[x] and rex) = 0 or deg rex) < degp(x) follows
from the division algorithm. Therefore, f(a) = q(a)p(a) + rea) = rea), since
p (a) = O. In short, any polynomial expression in a over F can be expressed as
a polynomial expression in a of degree at most n  1.
Let F[a] = {f(a) If(x) E F[x]}. We claim that F[a] is a subfield of K that
contains both F and a, and that [F[a] : F] = n. By the remark made above,
F[a] is spanned over F by 1, a, a
2
, • •• , anI, SO is finite dimensional over F.
Moreover, as is easily verified, F[a] is a subring of K; as a subring of K, F[a] is
an integral domain. Thus, by Theorem 5.3.3, F[a] is a field. Since it is spanned
over F by 1, a, a
2
, ••• , anI, we have that [F[a]: F] ~ n. To show that
[F[a] : F] = n we must merely show that 1, a, a
2
, ••• , a
n

l
are linearly inde
pendent over F. But if aa + ala + + an_Ia
n

1
= 0, with the ai in F, then
q(a) = 0, where q(x) = aa + alx + + an_Ix
n

1
is in F[x]. Since q(x) is of
lower degree than p(x), which is the minimal polynomial for a in F[x], we are
forced to conclude that q(x) = O. This implies that aa = al = ... = anl = O.
Therefore, 1, a, a
2
, ••• , a
n

l
are linearly independent over F and form a basis
of F[a] over F. Thus [F[a] : F] = n. Since F[a] is a field, not merely just a set
of polynomial expressions in a, we shall denote F[a] by F(a). Note also that if
M is any field that contains both F and a, then M contains all polynomial ex
pressions in a over F, hence M :> F(a). So F(a) is the smallest subfield of K
containing both F and a.
Definition. F(a) is called the field or extension obtained by adjoining
a to F.
We now summarize.
Theorem 5.3.5. Let K :> F and suppose that a in K is algebraic over F
of degree n. Then F(a), the field obtained by adjoining a to F, is a finite ex
tension of F, and
[F(a) : F] = n.
Before leaving Theorem 5.3.5, let's look at it in a slightly different way.
Let F[x] be the polynomial ring in x over F, and let M = (p(x» be the ideal
Sec. 3 Field Extensions 197
of F[x] generated by p(x), the minimal polynomial for a in Kover F.
By Lemma 5.3.4, p(x) is irreducible in F[x]; hence, by Theorem 4.5.11,
M is a maximal ideal of F[x]. Therefore, F[x]/(p(x)) is a field by Theorem
4.4.2.
Define the mapping f/!: F[x] ~ K by f/!(f(x)) = f(a). The mapping f/! is
a homomorphism of F[x] into K, and the image of F[x] in K is merely F(a)
by the definition of F(a). What is the kernel of f/!? It is by definition J =
{f(x) E F[x] If/!(f(x)) = O}, and since we know f/!(f(x)) = f(a), J = {f(x) E
F[x] If(a) = Ol. Since p(x) is in J and p(x) is the minimal polynomial for a
over F, p(x) is of the lowest possible degree among the elements of J. Thus
J = (p(x)) by the proof of Theorem 4.5.6, and so J = M. By the First Homo
morphism Theorem for rings, F[x]/M == image of F[x] under f/! = F(a), and
since F[x]/M is a field, we have that F(a) is a field. We leave the proof, from
this point of view, of [F(a) : F] = degp(x) to the reader.
PROBLEMS
1. Show that the following numbers in C are algebraic numbers.
(a) V2 + V3.
(b) v7 + VUe
(c) 2 + iV3.
(d) cos(21T/ k) + i sin(21T/k), k a positive integer.
2. Determine the degrees over Q of the numbers given in Parts (a) and (c)
of Problem 1.
3. What is the degree of COS(21T/3) + i sin(21T/3) over Q?
4. What is the degree of COS(21T/8) + i sin(21T/8) over Q?
5. If p is a prime number, prove that the degree of COS(21T/p) + i sin(21T/p)
over Q is p  1 and that
f(x) = 1 + x + x
2
+ ... + x
p

1
is its minimal polynomial over Q.
6. (For those who have had calculus) Show that
1 1 1
e=I+++···++···
I! 2! n!
is irrational.
7. If a in K is such that a
2
is algebraic over the subfield F of K, show that a
is algebraic over F.
198 Fields Ch.5
8. If F c K and f(a) is algebraic over F, where f(x) is of positive degree in
F[x] and a E K, prove that a is algebraic over F.
9. In the discussion following Theorem 5.3.5, show that F[x]/M is of degree
n = degp(x) over F, and so [F(a) : F] = n = degp(x).
10. Prove that cos 1
0
is algebraic over Q. (1
0
= one degree.)
11. If a E K is transcendental over F, let F(a) = {f(a)/g(a) If(x), g(x) =1= 0 E
F[x]}. Show that F(a) is a field and is the smallest subfield of K contain
ing both F and a.
12. If a is as in Problem 11, show that F(a) = F(x), where F(x) is the field of
rational functions in x over F.
13. Let K be a finite field and F a subfield of K. If [K : F] = nand F has q
elements, show that K has qn elements.
14. Using the result of Problem 13, show that a finite field has pn elements
for some prime p and some positive integer n.
15. Construct two fields K and F such that K is an algebraic extension of F
but is not a finite extension of F.
4. FINITE EXTENSIONS
We continue in the vein of the preceding section. Again K :J F will always
denote two fields.
Let E(K) be the set of all elements in K that are algebraic over F. Cer
tainly, F C E(K). Our objective is to prove that E(K) is a field. Once this is
done, we'll see a little of how E(K) sits in K.
Without further ado we proceed to
Theorem 5.4.1. E(K) is a subfield of K.
Proof What we must show is that if a, b E K are algebraic over F, then
a ± b, ab, and alb (if b =1= 0) are all algebraic over F. This will assure us that
E(K) is a subfield of K. We'll do all of a ± b, ab, and alb in one shot.
Let K
o
= F(a) be the subfield of K obtained by adjoining a to F. Since
a is algebraic over F, say of degree m, then, by Theorem 5.3.5, [K
o
: F] = m.
Since b is algebraic over F and since K
o
contains F, we certainly have that b
is algebraic over Ko. If b is algebraic over F of degree n, then it is algebraic
over K
o
of degree at most n. Thus K
1
= Ko(b), the subfield of K obtained by
adjoining b to K
o
, is a finite extension of K
o
and [K
1
: K
o
] ~ n.
Thus, by Theorem 5.3.1, [K
1
: F] = [K
1
: Ko][K
o
: F] :s mn; that is, K
1
is
a finite extension of F. As such, by Theorem 5.3.2, K 1 is an algebraic exten
Sec. 4 Finite Extensions 199
sion of F, so all its elements are algebraic over F. Since a E K 0 c K 1 and b E
K1, then all of the elements a ± b, ab, alb are in K1, hence are algebraic over
F. This is exactly what we wanted. The theorem is proved. D
If we look at the proof a little more carefully, we see that we have actu
ally proved a little more, namely the
Corollary. If a and b in K are algebraic over F of degrees m and n,
respectively, then a ± b, ab, and alb (if b =1= 0) are algebraic over F of degree
at most mn.
A special case, but one worth noting and recording, is the case K = C
and F = Q. In that case we called the algebraic elements in Cover Q the al
gebraic numbers. So Theorem 5.4.1 in this case becomes
Theorem 5.4.2. The algebraic numbers form a subfield of C.
For all we know at the moment, the set of algebraic numbers may very
well be all of C. This is not the case, for transcendental numbers do exist; we
show this to be true in Section 6 of Chapter 6.
We return to a general field K. Its subfield E(K) has a very particular
quality, which we prove next. This property is that any element in K which is
algebraic over E(K) must already be in E(K).
In order not to digress in the course of the proof we are about to give,
we introduce the following notation. If ab a2,"" an are in K, then
F(ab" ., an) will be the field obtained as follows: K
1
= F(al), K
2
=
K
1
(a2) = F(ab a2), K
3
= K
2
(a3) = F(al' a2, a3), ... , K
n
= Kn1(a
n
) =
F(ab a2, · · . , an)'
We now prove
Theorem 5.4.3. If u in K is algebraic over E(K), then u is in E(K).
Proof To prove the theorem, all we must do is show that u is algebraic
over F; this will put u in E(K), and we will be done.
Since u is algebraic over E(K), there is a nontrivial polynomial f(x) =
x
n
+ alx
n

1
+ a2Xn2 + + an, where ab a2,"" an are in E(K), such that
f(u) = O. Since ab a
2
, , an are in E(K), they are algebraic over Flof de
grees, say, mb m2"'" m
n
, respectively. We claim that [F(al"'" an): F] is
at most mlm2 ... m
n
. To see this, merely carry out n successive applications
of Theorem 5.3.1 to the sequence K b K
2
, ... , K
n
of fields defined above. We
leave its proof to the reader. Thus, since u is algebraic over the field K
n
=
F(ab a2,"" an) [after all, the polynomial satisfied by u is f(x) =
200 Fields Ch.5
x
n
+ alx
n

1
+ ... + an, which has all its coefficients in F(a}, a2, ... , an)], the
field K
n
(u) is a finite extension of K
n
, and since K
n
is a finite extension of F,
we have, again by Theorem 5.3.1, that Kn(u) is a finite extension of F. Be
cause u E Kn(u), we obtain from Theorem 5.3.2 that u is algebraic over F.
This puts u in E(K) by the very definition of E(K), thereby proving the theo
rem.D
There is a famous theorem due to Gauss, often referred to as the Fun
damental Theorem of Algebra, which asserts (in terms of extension) that the
only finite extension of C,' the field of complex numbers, is C itself. In reality
this result is not a purely algebraic one, its validity depending heavily on
topological properties of the field of real numbers. Be that as it may, it is an
extremely important theorem in algebra and in many other parts of mathe
matics.
The formulation of the Fundamental Theorem of Algebra in terms of
the nonexistence of finite extensions of C is a little different from that which
is usually given. The most frequent form in which this famous result is stated
involves the concept of a root of a polynomial, a concept we shall discuss at
some length later. In these terms the Fundamental Theorem of Algebra be
comes: A polynomial of positive degree having coefficients in C has at least
one root in C. The exact meaning of this statement and its equivalence with
the other form of the theorem stated above will become clearer later, after
the development of the material on roots.
A field L with the property of C described in the paragraphs above is
said to be algebraically closed. If we grant that C is algebraically closed
(Gauss' Theorem), then, by Theorem 5.4.3, we also have
The field of algebraic numbers is algebraically closed.
PROBLEMS
1. Show that a = v2  V3 is algebraic over Q of degree at most 4 by ex
hibiting a polynomialf(x) of degree 4 over Q.such thatf(a) = O.
2. If a and b in K are algebraic over F of degrees m and n, respectively, and
if m and n are relatively prime, show that [F(a, b) : F] = mn.
3. If a E C is such thatp(a) = 0, where
p(x) = x
5
+ v2x
3
+ V5x
2
+ v7X + VU,
show that a is algebraic over Q of degree at most 80.
Sec. 5 Constructi bi Iity 201
4. If K ~ F is such that [K: F] = p, p a prime, show that K = F(a) for every
a in K that is not in F.
5. If [K: F] = 2
n
and T is a subfield of K containing F, show that [T: F] = 2
m
for some m ~ n.
6. Give an example of two algebraic numbers a and b of degrees 2 and 3, re
spectively, such that ab is of degree less than 6 over Q.
7. If K ~ F are fields and ab ... , an are in K, show that F(ab ... , an) equals
F(a
u
(1)' ••• , au(n» for any permutation (T of 1, 2, ... , n.
5. CONSTRUCTIBILITY
In ancient Greece, unlike in the other cultures of the time, the Greek mathe
maticians were interested in mathematics as an abstract discipline rather
than as a pragmatic bag of tricks to do accounts or to carry out measure
ments. They developed strong interests and results in number theory and,
most especially, in geometry. In these areas they posed penetrating ques
tions. The questions they asked in geometrytwo of which will make up the
topic treated hereare still of interest and substance. The English mathe
matician G. H. Hardy, in his sad but charming little book A Mathematician's
Apology, describes the ancient Greek mathematicians as "colleagues from
another college."
Two of these Greek questions will be our concern in this section. But,
as a matter of fact, the answer to both will emerge as a consequence of the
criterion for constructibility, which we will obtain. We state these questions
now and will explain a little later what is entailed in them.
QUESTIONl
Can one duplicate a cube using just straightedge and compass? (By duplicat
ing a cube, we mean doubling its volume.)
QUESTION 2
Can one trisect an arbitrary angle using just straightedge and compass?
Despite the seemingly infinite number of angletrisectors that crop up
every year, the answer to both questions is "no." As we shall see, it is impos
sible to trisect 60° using just straightedge and compass. Of course, some
angles are trisectable, for instance, 0°, 90°, 145°, 180°, ... , but most angles
(in a very precise meaning of "most") are not.
202 Fields Ch.5
Before getting to the exact meaning of the questions themselves, we
want to spell out in explicit terms exactly what the rules of the game are.
By a straightedge we do not mean a rulerthat is, an instrument for mea
suring arbitrary lengths. No! A straightedge is merely a straight line,
with no quantitative or metric properties attributed to it. We are given a
line segmentto which we assign length Iand all other lengths that we
get from this must be obtainable merely employing a straightedge and
compass.
Let us call a nonnegative real number, b, a constructible length if, by
a finite number of applications of the straightedge and compass and the
points of intersection obtained between lines and circles so constructed,
we can construct a line segment of length b, starting out from the line seg
ment we have assigned length 1.
From our high school geometry we recall some things we can do in this
framework.
1. Whatever length we construct on one line can be constructed on any
other line by use of the compass acting as a transfer agent.
2. We can draw a line parallel to a given line that goes through a given
point.
3. We can construct a length n for any nonnegative integer n.
From these and by using results about the similarity of triangles, we can
construct any nonnegative rational length. We don't do that at this moment
for it will come out as a special case of what we are about to do.
We claim the following properties:
1. If a and b are constructible lengths, then so is a + b. For if AB is a
length segment of length a and CD is one of length b, we can transfer this
line segment CD, by means of a compass, to obtain the line ABE, where AB
is of length a and BE is of length b. Thus the line segment AE is of length
a + b. If b > a, how would you construct b  a?
2. If a and b are constructible lengths, then so is abo We Inay assume
that a =1= 0 and b =1= 0, otherwise, the statement is trivial. Consider the follow
ing diagram
P L)
J B
Sec. 5 Constructibility 203
Where L
1
and L
2
are two distinct lines intersecting at P, and such that PA
has length a, PB has length b, and PI has length 1. Let L
3
be the straight line
through I and A and L 4 the line parallel to L3 passing through B. If C is the
point of intersection of L
1
and L
4
, we have the diagram
p+#L
2
All of these constructions can be carried out by straightedge and compass.
From elementary geometry the length of PC is abo Therefore, ab is con
structible.
3. If a and b are constructible and b =1= 0, then alb is constructible. Con
sider the diagram
+....L
2
where P, A, B, I, L
b
and L
2
are as in Property 2 above. Let L
s
be the line
through A and B and let L
6
be the line through I parallel to L
s
. If D is the
point of intersection of L
1
and L
6
, then, again by elementary geometry, the
length of PD is alb. We stress again that all the constructions made can be
carried out by straightedge and compass.
Of course, this shows that the nonnegative rational numbers are con
structible lengths, since they are quotients of nonnegative integers, which we
know to be constructible lengths. But there are other constructible lengths,
for instance, the irrational number v2. Because we can construct by straight
edge and compass the rightangle triangle
204 Fields
Ch.5
with sides AB and BC of length 1, we know, by the Pythagorean Theorem,
that AC is of length v2. So v2 is a constructible length.
In Properties 1 to 3 we showed that the constructible lengths almost
form a field. What is lacking is the negatives. To get around this, we make
the
Definition. The real number a is said to be a constructible number if
Ia I, the absolute value of a, is a constructible length.
As far as we can say at the moment, any real number might be a con
structible one. We shall soon have a criterion which will tell us that certain
real numbers are not constructible. For instance, we shall be able to deduce
from this criterion that both ~ and cos 20
0
are not constructible. This in
turn will allow us to show that the answer to both Questions 1 and 2 is "no."
But first we state
Theorem 5.5.1. The constructible numbers form a subfield of the field
of real numbers.
Proof Properties 1 to 3 almost do the trick; we must adapt Property 1
slightly to allow for negatives. We leave the few details to the reader. D
Our next goal is to show that a constructible number must be an alge
braic numbernot any old algebraic number, but one satisfying a rather
stringent condition.
Note, first, that if a 2:: 0 is a constructible number, then so is ~ . Con
sider the diagram
B
It is of a semicircle of radius (a + 1)/2, center at C, AD is of length a, DB is
of length 1, and DE is the perpendicular to AB at D, intersecting the circle at
E. All this is constructible by straightedge and compass. From elementary
geometry we have that DE is of length ~ , hence ~ is constructible.
We now head for the necessary condition that a real number be con
structible. Let K be the field of constructible numbers, and let K
o
be a sub
field of K. By the plane of K
o
we shall mean the set of all points (a, b) in the
real Euclidean plane whose coordinates a and b are in K o. If (a, b) and (c, d)
Sec. 5 Constructi bi Iity 205
are in the plane of K
o
, then the straight line joining them has the equation
(y  b)/(x  a) = (b  d)/(a  c), so is of the form ux + vy + w = 0, where
u, v, and ware in K
o
. Given two such lines U1X + V1Y + W1 = 0 and
U2 X + V2Y + W2 = 0, where Ub Vb W1 and U2, V2, W2 are all in K
o
, either they
are parallel or their point of intersection is a point in K o. (Prove!)
Given a circle whose radius r is in K0 and whose center (a, b) is in the
plane of K
o
, then its equation is (x  a)2 + (y  b)2 =r
2
, which we see, on
expanding, is of the form XL + y2 + dx + ey + f = 0, where d, e, and f are in Ko.
To see where this circle intersects a line in the plane of K
o
, ux + vy + w = 0,
we solve simultaneously the equations of the line and of the circle. For in
stance, if V =1= 0, then y = (ux + w)/v; substituting this for y in the equation
of the circle x
2
+ y2 + dx + ey + f = 0 leads us to a quadratic equation for
the xcoordinate, c, of this intersection point, of the form c
2
+ S1C + S2 = 0,
with S1 and S2 in K
o
. By the quadratic formula, c = (S1 ± \lsi  4s
2
)/2,
and if the line and circle intersect in the real plane, then si  4s2 2: O. If
s = si  4s
2
2: 0 and if K
1
= K o ( ~ ) , then we see that the xcoordinate, C,
lies in K
1
• If ~ E K
o
, then K
1
= K
o
; otherwise, [K
1
: K
o
] = 2. Since the
ycoordinate d = (uc + w)/v, we have that d is also in K
1
• Thus the inter
section point (c, d) lies in the plane of K1 where [K1 : K0] = 1 or 2. The story
is similar if v = 0 and U =1= o.
Finally, to get the intersection of two circles x
2
+ y2 + dx + ey + f = 0
and x
2
+ y2 + gx + hy + k = 0 in the plane of K
o
, subtracting one of these
equations from the other gives us the equation of the line in the plane of K
o
,
(d  g)x + (e  h)y + (f  k) = O. So to find the points of intersection of
two circles in the plane of K
o
is the same as finding the points of intersection
of a line in the plane of K
o
with a circle in that plane. This is precisely the situa
tion we disposed of above. So if the two circles intersect in the real plane, their
points of intersection lie in the plane of an extension of K
o
of degree 1 or 2.
To construct a constructible length, a, we start in the plane of Q, the ra
tionals; the straightedge gives us lines in the plane of Q, and the compass
circles in the plane of Q. By the above, these intersect at a point in the plane
of an extension of degree 1 or 2 of Q. To get to a, we go by this procedure
from the plane of Q to that of L
1
, say, where [L
1
: Q] = 1 or 2, then to that
of L
2
, where [L
2
: Ltl = 1 or 2, and continue a finite number of times. We
get, this way, a finite sequence Q = L
o
eLl c ... c L
n
of fields, where each
[L
i
: L
i

1
] = 1 or 2 and where a is in Ln.
By Theorem 5.3.1, [L
n
: Q] = [L
n
: L
n

1
][L
n

1
: L
n

2
] .. · [L
1
: Q] and
since each of [L
i
: L
i

1
] = 1 or 2, we see that [L
n
: Q] is a power of 2. Since
a E L
n
, we have that Q(a) is a subfield of L
n
, hence by the Corollary to Theo
rem 5.3.1, [Q(a) : Q] must divide a power of 2, hence [Q(a) : Q] = 2
m
for
some nonnegative integer m. Equivalently, by Theorem 5.3.5, the minimal
206 Fields Ch.5
polynomial for a over 0 must have degree a power of 2. This is a necessary
condition that a be constructible. We have proved the important criterion for
constructibility, namely
Theorem 5.5.2. In order that the real number a be constructible, it is
necessary that [O(a) : 0] be a power of 2. Equivalently, the minimal polyno
mial of a over 0 must have degree a power of 2.
To duplicate a cube of sides 1, so of volume 1, by straightedge and
compass would require us to construct a cube of sides of length b whose vol
ume would be 2. But the volume of this cube would be b
3
, so we would have
to be able to find a constructible number b such that b
3
= 2.
Given a real number b such that b
3
= 2, then its minimal polynomial
over 0 is p(x) = x
3
 2, for this polynomial is monic and irreducible over 0 (if
you want, by the Eisenstein Criterion), and p(b) = O. Also, as is clear to the
eye, p(x) is of degree 3. Since 3 is not a power of 2, by Theorem 5.5.2, there is
no such constructible b. Therefore, the question of the duplication of the cube
by straightedge and compass has a negative answer. We summarize this in
Theorem 5.5.3. It is impossible to duplicate a cube of volume 1 by
straightedge and compass.
We now have disposed of the classical Question 1, so we turn our atten
tion to Question 2, the trisection of an arbitrary angle by straightedge and
compass.
If we could trisect the particular angle 60°, we would be able to con
struct the triangle ABC in the diagram
where () = 20° and AC is of length 1, by straightedge and compass. Since AB
is of length cos 20°, we would have that b = cos 20° is a constructible number.
We want to show that b = cos 20° is not a constructible number by pro
ducing its minimal polynomial over 0, and showing that this polynomial is of
degree 3. To this end we recall the tripleangle formula from trigonometry,
namely that cos 3cP = 4 cos
3
cP  3 cos cPo If b = cos 20°, then, since
cos(3 . 20°) = cos 60° = ~ , this trigonometric formula becomes 4b
3
 3b = ~ ,
and so 8b
3
 6b  1 = O. If c = 2b, this becomes c
3
 3c  1 = O. If b is con
structible, then so is c. But p(c) = 0, where p(x) = x
3
 3x  1, and this
Sec. 6 Roots of Polynomials 207
polynomial is irreducible over Q. (Prove!) So p(x) is the minimal polynomial
for cover Q. Because p(x) is of degree 3, and 3 is not a power of 2, by Theo
rem 5.5.2 we have that c is not constructible. So we cannot trisect 60
0
by
straightedge and compass. This answers Question 2 in the negative.
Theorem 5.5.4. It is impossible to trisect 60
0
by straightedge and
compass.
We hope that this theorem will dissuade any reader from joining the
hordes of angletrisectors. There are more profitable and pleasanter ways of
wasting one's time.
There is yet another classical problem of this kind to which the answer
is "no." This is the question of squaring the circle. This question asks: Can
we construct a square whose area is that of a circle of radius 1 by straight
edge and compass? This is equivalent to asking whether V; is a con
structible number. If this were the case, then since 1T = (V;)2, the number 1T
would be constructible. But Lindemann proved in 1882 that 1T is in fact tran
scendental, so certainly is not algebraic, and so cannot be constructible. There
fore, the circle of radius 1 cannot be squared by straightedge and compass.
Of course, what we did above does not constitute a proof of the impos
sibility of squaring the circle, since we have presupposed Lindemann's result
without proving it. To prove that 1T is transcendental would take us too far
afield. One might expect that it would be easier to prove that 1T is not con
structible than to prove that it is not algebraic. This does not seem to be the
case. Until now all proofs that 1T is not constructible go via the route of ex
ploiting the transcendence of 1T.
PROBLEMS
1. Complete the proof of Theorem 5.5.1.
2. Prove that x
3
 3x  1 is irreducible over Q.
3. Show that the construction given f o r ~ , a 2:: 0 does indeed give us ~ .
4. Prove that the regular heptagon (sevensided polygon with sides of equal
length) is not constructible by straightedge and compass.
6. ROOTS OF POLYNOMIALS
Let F[x], as usual, be the polynomial ring in x over the field F and let K be an
extension field of F. If a E K and
208 Fields Ch.5
then by f(a) we understand the element
f(a) = aa + ala + ... + ana
n
in K. This is the usage we have made of this notation throughout this chap
ter. We will now be interested in those a's in K such thatf(a) = O.
Definition. The element a E K is a root of the polynomial f(x) E F[x]
if f(a) = O.
In what we have done up until now we have always had an extension
field K of F given to us and we considered the elements in K algebraic over
F, that is, those elements of K that are roots of nonzero polynomials in F[x].
We saw that if a E K is algebraic over F of degree nthat is, if the minimal
polynomial for a over F is of degree nthen [F(a) : F] = n, where F(a) is
the subfield of K obtained by adjoining a to F.
What we do now is turn the problem around. We no longer will have
the extension K of F at our disposal. In fact, our principal task will be to pro
duce it almost from scratch. We start with some polynomial f(x) of positive
degree in F[x] as our only bit of data; our goal is to construct an extension
field K of F in which f(x) will have a root. Once we have this construction of
K under control, we shall elaborate on the general theme, thereby obtaining
a series of interesting consequences.
Before setting off on this search for the appropriate K, we must get
some information about the relation between the roots of a given polynomial
and the factorization of that polynomial.
Lemma 5.6.1. If a E L is a root of the polynomial f(x) E F[x] of de
gree n, where L is an extension field of F, then f(x) factors in L[x] as f(x) =
(x  a)q(x), where q(x) is of degree n  1 in L[x]. Conversely, if f(x) =
(x  a)q(x), withf(x), q(x), and a as above, then a is a root off(x) in L.
Proof Since F C L, F[x] is contained in L[x]. Because a E L, x  a is
in L[x]; by the Division Algorithm for polynomials, we have f(x) =
(x  a)q(x) + r(x), where q(x) and r(x) are in L[x] and where r(x) = 0 or
deg r(x) < deg(x  a) = 1. This yields that r(x) = b, some element of L.
Substituting a for x in the relation above, and using the fact that f(a) = 0, we
obtain 0 = (a  a)q(a) + b = 0 + b = b; thus b = o. Since r(x) = b = 0, we
have what we wanted, namely f(x) = (x  a)q(x).
For the statement that deg q(x) = n  1 we note that since f(x) =
(x  a)q(x), then, by Lemma 4.5.2, n = degf(x) = deg(x  a) + deg q(x) =
1 + deg q(x). This gives us the required result, deg q(x) = n  1.
The converse is completely trivial. D
Sec. 6 Roots of Polynomials 209
One immediate consequence of Lemma 5.6.1 is
Theorem 5.6.2. Let f(x) in F[x] have degree n; then f(x) can have at
most n roots in any extension, K, of F.
Proof We go by induction on n. If n == 1, then f(x) == ax + b, where a
and b are in F and where a =1= O. Thus the only root of f(x) is bfa, an ele
ment of F.
Suppose that the theorem is correct for all polynomials of degree k  1
over any field. Suppose that f(x) in F[x] is of degree k. If f(x) has no roots in
K, then the theorem is certainly correct. Suppose, then, that a E K is a root of
f(x). By Lemma 5.6.1, f(x) == (x  a)q(x), where q(x) is of degree k  1 in
K[x]. Any root b in K of f(x) is either a or is a root of q(x), since 0 == f(b) ==
(b  a)q(b). By induction, q(x) has at most k  1 roots in K, hence f(x) has at
most k roots in K. This completes the induction and proves the theorem. D
Actually, the proof yields a little more. To explain this "little more," we
need the notion of the multiplicity of a root.
Definition. If K is an extension of F, then the element a in K is a root
of multiplicity k > 0 of f(x), where f(x) is in F[x], if f(x) == (x  a)kq(x) for
some q(x) in K[x] and x  a does not divide q(x) (or, equivalently, where
q(a) =1= 0).
The same proof as that given for Theorem 5.6.2 yields the sharpened
version:
Let f(x) be a polynomial of degree n in F[x]; then f(x) can have at most n
roots in any extension field K of F, counting a root of multiplicity k as k roots.
Theorem 5.6.3. Let f(x) in F[x] be monic of degree n and suppose
that K is an extension of F in which f(x) has n roots, counting a root of multi
plicity k as k roots. If these roots in K are ab a2, . .. , am' each having
multiplicity k
b
k
2
, ... , k
m
respectively, then f(x) factors in K[x] as f(x) ==
(x  al)k
1
(x  a2)k
2
• •• (x  am)k
m
•
Proof The proof is easy by making use of Lemma 5.6.1 and of induc
tion on n. We leave the carrying out of the proof to the reader. D
Definition. We say that f(x) in F[x] splits into linear factors over (or
in) K iff(x) has the factorization in K[x] given in Theorem 5.6.3.
There is a nice application of Theorem 5.6.3 to finite fields. Let F be a
finite field having q elements, and let ab a2' ... , a
q
l be the nonzero ele
210 Fields Ch.5
ments of F. Since these form a group of order q  1 under the multiplication
in F, by Theorem 2.4.5 (proved ever so long ago), a
q

1
== 1 for any a *0 in
F. Thus the polynomial x
q

1
 1 in F[x] has q  1 distinct roots in F. By
Theorem 5.6.3, the polynomial x
q

1
 1 == (x  al)(x  a2) ... (x  aql)' If
we also consider 0, then every element a in F satisfies a
q
== a, so that the
polynomial x
q
 x has the q elements of F as its distinct roots. By Theorem
5.6.3 we have
Theorem 5.6.4. Let F be a finite field having q elements. Then x
q
 x
factors in F[x] as
x
q
 x == x(x  a1)(x  a
2
) ••• (x  a
q

1
),
where ab a2,"" aql are the nonzero elements of F, and
x
q

1
 1 == (x  a1)(x  a
2
) ••• (x  a
q

1
).
A very special case of this theorem is that in which F == 7L
p
, the integers
modulo the prime p. Here q == p and ab a2, ... ,apl are just 1,2, ... ,p  1
in some order.
Corollary. In 7L
p
[x], the polynomial x
p

1
 1 factors as
x
p

1
 1 == (x  l)(x  2) ... (x  (p  1».
Try this out for p == 5,7, and 11.
As a corollary to the corollary, we have a result in number theory,
known as Wilson's Theorem, which we assigned as Problem 18 in Section 4 of
Chapter 2.
Corollary. lfp is a prime, then (p  I)! == 1 modp.
Proof By the Corollary above,
X p ~ l  1 == (x  l)(x  2) ... (x  (p  1»;
substituting x == 0 in this gives us
1 == (1)(2)'" ((p  1» == (1)P
1
1' 2··· (p  1)
== (l)Pl(p  I)!
in 7L
p
• In the integers this translates into "congruent mod p." Thus
(l)Pl(p  I)! == 1 mod p,
and so (p  I)! == (l)P modp. But (l)P == 1 modp; hence we have
proved Wilson's Theorem. D
Sec. 6 Roots of Polynomials 211
We change direction to consider the problem mentioned at the begin
ning of this section: given f(x) E F[x], to construct a finite extension K of F
in which f(x) has a root. As we shall see in a moment, this construction of K
will be quite easy when we bring the results about polynomial rings proved in
Chapter 4 into play. However, to verify that this construction works will take
a bit of work.
Theorem 5.6.5. Let F be a field and f(x) a polynomial of positive de
gree n in F[x]. Then there exists a finite extension K of F, with [K: F] ~ n, in
which [(x) has a root.
Proof By Theorem 4.5.12, [(x) is divisible in F[x] by some irreducible
polynomial p(x) in F[x]. Since p(x) divides f(x), degp(x) ~ degf(x) = n,
and f(x) = p(x)q(x) for some polynomial q(x) in F[x]. If b is a root of p(x)
in some extension field, then b is automatically a root of [(x), since f(b) =
p(b)q(b) = Oq(b) = O. So to prove the theorem it is enough to find an exten
sion of F in which p (x) has a root.
Because p(x) is irreducible in F[x], the ideal M = (p(x» of F[x] gener
ated by p(x) is a maximal ideal of F[x] by Theorem 4.5.11. Thus by Theorem
4.4.2, K = F[x]/ M is a field. We claim that this is the field that we are seeking.
Strictly speaking, K does not contain F; as we now show, however, K
does contain a field isomorphic to F. Since every element in M is a multiple in
F[x] of p(x), every such nonzero element must have degree at least that of
p(x). Therefore, M n F = (0). Thus the homomorphism t/J: F[x] ~ K defined
by t/J(g(x» = g(x) + M for every g(x) in F[x], when restricted to F, is 1  1 on
F. Therefore, the image Fof F in K is a field isomorphic to F. We can identify
F, via t/J, with F and so, in this way, we can consider K an extension of F.
Denote x + M E K by a, so that t/J(x) = a, a E K. We leave it to the
reader to show, from the fact that t/J is a homomorphism of F[x] onto K with
kernel M, that t/J(g(x» = g(a) for every g(x) in F[x]. What is t/J(p(x»? On
the one hand, since p(x) is in F[x], t/J(p(x» = pea). On the other hand, since
p(x) is in M, the kernel of t/J, t/J(p(x» = O. Equating these two evaluations of
t/J(p(x», we get that pea) = O. In other words, the element a = t/J(x) in K is a
root o[p(x).
To finish the proof, all we need is to show that [K: F] = degp(x) ~ n.
This came up earlier, in the alternative proof we gave of Theorem 5.3.5.
There we left this point to be proved by the reader. We shall be a little more
generous here and carry out the proof in detail.
Given hex) in F[x], then, by the Division Algorithm, hex) = p(x)q(x) +
rex) where q(x) and rex) are in F[x], and rex) = 0 or deg rex) < degp(x).
Going modulo M, we obtain that
212 Fields
tfJ(h(x» == tfJ(p(x)q(x) + r e x » ~ == tfJ(p(x)q(x» + tfJ(r(x»
== tfJ(p(x»tfJ(q(x» + tfJ(r(x»
== tfJ(r(x» == rea)
Ch.5
[since tfJ(p(x» == pea) == 0].
So, since every element in K == F[x]/M is tfJ(h(x» for some hex) in F[x]
and tfJ(h(x» == rea), we see that every element of K is of the form rea), where
rex) is in F[x] and deg rex) < degp(x). If degp(x) == m, the discussion just
made tells us that 1, a, a
2
, ••• ,a
m

1
span Kover F. Moreover, these elements
are linearly independent over F, since a relation of the type aa + ala + ...
+ am_la
m

1
== 0 would imply that g(a) == 0 where g(x) == aa + alx + ... +
am_lX
m

1
is in F[x]. This puts g(x) in M, which is impossible since g(x) is of
lower degree than p(x), unless g(x) == O. In other words, we get a contradic
tion unless aa == al == ... = exml == o. So the elements 1, a, a
2
, ••• ,a
m

l
are
linearly independent over F. Since they also span Kover F, they form a basis
of Kover F. Consequently,
dimFK == [K: F] == m == degp (x) ::; n == deg[(x).
The theorem is proved. D
We carry out an iteration of the argument used in the last proof to
prove the important
Theorem 5.6.6. Let [(x) E F[x] be of degree n. Then there exists an
extension K of F of degree at most n! over F such that [(x) has n roots, count
ing multiplicities, in K. Equivalently, [(x) splits into linear factors over K.
Proof We go by induction on n. If n == 1, then [(x) = a + {3x, where a,
{3 E F and where {3 =1= O. The only root of [(x) is exl{3, which is in F. Thus
K == F and [K: F] == 1.
Suppose that the result is true for all fields for polynomials of degree k,
and suppose that [(x) E F[x] is of degree k + 1. By Theorem 5.6.5 there ex
ists an extension K
1
of F with [K
1
: F] ::; k + 1 in which [(x) has a root al.
Thus in K1[x], [(x) factors as [(x) = (x  al)q(x), where q(x) E K
1
[x] is of
degree k. By induction there exists an extension K of K 1 of degree at most k!
over Klover which q(x) splits into linear factors. But then [(x) splits into lin
ear factors over K. Since [K: F] == [K: Ktl[K
l
: F] ::; (k + l)k! == (k + I)!,
the induction is completed and the theorem is proved. D
We leave the subject of field extensions at this point. We are exactly at
what might be described as the beginning of Galois theory. Having an exten
Sec. 6 Roots of Polynomials 213
sion K of F of finite degree over which a given polynomiall(x) splits into lin
ear factors, there exists an extension of least degree enjoying this property.
Such an extension is called a splitting field of I(x) over F. One then proceeds
to prove that such a splitting field is unique up to isomorphism. Once this is
in hand the Galois theory goes into full swing, studying the relationship be
tween the group of automorphisms of this splitting field and its subfield struc
ture. Eventually, it leads to showing, among many other things, that there
exist polynomials over the rationals of all degrees 5 or higher whose roots
cannot be expressed nicely in terms of the coefficients of these polynomials.
This is a brief and very sketchy description of where we can go from
here in field theory. But there is no hurry. The readers should assimilate the
material we have presented; this will put them in a good position to learn
Galois theory if they are so inclined.
PROBLEMS
1. Prove Theorem 5.6.3.
2. If F is a finite field having the q  1 nonzero elements aI, a2' ... , aqI,
prove that aIa2 ... aqI = (l)q.
3. Let Q be the rational field and let p (x) = x
4
+ x
3
+ x
2
+ X + 1. Show
that there is an extension K of Q with [K: 0] = 4 over which p(x) splits
into linear factors. [Hint: Find the roots of p (x).]
4. If q (x) = x
n
+ aI x
n

I
+ ... + an, an =1= 0, is a polynomial with integer
coefficients and if the rational number r is a root of q(x), prove that r is
an integer and r Ian.
5. Show that q(x) = x
3
 7x + 11 is irreducible over Q.
6. If F is a field of characteristic p =1= 0, show that (a + b)P = a
P
+ b
P
for all
a and b in F.
7. Extend the result of Problem 6 by showing that (a + b)m = am + b
m
,
where m = pn.
8. Let F = 7L
p
, p a prime, and consider the polynomial x
m
 x in 7L
p
[x] ,
where m = pn. Let K be a finite extension of 7L
p
over which x
m
 x splits
into linear factors. In K let K
o
be the set of all roots of x
m
 x. Show that
K
o
is a field having at most pn elements.
9. In Problem 8 show that K
o
has exactly pn elements. (Hint: See Problem
14.)
10. Construct an extension field K
n
of Q such that [K
n
: 0] = n, for any
n 2:: 1.
214 Fields
11. Define the mapping B: F[x] ~ F[x] by
B(a
o
+ atx + a2x2 + . .. + anx
n
)
= at + 2a
2
x + ... + iaix
i

t
+ ... + nanx
n

1
•
Ch.5
Prove that:
(a) B(f(x) + g(x» = B(f(x») + B(g(x».
(b) B(f(x)g(x» = f(x)B(g(x» + B(f(x»g(x) for allf(x) and g(x) in F[x].
12. If F is of characteristic p =1= 0, characterize all f(x) in F[x] such that
B(f(x» = 0.
13. Show that if f(x) in F[x] has a root of multiplicity greater than 1 in some
extension field of F, then f(x) and B(f(x» are not relatively prime in
F[x].
14. If F is of characteristic p =1= 0, show that all the roots of x
m
 x, where
m = p n, are distinct.
15. If f(x) in F[x] is irreducible and has a root of multiplicity greater than 1
in some extension of F, show that:
(a) F must be of characteristic p for some prime p.
(b) f(x) = g(x
P
) for some polynomial g(x) in F[x].
6
SPECIAL TOPICS (OPTIONAL)
In this final chapter we treat several unrelated topics. One of these comes
from group theory, and all the rest from the theory of fields. In handling
these special topics, we draw from many of the results and ideas developed
earlier in the book. Although these topics are somewhat special, each of
them has results that are truly important in their respective areas.
The readers who have managed to survive so far should have picked up
a certain set of techniques, experience, and algebraic knowhow to be able to
follow the material with a certain degree of ease. We now feel free to treat
the various matters at hand in a somewhat sketchier fashion than we have
heretofore, leaving a few more details to the reader to fill in.
The material we shall handle does not lend itself readily to problems, at
least not to problems of a reasonable degree of difficulty. Accordingly, we
will assign relatively few exercises. This should come as a relief to those
wanting to assimilate the material in this chapter.
1. THE SIMPLICITY OF An
In Chapter 3, where we discussed Sn, the symmetric group of degree n, we
showed that if n ;::: 2, then Sn has a normal subgroup An, which we called the
alternating group of degree n, which is a group of order n!/2. In fact, An was
merely the set of all even permutations in Sn.
215
216 Special Topics (Optional) Ch.6
In discussing An, we said that An, for n 2:: 5, was a simple group, that is,
that An has no normal subgroups other than (e) and itself. We promised
there that we would prove this fact in Chapter 6. We now make good on this
promise.
To make clear what it is that we are about to prove, we should perhaps
repeat what we said above and formally define what is meant by a simple
group.
Definition. A nonabelian group is said to be simple if its only normal
subgroups are (e) and itself.
We impose the proviso that G be nonabelian to exclude the trivial ex
amples of cyclic groups of prime order from the designation "simple." These
cyclic groups of prime order have no nontrivial subgroups at all, so, perforce,
they have no proper normal subgroups. An abelian group with no proper
subgroups is easily seen to be cyclic of prime order.
We begin with the very easy
Lemma 6.1.1. If n 2:: 3 and Tb T2 are two transpositions in Sn' then T1 T2
is either a 3cycle or the product of two 3cycles.
Proof If T1 == T2' then T1 T2 == Ti == e and e is certainly the product of
two 3cycles, for instance as e == (123)(132).
If T1 =1= T2' then they either have one letter in common or none. If they
have one letter in common, we may suppose, on a suitable renumbering, that
T1 == (12) and T2 == (13). But then T1 T2 == (12)(13) == (132), which is already a
3cycle.
Finally, if T1 and T2 have no letter in common, we may suppose, without
loss of generality, that T1 == (12) and T2 == (34), in which case Tl T2 == (12)(34) ==
(142)(143), which is indeed the product of two 3cycles. The lemma is now
proved. D
An immediate consequence of Lemma 6.1.1 is that for n 2:: 3 the
3cycles generate An, the alternating group of degree n.
Theorem 6.1.2. If (J is an even permutation in Sn, where n 2:: 3, then (J
is a product of 3cycles. In other words, the 3cycles in Sn generate An.
Proof Let (J E Sn be an even permutation. By the definition of the par
ity of a permutation, (J is a product of an even number of transpositions.
Thus (J == T1 T2 •.• T2i 1T2i ... T2m 1T2m is a product of 2m transpositions
Tb T2, ... , T2m· By Lemma 6.1.1, each T2i 1T2i is either a 3cycle or a product
Sec. 1 The Simplicity of An 217
of two 3cycles. So we get that (J" is either a 3cycle or the product of at most
2m 3cycles. This proves the theorem. D
We now give an algorithm for computing the conjugate of any permu
tation in Sn. Let (J" E Sn' and suppose that (J"(i) == j. What does T(J"T1 look
like if T E Sn? Suppose that T(i) == sand T(j) == t; then T(J"T
1
(S) ==
T(J"(T1(S» == T(J"(i) == T(j) == t. In other words, to compute T(J"T
1
replace
every symbol in (J" by its image under T.
For instance, if (J" == (123) and T == (143), then, since T(l) == 4, T(2) == 2,
T(3) == 1, and T(4) == 3, we see that T(J"T
1
== (421) == (142).
Given two kcycles, say (12 ... k) and (i
1
i
2
••• i
k
), then they are conju
gate in Sn because if T is a permutation that sends 1 into i
b
2 into i
2
, ••• , k
into i
k
, then T(12 ... k)T
1
== (i
1
i
2
, • •• , i
k
). Since every permutation is the
product of disjoint cycles and conjugation is an automorphism, we get, from
the result for kcycles, that to compute T(J"T1 for any permutation (J", replace
every symbol in (J" by its image under T. In this way we see that it is extremely
easy to compute the conjugate of any permutation.
Given two permutations (J"1 and (J"2 in Sn, then they are conjugate in Sn,
using the observation above, if in their decompositions into products of dis
joint cycles they have the same cycle lengths and each cycle length with the
same multiplicity. Thus, for instance, (12)(34)(567) and (37)(24)(568) are
conjugate in Sg, but (12)(34)(567) and (37)(568) are not.
Recall that by a partition of the positive integer n, we mean a decompo
sition of n as n == n 1 + n2 + ... + nk, where 0 ::; n1 ::; n2 ::; ... ::; nk. If (J" in
Sn is the disjoint product of an n1cycle, an n2cycle, ... , an nkcycle, then
n 1 + n2 + ... + nk == n, and a permutation T is conjugate to (J" if and only if T
is the disjoint product of cycles in the same way. Therefore, the number of
conjugacy classes in Sn is equal to the number of partitions of n.
For instance, if n = 4, then the partitions of 4 are 4 = 4, 4 = 1 + 3,
4 == 1 + 1 + 2, 4 = 1 + 1 + 1 + 1, and 4 == 2 + 2, which are five in num
ber. Thus S4 has five conjugacy classes, namely the classes of (1234), (123),
(12), e, and (12)(34), respectively.
We summarize everything we said above in three distinct statements.
Lemma 6.1.3. To find T(J"T1 in Sn, replace every symbol in the cycle
structure of (J" by its image under T.
Lemma 6.1.4. Two elements in Sn are conjugate if they have similar
decompositions as the product of disjoint cycles.
Lemma 6.1.5. The number of conjugacy classes in Sn is equal to the
number of partitions of n.
218 Special Topics (Optional) Ch.6
Clearly, from the results above, any two 3cycles in Sn are conjugate in
Sn. A 3cycle is an even permutation, so is in An. One might wonder if any
two 3cycles are actually conjugate in the smaller group An. For n ;::: 5 the
answer is "yes," and is quite easy to prove.
Lemma 6.1.6. If n ;::: 5, then any two 3cycles in Sn are already conju
gate in An.
Proof Let 0"1 and 0"2 be two 3cycles in Sn; by Lemma 6.1.4 they are
conjugate in Sn. By renumbering, we may assume that 0"1 == (123) and 0"2 ==
T(123)T
1
for some T E Sn. If T is even, then we are done. If T is odd, then
p == T(45) is even and p(123)pl == T(45)(123)(45)l T 1 == T(123)T
1
== 0"2.
Therefore, 0"1 and 0"2 are conjugate in An" We thus see that the lemma is cor
rect. D
In S3 the two 3cycles (123) and (132) are conjugate in S3 but are not
conjugate in A
3
, which is a cyclic group of order 3.
We now prove a result that is not only important in group theory, but
also plays a key role in field theory and the theory of equations.
Theorem 6.1.7. If n ;::: 5, then the only nontrivial proper normal sub
group of Sn is An.
Proof Suppose that N is a normal subgroup of Sn and N is neither (e)
nor Sn. Let 0" =1= e be in N. Since the center of Sn is just (e) (See Problem 1)
and the transpositions generate Sn, there is a transposition T such that
O"T =1= TO". By Lemma 6.1.4, T1 == O"TO"l is a transposition, so TTl == TO"TO"l =I= e
is in N, since 0" E Nand TO"T == TO"T
1
E N because N is normal in Sn. So N
contains an element that is the product of two transpositions, namely TTl.
If T and T1 have a letter in common, then, as we saw in the proof of
Lemma 6.1.1, TTl is a 3cycle, hence N contains a 3cycle. By Lemma 6.1.4 all
3cycles in Sn are conjugate to TTl so must fall in N, by the normality of N in
Sn' Thus the subgroup of Sn generated by the 3cycles, which, according to
Theorem 6.1.2, is all of An, lies in N. Note that up to this point we have not
used that n ;::: 5.
We may thus assume that T and T1 have no letter in common. Without
loss of generality we may assume that T == (12) and T1 == (34); therefore,
(12)(34) is in N. Since n ~ 5, (15) is in Sn hence (15)(12)(34)(15)1 ==
(25)(34) is also in N; thus (12)(34)(25)(34) == (125) is in N. Thus in this case
also, N must contain a 3cycle. The argument above then shows that N ~ An.
We have shown that in both cases N must contain An" Since there are
no subgroups strictly between An and Sn and N =1= Sn, we obtain the desired
result that N == An" D
Sec. 1
The result is false for n == 4; the subgroup
The Simplicity of An 219
N == fe, (12)(34), (13)(24), (14)(23)}
is a proper normal subgroup of S4 and is not A
4
•
We now know all the normal subgroups of Sn when n 2:: 5. Can we de
termine from this all the normal subgroups of An for n 2:: 5? The answer is
"yes"; as we shall soon see, An is a simple group if n 2:: 5. The proof we give
may strike many as strange, for it hinges on the fact that 60, the order of As,
is not a perfect square.
Theorem 6.1.8. The group As is a simple group of order 60.
Proof Suppose that As is not simple; then it has a proper normal
subgroup N whose order is as small as possible. Let the subset T ==
{o' E Ssl NO'
1
eN}, the normalizer of N in Ss. Since N is normal in As, we
know that T :J As. T is a subgroup of Ss, so if T =1= As, we would have that
T == Ss. But this would tell us that N is normal in Ss, which, by Theorem
6.1.7, would imply that N :J As, giving us that N == As, contrary to our suppo
sition that N is a proper subgroup of As. So we must have T == As. Since (12)
is odd, it is not in As, hence is not in T. Therefore, M == (12)N(12)1 =1= N.
Since N <l As, we also have that M <l As (Prove!), thus both M n N
and MN == {mn 1m E M, n E N} are normal in As. (See Problem 9.) Because
M =1= N we have that M n N =1= N, and since N is a minimal proper normal
subgroup of As, it follows that M n N == (e). On the other hand,
(12)MN(12)1 == (12)M(12)1(12)N(12)1 == NM (since (12)N(12)1 == M
and (12)M(12)1 == N) == MN by the normality of M and N in As. There
fore, the element (12) is in the normalizer of MN in Ss , and since MN is nor
mal in As, we get, as we did above, the MN is normal in Ss, and so MN == As
by Theorem 6.1.7.
Consider what we now have. Both M and N are normal subgroups of
As, each of order INI, and MN == As and M n N == (e). We claim, and leave
to the reader, that MN must then have order INI
2
• Since MN == As, we ob
tain that 60 == IAsl = IMNI == INI
2
• But this is sheer nonsense, since 60 is not
the square of any integer. This establishes Theorem 6.1.8. D
To go from the simplicity of As to that of An for n 2:: 5 is not too hard.
Note that the argument we gave for As did not depend on 5 until the punch
line "60 is not the square of any integer." In fact, the reasoning is valid as
long as we know that n!/2 is not a perfect square. Thus, for example, if n == 6,
then 6!/2 == 360 is not a square, hence A
6
is a simple group. Since we shall
need this fact in the subsequent discussion, we record it before going on.
220 Special Topics (Optional)
Corollary to the Proof of Theorem 6.1.8. A
6
is a simple group.
Ch.6
We return to the question of whether or not n!/2 is a square. As a mat
ter of fact, it is not if n > 2. This can be shown as a consequence of the beau
tiful theorem in number theory (the socalled Bertrand Postulate), which as
serts that for m > 1 there is always a prime between m and 2m. Since we do
not have this result at our disposal, we follow another road to show the sim
plicity of An for all n ~ 5.
We now prove this important theorem.
Theorem 6.1.9. For all n ~ 5 the group An is simple.
Proof By Theorem 6.1.8 we may assume that n ~ 6. The center of An
for n > 3 is merely (e). (Prove!) Since An is generated by the 3cycles, if
U =1= e is in An, then, for some 3cycle T, UT =1= TU.
Suppose that N =1= (e) is a normal subgroup of An and that U =1= eis in N.
Thus, for some 3cycle T, UT =1= TU, which is to say, UTU1T
1
=1= e. Because
N is normal in An, the element TU
1
T
1
is in N, hence UTU
1
T
1
is also in N.
Since T is a 3cycle, so must UTU1 also be a 3cycle. Thus N contains the
product of two 3cycles, and this product is not e. These two 3cycles involve
at most six letters, so can be considered as sitting in A
6
which, since n ~ 6,
can be considered embedded isomorphically in An. (Prove!) But then
N n A 6 =1= (e) is a normal subgroup of A 6, so by the Corollary above,
N n A
6
= A
6
• Therefore, N must contain a 3cycle, and since all 3cycles are
conjugate in An (Lemma 6.1.6), N must contain all the 3cycles in Sn' Since
these 3cycles generate An, we obtain that N is all of An, thereby proving the
theorem. D
There are many different proofs of Theorem 6.1.9they usually in
volve showing that a normal subgroup of An must contain a 3cyclewhich
are shorter and possibly easier than the one we gave. However, we like the
bizarre twist in the proof given in that the whole affair boils down to the fact
that 60 is not a square. We recommend to the reader to look at some other
proofs of this very important theorem, especially in a book on group theory.
The An provide us with an infinite family of finite simple groups. There
are several other infinite families of finite simple groups and 26 particular
ones that do not belong to any infinite family. This determination of all
finite simple groups, carried out in the 1960s and 1970s by a large number of
group theorists, is one of the major achievements of twentiethcentury math
ematics.
Sec. 2
PROBLEMS
Finite Fields I 221
*1. Prove that if n > 2, the center of Sn is (e).
*2. Prove that if n > 3, the center of An is (e).
3. What can you say about the cycle structure of the product of two
3cycles?
4. If m < n, show that there is a subgroup of Sn isomorphic to Sm.
5. Show that an abelian group having no proper subgroups is cyclic of
prime order.
6. How many conjugacy classes are there in S6?
7. If the elements aI, a2 , ... , an generate the group G and b is a noncentral
element of G, prove that bai =1= aib for some i.
8. If M <l Nand N <J G, show that aMa
1
is normal in N for every a E G.
9. If M <l G and N <l G, show that MN is a normal subgroup of G.
10. If n 2:: 5 is odd, show that the ncycles generate An.
11. Show that the centralizer of (12 ... k) in Sn has order k (n  k)! and that
(12··· k) has n!/(k(n  k)!) conjugates in Sn.
12. In the proof of Theorem 6.1.8, show that IMNI = INI
2
•
2. FINITE FIELDS I
Our goal in this section and the next two is to get a complete description of
all finite fields. What we shall show is that the multiplicative group of
nonzero elements of a finite field is a cyclic group. This we do in this section.
In the next two, the objectives will be to establish the existence and unique
ness of finite fields having pn elements for any prime p and any positive inte
ger n.
Some of the things we are about to do already came up in the problem
sets in group theory and field theory as hard problems. The techniques that
we use come from group theory and field theory, with a little number theory
thrown in.
We recall what the Euler q;function is. We define the Euler q;function
by: q;(1) = 1 and, for n > 1, q;(n) is the number of positive integers less than
n and relatively prime to n.
We begin with a result in number theory whose proof, however, will ex
ploit group theory. Before doing the general case, we do an example.
Let n = 12; then q;(12) = 4, for only 1,5,7, and 11 are less than 12 and
relatively prime to 12. We compute q;(d) for all the divisors of 12. We have:
222 Special Topics (Optional) Ch.6
cp(1) = 1, cp(2) = 1, cp(3) = 2, cp(4) = 2, cp(6) = 2, and cp(12) = 4. Note that
the sum of all cp(d) over all the divisors of 12 is 12. This is no fluke but is a
special case of
Theorem 6.2.1. If n 2:: 1, then 'Lcp(d) = n, where this sum runs over all
divisors d of n.
Proof Let G be a cyclic group of order n generated by the element a.
If din, how many elements of G have order d? If b = a
n
/
d
, then all the solu
tions in G of x
d
= e are the powers e, b, b
2
, ••• , b
d

1
of b. How many of
these have order d? We claim, and leave to the reader, that b' has order d if
and only if r is relatively prime to d. So the number of elements of order d in
G, for every divisor d of n, is cp(d). Every element in G has order some divi
sor of n, so if we sum up the number of elements of order dnamely cp(d)
over all d dividing n, we account for each element of G once and only once.
Hence 'Lcp(d) = n if we run over all the divisors d of n. The theorem is now
proved. D
In a finite cyclic group of order n the number of solutions of x
d
= e, the
unit element of G, is exactly d for every d that divides n. We used this fact in
the proof of Theorem 6.2.1. We now prove a converse to this, getting thereby
a criterion for cyclicity of a finite group.
Theorem 6.2.2. Let G be a finite group of order n with the property
that for every d that divides n there are at most d solutions of x
d
= e in G.
Then G is a cyclic group.
Proof Let \fJ(d) be the number of elements of G of order d. By hy
pothesis, if a EGis of order d, then all the solutions of x
d
= e are the dis
tinct powers e, a, a
2
, ••• , a
d
 1, of which number, cp(d) are of order d. So if
there is an element of order d in G, then tfJ(d) = cp(d). On the other hand, if
there. is no element in G of order d, then tfJ(d) = O. So for all din we have
that tfJ(d) ::::; cp(d). However, since every element of G has some order d that
divides n we have that 'LtfJ(d) = n, where this sum runs over all divisors d of
n. But
n = 'LtfJ(d) ::::; 'Lcp(d) = n
since each tfJ(d) ::::; cp(d). This gives us that 'LtfJ(d) = "Lcp(d), which, together
with tfJ(d) ::::; cp(d), forces tfJ(d) = cp(d) for every d that divides n. Thus, in
particular, tfJ(n) = cp(n) 2:: 1. What does this tell us? After all, tfJ(n) is the
number of elements in G of order n, and since tfJ(n) 2:: 1 there must be an ele
ment a in G of order n. Therefore, the elements e, a, a
2
, ••• , a
n

1
are all dis
Sec. 2 Finite Fields I 223
tinct and are n in number, so they must give all of G. Thus G is cyclic with a
as generator, proving the theorem. D
Is there any situation where we can be sure that the equation x
d
= e
has at most d solutions in a given group? Certainly. If K* is the group of
nonzero elements of a field under multiplication, then the polynomial x
n
 1
has at most n roots in K* by Theorem 5.6.2. So, if G C K* is a finite multi
plicative subgroup of K*, then the number of solutions of x
d
= 1 in G is at
most d for any positive integer d, so certainly for all d that divide the order
of G. By Theorem 6.2.2 G must be a cyclic group. We have proved
Theorem 6.2.3. If K is a field and K* is the group of nonzero ele
ments of K under multiplication, then any finite subgroup of K* is cyclic.
A very special case of Theorem 6.2.3, but at the moment the most im
portant case for us, is
Theorem 6.2.4. If K is a finite field, then K* is a cyclic group.
Proof. K* is a finite subgroup of itself, so, by Theorem 6.2.3, K* is
cyclic. D
A particular instance of Theorem 6.2.4 is of great importance in num
ber theory, where it is known as the existence ofprimitive roots mod p for p a
prime.
Theorem 6.2.5. If p is a prime, then Z; is a cyclic group.
PROBLEMS
1. If a E G has order d, prove that ar also has order d if and only if rand d
are relatively prime.
2. Find a cyclic generator (primitive root) for Z ~ l .
3. Do Problem 2 for Zi7.
4. Construct a field K having nine elements and find a cyclic generator for
the group K*.
5. If p is a prime and m = p2, then Zm is not a field but the elements
{[a] I(a, p) = I} form a group under the multiplication in Zm. Prove that
this group is cyclic of order p (p  1).
224 Special Topics (Optional) Ch.6
6. Determine all the finite subgroups of C*, where C is the field of complex
numbers.
In the rest of the problems here q; will be the Euler q;function.
7. Ifp is a prime, show that q;(pn) = pnI(p  1).
8. If m and n are relatively prime positive integers, prove that
q;(mn) = q;(m)q;(n).
9. Using the result of Problems 7 and 8, find q;(n) in terms of the factoriza
tion of n into prime power factors.
10. Prove that lim q;(n) = 00.
n.. oo
3. FINITE FIELDS II: EXISTENCE
Let K be a finite field. Then K must be of characteristic p, p a prime, and K con
tains 0, 1, 2, ... ,p  1, the p multiples of the unit element 1 of K. So K :J 7L
p
,
or, more precisely, K contains a field isomorphic to 7L
p
• Since K is a vector
space over 7L
p
and clearly is of finite dimension over 7L
p
, if [K: 7L
p
] = n, then
K has pn elements. To see this, let Vb V2, ... , V
n
be a basis of Kover 7L
p
• Then
for every distinct choice of (a}, a2,' .. , an), where the ai are in 7L
p
, the elements
are distinct. Thus, since we can pick (aI, a2, ... , an) in pn ways, K has pn
elements.
The multiplicative group K* of nonzero elements of K is a group of
order pn  1. So, we have that a
m

I
= 1, where m = pn, for every a in K,
hence am = a. Since this is also obviously true for a = 0, we have that am = a
for every a in K. Therefore, the polynomial x
m
 x in 7L
p
[x] has m = pn dis
tinct roots in K, namely all the elements of K. Thus x
m
 x factors in K[x] as
where ab a2,' .. ' am are the elements of K.
Everything we just said we already said, in more or less the same way,
in Section 6 of Chapter 5. Since we wanted these results to be fresh in the
reader's mind, we repeated this material here.
We summarize what we just did in
Theorem 6.3.1. Let K be a finite field of characteristic p, p a prime.
Then K contains m = pn elements where n = [K: 7L
p
], and the polynomial
x
m
 x in 7L
p
[x] splits into linear factors in K[x] as
Sec. 3 Finite Fields II: Existence 225
where aI, a2, ... , am are the elements of K.
Two natural questions present themselves:
1. For what primes p and what integers n does there exist a field having pn
elements?
2. How many nonisomorphic fields are there having pn elements?
We shall answer both questions in this section and the next. The an
swers will be
1. For any prime p and any positive integer n there exists a finite field
having pn elements.
2. Two finite fields having the same number of elements are isomorphic.
It is to these two results that we now address ourselves. First, we settle
the question of the existence of finite fields. We begin with a general remark
about irreducible polynomials.
Lemma 6.3.2. Let F be any field and suppose that p(x) is an irre
ducible polynomial in F[x]. Suppose that q(x) in F[x] is such that in some ex
tension field of F,p(x) and q(x) have a common root. Thenp(x) divides q(x)
in F[x].
Proof. Suppose that p(x) does not divide q(x); since p(x) is irreducible
in F[x], p(x) and q(x) must therefore be relatively prime in F[x]. Thus there
are polynomials u(x) and v(x) in F[x] such that
u(x)p(x) + v(x)q(x) = 1.
Suppose that the element a in some extension K of F is a root of both p(x)
and q(x); thus pea) = q(a) = O. But then 1 = u(a)p(a) + v(a)q(a) = 0, a
contradiction. So we get thatp(x) divides q(x) in F[x]. 0
Note that we can actually prove a little more, namely
Corollary. If f(x) and g(x) in F(x) are not relatively prime in K[x] ,
where K is an extension of F, then they are not relatively prime in F[x].
Let F be a field of characteristic p =1= O. We claim that the polynomial
f(x) = x
m
 x, where m = pn, cannot have a multiple root in any extension
226 Special Topics (Optional)
Ch.6
field K of F. Do you remember what a multiple root of a polynomial is? We
refresh your memory. If g(x) is in F[x] and if K is an extension field of F, then
a in K is a multiple root of g(x) if g(x) = (x  a)2q(x) for some q(x) in K[x].
We return to the polynomial f(x) = x
m
 x above. Since f(x) =
x(x
m

t
 1) and 0 is not a root of x
m

t
 1, it is clearly true that 0 is a simple
(Le., not multiple) root of f(x). Suppose that a E K, K :::> F, is a root of f(x);
thus am = a. If y = x  a, then
f(y) == ym  y = (x  a)m  (x  a) = x
m
 ~  (x  a)
(since we are in characteristic p =1= 0 and m =pn)
= x
m
 x (because am = a) = f(x).
So
f(x) = f(y) = ym  y = (x  a)m  (x  a)
= (x  a)«x  a)mt  1),
and clearly this is divisible by x  a only to the first power, since x  a does
not divide (x  a)ml  1. So a is not a multiple root off(x).
We have proved
Theorem 6.3.3. If n > 0, then f(x) = x
m
 x, where m = pn, has no
multiple roots in any field of characteristic p.
We should add a word to the proof above to nail down the statement of
Theorem 6.3.3 as we gave it. Any field of characteristic p =1= 0 is an extension
of 7L.
p
, and the polynomial f(x) is in 7L.
p
[x]. So the argument above, with K
any field of characteristic p and F = 7L.
p
, proves the theorem in its given form.
We have exactly what we need to prove the important
Theorem 6.3.4. For any prime p and any positive integer n there ex
ists a finite field having p n elements.
Proof Consider the polynomial x
m
 x in 7L.
p
[x] , where m = pn. By
Theorem 5.6.6 there exists a finite extension K of 7L
p
such that in K[x] the
polynomial x m  x factors as
where at, a2' ... , am are in K. By Theorem 6.3.3, x
m
 x does not have any
multiple roots in K, hence the elements at, a2' ... , am are m = pn distinct el
ements. We also know that ab a2,.'.' am are all the roots of x
m
 x in K,
since x
m
 x is of degree m.
Sec. 4 Finite Fields III: Uniqueness 227
Let A = {a E K Iam = a}; as we just saw, A has m distinct elements. We
claim that A is a field. If a, b E A, then am = a and b
m
= b, hence (ab)m =
amb
m
= ab, thus ab E A. Because we are in characteristic p =1= 0 and m = pn,
(a + b)m = am + b
m
= a + b, hence a + b is in A.
Since A is a finite subset of a field and is closed with respect to sum and
product, A must be a subfield of K. Since A has m = pn elements, A is thus
the field whose existence was asserted in the statement of the theorem. With
this the theorem is proved. 0
PROBLEMS
*1. Give the details of the proof of the Corollary to Lemma 6.3.2.
The next two problems are a repeat of ones given earlier in the book.
2. If f(x) = aox
n
+ atXnt + ... + an is in F[x], let f'(x) be the formal
derivative of f(x) defined by the following equation: f'(x) = naoxnt
+ (n  l)atxn2 + ... + (n  i)aixnit + · . · + ant. Prove that:
(a) ([(x) + g(x»' =['(x) + g'(x)
(b) ([(x)g(x»' = f'(x)g(x) + [(x)g'(x) for allf(x) and g(x) in F[x].
*3. Prove that [(x) in F[x] has a multiple root in some extension of F if and
only if [(x) and [' (x) are not relatively prime.
4. If f(x) = x
n
 x is in F[x], prove that f(x) does not have a multiple root in
any extension of F if F is either of characteristic 0 or of characteristic
p =1= 0, where p does not divide n  1.
s. Use the result of Problem 4 to give another proof of Theorem 6.3.3.
6. If F is a field of characteristic p =1= 0, construct a polynomial with multiple
roots of the form x
n
 x, where p I(n  1).
7. If K is a field having pn elements, show that for every m that divides n
there is a subfield of K having pm elements.
4. FINITE FIELDS III: UNIQUENESS
Now that we know that finite fields exist having pn elements, for any prime p
and any positive integer n, we might ask: How many finite fields are there
with pn elements? For this to make any sense at all, what we are really asking
is: How many distinct nonisomorphic fields are there with pn elements? The
answer to this is short and sweet: one. We shall show here that any two finite
fields having the same number of elements are isomorphic.
228 Special Topics (Optional) Ch.6
Let K and L be two finite fields having pn elements. Thus K and L are
both vector spaces of dimension n over 7L
p
• As such, K and L are isomorphic
as vector spaces. On the other hand, K* and L* are both cyclic groups of
order pn  1 by Theorem 6.2.4; hence K* and L* are isomorphic as multi
plicative groups. One would imagine that one could put these two isomor
phisms together to prove that K and L are isomorphic as fields. But it just
isn't so. The proof does not take this direction at all. But the finiteness of K
and L together with these two isomorphisms (of two structures carried by K
and L) do suggest that, perhaps, K and L are isomorphic as fields. This is in
deed the case, as we now proceed to show.
We begin with
Lemma 6.4.1. If q(x) in 7L
p
[x] is irreducible of degree n, then
q(x) I(x
m
 x), where m = pn.
Proof. By Theorem 4.5.11 the ideal (q(x» of Zp[x] generated by q(x) is
a maximal ideal of 7L
p
[x] since q(x) is irreducible in 7L
p
[x]. Let A =
7L
p
[x]/(q(x»; by Theorem 4.4.3, A is a field of degree n over 7L
p
, hence has pn
elements. Therefore, u
m
= u for every element u in A.
Let a = x + (q(x» be the coset of x in A = 7L
p
[x]/(q(x»; thus q(a) = 0
and q(x) is the minimal polynomial for a over 7L
p
• Since a is in A, am = a, so a
is seen as a root of the polynomial x
m
 x, where m = pn. Thus x
m
 x and
q(x) have a common root in A. By Lemma 6.3.2 we have that q(x) I(x
m
 x). D
We are now in a position to prove the main result of this section.
Theorem 6.4.2. If K and L are finite fields having the same number of
elements, then K and L are isomorphic fields.
Proof Suppose that K and L have pn elements. By Theorem 6.2.4, L*
is a cyclic group generated, say, by the element b in L. Then, certainly,
7L
p
(b)the field obtained by adjoining b to 7L
p
is all of L. Since
[L : 7L
p
] = n, by Theorem 5.3.2 b is algebraic over 7L
p
of degree n, with
n = deg(q(x», where q(x) is the minimal polynomial in 7L
p
[x] for b, and is ir
reducible in 7L p[x].
The mapping t/J: 7L
p
[x] ~ L = 7L
p
(b) defined by t/J(f(x» = f(b) is a
homomorphism of 7L
p
[x] onto L with kernel (q(x», the ideal of 7L
p
[x] gener
ated by q(x). So L = 7L
p
[x]/(q(x».
Because q(x) is irreducible in 7L
p
[x] of degree n, by Lemma 6.4.1 q(x)
must divide x
m
 x, where m = pn. However, by Lemma 6.3.1, the polyno
mial x
m
 x factors in K[x] as
x
m
 X = (x  at)(x  a
2
) ••• (x  am),
Sec. 5 Cyclotomic Polynomials 229
where ab a2,"" am are all the elements of K. Therefore, q(x) divides
(x  al)(x  a2) ... (x  am)' By the Corollary to Theorem 4.5.10, q(x) can
not be relatively prime to all the x  ai in K[x], hence for some j, q(x) and
x  aj have a common factor of degree at least 1. In short, x  aj must
divide q(x) in K[x], so q(x) = (x  aj)h(x) for some hex) in K[x]. There
fore, q(aj) = O.
Since q(x) is irreducible in Zp[x] and aj is a root of q(x), q(x) must be
the minimal polynomial for aj in Zp[x]. Thus Zp(aj) == Zp[x]/(q(x» == L. This
tells us, among other things, that we have [Zp(aj): Zp] = n, and since
Zp(aj) C K and [K: Zp] = n we conclude that Zp(aj) = K. Therefore,
K = Zp(aj) == L. Thus we get the result that we are after, namely, that K and
L are isomorphic fields. This proves the theorem. 0
Combining Theorems 6.3.4 and 6.4.2, we have
Theorem 6.4.3. For any prime p and any positive integer n there ex
ists, up to isomorphism, one and only one field having pn elements.
5. CYCLOTOMIC POLYNOMIALS
Let C be the field of complex numbers. As a consequence of De Moivre's
Theorem the complex number On = cos 27T/n + i sin 27T/n satisfies O ~ = 1
and 0':: =1= 1 if 0 < m < n. We called On a primitive nth root of unity. The other
primitive nth roots of unity are
O
k (27Tk) + . . (27Tk)
= cos  lSIn 
n n n '
where (k, n) = 1 and 1 :s k < n.
Clearly, On satisfies the polynomial x
n
 1 in Q[x], where Q is the field
of rational numbers. We want to find the minimal (monic) polynomial for On
over Q.
We define a sequence of polynomials inductively. At first glance they
might not seem relevant to the question of finding the minimal polynomial
for On over Q. It will turn out that these polynomials are highly relevant to
that question for, as we shall prove later, the polynomial cPn(x) that we are
about to introduce is a monic polynomial with integer coefficients, is irre
ducible over Q, and, moreover, cPn(8
n
) = o. This will tell us that cPn(x) is the
desired monic minimal polynomial for On over Q.
We now go about the business of defining these polynomials.
230 Special Topics (Optional) Ch.6
Definition. The polynomials 4>n (x) are defined inductively by:
(a) 4>1 (x) = xI.
(b) If n > 1, then 4>n(x) = (x
n
 1)/TI4>d(x), where in the product in the
denominator d runs over all the divisors of n except for n itself.
These polynomials are called the cyclotomic polynomials and cPn(x) is
called the nth cyclotomic polynomial.
At the moment it is not obvious that the 4>n(x) so defined are even
polynomials, nor do we, as yet, have a clue as to the nature of the coefficients
of these polynomials 4>n(x). All this will come in due time. But first we want
to look at some early examples.
Examples
1. cP2(X) = (x
2
 1)/cP1(x) = (x
2
 1)/(x  1) = x + 1.
2. cP3(X) = (x
3
 1)/4>1 (x) = (x
3
 1)/(x  1) = x
2
+ X + 1.
3. 4>4(X) = (x
4
 1)/(cPt (X)cP2(X» = (x
4
 1)/«x  1)(x + 1»
(x
4
 1)/(x
2
 1) = x
2
+ 1.
4. cPs(x) = (X
s
 1)/4>l(X) = (X
s
 1)/(x  1) = x
4
+ x
3
+ x
2
+ X + 1.
x
6
 1
5. 4>6(X) = 4>1 (X)cP2(X)4>3(X)
X
6
 1
 (x  1)(x + 1)(x
2
+ X + 1)
x
3
+ 1
= X+1 = x
2
 X + 1.
We notice a few things about the polynomials above:
1. They are all monic polynomials with integer coefficients.
2. The degree of cPn (x) is cp (n), where cp is the Euler cpfunction, for
1 ::5 n ::5 6. (Check this out.)
3. Each 4>n(x), for 1 ::5 n ::5 6, is irreducible in Q(x). (Verify!)
4. For 1 ::5 n ::5 6, On is a root of 4>n(x). (Verify!)
These few cases give us a hint as to what the general story might be for
all 4>n (x). A hint, yes, but only a hint. To establish these desired properties
for cPn (x) will take some work.
To gain some further insight into these polynomials, we consider a par
Sec. 5 Cyclotomic Polynomials 231
ticular case, one in which n = pm, where p is a prime. To avoid cumbersome
subscripts, we shall denote 4>n(x) by t/J(m)(x), where n = pm. The prime p will
be kept fixed in the discussion. We shall obtain explicit formulas for the
t/J(m)(x)'s and determine their basic properties. However, the method we use
will not be applicable to the general case of 4>n(x). To study the general sit
uation will require a wider and deeper set of techniques than those needed
for t/J(m)(x).
We note one simple example. If p is a prime, the only divisor of p that
is not p itself is 1. From the definition of 4>p(x) = t/J(I)(X) we have that
x
p
 1
t/J(l)(X) = 4>p (x) = xI = x
p

1
+ · · · + x + 1.
Note that in studying the Eisenstein Criterion we showed that this polyno
mial is irreducible in Q(x).
What can we say for the higher f/J(m)(x)'s?
Lemma 6.5.1. For all m 2: 1,
Proof. We go by induction on m.
If m = 1, we showed above that t/J(I)(X) = (x
P
 1)/(x  1) = 1 + x +
x
2
+ . · · + x
p

I
, so the lemma is true in this case.
Suppose that t/J(r) = (x
pr
 1)/(x
pr

1
 1) for all r < m. Consider
t/J<m)(x). Since the only proper divisors of pm are 1, p, p2, . .. ,pmI, from the
definition of t/J(m)(x) we have that
(m) _ x
pm
 1 .
l/J (x)  (x  l)l/J(l)(x) · · . l/J(ml)(x)
By induction, t/J(r)(x) = (x
pr
 1)/(x
pr

1
 1) for r < m, hence
(x  1)t/J(1) (x) . · . t/J(ml)(x)
p 1 p2 1 pml 1
= (x  1) x  x  ... X m2  = x
pm

1
 1.
xI x
P
 1 x
P
 1
But then
pm 1
l/J(m)(x) = X 
x
pm

1
 1
completing the induction and proving the lemma. D
Note here that
pm _ 1
1/,(m)(x) = X = 1 + x
pm

1
+ ... + x(pl)pml
0/ x
pm

1
 1
232 Special Topics (Optional) Ch.6
is a monic polynomial with integer coefficients. Its degree is clearly pml(p  1),
which is indeed q;(pm). Finally, if f) is a primitive pmth root of unity, then
f)pm = 1, but ()pml =1= 1, hence I/J(m) (() = 0; so () is a root of I/J(m)(x). The final
thing we want to know is: Is I/J(m)(x) irreducible over to?
Note that
I/J(m)(x) = 1 + x
pm

1
+ ... + x(pl}pml = 1/J(1)(X
pm

1
)
and we know that l/J(l)(X) is irreducible in Q[x]. We shall use the Eisenstein
Criterion to prove that I/J(m)(x) is irreducible in to[x].
We digress for a moment. If f(x) and g(x) are two polynomials with in
teger coefficients, we define f(x) == g(x) mod p if f(x) = g(x) + pr(x), where
r(x) is a polynomial with integer coefficients. This is equivalent to saying that
the corresponding coefficients of f(x) and g(x) are congruent mod p. Ex
panding (f(x) + g(x»P by the binomial theorem, and using that all the bino
mial coefficients are divisible by p, since p is a prime, we arrive at
(f(x) + g(x»P == f(x)P + g(x)P modp.
Givenf(x) = aox
n
+ alx
n

1
+ ... + an, where the ai are integers, then,
by the above,
f(
x\P = (a x
n
+ a x
n

1
+ . · · +a \P == aPx
np
+ aPx(nl)p + ... + a
P
) 0 1 n) 0 1 n
the latter congruence being a consequence of Fermat's Theorem (the Corol
lary to Theorem 2.4.8). Since f(x
P
) = aox
np
+ al(nl)p + ... + an, we obain
that
f(x
P
) == f(x)P modp.
Iterating what we just did, we arrive at
for all nonnegative k.
We return to our I/J(m)(x). Since I/J(m)(x) = 1/J(1)(X
pm

1
) we have, from the
discussion above, that IfJm)(x) == IfJl)(X
pm

1
) modp. Therefore,
(l)(X + 1)pml = (X + 1)P  1)P
m

1
= (X + 1)P  1)P
m

1
I/J (x + 1)  1 x
= (xr1 + pxp2 + p(p; 1) x
p

3
+ ... + P(P2
1
) x + pyml
== 1/J(1)(X)pm1(pl) modp == I/J(m)(x + 1) modp.
Sec. 5
This tells us that
Cyclotomic Polynomials 233
I/J(m)(x + 1) = X
pm

1
(P1) + pr(x),
where r(x) is a polynomial with integer coefficients. So all the coefficients of
I/J(m)(x + 1), with the exception of its leading coefficient 1, are divisible by p.
If we knew for some reason that the constant term of hex) = I/J(m)(x + 1) was
not divisible by p2, we could apply the Eisenstein Criterion to show that hex)
is irreducible. But what is the constant term of hex) = I/J(m)(x + I)? It is
merely h(O) = l/J(m)(l), which, from the explicit form of I/J(m)(x + 1) that we
found four paragraphs earlier, is exactly p. Thus hex) is irreducible in Q[x],
that is, I/J(m)(x + 1) is irreducible in Q[x]. But this immediately implies that
I/J(m)(x) is irreducible in Q[x].
Summarizing, we have proved
Theorem 6.5.2. For n = pm, where p is any prime and m any nonneg
ative integer, the polynomial 4Jn(x) is irreducible in Q[x].
As we pointed out earlier, this is a very special case of the theorem we
shall soon prove; namely, that 4Jn (x) is irreducible for all positive integers n.
Moreover, the result and proof of Theorem 6.5.2 play no role in the proof of
the general proposition that 4Jn(x) is irreducible in Q[x]. But because of the
result in Theorem 6.5.2 and the explicit form of 4Jn(x) when n = pm, we do
get a pretty good idea of what should be true in general. We now proceed to
the discussion of the irreducibility of 4Jn (x) for general n.
Theorem 6.5.3. For every integer n 2:: 1,
4Jn (x) = (x  0(1» .. · (x  o(cp(n»),
where 0(1), 0(2), ••• , o( t/J(n» are the cp (n) distinct primitive nth roots of unity.
Proof We proceed by induction on n.
If n = 1, then 4Jl (x) = xI, and since 1 is the only first root of unity,
the result is certainly correct in this case.
Suppose that result is true for all m < n. Thus, if din and d =1= n, then,
by the induction, 4Jd(X) = (x  ... (x  where the 0<:) are the
primitive dth roots of unity. Now
x
n
 1 = (x   .. · (x 
where the run over all nth roots of unity. Separating out the primitive nth
roots of unity in this product, we obtain
x
n
 1 = (x  0(1» ... (x  o(cp(n»)v(x),
234 Special Topics (Optional) Ch.6
where v(x) is the product of all the other x  li; thus by our induction hy
pothesis v(x) is the product of the <Pd(X) over all divisors d of n with the ex
ception of d = n. Thus, since
x  (x
n
 1) _ (x  0(1}) · · · (x  o(q>(n»)v(x)
c/>n( )  v(x)  v(x)
= (x  O(l})(X  ff2}) • • · (x  ffcp(n») ,
we have proved the result claimed in the theorem. D
From the form of <Pn(x) in Theorem 6.5.3 we immediately see that
<Pn(x) is a monic polynomial in C[x] of degree q;(n). Knowing this, we prove
that, in fact, the coefficients of cPn (x) are integers. Why is this true? Proceed
ing by induction on n, we may assume this to be the case if din and d =1= n.
Therefore, if v(x) denotes the polynomial used in the proof of Theorem
6.5.3, then (x
n
 l)/v(x) = <Pn(x) E C[x], hence v(x) Ix
n
 1 in C[x]. But, by
the longdivision process, dividing the monic polynomial v(x) with integer
coefficients into x
n
 1 leads us to a monic polynomial with integer coeffi
cients (and no remainder, since v(x) I(x
n
 1) in C[x]). Thus (x
n
 l)/v(x) =
<Pn(x) is a monic polynomial with integer coefficients. As we saw, its degree is
q;(n). Thus
Theorem 6.5.4. For every positive integer n the polynomial <Pn(x) is a
monic polynomial with integer coefficients of degree q;(n), where cp is the
Euler q;function.
Knowing that cPn(x) is a polynomial, we can see that its degree is cp(n)
in yet another way. From <Pn(x) = (x
n
 l)/v(x), using induction on n,
deg(<pn(x)) = n  deg(v(x)) = n  Lq;(d), the sum over all divisors d of n
other than d = n, from the form of v(x). Invoking the result of Theorem
6.2.1, n  Lcp(d) = cp(n), where again this sum is over all din, d =1= n. We
thus obtain that deg(<Pn(x)) = q;(n).
The result we are about to prove is without question one of the most
basic ones about cyclotomic polynomials.
Theorem 6.5.5. For every positive integer n the polynomial <Pn(x) is
irreducible in Q[x].
Proof. Let f(x) in Q[x] be an irreducible polynomial such that
f(x) I<Pn(x). Thus <Pn(x) = f(x)g(x) for some g(x) in Q[x]. By Gauss' Lemma
we may assume that both f(x) and g(x) are monic polynomials with integer
coefficients, thus are in Z[x]. Our objective is to show that <Pn(x) = f(x); if
Sec. 5 Cyclotomic Polynomials 235
this were the case, then, since f(x) is irreducible in Q[x], we would have that
cPn(x) is irreducible in Q[x].
Since cPn(x) has no multiple roots, f(x) and g(x) must be relatively
prime. Let p be a prime number such that p does not divide n. If 0 is a root
of f(x), it is then a root of cPn(x), hence by Theorem 6.5.3 0 is a primitive nth
root of unity. Because p is relatively prime to n, OP is also a primitive nth
root of unity, thus, by Theorem 6.5.3, (}P is a root of cPn(x). We therefore
have that 0 = cPn( OP) = f( OP)g( OP), from which we deduce that either f( OP) =
Oar g(OP) = o.
Our aim is to show that f( OP) = O. Suppose not; then g( OP) = 0, hence
ois a root of g(x
P
). Because 0 is also a root of the irreducible polynomial
f(x), by Lemma 6.3.2 we obtain that f(x) Ig(x
P
). As we saw in the course of
the proof of Theorem 6.5.2, g(x
P
) == g(x)P mod p.
Let J be the ideal in 7L generated by p; by the Corollary to Theorem
4.6.2, 7L[x]/J[x] == 7L
p
[x] , which means that reducing the coefficients of any
polynomial modp is a homomorphism of 7L[x] onto 7L
p
[x].
Since all the polynomials cPn(x), v(x), f(x), and g(x) are in il[x], if
~ n ( x ) , v (x), !(x), and g(x) are their images in 7L
p
[X] , all the relations among
them are preserved going mod p. Thus we have the relations x
n
 1 =
~ n ( x ) v ( x ) , ~ n ( x ) =j(x)g (x) and l(x) I g(x
P
) = g(x)P.
Therefore, f(x) and g(x) have a common root, a, in some extension K
of 7L
p
• Now x
n
 1 = (f>n (x)v(x) =l(x)g (x) , hence a, as a root of both f(x)
and g(x), is a multiple root of x
n
 1. But the formal derivative (x
n
 1)' of
x
n
 1 is nx
n

1
=1= 0, since p does not divide n; therefore, (x
n
 1)' is rela
tively prime to x
n
 1. By the result of Problem 3 of Section 3 the polyno
mial x
n
 1 cannot have a multiple root. With this contradiction arrived at
from the assumption that OP was not a root of f(x), we conclude that when
ever 0 is a root of f(x), then so must OP be one, for any prime p that does not
divide n.
Repeating this argument, we arrive at: 0' is a root of f(x) for every in
teger r that is relatively prime to n. But 0, as a root of f(x), is a root of cPn(x),
so is a primitive nth root of unity. Thus 0' is also a primitive nth root of unity
for every r relatively prime to n. By running over all r that are relatively
prime to n, we pick up every primitive nth root of unity as some such 0'.
Thus all the primitive nth roots of unity are roots of f(x). By Theorem 6.5.3
we see that cPn(x) =f(x), hence cPn(x) is irreducible in Q[x]. D
It may strike the reader as artificial and unnatural to have resorted to
the passage mod p to carry out the proof of the irreducibility of a polynomial
with rational coefficients. In fact, it may very well be artificial and unnatural.
As far as we know, no proof of the irreducibility of cPn(x) has ever been given
236 Special Topics (Optional) Ch.6
staying completely in Q[x] and not going mod p. It would be esthetically sat
isfying to have such a proof. On the other hand, this is not the only instance
where a result is proved by passing to a related subsidiary system. Many the
orems in number theoryabout the ordinary integershave proofs that ex
ploit the integers mod p.
Because 4>n(x) is a monic polynomial with integer coefficients which is
irreducible in Q[x], and since On, the primitive nth root of unity, is a root of
4>n(x), we have
Theorem 6.5.6. 4>n(x) is the minimal polynomial in Q[x] for the primi
tive nth roots of unity.
PROBLEMS
1. Verify that the first six cyclotomic polynomials are irreducible in Q[x] by a
direct frontal attack.
2. Write down the explicit forms of:
(a) 4>10(X).
(b) 4>15 (x).
(c) 4>20(X).
3. If (x
m
 1) I(x
n
 1), prove that min.
4. If a > 1 is an integer and (am  1) I(an  1), prove that min.
5. If K is a finite extension of Q, the field of rational numbers, prove that
there is only a finite number of roots of unity in K. (Hint: Use the result of
Problem 10 of Section 2, together with Theorem 6.5.6.)
6. LIOUVILLE'S CRITERION
Recall that a complex number is said to be algebraic of degree n if it is the
root of a polynomial of degree n over Q, the field of rational numbers, and is
not the root of any such polynomial of degree lower than n. In the terms used
in Chapter 5, an algebraic number is a complex number algebraic over Q.
A complex number that is not algebraic is called transcendental. Some
familiar numbers, such as e, 1T, e'Tr, and many others, are known to be tran
scendental. Others, equally familiar, such as e + 1T, e1T, and 1T
e
, are suspected
of being transcendental but, to date, this aspect of their nature is still open.
The French mathematician Joseph Liouville (18091882) gave a crite
rion that any algebraic number of degree n must satisfy. This criterion gives
us a condition that limits the extent to which a real algebraic number of de
gree n can be approximated by rational numbers. This criterion is of such a
Sec. 6 Liouville's Criterion 237
nature that we can easily construct real numbers that violate it for every
n > 1. Any such number will then have to be transcendental. In this way we
shall be able to produce transcendental numbers at will. However, none of
the familiar numbers is such that its transcendence can be proved using Liou
ville's Criterion.
In this section of the book we present this result of Liouville. It is a sur
prisingly simple and elementary result to prove. This takes nothing away
from the result; in our opinion it greatly enhances it.
Theorem 6.6.1 (Lionville). Let a be an algebraic number of degree
n 2:: 2 (i.e., a IS algebraic but not rational). Then there exists a positive con
stant c (which depends only on a) such that for all integers u, v with
v > 0, la  u/vl > c/v
n
.
P,oof. Let a be a root of the polynomial f(x) of degree n in Q[x],
where a is the field of rational numbers. By clearing of denominators in the
coefficients of f(x), we may assume that f(x) = 'oX
n
+ 'lX
n

l
+ ... + 'n'
where all the 'i are integers and where '0 > o.
Since the polynomial f(x) is irreducible of degree n it has n distinct
roots a = al, a2, ... , an in C, the field of complex numbers. Therefore, f(x)
factors over C as f(x) = ro(x  a)(x  a2) · . · (x  an). Let u, v be integers
with v > 0; then
( )
n, u
n

I
f
!!:. = 'oU + ~ + ... + 'nI
U
+ ,
V v
n
un I un'
hence
v
nf
(!!:') = r un + runIv + . · . + r UV
n

1
+ r Un
V 0 I nI n
is an integer. Moreover, since f(x) is irreducible in Q[x] of degree n 2:: 2, f(x)
has no rational roots, so vnf(u/v) is a nonzero integer, whence Iunf(u/u) 12:: 1.
Using the factored form of f(x), we have that
I f(u/v)1
'ol(u/v)  azl · · · I(u/v)  a" I
unlf(u/u)\
hence
t(;) = ro( ( ;)  a)(( ;)  az) · · · ( ( ;)  an)'
1(;) al
roifl(u/v)  azl · . · I(u/v)  ~ I
::> 1
 rovnl(u/u)  azl · · · I(u/u)  ani·
238 Special Topics (Optional) Ch.6
Let s be the largest of lal, la21, ... , lanl. We divide the argument accord
ing as Iu/ul > 15 or IUlul ::5 15. If lu/ul > 15, then, by the triangle inequality,
la  (u/u)1 ;::: lu/ul  lal > 15  s = s, and, since u ;::: 1, la  (ulu)1 > slu
n
.
On the other hand, if Iu/uI ::5 2s, then, again by the triangle inequality,
lai  (ulu)1 ::::; lail + lu/ul ::::; s + 2s = 3s. Therefore,
so that l/t ;::: 1/(3s)nl = 1/(3
n

l
s
n

l
). Going back to the inequality above
that la  (ulu)1 ;::: 1/[rov
n
la2  (u/v)1 ... Ian  (u/u)/], we have that
la  (u/u)1 ;::: 1/(r03nlsnlun). These numbers ro, 3
n

1
, snl are determined
once and for all by a and its minimal polynomialf(x) and do not depend on u
or u. If we let b = 1/(r03nlsnl), then b > 0 and la  (u/u)1 > blv
n
. This
covers the second case, where IUlul ::5 2s.
If c is a positive number smaller than both band s, we have from the
discussion that la  u/ul > c/u
n
for all integers u, u, where u > 0, thereby
proving the theorem. D
Let's see the particulars of the proof for the particular case a = v2.
The minimal polynomial for a in Q[x] isf(x) = (x  a)(x + a), so a = al and
a = a2' So if u and u are integers, and v > 0, then
an integer. So lu
2
f(u/u)1 ;::: 1 ;::: 1/u
2
. The s above is the larger of 1v2/
and 1 v21; that is, s = v2. Also, the b above is 1/(3
2

1
(v2)21) = 1/(3v2),
so if c is any positive number less than 1/(3v2), then 1v2  ulu I > clv
2
•
What the theorem says is the following: Any algebraic real number has
rational numbers as close as we like to it (this is true for all numbers), but if
this algebraic real number a is of degree n ;::: 2, there are restrictions on how
finely we can approximate a by rational numbers. These restrictions are the
ones imposed by Liouville's Theorem.
How do we use this result to produce transcendental numbers? All we
need do is to produce a real number T, say, such that whatever positive inte
ger n may be, and whatever positive c we choose, we can find a pair of inte
gers u, u, with v > 0 such that IT  u/u I < clu
n
. We can find such a T easily by
writing down an infinite decimal involving O's and 1's, where we make the O's
spread out between the l's very rapidly. For instance, if T =
0.10100100000010 ... 010 ... , where the O's between successive 1's go like
m!, then T is a number that violates Liouville's Criterion for every n > O.
Thus this number T is transcendental.
Sec. 7 The Irrationality of 'IT 239
We could, of course, use other wide spreads of O's between the l's
mm, (m!)2, and so onto produce hordes of transcendental numbers. Also,
instead of using just 1's, we could use any of the nine nonzero digits to obtain
more transcendental numbers. We leave to the reader the verification that
the numbers of the sort we described do not satisfy Liouville's Criterion for
any positive integer n and any positive c.
We can use the transcendental number 'T and the variants of it we de
scribed to prove a famous result due to Cantor. This result says that there is
a 11 correspondence between all the real numbers and its subset of real
transcendental numbers. In other words, in some sense, there are as many
transcendental reals as there are reals. We give a brief sketch of how we
carry it out, leaving the details to the reader.
First, it is easy to construct a 11 mapping of the reals onto those reals
strictly between 0 and 1 (try to find such a mapping). This is also true for the
real transcendental numbers and those of them strictly between 0 and 1. Let
the first set be A and the second one B, so B C A. Then, by a theorem in set
theory, it suffices to construct a 11 mapping of A into B.
Given any number in A, we can represent it as an infinite decimal
0.ala2 ... an ... , where the ai fall between 0 and 9. (We now wave our hands
a little, being a little bit inaccurate. The reader should try to tighten up
the argument.) Define the mapping [from A to B by [(0.ala2 ... an ...) =
0.alOaZOOa3000000a4 ... ; by the Liouville Criterion, except for a small set of
ab a2'· .. ' an'··.' the numbers 0.alOaZOOa3000000a4 ... are transcendental.
The fwe wrote down then provides us with the required mapping.
One final word about the kind of approximation of algebraic numbers
by rationals expressed in Theorem 6.6.1. There we have that if a is real alge
braic of degree n 2:: 2, then la  u/u I > c/u
n
for some appropriate positive c.
If we could decrease the n to la  u/u I > c/u
m
for m < n and some suitable c
(depending on a and m), we would get an even sharper result. In 1955 the
(then) young English mathematician K. F. Roth proved the powerful result
that effectively we could cut the n down to 2. His exact result is: If a is alge
braic of degree n 2:: 2, then for every real number r > 2 there exists a positive
constant c, depending on a and r, such that la  u/ul > c/u' for all but a finite
number of fractions u/u.
7. THE IRRATIONALITY OF 'IT'
As we indicated earlier, Lindemann in 1882 proved that 1T is a transcendental
number. In particular, from this result of Lindemann it follows that 1T is irra
tional. We shall not prove the transcendence of 1T hereit would require a
240 Special Topics (Optional) Ch.6
rather long detourbut we will, at least, prove that 'TT' is irrational. The very
nice proof that we give of this fact is due to I. Niven; it appeared in his paper
"A Simple Proof That 'TT' Is Irrational," which was published in the Bulletin
of the American Mathematical Society, vol. 53 (1947), p. 509. To follow
Niven's proof only requires some material from a standard firstyear calculus
course.
We begin with
Lemma 6.7.1. If u is a real number, then lim unln! = O.
n +00
Proof. If u is any real number, then e
U
is a welldefined real number
and e
U
= 1 + u + u
2
/2! + u
3
/3! + . · . + unln! + · · .. The series 1 + u + u
2
/2! +
· .. + unln! + ... converges to e
U
; since this series converges, its nth term
must go to O. Thus lim unln! = O. D
n +00
We now present Niven's proof of the irrationality of 7T.
Theorem 6.7.2. 'TT'is an irrational number.
Proof. Suppose that 7T is rational; then 7T = alb, where a and b are pos
itive integers.
For every integer n > 0, we introduce a polynomial, whose properties
will lead us to the desired conclusion. The basic properties of this polynomial
will hold for all positive n. The strategy of the proof is to make a judicious
choice of n at the appropriate stage of the proof.
Let f(x) = xn(a  bx)nln!, where 7T = alb. This is a polynomial of de
gree 2n with rational coefficients. Expanding it out, we obtain
a x
n
+ a x
n
+
l
+ · .. + a
n
x
2n
f(x) = Q 1 n! '
where
_ nlb _ (1)'n! nibi  (_ )nbn
a
Q
= an, a
l
= na , . . . ,a
i
 0' ( _ 0)' a , . . . ,an  1
l. n l.
are integers.
We denote the ith derivative of f(x) with respect to x by the usual no
tation f(i)(x), understanding f(O) (x) to mean f(x) itself.
We first note a symmetry property of f(x), namely, that f(x) =
f( 7T  x). To see this, note that f(x) = (bnln!)x
n
('TT'  x)n, from whose form it
is clear that f(x) = f( 'TT'  x). Since this holds for f(x), it is easy to see, from
the chain rule for differentiation, that f(i)(x) = (1Yf(i)( 7T  x).
Sec. 7 The Irrationality of 1t' 241
This statement about f(x) and all its derivatives allows us to conclude
that for the statements that we make about the nature of all the f(i)(O), there are
appropriate statements about all the f(i)( 11').
We shall be interested in the value off(i)(O), andf(i)(11'), for all nonneg
ative i. Note that from the expanded form of f(x) given above we easily ob
tain that f(i)(O) is merely i! times the coefficient of Xi of the polynomial f(x).
This immediately implies, since the lowest power of x appearing in f(x) is
the nth, that f(i)(O) = 0 if i < n. For i 2:: n we obtain that f(i) (0) = i!ain/n!;
since i 2:: n, i!/n! is an integer, and as we pointed out above, ain is also an
integer; therefore f(i) (0) is an integer for all nonnegative integers i. Since
f(i)(11') = (lYf(O), we have that f(i)(11') is an integer for all nonnegative
integers i.
We introduce an auxiliary function
Since f(m) (x) = 0 if m > 2n, we see that
=  F(x) + f(x).
Therefore,
:x (F'(x) sin x  F(x) cos x) = F"(x) sin x + F'(x) cos x
 F'(x) cos x + F(x) sin x
= (FI/(x) + F(x» sin x = f(x) sin x.
From this we conclude that
i
1T
f(x) sin x dx = [F'(x) sin x  F(x) cos x ] ~
= (F'( 11') sin 11'  F( 11') cos 11')  (F'(O) sin 0  F(O) cos 0)
= F( 11') + F(O).
But from" the form of F(x) above and the fact that all f(i)(O) and f(i)(17) are
integers, we conclude that F( 11') + F(O) is an integer. Thus fof(x) sin x dx is
an integer. This statement about fa f(x) sin x dx is true for any integer n > 0
whatsoever. We now want to choose n cleverly enough to make sure that the
statement "fa f(x) sin x dx is an integer" cannot possibly be true.
242 Special Topics (Optional) Ch.6
We carry out an estimate on fof(x) sin x dx. For 0 < x < 11" the func
tion f(x) = xn(a  bx)nln! :s; ~ a n l n ! (since a > 0), and also 0 < sin x :s; 1.
Thus 0 < fo f(x) sin x dx < fo 1I"na
n
ln! dx = 1I"n+la
n
lnL
Let u = 1I"a; then, by Lemma 6.7.1, lim unln! = 0, so if we pick n large
enough, we can make sure that unln! < ii;,oohence 1I"n+la
n
ln! = 'lTUnln! < 1.
But then fof(x) sin x dx is trapped strictly between 0 and 1. But, by what we
have shown, fKf(x) sin x dx is ~ n integer. Since there is no integer strictly be
tween 0 and 1, we have reached a contradiction. Thus the premise that 11" is
rational was false. Therefore, 11" is irrational. This completes the proof of
the theorem. 0
INDEX
A(S),16
Abel, 43
Abelian group, 43
Algebraic
element, 193
extension, 193
number, 194
Algebraically closed field, 200
Alternating group, 121,215
Associative law, 12,41
Associative ring, 126
Automorphism of a group, 68
inner, 69
Automorphism of a ring, 141
Basis, 187
Bertrand Postulate, 220
Bijection, 11
Boole, 138
Boolean ring, 138
Cantor, 239
Carroll, 7
Cartesian product, 5
Cauchy, 80
Cauchy's Theorem, 80, 89
Cayley, 69
Cayley's Theorem, 69
Center, 53
Centralizer, 53, 102
Characteristic subgroup, 76
Characteristic of a field, 178
Chinese Remainder Theorem, 147
Class Equation, 103
Commutative ring, 127
Commuting elements, 53
Commuting mappings, 21
Complement of a set, 5
Complex number, 32
absolute value of, 34
argument of, 35
complex conjugate of, 32
imaginary part of, 32
polar form of, 35
purely imaginary, 32
real part of, 32
Index 245
Composition of mappings, 11
Congruence, 57
class, 60
Conjugacy, 58
class, 58
Conjugate
elements, 58
of a complex number, 32
Constant
function, 9
polynomial, 153
Constructible
length, 202
number, 204
Correspondence Theorem
for groups, 86
for rings, 142
Coset
left, 64
right, 58
Cycle of a permutation, 111
Cyclic group, 53, 55,60
generator of, 53
Cyclotomic polynomial, 230
Degree
of an algebraic element, 195
of a field extension, 191
of a polynomial, 153
De Moivre's Theorem, 36
De Morgan Rules, 6
Dihedral group, 45, 116,117
Dimension of a vector space, 186
Direct product of groups, 93
Direct sum
of rings, 146
of vector spaces, 181, 182
Divides, 22, 157
Division Algorithm, 155
Division ring, 127
Divisor, 22
Domain
integral, 127
principal ideal, 157
Duplication of a cube, 201
246 Index
Eisenstein Criterion, 169
Element, 3
algebraic, 193
identity, 41
invertible, 133
orbit of, 21, 65
transcendental, 194
unit, 41
Empty set, 4
Equality
of mappings, 9
of sets, 4
Equivalence class, 58
Equivalence relation, 57
Euclid, 27
Euclidean ring, 163
Euclid's Algorithm, 22
Euler, 63
Euler 'Pfunction, 62
Euler's Theorem, 63
Extension field, 191
Factor, 22
Factor group, 78
Factorial, 17
Fermat, 63
Fermat's Theorem, 63
Field, 127
algebraically closed, 200
characteristic of, 178
extension, 191
of algebraic numbers, 199,200
of quotients, 172175
of rational functions, 177, 179
splitting, 213
Field extension, 191
algebraic, 193
degree of, 191
finite, 191
Finite abelian groups, 96
Finite dimensional vector space, 185
First Homomorphism Theorem
for groups, 85
for rings, 142
Formal derivative, 227
Function, 8
constant, 9
Euler, 62
identity, 9
injective, 10
onetoone, 10
onto, 10
surjective, 10
Fundamental Theorem
of Algebra, 200
on Finite Abelian Groups, 100
Galois theory, 212
Gauss, 169
Gauss' Lemma, 168
Gaussian integers, 38, 166
Greatest common divisor
of integers, 23
of polynomials, 158
Group, 41
abelian, 43
alternating, 121, 215
axioms, 41
center of, 53
cyclic, 53, 55, 60
dihedral, 45, 116, 117
factor, 78
finite, 42
Hamiltonian, 72
homomorphism, 67
Klein's, 116
nonabelian, 43
octic, 116
order of, 42
quotient, 78
simple, 123,216
Hamilton, 72, 134
Hamiltonian group, 72
Hardy, 201
Hermite, 194
Homomorphism of groups, 67
image of, 70
kernel of, 70
trivial,67
Homomorphism of rings, 139
kernel of, 140
Homomorphism Theorems
for groups, 8487
for rings, 142
Ideal, 140
left, 140
maximal, 148
right, 140
trivial, 142
twosided, 140
Identity element, 41
Identity function, 9
Image, 8
inverse, 10
Index of a subgroup, 59
Induction, 29
Induction step, 31
Inductive hypothesis, 30
Injective mapping, 10
Integers, 21
Gaussian, 38, 166
relatively prime, 25
Integral domain, 127
Intersection of sets, 4
Invariants of abelian groups, 100
Inverse
of a group element, 41
of a mapping, 12
Invertible element, 133
Irreducible polynomial, 159
Isomorphic groups, 68
Isomorphic rings, 141
Isomorphism
of groups, 68
of rings, 141
Kernel of a homomorphism
for groups, 70
for rings, 140
Klein's 4group, 116
Lagrange, 59
Lagrange's Identity, 133
Lagrange's Theorem, 59
Leading coefficient, 162
Least common multiple, 28
Lindemann, 194
Linear combination, 185
Linear dependence, 186
Linear independence, 185
Liouville, 236
Liouville's Criterion, 236
Index 247
Mappings, 8
commuting, 21
composition of, 11
identity, 9
injective, 10
onetoone, 10
onto, 10
product of, 11
surjective, 10
Mathematical induction, 29
Matrices
real 2 x 2,130,131
2 X 2 over a ring, 131
Maximal ideal, 148
McKay, 89
Minimal generating set, 186
Minimal polynomial, 195
Monomorphism
of groups, 68
of rings, 141
Motion of a figure, 115
Multiple, 22
Multiplicity, 209
Niven, 240
Normal subgroup, 71
index of, 59
Null set, 4
Number
algebraic, 194
complex, 32
prime, 21, 26
purely imaginary, 32
transcendental, 194, 236
Octic group, 116
Onetoone
correspondence, 11
mapping, 10
Orbit, 21, 65
Order
of an element, 60
of a group, 42
Partition
of a set, 58
of a positive integer, 100
248 Index
Permutation, 16, 109
even, 121
odd, 121
Polynomial, 152, 179
coefficients of, 152
constant, 153
cyclotomic, 230
degree of, 153
irreducible, 159
leading coefficient of, 162
minimal, 195
monic, 157
relatively prime, 159
root of, 208
Polynomial ring, 152
Prime number, 21, 26
Primitive root of unity, 37, 39, 229
Primitive root mod p, 66
Principal ideal domain, 157
Principle of Mathematical Induction, 29
Product of mappings, 11
Projection, 9
Quadratic nonresidue, 151
Quadratic residue, 151
Quaternions, 131, 136
Quotient group, 78
Rational functions, 177, 179
Reflexivity, 57
Relation, 56
congruence relation, 57
conjugacy relation, 58
equivalence relation, 57
Relatively prime
integers, 25
polynomials, 159
Ring, 126
associative, 126
Boolean, 138
commutative, 127
division, 127
Euclidean, 166
homomorphism, 139
noncommutative, 127
of polynomials, 152, 179
with unit, 126
Roots of unity, 42
primitive, 37, 39, 229
Root of a polynomial, 208
multiplicity of, 209
Roth, 239
Second Homomorphism Theorem
for groups, 86
for rings, 142
Sets, 3
Cartesian product of, 5
difference of, 4
equality of, 4
intersection of, 4
union of, 4
Simple group, 123, 216
Splitting field, 213
Squaring of a circle, 207
Subfield, 128, 191
Subgroup, 51
characteristic, 76
cyclic, 53
index of, 59
normal, 71
proper, 51
Sylow, 101
trivial, 51
Subring, 129
Subset, 3
Subspace of a vector space, 181
spanned by elements, 181
Surjective mapping, 10
Sylow, 104
Sylow subgroup, 101, 105
Sylow's Theorem, 105
Symmetric group, 16, 109
Symmetry, 57
Third Homomorphism Theorem
for groups, 87
for rings, 142
Transcendental
element, 194
number, 194, 236
Transitivity, 57
Transposition, 20, 111
Triangle Inequality, 34
Trisection of an angle, 201
Union of sets, 4
Unit element
of a group, 41
in a ring, 126
Vector space, 180
basis of, 187
dimension of, 186
Index 249
finite dimensional, 185
infinite dimensional, 185
minimal generating set for, 186
WellOrdering Principle, 22
Wiles, 63
Wilson's Theorem, 65, 210
Zero divisor, 128
PRENTltE BAll
IJpper Saddle River, NI 07548
http://www.prenhall.tOm
ABSTRACT ALGEBRA
ABSTRACT ALGEBRA
Third Edition
I. N. Herstein
Late Professor of Mathematics University of Chicago
II
PRENTICEHALL, Upper Saddle River, New Jersey 07458
@
Library of Congress Cataloging in Publication Data
Herstein, I. N. Abstract algebra / I.N. Herstein.  3rd ed. p. em. Includes index. ISBN 0133745627 (alk. paper) 1. Algebra, Abstract. I. Title QA162.H47 1995 9521470 512' .02dc20 CIP
Acquisition Editor: George Lobell EditorinChief: Jerome Grant Production Editor: Elaine Wetterau/University Graphics, Inc. Art Director: Jayne Conte Cover Designer: Bruce Kenselaar Cover Photograph: Charnley Residence, entryway. Photo by © Nick Merrick/HedrichBlessing Manufacturing Buyer: Alan Fischer Marketing Manager: Frank Nicolazzo
II =
© 1996 PrenticeHall, Inc. Simon & Schuster/A Viacom Company Upper Saddle River, New Jersey 07458
All rights reserved. No part of this book may be reproduced, in any form or by any means, without permission in writing from the publisher. Earlier editions © 1986 and 1990. Printed in the United States of America 10 9 8 7 6 5 4 3 2
ISBN 0133745627
PrenticeHall International (UK) Limited, London PrenticeHall of Australia Pty. Limited, Sydney PrenticeHall Canada Inc., Toronto PrenticeHall Hispanoamericana, S.A., Mexico PrenticeHall of India Private Limited, New Delhi PrenticeHall of Japan, Inc., Tokyo Simon & Schuster Asia Pte. Ltd., Singapore Editora PrenticeHall do Brasil, Ltda., Rio de Janeiro
To Biska
CONTENTS Preface ix 1 Things Familiar and Less Familiar 1 2 3 4 1 5 6 7 A Few Preliminary Remarks 1 Set Theory 3 Mappings 8 A(S) (The Set of 11 Mappings of S onto Itself) The Integers 21 Mathematical Induction 29 Complex Numbers 32 16 2 Groups 1 2 3 4 5 6 7 8 9 10 11 40 Definitions and Examples of Groups 40 Some Simple Remarks 48 Subgroups 51 Lagrange's Theorem 56 Homomorphisms and Normal Subgroups 66 Factor Groups 77 The Homomorphism Theorems 84 Cauchy's Theorem 88 Direct Products 92 Finite Abelian Groups (Optional) 96 Conjugacy and Sylow's Theorem (Optional) 101 vii .
and Quotient Rings 139 Maximal Ideals 148 Polynomial Rings 151 Polynomials over the Rationals 166 172 Field of Quotients of an Integral Domain 5 Fields 1 176 Examples of Fields 176 A Brief Excursion into Vector Spaces Field Extensions 191 Finite Extensions 198 Constructibility 201 Roots of Polynomials 207 180 2 3 4 5 6 6 Special Topics (Optional) 1 2 3 4 5 6 7 215 The Simplicity of An 215 Finite Fields I 221 Finite Fields II: Existence 224 Finite Fields III: Uniqueness 227 Cyclotomic Polynomials 229 Liouville's Criterion 236 239 The Irrationality of 'IT 243 Index . Homomorphisms.viii Contents 3 4 The Symmetric Group 1 2 3 108 Preliminaries 108 Cycle Decomposition 111 Odd and Even Permutations 119 Ring Theory 1 125 2 3 4 5 6 7 125 Definitions and Examples Some Simple Results 137 Ideals.
The students might then go over several past examples of finite groups and explicitly set up isomorphisms with subgroups of Sn.1 and 3. clarified and expanded some proofs.5. corrected minor errors. After Section 2. straightened out inconsistencies. At the same time. We feel that one of the book's virtues is the fact that it covers a big chunk of abstract algebra in a condensed and interesting way. in order to provide more examples of groups.5. thereby disturbing the original balance. one could spend an hour discussing permutations and their cycle decomposition (Sections 3. without trivializing the subject. and added a few examples.2). We have.PREFACE TO THE THIRD EDITION When we were asked to prepare the third edition of this book. To resolve the many typographical problems of the second edition. Rather than alter the arrangement of the contents. it would have the beneficial result of making the students more comfortable with the concept of an isomorphism. which addresses this concern. we suggest an alternate route through the material. it was our consensus that it should not be altered in any significant way. and that Herstein's informal st~le should be preserved. however. At the same time. leaving the proofs until later. It has been pointed out to us that some instructors would find it useful to have the Symmetric Group Sn and the cycle notation available in Chapter 2. The instructor could then weave in the various subgroups of the Symmetric Groups Sn as examples throughout the remainix . Prentice Hall has had the book completely retypesetmaking it easier and more pleasurable to read.This exercise would be motivated by Cayley's theorem. it remains accessible to most undergraduates. quoted in Section 2.
a Symbol List has been included to facilitate keeping track of terminology. we take this opportunity to thank the many individuals whose collective efforts have helped to improve this edition. Two changes in the format have been made since the first edition. and Robert Zinc from Purdue University. Winter .2 after Section 2.4.x Preface to Third Edition Ch. we thank George Lobell and Elaine Wetterau. As such. Finally. Second. And. they should be read carefully. First. Donald Passman from the University of Wisconsin. If desired. Barbara Cortzen David J.6 der of Chapter 2. and others at Prentice Hall who have been most helpful. a few problems have been marked with an asterisk (*). of course.1 and 3. These serve as a vehicle to introduce concepts and simple arguments that relate in some important way to the discussion. We thank the reviewers: Kwangil Koh from North Carolina State University. one could even introduce Sections 3.3 or 2.
this book should serve as an introductionand. only as an introductionto this fascinating subject. and computer science. the importance of the results and concepts of abstract algebra play an ev. this book could serve that purpose. It has been a fertile and rewarding research area both in the last 100 years and at the present moment. Exciting results have been proved in group theory. A subject that was once regarded as esoteric has become considered as fairly downtoearth for a large cross section of scholars. we stress.r more important role in physics. commutative and noncommutative ring theory. Some of the great accomplishments of our twentiethcentury mathematics have been precisely in this area. Lie algebras. Jordan algebras. In mathematics itself abstract algebra plays a dual role: that of a unifying link between disparate parts of mathematics and that of a research subject with a highly active life of its own. The purpose of this book is twofold. and a host of other parts of what is known as abstract algebra. For interested readers who want to learn what is going on in an engaging part of modern mathematics. but also in a variety of other disciplines. combinatories. as well as provide them with some highly usable tools to apply in the areas in which they are interested. chemistry. The choice of subject matter has been made with the objective of introducing readers to some of the fundamental algebraic systems that are both inxi . For instance. For those readers who either want to go on to do research in mathematics or in some allied fields that use algebraic notions and methods.PREFACE TO THE FIRST EDITION In the last halfcentury or so abstract algebra has become increasingly important not only in mathematics itself. to cite a few such outside fields.
except in one sectionthe second last one of the bookwhere we make implicit use of the fact that a polynomial over the complex field has complex roots (that is the celebrated Fundamental Theorem ofAlgebra due to Gauss). Readers should not become discouraged if they do not manage to solve all the problems. Ostedt of the Macmillan Company for his enthusiasm for the project and for bringing it to publication. We should like to express our special thanks to Professor Martin Isaacs for his highly useful comments. Gary W. As the reader will soon see. Thus the presentation is somewhat chatty. and to be a means of developing mathematical insight. and to Mr. Many of the changes they suggested have been incorporated and should improve the readability of the book. and significant results in each of the systems we have chosen to discuss. to prepare the ground for material that is yet to come. With this we wish all the readers a happy voyage on the mathematical journey they are about to undertake into this delightful and beautiful realm of abstract algebra. They are often referred to as the discussion progresses. and techniques. to challenge their mathematical ingenuity. in each of these systems the aim has been to arrive at some significant results. applicable.H. intuition. We hope that we have achieved the goal of presenting interesting.N. Trying to do the problems is undoubtedly the best way of going about learning the subject. and harder (with an occasional very hard). We are grateful to many people for their comments and suggestions on earlier drafts of the book.xii Preface to First Edition teresting and of wide use. We are also grateful to Fred Flowers for his usual superb job of typing the manuscript. we hope that this will put the readers at their ease. Some of these examples are carried forward to be examples of other phenomena that _come up. Moreover. there are many exercises in the book. They are often divided into three categories: easier. We have strived to present the material in the language and tone of a classroom lecture. We feel that the book is selfcontained. An attempt is made to give many and revealing examples of the various concepts discussed. Some of the problems appear several times in the book. The purpose of these problems is to allow students to test their assimilation of the material. The intent of many of the problems is that they be triedeven if not solvedfor the pleasure (and frustration) of the reader. . I. middlelevel. and in the last section where we make use of a little of the calculus. There is little purpose served in studying some abstract object without seeing some nontrivial consequences of the study.
SYMBOL LIST aES a$S a is an element of the set S. T:J S S= T ~ o AUB AnB {s E Sis satisfies P} AB A' (a. 3 S is a subset of the set T. 5 The set of real numbers. 21 Orbit of s relative to mapping f. 18 Set of 11 mappings from a set S to S. 16. 10 Inverse image of a subset A of T under f: S ~ T. 10 Composition or product of functions f and g. 4 The difference of the sets A and B. 4 The union of the sets A and B. b) AXB IR f:S~T f(s) i: S ~ S. 5 Ordered pair consisting of a. b (see also below). fg A(S) Sn n! 7L O(s) a is not an element of the set S. 3 The sets Sand T are equal (have the same elements). 4 The complement of A. 4 The subset of elements of S satisfying P. is fl(t) fl(A) fo g. 3 SeT. 17 Set of integers. 9 Inverse image of t under f. 4 The empty set. 8 Function from the set S to the set T. 8 Image of the element s under the function f. 21 xiii .109 n factorial. 11. 16 Symmetric group of degree n. 5 The Cartesian product of A and B. 8 The identity function on S. 4 The intersection of the sets A and B.
93 Permutation sending a to u.xiv Symbol List N mIn m~n (a. 78 Product of subsets A.'7 Class of all b equivalent to a. 42 Set of rational numbers. 53 Center of group G. 22 m does not divide n. 32 Square roots of 1. 131 Determinate of the 2x2 matrix x. . . 32 Conjugate of complex number z = a + bi. 34 Polar form of a complex number. 136 . 68 Image of homomorphism. G 2. 42 Order of the group G. ••• . 59 Set of integers mod n. 102 Cyclic group generated by a. 121.c) An ao + ali + a2j + a3 k detx c) Set of positive integers. 79 Direct product of G}.. b. . 35 Primitive nth root of unity.bi r(cos 9 + i sin 9) 9n Q En Izl IGI C(a) (a) Z(G) ab a == bmodn a == ben) [a] cl(a) o(a) iG(H) 7L n Un 'P(n) Hb aH G=G' 'P(G) Ker 'P N<JG GIN AB G 1 X G 2 X . 101 Order of element a of a group. X G n a b (u v w (a. 70. 53 a is equivalent to b in a specified sense. 61 Group of invertible elements of 7L n . 36.. 22 Greatest common divisor of a. 215 Quaternion. 33 Absolute value of complex number z. 21 m divides n. b) C i. 70 Kernel of the homomorphisms 'P.. 42 Centralizer of a in G. G n . 111 Alternating group of degree n.. 58. 42 Group of nth roots of unity. 140 N is a normal subgroup of G. 57 a is congruent to b modulo n (short form). 64 Group G is isomorphic to group G'. . 62 Euler 'P function (phi function). B of a group. 32 Inverse of the complex number z.. 62 Right coset of subgroup H. . 32 Complex number z with real part a and imaginary part b. 110 Cycle sending a to b..c to a. 72 Quotient of a group G by a subgroup N. 58 Conjugacy class of a. 53. b (see also above). 57 a is congruent to b modulo n (long form). . i z = a + bi z= lIz a . 23 Set of complex numbers.c to w. 58 Left coset of subgroup H. b to v. 60 Index of H in G.
196 Field extension obtained by adjoining to a to F. + anV n (Vb V2' • • • . 180 Linear combination of vectors Vt . 180 Scalar a times vector v. 227 nth cyclotomic polynomial. 196 Field of algebraic elements of Kover F. 181 Dimension of V over F.Symbol List xv H(F) R(f)S (a) F[x] deg p(x) (g(x)) g(x) I f(x) R[x] F(x) vE V av atvt + · . W.. S. 191 Ring generated by a over F.Vn . 136 Direct sum of rings R. 181 Direct sum of vector spaces V. 152 Degree of polynomial p(x). V n . 157 Polynomial g (x) divides f(x) . Vn) V(f)W dimF(V) U+W [K:F] F[a] F(a) E(K) f'(x) <f>n(X) Ring of quaternions over F. 146 Ideal generated by a in a commutative ring. V2' ••• . ••• . 157 Polynomial ring over ring R. 230 . 163 Field of rational functions in x over F. W of V. 186 Sum of subspaces U. 190 Degree of Kover F. 145 Polynomial ring over the field F. 153 Ideal generated by g(x) in a polynomial ring. 198 Formal derivative ofpolynomialf(x). 181 SUbspace spanned by Vt. 177 Vector V in a vector space V.
ABSTRACT ALGEBRA .
with a few strokes." but the difficulties that the reader may encounter are not so much due to the "algebra" part as they are to the "abstract" part. . We start with some collection of objects S and endow this collection with an algebraic structure by assuming that we can combine. In other words. l'he· subject to be discussed is usually called "abstract algebra. or asking oneself "What is the point of all this?. there seems to be a common reaction for the novice. These ways of combining elements of S we call operations on S. On seeing some area of abstract mathematics for the first time. elements of this set S. This is true in all of mathematics. the best road to good understanding of the notions introduced is to look at examples. and background for the material we shall study? Let's give it a try. or whatnot. purpose. Can one. This is not too surprising. but it is particularly true for the subject matter of abstract algebra." is to take the concept at hand and see what it says in particular cases. topology. elements of this set S to obtain. quickly describe the essence. once more. for while many of the ideas are fundamentally quite simple. This can best be described by a feeling of being adrift. be it in analysis. of not having something solid to hang on to. they are subtle and seem to elude one's grasp the first time around. One way to mitigate this feeling of limbo. in one or several ways (usually two).1 THINGS FAMILIAR AND LESS FAMILIAR 1. A FEW PRELIMINARY REMARKS For many r9aders this book will be their first contact with abstract mathematics.
that we should not be led to some nonsensical contradiction computing within the framework of the allowable things the axioms permit us to do. b are any objects in S. What would we require of such a structure? Certainly we would want that the axioms be consistent. We shall study some of the basic axiomatic algebraic systems in this book. These axioms are for us to define. Let S be a set having an operation * which assigns an element a for any a. But that is not enough. then a * b = b * a." In everyday language "axiom" means a selfevident truth. An axiom is not a universal truthbut one of several rules spelling out a given mathematical structure. 2. and most importantly. Stick with it. The axiom is true in the system we are studying because we have forced it to be true by hypothesis.2 Things Familiar and Less Familiar Ch. The ones we mentioned earlier. If a. b are any objects in S. Let us assume that the following two rules hold: 1. Do not be discouraged if the initial exposure leaves you in a bit of a fog. 1 Then we try to condition or regulate the nature of S by imposing certain rules on how these operations behave on S. namely groups. These rules are usually called the axioms defining the particular structure on S. We can easily set up such algebraic structures by imposing a set of rules on a set S that lead to a pathological or weird system. there may be very few examples of something obeying the rules we have laid down. namely a lack of feeling that the material is something they can get their teeth into. concrete examples of the concept under discussion. one could try many sets of axioms to define new structures. and fields. from noticing that there are many concrete mathematical systems that satisfy these rules or axioms. to do certain things. PROBLEMS 1. A word about the use of "axioms. b E S. rings. try to understand what a given concept says. that is. in the particular structure. We return to something we said earlier about the reaction that many students have on their first encounter with this kind of algebra. look at particular. and fields. rings. But we are not using everyday languige. Furthermore. Time has shown that certain structures defined by "axioms" play an important role in mathematics (and other areas as well) and that certain others are of no interest. namely groups. have stood the test of time. we are dealing with mathematics. It is a license. Of course. then a * b = a. Show that S can have at most one object. If a. * b of S . historically in mathematics. but the choice made comes.
. b. . D * ~ = ~ = ~ 2. 3. many college students have had some exposure to set theory. (b) (a * b) * c =1= a * (b * c) in general.Sec. . . (d) There is a particular a in S such that a * b = b * a = b for all b in S. c are any elements of S (i. b in S define * by a * b = ab. b. we write a E Sthis is read "a is an element of S. one of these is a part of the other. n. a * a = O. then b * b = a. . Going on the assumption that many readers will have some acquaintance with set theory. 2.. we need some notation. ±n. ±2. Under what conditions on a. we write a fl. (b) (a * b) * c = a * (b * c). is an element of S. . however. * on S * D. b and c can be any of D or ~). we introduce a shorthand for these phrases. the objects of S we call the elements of S.. Verify by explicit calculation that if a.. 2 Set Theory 3 2.. So. 3. . then 165 E S. (d) For a in S." To denote the contrary. . c is (a * b) * c = a * (b * c)? (c) The integer 0 has the property that a * 0 = a for every a in S. SET THEORY With the changes in the mathematics curriculum in the schools in the United States. Let S consist of the two objects D and ~. Let S be the set of all integers 0. This introduction to set theory in the schools usually includes the elementary notions and operations with sets. namely that an object a is not an element of S. D * D = D. S. We define the operation by subjecting D and ~ to the following conditions: 1. To avoid the endless repetition of certain phrases.. ~ * ~ = D. First. To denote that a given element. whereas 13 f£ S.. (e) Given b in S. (c) a * b = b * a. a. 2. then: (a) a * b is in S..e. We say that S is a subset of T. for instance. For a. if S denotes the set of all positive integers 1.. which we write SeT (read "S is contained in T") if every element of S is an element of T. we shall give a rapid survey of those parts of set theory that we shall need in what follows. Let S be a collection of objects. a. 3. ±1. We often want to know or prove that given two sets Sand T.. where a is the particular element in Part (d). . Verify the following: (a) a * b =1= b * a unless a = b.
3} and B = {2. This is an opportune moment to introduce a notational device that will be used time after time. (Prove!) The need also arises for a very peculiar set. . that Sand T have exactly the same elements. For instance. If A. IO}. we define A . or is in both. they consist of the same set of elements. Thus. whose elements satisfy a certain property P. written A n B. Let A. 3. which is defined: A U B is that subset of S consisting of those elements of S that are elements of A or are elements of B. if S is the set of all positive integers having 6 as a factor and T is the set of all positive integers having both 2 and 3 as factors. IO}. In the definition given above. We can also denote this by writing T => S. are equal.B = {a E A I a f£ B}. read "T contains S. B be subsets of a given set S. by the intersection of A and B. 2. it is clear how one can define the union and intersection of any number of subsets. 1 In terms of the notation we now have: SeT if s E S implies that sET. Particular examples of intersections that hold universally are: A n A = A. For instance. Again let A and B be subsets of a set S. that is. 4. namely one having no elements. that is. We shall frequently need to show that two sets Sand T. defined perhaps in distinct ways. If A = {I. or is in B. S is always a subset of itself. We now proceed to our second way of constructing new sets from old. We now introduce methods of constructing other subsets of S from A and B. 6.) Thus. then A U B = {I. A n 0 = 0. 2. Consequently. we shall mean the subset of S consisting of those elements that are both in A and in B. written A U B. then A U B = {s E Sis E A or s E B} while A n B = {s E Sis E A andsEB}. for instance. The first of these is the union of A and B. Although the notions of union and intersection of subsets of Shave been defined for two subsets. A n S = A. 4. then SeT. We shall write this as A = {s E Sis satisfies Pl.4 Things Familiar and Less Familiar Ch. if T is the set of all positive integers and S is the set of all positive even integers. The "or" is not meant to exclude the possibility that both things are true. the difference of two sets. we shall often be called on to describe the subset A of S. that is. Given a set S. 6. It has the property that it is a subset of any set S. Here we mean that an element c is in A U B if it is in A. S => S for any set S. This set is called the null or empty set and is denoted by 0. B are subsets of S. in the example above. The usual strategy for proving this is to show that both SeT and T C S. if A. A U A = A. then S = T. It should be clear from the definitions involved that A nBC A and A nBc B. We now introduce a third operation we can perform on sets." (This does not exclude the possibility that S = T. and S is a subset of T. The "or" we have just used is somewhat different in meaning from the ordinary usage of the word. B are subsets of S. A n B = {2}.
namely that A U B C (A n B) U (A . A U B =. we .B C A.B) U (B . hence A U B = (A (A n B) U (A . Note the relation among the three operations. 4.A is called the complement of A in S and is written A' .A = ~ is the shaded area. A n B 3.A C B. b) to be equal to the ordered pair (ab b t ) if and only if a = at and b = b I . then 1. then.B) U (B .A) C A U A U B = A U B. A .B is the set of all positive odd integers. so again it is certainly in (A n B) U (A .A). We have thus covered all the possibilities and have shown that A U B C (A n B) U (A . B .A). We represent these three operations pictorially. B. so again it is in (A n B) U (A .B) U (B . b) I a E A. Now for the other direction.B) U (B .B) U (B .B) U (B . we need not restrict ourselves to two sets.Sec. A .A).A). = = is the shaded area. this part is easy for.B. Here. if u E A and u E B. Having the two opposite containing relations of A U B and (A n B) U (A . by the v~ry definition of A . As an illustration of how one goes about proving the equality of sets constructed by such settheoretic constructions. where we declare the ordered pair (a. If A is ® and B is ®. we pfove this latter alleged equality. if u E A but u ft.A). B by A X B = {(a.B) U (B . then u E B . u E A .B GB G1V is the shaded area.11 .A. then u E A n B. A nBC A.A) C A U B. 2 Set Theory 5 So if A is the set of all positive integers and B is the set of all even integers.B. Given II E A U B. We close this brief review of set theory with yet another construction we can carry out on sets. we obtain the desired equality of these two sets. so it is certainly in (A n B) U (. then A . if u E B but u $ A. for instance. b E B}. On the other hand. In the particular case when A is a subset of S. Finally. by definition. 2.A).B) U (B . This is the Cartesian product defined for the two sets A. is the shaded area.A). We first show that (A n B) U (A .B) U (B . too. the difference S . namely the equality n B) U (A . and B .
B) U (B .. namely: (a) (A n B)' == A' U B'. B.B. 3. If A is the set of all residents of the United States. 2. (a) S == {Mercury. let C' denote the complement of C in S. . 16. * 13. 4.. Wyoming}. 4. Let S be a set. 4}. . U C). 10. prove that A U C C B U C for any set C. Prove the De Morgan Rules for subsets A. and C the set of all women in the world. If A C Band B C C.25. B the set of all Canadian citizens.32.. ~ 4. What does this look like pictorially? 9. 8.C. If A C B. II} and you have been told that A n B == {4.. (b) S == {2.}. Describe the following sets verbally.. If C is a subset of S. . (a) S == {2. prove that A C C. for sets A. describe the sets A n B n C. 4. (b) S == {Alabama. 9. C. If A == {I. Prove that A n (B U C) == (A n B) == U (A n C). . C E C and where equality of two ordered triples is defined componentwise. *12.. A . 2. 7. 4. 9. B of S. where a E A.. their Cartesian product as the set of ordered triples (a. c). what must a be? 5.B) U (B . Earth. 7. . Alaska. (c) S == {I. For any two subsets of S we define A +B == (A . .. Venus. Prove that (A ..6 Things Familiar and Less Familiar Ch. 1 can define. 16. 36. b E B. Write down all the subsets of S MiddleLevel Problems {I.A verbally.}. Show that A U B == B U A and A n B == B n A. b. Prove that A U (B n C) == (A U B) n (A 11. (b) (A U B)' == A' n B'.Pluto}.8. PROBLEMS Easier Problems 1.6. a} and B == {3. 3. 8. 9}. . . 6.}.(A n B). Describe the following sets verbally. C .A) == (A U B) . A . 4.A) and A· B == A n B..
at least. B. If S is a set having five (a) How many subsets (b) How many subsets (c) How many subsets Harder Problems 22. Show. A pubirc opinion poll shows that 930/0 of the population agreed with the government on the first decision. B. C find a formula for meA U B U C). (f) If A + B = A + C.) 19. (d) A + A = 0. Take a shot at finding meAl U A 2 U . (a) Show that a set having n elements has 2 n subsets.. (b) A + 0 = A. 15. What percentage. let m(C) denote the number of elements in C. A 2 . and 740/0 on the third. 750/0 an ear. how many subsets are there that have exactly m elements? elements: does Shave? having four elements does Shave? having two elements does Shave? . for finite sets A. that meA X B) = m(A)m(B). For three finite sets A.. In his book A Tangled Tale. then B = C. U An) for n finite sets AI. (Hint: First consider D = B U C and use the result of Problem 14. must have lost all four?" Solve Lewis Carroll's problem. then at least 500/0 of Americans have both gone to high school and read a daily newspaper. (B + C) = A· B + A . If A. (e) A + (B + C) = (A + B) + C. (c) A· A = A. C. 21. 85% a leg.meA n B). An· 17. B are finite sets. ••• . Lewis Carroll proposed the following riddle about a group of disabled veterans: "Say that 700/0 have lost an eye. At least what percentage of the population agreed with the government on all three decisions? (Hint: Use the results of Problem 15. 2 Set Theory 7 Prove that: (a) A + B = B + A. 800/0 an arm.Sec. (g) A . (b) If 0 < m < n. *20. 84% on the second. prove that meA U B) = meA) + m(B) . 18. Use the result of Problem 14 to show that if 800/0 of all Americans have gone to high school and 70% of all Americans read a daily newspaper. *14. If C is a finite set. for three decisions made by the government.) 16.
Let S = {all legally employed citizens of the United States} and T = {positive integers}. Rose Kennedy is the image under f of John F. Therefore. We denote that f is a mapping from S to T by f: S ~ T and for the t E T mentioned above we write t = f(s). andf(a) = f(a) for all a E IR. as a mapping. Thus. we shall give an informal and admittedly nonrigorous definition of a mapping (function) from one set to another. often in the form of formulas. then there is only one element t in T that is associated to s by the mapping. As we see below. The concept is hardly a new one for any of us. f(John F. The definition of a function from one set to another can be given in a formal way in terms of a subset of the Cartesian product of these sets. Let f be the rule that assigns to every person his or her country of citizenship. we call t the image of sunder f. Kennedy) = Rose Kennedy. a function or mapping f from S to T is a rule that assigns to each element s E S a unique element t E T. Kennedy. Let S be the set of all people in the world and T the set of all countries in the world. . the following is not a mapping. f(s) by f(s) = Social Security Number of s. On the other hand. Then f is not a mapping from S to T. Define. (For the purpose of this text. is a perfectly good function from IR to IR. they can occur in any area. Let's explain a little more thoroughly what this means. 1 3. It should be noted thatf(2) = (2)2 = 4 = f(2). But mappings need not be restricted to sets of numbers. Why not? Because there are people in the world that enjoy a dual citizenship. when applied to Mary Jones. and according to our definition.) Then f defines a mapping from S to T. let us assume that all legally employed citizens of the United States have a Social Security Num~er. for such people there would not be a unique country of citizenship. MAPPINGS One of the truly universal concepts that runs through almost every phase of mathematics is that of a function or mapping from one set to another. the rule f: IR ~ IR. t varies over T (in a manner depending on s). defined by f(a) = a 2 for a E IR. fwould not make sense. llet S = {all men who have ever lived} and T = {all women who have ever lived}. for s E S. where IR is the set of real numbers. 2. if Mary Jones is both an English and French citiz~. Let S. Note that by the definition given. Instead. Examples 1. As s varies over S. here. If s is a given element of S. One could safely say that there is no part of mathematics where the notion does not arise or playa central role. Since grade school we have constantly encountered mappings and functions.8 Things Familiar and Less Familiar Ch. T be sets. Define f: S ~ T by f(s) = mother of s.
S2 E S are in Sand f(Sl) = f(S2). This is a function from S to T. bye). 6). it is for us to decide how to declare f = g where f: S ~ T and g : S ~ T. 2) =1= (3. We call this function of S to itself the identity function (or identity mapping) on S. Let S be the set of all objects for sale in a grocery store and let T = {all real numbers}.Sec. 2) = ~ while f(3. denote it by is (and later in the book. If S is the set of all real numbers and f is defined on S by f(s) = S2 + 2s + 1. while g is defined on S by g(s) = (s + 1)2. 6 = 12. define f: S ~ T by f(s) = S2. The mappings to be defined in Examples 9 and 10 are mappings that occur for any nonempty sets and playa special role. what can you say about Sl and S2? 5. This defines a mapping from S to T. 9. 10. Let S be the set of all positive integers and let T be the set of all positive rational numbers. although (1. T be nonempty sets. Define f: S ~ T by f(s) = price of s. Does every element of T come up as an image of some s E S? If not. Let S. . Note that f(l. how would you describe the set of all images {f(s) I s E S}? When is f (s 1) = f(S2)? 6. Define f: S ~ T by f (s) = t 0 for every s E S. 3 Mappings 9 3. What can you say about {f(s) Is E S}? When is f(s 1) = f(S2)? 7. Define f: S X S ~ T by f(m. at times. Describe the subset of S X S consisting of those (a. the Cartesian product of T with itself. Now that we have the notion of a mapping we need some way of identifying when two mappings from one set to another are equal. while that of 3. f is called a constant function from Sto T. This is not God given. Let S = T be the set of all real numbers. we declare that f = g if and only if f(s) = g(s) for every s E S. and let to be a fixed element of T. 4. What is more natural than to define this equality via the actions of f and g on the elements of S? More precisely. Thus the image of 6 under this mapping. Definef: T X T~ Tby f(t 1 . Let S be any nonempty set and define i: S ~ S by i(s) = s for every s E S. b) such that f(a. our definition of the equality of f and g is merely a statement of the familiar identity (s + 1)2 = S2 + 2s + 1. 2). Let S be the set of all integers and let T = S. t 2) = t 1 • This mapping from TXT to T is called the projection of TXT onto its first component. 8. If SJ. is given by f( 3) = 2( 3) = 6. Let S = T be the set of all real numbers. define f: S ~ T by f (s) = S3. f( 3). is given by f(6) = 2 . This defines a mapping from S X S to T. f(6). b) = ~. 6) = ~ = ~ = f(l. Let T be any nonempty set and let S = TXT. n) = min. Define f: S ~ T by f(m) = 2m for any integer m. We may.
Definition. If the inverse image of {t} consists of only one element. say s E S. there exists an s E S such that t = [(s). The mapping in Example 4 fails to be onto because not every integer is even. but is so if[is 11 and onto. [is 11 if [(SI) = [(S2) implies that SI = S2. In the examples we gave earlier. citizens have distinct Social Security numbers (provided that there is no goofup in Washington. a mapping is 11 if it takes distinct objects into distinct images. for the number 1. since two brothers would have the same mother. If we define [(S) = {[(s) E Tis E S}. given t E T. First. in Example 2 the mapping is not onto. A mapping [: S ~ T is said to be onetoone (written 11) or injective if for SI =1= S2 in S. Definition. 1 Having made the definition of equality of two mappings. for instance. As we note below. in Example 1 the mapping is not onto. Given a mapping [: S ~ T and a subset ACT. [(SI) =1= [(S2) in T. This [I does not in general define a mapping from T to S for several reasons. The reader can decide whether or not the given mappings are onto in the other examples. the mapping is not onto. Another specific type of mapping plays an important and particular role in what follows. However. then 1 . is not the square of any real number. and call Pl(A) the inverse image o[ A under [. we could try to define [I(t) by defining [I(t) = s. We shall use the same notation [I in cases of both subsets and elements. However in Example 2 the mapping is 11 because distinct U. the mapping in Example 6 is onto because every real number has a unique real cube root. for not every positive integer is the Social Security Number of some U. we may want to look at B = {s E S [(s) E A}. this need not be a mapping from Tto S. citizen. which is unlikely). the inverse image of the subset {t} of T consisting of the element t E T alone. The mapping [: S ~ T is onto or surjective if every t E T is the image under [ of some s E S. another way of saying that the mapping [: S ~ T is onto is by saying that [(S) = T. \ve use the notation [l(A) for this set B.S. again. that is. if and only if. the mapping of Example 1 is not 11. if [is not onto. Similarly. The reader should check if the various other examples of mappings are 11. In the examples of mappings we gave earlier. we now want to single out certain types of mappings by the way they behave. since not every woman that ever lived was the mother of a male child. Equivalently. In other words.10 Things Familiar and Less Familiar Ch. and in Example 5.S. Of particular interest is [l(t).
Definition. ([0 (g 0 h)(s) (([0 g) 0 h)(s). Consider the situation g : S ~ T and [: T ~ u. so g(s) is ripe for being acted on by f Thus we get an element leges») E U. when we map S into itself Although special. then g sends it into the element g(s) in T. then for some t E T there are at least two distinct Sl =1= S2 in S such that fest) = t = [(S2). denot~d by [0 g. (Verify!) This brings us to a very important class of mappings. is the mapping [0 g: S ~ U defined by ([ 0 g)(s) leges)~ for every s E S. if [is both 11 and onto T. for [0 g to have any sensethe terminal set. Given an element s E S. One special time when we can always compose any two mappings is when S = l~ = U. However. We claim that this procedure provides us with a mapping from S to U. Note first of all that both [0 (g 0 h) and ([ 0g) 0 h define mappings from S to V. Second. so it makes sense to speak about their possible equality. 3 Mappings 11 there is some t in T which is not the image of any element s. so Ilet) = 0. ~ T is said to be a 11 correspondence Now that we have the notion of a mapping and have singled out various types of mappings. g : T ~ U. we might very well ask: "Good and well.3. T. (Verify!) We define this more formally in the Defi~ition. for the mapping g must be the initial set for the mapping f. We verify a few properties of this composition of mappings. then. and [: U ~ V. then the composition (or product). The mapping [: S or bijection if [is both 11 and onto.1.Sec. We apply the definition of composition to see that ([0 (g h) )(s) = [( (g h )(s) = [(g(h (s) ). Note that to compose the two mappings [ and gthat is. then [0 (g 0 h) = Proof How shall we go about proving this lemma? To verify that two mappings are equal. but what can we do with them?" As we shall see in a moment. this case is of the utmost importance. Our task. If h : S ~ T. {fog)oh. is to show that for every s E S. we can introduce an operation of combining mappings in certain circumstances. that is. If g : S ~ T and [: T ~ U. if [is not 11. SO Ilet) is not a unique element of Ssomething we require in our definition of mapping. 0 0 . we merely must check that they do the same thing to every element. then [I indeed defines a mapping of Tonto S. Lemma 1.
fl(t) is that element So E S such that . however. If g: S ~ T and f: T fo g : S ~ U is also a bijection. ~ U are both onto. thus.2 is to obtain Lemma 1. by definition. satisfy the associative law. Remark. is also 11. the mappingfo g is 11. where is and iT are the identity mappings of Sand T.12 Things Familiar and Less Familiar Ch. 1 Unraveling ((f g) h)(s) = (f g)(h(s» = f(g(h (s») . then If f is a 11 correspondence of S onto T.3. f(g(Sl» = f(g(S2»· Since f is 11. If g: S ~ Tandf: T~ U are both 11. fo (g h) = (fo g) h. we get from this that g(Sl) = g(S2). then (fo fl)(t) = f(fl(t».3. thus Sl = S2 follows. 0 = Proof We verify one of these. by definition. 0 0 0 we do indeed see that (f (g h»(s) 0 0 = ((f g) h)(s) 0 0 0 for every s E S. If g: S ~ T and f: T also onto. then the "object" f.l = iT and f. thenfo g: S ~ U Proof Let us suppose that (fo g)(Sl) = (fo g)(S2). In this case it is called the inverse of f In this situation we have Lemma 1. Consequently.4.2.l : T ~ S defined earlier can easily be shown to be a 11 mapping of Tonto S.3. then fo g: S ~ U is An immediate consequence of combining the Remark and Lemma 1.3. Because of the equality involved there is really no need for parentheses. respectively. D We leave the proof of the next Remark to the reader. D 0 (The symbol D will always indicate that the proof has been completed.3.l f is. ~ U are both bijections. under composition. so we write fo (g h) as fo go h. g is also 11.) This equality is described by saying that mappings. 0 Lemma 1. If t E T. then fo f. Since (fo g)(Sl) = (fo g)(S2) forces Sl = S2. If f: S ~ T is a bijection. But what is fl(t)? By definition.
then g = h. and if f: T ~ U is 11.3. T = S. (g) S = set of all positive real numbers. 8. If f is a 11 mapping of S onto T. (f) S =~set of all real numbers.3. T = set of all men. (a) Does this define a function from S to T? (b) Show that f(SI + sz) = f(st)f(sz). or both. T = S. (b) S = set of positive integers. If f: S ~ T is onto and g : T ~ U and h : T ~ U are such that g f = h f. T = set of nonnegative integers. f: S ~ T is defined by f(s) = 1 if s is even.2. prove that f. f(s) = 1 if s is odd. 2. For the given sets S. f(s) = s . prove that g = h. In other words. prove that f. T = S. (fo fl)(t) t E T. 3 Mappings 13 t = f(so). if not. What does this say about the integers? (c) Is f(Stsz) = f(SI)f(sz) also true? 0 0 0 . T = S. (c) S = set of positive integers.1. onto. (a) S = set of all women. Lemma 1. PROBLEMS Easier Problems 1.Sec.5. T determine if a mapping f: S ~ T is clearly and unambiguously defined. I}. f(s) = ~. T = S. *3. So f(fl(t» = f(so) = t. Give a proof of the Remark after Lemma 1. f(s) = s 1. * 4. S. Let S be the set of all integers and T = {I.1 is a 11 mapping of T onto S. If g: S ~ T. f(s) = ~. If f: S ~ T and iT is the identity mapping of T onto itself and is is that of S onto itself. f(s) = s . the identity mapping on T. In those parts of Problem 1 where f does define a function. If f is a 11 mapping of S onto T.D = t for every We leave the last result of this section for the reader to prove. f(s) = husband of s. determine if it is 11. (e) S ~ set of all integers. then iT 0 f = f and fo is = f.1 = iT. show that if fo g = fo h. *7. f(s) = s .1. hence fo f. h : S ~ T. (d) S = set of nonnegative integers.1.1 f = is. say why not. *6.
b always? (c) Find all fa. Give explicit values for u.b such that fa. where a.14 Things Familiar and Less Familiar Ch. 15. v. 11.. Let S be the set of all positive integers of the form 2 m 3n . how could you modify the definition of [so as to get a legitimate function? 13.1 exists and find its form.b fl. b can f be onto or 11. Define [: S ~ T by [(2 m 3n ) = min. b E S. and let T be the set of all integers.b fc. be defined by [(s) = as + b. define fa. 9. Define f: S ~ S by f(l) = 2. and let T be the set of all rational numbers. (a) Find fog. f(2) = 3. Let S be the set of all real numbers and for a. and f(s) = s for any other s E S. . where a =1= 0. Let S be the set of nonnegative rational numbers. If [is a 11 mapping of S onto itself. n > 0. (On what properties of the integers does this depend?) 14. 17. n nonnegative integers. C. where S is the set of all integers. how many mappings are there of S into itself? 18.b fc. S = {min I m. (a) Show that fa. (b) Is fa. Show that fo fo [ = is. (b) Find g f. b in order that f 0 [= is. and define f: S ~ S by [(s) S2 + as + b. b are fixed real numbers. v in terms of a. If S is a finite set having m > 0 elements. show that ([1)1 = f. Let [: S ~ S. Define f: S ~ S by f(s) = g: S ~ S by g(s) = s + 1. where a. b. Prove that for no values at a. Find all [of the form given in Problem 14 such that [0 [0 f = is. n =1= O}.d = fu. Prove that f defines a function from S to T.b(S) = as + b.1 = fl. 1 S2.u for some real u. b are integers. and d. where m > 0. f(3) = 1. Let S be the set of all real numbers. Let S be the set of all real numbers.b' (d) Show that f. (a) Does f: S ~ T defined by f(mln) = 2m 3n define a legitimate function from S to T? (b) If not. Find the necessary and sufficient conditions on a. In Problem 17. how many 11 mappings are there of S into itself? 19. 16. What is [I in this case? 0 0 0 0 0 MiddleLevel Problems 12. Let S be the set of all positive integers.d fa.1 fa.d = fc. (c) Isfog = gof? 0 and 10. that is.
\ f ofofo .1 explicitly. findf. Let S be the set of all positive real numbers. d 0 fa. m 2:: 0. n 2:: 0. Show that there is a 11 correspondence of S onto T. For the fin Problem 25. If S is a finite set and f is a 11 mapping of S. Define f: S ~ S by f(s) = as 2 + bs + c. 3 Mappings 15 20. If S is a finite set and f is a mapping of S onto itself. Find all fe. show that f must be surjective.b(S) = as + b. show that f must be 11. c need not be integers. then (a) What can you say about f if g is onto? (b) What can you say about g if f is II? 28. b.Sec. c and nonnegative reals b. b are rational numbers. find an n > (in terms of m) that works simultaneously for all 11 mappings of S into itself. Prove that there is a 11 correspondence of the set of all positive integers onto tITe: set of all positive rational numbers. b. b. c. where a =1= 0.] Harder Problems 23. b. d for every fa. 25. of = is. Let S be the set of all integers of the form 2m 3n . d. If S is a finite set and f is a 11 mapping of S into itself. Let S be the set of all integers and a. If f. Find a 11 mapping f of S onto T such that f(S1 + S2) = f(S1) f(S2) for all Sb S2 E S. f(s) = ~s(s + 1) = ~S2 + ~s does always give us an integer for integral s. 26. Let S be the set of all real numbers and T the set of all positive reals.b : S ~ S be defined by fa. 29. b· 22. If S has m elements in Problem 30. Find necessary and sufficient conditions on a. Let S be the set of all rational numbers and let fa.d of this form s~tisfying fe. 30. b = fa. for example. so that f defines a mapping on S [Note: a. 27. is it ever possible that the mapping f: S ~ S defined by f(s) = (as + b)/(cs + d) satisfiesf o f= is? Find all such a. g are mappings of S into Sand fog is a constant function. d that do the trick. For positive reals a. . n times I 31. c rational numbers. and let T be the set of all positive integers. 24. 21. show that for some integer n > 0. c. ° .. b 0 fe.
Suppose that you can do a certain thing in r different ways and a second independent thing in s different ways. 1 4. A (S) satisfies the following: (a) f. Chapter 3 will be devoted to a study. . A(S) (THE SET OF 11 MAPPINGS OF 5 ONTO ITSELF) We focus our attention in this section on particularly nice mappings of a nonempty set. If we are interested in the structure of Sn' it really does not matter much what our underlying set S is. We have chosen to develop those aspects here which will motivate the notion of a group and which will give the reader some experience. g E A(S) implies thatfo g E A(S). (c) There exists an elementthe identity mapping isuch that fo i i o f= ffor every fE A(S). of all 11 mappings of S onto itself. To do so. So. say n. When S has a finite number of elements. n}. Although most of the concern in the book will be in the case in which S is a finite set. Sn plays a central role. h E A(S) implies that (fo g) 0 h = fo (g 0 h). there exists agE A(S) (g go f = i. Lemma 1. Groups will be discussed in Chapter 2. . in some depth. = fl) such that fo g = Proof All these things were done in Section 3. either in the text material or in the problems. We begin with a result that is really a compendium of some of the results obtained in Section 3. working in a grouptheoretic framework. then A (S) has a special name. (b) f. Namely. In how many distinct ways can you do both things together? The best way of finding out is to picture this in . you can think of S as being the set {I. into itself. g. It is called the symmetric group of degree n and is often denoted by Sn' Its elements are called permutations of S.16 Things Familiar and Less Familiar Ch. D We should now like to know how many elements there are in A(S) when S is a finite set having n elements.1. S. There are many properties of the set A(S) on which we could concentrate. A(S). of Sn' In the investigation of finite groups. (d) Given f E A(S).. We leave it to the reader to find the relevant part of Section 3 that will verify each of the statements (a) through (d). we first make a slight digression. we do not restrict ourselves to that situation here. we shall consider the set.4. and feeling for..
How many choices does fhave as a place to send x I? Clearly n. To be able to see the picture in its entirety. . Continuing this way. Hence f can send X2 into n .X3~XI. Hence the number of suchf's is n(n . x n }. for we can send x I under fto any element of S. for every road you take from Chicago to Detroit you have s ways of continuing on to Ann Arbor. 1. g: Xl ~ X2. If we can do the first things in r I distinct ways. X3. Let's recall something many of us have already seen: Definition. 4 A(S) (The Set of 11 Mappings of S Onto Itself) 17 a concrete context. + s. we see that f can send Xi into n . gof:XI ~XbX2~X3. we must have f(XI) =1= f(X2). then we can do all these together in rlr2 rm different ways. X3. Suppose that there are r highways running from Chicago to Detroit and s highways running from Detroit to Ann Arbor. the mth in rm distinct ways. then to Ann Arbor? Clearly. Proof: Let f E A(S).. where S = {Xb X2. for an integer m > 2.1 different images.. i:XI~XbX2~X2. Lemma 1. where n! is still quite small. 3 . If S has n elements. = rs r times different ways.X3~X3.. X2. ·1 = n! D Example The number n! gets very large quickly. But now f is not free to send X2 anywhere..X3~X2. X2. . If n is a positive integer. for since f is 11.Sec.. Consider A(S) = S3' where S consists of the three elements Xl.(i . We list all the elements of S3' writing out each mapping explicitly by what it does to each of Xl. . f:XI~X2. then A(S) has n! elements.. In how many ways can you go first to Detroit. then n! (read "n factorial") is defined by n! = 1 ·2 .(Verify!) 3. the second in r 2 ways. 2. n. we look at the special case n = 3. It is fairly clear that we can extend this from doing two independent things to doing m independent ones. hence you can complete it in IS + S + S + . You can start your trip from Chicago in r distinct ways.2.1)(n . X2 ~ Xb X3 ~ X3· 4.1) different images.2) . .X2~X3.4. So we can send X2 anywhere except onto f(XI)..
Finally.l = fo f. Similarly. To see this. g.. some other familiar properties do go over. so one familiar rule of the kind of arithmetic we have been used to is violated. hence f. On the other hand. we claim that (fg)2 =1= f2g2.l g. f2 = fo f = ff. then g = h. will be a nonempty set. therefore. What does this list tell us? To begin with. Similarly. g satisfy gf = flg. From now on we shall drop it and write fo g merely as fg. gf = hf implies that g = h. if f. Since g E S3 and g E S 3 . where n is a positive integer. PROBLEMS Recall that fg stands for fo g and. S. X2 ~ X2. In S3 our f. . and S3 has only six elements. fo f : Xl ~ X3' X2 ~ Xl. that is.. For instance. what fm means. 0 0 It is a little cumbersome to write this product in A (S) using the o. we easily get gog = i. X2 ~ X2. X3 ~ Xl. in the example of the f. also. 0 0 0 0 0 0 Note also that fo (fo f) = i. However. hence f2 g 2 = f2 =1= i. 0 Example Do not jump to conclusions that all familiar properties of exponents go over. we have a complete list of all the elements of S3. For negative exponents n we define fn by fn = (fl)n. X3 ~ X2. (Verify!) 6. (Verify!) Since we have listed here six different elements of S3. we note that fo g =1= go f. The usual rules of exponents prevail. to avoid expressions like fo fo fo . (fg)2 = i. 1 5. fa = i. from fg = fh we have fl(fg) = fl(fh). Why? Because. f. What is it? If we calculate gog. namely frfs = fr+s and (fr)s = frs. without subscripts. h are in A (S) and fg = fh. g = ig = (fIf)g = fl(fg) = fl(fh) = (fIf)h = ih = h. whence (fg)2 =1= f2 g 2in this case. we note that so that (fg)2: Xl ~ Xl. g E S3 defined above. but since f =1= /1 we cannot cancel the g here. f2 =1= i and g2 = i. Also. For instance. We leave these as exercisessomewhat tedious ones at thatfor the reader. fo g: Xl ~ X3. and so on. We define. So we can cancel an element in such an equation provided that we do not change sides. we leave it to the reader to show that g f = f. we shall start using the shorthand of exponents. X3 ~ X3. we get (f g) (f g) = i = (g f) (g f). for f E A(S). we must have gog also in S 3.18 Things Familiar and Less Familiar Ch.
Sec. then there is some g E K such that h = flgf. Iff. (b) If h E H. h E A(S). show thatf6 i. show that there is anfE A(S) such thatf(sl) = = S2. prove that fg = gf. then flgf E H. namely frfs f E A (5) and positive integers r. (b) If f. f4. Show that: 3. show that (flgf)(f1hf) about (flgf)n? = fl(gh)f. (b) g2. If Sl =1= S2 are in S. Then if H is as in Problem 2 and K = {g E A (S) I g(S2) = S2}. (d) gf. 9. (e) (fg)3. 2. Suppose that Sl =1= S2 are in Sand f(Sl) = S2. = fr+s and (fr)s = frs for 8. X2. *7. IffE S3. If Sl E S. If f. (gf)3. g E A (S) and fg (a) (fg)2 = f2 g 2. thenf. X3' X4}. (b) (fg)l = flgl. If f. for the same f and g. to show that (fg)m = fmgm for all integers m. If S = {x}. Verify_ the rules of exponents. f3.1 . gl = 10. let f. show that: 6. g. g E H. show that: (a) If g E K. where f E A (S). 4 A(S) (The Set of 11 Mappings of S Onto Itself) 19 Easier Problems 1. g E S4 be defined by and Calculate: (a) f2. (c) fg. 4. let H === {fE A(S) I f(Sl) (a) i E H. Push the result of Problem 5. s. What can you say 5. then fg E H. g E A (S) and (fg)2 = f2g2. Can you find a positive integer m such that fm i for all f E S4? . = gf. g3.1 E H. sd. (c) IffE H. (f) f. = 11.
f(x n) = xtl can be written as f = gtg2 where each gi E Sn interchanges exactly two elements of S = {Xl. If f E Sn. such that fk = i. g E VeT) and F is onto Sm· 20. Sn such that F(fg) = 15. If m < n. how many elements are there in VeT)? Show that there is a mapping F: VeT) ~ Sm such that F(fg) = F(f)F(g) for f. = i. (b) f. if f E A(S). show that for some fin Sn. If S has three or more elements.e. (Hint: Consider the positive powers of f. . depending on f.20 Things Familiar and Less Familiar Ch. (b) f E M implies that f. Call an element in Sn a transposition if it interchanges two elements. 19. 1 MiddleLevel Problems *12. show that we can find f.. For the situation in Problem 16. g E VeT) implies thatfg E VeT). show that f = h lh 2 ••• h m for some h j E Sn such that h. can F in Problem 19 ever be II? If so. f(xJ = Xi+l if i < n. In Sn show that the mapping f defined by [i. If the S in Problem 18 has n elements and T has m elements.) * 13. Show that any element in Sn is a product of transpositions. leaving the others fixed. g E Sm.) 24.1Mf {flgfl gEM} must equal M. (This sharpens the result of Problem 22. when? 21. If n is at least 3. Harder Problems 22. show that there is some positive integer k. gnl . If m < n. x n}. gEM implies that fg E M. *23. 16. = i for all f E ~ Sn. If f E Sn. show. leaving the other elements fixed in S. = 18. Let S ~ T and consider the subset VeT) = {f E A (S) I f(t) E T for every t E T}. Prove that: (a) f. f cannot be expressed in the form f = g3 for any g in Sn. g E A (S) such that fg =1= gf. show that there is a 11 mapping F: Sm F(f)F(g) for alII. Show that there is a positive integer t such that ft 14. that f.l E M. Let S be an infinite set and let MeA (S) be the set of all elements f E A(S) such that f(s) =1= s for at most a finite number of s E S. Show that: (a) i E VeT). 17.
Show that repeating this fixed shuffle a finite (positive) number of times will bring the deck back to its original order.. If f E Sn is such that f =1= i but f3 = i. .. let C(f) = {g E A (S) I fg (a) g. We say that g E A(S) commutes with f E A(S) if fg = gf. . Prove that: 5. If f E A (S) satisfies f3 = i. what can you say about the size of the orbits of the elements of S relative to f? What property of the prime numbers are you using to get your answer? 31. . f. t E S. we shall give here a rather sketchy survey of the properties of 7L that we shall use often in the ensuing material. and zero. f(X2) = X3. 5 The Integers 21 25. of all integerspositive. then either O(s) n OCt) = 0 or O(s) = OCt). (c) C(f) is not empty. show that we can number the elements of S in such a way that f(Xl) = X2.I . and.. for s E S. f(x n ) = Xb are the powers of f.f(X3k+l) = X3k+2' f(X3k+2) = X3k+3. 2. For f E A (S). Because of this acquaintance with 7L. In Sn show that the only elements commuting with f defined by f(Xi) = Xi+1 i"f i < n.. . *32. show that the orbit of any element of S has one or three elements. 28. f(X3k+3) = X3k+l for some k. namely i = fa. 11 and f(x 12) = Xl.2. . a few are less well known. X12} and f E S12 is defined by f(xt) = Xt+l if i = 1. Find all the elements in A(S) that commute with f: S ~ S defined by f (x 1) = X2 .. . f(xs) = X6. find the orbits of all the elements of S (relative to f). f(X3) = Xl. f(X4) = Xs. Prove that if S has more than two elements. Show that if s. 26. negative. The basic assumption we make about the set of integers is the . for all the other Xt E S.. call. h E C(f) implies that gh E C(f). = gf}. View a fixed shuffle of a deck of 52 cards as a 11 mapping of the deck onto itself. f2.. 34. .. then the only elements fa in A (S) ~ch that fof = ffo for all f E A (S) must satisfy fa = i. *27. If f E A(S) satisfies fP = i. and f (s) = s if s =1= Xb X2 . . (b) g E C(f) implies that gl E C(f). If f E A(S).. the orbit of s (relative to f) the set O(s) = {fj(s) I all integersj}. which we shall often call N. f(xt) = Xt. f (X 2) = Xl. 29. . *30.. If S = {Xl' X2. Most of these properties are well known to all of us. Equally familiar is the set. THE INTEGERS The mathematical set most familiar to everybody is that of the positive integers 1. 7L. f(X6) = X4.fn . Recall that a prime number is an integer p>1 such that p cannot be factored as a product of smaller positive integers. 33. .Sec.
r 2:: 0. (7) 114. such that m = qn + r. especially about the notion of divisibility.(q + l)n < r. i.e. for instance. Note that W contains some nonnegative integers. there is an element Vo E V such that Vo ~ v for every v E V. D Euclid's Algorithm will have a host of consequences for us. contradicting the minimal nature of r in V.qn ~ n. r.2. then m . We claim that r < n. We give it a more formal statement and a proof based on wellordering. namely that we can divide one integer by another to get a remainder that is smaller. 41 (16). If not. To indicate that m is not a divisor of n. yet m . where t runs through all the integers.. 3 t 5. The basic elementary properties of divisibility are laid out in Lemma 1. If min. if n = cm for some integer c.qn for some q (for that is the form of all elements in W ~ V).1 (Euclid's Algorithm). But this puts m . Given integers m =1= 0 and n we say that m divides n. for if t is large enough and negative. written as min. Theorem 1. for instance. (b) If m =1= The following are true: 0.(q + l)n 2:: O. Since we are speaking about the integers. 2114.tn. Euclid's Algorithm is proved. and r = m . This principle will serve as the foundation for our ensuing discussion of the integers.5. Definition. The first application we make of it is to show something we all know and have taken for granted. .(q + l)n in V. (a) 11 n for all n. Since rEV. Let W be the set of m . what this principle states is that given a nonempty set V of nonnegative integers. Any nonempty set of nonnegative integers More formally. Proof. then m 10. This is known as Euclid's Algorithm. Let V = {v E W I v 2:: O}. so. we call m a divisor or factor of n. has a smallest member.22 Things Familiar and Less Familiar Ch.tn I t E Z}. 1 WellOrdering Principle. with 0 ::5 r < n. and n a multiple of m. This will save a lot of repetition of certain phrases. by the wellordering principle V has a smallest element. then there exist integers q and r. hence m . r = m . Thus. be it understood that all letters used in this section will be integers. W = {m . With this. If m and n are integers with n > 0.5.tn > O. we write m t n.
1)24 = ( . However. we now want to introduce that of the greatest common divisor of two (or more) integers. Then n = cm and q = dm for some c and d. So suppose that min and m I q. un + vq = u(cm) + v(dm) = (uc + vd)m. b are not both 0. moreover. 9 + (. is unique. then m = ±n. v. (24. (b) c I a and c lb. following immediately from the definition of min. then m I (un + vq) for all u. Therefore. unique.5)9 + 2 . b) exists. c = moa + nob for some suitable mo and no. Theorem 1. Simply enough. for instance. in fact. then m I q. (e) If mil. this should be the largest possible integer that is a divisor of both integers in question. In other words. from the definition. b) exists. .3. Given a. then their greatest common divisor c is defined by: ~ (a) c > o. This combination is not unique. We leave all but Part (d) as exercises but prove Part (d) here to give the flavor of how such proofs go. b (not both 0). (f) If min and n I m. (c) Ifdlaanddlb. Definition. Defining something does not guarantee its existence. 24. Proof The proofs of all these parts are easy. We write this c as c = (a. m I (un + vq). then their greatest common divisor c = (a. the greatest common divisor of a and b is the positive number c which divides a and b and is divisible by every d which divides a andb. and is. (d) If min and m I q. 9) = 3 = 3 . Thus. namely that (a. So it is incumbent on us to prove that (a. D Having the concept of a divisor of an integer. 5 The Integers 23 (c) If min and n I q. we want to avoid using the size of an integerfor reasons that may become clear much later when we talk about rings. then m = 1 or m = 1. The proof actually shows more. and. So we make the definition in what may seem as a strange way.Sec. b) is a nice combination of a and b. b).5. If a.thendlc.
What is (24. Therefore. First note that if d I a and d I b. 2. in other words. r cannot be positive. which now is easier since one of the numbers is smaller than before.5. a = qc and so cia.2 we get that t = c (since both are positive). namely a = 24. b) by 1.5. 9) = 3. 4) This gives us (100. We claim that c is our required greatest common divisor. 1 Proof Since not both a and bare 0. if t > 0 also satisfied t I a. r = qnob + (1 . b = 9. d I c. 12) ( 16. that is. note that 3 = 3· 9 + (1 )24. For the uniqueness of c. But r < c and is in A. So r is in A. for example. So. interchange a and b so that a > b > O. c. By Euclid's Algorithm. then x = (mt)a + (nt)b. we would have tic and cit. to verify that c is our desired elenient. 16) = (16. It is possible to find the actual values of mo and no such that 4 = since 100 = 3 (28) + 16 since 28 = 1 (16) + 12 since 16 = 1 (12) + 4 mo 100 + no 28 .qmo)a. so by the choice of c. then xis also in A and x> 0. Thus A has positive elements. b) = (b.9)? How is this done for positive numbers a and b which may be quite large? If b > a. r) where a = qb + r with a ::5 r < b (Why?). a = q(moa + nob) + r. t I band d I t for all d such that d I a and d I b.2. So. in A. Similarly. 4) = 4. where 0 ::5 r < c. a = qc + r.24 Things Familiar and Less Familiar Ch. then d I (moa + nob) by Part (d) of Lemma 1. Then we can find (a. By direct examination we know that (24. finding (b. we have (100. observing that (a. 28) = (28. the set A = {ma + nb I m. for if x = mta + ntb. 28) = (12. D Let's look at an explicit example. so is in A. that is. By Part (f) of Lemma 1. r). If x E A and x < 0. 16) ( 28. we need only show that c I a and c lb. Since C E A. no. hence. nEE} has nonzero elements. by the wellordering principle there is a smallest positive element. c I b. 12) = (12. Hence r = 0. by the form of the elements of A we know that c = moa + nob for some mo.
But then 4= 16+(1)12 12= 28+(1)16 16 = 100 + (3) 28 4 = 16 + (1) 12 = 16 + (1)(28 + (1) 16) = (1) 28 + (2) 16 = (1) 28 + (2)(100 + (3) 28) = (2) 100 + (7) 28 so that mo = 2 and no = 7.5.5. This shows how Euclid's Algorithm can be used to compute (a. The integers a and b are relatively prime if and only if 1 = rna + nb for suitable integers m and n. We say that a and b are relatively prime if (a. Proof We pick up the proof of Theorem 1.5 at mac + nbc = c. 5 The Integers 25 by going backwards through the calculations made to find 4: Since 16 = 1 (12) + 4. Since 100 = 3 (28) + 16. If band c are both relatively prime to a. that is.Sec. bc).4. An immediate corollary to Theorem 1. b).atively prime to a. hence (ma + nb)c = c. Since 28 = 1 (16) + 12. Since d I a and d I c . mac + nbc = c. hence d I (mac + nbc) = c.3 is Theorem 1.5. b) = 1. We shall include some exercises at the end of this section on other properties of (a.5. then bc is also rel. D Corollary. If a and b are relatively prime and a I bc. We come to the very important DefilJ!tion.5. then a I c. b) for any positive integers a and b.5.4. ma + nb = 1 for some m and n. If d = (a. a I bc and by observation a I mac.4 has an immediate consequence Theorem 1. Theorem 1. By assumption. hence a I (mac + nbc) and so a I c.5. Proof By Theorem 1. then d I a and d I bc. So the integers a and b are relatively prime if they have no nontrivial common factor.
and if a is an integer not relatively prime to p. Suppose that p { a 1. since m E M. If p is a prime and p I (ala2 . then p and al are relatively prime. so p divides a. A prime number. Then there must be an intger m > 1 for which the theorem fails. 1 and (a. It follows that a = 1. so.5 is Theorem 1.7. such that for any integer a either p I a or p is relatively prime to a. Another result coming out of Theorem 1. an). n. we cannot have a E M or b E M.5. namely that we cannot factor p nontrivial/yo For if p is a prime as defined above and p = ab where 1 ~ a < p. be). an' Repeat the argument just given on az.5. c) = 1. Section 4. by our hypothesis..6. or a prime. p) = a (Why?) and p does not divide a since p > a. hence by Theorem 1. p I a2 . we get that d = 1. then (a. This definition coincides with the usual one.5. But pial (a2 . Thus a and bare . by the definition of M the theorem must be true for both a and b. D We now single out an ultraimportant class of positive integers... then (a. Theorem 1. Clearly. Suppose that the theorem is false. by the wellordering principle. b) equals p. then either n is a prime or n is the product of Proof. if p is a prime in the sense that it cannot be factored nontrivially. Since 1 = d = (a. If n > 1. we have that be is relatively prime to a. The proofs of both these results lean heavily on the wellordering principle.5. M. primes.. the set M for which the theorem fails is nonempty. Because a < m and b < m and m is the least element in M.26 Things Familiar and Less Familiar Ch. b) is not 1 and it divides a and p. is an integer p > 1. M. Therefore. which we met before in Problem 30. where 1 < a < m and 1 < b < m. In the theorem after the next we shall show that there is a uniqueness about the way n > 1 factors into prime factors. an).5. b $.. But then (a. Since a $. there is nothing to prove.. and continue. M has a least element m. so p = b. thus m = ab. Proof If p I aI. We shall show this in the next theorem. then p I aj for some i with 1 $ i s. 0 The primes playa very special role in the set of integers larger than 1 in that every integer n > 1 is either a prime or is the product of primes. m cannot be a prime. Definition. On the other hand.
we easily get k = a2 = b 2 . from m = ab we get that m is a product of primes. So we see that the primes and their exponents arising in the factorization of m are unique.. we shall state the theorem in a particular way.. .. q~e. so there is a least integer m > 1 for which it is false. this form. contradicting that m E M.5. q is not divisible by any of Pb' . < Pk are primes and the exponents aI..... < Pk' ql < q2 < . the argument we shall give is due to Euclid. a2. . we could enumerate all the primes in PI' P2' .. Since PI Ip~l .Pk' Consider the integer q = 1 + PIP2 . .5. ak and bb ..5. Since Pi ~ q. . p%k = q~lq~2 .5. where PI < P2 < ..5.6. This puts m outside of M.1. < qe are primes and where the exponents ab .9. 3 = 3 . p%k = p~11 q~2 >:. q~e where PI < P2 < . Proof If the result were false.8. . thus PI ~ Pj = ql ~ qi = Pl' This gives us that PI = qi' Now since m/PI < m. m/PI has the unique factorization property.7... al . One would expect from this that there should be manythat is.. · p%k = q~l . There is an infinite number of primes.Pk' So q is not a prime nor is it divisible by any prime. hence.... This old result goes back to Euclid. . in a sense. ak are all positive. D . thereby proving the theorem. . hence PI = qi' By the same token ql = Pj for some j. for we get a remainder of 1 on dividing q by Pi. Pk' Since q > Pi for every i = 1. To avoid trivialities of the kind 6 = 2 . then there is one and only one way to write n in the form n = p~lp~2 . and so proves the theorem. This m must have two distinct factorizations as m = p~lp~2 . k.. ness for m. . PI I qi.Sec. This violates Theorem 1... 2 (so. q cannot be a prime. . 5 The Integers 27 primes or the product of primes. pC. be are all positive.Pk = qk... D What these last two theorems tell us is that we can build up the integers from the primes in a very precise and welldefined manner.. Theorem 1.. Proof We start as we did above by assuming that the theorem is false. .1 = b i .. .. . by Theorem 1. q~e and since m/PI can be factored in one and only one way in P2 = q2.6 PI I q7 for some i... But m/PI = p~11 p~2 .. Given n > 1. Theorem 1.. D We asserted above that there is a certain uniqueness about the decomposition of an integer into primes. We make this precise now. in fact.ak = b k . 6 has two factorizations into the primes 2 and 3)./. again by Theorem 1.. This proves the theorem.2.. This contradicts the lack of such unique1 e.. an infinityof primes...
2. 5040. (a) Show that v = mn/(m. 3. The famous prime number theorem states that for large n the number of primes less than or equal to n is "more or less" n/log e n. prove that it is enough to show that n is not divisible by any prime p with p ~ yj.. Pk are distinct primes and at..28 Things Familiar and Less Familiar Ch. 4.. 84). . p~k by describing the c's in terms of the a's and b's. express (m. * 7. b) if m > o. 6.. 9. (b) (85. 120. b) as ma + nb for: (a) (116. 720. (c) (72.. If m = p~l . 5. except part (d). n) as p~l . where this "more or less" is precisely described. Define the least common multiple of positive integers m and n to be the smallest positive integer v such that both m I v and n I v. ••. Find the least common multiple of the pairs given in Problem 1. (b) In terms of the factorization of m and n given in Problem 6. p%k and n = pr 1 • • • pZ\ where p t ..5.5. n).9 exist about how many primes there are up to a given point. 25). . then (ab) 1m. Factor the following into primes. what is v? 8. 1 Results much sharper than Theorem 1. Find (a. b k are nonnegative. If m.. b) and express (a. 26). v with n/2 ~ v ~ n/2 such that m = un + v. (a) (b) (c) (d) 36. . Show that (ma. To check that a given integer n > 1 is a prime. PROBLEMS Easier Problems 1. . n > 0 are two integers. ak are nonnegative and b t .2.. Prove all the parts of Lemma 1. b) = 1. Show that if a I m and b I m and (a. 10. . There are many open questions about the prime numbers. 65). mb) = mea. show that we can find integers u. (d) (72.. .
14.9 to prove: (a) There is an infinite number of primes of the form 4n + 3. 6 Mathematical Induction 29 11. If p is_a prime. Theorem 1. Then P(n) is true for all n 2:: 2:: Let pen) be a statement about the positive integers 1. .6we say "argue as above and continue. the set of all positive integers. (b) 1001. b are integers.1. (b) If P(k) happens to be true for some integer k true. 5. Adapt the proof of Theorem 1.Sec. prove that one cannot find nonzero integers a and b such that a2~ pb 2. Show that no integer u a. . 12. (b) There is an infinite number of primes of the form 6n + 5. Check if the following are prime. this will be the usual method that we shall use for proving theorems about all the positive integers. 1 + 2 . such that: (a) P(l) is true. where 16. This is provided for us by the Principle of Mathematical Induction. 15. then P(k + 1) is also 1. What is clear is that we need some technique of avoiding such phrases when we want to prove a proposition about all the positive integers. Starting with 2. MATHEMATICAL INDUCTION If we look back at Section 5. construct the positive integers 1 + 2· 3. we see that at several placesfor instance. show that there is a 11 mapping of Tonto N.) vIP 6. (c) 473.5. If T is an infinite subset of N." This is not very satisfactory as a means of nailing down an argument. (This shows that is irrational. 7.6. = 4n + 3 can be written as u = a2 + b 2 . in the proof of Theorem 1. (a) 301. 1 + 2 ·3 ·5. 17. 7. 3 ·5 .5. 3. Do you always get a prime number this way? MiddleLevel Problems 13. (b) 6n + 1 or 6n + 5 for some n. in fact. If P is an odd prime. show that p is of the form: (a) 4n + 1 or 4n + 3 for some n.
In this second possibility. then so is P(k + 1).1 contacts. hence pen) is true for all n 2:: 1. + k = ~k(k + 1). + k + (k + 1) = ~ (k + l)((k + 1) + I)? Now 1 + 2 + . Proof If pen) is the proposition that 1 + 2 + . the arguments given in proving Theorems 1. since P(k) is valid. So if pen) is what is stated above about the tennis balls. the matter is clear. and that p I ala2 .5. there is a least integer m 2:: 1 for which P(m) is not true.1. touching each other. for if pial. it certainly divides ai for some 1 ::::.6. m =I=. D We illustrate how to use induction with some rather diverse examples. for 1 = ~ (1 + 1). If for k balls we have k . Thus. ak)ak+l either p I ak+l (a desired conclusion) or pial· . k) + (k + 1) = ~k(k + 1) + (k + 1). 1 Proof Actually.1 contacts.5.5. Examples 1. This contradicts that P(m) is not true. If p is a prime and p I ala2 . then adding one ball (on aline) adds one contact... then P(l) is certainly true. Then P(l) is true.1 < m. i ~ 1... So Part (b) of Theorem 1. But ~k(k + 1) + (k + 1) = ~(k(k + 1) + 2(k + 1» = ~(k + l)(k + 2).. So k + 1 balls would have k contacts.. hence m > 1. Thus there can be no integer for which P is not true.1 holds.. so by the choice of m. and so the theorem is proved.. Thus the proposition 1 + 2 + . The question is: Is P(k + 1) then also true.. Suppose that n tennis balls are put in a straight line. by wellordering. +n = ~n(n + 1). that is.. since p I (ala2 . Proof If n = 2. + n = ~n(n + 1). For n 2:: 1. Now 1 ~ m . since P(k) is true we have that p I ai for some 1 :5 i ::::. D 2. Since P(l) is true. by Theorem 1. this means that 1 + 2 + . If P(k) should be true. But then by the inductive hypothesis [Part (b)] we must have that P(m) is true.. Proof Let pen) be the statement in Example 2.1 + 2 + . ak ak+l .. Combining both possibilities.. ak.. by the theorem.. then. 0 3. Suppose that the theorem is false. + n = ~ n(n + 1) is true for all n 2:: 1...0 . we see that if P(k) happens to be true.7 and 1.. Thus. k.. which assures us that P(k + 1) is true..1) must be valid..30 Things Familiar and Less Familiar Ch. Then we claim that these balls make n . Suppose we know that P(k) is true. + k + (k + 1) = (1 + 2 + . we get that p I aj for some 1 ~ j ~ k + 1. P(m .8 are a prototype of the argument we give here. P(n) is true for all n 2:: 1. an' then p I ai for some 1 ~ i ~ n.6. is 1 + 2 + .
7.. Suppose that P(n) is a proposition about positive integers n such that P(no) is valid. prove that 1 + a + a2 + ..1] holds. then P (k) is true. Give an example of a proposition that is not true for any positive integer. show that N 1 + 1 + . Prove that a set having n ~ 2 elements has ~n(n .l)/(a .. 4.1 is usually called the induction step. Prove that a set having n ~ 3 elements has n(n . guess (from Problems 3 and 4) how many subsets having exactly 4 elements there are in S. By induction. MiddleLevel Problems 11. Part (b) of Theorem 1. Prove that 12 + 22 + 32 + .... Complete the proof of Theorem 1.. We could.6. 6 Mathematical Induction 31 We must emphasize one point here: Mathematical induction is not a method for finding results about integers.5.3 n(n + 1) 1 n n + 1. In the problems we shall give some other versions of the principle of induction. Prove by induction that a set having n elements has exactly 2 n subsets.6.1) by induction. Then verify your guess using mathematical induction. by other means. 3.l)(n . 2.6. + an = (a n+ 1 . If n .1) subsets having exactly two elements. Prove that P(n) is true for all positive integers n. 8.. and if P(k) is true. it is a means of verifying a result. + n.2)/3! subsets having exactly three elements. find the formula given above for 1 + 2 + . . + n 3 = ~n2(n + 1)2 by induction. replacing the last sentence by an induction argument. yet for which the induction step [Part (b) of Theorem 1. PROBLEMS Easier Problems 1. *10. Let pen) be a proposition about integers n such that P(l) is true and such that if P (j) is true for all positive integers j < k. If a =1= 1.::: 4 and S is a set having n elements. so must P(k + 1) be. Prove that 13 + 23 + . 5. * 9.. What can you say about pen)? Prove your statement.. *6.Sec. 12. + 2. + n 2 = ~n(n + 1)(2n + 1) by induction.
COMPLEX NUMBERS We all know something about the integers. this assumption has been made for some of the text material and many of the problems have referred to these numbers. This is no longer the case. If a is 0. generalize the result in Problem 13 to: If p is a prime number. At one time the complex numbers were a part of the high school curriculum and the early college one. (c) (zw) = zw (d) zz is real and nonnegative and is. there is a complex number related to z. which we write as Z. w E C. then: (a) (Z) = z. Using induction on n. bare real and where we declare: c + di. defined by Z = a . (a + bi)(c + di) = (ac . the complex numbers and their properties are much less known to presentday college students. Prove by induction on n that n 3 n is always divisible by 3.bd) + (ad + bc)i.1. if and only if a = c and b = d. for a. (b) (z + w) = Z + w.7. rational numbers. We shall write 0 + Oi as 0 and a + Oi as a. (a + bi) ± (c + di) = (a ± c) + (b ± d)i. and real numbersindeed.32 Things Familiar and Less Familiar  Ch. So we shall do a rapid development of this very important mathematical set. . Taking the complex conjugate gives us a mapping of C onto itself. For the complex number z = a + bi. a = + bi 3. where a. c. then n P . (Hint: The binomial theorem. Note that z + 0 = z. b. Unfortunately. positive if z =1= O. Z. we call z purely imaginary. 1. This complex number.bi.) 15. This last propertymultiplicationcan best be remembered by using i 2 = 1 and multiplying out formally with this relation in mind. 2. The set of complex numbers.n is always divisible by p. Given z = a + bi. 7. Prove by induction that for a set having n elements the number of 11 mappings of this set onto itself is n!. is the set of all a + bi. z1 = z for any complex number z. d real. C. is called the complex conjugate of z. We claim Lemma 1. 1 13. in fact. a is called the real part of z and b the imaginary part of z. If z. 14.
so = a + bi =1= 0. We do verify Parts (c) and (d).(ad + be)i. v. . b are real and not both 0. Lemma 1. Proof We leave the proofs of these various parts to the reader. where a.1 = 1. This verifies Part (c). b. staying in C while doing so. a2 + b 2 is real and positive.di. we see that indeed (zw) = Z w. z = a . Suppose that z = a + bi. On the other hand. However. 0 These properties of C make of C what we shall call a field. d are real.bd) + (ad + be)i = (ae .bd) . 0 implies that u.bd) + (ad + be)i.7. e. then acts like the inverse liz of z. hence.1 = = 11u exists in C such that uu. So zw = (ae .bi and W = e . We go next to the proof of Part (d). What the lemma says is that we are allowed to calculate in C more or less as we did with real numbers. W = e + di.1 shows that if z zz = a2 + b2 ¥Oandz(zl(a 2 + b2 » = 1.7. Suppose that z = a + bi =1= 0. (f) z  z is twice the imaginary part of z times i. (b) (u + v) + w = u + (v + w). w E C.Sec.(ad + be) i. Proof Most of the parts of this lemma are straightforward and merely involve using the definition of complex conjugate. as asserted in Part (d). VUe (d) (uv)w u(vw). C has a much richer structure than the set of real numbers. then (a) u + v (c) uv = (e) u =1= = v + u. hence (zw) = (ae . z W = (ae . Comparing this with the result that we obtained for (zw). This allows us to carry out division in C. D The proof of Part (d) of Lemma 1. which we shall study in much greater depth later in the book.bd) . 7 (e) Complex Numbers 33 z + z is twice the real part of z. then z = a .bi and zz = a2 + b2 • Since a. by the definition of the product in C. We now list a few properties of C.2. C behaves under its sum and product according to the following: If u.
For z.7. and since a > 0.bd)2 + (ad + be)2 = (a 2 + b 2)(e2 + d 2). Iz + w I :5 Iz I + IwI· =I= Proof If z = 0.z.a = Ya = a + bi E C.1) (by Lemma 1.5 below. 0. If z. this will come up in the proof of Theorem 1. Now luvl = Y(uv) (uv) = Y(uv) (uv) = = (by Part (c) of Lemma 1. Lemma 1. Proof By definition. is defined by Izi = v.b 2)/4a ~ 0. with a > O. In the meantime we prove Lemma 1. and so b 2 .3. If aa 2 + ba + e every real a. there is nothing to prove. written as 2 + b2• We shall see.4ae :5 O. Note several small points about conjugates. w E C. v E C. then (zW + zw) = zW + zW = zw + zW.7. If z E C. so zW + ZW is real. we obtain that (4ae .5 (Triangle Inequality). then the absolute value of z. Definition.7.bd) + (ad + bc)i and to note the identity (ae . 1 We now introduce a "size" function on C. . But first we must digress for a moment to obtain a statement about quadratic expressions.2) Y(uu)(vv) ~~ = lullvl·D uv = Another way of verifying this lemma is to write u = a + bi. then luvl = lullvl. 2::  0 for Proof Consider the quadratic expression for a = bI2a. We want to get an upper bound for Izw + zwl. Iu I = ~ and Iv I = ~.b 2 ~ 0. in a few moments.4ae :5 O. b.34 Things Familiar and Less Familiar Ch.4. then z is real if and only if z = z. Now. If u. e be real.7.7. Simplifying this.7. so we may assume that z thus zz > O. w E C. v (ae . Let a. for a real. and z is purely imaginary if and only if z = . what this last definition means geometrically. we end up with 4ae . D We use this result immediately to prove the important Theorem 1. We get a( bI2a)2 + b( bI2a) + e 2:: O. If z Izl. = e + di. then b 2 .
a = r cos (). zW + zw ::::.4ac = (zw + ZW)2 . we have the picture: (a +c. Izl 2 + Iwl 2 + 21z1 Iwl 2 from the result above. b+d) .4(zz)(ww) ::::. as we see. Also. 4(zz)(ww) = 41z1 2 1w1 2 • Therefore. 0 Why is this result called the triangle inequality? The reason will be clear once we view the complex numbers geometrically. Iz + W 1 ::::. then Lemma 1. The angle () is called the argument of z and. tan () = b/a. Represent the complex number z = a + bi as the point having coordinates (a. Izl. z = a + bi = r (cos () + i sin (). 0. In other words. For a = 1 above. b2 Iz + wl 2 = zz + wW + zW + zw = Izl 2 + Iwl 2 + zW + zw ::::. 2lzllwl. This representation of z is called its polar form.4 tells us that . b = zW + zw. therefore. c = wW.Sec. Given z = a + bi. hence (zw + ZW)2 ::::. taking square roots we get the desired result. 7 0::::.7. (I z I + Iw I)2. Geometrically. then their sum is z + w = (a + c) + (b + d)i. The distance r of this point from the origin is Va 2 + b 2 . Iz + wi::::. b = r sin (). w = c + di. b) in the xy plane. If a = ZZ > 0. Iz I + Iwi. laz + Complex Numbers 35 wl 2 = = (az + w) (az + w) 2 = (az + w)(az + W) a zz + a(zW + ZW) + wW. in other words.
::: 1. This has another very interesting consequence. in fact.sin () sin t/J) + i rs( sin () cos t/J + cos () sin t/J) rs[cos( () + t/J) + i sine () + t/J)]. 21T ()n = cos . Icos () + isin ()j = Vcos 2 () + sin2 () = Vi = 1. it is true even if m is rational. Specifically. thus the term triangle inequality. r(cos () + i sin () = r k ( cos k() + i sin k() .::: 1. in multiplying two complex numbers. For any integer n . cos( () + t/J) = cos () cos t/J . the statement is obviously true. . then zw = r( cos () + i sin () . The complex numbers that come up in the polar form cos () + i sin () are very interesting numbers indeed. hence the result is true for all integers n . 1 The statement Iz + wi::::. the argument of the product is the sum of the arguments of the factors. so they give us many complex numbers of absolute value 1. .sin () sin t/J and sine () + t/J) = sin () cos t/J + cos () sin t/J. to see this just go back and look at the polar form of such a number. [r( cos () + i sin ()]k r k [cos k() + i sin k()]. Assume then that for some k.7. n n where n ~ 1 is an integer. = = Thus. [r(cos () + isin ()]n = rn[cos(n() + isin(n()]. Izi + Iw I merely reflects the fact that in a triangle one side is of smaller length than the sum of the lengths of the other two sides.36 Things Familiar and Less Familiar Ch. Let's recall two basic identities from trigonometry.. 0 In the problems we shall see that De Moivre's Theorem is true for all integers m. r( cos () + i sin () = r k + 1 [ cos (k + 1) () + i sin (k + 1) ()] by the result of the paragraph above. s( cos t/J + i sin t/J) rs( cos () cos t/J . Proof We proceed by induction on n. if z = r( cos () + i sin () and w = s( cos t/J + i sin t/J). Therefore. If n = 1. Thus [r(cos () + i sin ()]k+l = [r(cos () + i sin ()]k .6 (De Moivre's Theorem). Consider the following special case: 21T. Theorem 1.+ l sIn . In truth they give us all the complex numbers of absolute value 1. This completes the induction step.
Sec. 7
Complex Numbers
37
By De Moivre's Theorem,
21T) sIn n ( cos (n + l.' (21T))n 7T = cos )) + i sin
(n en
(n (2n 7T))
= cos
21T +
i sin
21T =
1.
So O~ = 1; you can verify that 0':: =1= 1 if 0 < m < n. This property of On makes it one of the primitive nth roots of unity, which will be encountered in Problem 26.
PROBLEMS
Easier Problems
1. Multiply.
(a) (6  7i)(8 + i). (b) (~ + ~i)(~  ~i). (c) (6 + 7i)(8  i).
2. Express Z1 in the form (a) z = 6 + 8i. (b) z = 6 8i. 1 . 1 (c) z = v2 + v2 l .
*3. Show that (Z)l
Z1 =
a + bi for:
=
(Zl).
4. Find (cos 0 + i sin 0)1. 5. Verify parts a, b, e, fof Lemma 1.7.1.
* 6. show that z is real if and only if
only if
z=
z=
z,
and is purely imaginary if and

z.
zw
=
7. VerIfy the commutative law of multiplication
wz
in C.
8. Show that for
9. Find: (a) 16  4il· (b) I~ + ~il·
(c)
z =1=
0,
Iz 1= 1/1zl.
1
l~ + ~il·
Izi
=
10. Show that
Izi.
38
Things Familiar and Less Familiar
Ch. 1
11. Find the polar form for V2 1 .
(a
)z
z
==
"2"  V2 l.
(b)
== ==
(c) (d)
z
z
4i. 66.
V2
+
V2 l.
cos 0 + i sin
== 
13 + 39_ i. 2 2V3
==
12. Prove that (cos(~ 0) + i sin(~ 0»2
o.
==
13. By direct mUltiplication show that (~ + ~V3 i)3 MiddleLevel Problems
1.
14. Show that (cos 0 + isin o)'n == cos(mO) + isin(mO) for all integers m. 15. Show that (cos 0 + i sin o)r == cos (rO) + i sin(rO) for all rational numbers r. 16. If z E C and n ;::: 1 is any positive integer, show that there are n distinct complex numbers w such that z == w n • 17. Find the necessary and sufficient condition on k such that:
27Tk) (cos (n 27Tk) (cos (n
+ 1.. (27Tk))n == SIn
n
1
and
.. n + 1 SIn (27Tk))rn
¥ 1
if 0 < m < n.
18. Viewing the xy plane as the set of all complex numbers x + iy, show that multiplication by i induces a 90° rotation of the xy plane in a counterclockwise direction. 19. In Problem 18, interpret geometrically what multiplication by the complex number a + bi does to the xy plane.
2 *20. Prove that Iz + w 1 + Iz  wl 2 == 2(lz/ 2 + IwI 2). 21. Consider the set A == {a + bi I a, b E Z}. Prove that there is a 11 correspondence of A onto N. (A is called the set of Gaussian integers.)
22. If a is a (complex) root of the polynomial
where the (Xi are real, show that polynomialp(x) ifp(r) == 0.]
a must
also be a root. [r is a root of a
Sec. 7
Complex Numbers
39
Harder Problems
23. Find the necessary and sufficient conditions on
z and
... ,
w in order that
Iz + wi = Izl + Iwl·
24. Find the necessary and sufficient conditions on
Z1,
Zk in order that
IZI + ... + zkl
=
IZII + ... + IZkl·
*25. The complex number () is said to have order n ;::: 1 if ()n = 1 and ()m =1= 1 for 0 < m < n. Show that if () has order nand ()k = 1, where k > 0, then n I k. *26. Find all complex numbers () having order n. (These are the primitive nth
roots of unity.)
2
GROUPS
1. DEFINITIONS AND EXAMPLES OF GROUPS
We have seen in Section 4 of Chapter 1 that given any nonempty set, the set A (S) of all 11 mappings of S onto itself is not just a set alone, but has a far richer texture. The possibility of combining two elements of A(S) to get yet another element of A(S) endows A (S) with an algebraic structure. We recall how this was done: If f, g E A (S), then we combine them to form the mapping fg defined by (fg)(s) = f(g(s» for every s E S. We called fg the product of f and g, and verified that fg E A (S), and that this product obeyed certain rules. From the myriad of possibilities we somehow selected four particular rules that govern the behavior of A (S) relative to this product. These four rules were
1. Closure, namely if f, g E A(S), then fg E A(S). We say that A(S) is closed under this product. 2. Associativity, that is, givenf, g, h E A(S), thenf(gh) = (fg)h. 3. Existence of a unit element, namely, there exists a particular element i E A (S) (the identity mapping) such that fi = if = f for all f E A (S). 4. Existence of inverses, that is, given f E A (S) there exists an element, denoted by fI, in A(S) such that ff 1 = flf = i.
To justify or motivate why these four specific attributes of A (S) were singled out, in contradistinction to some other set of properties, is not easy to
40
Sec. 1
Definitions and Examples of Groups
41
do. In fact, in the history of the subject it took quite some time to recognize that these four properties played the key role. We have the advantage of historical hindsight, and with this hindsight we choose them not only to study A (S), but also as the chief guidelines for abstracting to a much wider context. Although we saw that the four properties above enabled us to calculate concretely in A (S), there were some differences with the kind of calculations we are used to. If S has three or more elements, we saw in Problem 15, Chapter 1, Section 4 that it is possible for f, g E A (S) to have fg =1= gf. However, this did not present us with insurmountable difficulties. Without any further polemics we go to the Definition. A nonempty set G is said to be a group if in G there is defined an operation * such that: (a) a, bEG implies that a closed under *.)
* bEG.
(We describe this by saying that G is
(b) Given a, b, c E G, then a * (b * c) = (a * b) saying that the a$sociative law holds in G.)
* c.
(This is described by
(c) There exists a special element e E G such that a * e a E G (e is called the identity or unit element of G).
= e * a = a for all
(d) For every a E G there exists an element bEG such that a * b = b * a = e. (We write this element b as aI and call it the inverse of a in G.) These four defining postulates (called the group axioms) for a group were, after all, patterned after those that hold in A (S). So it is not surprising that A(S) is a group relative to the operation "composition of mappings." The operation * in G is usually called the product, but keep in mind that this has nothing to do with product as we know it for the integers, rationals, reals, or complexes. In fact, as we shall see below, in many familiar examples of groups that come from numbers, what we call the product in these groups is actually the addition of numbers. However, a general group need have no relation whatsoever to a set of numbers. We reiterate: A group is no more, no less, than a nonempty set with an operation * satisfying the four group axioms. Before starting to look into the nature of groups, we look at some examples.
Examples of Groups
1. Let 7L be the set of all integers and let * be the ordinary addition, +, in 7L. That 7L is closed and associative under * are basic properties of the integers. What serves as the unit element, e, of 7L under *? Clearly, since a = a * e =
2 a + e. The picture below illustrates the group £6. whereas En has a finite number. Again it is easy to check that IR + is a group under *. The number of elements in G is called the order of G and is denoted by IG I. . A group G is said to be a finite group if it has a finite number of elements. n. where On is the complex number On = Cos(27T/n) + i sin(27T/n). Note that 7L C 0 and both 7L and 0 are groups under the same operation *. Definition. Let O~ * 0h = O~+j. 4. the aI in this instance is a. 5. 0 is easily shown to be a group under *. and 0 is the required identity element under addition.. What about aI? Here too. and clearly a * (a) = a + (a) = o. Let 0' be the set of all nonzero rational numbers and let the operation * on 0' be the ordinary multiplication of rational numbers. Let 0 be the set of all rational numbers and let the operation * on 0 be the ordinary addition of rational numbers. . of elements. By the familiar properties of the rational numbers we see that 0' forms a group t:elative to *. we have e = 0. Note one striking difference between the Examples 1 to 4 and Example 5. since e = 0 = a * aI = a + aI. 2. By De Moivre's Theorem we saw that O~ = 1. i = 0. 2. As above. the first four have an infinite number of elements.42 Groups Ch. the ordinary product of the powers of On as complex numbers. whose elements are represented by the dots on the unit circle in the complex plane. 1. 3. n . Let IR+ be the set of all positive real numbers and let the operation * on IR + be the ordinary product of real numbers. The elements of En are called the n'th roots of unity. . Let En be the set of O~.1.. We leave it to the reader to verify that En is a group under *.
6. Since we are talking about the product of mappings (i. So (Ta.6. Definition.d(r) + b = a(er + d) + b (ae)r + (ad + b) = Tae. So we have the formula Ta. * is associative.6(r) = 5r . T s.Sec. such that .6 = 64. All the examples presented above satisfy the additional property that a * b = b * a for any pair of elements.ad+b· (1) This result shows us that Ta. Let IR be the set of all real numbers. bEG. what is T~.d(r» = aTe. where S has three or more elements. Of course.b * Te. and I En I = n. But we present a few other examples in which we can compute quite readily.b * Te.6. Just witness the case of A (S).6 is such that T S . The Ta.b(r) = ar + b for any real number r. The element T1.o = i is the identity mapping of IR onto itself.d = Tae. b * Te .. one of the greatest scientists Norway has ever produced. Finally.ad+b(r).b E G. and we let Ta .b are 11 mappings of IR onto itself. This prompts us to single out as special those groups of G in which a * b = b * a for all a. a not too surprising choice of name. d be the product of two of these mappings. A group that is not abelian is called nonabelian.6( 1T) = 571" . and let G be the set of all mappings Ta. T S. 1 Definitions and Examples of Groups 43 Thus En above is a finite group.b(Te.e. A group G is said to be abelian if a *b = b * a for all The word abelian derives from the name of the great Norwegian mathematician Niels Henrik Abel (18021829). the A(S) afford us an infinite family of such. for instance. T S .lb? Can we find real numbers x =1= 0 and y. the composition of mappings). We now give examples of some nonabelian groups.d)(r) = = Ta. a.b: IR ~ IR defined by Ta.d is in Gfor it satisfies the membership requirement for belonging to Gso G is closed under *. b are real numbers and a =1= o. there we saw that we could find f. Thus. This need not be true in a group.b * Te.6(14) = 5 . g E A (S) such that fg =1= gf. 14 . where a.
44
Groups
Ch.2
Go back to (1) above; we thus want Tax,ay+b = TI,o, that is, ax = 1, ay + b = 0. Remember now that a =I= 0, so if we put x = aI and y = a1b, the required relations are satisfied. One verifies immediately that
So G is indeed a group. What is Tc , d * Ta , b? According to the formula given in (1), where we replace a bye, e by a, b by d, d by b, we get
(2)
Thus Tc,d * Ta,b = if Ta,b * Tc,d and only if be + d = ad + b. This fails to be true, for instance, if a = 1, b = 1, e = 2, d = 3. So G is nonabelian. 7. Let H C G, where G is the group in Example 6, and H is defined by H = {Ta , bEG I a is rational, b any real}. We leave it to the reader to verify that H is a group under the operation * defined on G. H is nonabelian. 8. Let K C He G, where H, G are as above and K = {TI,b E G I b any real}. The reader should check that K is a group relative to the operation * of G, and that K is, however, abelian. 9. Let S be the plane, that is, S = {(x, y) Ix, y real} and consider f, g E A(S) defined by f (x, y) = (x, y) and g(x, y) = (y, x); f is the reflection about the yaxis and g is the rotation through 900 in a counterclockwise direction about the origin. We then define G = {fig j Ii = 0, 1; j = 0, 1, 2, 3}, and let * in G be the product of elements in A(S). Clearly, f2 = g4 = identity mapping; (f* g)(x, y) = (fg)(x, y) = f(g(x, y» = f(  y, x) = (y, x) and (g*f)(x,y) = g(f(x,y» = g(x,y) = (y, x). So g * f =1= f * g. We leave it to the reader to verify that g * f = f * gl and G is a nonabelian group of order 8. This group is called the dihedral group of order 8. [Try to find a formula for (fig j ) * (fSgt) = fagb that expresses a, b in terms of i, j, s, and t.] 10. Let S be as in Example 9 and f the mapping in Example 9. Let n > 2 and let h be the rotation of the plane about the origin through an angle of 27T/n in the counterclockwise direction. We then define G = {fkh j I k = 0, 1; j = 0, 1,2, ... , n  I} and define the product * in G via the usual product of mappings. One can verify that f2 = h n = identity mapping, and fh = hIr
Sec. 1
Definitions and Examples of Groups
45
These relations allow us to show (with some effort) that G is a nonabelian group of order 2n. G is called the dihedral group of order 2n. 11. Let G = {f E A (S) If(s) =1= s for only a finite number of s E S}, where we suppose that S is an infinite set. We claim that G is a group under the product * in A (S). The associativity holds automatically in G, since it already holds in A(S). Also, i E G, since i(s) = s for all s E S. So we must show that G is closed under the product and if f E G, then f 1 E G. We first dispose of the closure. Suppose that f, g E G; then f(s) = s , s~. Then except, say, for SI, S2, ••• 'Sn and g (s) = s except for s{, s~, (fg)(s) = f(g(s» = s for all s other than SI, S2, ••• , Sn, s{, ,s/n (and possibly even for some of these). So fg moves only a finite number of elements of S, so fg E G. Finally, if f(s) = s for all s other than Sb S2' ••• , Sn, then fl(f(s» = f 1(s), but fl(S) = fl(f(s» = (flf)(s) = i(s) = s. So we obtain that f 1(s) = s for all s except SI, ••• 'Sn. Thus f 1 E G and G satisfies all the group axioms, hence G is a group. 12. Let G be the set of all mappings T(j, where To is the rotation of a given circle about its center through an angle () in the clockwise direction. In G define * by the composition of mappings. Since, as is readily verified, To * T", = T 0+ '" ' G is closed under *. The other group axioms check out easily. Note that 121T = To = the identity mapping, and r;l = T o= 121To. G is an abelian group. As we did for A (S) we introduce the shorthand notation an for
,a * a * a·· · * a n times
and define a n = (a 1)n, for n a positive integer, and aO = e. The usual rules of exponents then hold, that is, (am)n = a mn and am * an = a m+ n for any integers m and n. Note that with this notation, if G is the group of integers under +, then an is really na. Having seen the 12 examples of groups above, the reader might get the impression that all, or almost all, sets with some operation * form groups. This is far from true. We now give some examples of nongroups. In each case we check the four group axioms and see which of these fail to hold.
Nonexamples
1. Let G be the set of all integers, and let * be the ordinary product of integers in G. Since a * b = ab, for a, bEG we clearly have that G is closed and associative relative to *. Furthermore, the number 1 serves as the unit ele
46
Groups
Ch.2
ment, since a * 1 = a1 = a = 1a = 1 * a for every a E G. So we are threefourths of the way to proving that G is a group. All we need is inverses for the elements of G, relative to *, to lie in G. But this just isn't so. Clearly, we cannot find an integer b such that 0 * b = Ob = 1, since Ob = 0 for all b. But even other integers fail to have inverses in G. For instance, we cannot find an integer b such that 3 * b = 1 (for this would require that b = ~, and ~ is not an integer). 2. Let G be the set of all nonzero real numbers and define, for a, bEG, a * b = a2 b; thus 4 * 5 = 42 (5) = 80. Which of the group axioms hold in Gunder this operation * and which fail to hold? Certainly, G is closed under *. Is * associative? If so, (a * b) * c = a * (b * c), that is, (a * b )2C = a 2(b * c), and so (a 2b)2c = a 2(b 2c), which boils down to a2 = 1, which holds only for a = ±1. So, in general, the associative law does not hold in G relative to *. We similarly can verify that G does not have a unit element. Thus even to discuss inverses relative to * would not make sense. 3. Let G be the set of all positive integers, under * where a * b = ab, the ordinary product of integers. Then one can easily verify that G fails to be a group only because it fails to have inverses for some (in fact, most) of its elements relative to *. We shall find some other nonexamples of groups in the exercises.
PROBLEMS Easier Problems
1. Determine if the following sets G with the operation indicated form a group. If not, point out which of the group axioms fail. (a) G = set of all integers, a * b = ab. (b) G = set of all integers, a * b = a + b + abo (c) G = set of nonnegative integers, a * b = a + b. (d) G = set of all rational numbers =1= 1, a * b = a + b + abo (e) G = set of all rational numbers with denominator divisible by 5 (written so that numerator and denominator are relatively prime), a * b =
a + b. (f) G a set having more than one element, a
* b = a for all a, bEG.
=
2. In the group G defined in Example 6, show that the set H b any real} forms a group under the * of G. 3. Verify that Example 7 is indeed an example of a group.
{Ta,b Ia
=
±1,
Sec. 1
Definitions and Examples of Groups
47
4. Prove that K defined in Example 8 is an abelian group. 5. In Example 9, prove that g * f = f * gt, and that G is a group, is nonabelian, and is of order 8. 6. Let G and H be as in Examples 6 and 7, respectively. Show that if Ta,b E G, then Ta,b * V * T;;:l E H if V E H. 7. Do Problem 6 with H replaced by the group K of Example 8. 8. If G is an abelian group, prove that (a * b)n = an * b n for all integers n. 9. If G is a group in which a2 = e for all a E G, show that G is abelian.
10. If G is the group in Example 6, find all Ta,b E G such that Ta,b * TI,x = Tl,x * Ta,b for all real x. 11. In Example 10, for n = 3 find a formula that expresses (fi h j ) * (fS h t ) as fah b • Show that G is a nonabelian group of order 6.
12. Do Problem 11 for n = 4. 13. Show that any group of order 4 or less is abelian. 14. If G is any group and a, b, c E G, show that if a and if b * a = c * a, then b = c. 15. Express (a * b)I in terms of aI and b 1.
* b = a * c, then b = c,
16. Using the result of Problem 15, prove that a group G in which a = aI for every a E G must be abelian. 17. In any group G, prove that (aI)I = a for all a E G. *18. If G is a finite group of even order, show that there must be an element a =1= e such that a = aI. (Hint: Try to use the result of Problem 17.)
19. In S 3, show that there are four elements x satisfying x 2 = e and three elements y satisfying y3 = e. 20. Find all the elements in S4 such that x 4 = e.
MiddleLevel Problems 21. Show that a group of order 5 must be abelian. 22. Show that the set defined in Example 10 is a group, is nonabelian, and has order 2n. Do this by finding the formula for (fih j ) * (fsh t ) in the form fah b • 23. In the group G of Example 6, find all elements U E G such that U * Ta,b = Ta,b * U for every Ta,b E G. 24. If G is the dihedral group of order 2n as defined in Example 10, prove that: (a) If n is odd and a EGis such that a * b = b * a for all bEG, then a = e. (b) If n is even, show that there is an a E G, a =1= e, such that a * b = b * a for all bEG.
48
Groups
Ch.2
(c) If n is even, find all the elements a E G such that a for all bEG.
*b
=
b
*a
25. If G is any group, show that: (a) e is unique (i.e., if f E G also acts as a unit element for G, then f = e). (b) Given a E G, then aI EGis unique.
*26. If G is a finite group, prove that, given a E G, there is a positive integer n, depending on a, such that an = e. *27. In Problem 26, show that there is an integer m > 0 such that am = e for all a E G. Harder Problems
28. Let G be a set with an operation * such that: 1. G is closed under *. 2. * is associative. 3. There exists an element e E G such that e * x = x for all x E G. 4. Given x E G, there exists ayE G such that y * x = e. Prove that G is a group. (Thus you must show that x * e = x and x * y = e for e, y as above.)
29. Let G be a finite nonempty set with an operation * such that: 1. G is closed under *. 2. * is associative. 3. Given a, b, c E G with a * b = a * c, then b = c. 4. Given a, b, c, E G with b * a = C * a, then b = c. Prove that G must be a group under *. 30. Give an example to show that the result of Problem 29 can be false if G is an infinite set. 31. Let G be the group of all nonzero real numbers under the operation * which is the ordinary multiplication of real numbers, and let H be the group of all real numbers under the operation #, which is the addition of real numbers. (a) Show that there is a mapping F: G ~ H of G onto H which satisfies F(a * b) = F(a)#F(b) for all a, bEG [i.e., F(ab) = F(a) + F(b)]. (b) Show that no such mapping F can be 11.
2. SOME SIMPLE REMARKS
In this short section we show that certain formal properties which follow from the group axioms hold in any group. As a matter of fact, most of these results have already occurred as problems at the end of the preceding section.
However. (ab)l == bIa. Similarly. Suppose that b. (d) For a. We shall always write it as aI. c E G act as inverses for a. then b == c.1 == e. no. for a E G.l . for then e == ef and f == ef. To see this.l . and our result is established. in any abelian group. hence e == ef == f. then e == f This is very easy. we cannot conclude from ab == ca that b == c. by the associative law. then: (a) Its identity element is unique. Hence.1)a. we shall prove a stronger result (listed below as Lemma 2.2.2. 2 Some Simple Remarks 49 now on we shall write the product a It is a little clumsy to keep writing the * for the product in G. that is. (al)l == a. bEG.1 (associative law) (again the associative law) == (a(bb. (bIal)(ab) == e. f E G and af == fa == a for all a E G and ae == ea == a for all a E G. Thus from ab == ac we have u(ab) == u(ac).1) == ((ab)b1)a. Instead of proving Part (b).2). To see Part (c). for Part (d) we calculate (ab)(b1a. as required. Finally.Sec. We claim that in a group G if ab == ac. D . that is.1 == aa. (ua)b == (ua)c. so. so by cancellation b == c and we see that the inverse of a is unique. What is expected of us to carry out the proof? We must show that if e. If G is a group. so by cancellation in al(al)l == e == aIa we get that (al)l == a. Hence b == eb == ec == c. bEG. note that by definition aI (al)l == e. Now to get Part (b) as an implication of the cancellation result.1. The first such formal results we prove are contained in Lemma 2. (ab)l == bIa. A similar argument shows that if ba == ca. yes. but in general. we have. then ab == e == ac. an element u E G such that ua == e. and from * b simply as ab for all a. (c) If a E G.1 == (ae )a. by definition. Proof We start with Part (a). which will have Part (b) as an immediate consequence. eb == ec. (b) Every a E G has a unique inverse aI E G. we can cancel a given element from the same side of an equation. then b == c. but aI a == e.
and 2. There is only a scant bit of mathematics in this section. (c) (aba1b1)n(nl) = e.1 for all integers r. bEG. i + 1 such that G is not abelian but does have the property that (ab)i = aib i and (ab)i+l = ai+1b i+1 for all a. bEG. bEG. [Hint for Part (c): Note that (aba1)r = abra. For all a. we give only a few problems. prove that G is a group. u. 4. we have: (a) If ab = ac. 5. then b = c. 6.50 Groups Ch. In any group G and a. there is a u E G such that ua = w. prove that: (a) (ab)nl = bn1a n. show that there is a group G and consecutive numbers i. then b =c. PROBLEMS 1.1 translates into the familiar (a) = a.] . y. w E G. y E G.1. Show that G is abelian. Let G be a group in which (ab)n = anb n for some fixed integer n > 1 for all a. prove that G is abelian. there is an x E G such that ax = y. given a." In other words.2. (This is a repeat of a problem given earlier. note that if G is the group of real numbers under +. If G is a group in which (ab)i = aib i for three consecutive integers i. We keep this promise and write Lemma 2. (b) If ba = ca. Show that G is a group.1 = bn1a n.bin G.) 3. w E G. *2. Before leaving these results. Suppose that G is a set closed under an associative operation such that 1. for every a.2. accordingly. b. It will be used in the body of the text later. If G is a finite set closed under an associative operation such that ax = ay forces x = y and ua = wa forces u = w. given a. then Part (c) of Lemma 2.2 We promised to list a piece of the argument given above as a separate lemma. Let G be a group in which (ab)3 = a 3b 3 and (ab)5 = a 5b5 for all a. c E G. (b) anb n. x. No indication is given as to the difficulty of these.2. Show that the result of Problem 3 would not always be true if the word "three" were replaced by "two.
. Some of the groups we consider are those we introduced as example~ in Section 1. and are such that the information gathered about them can be used to get relevant information and insight about G itself. We would strongly recommend that the reader carry out such a verification in lots of the others and try to find other examples for themselves. A group is distinguished from an ordinary set by the fact that it is endowed with a wellbehaved operation. the proper subgroups of G. whatever we choose as such pieces. SUBGROUPS In order for us to find out more about the makeup of a given group G. In some of these examples we shall verify that certain specified subsets are indeed subgroups. Since the associative law holds universally in a group G. for a given subset A of G. 3 Subgroups 51 3. we want to look at some specific subgroups of some particular. of a group G is called a subgroup of G if. whether A is closed under the operation of G. and finally. given a E A. We stress the phrase "relative to the product in G. A is a group. then the associative law certainly holds for them. we are led almost immediately to the concept of a subgroup of a group. we want them to reflect the fact that G is a group. H itself forms a group. whether e is in A. It might be desirable to focus our attention on appropriate pieces of G. Once this is granted. which are smaller. over which we have some control. In trying to verify whether or not a given subset of a group is a subgroup. for instance. It is thus natural to demand that such pieces above behave reasonably with respect to the operation of G. So we must check. But A is not a subgroup of 7L viewed as a group with respect to +. we maintain the numbering given there for them. These two subgroups we call trivial subgroups. alone. whether aI is also in A. Our interest will be in the remaining ones. the set of integers. not merely any old set.Sec. namely G itself and the subgroup consisting of the identity element. we are spared checking one of the axioms defining a group. Under the multiplication of integers." Take. explicit ~roups. The question then becomes: What should serve as suitable pieces for this kind of dissection of G? Clearly. relative to the product in G. it may be too much of a task to tackle all of G headon. H. given any subset A of G and any three elements of A. namely the associative law. the subset A = {I. e. I} in 7L. Definition. Before proceeding to a closer look at the general character of subgroups. Every group G automatically has two obvious subgroups. A nonempty subset.
if a. so H is certainly closed under +. H as in Example 7. Then K C He G and K is a subgroup of both H and of G. But fl(f(a» = fl(a) = i(a) = a.l E H(a). In Example 1. that is. Suppose that A eGis nonempty and that given a. 3. that is. Let S be any nonempty set and let G = A(S). A nonempty subset A eGis a subgroup of G if and only if A is closed with respect to the operation of G and. iffE H(a). Why? Is H closed. Let G once again be the group 7L of integers under +.l E A. We claim that H(a) is a subgroup of G. Moreover. H(a) is a subgroup of G. we have that f. Lemma 2. We leave it to the reader to verify that H m is a subgroup of 7L under +. We claim that H is a subgroup of 7L.l = e. then aI EA. then labl = lallbl is rational. Since aa. In other words. b E H. H is a subgroup of 7L under +. so thatfl(f(a» = fl(a).3. since a = fl(f(a» = fl(a). Let G be as in Example 6. since f(a) = g(a) = a. Let m > 1 be any integer and let H m consist of all multiples of m in 7L. then ab E A. Then H is a subgroup of G (see Problem 3 in Section 1). then a is also even. again by supposition. Now to the inverse. g E H(a). if a is even. b are even integers. If a E S. Then V is a subgroup of C'. the inverse of a E 7L relative to this operation is a. Thus a is a subgroup of G. Thus. In short. Thus fg E H(a). we have that e E A. For pick a E A. For if lal and Ibl are rational. b E A. Suppose further that given a E A. Let V = {a E C' Ila I is rational}. 5. hence a E H. H. thenf(a) = a. Then we assert that e E A. We now consider some examples. 6. is a + b an even integer? The answer is yes. and H as in Example 7.1. so .52 Groups Ch.2 Note that we can save one more calculation. Since the operation in 7L is +. given a E A. is a + b E H? In other words. For if f. His nonempty. can be described in another way: namely H consists of all multiples of 2. then aI E A. the set of even integers. 2. Let G be the group 7L of integers under + and let H be the set of even integers. If a E H. 4. There is nothing particular in Example 1 that makes use of 2 itself. let H(a) = {f E A(S) I f(a) = a}. hence aa. Also. Let C' be the nonzero complex numbers as a group under the multiplication of complex numbers. then (fg)(a) = f(g(a» = f(a) = a. Examples 1. (Why?) Consequently. Let G be as in Example 6 of Section 1. then aI E A by supposition. given a. and K as in Example 8 in Section 1.
So. It is denoted (a). so is in A. If in a group ab = ba. V. For. so (ai)I E A. Definition. and let B = {I. Therefore. and is of order n. since lal = Vcos 2 () + sin 2 () = 1. Also. also. then (e) = tel. whence gl E C(a). Also. . V is a sub 7. Let G be any group and let a E G. IaI group of C'. (J~l}. (ai)I = a.i sin () = cos(() + i sin(() E U.Sec. hence aI E V. V be as above. the group B is the cyclic group (() n) of C generated by () n. h E C(a). We call C(a) the centralizer of a in G. Let G be any group. If a = cos () + i sin () and b = cos l/J + i sin l/J. we saw in Chapter 1 that ab = cos(() + l/J) + i sin(() + l/J). then aia j = a i+ j . A is the cyclic subgroup of G generated by a in the following sense. We leave it to the reader to verify that Z(G) is a subgroup of G. These particular subgroups C(a) will come up later for us and they are given a special name. This makes A into a subgroup of G. The cyclic subgroup of G generated by a is a set {a i I i any integer}. thus (gh)a = g (ha) = g (ah) = (ga)h = (ag)h = a(gh)~by the repeated use of the associative law). lal = 1. Note that if e is the identity element of G. ••• . U. we say that a and b commute. 3 Subgroups 53 ab E V. Let C'. and C'. 10. ()n. then ga = ag and ha = ah. U eve C' and U is a subgroup both of V and of C'. I = lila I is rational. Since (J~ = 1 (as we saw by De Moivre's Theorem). Let C' and V be as above and let U = {a E C' I a = cos () + isin (). which simplifies to ag. it is easily checked that B is a subgroup of U. and let n > 1 be an integer. 9. If g. the closure of C(a). ()anyreal}. First. In Example 8.i. then from ga = ag we have gI(ga)gl = gI(ag)gI. (J~. by the rules of exponents. C(a) is thereby a subgroup of G. Therefore. It is called the center of G. The set A = {a i I i any integer} is a subgroup of G. so that ab E U. 8. 11. Let ()n = cos(27Tln) + i sin(27T/n). Thus C (a) is the set of all elements in G that commute with a. Let G be any group and let Z(G) = {z E G I zx = xz for all x E G}. if g E tea).I = gIa. and that aI = cos () . hence gh E C(a). Also. if a i E A and a j E A. for a E G let C(a) = {g E G I ga = agIo We claim that C(a) is a subgroup of G.
1(h 1h 2 )a E a1Ha. B are subgroups of G.1 E H. 4.) . If G is a finite group and H a nonempty subset of G closed under multiplication.D An immediate.I 2:: 0. where n = IHI. This proves the lemma.1 = a j . corollary to Lemma 2. consider the elements a. whence a. This is the case in which H is finite. But then.1 = a j .i E H.i = e. let a1Ha = {a1ha I h E H}.1 = e and we are done. For a E G. j .1 we must show that a E H implies a. Therefore. Here we have written down n + 1 elements. then H is a subgroup of G. If A. then j ~ n + 1.2 is the Corollary. (See Example 11. by the cancellation property in groups. then. 3. In an important special case we can make a considerable saving in checking whether a given subset H is a subgroup of G.i. put another way.2. What is the cyclic subgroup of 7L generated by 1 under +? Let S3 be the symmetric group of degree 3. a j .i 2:: 1. PROBLEMS Easier Problems 1. Let G be any group and H a subgroup of G. 2.1 points out for us what we need in order that a given subset of a group be a subgroup. If x = a1h1a and h y = a. Therefore.l (h 1h 2 )a (associative law). How can this be? Only if some two of the elements listed are equal. a j . XI = (a1ha)1 = a1h1a E a1Ha. Since j . Lemma 2. If a. However. only if a i = a j for some 1 ~ i < a = e. Thus a1Ha is closed. ••• . the center of G.1h 1a)(a.54 Groups Ch. so we go on to other things.3. as is easily verified. which says that xy E a1Ha. An even dozen seems to be about the right number of examples. hence e E H. the order of H.3. then xy = (a. and H has only n distinct elements. a1Ha is a subgroup of G.1 E H and aa j . a. and since H is a subgroup of G. Also. We assert that a1Ha is a subgroup of G. Then H is a subgroup of G. Suppose then that a =1= e . Find all the subgroups of S3' Verify that Z(G).1 2 a where hI. if x = a1ha E a1Ha.3.i.2 12. so a j .3.i.i = e. a 2 . all of them in H since H is closed. but nevertheless important. Proof. show that A n B is a subgroup of G. h 1 h 2 E H.i . h 2 E H. By Lemma 2. Lemma 2. is a subgroup of G. Suppose that G is a group and H a nonempty finite subset of G closed under the product in G.I E H. a n + 1.1h 2 a) = a.
8. If G is cyclic. Show that a E Z(G) if and only if C(a) = G. where p is a prime number.1ax) = x. B of G such that AB is not a subgroup of G. show that every subgroup of G is cyclic. If G is an abelian group and n > 1 an integer. show that AB is a subgroup of G. prove that H is a subgroup of G. prove that C(x. b E H.] 18. B are subgroups of an abelian group G. let AB = tab Ia E A. let An = {an I a E G}. 21. (This sharpens the result of Problem 14. If G is an abelian group and if H = {a E G I a 2 = e}. * 22. If S is a nonempty set and xeS. In other words. Prove that An is a subgroup of G. [See Examples 10 and 12 for the definitions of C (b) and of x 1 C (a )x. b E B}. that is. given a. 14. 13. *11.1C(a)x. If A. MiddleLevel Problems *16. of orders m a~d n. then ab 1 E H. x E G. Prove that a cyclic group is abelian. If G is a group and a. If A. *12. 3 Subgroups 55 5. 6. If G has no proper subgroups. prove that H is a subgroup of G. x = a j for some j. If G has no proper subgroups. prove that G is cyclic. . show that T(X) = {fE A(S) I f(X) c X} is a subgroup of A (S) if X is finite. Give an example of a group G and two subgroups A. Give an example of a nonabelian group for which the H in Problem 8 is not a subgroup. 20. B are subgroups of G such that b. G is cyclic if G = (a) for some a E G. In S 3.Sec. Prove that AB is a subgroup of G.) 17. find C(a) for each a E S 3. If G is a group and H a nonempty subset of G such that. 15. If A and B are finite subgroups. We say that a group G is cyclic if there exists an a E G such that every x EGis a power of a. If G is an abelian group and H = {a E G I an(a) = e for some n(a) > 1 depending on a}. prove that AB is a subgroup of order mn if m and n are relatively prime. 9. 7. If C(a) is the centralizer of a in G (Example 10). show that H is a subgroup of G. 19.1Ab C A for all b E B. respectively. prove that G is cyclic of order p. 10. prove that Z (G) naEGC(a). of the abelian group G. in which case we say that a is a generator for G.
N be subgroups of G such that XI Mx C M and XI Nx C N for all x E G.) 4. Harder Problems 25. Similarly.2 23. this theorem is like the ABC's for finite groups and has interesting implications in number theory. given a. so also does the concept of "relation. T(X) be as in Problem 18 (but X no longer finite). If in Problem 26 H is a finite subgroup of G. a < b is a relation on S. Although its proof is relatively easy. H a subgroup of G. If H is a subgroup of G.1mn. *29. X.1Ny = N for every y E G. To smooth the argument of this theoremwhich is due to Lagrangeand for use many times later. we make a short detour into the realm of set theory. . If M is a subgroup of G such that x. bEG.1n. as is a ~ b.56 Groups Ch. Prove that MN is a subgroup of G and that x. *27. prove that actually XI Mx = M.1Mx = M and x." A relation is a statement aRb about the elements a. (Hint: Consider the element m.1Mx C M for all x E G. prove that Ha and Hb have the same number of elements. b E S. N are such that x.1Nx = N for all x E G. let N = n xEG x. Let Hx = {hx I h E H}. Let S. Let G be a group. n E N. Give an example of a set S and an infinite subset X such that T(X) is not a subgroup of A(S). 30. then Ha = Hb or Ha n Hb = 0. and if M n N = (e). Show that. What is the order of AB in Problem 22 if m and n are not relatively prime? 24. What is this number? 28. LAGRANGE'S THEOREM We are about to derive the first real grouptheoretic result of importance. Prove that N is a subgroup of G such that y.1 (MN)x C MN for all x E G. If M. As a matter of fact. *26. a = b is a relation on S.1Hx. those of you who solved Problems 26 and 27 of Section 3 have all the necessary ingredients to effect a proof of the result. The theorem simply states that in a finite group the order of a subgroup divides the order of the group. prove that mn = nm for any m E M. Just as the concept of "function" runs throughout most phases of mathematics. If S is the set of integers. Let M.
b) + (b .1 E H. the defining rules of an equivalence relation hold for this '"''.is an equivalence relation on G. But (ab.b). which is read "a congruent to b mod n.1. (b) a '"'' b implies that b '"'' a (symmetry). c E S.c). So a '"'' b if they are equal as far as the attribute of price is concerned. We verify that this is an equivalence relation.b) implies that n I (b . (c) a '"'' b. We generalize Example 2. Also. for all a. Note that if G = 7L. if a '"'' band b '"'' c.a). Examples 1. if a '"'' band b '"'' c. Note that in measuring this "generalized equality" on S we ignore all properties of the elements of S other than their price.1 )1 = (b. we write this as a == b mod n [or. Let S be the integers and n > 1 a fixed integer. for n > 1 a fixed integer. This relation on the integers is of great importance in number theory and is called congruence modulo n. hence n I ((a . This vague remark may become clearer after we see some examples. sometimes. Let G be a group and H a subgroup of G. equality. so ba. (ab1)1 E H. when a '"'' b.c). Therefore. define a '"'' b if ab. we have a '"'' a. it satisfies: (a) a '"'' a (reflexivity). b '"'' c implies that a '"'' c (transitivity). A relation '"'' on a set S is called an equivalence relation if. for a. b E S if n I (a . then n I (a . This tells us that a '"'' b implies that b '"'' a. so the general notion of equivalence relation is a generalization of that of equality. 3. and H is the subgroup consisting of all multiples of n. Since e E Hand e = aa. Finally. that is. For a. then ab 1 E Hand bc. But (ab. In a sense. Of course. if ab. b E S. b.1 E H.1) = ac.b) and n I (b . as a == ben)]. 2. Because n I (a . hence b '"'' a.c». a '"'' c. Finally. bEG. We define a '"'' b for a.1 . this is a special case of a much wider phenomenon in groups. whence ac.Sec. Let S be all the items for sale in a grocery store. n I (a .1 E H. Clearly. thus . the group of integers under +. we have that a '"'' b implies that b '"'' a. if the price of a equals that of b. we have that a '"'' a.1 )l a 1 = bat. As we shall see. We have shown the transitivity of '"''.a.1 E H and therefore a '"'' c.1 E H. then since H is a subgroup of G. is an equivalence relation." We'll be running into it very often from now on. Since n I 0 and 0 = a . then ab. =. 4 Lagrange's Theorem 57 Definition.1)(bc. we declare a '"'' b. an equivalence relation measures equality with regard to some attribute.1 E H .
There will be no lack of examples in the problems at the end of this section.b if ab." Note that if G is abelian. for some h E H.l ). Let G be any group. The important influence that an equivalence relation has on a set is to break it up and partition it into nice disjoint pieces. too. then ab. Examples 3 and 4.b. the center of G.b.b if b = xlax for some x E G. In Example 4. First. if a . If . Therefore. hence a = (Xl)lb(x. Thus a . In Example 3.b if there exists an x E G such that b = xlax. y E G. Thus c = yl(xlax)y = (xy)la(xy).b we say that "a and b are conjugate in G. It is this equivalence relation that we shall use in proving Lagrange's theorem. then a . then b = xlax.a}. Finally. if a = kb where k E H. We could go on and on to give numerous interesting examples of equivalence relations. a . Second. The set Hb is called a right coset of H in G. then [a]. If G is abelian. so a .58 Groups Ch. 4.l = h. but this would sidetrack us from our main goal in this section.a.b implies that a = hb. if ab.2 translates into a == ben). c = ylby for some x. Thus [a] = {xlax I x E G}. that is.is an equivalence relation on S. Let us see what the class of a is in the two examples.c. Right cosets. This relation.l E H. just given. So congruence mod n is a very special case of the equivalence we have defined in Example 3. we defined a . since b = eb and e E H (also because b E [b] = Hb). if a E Z(G). and so a . then cl(a) consists merely of a. the class of a.c. We go on with our discussion and make the Definition. We claim that this defines an equivalence relation on G.l = k E H. then b = xlax. is defined by [a] = {b E SIb . so that b . a . In fact. then cl(a) consists of a alone. . The notion of conjugacy and its properties will crop up again often. We have established that this defines an equivalence relation on G. For a.b if and only if a E Hb = {hb I h E H}. We shall examine the class of an element a in Example 2 later in this chapter. On the other hand. We ran into such in Problem 26 of Section 3. [b] = Hb. b . Note that b E Hb. plays an important role in group theory and is given the special name conjugacy.a for a = elae. especially in Section 11.l = (kb)b. if a . We shall denote [a] in this case as cl(a) and call it the conjugacy class of a in G. bEG we declare that a . play important roles in what follows.b if and only if a = b. and left handed counterparts of them called left cosets. When a .
where this union runs over one element from each class. having been born and brought up in Turin. then [a] n [b] = 0. J. we have that IG I k IH I.. So Hand Ha i have the same number. the number of right cosets of H in G. a ~ c by symmetry of '"'"'. we have a ~ b. whence [a] = [b].2 (Lagrange's Theorem). if [a] n [b] =1= 0. then the order of H divides the order of G. We show that if [a] =1= [b]. We claim that this map is 11. '"'"' partitions S into equivalence classes. He spent most of his life. D Although Lagrange sounds like a French name.then by cancellation in G we would have h = h'. Thus [a] C [b]. Suppose. Thus a E [b]. thus the map is 11. and where [a] =1= [b] implies that [a] n [b] = 0. Let k be the number of distinct classeseall them Ha 1. By definition of class. for if hai = h'ai. then. We assert that any Hai has IHI = order of H number of elements. If '"'"' is an equivalence relation on S. Theorem 2. . in France. then [a] = [b]. let c E [a] n [b].1. . 4 Lagrange's Theorem 59 Theorem 2. L. U Hak and the Hai are disjoint and each Hai has IH I elements. .4.. namely IG I/IHI. or. however. Twargument is obviously symmetric in a and b.Sec. where we established that the relation a '"'"' b if ab 1 E H is an equivalence relation and that [a] = Ha = {halh E H}. Lagrange was a great mathematician who made fundamental contributions to all the areas of mathematics of his day. if x E [a]. If G is finite. hence x E [b]. Therefore. The proof of the second assertion is also quite easy. a ~ b gives us that x '"'"' b. then S = Ural. Map H ~ Ha i by sending h ~ ha i . IHI. . what is equivalent to this. that [a] n [b] =1= 0. and our assertion above is proved. If G is a finite group and H is a subgroup of G. . Proof Let us look back at Example 3. and so. Lagrange (17361813) was actually Italian.4. Proof Since a E [a]. It is definitely onto by the very definition of Hai.4. Thus IH I divides IG I and Lagrange's Theorem is proved.1. U Hak and we know that Ha j n Hai = 0 if i =1= j. is called the index of H in G and is written as iG (H). of elements.Hak. so we have [b] C [a]. G = Hal U Ha2 U ... Since G = Hal U . D We now can prove a famous result of Lagrange. we have UaES[a] = S. C '"'"' a since c E [a] and c ~ b since c E [b]. That is. By Theorem 2. since a '"'"' c and c '"'"' b. then x '"'"' a. The theorem is now completely proved.
This says that any x EGis of the form x = a i • Hence. we shall obtain some classical numbertheoretic results due to Fermat and Euler. Theorem 2. If G is finite and a E G. p a prime. depending on a. IG I Thus We have proved the Theorem 2. then the order of a. by Theorem 2. Suppose that a E G has order m. then an = e for all When we apply this last result to certain special groups arising in number theory. Consider the set A = {e. ••• . Since IA IIIG/. Thus IAI = m = o(a). we claim that A is a subgroup of G (since am = e) and that the m elements listed in A are distinct.5.4. where we defined a == b mod n (a congruent to b mod n) if n I (a . So if H =1= (e). is the least positive integer m such that am = e. If G is finite. aE G.2 that e for some n(a) 2:: 1. then H = G.4. We call it the congruence class of a. consists of all a + nk. then 0 (a) I IG I.4. So this subgroup is all of G.3. a m. If a E G. we have Theorem 2. o(a). If a E G. A group G of prime order is cyclic. Let 7L be the integers and let n > 1 be a fixed integer.2 Recall that a group G is said to be cyclic if there is an element a E G such that every element in G is a power of a. we have. D IHI divides IG I = p. raj.3.4. . a.1 ~. so IHI an(a) = If G is finite and a E G. where k runs through all the integers. where G is finite. we saw earlier in the proof of Lemma 2. written 0 (a). The class of a. We leave the verification of these claims to the reader. then the powers of a form a subgroup (a) of G different from (e). a 2. G is cyclic by the definition of cyclic group. If G is a finite group of order n.b). k . Proof If H is a subgroup of G then. a =1= e.60 Groups Ch. We go back to Example 2 of equivalence relations. = 1 or p. by invoking Lagrange's Theorem. We make the Definition.4.4.
Sec.b'). The element [0] acts as the identity element and [a] acts as . How to define [a] + [b]? What is more natural than to define [a] + [b] = [a + b]. thus [b] = [r]. but a certain piece of it will become a group. and the unit element under multiplication is [1]. It is a cyclic group of order n generated by [1]. .4. [0] cannot have a multiplicative inverse. under· En will not form a group. [3] =1= [0]. [n . [1] = [4] = [. Under + En will form an abelian group.. So we now have a welldefined addition in En. Since [0][ a] = [0] for all a. [1].2] = . Therefore. We leave it to the reader to verify that they are distinct.. For instance. for then we will have [a] + [b] = [a + b] = [a' + b'] = [a'] + [b']. But there is a fly in the ointment.[a]. We shall introduce two operations. What we must show is that if [a] = [a'] and [b] = [b ']. b = qn + r.(a' + b'». 4 Lagrange's Theorem 61 o~ By Euclid's Algorithm. the inverse of [a].. Also from [b] = [b']. if n = 9. a + b == (a' + b') mod n. Okay.. We summarize this all as Theorem 2. we turn to the introduction of a multiplication.. Suppose that [a] = [a'].b'» = «a + b) . why not try the nonzero elements [a] =1= [0] as a candidate for a group under this product? Here again it is no go if n is not a prime. En forms a cyclic group under the addition [a] + [b] = Having disposed of the addition in En.1] give us all the congruence classes. and [3][6] = [18] = [0]. so the nonzero elements do not. [a + b].. Is this operation + in En welldefined? What does that mean? We can represent [a] by many a'sfor instance. given any integer b. hence n I «a . give us a group. yet we are using a particular a to define the addition. Under this multiplicationwe leave the fact that it is welldefined to the readerEn does not form a group. Let En = {[OJ. and so [a + b] = [a' + b']. So the n classes [0].a'). .I]}. n I (b . for instance. We leave it to the reader to check out that En is a group under +. then [a + b] = [a' + b']. then n I (a . what is more natural than defining [a] · [b] = [ab]? So. yet [2][3] = [6] = [0]. [n . then [2] =1= [0]. if n = 6.6. . if n = 3. [2][7] = [14] = [5]. in general. [1]. . where r < n.a') + (b . Again. + and· in En. So we ask: Can we find an appropriate piece of En that will form a . .
[2]. n) = 1. Us is a group of order 4 in which a 2 = e for every a E Us. following from the associativity of the integers under multiplication. [8]. This says that n I a(b . U g = {[I]. also [2]6 = [2][2]5 = [2][5] = [10] = [1]. [5]2 = [25] = [1]. namely [1].4. 5. So [a][b] = [ab] yields that if [a]. 7 are less than 8 and positive and relatively prime to 8.62 Groups Ch. By Problem 2 of Section 2. n) = 1. [2]5 = [32] = [5].6 we have Theorem 2. n) = 1 for [a] = [b]. n) = I}. then (ab.ac] = [0]. and so [ab .c) = ab . .c). [8]}.2. What other elements in U g generate U g? In parallel to Theorem 2. [4].14. Note that [2]1 = [2]. We try another one. n) = 1 and (b. By Theorem 1. cp (15). we have the cancellation property in Un. [14]}. [13]. n) = 1.5 one must have that n I (b . 2. [7]. I Un I = number of integers 1 ~ m < n such that (m. n) = 1 if and only if (b. Multiplication is commutative in Un. we have cp(p) = p . [2]2 = [4]. Un is a group. [5]. In fact.13. This number comes up often and we give it a name. [7]. for n > 1.5. Definition. Note that [3][5] = [15] = [7]. and so [b] = [c]. and so on.11.ac.5. Us = {[I]. What is the order of Un? By the definition of Un. m < n such = the number of positive integers Thus IUnl = cp(n).2 group under multiplication? Yes! Let Un = {[a] E 7L n I (a. 3.8. [5]. Un forms an abelian group. under the product [a][ b] = [ab].4. [7]}. [3]. n) = 1. noting that (a. The reader should verify that a 4 = e = [1] for every a E U 15 • 3. The Euler cpfunction. Note that [11][13] = [143] = [8]. The numbers 1 ~ m < 15 relatively prime to 15 are 1. [b] E Un. We see that cp (8) = 4 for only 1. [2]. If n = p. [2]3 = [8]. Note that if [a][b] = [a][c] where [a] E Un' then we have [ab] = [ac].4. so cp(15) = 8. [4]. but a is relatively prime to n. [11].7. V 15 = {[I]. of order cp (n).1. Associativity is easily checked. where cp (n) is the Euler cpfunction. cp (n). if (a. then [ab] E Un and Un is closed. a prime. [2 4 ] = [16] = [7]. In other words.5. By the Corollary to Theorem 1. So the powers of [2] give us every element in U g • Thus U g is a cyclic group of order 6.7. [2]4 = [1]. The identity element is easy to find. Let us look at some examples of Un . cp (n) that (m. is defined by cp (1) = 1 and. m with 1 ::::. 1.
c. acp(n) == 1 mod n. . acp(n) = e for all a E Un' This translates into [aCP(n)] = [a]cp(n) = [1]. a'"'' b if la I = Ib \. a '"'' b if a. then b == O(p) and b P == O(p).Sec. so that b P == b(p). D A special case.4. acp(n) If a is an integer relatively prime to n. then aP 1 == 1 mod p.4. the complex numbers. PROBLEMS Easier Problems 1. (a) S = IR reals. In other words. One final cautionary word about Lagrange's Theorem. we have. 4 Lagrange's Theorem 63 An immediate consequence of Theorems 2. then == 1 mod n. Theorem 2. a '"'' b if they have the same color eyes. b are parallel.4.1) for every integer a relatively prime to p. is due to Fermat. so that a P == a (p).8. He was the most prolific of all mathematicians ever. by Theorem 2.1 == a (p). Proof Un forms a group of order cp(n). A group with this property is very special indeed. For any integer b. so by Theorem 2. Pierre Fermat (16011665) was a great number theorist. (c) S = straight lines in the plane.7 and 2.5 is a famous result in number theory.1. and its structure can be spelled out quite well and precisely. Fermat's Last Theoremwhich was in fact first proved in 1994 by Andrew Wilesstates that the equation an + b n = cn (a. That is. Corollary (Fermat).4. if G is a finite group of order n. then it need not be true that for every divisor m of n there is a subgroup of G of order m.4. (b) S = C. p) = 1. a '"'' b if a . b P == b mod p. if (a. If p is a prime and p ~ a. D Leonard Euler (17071785) was probably the greatest scientist that Switzerland has produced. n being integers) has only the trivial solution where a = 0 or b = 0 or C = 0 if n > 2.b is rational.5. b. (d) S = set of all people. Verify that the relation '"'' is an equivalence relation on the set S given.8 (Euler). If p I b. which in turn translates into n I (aCP(n) . Proof Since cp(p) = p . Its converse in general is not true. that a P 1 == l(p). hence a 1 • a P . where n = p is a prime.
b '".) 10. f(X2) = Xl. If aH and bH are distinct left cosets of H in G. If G is S3 and H = {i. {Sa} nonempty subsets such that S = UaS a and Sa n Sf3 = o if a =1= {3. write down all the cosets of the subgroup H = {[OJ. what can you say about the x in Problem 16? . where f: S ~ S is defined by f(XI) = X2. 12. These seem to imply that a '". * 16. show that there are as many distinct left cosets of H in G as there are right cosets of H in G. so by (2). In E16 . [4].64 Groups Ch. Let '". We don't need to distinguish between right cosets and left cosets..a. .a and (2) a ~ band b . since En is abelian under +.b if aIb E H.1 = H for all a E G. for a. so a . an must satisfy x 2 = e. write your coset as [a] + H. Define. 9. . If every right coset of H in G is a left coset of H in G. Problem 2 shows that this is not so.c implies that a ~ c. If G is a finite abelian group and a I.a. The relation '". are Ha and Hb distinct right cosets of H in G? Prove that this is true or give a counterexample. * 5.a. then by (1). In Problem 9. show that X = ala2 .2 2. a . If p is a prime. is every right coset of H in G also a left coset of H in G? 8. prove that aHa.must be an equivalence relation. If this argument is correct. Prove that this defines an equivalence relation on G. For if a '". what is the index of H in E 16 ? (Recall that we defined the index iG(H) as the number of right cosets in G. Why not? What properties of an equivalence relation does it satisfy? 3.b. Let S be a set. Find the orders of all the elements of U l8 • Is U l8 cyclic? 14. 6. bEG. and show that [a] = aH = {ah I h E H}. [8]. . In Problem 6.be a relation on a set S that satisfies (1) a '".b. f}. 17.b implies that b '". f(X3) = X3' list all the right cosets of H in G and list all the left cosets of H in G.b if both a > band b > a is not an equivalence relation. What is wrong with the argument we have given? 4. b '".. Let G be a group and H a subgroup of G. . then the relation '". If G is of odd order. For any finite group G.on the real numbers IR defined by a '".) 11. [12]}. Define an equivalence relation on S in such a way that the Sa are precisely all the equivalence classes.a. (Since the operation in En is +. 13. a '". The sets aH are called left cosets of H in G. a n are all its elements. Find the orders of all the elements of U20 • Is U20 cyclic? *15. show that the only solutions of x 2 == 1 mod p are X == 1 modp or x == 1 modp. 7.
19. Let G be a finite group. show that there is an element a such that [a]2 = [1].] 25. 30. and let S be the set of all distinct right cosets of H in G. an integer a such that a 2 == 1 mod 41. 24. In the group G of Example 6 of Section 1.1 32. Let G be the dihedral group of order 8 (see Example 9.1 == 1 modp ifp {a. If p is a prime number of the form 4n + 3. Give their form explicitly. If i =1= f E A (S) is such that f P = i. Prove that there is a 11 mapping of S onto T. Find the conjugacy classes in G. fj (s) = s for some 1 ~ j < p. 4 Lagrange's Theorem 65 18. that is. Find all the distinct conjugacy classes of S 3 • 20. H a subgroup of G. In U41 . show that f(s) = s. 21. [Recall: The orbit of sunder f is {fj(s) I j any integer}.r = b ir for all positive integers r. 28. b • Describe it in terms of a and b. show that aHa. find the conjugacy class of the element Ta .Sec.) 27. 34. Verify Euler's Theorem for n x2 == 1 modp [Hint: Use F'ermat's Theorem that a P . H a subgroup of G. Let G be a group. and if for some s E S. prove that m Is. If G is a cyclic group of order n.) It is.] . 23. See Problems 5 and 12.1 = H for every a E G. T the set of all left cosets of H in G. is not the right one. 33. If o(a) = m and as = e. and for n = 14 and a = 5. also true if p = 2. show that arba. (Note: The obvious map that comes to mind. p a prime. (This is known as Wilson's Theorem. then (p . show that we cannot solve 22. If in a group G.1 = b2 . Using the results of Problems 15 and 16. Section 1). of course. If aH = bH forces Ha = Hb in G. If f E A (S) has order p. which sends Ha into aH. prove that if p is an odd prime number. find o(b) if b =1= e. Show that the nonzero elements in 7L n form a group under the product [a] [b] = [ab] if and only if n is a prime. = 14 and a = 3. p a prime. show that for every s E S the orbit of sunder f has one or p elements. MiddleLevel Problems 26. Let f(a) be the least positive m such that am E H.I)! == 1 mod p. If in G a 5 = e and aba. aba. *31. Prove thatf(a) lo(a). 29. show that there are 'P (n) generators for G. = bi.
that of a normal subgroup.] 40.(m).1).1). p a prime.66 Groups Ch. Show that: (a) If G has exactly one element b =1= e such that b2 == e.. 39. is the Euler q. and that of the factor or quotient group of a group by a normal subgroup. and al.!. then we can solve x 2 == 1 mod p (with x an integer). 42.(a n .1)  then y2 41. show that if p is a prime of the form p == 4n + 1 and if y == 1· 2·3··· .== p. an its elements. q. We discuss the first two of these in this section. .p) == 1. and the third in Section 6.(m) elements of order m. Let G be an abelian group of order n.] 37. I(s) == s.1 (p 2 . . show that for each positive integer m that divides n (including m == 1 and m == n) there are q.2 . Show that l/J(m) ::::. show that Up. (b) If G has more than one element b =1= e such that b 2 == e. (c) If n is odd. .2 35. In a cyclic group of order n. If 1 E A (S) has order p. . Let x == ala2··· an. where q. Using Wilson's Theorem (see Problem 28).function. Using the result of Problem 40. show that n I q.) 41. where (n. [Hint: Consider the integers mod(a n . if p is a prime. 38. then x == e (see Problem 16). then x == b. show that for some s E S. Using the result of Problem 37. Harder Problems 36.) == 1 mod p.. show that if p is a prime of the form p == 4n + 1. Prove that G must be cyclic.(m) and use Problem 38. Without further ado we introduce the first of these. is cyclic. (This is a famous result in number theory. (This gives another proof of the result in Problem 43. then x == e. it asserts the existence of a primitive root mod p. If a > 1 is an integer. HOMOMORPHISMS AND NORMAL SUBGROUPS In a certain sense the subject of group theory is built up out of three basic concepts: that of a homomorphism. Let G be a finite abelian group of order n for which the number of solutions of x m == e is at most m for any m dividing n. 5. and S is a finite set having n elements. [Hint: Let l/J(m) be the number of elements in G of order m. show that n == ~ml nq. Using the result of Problem 39.
(x) + q. That . then q. Define q. clearly i(xy) == xy == i(x)i(y). even + odd == odd. (a) q. Since i(x) == x for all x E G. define q. if it is. (a) == a2. Let G be any group and G' any other. of any group G into itself. (m) == 1 if m is even. Before working out some facts about homomorphisms. so q. 3. q. i. Trivially. q. 5 Homomorphisms and Normal Subgroups ~ 67 Definition.(G) == (e). It certainly is not a very interesting one. (a) == Ia I. the subgroup of the reals under multiplication. (ab) == Iab I == Ia I Ib I == q. (b). and G' the group of all reals under addition. The example of Us above suggests the socalled trivial homomorphism. (a) q. and G' the group of nonzero reals under multiplication.(ab) == q. (b).(x) == IOgloX for x E G. while the product q. Let q. and odd + odd == even. q. Another homomorphism always present is the identity mapping. Let G. again. (m) == 1 if m is odd. we'll say that it is. A short description of a homomorphism is that it preserves the operation of G.(xy) == q. We do not insist that q. (ab) == (ab)2 == a 2b2 == q.b) == a. is not too interesting as a homomorphism.function. Define q. The statement that q.) G' is In this definition the product on the left sidein q. so q. the unit element of G'.(a)q. It need not be onto. In fact. 6. (x) == e'. I}. so q.: G a homomorphism if q. Let G be the group in Example 6 of Section 1. The map i is 11 and onto. Let q. is onto. for all x E G. bEG. 2. then the mapping q. (a) q.(Ta. (ab )is that of G.: G ~ G' be defined by q.: G ~ G be defined by q. G' be two groups.: G ~ G' by q. is a homomorphism of G into G'. is a homomorphism is merely a restatement of: even + even == even. the reader should check that in Us (see Section 4) a 2 == e for all a E Us. but. we present some examples.(b) for all a. Let G be an abelian group and let q. Examples 1. Let G be the group of integers under + and G' == {I. be onto. 5.(y). (b) is that of G'. 4. Since IOglO(XY) == IOgloX + IOgloY. It also happens to be onto and 11. Let G be the group of all positive reals under the multiplication of reals. q. is a homomorphism of G into G'. is a homomorphism. is a homomorphism of G into itself. Let G be the group of all nonzero complex numbers under multiplication and let G' be the group of positive reals under multiplication. has nothing to do with the Euler q. Since q. (Note: This q.Sec. we have q. : G ~ G' be defined by q.
2 q.(m) = [m]. The fact that q. q. forgetting about its multiplication.b Tc. It is easy to verify that the image of 'fJ. we see that q. so q.(m) + q. The homomorphism q. Ta. there is one of G' onto G (see Problem 2). Let G be any group. so q. so q. a = Ta(e) = Tb(e) = b. is 11. Since the addition in 7L n is defined by [m] + [r] = [m + r]. Let G = 7L be the group of integers under + and let G' = 7L n • Define q. (ab) = Tab = TaTb = q. in point of fact. doesn't have a ghost of a chance of being onto. We shall denote that G and G' are isomorphic by writing G === G'. for a E G. F or if there is an isomorphism of G onto G'. then A( G) has order n!. is indeed 11.: G ~ A(G) by q. Two groups G and G' are said to be isomorphic if there is an isomorphism of G onto G'. (we used the associative law).: G ~ 7L n by q.ad+b' 7. but. and let A(G) be the set of all 11 mappings of G onto itselfhere we are viewing G merely as a set. Therefore. Define Ta : G ~ G by Ta(x) = ax for every x E G. if G has order n > 2.(b). Definition. An isomorphism from G to G itself is called an automorphism. . Definition. What is the product. We single them out in the following definition. The following general construction gives rise to a wellknown theorem. TaTb.d = Tac.: G ~ G' is called a monomorphism if q. The product rule for the T's translates into q. Ta = Tb. We shall discuss more thoroughly later what it means for two groups to be isomorphic.(a) = q. But now we summarize what we did in Example 8. is 11 suggests that perhaps 11 homomorphisms should playa special role. is a homomorphism follows from the product rule in G. Suppose that q. namely. and since n! > n. . that is.( G) = {Ta Ia E G}.(r). (b). (a) q. is a homomorphism of G into A(G). it is not. q. So we see that TaTb = Tab' Define the mapping q.(m + r) = q. is a subgroup of A(G). We claim that q. It is not onto in generalfor instance. is indeed a homomorphism of 7L onto 7L n • 8. A monomorphism that is onto is called an isomorphism. This definition seems to be asymmetric. One more definition.68 Groups Ch. of Ta and Tb as mappings on G? Well.(a) = Ta. is 11.
if x is labeled q. then aI xa = a 1 ya. a E G fixed in the discussion. Define q. using the label q. so this renaming of the elements is consistent with the product in G. : G ~ G' by q. invariant theory. Often. is an isomorphism of G onto itself. We go on with more examples. (y) = q. we see that xy is labeled as q. We claim that q.Sec. (x) for the element x. We shall see some in the problems to follow.) This is a good place to discuss the importance of "isomorphism. 9. It is 11 for if q. so q. y labeled q. Let G be the group of reals under + and let G' be the group of all nonzero complex numbers under multiplication. for an appropriate S. (xy). When G is finite. be an isomorphism of G onto G'. are equal.5." Let q. is onto. we can take the set S in Theorem 2. is called the inner automorphism of G induced by a. (x) q. what is xy labeled as? Since q.1 (Cayley's Theorem). in which case A (S) is Sn and its elements are permutations. is at least a homomorphism of G into itself. (x) q.(axa. q. Cayley's Theorem is usually stated as: A finite group can be represented as a group ofpermutations. The appropriate S we used was G itself. One final example: 10. (Arthur Cayley (18211895) was an English mathematician who worked in matrix theory. 5 Homomorphisms and Normal Subgroups 69 Theorem 2.(x). We saw that (cos x + i sin x)(cos y + i sin y) = cos(x + y) + i sin(x + y). so by cancellation in G we get x = y. Is this labeling consistent with the structure of G as a group? That is. First.(x) = cos x + i sin x.1 to be finite. (x) = q. The notion of automorphism and some of its properties will come up in the problems. .5. Finally. for x = aI(axaI)a = q. Every group G is isomorphic to some subgroup of A (S).: G ~ G by q. We can view G' as a relabeling of G. But there may be better choices. and many other parts of algebra. (y). Let G be any group. So two groups that are isomorphicalthough they need not be equalin a certain sense. Here q. In this case. it is desirable to be able to identify a given group as isomorphic to some concrete group that we know.(y).I) for any x E G.(x) = aIxa for all x E G. (y). as described above. Define q.
(b) 'P (a 1) = 'P (a) 1 for all a E G. Now that we have a few examples in hand. 2 hence 'P (x )'P(Y) = 'P (x + y) and 'P is a homomorphism of G into G'.5. is 'P(G) = {'P(a) I a E G}. {y E G I 'P(Y) = w'} If w' E G' is of the form 'P (x) = (Ker 'P)x. If 'P is a homomorphism of G into G'. Also. What is . This prompts the Definition. Thus the inverse image of any element w ' in 'P (G) E G' is the set (Ker 'P)x. We claim that this lack is rather uniform. We want t6 measure how far a given homomorphism is from being a monomorphism. 'P(aa. we start a little investigation of homomorphisms.1 E Ker 'P and so y E (Ker 'P)x. is defined by Ker 'P = {a E G I 'P(a) = e'}.5. but 'P(X)l = 'P(x.1). D Definition. whence yx. then 'P(x) = 'P(Y).5. We singled out certain homomorphisms and called "them monomorphisms. If 'P is a homomorphism of G into G'.5. 'P is not 11 because. 'P(D) = 'P(27T) = 1. if 'P(x) = 'P(Y) = w'. which proves that 'P(a. 'P(x) = 'P(xe) = 'P(x)'P(e).l ) = 'P(yx. for instance. if k E Ker'P and 'P(x) = w'. nor is 'P onto. then 'P(kx) = 'P(k)'P(x) = e''P(x) = w ' . so kx E W. then . where x is any element in G such that 'P(x) = w'. We state this as Lemma 2. by cancellation in G' we get 'P(e) = e'. We leave to the reader the proof of Lemma 2. w'.4.3. then the image of 'P is a subgroup of G'. If 'P is a homomorphism of G into G'. then: (a) 'P(e) = e'. We begin with Lemma 2.W = {x E G I 'P (x) = w'} for a given w' E G'? We show that if 'P(x) = w' for some x E G. Proof Since x = xe. Also.1). so e' = 'P(Y)jP(X)1 = 'P(Y)'P(x. 'P(G).2. the unit element of G'.70 Groups Ch.2.1) by Lemma 2.1) = 'P(a)'P(a. Clearly. then the kernel of 'P. Their property was that they were 11. Ker 'P. The image of 'P.1) = 'P(e) = e'. hence 'P(Y)'P(X)I = e'. then W = {kx IkE Ker 'P} = (Ker 'P)x. hence e' = 'P(aa. Ker'P measures the lack of 111 ness at' one point e'.I) = 'P(a)I.
As we shall see in the next section. 5 Homomorphisms and Normal Subgroups 71 We now shall study some basic properties of the kernels of homomorphisms. then 'P is a monomorphism if and only if Ker 'P == (e).5. then the left coset aN. .1 C N. Consequently.I)e ''P (a) == 'P (a . Ker 'P. If 'P is a homomorphism of G into G'.INa. must be equal to the right coset containing a.1 for every a E G. then N == a (aINa)a. hence aI (Ker 'P) a E Ker 'P. Of course. D Property (b) of Ker 'P in Theorem 2. We ran into this property in the text material and problems earlier on several occasions. is a normal subgroup of G. whence ab E Ker 'P. aI (Ker 'P)a C Ker 'P. D Corollary. We use it to define the ultraimportant class of subgroups of a group. For if aINa C N for all a E G. Transposing. If k E Ker'P and a E G.5.I) == 'P(a)I == e'. So N == aNa. so Ker 'P is closed under product. Although we defined a normal subgroup via aINa C N. (b) Given a E G.1 C aNa.Sec. if every left coset of N in G is a right coset.Ia) == 'P (e) == e'. If 'P is a homomorphism of G into G'. then 'P (k) == e'. its proof is easy. we actually have aINa == N. for any homomorphism. hence 'P (ab) == 'P (a) 'P (b) == e'.1 == (a. Theorem 2.5 is an interesting and basic one for a subgroup to enjoy. Ker'P is a subgroup of G.I) 'P (k)'P (a) == 'P (a .4. Therefore. If a. every normal subgroup of G is the kernel of some appropriate homomorphism of G into an appropriate group G'. This tells us that aIka E Ker 'P. that is.5. every left coset of N in G is a right coset of Nin G. and so aI E Ker 'P. then (a) Ker 'P is a subgroup of G.5.I). then 'P (a) == 'P (b) == e'. Definition. which contains a. So in a certain sense the notions of homomorphism and normal subgroups will be shown to be equivalent. Proof Although this is so important. We leave the few details to the reader. we have Na == aN. The theorem is now completely proved. Also 'P (a) == e' implies that 'P (a . b E Ker 'P. Proof This result is really a corollary to Lemma 2. On the other hand. 'P(aIka) == 'P (a .I) 'P (a) == 'P (a. The subgroup N of G is said to be a normal subgroup of G if aINa C N for every a E G.
after the Irish mathematician W. .y E G. h n . In Example 7. 2.I } is normal in G. In Example 6. We go back to the examples of Section 1.b E G I b real}. which we introduce in Chapter 4.6. we see that Ker cp = {27Tm I m any i~ger}. In Examples 8 and 9. In Example 4. We should look at some normal subgroups. We write "N is a normal subgroup of G" by the abbreviated symbol N<JG. and Ker cp = {x IlogloX = O} = {I}. where G is abelian. the kernels consists of e alone. x E G.5. from the polar form of a complex number. Ker cp is the set of all even integers.2 namely Na. Nomerely that the set of all aIna is the same as the set of all n. g3} <J G. Ker cp = {x E G I x 2 = e} The kernel of the trivial homomorphism of Example 3 is all of G. for aIxa = x for every a. then every subgroup of G is normal. we leave it to the reader to check that T. Ker cp = {a E C' Iia I = I}. In Example 9 the subgroup {i. Note that aINa = N does not mean that aIna = n for every n E N. The desired group of order 8 can be found in the quaternions of Hamilton. Nonabelian groups exist in which every subgroup is normal. H = {Ta. all the kernels above are normal subgroups of their respective groups. intrinsically in G itself. h. for the maps are monomorphisms. In Example 10 the subgroup H = Ii.y CHand so H <J G. h 2 . 3. N <J G if and only if every left coset of N in G is a Before going any further. If Tx.'~ HTx. right coset of N in G. In Example 7. Ker cp = {Tt. If G is abelian.b E G I a rational}. In Example 1. g. we pause to look at some examples of kernels of homomorphisms and normal subgroups. This we also leave to the reader. as Ker cp = {cos x + i sin x Ix real}. which can be identified. Section 1. without recourse to the kernels of homomorphism. In Example 2. R. We have proved Theorem 2. The converse of this is not true. 1. Ker cp is the set of all multiples of n. Hamilton (18051865). ••• . and cp (x) = X 2 . See if you can find such an example of order 8.72 Groups Ch. cp(x) = 10gloX. i. Of course. Here too we leave the checking to the reader. Such nonabelian groups are called Hamiltonian. aN = Na and N = aINa for all a E G. Thus. In Example 10. In Example 5. which is to say that N is normal in G.
but the ideas presented. (d) G group of all nonzero real numbers under multiplication. (fg) g(fg)l = fgggIfl = fgft = g2. is to take a stab at many of the almost infinite list of problems that follow. 5 Homomorphisms and Normal Subgroups 73 4. Define La: G ~ G by La(x) = xa. Avoid a mathematical stomachache later by assimilating this section well. One way of seeing how complete this digestion is. and gf. fgft = gt = f g 2f. g. G 3 : (a) G t ~ G t . Z(G). are quite subtle.t . (c) G abelian group. as a set. cp(r) = 1 if r is negative. (b) G I ~ G 2 implies that G 2 ~ G I • (c) G t ~ G 2 . onto itself. If G = S3' G has the elements i. PROBLEMS Easier Problems 1. the center of G. i} <l S3' As we saw earlier (or can compute now). G 2 ~ G 3 implies that G I ~ G 3 • 3. The material in this section has been a rather rich diet. I}. (b) G group. f(X2) = Xt. g(X3) = Xt. where f(XI) = X2. n > 1 a fixed integer. cp(r) = 1 if r is positive. The material of the next section is even a richer diet. . So N <l S3 follows.Example 11 of Section 3). G' = {I. (e) G an abelian group. and cp: G ~ G defined by cp(a) = an for a E G. We claim that the subgroup N = Ii. i. (a) G = Z under +. g. (b) LaL b = Lab' (C) The mapping t/J: G ~ A(G) defined by t/J(a) = La is a monomorphism of G into A (G). f(X3) = X3 and g(Xt) = X2' g(X2) = X3.t = g. fg. G' = Zn. Prove that: (a) La E A(G). and even harder to digest. 5. Let G be any group and A (G) the set of all 11 mappings of G. i. is a normal subgroup of G (see . If G is any group.Sec. Determine in each of the parts if the given mapping is a homomorphism. If so. 2. and so on. Recall that G ~ G' means that G is isomorphic to G'. f. although simple. We recommend that the reader digest the concepts and results thoroughly before going on. Prove that for all groups G b G 2 . cp: G ~ G defined by cp(a) = aI for a E G. identify its kernel and whether or not the mapping is 11 or onto. cp(a) = [a] for a E Z. cp: G ~ G defined by cp(a) = aI for a E G. It may not seem so.
If G is abelian and cp : G that G' is abelian. then K C N(H). (Hint: Acting on e E G. (c) If K is a subgroup of G such that H <l K. 7. N <J G. [So N(H) is the largest subgroup of G in which H is normal. show that for mE M. Verify that in Example 9 of Section 1. find the necessary and sufficient condition that cp be an isomorphism of G onto itself. 9. g3} is a normal subgroup of G.] 20. g2. show that if V E A (G) is such that Ta V = VTa for all a E G. n EN. the set H = {i. is a subgroup of G' . the group of all real numbers under +. then V = L b for some bEG. Find an isomorphism of G. In Problem 3 prove that for all a. is a monomorphism if and only if Ker cp = (e). bEG. ~ G' is a homomorphism of G onto G'. the dihedral group of order 8. If G is a finite abelian group of order nand cp: G ~ G is defined by cp (a) = a m for all a E G. 11. let N(H) be defined by the relation N(H) = {a E G I a1Ha = H}. Prove that if cp: G ~ G' is a homomorphism. ••• . If N <l G and M <l G and MN = {mn 1m E M. = {i. onto G'. TaL b = LbTa. then Z (G) <l G. the subgroup H is normal in G. 16. prove that N <l G. 17. If M <l G. g. mn = nm. 18. Show that cp : G ~ G'. N <l G. cp(N). bEG I a rational}. prove that M n N <l G. If H is any subgroup of G and N = n aEG aI Ha. Prove that if Z (G) is the center of G. where cp is a homomorphism. (b) H <l N(H). the image of G. h. and M n N = (e). then cp (G). and H = {T a . N <l G.h n  I } 12.) 6.2 4. prove that MNis a subgroup of G and that MN <l G. 19. In Problem 4. Verify that in Example 10 of Section 1. and cp: G ~ G' a homomorphism of G onto G'. If H is a subgroup of G. Verify that if G is the group in Example 6 of Section 1. prove 15. 8. h 2. find out what b should be. If M <l G. of N is a normal subgroup of G'. 14. 13. 10.74 Groups Ch. . the dihedral group of order 8. Prove that: (a) N(H) is a subgroup of G and N(H) ~ H. where Ta is defined as in Example 8. n EN}. the group of all positive real numbers under multiplication. prove that the image. If G is any group. 5. then H <l G.
Let G be a group and let W = G X G as defined in Problem 24. given by 'Pl (al) = (ab e2). If a.. we define H(s) = {fE A(S) I f(s) = s}. If G b G 2 are two groups. (b) Ker t/J = Z(G). Prove that Ot/J is an automorphism of G. We saw in Example 9 of this section that O"a is an isomorphism of G onto itself.f(X3) = X3· (a) Write down all the left cosets of H in G. If s E S. Let 0. prove that ba = ajb for some j. prove that O(N) <l G.. f}.f(X2) = H = {i. (b) The image 'P(G) in W [i. Prove that: (a) t/J is a homomorphism of G into A (G). b) where a E G l . If 0 is an automorphism of G and N <l G. bEG. b E G 2]. 27. G is the set of all ordered pairs (a. the center of G. 24. so that the set of all automorphisms of G is itself a group. the symmetric group of degree 3 and let x2. If G is a group and a E G. a) is a monomorphism of G into W. define O"a: G ~ G by O"a(g) = aga. . let G = G l X G 2 . 5 Homomorphisms and Normal Subgroups 75 21. t/J be automorphisms of G. Define a product in G by (al' b l )(a2' b 2) = (ala2' b lb 2). 28. (b) Write down all the right cosets of H in G. Let G be a group such that all subgroups of G are normal in G. 22.e.Sec. {(a. the group of all 11 mappings of G (as a set) onto itself. Let S be any set having more than two elements and A(S) the set of all 11 mappings of S onto itself. (c) Is every left coset of H a right coset of H? 23. (a) Prove that G is a group. so O"a E A (G).e. Prove that H(s) cannot be a normal subgroup ofA(S). Let G = S 3. and that Ol is an automorphism of G. MiddleLevel Problems *26. (e) Prove that G l X G 2 ~ G 2 X G l • 25. the Cartesian product of G b G 2 [i. Prove that: (a) The mapping 'P: G ~ W defined by 'P(a) = (a. (b) Show that there is a monomorphism 'Pl of G l into G such that 'Pl (G l ) <l G. where f(Xl) = xl. Define t/J: G ~ A(G) by t/J(a) = O"a for all a E G. (d) Using the mappings 'Pb 'P2 of Parts (b) and (c).l . a) I a E G}] is normal in W if and only if G is abelian. prove that 'Pl(G 1)'P2(G 2) = G and 'Pl(G l ) n 'P2(G 2) is the identity element of G. and let Ot/J be the product of 0 and t/J as mappings on G. where e2 is the identity element of G 2 • (c) Find the similar monomorphism 'P2 of G 2 into G.
Let G be a group. Prove that I( G) <l . 36. Use the result of Problem 38 to redo Problem 37. for bEG.:fI( G). prove that there is a normal subgroup N =I=. Let G be a group and H a subgroup of G. Suppose that N <l G and MeN is a characteristic subgroup of N. . If H is a subgroup of G of order pn. Prove that: (a) M characteristic in G implies that M <l G. (b) M. N characteristic in G implies that MN is characteristic in G.(e) of G contained in H. c E G [therefore the mapping l/J: G ~ A(S) defined by l/J(b) = T b is a homomorphism]. Suppose that IG I = pm. Define. H a subgroup of G such that n (iG(H)! where n = IG I. If N <l G and H is a subgroup of G. If H is a normal subgroup of order p in G. 31. . (c) Show that Ker l/J is the largest normal subgroup of G lying in H [largest in the sense that if N <l G and N C H. then the index of H in G. prove that G ~ S3 . 33. 39. prove that H is a characteristic subgroup of G. then N C Ker l/J]. Do Problem 31 even if G is not abelian if you happen to know that for some reason or other H <l G. (See Problem 28. then M must be normal in G. the kernel of l/J: G ~ A(S). is a characteristic subgroup of G. 35. 'P. Recall that if H is a subgroup of G. Suppose that G is an abelian group of order pn m where p (m is a prime. Harder Problems 37. If G is a nonabelian group of order 6. See Problem 50. (c) A normal subgroup of a group need not be characteristic.) 34.) Let I (G) = {U a I a E G}. prove that H is characteristic. Prove that M <J G. If G is a finite group. of W. you must find an example of a group G and a noncharacteristic normal subgroup. (It is not true that if M <l Nand N <l G.:fI( G) the group of all automorphisms of G. (This is quite hard. where p (m and p is a prime.2 * 29. T b : S ~ S by Tb(Ha) = Hab. Let S = {Ha I a E G} be the set of all right cosets of H in G. 40.) 30. 38. the center of G. iG(H). is the number of distinct right cosets of Hand G (if this number is finite).76 Groups Ch. 32. (b) Describe Ker l/J. Show that Z (G). show that H n N <l H. A subgroup T of a group W is called characteristic if 'P (T) C T for all automorphisms. where Ua is as defined in Problem 26.1• (a) Prove that TbTe = T be for all b.
52.2 is the identity automorphism of G. 50.Sec. show that both subgroups A fe. and INI = 3. prove that a group of order p2. b4 } are normal in G. a 2 } and B 48. for an arbitrary subgroup H. Prove that a group of order p2. show that any group of order 15 is cyclic.(a) = aI for all a E G. is normal in G. an automorphism of G such that q. FACTOR GROUPS Let G be a group and N a normal subgroup of G. a. = {e. Prove that A = (a). Using the result of Problem 44. Let G be a group of order 15. b3 .1 for all y E G. Prove that G is abelian and q. Prove that 'P(Y) = y. From the result of Problem 47. and N has a normal subgroup M (i. 43. [a] = Na. the right coset of N in G containing a. 6 Factor Groups 77 41. Let G be a finite group. Recall that in looking at 7L n we defined for it an operation + via [a] + [b] = [a + b]. 51. Prove that there is a subgroup N c H. H a subgroup of G such that iG (H) is finite. N <J G. Let's try this out when N is normal and see if we can say a little more than one could say for just any old subgroup. M <J N). must be abelian. 46. As we saw earlier. Prove that a group of order 9 must be abelian. (Hint: Use the result of Problem 40.. (x) = XI for more than threefourths of the elements of G. q. Let G be a group. Very Hard Problems 49. the subgroup generated by a. Suppose that you know that a group G of order 21 contains an element a of order 7. an automorphism of G such that q. show that there is an element a such that a 3 = e and an element b =1= e such that b 5 = e. P a prime.e. has a normal subgroup of order p. the equivalence relation a '"'' b if ab. Why . Let G be a finite group and q.) 42. b2 . and so G is abelian. In Problem 46. N <J G such that iG (N) is finite. 45.1 E H. Construct a group G such that G has a normal subgroup N. In proving Lagrange's Theorem we used. N c H. Prove that either H <J G or there exists a subgroup N <J G. Suppose that 'P(x) = x implies that x = e. yet M is not normal in G. P a prime. 44. Suppose that you know that a group G of order 36 has a subgroup H or order 9. 6. let a '"'' b if ab 1 E N and let [a] = {x E G I x '"'' a}. =1= e in G = 47. So. b.
Therefore. the exact thing we required to ensure that our product in M was welldefined. then [ab] = [a'b']. Finally. then [a][a.2 not try something similar for an arbitrary group G and a normal subgroup NofG? So let M = {[a] I a E G}.6. and better still. If N <l G. the associative law has been established for the product in M. where n E N. We shall soon show that M is a group under this product. equivalently. that this product of classes does not depend on the particular representatives we use for the classes.2. Thus M is now endowed with a welldefined product [a][b] = [able We now verify the group axioms for M. What we have shown is the very important Theorem 2. so n(ama. What about a unit element? Why not try the obvious choice. since N <l G. b' = mb.1 E N} = Na. One observation must immediately be made.1. what about inverses? Here. From the definition of our equivalence we have that a' = na. so [e] does act as the unit element for M.1] = [e]. ama. Closure we have from the very definition of this product. We define a product in M via [a] [b] = [ab]. namely [e]? We immediately see that [a][e] = rae] = [a] and [e][a] = rea] = [a].1) is also in N. If [a]. Therefore let us suppose that [a] = [a'] and [b] = [b ']. = then GIN is a group relative to the operation [a] [b] [ab]. The symbol we use for M is GIN (read "G over Nor G mod N") and GIN is called the factor group or quotient group of G by N. a symbol that indicates its dependence on G and N. .6. the kernel of f/J. then [a ]([ b][c]) = [a][bc] = [a (be)] = [(ab)c] (since the product in G is associative) = [ab][ c] = ([a][ b])[c]. and [c] are in M. so that a' b' ~ ab. So if we let nl = n(ama. [b]. Similarly. We want to give M a name.1 is in N. is N. Thus a'b' = namb = n(ama1)ab. too. namely Theorem 2. But first and foremost we must show that this product in M is welldefined. If a E G. But this tells us that a'b' E Nab. then nl E Nand a'b' = nlab. the obvious choice is the correct one. where [a] = {x E G Ixa. In other words.78 Groups Ch. hence [aI] acts as the inverse of [a] relative to the product we have defined in M. we must show that if [a] = [a'] and [b] = [b'].1). then there is a homomorphism f/J of G onto GIN such that Ker f/J. If N <l G and GIN = {[a]/aEG} = {NalaEG}. where mEN.1] = [aa. from which we have that [a' b'] = [ab]. for this would show that [a][b] = [ab] = [a'b'] = [a '][ b '].
So Ker t/J = N. As before. B are subsets of G. (Na)(Nb) consists of all elements of the form (na)(mb). so we see that Ker t/J C N. We try to introduce a product in W as we did for GIN by defining [a] [b] = [ab]. then HH CHis another way of saying that H is closed under the product ofG. Since every element X E GIN is of the form X = [b] = t/J(b) for some bEG. this implies that H = Hahat.6. The "some homomorphism" is the t/J defined above and the "some group" is GIN. whence aha. Ker t/J. for t/J(ab) = [ab] = [a][b] = t/J(a)t/J(b). so for the product to be well defined. H must be a normal subgroup of G. we define W= {[a]laEG} = {HalaEG} where the equivalence a '"'' b is defined by ab. that is. This construction of the factor group G by N is possibly the single most important construction in group theory. then [hb] = [b]. But Na = Ntells us that a = ea E Na = N. and so (na)(mb) = (nama1)(ab) = ntab. mEN. That N C Ker t/Jwhich is easywe leave to the reader. If H is a subgroup of G.Sec.1 E H. let AB = tab I a E A. We view this matter of the quotient group in a slightly different way. aha. we would need that [a][b] = [a][hb]. Our product as defined in GIN makes of t/J a homomorphism. and a E Ker t/J if and only if E = N = t/J(a) = Na. what is the kernel. b E B}.2 substantiates the remark we made in the preceding section that every normal subgroup N of G is the kernel of some homomorphism of G onto some group. Let GIN = {Na I a E G} be the set of all right cosets of the normal subgroup N in G. in other words.1 must be in H. . where E is the unit element of GIN. as we shall see later.1 E H. Using the product of subsets of G as defined above. 6 Factor Groups 79 Proof The most natural mapping from G to GIN is the one that does the trick. what is (Na)(Nb)? By definition. Is this product well defined? If h E H. That is. [ab] = [ahb]. So we see that in order for the product defined in W to be welldefined. of t/J? By definition. Finally. t/J is onto. If A. In other algebraic systems we shall have analogous constructions. and so Ha = Hah. Define t/J: G ~ GIN by t/J (a) = [a]. for all a E G and all h E H. D Theorem 2. But what is E? Nothing other than E = [e] = Ne = N. This gives us that Hab = Hahb. One might ask: Where in this whole affair did the normality of N in G enter? Why not do the same thing for any subgroup H of G? So let's try and see what happens. H must be normal in G. Ker t/J = {a E G I t/J (a) = E}. where n.
All the other group axioms for GIN. We shall see very soon how we combine induction and the structure of GIN to get information about G.6. Thus (Na)(Nb) C Nab.80 Groups Ch. In short. with a proof that will be much more beautiful than the one we are about to give for the special. aN = Na. whose most basic contributions were in complex variable theory.4 (Cauchy). However we view GINas equivalence classes or as a set of certain subsets of Gwe do get a group whose structure is intimately tied to that of G.2 where nl = nama. since N is normal. but it illustrates quite clearly a general technique. Proof Before getting involved with the proof. The theorem we are about to prove is due to the great French mathematician A. Thus IGINI = IG II INI. We proceed by induction on IGI. Another way of seeing that (Na)(Nb) = Nab is to note that by the normality of N. We shall prove it in the general case later. which is the set of all the right cosets of N in G. hence (Na)(Nb) = N(aN)b = N(Na)b = NNab = Nab. We state this more formally as Theorem 2. since NN = N (because N is a subgroup of G). IGI/INI· If G is a finite group and N <J G. are now readily verified from this product formula. is givenas the proof of Lagrange's Theorem showedby iG(n) = IGI/INI. Theorem 2. as defined here. we have shown that the productas subsets of Gof Na and Nb is given by the formula (Na)(Nb) = Nab. L. i G (N). The proof we givefor the abelian caseis not a particularly good one. then the number of right cosets of N in G. so that Nab C (Na)(Nb). then n(ab) = (na)(eb) E (Na)(Nb). abelian case.6. When G is a finite group and N <J G. we point out to the reader that the theorem is true for any finite group. On the other hand. if n E N. If G is a finite abelian group of order IG I and p is a prime that divides IG I. What does this mean precisely? We shall . we shall prove a special case of a theorem that we shall prove in its full generality later. But this is the order of GIN. via the natural homomorphism l/J of G onto GIN. then G has an element of order p.3. that of pulling back information about GIN to get information about G itself. Cauchy (17891857). then IGINI As an application of what we have been talking about here.1 is in N.
16. But if G has no nontrivial subgroups. In other words. p I m (see Problem 31 of Section 4). we can look at it as G. but [a] =1= [e]. again by induction. The formation of a new set from an old one by using as elements of this new set subsets of the old one is strange to the neophyte seeing this kind of construction for the first time. N. and two elements as equal if they are equal "up to N. if p divides IN I. there exists an element in GIN of order p. so IGIN I = IGI/INI < IGI. (See Problem 16 of Section 3. we must have IG I = p. So (am)P = e. But then any element a =1= e E G satisfies a P = e and is of order p. and of the greatest importance in the subject. Thus. What are we doing when we form GIN? Sure. The notion of a factor group is a very subtle one.Sec. a f!:. Since G is abelian. This completes the induction. then [a] m = [e]. 9. If IG I = 1. But if am = e. by our induction hypothesis there would be an element of order p in N. hence in G. . there exists an a E G such that [a]P = [e]. Suppose that there is a subgroup (e) =1= N =1= G. and because IGIN I = IGI/INI.) What is this "prime order"? Because p divides IGI. in forming TLI N. So we have a starting point for our induction. and so on. which you should be able to handle more easily now. Because p divides IGI and p(INI. But. So we may suppose that p (INI. then (aP)m = e. Let's look at it another way. 4. If we could show that b = am =1= e. what we are doing is identifying 1 with 6. So although GIN is not a subgroup of G. and since [a] has order p. What we are doing is identifying two elements in G if they satisfy the relation ab 1 E N. So it is worthwhile looking at this whole matter from a variety of points of view. This translates to a P E N. p (m = INI. INI > 1. with N blotted out. The nice thing about all this is that this identification jibes with addition in 7L when we go over to 7LIN. SO we are done if G has a nontrivial subgroup. D We shall have other applications of this kind of grouptheoretic argument in the problems. The group GIN is abelian." For instance. and we would be done. So if m = IN I. every subgroup is normal. we have that p divides IGINI. 6 Factor Groups 81 assume the theorem to be true for all abelian groups of order less than IGI and show that this forces the theorem to be true for G. In a sense we are blotting out N. there is no such p and the theorem is vacuously true. where 7L is the group of integers and N is the set of all multiples of 5 in 7L. and we are identifying all multiples of 5 with O. by assumption. 11. Let's look at a few examples from this point of view. we are looking at equivalences classes defined via N. so we can form GIN. since G is (Prove!) and since N =1= (e). Since IN I < IG I. it must be cyclic of prime order. We consider GIN from another angle now. then b would be the required element of order p in G. and so proves the theorem.
. So in passing to GIN every element is being identified with a positive real number. What does GI7L really look like? In forming GI7L. If G is the group of all nonzero real numbers under multiplication and N is the subgroup of all positive real numbers. Since G is abelian. and construct the multiplication in GIN. bib real} C G. 3. which is Tac. 2 1. in a sense that can be made precise. 2. and Icos () + i sin ()I = 1. of course! So GI7L is like a circle.. 2. 7L <J G. Let G = {Ta . in its polar form.. Every real number a thus has a mate.o. Now T a ... we saw that N <J G.. If G is the group of nonzero real numbers under multiplication and . b with Ta .ad+b' is identified with T ac . so in GIN we are getting an element by identifying Ta . and so we can talk about GI7L. .d = Tac. . So. since lab 1 = 1a II bl· So GIN is in a very real sense (no pun intended) the group of positive real numbers under multiplication. so it makes sense to talk about GIN.. where 0 ::::. In going to GIN we are declaring that any complex number of absolute value 1 will be identified with the real number 1.b with Ta. write out GIN by exhibiting the cosets of N in G. ~. Tc. are in the same left coset of N in G. in GI7L. So 0 is identified with 1. Let G be the group of nonzero complex numbers and let N = {a E Gila I = I} which is the unit circle in the complex plane.ad+b and if we identify Ta. then their product. 1]. ~.3. band T a . PROBLEMS 1. 3.b multiply according to Ta. and 1. . °. and in some sense (which will be made more precise in the next section) GIN can be identified with this group of real numbers. 0. can be written as a = r(cos () + i sin (). What do we get this way? A circle. . ~ is identified with ~. ii < 1.. ° 2. the whole real line has been compressed into the unit interval [0. 2. Moreover. we are identifying any two real numbers that differ by an integer. for we have also identified the end points of this unit interval. So we are bending the unit interval around so that its two end points touch and become one.82 Groups Ch. we are identifying a with r. ii. Then N is a subgroup of G and is normal since G is abelian. the Ta. Now any a E G. But a little more is true.o. and this circle is a group with an appropriate product. So in GIN multiplication is like that of the group of nonzero real numbers under multiplication. Let G be the group of real numbers under + and let 7L be the group of integers under +. and this identification jibes with the products in G and in the group of positive real numbers. The latter element just depends on a. b I a =1= 0.d with Tc. b real} (Example 6 of Section 1). ~. Let N = { T 1. In identifying a cos () + i sin () with 1.bTc. where r = 1 I.
prove that G is cyclic.1 E N for all a. prove that GIN is abelian. Do Problems 7 and 8 by observing that GI N is a homomorphic image ofG. respectively. 13. We prove it in Section 11.) This result. bEG.. 9. 12. Arguing as in the Example 2. If G is a group and Z(G) the center of G. If G is a group and N <l G. .. If G is a cyclic group and N is a subgroup of G. b) E G I a. and so we can define a product on the donut so that it becomes a group. . 8. If G is a group and N <l G is such that for all a. then M is a subgroup of G. P2 2.. (See Problem 15. S 2. show that if GIZ(G) is cyclic. you may think of a torus as the Cartesian product of two circles. consider the following: let G = {(a. which actually holds for all finite groups. d) = (a + c. show that GIN is a cyclic group. where G is the group of reals under + and lL integers. If G is an abelian group of order PIP2 . b real}. b) I a. 10. where + in G is defined by (a. bEG. and M ~ N. where Ph P2. . If G is an abelian group and N is a subgroup of G. b) + (c. What are the cosets of Nin G? 3. show that if M is a subgroup of GIN and M = {a E GINa E M}. 6 Factor Groups 83 N = {I. show that the M defined is normal in G. . show how you can "identify" GIN as the group of all positive real numbers under multiplication. p%k.. where we identified GIlL as a circle. 4. where Ph P2. and let N = {(a. Show that GIN can be identified as a torus (donut). Show that G has subgroups S 1. Pk are distinct primes. 7. then G is abelian.. Let G be an abelian group of order p~lP22 . is a famous result in group theory known as Sylow's Theorem. b + d) (so G is the plane). Here. prove that aba1b... show that GIN is an abelian group. In Problem 3. p%k. Pk. If M in Problem 3 is normal in GIN. show that MIN must equal M. (Hint: Use Cauchy's Theorem and pass to a factor group. 14. 5. . b are integers}. If G is a group and N <l G is such that GIN is abelian. Sk of orders p~l. 11. .Sec..) . I}. 6.Pk are distinct prime numbers.
b) = a is a homomorphism of G onto G' with kernel K = {T 1. m > 1 depending on a}. 2.o) = KTax. 17. (b) GIP has no elements of order p. The First Homomorphism Theorem. G was the group of nonzero complex numbers under multiplication and G' the group of all positive real numbers under multiplication. Let G be an abelian group of order pnm . prove that G has an element of order mn. Therefore. where m and n are relatively prime. 7. spells out this relationship in exact detail. where m and n are relatively prime. I a =1= 0 real}. is readily seen to be an isomorphism of GIK onto G'. 18. hence we can form GIK.2 15.o the mapping of GIK onto G'. namely Ta. which we are about to prove. But first let's look back at some of the examples of factor groups in Section 6 to see explicitly what the relationship mentioned above might be. From the product rule of these T's. On the other hand. . Let G be an abelian group (possibly infinite) and let the set T = {a E G I am = e. If K is the kernel of cp. 1. If G is an abelian group and if G has an element of order m and one of order n.84 Groups Ch. Prove that: (a) T is a subgroup of G. Let G be an abelian group of order mn. In Example 3.o)(KTx. Since ° (KTa. then K is a normal subgroup of G. in Example 1 of Section 6 we saw that GIK = {KTa.o onto a. Let G = {Ta .d = Tac. (c) Ipi = pn. such that x m = unit element of GI M. THE HOMOMORPHISM THEOREMS Let G be a group and cp a homomorphism of G onto G'. It is fairly natural to expect that there should be a very close relationship between G' and GIK.bTc. b I a =1= 0. which sends each KTa.ad+b' we determined that the mapping cp: G ~ G' defined by CP(Ta. 16. GIK::::= G'. b real} and let G' be the group of nonzero reals under multiplication. (b) GIM has no element. Let M = {a E G I am = e}. Let P = {a E G I apk = e for some k depending on a}. Prove that: (a) M is a subgroup of G. where p is a prime and p (m. other than the identity element. x. Prove that: (a) P is a subgroup of G. bib real}. (b) GIT has no elementother than its identity elementof finite order.
that is. to Theorem 2. then cp(a) = cp(b). from this discussion. Finally. here. thus x = cp(a) = I/J(Ka). But we have already seen that if Ia I = 1. the isomorphism between these being effected by the map I/J: defined by I/J(Ka) = G/K~ G' cp(a).Sec. using that cp is a homomorphism and that (Ka)(Kb) = Kab. So define I/J: G/K ~ G' by I/J(Ka) = cp(a) for a E G. So. where r > O. then cp(k) = e'. The mapping of G/K onto G' defined by sending Kr onto r then defines an isomorphism of G/K onto G'. So the set K = {cos () + i sin () I 0 ~ () < 27T}. whose elements are the cosets Ka. since Iabl = Ia I Ib I. Let cp be a homomorphism of G onto G' with kernel K. So G/K. Therefore. so we get cp(a) = cp(b). the kernel of cp. 7 The Homomorphism Theorems 85 Let cp: G ~ G' defined by cp (a) = la I for a E G. too. If a is any complex number.1 is thus in the kernel of cpwhich is Kwe have ab. cp is a homomorphism of G onto G' (can you see why it is onto?). and Theorem 2.1 is proved. Proof The best way to show that G/K and G' are isomorphic is to exhibit explicitly an isomorphism of G/K onto G'. namely. is the polar form of a. since K(cos () + i sin() = K because cos () + i sin () E K. then x = cp(a) for some a E G. e' = cp(a)cp(b)l = cp(a)cp(b. hence cp(a) = cp(kb) = cp(k)cp(b). is I/J a homomorphism? We check: 1/J((Ka)(Kb» = I/J(Kab) = cp(ab) = cp(a)cp(b) = I/J(Ka)I/J(Kb). has all its elements of the form Kr.1 ) = cp(ab. where r = Ia I. Then G' ~ G/K.7. This implies that Ka = Kb.1 E K.1 (First Homomorphism Theorem). then a is of the form cos () + i sin (). I/J is a homomorphism of G/K onto G'. In this way I/J is seen to be 11. our first task is to show that I/J is well defined. As usual. Is I/J II? Suppose that I/J(Ka) = I/J(Kb). But if Ka = Kb. Because ab. Then. given x E G'. G/K ~ G'. This boils down to showing that if Ka = Kb. to show that if Ka = Kb. then a = r( cos () + i sin (). This shows that I/J maps G/K onto G'.1). With this little experience behind us we are ready to make the jump the whole way. Since k E K. Thus the kernel K of cp is precisely K = {a E Gila I = I}. then I/J(Ka) = I/J(Kb).7. This shows that the mapping I/J is well defined. then a = kb for some k E K. Because cp is onto G'. the identity element of G'. Consequently. Thus Ka = Kr(cos () + i sin () = K(cos () + i sin ()r = Kr. D . then cp(a) = I/J(Ka) = I/J(Kb) = cp(b). The statement of the theorem suggests what such an isomorphism might be.
then q. The next result. Theorem 2.86 Groups Ch. since e E H.(b) E H'.: G ~ G' be a homomorphism of G onto G' with kernel K. Finally.3 (Second Homomorphism Theorem). and HIK == H'.(ab) = q. In the context of the theorem above. Theorem 2. again since H' is a subgroup of G'. so H is closed. this puts ab in H. and is traditionally called the Correspondence Theorem.(a) E H'. Moreover. where e' is the unit element of G'. and HI(H n N) =: (HN)/N.(a)IH' q. Further. so q. This tells us that q. Then HN = {hn I h E H. if H' <J G '.(a) E H'}. so aIHa C H. Let H be a subgroup of a group G and N a normal subgroup of G. we have that K C H. then H is a subgroup of G. since H' is a subgroup of G'. however. whence aI E H.(aI)H'q. leaving its proof to the reader in Problem 5. b E H. D It is worth noting that if K is any normal subgroup of G.(a)q.7. q. is the natural homomorphism of G onto G/K. and q. then the theorem gives us a 11 cor. Therefore. This proves the normality of H in G. Since K <J G and K C H.(K) = {e'} C H'. then H<J G. H n N is a normal subgroup of H. it follows that K <J H.) We now state the Second Homomorphism Theorem.2 Having talked about the First Homomorphism Theorem suggests that there are others.(a)I is in H'.(a). then q. (See Problem 7. H :> K.(aIHa) C H'.(a. If H' is a subgroup of G' and if H = {a E GI q. if a E H. if H' <J G' and if a E G. Because q.I) = q. restricted to H defines a homomorphism of H onto H' with kernel K.(b) E H'. Proof We first verify that the H above is a subgroup of G.2 (Correspondence Theorem). is an extension of the First Homomorphism Theorem. it exhibits a 11 correspondence between subgroups of G I and those subgroups of G that contain K. n EN} is a subgroup of G. then q. as well as the last conclusion of the theorem. Finally.(a) C H'.. this correspondence preserves normality in the sense that H' is normal in G/K if and only if H is normal in G. hence q. By the First Homomorphism Theorem we get HIK == H'. respondence between all subgroups H' of G/K and those subgroups of G that contain K. The mapping q. Let the map q.(a) C H'. If a. It is not empty.7. . hence q. H is a subgroup of G.
we are sort of "canceling out" the K in the numerator and denominator. What is the kernel. 1}. Finally. 3. by the definition of l/J. for l/J(ab) = N' cp(ab) = N' cp(a)cp(b) = (N' cp(a»(N' cp(b» = l/J(a)l/J(b). 1]. M = N. Show that M ~ N in the proof of Theorem 2. Let G be the group of all realvalued functions on the unit interval [0. that is. prove that GIN ~ real numbers under +. we go on to the Third Homomorphism Theorem. (c) GIN ~ G 2 • . Theorem 2. of l/J? If a E M. show that: (a) N = {(a.4 (Third Homomorphism Theorem). bd).7. e2) I a E G l }. D This last equality is highly suggestive. by the First Homomorphism Theorem. whence. again by Theorems 2. b) I a E G l . but this puts a in N. is a normal subgroup of G. so l/J is a homomorphism of G onto G'IN' with kernel N.2. Proof Define the mapping l/J: G ~ G'IN' by l/J(a) = N'cp(a) for every a E G. If N = {I E G I I(~) = O}. which leads us to GIN ~ G'IN' ~ (GIK)/(NIK).1]. G' ~ GIK. That N C M is easy and is left to the reader. Because N' cp(a) = N' we must have cp(a) EN'. 4. by the very definition of N.3. we conclude that GIN ~ G'IN'. Equivalently. 7 The Homomorphism Theorems 87 Finally. (b) N~ G l .7. b)(c. d) = (ac. since N' <J G'. 2. If the map cp: G ~ G' is a homomorphism of G onto G' with kernel K then. we see that l/Jmaps G onto G'IN'. PROBLEMS 1. GIN ~ G'IN'. GIN ~ (GIK)/(NIK). N' ~ NIK. then l/J(a) is the unit element of G'IN'. Let G be the group of nonzero real numbers under multiplication and let N = {1. Furthermore. g E G. Prove that GIN ~ positive real numbers under multiplication. where we define (a. l/J is a homomorphism of G onto G'IN'. where we define. addition by (I + g)(x) = I(x) + g(x) for every x E [0. for I. l/J(a) = N' cp(a). Thus MeN. if N' <J G' and N = {a E G I cp(a) EN'}. which tells us a little more about the relationship between Nand N' when N' <J G'. l/J(a) = N'. and x' = cp(x) for some x E G. On the other hand.7. If G b G 2 are two groups and G = G l X G 2 = {(a.Sec. b E G 2 }. M.7.1 and 2. Since cp is onto G' and every element of G'IN' is a coset of the form N'x'. where e2 is the unit element of G 2. Therefore.
Proof Let s E S. We prove it here to put it on the record officially. ButfP(s) = s and sincep~m. f E A(S). where m lo(a). (Prove!)] Lemma 2. Let the set HN = {hn I h E H.1. or zero). so has one element.1 andfm(s) = s. then the orbit of sunder fconsists merely of s itself. We leave it to the reader as a problem that this does indeed define an equivalence relation on S. We shall give a very neat proof of this here.) 7. we can say something about the size of the orbits under f. . (d) (HN)IN ~ HI(H n N). is a homomorphism of G onto G' and N <J G. Let m = j .. n EN}.f p. [Iffk(s) = s. . then the orbit of any element of Sunder fhas 1 or p elements. and N <J G. and define a relation on S as follows: s '"'' t if t = fi(S) for some integer i (i can be positive. of courseftk(s) = s for every integer t. Let G be a group. is called the orbit of sunder f So S is the disjoint union of the orbits of its elements. (c) N C HN and N <J HN. negative.2 5. then Na in GIN has finite order m.1. we claim that thesep elements are distinct and constitute the orbit of sunder f If not. then 0< m ~p . We return for a moment to set theory. iff(s) = s. 8.i. doing something that we mentioned in the problems in Section 4. p a prime. *6. Suppose then that f(s) =1= s.f2 (s). Let S be a set. Prove that: (a) H n N <J H. ap + bm = 1 for some integers a and b.(N) <J G'. If f E A(S) is of order p.1 (s). Consider the elements s. If G is a group and N <J G. Thus fl(S) = fap+bm(s) = fap(fbm(s» = fap(s) = S. show that if a E G has finite order o(a). then G contains an element of order p. We did point out there that Cauchy's Theorem is true even if the group is not abelian. this proof is due to McKay. H a subgroup of G..6. [s]. When fis of order p.88 Groups Ch.4Cauchy's Theoremwe proved that if a prime p divides the order of a finite abelian group G.f(s). p a prime. those of the readers who solved Problem 34 of Section 4 already know the result. then fi(S) = fj(s) for some o ~ i < j ~ p . The equivalence class of s. (b) HN is a subgroup of G. show that q. which gives us that fji(S) = s. If q. (Prove this by using the homomorphism of G onto GIN. .8. CAUCHY'S THEOREM In Theorem 2.
. e.. . We strongly urge the reader who feels uncomfortable with the proof just given to carry out its details for p = 3. we know that m 2:: 1. In this case the action of f on S becomes clear and our assertions about this action can be checked explicitly. the result amounts to Problem 18 in Section 1. then apala2 . On the other hand. Note that if ala2 ... Why? We can choose ab . a p). 8 Cauchy's Theorem 89 since fm(s) = fP(s) = s. we get that m > 1.. Let m be the number of s E S such that f(s) = s. a is the required element of order p... . If there are k such orbits where f(s) =1= s. apb ap) then satisfies so is in S. where ab a2' .. . since for s = (e. Proof If p = 2. . Note thatf =1= i... from the definition of S this implies that a P = e. .. apla p = e. and by putting ap = (ala2 .. . . and these orbits are disjoint... the orbit of s consists of p elements.. a) =1= (e. Theorem 2.. apl)t. .) So f(s) = s if and only if s = (a.··. ai =1= aj. e) in S.1 = m + kp. a) for some a E G. for we have accounted this way for every element of S. f(s) = s. then for some i =1= j.. (We leave this to the reader. f(s). so as p elements. We claim that S has n P l elements where n = IGI. and that fP = i. D We now give McKay's proof of Cauchy's Theorem. Let S be the set of all ordered ptuples (ab a2. the ptuple (ab a2.1.8. apl) is in A(S). then f(s) = s. the identity map on S. apl = e (for if xy = e in a group. apl arbitrarily in G.2 (Cauchy).kp.. a.. f 2(s). a2. . If the orbit of sunder fhas one element. Thus S has n P 1 elements. aI. we get that n P .l . . . since m = P . But pin by assumption and p I (kp). D Note that the proof tells us that the number of solutions in G of x P = e is a positive multiple of p. then G contains an element of order p. . This contradicts that f(s) =1= s. Since a =1= e. ' ap are in G and where ala2 . ap) = (a p.8.. if f(s) =1= s. So the mapping f: S ~ S defined by f(ab . for they are equivalence classes. e.. . e). If p is a prime and p divides the order of G. But this says that n there is an s = (a.. . Thus the orbit of s under f consists of s. this we have by Lemma 2. so f is of order p. we know that the orbit of sunder f consists precisely of p distinct elements. then yx = e). Assume that p =1= 2. So we must have p I m. .. apb ap).Sec. ap1a p = e. . On the other hand.. Now when is f(s) =1= s? We claim that f(s) =1= s if and only if when s = (aI' a2. . Because m =1= 0 and p I m... if f(s) =1= s. a..fP1 (s).
This says that ab is of order mn.90 Groups Ch. for some i (depending on x). u == x. B == xlAx is a subgroup of G of order p. Other consequences will be found in the problem set to follow. as claimed in the lemma. and in later material on groups. For suppose that xy == uv where x. we have ulx E A n B. If a EGis of order p and A is the subgroup of G generated by a. Proof Suppose that A is the subgroup generated by a and B that generated by b. thus (ab)i == e. Since B =1= A and A n B is a subgroup of A and A is of prime order. and xlax =1= e.8. 0 ::. q are primes and p > q.3. for IA n B I I nand IA n B II m. So a i == e. We shall present one of these. we see that mn is the smallest positive integer i such that (ab)i == e. v == y. y. Suppose that c i == e. which follows from Lagrange's Theorem. then xlax == ai. and since ulx == vyt. i < p. Since (ab )mn == a mn b mn == e. Lemma 2. where p. v E B. we claim that AB has p2 distinct elements. a are as in Lemma 2.8. then A <J G. which has only pq < p2 elements (since p > q). we are forced to conclude that A n B == (e) and so ulx == e. e == (ab)i == aib i .8. If G. where 0 < i < p. D Corollary. Proof We claim that A is the only subgroup of G of order p. In consquence. n) == 1 and m and n both divide i. in consequence of which we conclude that xlAx == A. then ulx == vyl. Thus the number of distinct elements in AB is p2. that is. xlax E A. But every element of A is of the form ai. n) == 1. Lemma 2. Because IAI == m and IB I == nand (m. in which we determine completely the nature of certain groups of order pq. Because (m. then c == ab is of order mn. Proof Since e =1= a E A and xlAx == A. Similarly. D . whence n I i. where i > 0. y E B}. this tells us that a i == b.3 and if x E G. hence A <J G. where m and n are relatively prime and ab == ba. where 0 < i < p.2 Cauchy's Theorem has many consequences. Suppose that B is another subgroup of order p. and b i == e. If a EGis of order m and bEG is of order n. vyl E B. But if x E G. With this contradiction we see that B == A and A is the only subgroup of order p in G. whence m I i. Consider the set AB == {xy Ix E A.4. u E A. where p and q are distinct primes. mn divides i. D We now prove a result of a different flavor.i E A n B == (e). Let G be a group of order pq. But all these elements are in G. But ulx E A. we get A n B == (e). xlax == ai. So i 2:: mn. Since ab == ba.
2. (Hint: Use for H a subgroup of order p. By Lemma 2. D PROBLEMS MiddleLevel Problems 1. . i = 1. show that if some two entries in s (ab a2. This means that the 15 powers e = co. Using the result of Problem 40 of Section 5. b1ab = a i = a.8. 14 C are distinct. so. G has elements b of order 3 and a of order 5. Construct a nonabelian group of order 21.8.Sec. so the powers of c sweep out all of G.8.4.) 4. 2. and the theorem is proved. iq iq 1 q But b = e. b1ab = a i = a. where p.1. ••• . Prove that a group of order 35 is cyclic.8).1 = e. Proof By Cauchy's Theorem.3. hence ab = ba.2 ab 2 = b1(b1ab)b = b1aib = (b1ab)i = (ai)i = a i2 and similarly.4. so must sweep out all of G. but the form in which we did it is the exact prototype for the proof of the more general Theorem 2. 3. In a word.8.. q are primes and p > q.1.4. then I(s) =1= s. Since a is of order 5. i P . by Fermat's Theorem. i 4 == 1(5). by Fermat's Theorem (Corollary to Theorem 2. a p) are different. since 0 < i < 5. 5 must divide i 3 .3. c. give another proof of Lemma 2. therefore.1. (Hint: Assume that a 3 = e. .8. by Lemma 2. However. By Cauchy's Theorem. Therefore.1. and since o < i < p. In the proof of Theorem 2. c2 . which means that ab = ba.= e.1 == l(p). If q (p . Thus brab r = air for all r 2:: 0 (Prove!).. we conclude that i == l(p). G has an element a of order p and an element b of order q. Thus b. i 3 == 1(5). These two equations for i tell us that i == 1(5). and so bqab q = a . a group G of order 15. b1ab = a i for some i iq with 0 < i < p.8. c = ab is of order 15. In short. b1ab = ai. then G must be cyclic. that is. which is to say. c = ab has order pq. Because a is of order p. By the Corollary to Lemma 2. so we get a i3 = a.8.3ab 3 = a i3 . 8 Cauchy's Theorem 91 Before considering the more general case of groups of order pq. G must be cyclic. Thus G is cyclic. Since q (p . Let G be a group of order pq. and the orbit of sunder 1 has p elements.3. . i = 1 follows. whence a i3 . But b 3 = e. b. i q == l(p). we conclude that p I i q . However. Since a is of order 5 and b of order 3. where 0 < i < 5. let's look at a special case. By the Corollary to Lemma 2. a = a and so a . namely.5. The argument given for 15 could have been made shorter.
Let p > q be two primes such that q I p . b E B}.) 15. where m and n are relatively prime. 2 b7 = e and find some i such that a1ba the relations a 3 = b7 = e. 14. Suppose that G has a normal subgroup P of order pn. Harder Problems 11. If G is a group and A. Prove that O(P) = P for every automorphism 0 of G.) = ai =1= a. DIRECT PRODUCTS In several of the problems and examples that appeared earlier. Prove that any two nonabelian groups of order 21 are isomorphic. and the idea needed to do Problem 4 above. Prove that there exists a nonabelian group of order pq. where p is prime and p ~ m. From the result of Problem 9. Let G be a finite group with subgroups A. then any two nonabelian groups of order pq are isomorphic. G 2 are two groups. Let G be a group of order pn m . prove that any group of order 99 is abelian. prove that the set AB = tab I a E A.1. 6. 9. B of orders m. respectively. Prove that if p > q are two primes such that q I p . n. If G is a group with subgroups A. prove that a group of order 42 has a normal subgroup of order 21. 7. Using the fact that any group of order 9 is abelian.) Very Hard Problems 13. b). where a E G 1 and b E G 2 and . (Hint: Use the result of Problem 40 of Section 4. 10. then G = G 1 X G 2 is the set of all ordered pairs (a. AB = {ab I a E A. prove that the subset of G. 8. b E B} has (IA liB 1)/ IA n B I distinct elements. 9. Prove that A n B =1= (e). which is consistent with 5. B such that IA I > VTGI and IBI > VTGI. has mn distinct elements. (See Problem 4. namely that Up is cyclic if p is a prime.1. Prove that a group of order 42 has a nontrivial normal subgroup. we went through the following construction: If G 1. B finite subgroups of G.92 Groups Ch. Prove that a group of order 99 has a nontrivial normal subgroup. 12.
an.. .(a11 . · an = blb2· · · bn.. ' an) where ai E G i .an ) 1 . by the isomorphism 1Ti: G j ~ G i defined by 1Ti (e1' e2' ..a21 . We call it external.2. Consider the subsets G j C G 1 X G 2 x·· .. · · an with aj E G j • This motivates the following Definition.Nn if every a E G has a unique representation in the form a = ala2 . G is merely the Cartesian product of the groups G 1. since the groups G 1 . Definition. not only is G j a subgroup of G but G j <J G. . .2. en) as its unit element.. X G n is a group is immediate. . where the ai E G i and bi E G i ..n. that is. b 1)(a2' b2) (a1a2' b 1b 2). then that is. . n. .. .. . G i is a group and is isomorphic to G i. . The group G is said to be the (internal) direct product of its normal subgroups N b N 2. . X G n is defined componentwise. . .. G n with a product defined in G by componentwise multiplication. then their (external) direct product G 1 X G 2 X G 3 X .. a can be written in this way in a unique manner.... From what we have discussed above we have the . That G = G 1 X G 2 X . ai' . and where the product in G 1 X G 2 X . where ei is the unit element of G i. (Prove!) Given any element a = (ab a2' .. G j consists of all ntuples where in the ith component any element of G i can occur and where every other component is the identity element. . G 2 .. every a E G can be written as a = ala2· . and h were (aba2. G 2. .a n1) . X G n is the set of all ordered ntuples (ab a2' . .. X G n = G... for i = 1. that is... G n are any groups.Sec.. 9 Direct Products 93 where the product was defined componentwise via (aI. with (e1' e2' . G n are n groups. So G is built up from certain normal subgroups. an) E G. then al = bi .... Furthermore. . If G 1. .. . the products in each component being carried out in the respective groups G 1 and G 2 • We should like to formalize this procedure here. ••• . as G = G I G 2 • • • G n in such a way that every element a E G has a unique representation in the form a = ala2 . where in other words. Clearly.. where each ai E G i • Moreover.. .. . if a = ala2· . the G j . .. an' where each ai E N i for i = 1.. . . . . G 2.an = bn. en) = a i . with no relatioh necessarily holding among them.
. a . we claim that N i n N j = (e) for i =1= j.I ) E M. Thus a E M n N = (e). N i n N j = (e).I .2.. Since M <J G and m.. However a = e . a2. e. where the a occurs in the jth place. a = m(nmIn. If G is as in Lemma 2. . mnmIn. Perhaps things would be clearer if we do it for n = 2. This gives us one representation of a in G = N I N 2 ••• N n • On the other hand. Corollary. N 2 <J G. X N n • To do so. e .I = e. so a has the second representation as an element of N I N 2 ••• N n • By the uniqueness of the representation.9. where b l = e E N b b 2 = a E N 2 . and every element a E G has a unique representation as a = al . . Section 5. So N I n N 2 = (e)..I is also in N. as required. e. .9. The result we are about to prove has already occurred as Problem 20. we .•• . so a = b l . N n . since N <J G and n E N.2 Lemma 2. Now bracket a in the other way.. G n . N n . Lemma 2. we first get some preliminary results.9. then if i have aiaj = ajai' =1= j and ai E N i and aj E N j .9. b 2. By the uniqueness of the representation we must have a I = b l. where the a occurs in the ith place. e . At any rate we have proved Lemma 2. M. we have nmIn. which is to say. then G is the internal direct product of the normal subgroups G b G 2 . The argument given above for N b . then G is isomorphic to N I X N 2 X . If G is the internal direct product of its normal subgroups N b N 2 . This gives us that mn = nm. given m E M and n E N. mnm. ••• . we get a = e. ••• . a. that is. For suppose that a E N i n N j . eae . mn = nm. Then.I E N. Viewing a as bracketed one way. . so a = (mnmI)n. for i =1= j. · .I ). Suppose that a E N I n N 2 . then. e . D If G is the internal direct product of the normal subgroups N I.1.. namely to prove that if G is the internal direct product of its normal subgroups N b N 2 . Proof Consider the element a = mnmIn.. where al E N I . For the sake of completeness we prove it here. a = e.3. G n defined above. a2 with al = a E N l . a2 E N 2 . but perhaps is less transparent. a = (mnmI)n. N 2 .3. N n is the same argument as that given for n = 2. then a = a . a2 = e E N 2 . then.I .. If G = G I X G 2 X . G 2 . .. then a = e . We want to go in the other direction.. e . X G n is the external direct product of G I . e is a representation of a = al .. So suppose that N I <J G.•.I E M and so a = m(nmIn. a = e . and so N i n N j = (e). N normal subgroups of G such that M n N = (e).94 Groups Ch.I E M. ••• .Nn . Let G be a group.
So.4. . N 2.···. . b n». that is.9... N 2 • Then G is the internal direct product of N I and N 2 if and only if G = N 1 N 2 and N I n N 2 = (e). anb n = aIa2b2a3b3 anbnb l . D Corollary.. b 2... ..· · . . .Sec. On the other hand. Now repeat this procedure with b 2. Let G be a group with normal subgroups N I .. a2.. aiaj = aja i for ai E N i .4 is complete.. an = b Ib 2 ... ••• . . b n). to get that a Ib Ia 2b 2 anb n = (aIa2 . . . For if t/J((ab a2'···' an» = t/J((b 1 ... with the ai E N i . . Proof Suppose G is an internal direct product of N b . .9. by Lemma 2. bn ) t/J((a b a2. consider t/J((a b a2.. Since the N's are normal in G..· . So we can pull the b i across all the elements to the right of it to get aIbIa2b2· . We assert that it is also 11. Then the conclusion that G is the internal direct product of N I. . N n easily follows from the fact that t/Jis onto and 11. aj E N j .an)(b 1 . With this the proof of Theorem 2.. an is an isomorphism from N I X N 2 X .. Thus t/J((a 1. N i n N j = (e) for i =1= j. ••• .9..3. · a n)(b Ib 2 . All that remains is to show that t/J is a homomorphism... b 2.. .an)(b 1 . aIa2 ...anbn» = (a 1b 1 )(a 2b2 ) • • • (anb n) Since b i E N b it commutes with ai' b i for i > 1 by the Corollary to Lemma 2.. N n. then by the definition of t/J. . an. . Hence t/Jis 11.bn» = t/J((a1b b a2 b 2. 9 Direct Products 95 Proof By Lemma 2. .bn»· In other words. N 2 .9. . an) = aIa2 . b n.9.· .. N 2 . b2 ..an»t/J((b 1 b 2. t/J is a homomorphism..2 we have that any element in N i commutes with any element in N j .an = bn. an )(b 1 b 2 . suppose that t/J is an isomorphism.. X N n (external direct product) onto G if and only if G is the internal direct product of N I.. and so on.. N n • Since every element a in G has a representation a = ala2 . 0 With these preliminaries out of the way we can now prove Theorem 2.3.bn» = a1b 1a2 b 2 . Then the mapping t/J(ab a2. · · anb n = = (a 1a2 . . By the uniqueness of the representation of an element in this form we deduce that al = b h a2 = b2. . . ••• . . we have that the mapping t/J is onto. . N n. Let G be a group with normal subgroups N I .
. we drop the adjectives "internal" and "external" and merely speak about the "direct product. . The objective is often to show that a given group is the direct product of certain normal subgroups." When notation G = N I X N 2 is used it should be clear from context whether it stands for the internal or external direct product. If one can do this. In A let T = {(g. N 2 . a2) = aIa2. N k normal subgroups of G such that G = N I N 2 ••• N k and IGI = IN I IIN 2 1·· ·INkl.. Let G be a finite group..4 and its corollary. PROBLEMS 1.2 Proof This follows easily from the fact that l/J: N l X N 2 ~ G. If G I and G 2 are groups. ••• . N I. . G = N I N 2 • •• N k. N 2 . Let G be a group. 3. prove that G I X G 2 = G 2 X G I • 2. N k • 6. is an isomorphism if and only if N I N 2 = G and N I n N 2 = (e). 4. mk > O. A = G X G. respectively. N i n (N I N 2 ••• NiINi + 1 ••• N k) = (e). G has a subgroup Pi of order pr'. Let G be an abelian group of order p'{'lp'{'2 . N 2. N I . ••• . By Problem 10 of Section 6. g) Ig E G}. 2. For each i.96 Groups Ch. If G l and G 2 are cyclic groups of orders m and n.9. . · p'!:k. P2'···' Pk are distinct primes and mi > 0. which is given by l/J(a}. . (b) Prove that T <J A if and only if G is abelian. it might seem like a nice little construction. Show that G = PI X P 2 X . Prove that G is the direct product of N}. . If we were to leave that topic at the point where we ended. N k. prove that G I X G 2 is cyclic if and only if m and n are relatively prime. N k normal subgroups of G such that: 1. X P k • 5.. 0 In view of the result of Theorem 2. N 2 . where PI.. . Prove that G is the direct product of N}. Let G be a group. but so what? To give some more substance to it. for each i. (a) Prove that T = G. m2 > 0. ••• . FINITE ABELIAN GROUPS (OPTIONAL) We have just finished discussing the idea of the direct product of groups. 10. the structure of the group can be completely determined if we happen to know those of the normal subgroups. .
b. Let G be an abelian group of order pn. X A k .. and rightfully so. This step will be fairly easy to carry out. What should this mechanism be? Our control on the elements of G lies only in specifying their orders.::: . Before getting down to the actual details of the proof. So we shall focus on groups of order pn. such that every element x E G can be written as x = b Ib 2 · . when applied after we have picked a I. a7:\ where the elements a'r i are unique. Our first step will be to reduce the problem from any finite abelian group to one whose order is pn. Fortunately. . say. the details of the proof will not be cluttered with elements whose orders are somewhat complicated. where p is a prime. For instance. b. A difficulty appears right away. b k . and each Ai is cyclic generated by ai. To consider these would be going too far afield. If this were so and x = a'. It is the order of the elementwhen properly usedthat will give us the means to prove the theorem.. . We want to show that there exist cyclic subgroups of G.. and since the group will have order involving just one prime.::: nk. C are the cyclic subgroups generated by a. ak. we want to show that x = a'.. This reduces most questions about finite abelian groups to questions about cyclic groups. A 2. we should like to give a quick sketch of how we shall go about proving the theorem... Otherwise put.. the finite abelian groups.l . . and ab. B.la'{'2 . since each Ai is cyclic and generated by ai. then G = A X B = A X C = B X C. where IGI = pn and the A's have been numbered. What we shall prove is that any finite abelian group is the direct product of cyclic groups. . where each b i E Ai' in a unique way. then we can see that if A. so that IAil = pn and nl . will allow us to pick a2. Suppose that G = Al X A 2 X . such a class of groups exists.::: n2 . . a7:\ then 1 . for there is not just one choice for these elements aI. a. The theorem we shall prove is called the Fundamental Theorem on Finite Abelian Groups. How to get around this? What we need is a mechanism for picking al and which.Sec. respectively. especially since the story for finite abelian groups is so important in its own right. 10 Finite Abelian Groups (Optional) 97 we should prove at least one theorem which says that a group satisfying a certain condition is the direct product of some particularly easy groups. .. The results on the structure of finite abelian groups are really special cases of some wider and deeper theorems. and so on. if G is the abelian group of order 4 with elements e. namely A I. So there is a lack of uniqueness in the choice of the ai. A k... . a reduction that often allows us to get complete answers to these questions. where a2 = b 2 = e and ab = ba. ab..
and similarly if n =1= 1. In other words. Thus a = asna rm is in MN. What does this translate into in G itself? We want an element a2 such that a2 requires as high a power as possible to fall into A 1.'" we shall use an induction argument and some subsidiary preliminary results. Ch. since M n N is a subgroup of both M and N. and I TI is not divisible by p. So that will be the road to the selection of the second element.2 pfl1 because nl 2:: ni.1. if neither m nor n is 1. 1M n NI divides IMI = m and INI = n. One should not confuse the basic idea in the proofwhich is quite reasonablewith the technical details. we need to show that G = MN and G = M X N. It now follows from Corollary to Theorem 2. m > 0. Proof The sets M and N defined in the assertion above are quickly seen to be subgroups of G. Moreover. thus x = e. With this sketch as guide we hope the proof of the theorem will make sense to the reader. Then one repeats it appropriately to find an element a3 .. since (asn)m = asnm = e. then a = al = a sn + rm = asna rm . then M =1= (e) and N =1= (e). However. and so on.98 Groups lpfl . hence M n N = (e). we should pick an element in G whose order is maximal. arm E N.4) we readily obtain M =1= (e). So we now begin to fill in the details of the sketch of the proof that we outlined above. which may cloud the issue. so since every a'r = e. Lemma 2. The doing of all this is the technical part of the argument and does go through. .9. Furthermore.6. What do we do for a2? If G = G/A b then to get the first element needed to represent G as a direct product of cyclic groups. Let G be a finite abelian group and let p be a prime such that p divides IG I. there exist integers rand s such that rm + sn = 1. then by Cauchy's Theorem (Theorem 2. then G = M X N. D An immediate consequence is the Corollary. a2. that N =1= (e). by Lagrange's Theorem. it may not do the trick. This is the procedure we shall be going through to prove the theorem. Moreover. Because m and n are relatively prime. we obtain 1M n NI = 1. Then G = P X T for some subgroups P and T. Let G be a finite abelian group of order mn. But to smooth out these successive choices of aI. if m =1= 1. In this way G = MN. where /PI = pm. if we pick an element a2 with this property. we have that a sn E M.10. If M = {x E G I x m = e} and N = {x E G I x n = e}. Similarly. To finish the proof.4 that G = M X N. al should be an element of G whose order is as large as it can possibly be. we can now pick al. pnil pnt. we may have to adapt it so that it will. Since m and n are relatively prime. where m and n are relatively prime. Fine. If a E G.
we have. It is easy to see that p ITI by making use of Lagrange's Theorem. Then G = A X Q. by the induction we know that G = (a) X T for some subgroup T of G. then. whence ao(a) = e. It remains to show that A n Q = (e). G = P X T and P =1= (e).0 t We now come to the key step in the proof of the theorem we seek. Proof We proceed by induction on n. we see that ao(a) E A n B = (e). and let a E G have maximal order of all the elements in G.7). this implies a i = e. We claim that G is the internal direct product A X Q. that x P E A. (See Section 11). But since o(a) = o(a). by assumption B =1= (e). On the other hand. We claim that p I i. b not in A. A n Q = (e) and we obtain that G = A X Q.1. Let B = (b). by our choice of x. Since every element in P has order a power ofp. and note that the maximality . so I pi = pm for some m. Theorem 2. thus A n B = (e) (see Problem 1). Ipi is not divisible by any other prime (by Cauchy's Theorem). Let G = G/B. Suppose that G =1= A and let x E G. hence x P = a i for some i. A have smallest possible order. Let o(a) = pS. We first show that the theorem is correct if there exists an element bEG such that b tt. In G. such that P = e. By the Correspondence Theorem we also know that T = Q/B for some subgroup Q of G. The proof is a little difficult. x f!:. We suppose the theorem to be true for all m < n. This tells us that o(a) I o(a). Because o(x P ) < o(x). ao(a) = e.10. that there is no element b in G. and since (a) n T = (e). but once we have this result the rest will be easy. we have that a i = e. where A is the cyclic subgroup generated by a. then IG I = p and G is already a cyclic group generated by any a =1= e in G. what is the order of a = Ba? We claim that o(a) = o(a).10. the subgroup of G generated by b. p a prime. 10 Finite Abelian Groups (Optional) 99 Proof Let P = {x E G I x Ps = e for some s} and let the subset T = {x E G I Xl = e for t relatively prime to p}. Therefore. in which case G is a cyclic b group.2. If n = 1. we know that o(a) I o(a) (see Problem 6 of Section 2. Let a i E A n Q. Since ao(a) E A. That G = AQ is left to the reader. Since a is an element of maximal order in G. Then a i E Q/B = T. By Lemma 2. To begin with.Sec. hence IGI < IGI. Suppose. Let G be an abelian group of order pn. Hence o(a) = o(a). Thus we really have that P is not merely some subgroup of G but is what is called apSylow subgroup of G. A = (a) and b P = e. We claim that this forces G = A = (a). so ao(a) E B.
because x f£ A. In other words. . D We are now able to prove the very basic and important Theorem 2.. Since o(a) = pS. X Tq .10. in that case we saw that the theorem was correct. it has the following three partitions: 3 = 3. nkwhich are called the invariants of Gare unique. 3 = 2 + 1.. Moreover. P = Al X A 2 X . X Ak I = IA I IIA 2 /· • ·IA k I. respectively... where IPI = pn. we may thus assume that T = T I X T 2 X . 3 = 1 + 1 + 1. D We return to abelian groups G of order pn.. Proof Let G be a finite abelian group and p a prime that divides IG I. if n = 3. two abelian groups of order pn are isomorphic if and only if they have the same invariants. X A k .. where the Ti are cyclic subgroups of T. where the Ai are cyclic groups of order pn. But x Ps = (XP)pSl = (ai)pSl = e. Arguing by induction on IGI. b f£ A such that b P = e. we have p I i.. For example.. + nk.2.. Let y = auP. it follows that the number of nonisomorphic abelian groups of order pn is equal to the number of partitions of n.10. This finishes the induction and proves the theorem. . G = P X T. which gives us that hence n = ni + n2 + . and G is a cyclic group. Then yP = aix p = aia i = e. The groups corresponding to these partitions are a cyclic group of order p3.. Thus the integers ni 2:: 0 give us a partition of n. Thus x P = ai. so there are three nonisomorphic abelian groups of order p3 (independent of p).. By the Corollary to Lemma 2.. .. the direct product of a cyclic group of order p2 by one of order p. y f£ (a) = A. Thus G = (AI X A 2 X X A k ) X (T1 X T2 X X A k X T I X T2 X X Tq ) X Tq • = Al X A 2 X This very important theorem is now proved. X A k . where p I i.2 of the order of a implies that x Ps = e. n2. IG I = IA I X A2 X . We now have at hand that G = Al X A 2 X . 2:: nk· Also. x. and the direct product of three cyclic groups of order p.10.3 (Fundamental Theorem on Finite Abelian Groups). It can be shown that these integers n 1.1..100 Groups Ch.. We can arrange the numbering so that n 1 2:: n2 2:: . A finite abelian group is the direct product of cyclic groups. Granted this. So we must have G = (a). where the Ai are cyclic subgroups of P. By Theorem 2. But this puts us back in the situation discussed above. where there exists abE G.
where fern) denotes the number of partitions of m. two partitions of 2. For each prime Pi there are as many groups of order pfi as there are partitions of ai. the path somewhat tortuous. Recall that the element b in G is said to be conjugate to a E G (or merely. Let G be a finite group. The equivalence class of a. 4 = 3 + 1. the Corollary to Lemma 2. there are 10 nonisomorphic abelian groups of order 144. Let A be a normal subgroup of a group G... Let G be an abelian group of order pn. · f(ak). and suppose that bEG is an element of prime order p. we assign only three problems to this section. Thus we know how many nonisomorphic finite abelian groups there are for any given order. then o(aN) = o(a). is called the conjugacy class of a. PROBLEMS 1. p%\ where the Pi are distinct primes and the a i are all positive. as an example of such a relation in a group G. Prove that: (a) The order of aN in GIN divides the order of a in G. which we denote by cl(a).Sec. 4 = 2 + 1 + 1.g. and that b f£ A. e. . Show that xo(a) = e for all x E G. 4 = 2 + 2. and there are five partitions of 4. p a prime. We showed in Section 4 that this defines an equivalence relation on G. For instance. So the number of nonisomorphic abelian groups of order n = p~l . and let a E G have maximal order. 3. with N <J G and a E G. Thus there are five nonisomorphic groups of order p 4 • Can you describe them via the partitions of 4? Given an abelian group of order n = p~lp~2 .. The material treated in this section has been hard.Sylow subgroups (see. that is. 2. o(aN) lo(a). To spare the reader too much further agony. 11. and the effort to understand quite intense.. 11 Conjugacy and Sylow's Theorem (Optional) 101 For n = 4 we see the partitions are 4 = 4. p%k is f(al)f(a2) . which are five in number. the notion of conjugacy. then G is the direct product of its socalled Pi.4 = 1 + 1 + 1 + 1.. (b) If (a) n N = (e).10. a conjugate of a) if there exists an x E G such that b = xlax. Show that A n (b) = (e).. how many nonisomorphic abelian groups are there of order 144? Since 144 = 24 32 .1). CONJUGACY AND SYLOW'S THEOREM (OPTIONAL) In discussing equivalence relations in Section 4 we mentioned.
then ax == xa for all x E G. only if x and yare in the same right coset of C(a) in G. So conjugacy is not an interesting relation for abelian groups. is defined by C(a) == {x E G I xa == ax}. we obtain a(xyl) = (xyl)a. cl(a) consists of all xlax as x runs over G. in other words. ic(C(a)). we need to know when two conjugates of a coincide. So to determine which are the distinct conjugates of a. that of the centralizer of a in G. hence xlax = a. and is equal to IGII IC(a)l. if a E Z(G). C(a) is a subgroup of G. Definition. This brings us to a concept introduced as Example 10 in Section 3. this depends strongly on the element a. for in that case the number of right cosets of C (a) in G is what we called the index.11. Thus there are as many conjugates of a in G as there are right cosets of C (a) in G. the center of G. xyl must commute with a. the centralizer of a in G. a itself. Given a E G. if cl(a) consists only of the element a. which is the same as asking: When is x lax = yIay? In this case. We have proved Theorem 2. then xlax = a for all x E G. As we saw above. however. This is most interesting when G is a finite group. two elements are conjugate if and only if they are equal. When C(a) arose in Section 3 we showed that it was a subgroup of G. then C (a). hence xyla == axyl. We record this now more officially as Lemma 2. So x and y give rise to the same conjugate of a if and only if x and yare in the same right coset of C (a) in G. So Z(G) is characterized as the set of those elements a in G whose conjugacy class has only one element. since G == Z(G). the two conjugates x lax and y1ay of a are equal only if xyl E C(a).1. We repeat something we did there. On the other hand. that is.102 Groups Ch. transposing. of C(a) in G. For a E G. On the other hand. . the conjugacy class of a in this case consists merely of the element a itself. then xyl E C(a). For an abelian group G.2. This yields that xlax = y1ay.2 For a finite group an immediate question presents itself: How large is cl(a)? Of course. hence a E Z(G). in other words. For instance. Let G be a finite group and a E G.11. for nonabelian groups it is a highly interesting notion. This gives us that xa = ax for all x E G. if x and yare in the same right coset of C(a) in G. If a E G. then the number of distinct conjugates of a in G equals the index of C (a) in G.
11 Conjugacy and Sylow's Theorem (Optional) 103 In other words. It is almost a sacred tradition among mathematicians to give.D This last theorem has an interesting application. as the first application of the class equation. which some readers may have seen in solving Problem 45 of Section 5. since IC(b)1 divides IGI by Lagrange's theorem. where 1 ::::. its conjugacy class contains more than one element and IC(b)1 < IGI. z ~ 1. where p is a prime. the center of G. Not wanting to be accused of heresy.11.11. G is the union of the disjoint conjugacy classes. One such consequence is a kind of bookkeeping result. Moreover. is not trivial (i. z is then the number of elements in G whose conjugacy class has only one element. there must be an element a =1= e in Z(G).Sec. Moreover. and divides Ln(b)<npnn(b). If G is a group of order pn. We shall see a few of these here. we get Theorem 2. Since conjugacy is an equivalence relation on G. the number of elements in cl(a) equals iG(C(a» = IGI/IC(a)l· This theorem.4. The net result of this is that p I z. as we pointed out previously. neb) < n. there exists an element a =1= e in G such that ax = xa for all x E G). then IGI = ~ ic(C(a» = ~ I~~)I' where the sum runs over one a from each conjugacy class. z = pn = IGI = Z + b~Z(G) IC(b)1 L lQL = Z + L ~. For any element b outside Z( G). Let IZ(G)I. Since e E Z (G). IC(b)1 = pn(b). we know how many elements there are in each class. p divides the lefthand side. So since z = IZ(G)I. We divide the pieces of the class equation into two parts: that coming from the center. This is . Putting all this information together. pn. a particular theorem about groups of order pn.3 (The Class Equation).e.2.. Proof We shall exploit the class equation to carry out the proof. where p is a prime.h) n(b)<nP = Z + n(b)<n L pn~n(h). We get. we have that z is at least p. which proves the theorem. then Z(G). If G is a finite group. and since z ~ 1. and the rest.11. although it was relatively easy to prove. this way. we follow this tradition and prove the pretty and important Theorem 2. by Theorem 2. Clearly. is very important and has many consequences.
by the Correspondence Theorem (Theorem 2. so C(x) = G. r is a group of order IGI/IAI = pk+l/p = pk by Theorem 2. Thus N is our required normal subgroup in G of order pk. then G contains a normal subgroup of order pnl. x $. p a prime.11. But then we have pkl = IMI = IN/AI = l~i ' that is. we know that r has a normal subgroup M of order pkl. G must be abelian. pkl = INI/p. hence A C C(x) for all x E G. Since r is a homomorphic image of G. Thus the subgroup A = (a) generated by a is of order p and is normal in G. yet IC(x)1 must divide p2.3. Since every element of G is in the center of G.2 Theorem 2. of order p in Z(G). a. so IC(x)1 > p. a Norwegian mathematician.11.7. Without it the subject would not get off the ground. then Proof By Theorem 2. The net result of this is that IC(x)1 = p2.6.11. Consider r = G/A. D By far the most important application we make of the class equation is the proof of a farreaching theorem due to Sylow. Theorem 2. We choose one of a wide possible set of choices to illustrate this technique. such that N/A = M. This completes the induction and so proves the theorem. It is impossible to overstate the importance of Sylow's Theorem in the study of finite groups. where p is a prime. . If G is a group of order p2.4. by Theorem 2.5. the center of G. who proved it in 1871. Since r has order pk. If n = 1. The l1atural attack on virtually all these problems follows the lines of the argumellt we are about to give. Let G be of order pk+l. Suppose that we know that for some k every group of order pk has a normal subgroup of order pkl. so that there is an element. If G is a group of order pn. If A = (a). We already showed this theorem to be true for abelian groups. whence x E Z(G}. Given x E G. D In the problems to come we shall give many applications of the nature of groups of order pn. A.2) there is a normal subgroup N in G. G is abelian.6.l = po = 1.104 Groups Ch. We shall now prove it for any finite group. then C(x) :J A and x E C(x). N :J A.4 there exists an element a of order p in Z( G). leading us to INI = pk. the subgroup generated by a. then A C Z(G).11. then G is of order p and (e) is the required normal subgroup of order p l . Z(G) =1= (e). Proof We proceed by induction on n. where p is a prime.
Then G has a subgroup of order pn. For those who want to see several proofs of that part of Sylow's Theorem which we proved above. By Cauchy's Theorem there is an element a of order p in Z(G). If A is the subgroup generated by a. contradicting our assumption that G had no such subgroup. The other two are (assuming pn m = IG I. we proceed by induction on IG I.11. D Actually. Then. then IAI = p and A <l G. . Q of G. Z(G). where p m): t 1.7 (Sylow's Theorem). An abelian group has one pSylow subgroup for every prime p dividing its order. Since p I pnm . since a E Z(G). they are called pSylow subgroups of G. so p I LaEtZ(C) ic(C(a». Z(G). Since If 1 < IGI.11. where p is a prime and p t m. We therefore assume that n 2:: 1. 3. If z = IZ(G)I. we get p I z. by the Correspondence Theorem there is a subgroup P of G such that P :J A and PIA = M. and Sylow's Theorem is established. If 1 = IGI/IAI = pnmlp = pnlm . Thus p divides IGI/IC(a)1 = ic(C(a» for a $. Z(G). this is trivial. by our induction hypothesis. Ipi = IMIIAI = pnlp = pn and P is the soughtafter subgroup of G of order pn. This completes the induction. then for some x E G. assuming the result to be true for all groups H such that IHI < IGI. if G = S3' the symmetric group of degree 3. For instance. hence pn t IC(a)l. Suppose that the result is false for G. Sylow's Theorem consists of three parts.4. Proof If n = 0. that is. then C(a) =1= G. The number of subgroups of order pn in G is of the form 1 + kp and divides IGI.Sec. pn cannot divide IHI for any subgroup H of Gif H =1= G. Any two subgroups of order pn in G are conjugate. of which we only proved the first. Q = x1px. Write down the class equation for G following the lines of the argument in Theorem 2. Here. Since these subgroups of order pn pop up all over the place. 2. 11 Conjugacy and Sylow's Theorem (Optional) 105 Theorem 2. This is far from true in the general case. and of the other two parts. Therefore. Consider f = GIA. However. if a $. then z 2:: 1 and But P I ic(C(a» if a $. again. there are three 2Sylow subgroups (of order 2) and one 3Sylow subgroup (or order 3). Suppose that G is a group of order pnm . which has order 6 = 2 . In particular. by our induction hypothesis r has a subgroup M of order pnl. they might look at the appropriate section of our book Topics in Algebra. if Ipi IQI = pn for subgroups P.
then N(H) = G. 11.106 Groups Ch.::: p2. If cp is an automorphism of G. show that G has a subgroup of order pm for alII::::. If P <l G. PapSylow subgroup of G. prove that G is abelian. 14. 15. Prove that if G is a finite group and H is a subgroup of G. show that if a1pa = P then a E P. 19. 13. if H <l G.2 PROBLEMS Easier Problems 1. If Pis apSylow subgroup of G. then the number of distinct subgroups x1Hx of G equals iG(N(H». If P is a pSylow subgroup of G. Prove that B(N) <l G. prove that cp(P) = P for every automorphism cp of G. If H is a subgroup of G. Show that a group of order 36 has a normal subgroup of order 3 or 9. If P is a pSylow subgroup of G. If a E G. Prove that N(H) is a subgroup of G. If P is a pSylow subgroup and a EGis of order pm for some m. If IGI = pn. prove that P is a pSylow subgroup of N(P) and is the only pSylow subgroup of N(P). Prove that N(x1Hx) = x1N(H)x. This does not mean that xa = ax whenever x E N(H). yet H need not be in the center of G. 6. Do Problem 1 for G the dihedral group of order 8. 9. . H C N(H) and in fact H <l N(H). show that C(cp(a» = cp(C(a» for a E G. MiddleLevel Problems 16. let B(N) = {x E G I xa = ax for all a EN}. 7. If N <l G. 4. prove that P is the only pSylow subgroup of G. Use the class equation to give a proof of Cauchy's Theorem. If P is a pSylow subgroup of G and P <l G.) 17. n. 3. For instance. let N(H) = {x E G I x1Hx = H}. show that the number of distinct XI Px cannot be a multiple of p. a E H. In S3' the symmetric group of degree 3. show that C(x1ax) = x1C(a)x. 5. 8. 12. 2. Show that a group of order 108 has a normal subgroup of order 9 or 27. show that N(N(P» = N(P). 18. (Hint: See Problem 40 of Section 5. 10. find all the conjugacy classes. If IGI = p3 and IZ(G)I . m ::::. and check the validity of the class equation by determining the orders of the centralizers of the elements of S3 .
define a relation B . b E Z(P) are conjugate in G. Using the result of Problem 25. If H is a subgroup of G of order pm show that H is contained in some pSylow subgroup of G. H a subgroup of G. Let G be a group. For Qb Q2 E S define Ql . then Q = x1px for some x E G. (b) Using the result of Problem 14. Prove that (a) this defines an equivalence relation on G. Suppose that Q =1= x1px for any x E G.) 28.Q2 if Q2 = a1Qla with a E P. that if Q =1= P. show that N(H) ~ H.A for subgroups A. (This is the third part of Sylow's Theorem. show that if G is a finite group.Q2 if Q2 = a1Qla. 26. 22. 25. show that the result of Part (a) cannot hold. 11 Conjugacy and Sylow's Theorem (Optional) 107 20. Prove. If P is a pSylow subgroup of G and a. using this relation. with a E P. 24. show that the number of distinct subgroups equivalent to A equals the index of N(A) n H in H. . let S be the set of all pSylow subgroups of G. show that Ghas a subgroup of order pm.Sec. where a E P. then [a] has m elements where m is the index of H n C(a) in H. Define for a. (a) Show that this implies that the number of distinct ylQy is a multiple ofp. Harder Problems 23. bEG. as y runs over G. 29. then the number of distinct a1Qa. (b) If [a] is the equivalence class of a. (b) If G is finite. If IGI = pn and H in G. prove that they are already conjugate in N(P). Show that any subgroup of order pnl in a group G of order pn is normal 21. and Q another one.b if b = h1ah for some h E H. a . =1= G is a subgroup of G. B of G by the condition that B = h1Ah for some h E H. (This is the second part of Sylow's Theorem. Ifpm divides IGI. If G is a group. show that the number of pSylow subgroups of G is of the form 1 + kp.) 27. (c) Prove from this that given any two pSylow subgroups P and Q of G. (a) Prove that this defines an equivalence relation on the set of subgroups of G. If P is a pSylow subgroup of G. Let S be the set of all ylQy. H a subgroup of G. For Qb Q2 E S define Ql . is a multiple of p. Let P be a pSylow subgroup of G.
If S is infinite. long before the notion of an abstract group was defined. This has the advantage of transforming G as an abstract system into something more concrete.1). asserts that any group G is isomorphic to a subgroup of A (S).3 THE SYMMETRIC GROUP 1. We shall not be concerned with the subgroups of A (S) for an arbitrary set S. namely a set of nice mappings of some set onto itself. and others. In fact. This is much too much to expect. A (S) is a very wild and complicated object.5. Historically. we would know the structure of all groups. results on groups of permutations proved in the late eighteenth and early nineteenth centuries. Since the structure of isomorphic groups is the same. Even if S is finite. groups arose this way first. Galois. the complete nature of A (S) is virtually impossible to determine. the set of 11 mappings of the set S onto itself. Nevertheless. in the proof we gave we used for S the group G itself viewed merely as a set. for some suitable S. known as Cayley's Theorem (Theorem 2. Cayley's Theorem points out a certain universal character for the groups A (S). 108 . Yet it was not until the midnineteenth century that Cayley more or less introduced the abstract concept of a group. one could try to exploit this embedding of an arbitrary group G isomorphically into some A (S). We find in the work of Lagrange. Abel. If we knew the structure of all subgroups of A(S) for any set S. This result. PRELIMINARIES Let us recall a theorem proved in Chapter 2 for abstract groups.
then G is isomorphic to a subgroup of Sn and Sn has n! elements. writing out u. . . the numbers iI. these symbols. we usually say that G is a subgroup of Sn' Since n is so much smaller than n! for n even modestly large. algebraists multiply permutations from left to right. 1 Preliminaries 109 In this chapter we consider only A (S) for S a finite set. X2. we might as well suppose that S = {x}. . 1 ~ ik ~ n. In this symbol the number in the second row is the image under (J' of the number in the first row directly above it. If the readers look at some other book on algebra. we shall denote tI:tem by lowercase Greek letters...The elements of Sn are called permutations. To be consistent with our definition of the composition of elements in Sn. can be discarded. Since we multiplied two elements u. . if j =1= k. an element u in Sn. .. n shuffled about in some order. and denote it by Sn.. we may assume that S = {I. so U(Xk) = Xi k for some ik . ~). then U(Xk) E S for k = 1. is a group relative to the product of mappings. This might be called the graph of u. It would be desirable to embed G in an Sn for n as small as possible. i2. There is nothing holy about 3 in all this. . We certainly can make it more compact by dropping the x's and writing u = (~ . x n}. then we call A (S) the symmetric group of degree n. they should make sure which way the permutations are being multiplied: right to left or left to right. . then Xi) = u(Xj) =1= U(Xk) = Xi k • Therefore. Clearly. r E A(S). 2. r E A(S) by the rule (ur)(s) = u( res»~ this will have the effect that when we introduce the appropriate symbols to represent the elements of Sn.2. Speaking loosely.. Our first need is some handy way of denoting a permutation. in particular. X2 ~ X3' X3 ~ Xl' But this is cumbersome and space consuming. n}. If u. we do it from right to left.. We showed in Section 4 of Chapter 1 that A(S) satisfied four properties that we used later as the model to define the notion of an abstract group. Let S be a finite set having n elements. it works equally well for any n. the action of U on S is determined by what U does to the subscript j of Xj' so the symbol "x" is really excess baggage and. Given the permutation u E Sn = A(S). For certain classes of finite groups this is achievable in a particularly nice way. in are merely the numbers 1. Very often. . will multiply from right to left... By Cayley's Theorem we know that if G is a finite group of order n. Let's recall what is meant by the product of two elements of A (S). as such. Recall that if S has n elements. Because U is 11.Sec. . One clear way is to make a table of what u does to each element of S. or permutations. that is. 2. In short.. say u E S3' in the fashion: u: Xl ~ X2. .. We did this earlier. n. . then we defined ur by (ur)(s) = u(r(s» for every s E S. Thus Sn. our group occupies only a tiny little corner in Sn.
3 .. in the example in S3 cited. . Find the products: (a) ( 6 1 2 3 4 5 63)(21 4 5 2 1 2 3 4 5 6) 3 4 5 6 1" . just flip the symbol for u over to get u. . ••• . This is the image of k under UTe We then run through k = 1.. We illustrate this with two permutations u = (~ ~ i ~ ~) and T= (~ ~ . l2. 0'(2) = i 2 . G~ ~} u 1 = (i ~ ~) = Gi = ~). which we shall write as eis merely e (~ ::: How does the product in Sn translate in terms of these symbols? Since UT means: "First apply T and to the result of this apply 0'. If 0' E Sn and 0'(1) 1 ~ .. U = i}.. we use the symbol = (~ ~ ~) = (~ ~ ~) = (~ ~ ~)... n and get the symbol for UTe We just do this visually.1? It is easy. ~ ~ 0 . 1.. ••• u(n) = in. For instance. as long as we carry the i/s along accordingly. (Prove!) In our example The identity element: ). We then look at the spot ik in the first row of 0' and see what is directly below it in the second row of u. .) in S 5' Then UT = G.. PROBLEMS 1.2. n. the first row is always 1 2 .. and write u = 1 o ( It 2 0 ..).110 The Symmetric Group Ch. 0) Even the economy achieved this way is not enough. ~) to represent 0' and we write 0' = (~ ~ ~)o Note ( it l2 ••• In It l2 In that it is not necessary to write the first row in the usual order 1 2 . In' ThoIS IS ne. After all.1 u = = (i ~ ~). what is u. b ut In t h e next sectIon we "fi 0 " shall find a better and briefer way of representing permutations." in forming the product of the symbols for 0' and T we look at the number k in the first row of T and see what number ik is directly below k in the second row of T. any way we write the first row. we still have u. If we know 0' = (~~ It l2 ••• In ~). so we could dispense with it. n. ••• l2 In n) (" "as ll.
5) C 5) G G (~ 2 3 4 5 3 4 5 6 2 3 4 5 1 3 4 6 5 7· 6 7) 2 3 4 5 4 5 2 1 ~} 3. Find the order of each element in Problem 2.) . (a) (b) (c) 5r C ~} . CYCLE DECOMPOSITION We continue the process of simplifying the notation used to represent a given permutation.. ••• .e. u(ik ) = i b and a(s) = s for any s E S if s is different from i l . the permutation (i l i 2 ) is called a transposition. in S 7. Evaluate all the powers of each permutation a (i.. are said to be disjoint cycles if .Sec. (Prove!) For example. (i k . Definition. 2. The symbol (i l i 2 ••• ik ) will represent the permutation a E Sn.2 ). and so on.l ). where a(i l ) = i 2 . Prove that (~ II 2 i2 Zr (1 = 1 i2 2 ~n) . we get a device to decompose any permutation as a product of particularly nice permutations. ik • Thus. ••• . find a k for all k). a(ij ) = ij + l for j < k.. ik be k distinct integers in S = {I. Note that if a = (i l i2 ••• ik ). a(i 2 ) = i 3 . i 2 . . say a kcycle and an mcycle. then a is also (i k i l i2 ••• ik . we get something more than just a new symbol. n}. (1 3 5 4) = (4 1 3 5) = (5 4 1 3) = (3 5 4 1). Find the order of the products you obtained in Problem 1. 4. ••• . 2 Cycle Decomposition 111 (b) (c) G (~ 2 3 4 1 3 4 5 5)C3 1 3 4 2 2 1 4 2 3 4 2 3 4 2 3 4 1 3252134541325· 2. Two cycles. In doing so. For the special case k = 2. . 5. the permutation (1 3 5 4) is the permutation 1 2 3 4 5 6 7) ( 3 2 5 1 4 6 7· We call a permutation of the form (i l i2 ••• i k ) a kcycle.1 ik i 1 i 2 ••• i k . 2. Let i b i 2 .
while Uk (1) = 1. T(i). 2... 2. Ukl(i l ».. then the orbit of i l under U (see Problem 27 in Section 4 of Chapter 1) is {i l . In fact. Given two disjoint cycles in Sn. we have al(l) = u(2) = 3. we see that u j (l) = j + 1 for j ~ k . k) in Sn. n}. We can get this visually by weaving through ( ~ 394 1 5 6 7 8 9) 562 7 8 T with the thin path.1. Clearly. We have that this orbit is {i. n }. 1 2 3 4 5 6 7 8 9) what is the cycle of T determined by I? We claim that it is (1 3 4).l (i)}. where e is the identity element in Sn. 1. ••• . If u... the reader should verify that (uT)(i) = (Tu)(i) for every i E S = {I. how is 3 related to I? Since u(2) = 3. T are disjoint cycles. we claim that they commute. Consider the scycle (i T(i) T2 (i) .112 The Symmetric Group Ch. If U = (i l i 2 ••• ik) is a kcycle. we see that Uk = e.. Whence (1 3 5) and (4 2 6 7) in S7 are disjoint cycles. There are two things to be concluded from the paragraph above. is k. We state this result as Lemma 3. u(l) = 2 by the definition given above. . We leave the proof of this to the reader. i 2 .. TSl(i». We take a specific example and find all its cycles. Given any permutation Tin Sn.2. Why? T takes 1 into 3. for i E {I. . ik }. What is the cycle of through determined by 2? Weaving ( 12~) 394 1 5 627 8 . then UT = TU. we call it the cycle of Tdetermined by i.. as an element of Sn. T2 (i) . Continuing. T3 (1) = T(4) = 1. .. . 3 into 4 and 4 into 1.. T2 (1) = T(3) = 4. and since T(l) = 3. Let's consider a particular kcycle U = (1 2 .. T E Sn are disjoint cycles. . So we can see that the kcycle U = (i 1 i 2 ••• ik ) is U = (i l u(i l ) u 2 (i l ) .1. Let T = ( 3 9 4 1 5 6 2 7 8. The order of a kcycle. (Prove!) 2. .. with the suggestion that if u. consider the orbit of i under T.3 they have no integer in common. where TS(i) = i and s is the smallest positive integer with this property. Ts .
There is nothing special about the permutation T above that made the argument we gave go through. respectively. 10. 2. Therefore we have that T = (1 3 4)(2 9 8 7) (5)(6). Here. recall that the least common multiple of m and n is the smallest positive integer v which is divisible by m and by n. we see that the cycle of T determined by 2 is (2 9 8 7). that is. in order that T m = e we need (1 2)m = e. (7 8 9)m = e. The cycles of T determined by 5 and 6 are (5) and (6). (7 8 9) commute. to have (3 4 5 6)m = e. 9} appears in one and only one cycle.. Chapter 1. because (3 4 5 6) is of order 4. On the other hand. (See Problem 7. Consider the permutation T = (1 2)(3 4 5 6)(7 8 9) in Sg. and to have (1 2)m = e. again. T is k. T k = e and T j =I= If T in Sn is a kcycle.2. What we did for T works for any permutation. (Prove!) To have (7 8 9)m = e.. since (7 8 9) is of order 3. (5). We leave the formal writing down of the proof to the reader. since 5 and 6 are left fixed by T. Every permutation in Sn is the product of disjoint cycles. writing u as a product of kcycles. . In other words. and (6). The same argument would hold for any permutation in Sn for any n. we must have 4 1 m. we ignore the i's such that u(i) = i. we must have 3 I m. Section 5.) Then we have .3.Sec. we omit all Icycles. the special properties of T do not enter the picture. T m = (1 2)m(3 4 5 6)m(7 8 9)m. then the order of e for 0 < j < k. To formulate this properly. and the image of any i under T is read off from the cycle in which it appears. So T is of order 12. where we view these cyclesas defined aboveas permutations in Sg because every integer in S = {I.2. Thus we write u = (1 2 3)(4 5)(6)(7) simply as u = (1 2 3)(4 5). (3 4 5 6). In writing a permutation u as a product of disjoint cycles. SO the cycles of Tare (1 3 4). with k > 1. 2 Cycle Decomposition 113 with the thin path. and 11. because (1 2) is of order 2. Thus in the group S 11 the permutation T = (1 5 6)(2 3 9 8 7) leaves fixed 4. (2 9 8 7). we assume that u leaves fixed any integer not present in any of the cycles. . Theorem 3.2. Lemma 3. that is. What is its order? Since the disjoint cycles (1 2). (3 4 5 6)m = e. This tells us that m must be divisible by 12. we must have 21 m.
. On the other hand. Ti Tj = Tj Ti. ••• . How many times must we repeat this shuffle to get the cards back to their original order? The answer is merely the order of u. if UN = e.3 Theorem 3..4 in the context of a card shuffle. Consequently. mk' disjoint cycles of length mb m2. So we must compute UT and find its order.•• Tk.. . and then follow it by the shuffle given above. 13. For instance. mk' Then the order of U is the least common Proof Let U = Tl T2 . mk' so MIN.•.. the second in that of the 4th. How many repeats are now needed to get the cards back to their original order? The first operation is the shuffle given by the permutation T = (1 12 11 10 9 8 7 6 5 4 3 2) followed by u above."" mk' then u M = (TIT2 . Suppose that we shuffle a deck of 13 cards in such a way that the top card is put into the position of the 3rd card. so m i I N. . (1 2) and (1 3). TkM = e (since Tt. Suppose that we shuffle the cards as follows. . 13. D Note that the disjointness of the cycles in the theorem is imperative. But UT = (1 X 3 5 7 9 11 12 11 13 2 4 6 8 10 12) (1 10 9 8 7 6 5 4 3 2) 12 13). therefore if M is the least common multiple of ml. where the Ti are disjoint cycles of length mi' Since the Ti are disjoint cycles. which are not disjoint. u. but their product (1 2)(1 3) = (1 3 2) is of order 3. This forces each Tf = e.2. . the order of u is at most M.. since Ti is of order mi' Thus N is divisible by the least common multiple of mb m2' . (prove!) because Ti are disjoint permutations. 2.. m2. working mod 13. that is.t= e because Ti is of order mi and mi I M). the shuffle becomes u = 1 2 3 4 5 6 7 8 9 10 11 ( 3 4 5 6 7 8 9 10 11 12 13 12 13) 1 2' and u is merely the 13cycle (1 3 5 7 9 11 13 2 4 6 8 10 12). Tk)M = T~T~ .. First take the top card and put it into the secondtoIast place. Therefore. . multiple of ml. So it takes 13 repeats of the shuffle to get the cards back to their originalorder. the ith into the i + 2 position. are each of order 2. As a permutation. Let's consider Theorem 3.•.4. = (1)(2 3 4 5 6 7 8 9 10 11 . we see that u is of order M as claimed in the theorem. then Tf T!j •• • TZ = e.. Let's give a twist to the shuffle above. . of 1.2. so u is of order 13. Let U E Sn have its cycle decomposition into . m2.114 The Symmetric Group Ch.
the order of the permutation is visible at a glance. and what would this number be? We return to the general discussion.. ik) (i 1 ik)(i 1 ik. To illustrate this point. 3. so not in a unique way. a motion of a geometric figure is a permutation of its vertices that can be realized by a rigid motion in space. 2 Cycle Decomposition 115 so is of order 12. many things become transparentfor example. Informally. 4 as below: 4 3 2 . after all. So it would take 12 repeats of the shuffle to get back to the original order. and in at least two distinct ways. Can you find a shuffle of the 13 cards that would require 42 repeats? Or 20 repeats? What shuffle would require the greatest number of repeats. there are eight motions of a square. 2.. as we shall see in Section 3. So two things are evident. This theorem is really not surprising for it says. We saw that there is a lack of uniqueness in representing a given permutation as a product of transpositions. As a final word of this section. some aspects of this decomposition are indeed unique. which are in fact permutation groups that have already appeared in Chapter 2 under different guises. Given the kcycle (i 1 i2 . then (i 1 i2 . But.2.. We can also see that (1 2 3) = (2 3)(1 3).1)··· (i 1 i2).Sec. tions. ik). When we represent elements of a permutation group as products of disjoint cycles. Because every permutation is the product of disjoint cycles and every cycle is a product of transpositions we have Theorem 3. whose vertices are numbered 1. First. For example. we see that (1 2 3) = (1 3)(1 2). Consider the permutation (1 2 3). so every kcycle is a product of k .1 transpositions (if k > 1) and this can be done in several ways. nothing more or less than that any permutation can be effected by carrying out a series of interchanges of two objects at a time.5. we now give a few examples of certain geometric groups. we would like to point out the convenience of cycle notation. Examples Every permutation in Sn is the product of transposi 1. we can write (1 2 3) as the product of two transpositions..
of course a 2 = ~4 = (1) is the "motion" that leaves the vertices unchanged. We also have the relation ~a These motions. This subgroup is often called Klein's 4group. This group (or.1 without mention of permutations. or the dihedral group of order 8. We leave it to the reader to verify that the group of all motions of an equilateral triangle is the full symmetric group S3. corresponding to a reflection . strictly speaking._  3 1 2 the reflections in the two axes of symmetry. 3. (14)(23). ~2 = ~3 = (1432) is the counterclockwise rotation by 270°. and. (12)(34).3 a = (13) is the reflection about the axis of symmetry joining vertices 2 and 4 in the original position. generated by the permutations a = (15)(24).116 The Symmetric Group Ch. or symmetries of a square. (24) is the reflection in the other diagonal axis. a~ = (12)(34) is the reflection in the vertical axis of symmetry. a group isomorphic to it) was introduced in Example 9 of Section 2. These motions can be identified with permutations (1). 2. (13)(24). The motions of a regular hexagon form the dihedral group of order 12. ~ = (1234) is the counterclockwise rotation by 90°. form a subgroup of S4 which is called the oetie group. (13)(24) is the counterclockwise rotation by 180°. and form a subgroup of the group obtained in Example 1. = a~3. 3 2 4. rotation by 180° and the identity. There are only four symmetries of a nonsquare rectangle: 4 1 u _ i. a~2 = a~3 = (14)(23) is the reflection in the horizontal axis.
2. Give a complete proof of Theorem 3. can be interpreted as the group of symmetries of a regular ngon (a polygon with n edges of equal length). 2 3 4 5)3. then UT = TU. (f) (1 . (e) (1 2 3)(3 5 7 9)(1 2 3)1. (d) (1 2 3)(1 3 2).2. 5. and J3 counterclockwise rotation by 60°. Express as the product of disjoint cycles and find the order. which was first introduced in Example 10 of Section 2. 6. (a) (1 2 3 5 7)(2 4 7 6). (a) 1 2 3 4 5 6 7 8 9) ( 3 1 4 2 7 6 9 8 5' 1 2 3 4 5 6 7) (b) ( 7 6 5 4 3 2 l ' (c) G~ . (b) (1 2)(1 3)(1 4). Show that a kcycle has order k. 2. 4. T are two disjoint cycles. 2 Cycle Decomposition 117 about one of the axes of symmetry.1. Show that if u. Find the cycle decomposition and order.Sec. Find a shuffle of a deck of 13 cards that requires 42 repeats to return the cards to their original order. corresponding to the 4 6 3 2 In general. (c) (1 2 3 4 5)(1 2 3 4 6)(1 2 3 4 7). ~ ~ ~ i) G 2 3 4 5 6 7) 3 1 5 6 7 4' 3. 5 = (123456). the dihedral group of order 2n. PROBLEMS Easier Problems 1.
19. mk' (We say that they have similar decompositions into disjoint cycles. 16. . Tof Sn.1 = (1 2 3).I is a transposition if T is a transposition. m2.3 7. 12.. Prove that there is no permutation u such that u(1 2 3)u. MiddleLevel Problems 13. Show that if T is a kcycle.. T are two permutations that disturb no common element and UT = e. If u. 17. then TI T2T3 =1= e. Find the conjugacy class in Sn of (1 2 . Prove that there is no permutation U such that u(1 2)u. If T}.I for any permutations u. n). Find an algorithm for finding UTU. 20. T2 are distinct transpositions. find a permutation 0such that u(1 2)u. 8.1 (1 2 4)(5 6 7). Prove that if TI' T2. and T3 are transpositions. 15.. Prove that there is a permutation u such that u(1 2 3)u. prove that U = T = e. T = pUpI. 9. UTU.118 The Symmetric Group Ch. Given the two transpositions (1 2) and (1 3). If TI and T2 are two transpositions...I is also a kcycle. 24.1 (4 5 6). 14. Let <I> be an automorphism of S3' Show that there is an element U E S3 such that <1>( T) = UITU for every T E S3. 21. . Do Problem 24 for U = (1 2)(3 4). Let (1 2) and (1 2 3 . 11. Prove that (1 2) cannot be written as the product of disjoint 3cycles. the identity element of Sn.1 = (1 3).. n) in Sn? 25.. 23.. n) be in Sn' Show that any subgroup of Sn that contains both of these must be all of Sn (so these two permutations generate Sn)' 18. then UTU. . 22. show that TIT2 can be expressed as the product of 3cycles (not necessarily disjoint).) Prove that for some permutation p. T be two permutations such that they both have decompositions into disjoint cycles of cycles of lengths ml. show that TI T2 is of order 2 or 3. for any permutation u. Do Problem 6 for a shuffle requiring 20 repeats. 10. Prove that for any permutation u. Express the permutations in Problem 3 as the product of transpositions. Let u. What is the order of the centralizer of (1 2 .
this decomposition is not unique. however.XT(2» (XT(I) . Let f(Xb X2' X3) = (Xl . Similarly. that certain aspects of this kind of decomposition are unique. What does .Xl) (Xl .Xl) (X 2 . So u* coming from 0' = (1 2) changes the sign of f(x).X3) (Xl . 2 ..f(x)..Sec.X2) (Xl . so T* coming from.. Then T*(f(x» = (XT(I) .2) * (f (x» = (X T2(1) = f(x).. = (1 2 3) leaves f(x) unchanged. do to f(x)? Since .... since . and 0'(3) = 3. ODD AND EVEN PERMUTATIONS Odd and Even Permutations 119 We noticed in Section 2 that although every permutation is the product of transpositions. (1 3) changes the sign off(x). we immediately see that (...(Xl .X2)(XI . What about the other permutations in S3.0'(2) = 1.XT 2(2) 2(3» Now consider U'T = (1 2)(1 2 3) = (2 3). the identity element e induces a map e* on f(x) which does not change f(x) at all.. We go into this now. . Let's look at the action of another element.. We let S3 act on f(x) = f(Xb X2.X3) (X2 .2. U7'must change the sign off(x).X2) (Xl . for. = (1 2 3). .*f(x) = f(x). We did comment.X3) be an expression in the three variables Xb X2. so that O'*(f(x» = (Xu(l) . above.X3)(X2 .X3) f(x). leaves f(x) alone and 0' changes the sign off(x).. We have accounted for the action of every element of S3 onf(x).X3) . If 0' E S3' then We consider what 0'* does to f(x) for a few of the o"S in S3. X3. how do they affectf(x)? Of course. X3) as follows.XI )(X3 .. Let's consider the special case of S3' for here we can see everything explicitly. Consider 0' = (1 2). Then 0'(1) = 2.X3) .X7{3» = = = (x 2 .. of S3 on f(x).Xu(3» (x 2 . 3 3.XT (XT (1) 2(2» (Prove!)  2 XT (3» (XT .X3) (X 2 .Xu(2» = (Xu(l)  Xu(3»(Xu(2) .XT(3»(X7{2) .X3)(X2 .
we get an odd number of changes of sign on f(x) when acted on by T*. . This happens for (xu .xv). . and j all those from 2 to n inclusive.1) + 1 such. then k must be odd.. Therefore. an odd number of them. Thus T*(f(x» = f(x). So p*(f(x» = (1)kf(x). define u* on f(x) by u* (f (x» = If u. Therefore. This tells us that t and k have the same parity. our claim that T*(f(x» = f(x) for every transposition T is substantiated. whence (1)t = (1)k. What does a transposition T do to f(x)? We claim that T*(f(x» = f(x).120 The Symmetric Group Ch.. for (Xi . which get transformed into an (x a .i .X »)) = U*(T*(f(X))) = (U*T*)(f(X» j SO (UT)* = U*T* when applied to f(x). and finally. p*(f(x» = (1)tf(x). then p acting on f(x) will change the sign of f(x) k times. we count up the number of (xu ..Xv) if i < v < j..Xj) if i < u < j. Ut are transpositions. since each Ti changes the sign off(x). for (Xi . We strive for this goal now..1 inclusive. (1)kf(x) = (1)tf(x). that is. If U E Sn. by the same reasoning. if t is odd. . As we did above. To prove this. . If p = UI U2··· Ut . x n ) to be where in this product i takes on all values from 1 to n . define f(x) = f(x}. that is. where Ub . suggesting to readers that they carry out the argument that we do for arbitrary n for the special case n = 4.Xj). assuming that T = (i j) where i < j.. T 1<] f1 (xu(i)  xU(j»· E Sn' then = u* (T* (11 (Xi . Tk. and if t is even. This suggests that although the decomposition of a given permutation U as a product of transposition is not unique. . Each of these leads to a change of sign on f(x) and since there are 2(j .Xb) with a > b.3 Suppose that p E S3 is a product p = T(T2··· Tk of transpositions Tb . with u < v. then k must be even. the parity of the number of transpositions in such a decomposition of U might be unique.
and is an even permutation if 0' is the product of an even number of transpositions. Tk)* = TiT~ . We can show that An is a subgroup of Sn in another way. Since An is thus a finite closed subset of the (finite) group Sn. we see that O'*(f(x» = (l)kf(x). For suppose that 0" E An and p E Sn. Why? Well. then larly.Sec.. but cannot be both. Simiwhere are transpositions. so they are of equal parity. What we have proved above is Theorem 3. if 0' = Tt'T2· .. we see that the parity of 0' and 0'1 is (l)k.3. then 0'* = (T1T2 . then since Til = Ti .3.. and since each T~(f(x» = f(x). We already saw that An is closed under the product of Sn. So these two decompositions of O'as the product of transpositions are of the same parity.1. The permutation 0' E Sn is an odd permutation if 0' is the product of an odd number of transpositions. A permutation in Sn is either an odd or an even permutation. For any permutation 0' we claim that 0' and 0'1 are of the same parity. Therefore.3. 3 Odd and Even Permutations 121 If 0' is any permutation in Sn and 0' = T1 T2 . .. Tk as acting on f(x).. Tk are transpositions. An is a subgroup of Sn.. This suggests the following '1'2··· 't.10"p? By the . Tb where Tb T2' . This certainly shows that 0' E An forces 0"1 E An. T E An. But it shows a little more.1 behind us we can deduce a number of its consequences.. so to know that An is a subgroup of Sn we merely need show that 0' E Sn implies that 0'1 E Sn. then we immediately have that O'T E An. 'b '2' . Let An be the set of all even permutations. and no product of an even number of transpositions can equal a product of an odd number of transpositions. An is called the alternating group of degree n. namely that An is a normal subgroup of Sn.. if 0'.. we conclude that (l)k = (l)t. Thus any permutation is either the product of an odd number of transpositions or the product of an even number of transpositions. if 0' = O'*(f(x» = (l)tf(x).. by Lemma 2. Tk' where the Ti are transpositions. ' 't Definition.2. With Theorem 3.. whence An is a subgroup of Sn. Comparing these two evaluations of O'*(f(x». What is the parity of p.
I} of order 2 under multiplication. which. shortening it to get a contradiction. permutations. From the very definitions involved we have the following simple rules for the product of permutations: 1. and if IT is an odd permutation. let ()( IT) = 1. Other proofs use other devices. 3. the group An' for n 2:: 5. An is a normal subgroup of Sn of order !n!. For n > 1. N. is a normal We look at this in yet another way. The product of an even permutation by an odd one (or of an odd one by an even one) is odd. For n > 1. What is the kernel. which is why we used it." where one tries to show that e cannot be written as the product of an odd number of transpositions by assuming that it is such a shortest product.2. The foregoing rules about products translate into ()(lTr) = ()(IT)()(r).3. Thus 2 = lEI = ISn/Anl = ISnl/IAnl. Thus An is a normal subgroup of Sn.3. we do not particularly like any of them. Sn contains !n! even permutations and !n! odd A final few words about the proof of Theorem 3. This gives us that IAnl = ! ISnl = !n!. The product of two odd permutations is even. 2. in some sense. However.3.3.3 above.3. An' the alternating group of degree n. if n > 1.122 The Symmetric Group Ch. let () (IT) = 1. Quite frankly.1 before we close this section.1 are known. and by the appropriate finagling with this product. of ()? By the very definition of An we see that N = An. Theorem 3. is an extremely interesting group. Many different proofs of Theorem 3. Some involve what might be called a "collection process. E ~ Sn/An. is extraneous to the whole affair. p and pl are of the same parity and IT is an even permutation so p1lTp is an even permutation. Therefore. Corollary. We . so () is a homomorphism of Sn onto the group E = {I. Finally. hence is in An. The product of two even permutations is even. We summarize what we have done in Theorem 3. The proof we gave exploits the gimmick of the function [(x). subgroup of Sn. So by the First Homomorphism Theorem. the proof given is probably the most transparent of them all. If IT is an even permutation.
(a) (1 2 3 4 5 6 7 8 9) 245 1 3 7 8 96· 2. .. If n 2:: 3. . and is an even permutation if k is odd. show that (]" is an odd permutation if k is even. are (e) and An itself. Prove that the parity of a permutation in Sm. PROBLEMS Easier Problems 1. m and u leaves j > m fixed. does not change.Sec. show that every element in An is a product of 3cycles. . .000 printed pages. If (]" is a kcycle.. The determination of these simple groups runs about 10. T E Sn. 4.. where the images of 5 and 4 have been lost. (b) (1 2 3 4 5 6)(7 8 9). The An for n 2:: 5 provide us with an infinite family of nonabelian finite simple groups. we can consider Sm C Sn by viewing u E Sn as acting on 1. n as it did on 1. is an even permutation. Show that every element in An is a product of ncycles. If m < n. . 3 Odd and Even Permutations 123 shall show in Chapter 6 that the only normal subgroups of An' for n 2:: 5. Find the parity of each permutation. Suppose you are told that the permutation 1 2 3 4 5 6 7 8 9) ( 312 7 896 in Sg. 3. 5.1(]"T. Prove that (]" and T. 7.. Find a normal subgroup in A 4 of order 4. m. for any u. any nonabelian finite simple group must have even order. A group with this property is called a simple group (not to be confused with an easy group). (c) (1 2 3 4 5 6)(1 2 3 4 5 7)... In the last 20 years or so the heroic efforts of algebraists have determined all finite simple groups. are of the same parity. (d) (1 2)(1 2 3)( 4 5)(5 6 8)(1 7 9). . Interestingly enough. . 2. There are other infinite families of finite simple groups. 8. 2. The abelian finite simple groups are merely the groups of prime order. What must the images of 5 and 4 be? MiddleLevel Problems 6. when viewed this way as an element of Sn.
If n 2:: 5 and (e) =1= N C An is a normal subgroup of An' show that N must contain a 3cycle. Using the result of Problem 9. very hard) 9. 10.) . show that if n 2:: 5. (Thus the groups An for n 2:: 5 give us an infinite family of nonabelian finite simple groups.3 Harder Problems (In fact. the only normal subgroups of An are (e) and An itself.124 The Symmetric Group Ch.
We were used to systems where you could both add and multiply elements and where the elements did satisfy the commutative law of multiplication ab = ba. As we proceed and see examples of rings. Naturally enough. in fact. Furthermore. the first thing we should do is to define that which we'll be talking about. we shall see some of these things occur. and for some. rational. complex. In many ways this system will be more reminiscent of what we had previously known than were groups. which plays a central role in the algebra of today. Although many of the formal rules of arithmetic hold. it ran somewhat counter to our prior experience in algebra. we have been introduced to one kind of abstract system. usually have little to do with these familiar ones. and.4 RING THEORY 1. Because a group is an algebraic system with only one operation. 125 . For one thing rings will be endowed with addition and multiplication. With this preamble over we are ready to begin. these systems of our acquaintance usually came from sets of numbersintegers. DEFINITIONS AND EXAMPLES So far in our study of abstract algebra. That was the notion of a group. real. and because a group need not satisfy the rule ab = ba. On the other hand. The next algebraic object we shall consider is a ring. and these will be subjected to many of the familiar rules we all know from arithmetic. many strangeor what may seem as strangephenomena do take place. rings need not come from our usuall1umber systems.
There exists an element 0 E R such that a + 0 = a for every a E R. We interweave them by the two distributive laws (h) a·(b+c)=a·b+a·c and (b + c) . c E R. b. that is. b E R implies that a + b E R. We look at some of the familiar rules that are not insisted upon for a general ring. b E R. for a. Many of the examples we shall encounter will have such an element. Nonassociative rings do exist. and in that case we say that R is a ring with unit.) Note that so far all we have said is that R is an abelian group under +. a = a for every a E R. In all fairness we should point out that many algebraists do demand that a ring have a unit element. This is all that we insist on as far as the multiplication by itself is concerned. . c E R. c) = (a· b) . a. a = b . b.126 Ring Theory Ch. the ring consisting of 0 alone is not a ring with unit. there exists abE R such that a + b = O." Although Axioms (a) to (h) are familiar. b. b E R. Given a E R.4 Definition. These axioms for a ring look familiar. (We shall write b as a. c for a. for the concept of ring was introduced as a generalization of what happens in the integers. We now spell out the rules for the multiplication in R. the associative law of multiplication. But the + and· are not allowed to live in solitary splendor. (a + b) + c = a + (b + c) for a. there are certain things they do not say. First. So whenever we use the word "ring" we shall always mean "associative ring. we do not postulate the existence of an element 1 E R such that a . A nonempty set R is said to be a ring if in R there are two operations + and· such that: (a) (b) (c) (d) (e) a. a + b = b + a for a. a + c . They should be. the rings we defined are usually called associative rings. (g) a· (b . We do insist that 1 =1= 0. 1 = 1 . and some of these play an important role in mathematics. c E R. (f) a. But they shall not be our concern here. Because of Axiom (g). b E R implies that a .
Sec. 1
Definitions and Examples
127
Second, in our previous experience with things of this sort, whenever a . b = 0 we concluded that a = 0 or b = O. This need not be true, in general, in a ring. When it does hold, the ring is kind of nice and is given a special name; it is called a domain. Third, nothing is said in the axioms for a ring that will imply the commutative law of multiplication a . b = b . a. There are noncommutative rings where this law does not hold; we shall see some soon. Our main concern in this chapter will be with commutative rings, but for many of the early results the commutativity of the ring studied will not be assumed. As we mentioned above, some things make certain rings nicer than others, and so become worthy of having a special name. We quickly give a list of definitions for some of these nicer rings.
Definition. A commutative ring R is an integral domain if a . b R implies that a = 0 or b = O.
= 0 in
It should be pointed out that some algebra books insist that an integral domain contain a unit element. In reading another book, the reader should check if this is the case there. The integers, 7L, give us an obvious example of an integral domain. We shall see other, somewhat less obvious ones.
Definition. A ring R with unit is said to be a division ring if for every a 0 in R there is an element b E R (usually written as aI) such that a . aI = aI. a = 1.
*
The reason for calling such a ring a division ring is quite clear, for we can divide (at least keeping left and right sides in mind). Although noncommutative division rings exist with fair frequency and do play an important role in noncommutative algebra, they are fairly complicated and we shall give only one example of these. This division ring is the great classic one introduced by Hamilton in 1843 and is known as the ring of quaternions. (See Example 13 below.) Finally, we come to perhaps the nicest example of a class of rings, the field.
Definition.
ring.
A ring R is said to be a field if R is a commutative division
In other words, a field is a commutative ring in which we can divide freely by nonzero elements. Otherwise put, R is a field if the nonzero elements of R form an abelian group under· , the product in R.
128
Ring Theory
Ch.4
For fields we do have some ready examples: the rational numbers, the real numbers, the complex numbers. But we shall see many more, perhaps less familiar, examples. Chapter 5 will be devoted to the study of fields. We spend the rest of the time in this section looking at some examples of rings. We shall drop the· for the product and shall write a · b simply as abo
Examples
1. It is obvious which ring we should pick as our first example, namely 7L, the ring of integers under the usual addition and multiplication of integers. Naturally enough, 7L is an example of an integral domain. 2. The second example is equally obvious as a choice. Let Q be the set of all rational numbers. As we all know, Q satisfies all the rules needed for a field, so Q is a field. 3. The real numbers, IR, also give us an example of a field. 4. The complex numbers, C, form a field. Note that Q eIRe C; we describe this by saying that Q is a subfield of IR (and of C) and IR is a subfield of C. 5. Let R = 7L 6 , the integers mod 6, with the addition and the multiplication defined by [a] + [b] = [a + b] and [a] [b] = [ab]. Note that [0] is the 0 required by our axioms for a ring, and [1] is the unit element of R. Note, however, that 7L 6 is not an integral domain, for [2][3] = [6] = [0], yet [2] =1= [0] and [3] =1= [0]. R is a commutative ring with unit. This example suggests the Definition. An element a for some b =1= 0 in R.
=1=
0 in a ring R is a zerodivisor in R if ab
=
0
We should really call what we defined a left zerodivisor; however, since we shall mainly talk about commutative rings, we shall not need any leftright distinction for zerodivisors. Note that both [2] and [3] in 7L 6 are zerodivisors. An integral domain is, of course, a commutative ring without zerodivisors. 6. Let R = 7L s , the ring of integers mod 5. R is, of course, a commutative ring with unit. But it is more; in fact, it is a field. Its nonzero elements are [1], [2], [3], [4] and we note that [2][3] = [6] = [1], and [1] and [4] are their own inverses. So every nonzero element in 7L s has an inverse in 7L s . We generalize this to any prime p.
Sec. 1
Definitions and Examples
129
7. Let 7L p be the integers mod p, where p is a prime. Again 7L p is clearly a commutative ring with 1. We claim that 7L p is a field. To see this, note that if [a] =1= [0], then p (a. Therefore, by Fermat's Theorem (Corollary to Theorem 2.4.8), aP 1 == l(p). For the classes [.] this says that [a P 1] = [1]. But [a P 1] = [alPI, so [a]pl = [1]; therefore, [a]p2 is the required inverse for [a] in 7L p, hence 7L p is a field. Because 7L p has only a finite number of elements, it is called a finite field. Later we shall construct finite fields different from the 7L p 's. 8. Let Q be the rational numbers; if a E Q, we can write a = mIn, where m and n are relatively prime integers. Call this the reduced form for a. Let R be the set of all a E Q in whose reduced form the denominator is odd. Under the usual addition and multiplication in Q the set R forms a ring. It is an integral domain with unit but is not a field, for ~, the needed inverse of 2, is not in R. Exactly which elements in R do have their inverses in R? 9. Let R be the set of all a E Q in whose reduced form the denominator is not divisible by a fixed prime p. As in (8), R is a ring under the usual addition and multiplication in Q, is an integral domain but is not a field. What elements of R have their inverses in R? Both Examples 8 and 9 are subrings of Q in the following sense.
Definition. If R is a ring, then a subring of R is a subset S of R which is a ring if the operations ab and a + b are just the operations of R applied to the elements a, b E S.
For S to be a subring, it is necessary and sufficient that S be nonempty and that ab, a ± b E S for all a, b E S. (Prove!) We give one further commutative example. This one comes from the calculus. 10. Let R be the set of all realvalued continuous functions on the closed unit interval [0, 1]. For f, g E R and x E [0,1] define (f + g)(x) = f(x) + g(x), and (f· g )(x) = f (x)g (x). From the results in the calculus, f + g and f· g are again continuous functions on [0, 1]. With these operations R is a commutative ring. It is not an integral domain. For instance, if f(x) = x + ~ for o : : ; x ::::; ~ and f(x) = 0 for ~ < x ::::; 1, and if g(x) = 0 for 0 ::::; x ::::; ~ and g(x) = 2x  1 for ~ < x::::; 1, thenf, g E R and, as is easy to verifY,f' g = O. It does have a unit element, namely the function e defined by e (x) = 1 for all x E [0, 1]. What elements of R have their inverses in R?
130
Ring Theory
Ch.4
We should now like to see some noncommutative examples. These are not so easy to come by, although noncommutative rings exist in abundance, because we are not assuming any knowledge of linear algebra on the reader's part. The easiest and most natural first source of such examples is the set of matrices over a field. So, in our first noncommutative example, we shall really create the 2 X 2 matrices with real entries. 11. Let F be the field of real numbers and let R be the set of all formal square arrays
where a, b, c, d are any real numbers. For such square arrays we define addition in a natural way by defining
It is easy to see that R forms an abelian group under this + with
ing as the zero element and
(~ ~) act
(a b)
_
C _
U
d the negative of c d . To make of
(a b)
R a ring, we need a multiplication. We define one in what may seem a highly unnatural way via
b)(r + bt + bU). (a d t s) = (ar + dt as + du c cr cs
It may be a little laborious, but one can check that with these operations R is a noncommutative ring with Note that
(~ ~)
acting as its multiplicative unit element.
(~ ~)(~ ~) = (~ ~)
while
so
Sec. 1
Definitions and Examples
131
Note that
(~ ~) and (~ ~) are zerodivisors; in fact,
(~ ~r (~ ~)
=
so
is a nonzero element whose square is the 0 element of R. This R is known as the ring of all 2 X 2 matrices over F, the real field. For those unfamiliar with these matrices, and who see no sense in the product defined for them, let's look at how we do compute the product. To get the top left entry in the product AB, we "multiply" the first row of A by the first column of B, where A, B E R. For the top right entry, it is the first row of A versus the second column of B. The bottom left entry comes from the second row of A versus the first column of B, and finally, the bottom right entry is the second column of A versus the second column of B. We illustrate with an example: Let
A = (
3 2
1
~)
and
Then the first row of A is 1, ~ and the first column of B is ~, 71"; we "multiply" these via 1 . ~ + ~ . 71" = 71"/2 + ~, and so on. So we see that
AB = (
~ + 71"/2 1 + 271"
~
~  71"/2)
 271" .
In the problems we shall have many matrix multiplications, so that the reader can acquire some familiarity with this strange but important example. 12. Let R be any ring and let
s=
{(:
~) a, b, dE R}
I
C,
with + and· as defined in Example 11. One can verify that S is a ring, also, under these operations. It is called the ring of2 X 2 matrices over R. Our final example is one of the great classical examples, the real quaternions, introduced by Hamilton (as a noncommutative parallel to the complex numbers).
13. The quaternions. Let F be the field of real numbers and consider the set of all formal symbols ao + ali + a2j + a3 k , where ao, ab a2' a3 E F. Equality and addition of these symbols are easy, via the obvious route
1843. We now come to the tricky part. We can write out the product now of any two quaternions. We are multiplying out formally using the distributive laws and using the product rules for the i. ki = j and ji = . declaring by definition that (ao + ali + a2j + a3 k )(f30 + f31 i + f32j + f33 k ) = 1'0 + 1'1 i + 1'2j + 1'3 k . and (ao + ali + a2j + a3 k ) + (f30 + f31 i + f32j + f33 k ) = (a o + f30) + (a 1 + f31) i + (a2 + f32) j + (a3 + f33)k. thus 0 + Oi + OJ + Ok will be written simply as 0. o + 3i + 4j + Ok as 3i + 4j. The product is based on i 2 = j2 = k 2 = 1.a3f32 + a2f30 + a3f31 + a1f32 . When Hamilton discovered it on October 6. j.a1f33 1'3 = aof33 It looks horrendous. the multiplication. where 1'1 = aof31 + a1 f30 + a2f33 . k above.1 + Oi + OJ + Ok as 1. jk = i. kj = . and so on. ik = .j. A calculation reveals that (II) = a6 + ai + a~ + a~. and going around counterclockwise we get the negatives. . doesn't it? But it's not as bad as all that.i. If we go around the circle clockwise the product of any two successive ones is the next one. he cut the basic rules of this product out with his penknife on Brougham Bridge in Dublin.a2f31 + a3f30 (I) 1'2 = aof32 .4 = f30 ao + ali + a2j + a3 k + f31 i + f32j + f33 k if and only if ao = f3o.k. a 1 = f3l' a2 = f32 and a 3 = f3 3 . according to the rules above. ij = k. If some ai is 0 in x = ao + ali + a2j + a3k. we shall omit it in expressing x.132 Ring Theory Ch.
Then. then x has an inverse in the quaternions. it expresses the product of two sums of four squares again as a sum of four squares.lil . 1'3 are as in (I). Do Problem 4 for Example 9. the 2 X 2 matrices over the reals.Sec. Show that any field is an integral domain. 3. 6. for suppose that x = ao + ali + =1= 0 (so some ai =1= 0). the quaternions above (or some piece of them) are often the only noncommutative division rings that even many professional mathematicians have eVer seen. as we mentioned earlier.. . Its verification will be one of the exercises. 4. check the associative law of multiplication. In Example 11. PROBLEMS Easier Problems * 1. 1 Definitions and Examples 133 a2j + a3 k This has a very important consequence. This way the reader will be able to acquire some skill with playing with noncommutative rings.e. Although. Thus the quaternions form a noncommutative division ring. Find all the elements in E24 that are invertible (i. 1'1. So.Ii] . One final comment in this section: If 1'0. then (a~ + ai + a~ + a~)(f3~ + f3i + f3~ + f3~) = I'~ (III) + I'i + I'~ + I'~. Then from (II) we easily get o ( ao + all. 1'2. if x =1= 0.Ii k ) 1 =. Find all its invertible elements. 5. have a multiplicative inverse) in E24 • 2. there is no lack of noncommutative division rings. We shall have many problemssome easy and some quite a bit harderabout the two examples: the 2 X 2 matrices and the quaternions. since the a's are real. This is known as Lagrange's Identity. + a3 k)(a al· a2· a3 Ii . f3 = a~ + ai + a~ + a~ =1= o. Show that En is a field if and only if n is a prime. Verify that Example 8 is a ring. + a2].
(b) (1 . . S the ring of 2 X 2 matrices over R.] 12. ~)(~ ~) . (d) i(ao + ali + a2j + a3k ) .j).2k)(1 + 2i . Find all 2 X 2 matrices (: 10. E R} is a subring of S.bi.(~ ~)(: ~). Find all matrices (: 9. Let F: C ~ C be defined by F(a + bi) (a) F(xy) = F(x)F(y) for x.4j + 6k). Work out the following: (a) ( 4 (b) 1 7 2)(1~) 0 1· 1)2 1 (1 1 .(aD + ali + a2j + a3k )i. 13.) (b) Show that {( (c) Show that ~ :) I a. In that case write down 11. (c) (2i .bd)2 + (ad + be)2. b. a . (See Example 12. [Note: F(x) is merely x. C. (b) F(xx) = Ix1 2 • (c) Using Parts (a) and (b). (a) (i + j)(i . 1 1)3 (g g . (c) (d) (: 8. (~ ~ (~ :) has an inverse in S if and only if a and inverses in R. Verify the identity in Part (e) of Problem 11 directly. ~) such that (: ~)(~ ~) = (~ ~)(: ~)~ ) that commute with all 2 X 2 matrices. Show that: (a 2 + b2)(e2 + d 2) = (ae . show that = r l C have explicitly.3j + 4k)2.) (a) Check the associative law of multiplication in S. (Remember: R need not be commutative. Let R be any ring with unit.4 7.i + 2j . Find the following products of quaternions . y E C.134 Ring Theory Ch.
19. + ali == a~ + a 2j + a3 k )(aO + ai + a~ + a~. In the quaternions. 1 Definitions and Examples 135 14. 21. G {+l . Show that the only quaternions commuting with i are of the form a 15. In the quaternions. MiddleLevel Problems 18. . (b) (x + y)* == x* + y*. in the quaternions. == 1 in the 20. Verify Lagrange's Identity by a direct calculation.Sec. [Note the reversal of order in Part (d). . Show that Ixyl == Ixllyl for any two quaternions x and y. Give an example. Show that a division ring is a domain. Define the map * in the quaternions by Show that: (a) x** == (x*)* == x. +k} . +. of a noncommutative domain that is not a division ring. Find the quaternions that commute with both i and j. (a) Prove that G is a group (under multiplication). .ali . Verify that (ao + f3i. 22. 16. (b) List all subgroups of G.] 24.l . 23.a2j . (c) xx* == x*x is real and nonnegative. Using *. consider the following set G having eight elements: +. Show that there is an infinite number of solutions to x 2 quaternions. j. (d) (xy)* == y*x*. define Show that Ixy I == Ix II y I for any two quaternions x and y.a3 k ) 17. by using Parts (c) and (d) of Problem 23. define Ixl == Vxx*. (c) What is the center of G? (d) Show that G is a nonabelian group all of whose subgroups are normal.
33. a3 E F and where equality. Use the result of Problem 24 to prove Lagrange's Identity. (b) Find the order of G. Show that for x. show that ( ~ ~) is invertible in R if and only if ad (~ ~) 1. If ( ~ ~) E R. 26. be *. show that S = {f E R Ifis differentiable on (0. 32. then x is a zerodivisor in R. In R. For x. . 29. and. show that { ( _ =1= °is ~ : ) I a. For R as in Example 10. ~) = ad  be. If x E R is a zerodivisor. Define det(: (det x)(det y). Harder Problems 31. det(xy) = det(x) det(y). the set of all aa + ali + a2j + a3k. In Problems 26 to 30. G. 35. let R be the 2 X 2 matrices over the reals. Let T be the group of matrices A with entries in the field 7L 2 such that det A is not equal to 0. let H(F) be the ring of quaternions over F. Show that if det(: ~) = ad  be *.0.O. p a prime. and multiplication are defined as for the real quaternions. I)} is a subring of R which is not an integral domain. ab a2. b real} is a field. that is. where aa. if x such that det x = 0. Show that {x E R I det x =1= o} forms a group. under matrix multiplication and that N = {x E R I det x = I} is a normal subgroup of G. Let R be as in Problem 31. conversely. In that case find 27. Prove that T is isomorphic to S3' the symmetric group of degree 3.136 Ring Theory Ch. Let R be the ring of all 2 X 2 matrices over 7L p . (d) Find a pSylow subgroup of G. then (~ ~) is invertible in R. show that det x = 0. y E R show that det(xy) 28. If F is a field. (a) Prove that G is a group. addition. y E R. 30. Let G be the set of elements x in the ring R of Problem 31 such that det(x) =1= 0.4 25. (Quite hard) (c) Find the center of G. 34.
In H(C). the complex numbers. for instance. Since a ring R is at least an abelian group under +. 40. We have used the left distributive law in this proof. 2 Some Simple Results 137 36. Prove that a finite domain is a division ring. Then o. These will allow us to avoid annoying trivialities that could beset a calculation we might be making. a3' a4 in F forces ao = al = a2 = a3 = O. 2. We begin with Lemma 4. (a)(b) = «a)b) = ((ab» = ab. Similarly. ( a) = a. hence aO = O. find an element x 0 such that x = O. aO = a(O + 0) = aO + aO. if a + b = a + c. 41. show that H(Q) is a division ring. (a) aO = Oa = Let R be any ring and let a. The results we shall prove in this section are not very surprising. since we are in an abelian group. not too interesting. and so on. SOME SIMPLE RESULTS Now that we have seen some examples of rings and have had some experience playing around with them. show that H(C) =1= is not a division ring. (c) (a)( b) = abo (d) If 1 E R. a(b) = (ab). (a + b) = (a) + (b). b E R. . there are certain things we know from our group theory background. (b) ab + a( b) = a(b + (b» = aO = 0 from Part (a). (a) Since 0 = 0 + 0. (a)b = (ab). The right distributive law gives Oa = O. Proof We do these in turn.Sec. If Q is the field of rational numbers. then (l)a = a.2. it would seem wise to develop some computational rules. 2 38. (b) a(b) = (a)b = (ab). If F = C. then b = c. Therefore. but it was something we had to do before going on to bigger and better things. (c) By Part (b). ai + a~ + a~ = 0 39. Show that H(F) is a division ring if and only if a6 + for ab a2. 37. and certainly not at all exciting. The same holds for the results we are about to prove. Use Problem 40 to show that 7Lp is a field if p is a prime.1. Neither was learning the alphabet.
Thus x 2 == x. but keeping in mind that R may be noncommutative. So ( l)a == . b E R. Thus 0 == x(xy + yx) == x 2y + xyx == xy + xyx.2.3. But (x + y)2 == x + xy + yx + y2 == X + xy + yx + y. A Boolean ring is commutative. D . We say that a ring R is a Boolean ring [after the English mathematician George Boole (18151864)] if x 2 == x for every x E R. D We close this brief section with a result that is a little nicer. b E R. One curiosity follows from the two distributive laws when R has a unit element. while 0 == (xy + yx)x == xyx + yx 2 == xyx + yx. If R is a system with 1 satisfying all the axioms of a ring. b E R. a Boolean ring. to it. y E R. we can cancel a on the left and b on the right to obtain b + a == a + b. say. since we are in a group under +. So. In any ring R. Therefore. by the left distributive law (a + b)(l + 1) == a(l + 1) + b(l + 1) == a + a + b + b.2.2. On the other hand.2. R is commutative. then (l)a + a == (l)a + (l)a == (1 + l)a == Oa == O. then R is a ring.138 Ring Theory Ch. by Lemma 4.4 (d) If 1 E R. Proof We must show that a + b == b + a for a. Proof This is clearly the analog of (a + f3)2 == Q? + 2af3 + f32 in the integers. R is therefore a ring. But then a + b + a + b == a + a + b + b. D Can you see the noncommutative version of the binomial theorem? Try it for (a + b)3. By the right distributive law (a + b)(l + 1) == (a + b)l + (a + b)l == a + b + a + b. By the right distributive law (a + b)2 == (a + b )(a + b) == (a + b)a + (a + b)b == a2 + ba + ab + b 2. The commutative law of addition follows from the rest. Lemma 4. This gives us xy + xyx == xyx + yx. D Another computational result.2. y2 == y.2. (a + b)2 == a2 + b2 + ab + ba for a. from which we have xy + yx == O. (x + y)2 == 2 X + y.4. We prove a nice result on Boolean rings. Lemma 4.a. and so xy == yx. as required. so we have (x + y) == (x + y)2 == X + xy + yx + y.a by the definition of . Lemma 4. Proof Let x. exactly what was claimed. except possibly a + b == b + a for a.
2. b E R. 5. Prove that R is commu 8. AND QUOTIENT RINGS In studying groups. and Quotient Rings 139 PROBLEMS 1. then q = pn for some integer n.) 3. 3 Ideals. and their kernelsthe normal subgroupsplayed a central role. it turned out that homomorphisms. If R is a finite integral domain. IDEALS. b are integers. (Hint: Cauchy's Theorem) 9. And it will be! Without any further fuss we make the Definition. . na has meaning for us for n E 7L. Prove that R is commutative. b are integers.. show that R is a field.1) = alb. With the background we have acquired about such things in group theory. (Hint: Consider 1Ia 2 as a runs through Up. show that if 1 + ~ + . b E R.Sec. where a. + 1/(p . Ifp is a prime andp > 3.1) = alb. a E R. If R is an integral domain and ab = (nm)(ab) if n. Homomorphisms. show that: (a) There exists a prime p such that pa = 0 for all a E F. show that b = c.. Let R be a ring in which x 4 tative. of homomorphism and normal subgroup do playa key role.. If F is a finite field. If a2 = 0 in R. show that ax + xa commutes with a. c E R. As a matter of fact. HOMOMORPHISMS. the analogs. Let p be an odd prime and let 1 + ~ + . the parallel development for rings should be easy and quick. = 3. so does aI. Let R be a ring: since R is an abelian group under +. There is no reason to expect that the same thing should not be true for rings. show that (xe .) 10. (b) If F has q elements. where a. 6. If R is a ring and e E R is such that e2 = e. b. 4. = x 7. Show that (na)(mb) and a.. for every x E R. m are integers = ac for a =1= 0. then p2 Ia.exe)2 = 0 for every x E R. in the setting of rings. Show thatp I a. Let R be a ring in which x 3 = x for every x E R. + 1/(p . (Hint: As a runs through Up.exe)2 (ex . homomorphism if The mapping 'P : R ~ R' of the ring R into the ring R' is a (a) 'P(a + b) = 'P(a) + 'P(b) and (b) 'P(ab) = 'P(a)'P(b) for all a.
enjoy? Clearly. a E I.: R ~ R' be a ring homomorphism and let Ker q.(rk) = O. then Ker q. Definition. Before going on. by "ideal" we shall always mean a twosided ideal. A nonempty subset I of R is called an (a) I is an additive subgroup of R. This property of Ker q.(k) = 0. swallows up multiplication from the left and the right by arbitrary ring elements. ideal of R. then q. Except for some of the problems. We shall soon see some examples of homomorphisms and ideals. So we can call on.(kr) = q. the image of R under a homomorphism from R to R'. then ra E L.1. as Lemma 4. We could split it and define a set L of R to be a left ideal of R if L is an additive subgroup of R and given r E R.(k)q. it is only natural and just that we demand that both these operations be preserved under what we would call a ring homomorphism.140 Ring Theory Ch. we shall not use the notion of onesided ideals in this chapter. But much more is true. certain results from this fact alone. as defined in Chapter 4. we record what was done above for Ker q. ideal of R if: Let R be a ring. is an additive subgroup of R.(r) = 0. a E L. But first we note that Part (b) in the definition of ideal really has a left and a right part. What properties does Ker q. q.4 Since a ring has two operations. from group theory Ker q. If k E Ker q. Let q. and similarly. (b) Given r E R. and expect. is an We shall soon see that every ideal can be made the kernel of a homomorphism. So Ker q. Indeed. so q. Section 1 (Prove!). = {x E R I q.(r) = Oq. the 0 being that of R'. An ideal as we defined it is both a left and a right ideal of R. Just as we saw in Chapter 2. is now abstracted to define the important analog in ring theory of the notion of normal subgroup in group theory. is a homomorphism of R viewed merely as an abelian group under + into R' (also viewed as a group under its addition). If q.3. Section 5 for groups. Shades of what happens for normal subgroups of groups! . Furthermore. So we require only left swallowingup for a left ideal. We can similarly define right ideals. here. is a subring of R'.(x) = O}. in working in noncommutative ring theory one uses this name.: R ~ R' is a homomorphism. By all rights we should then call what we called an ideal a twosided ideal of R. Property (a) in the Definition tells us that q. then ra E I and ar E I. and r E R.
We should be able to put all this together to make of R/K a ring. Because b + K = b' + K.a')b = ab . Since K is an additive subgroup of R. the first thing that comes up is to show that this product is welldefined. Furthermore. we have b .a'b) + (a'b . But this tells us (just from group theory) that ab + K = a'b' + K. Then the quotient group R/K as an additive group is a ring under the multiplication (a + K)(b + K) = ab + K. it is merely the set of all cosets a + K as a runs over R. We summarize all this in Theorem 4. So R/K is now endowed with a sum and a product.: R ~ R/K defined by q. But R is not just a group.b') = a'b . the map q. it is more than that. = (0).b') E K. then (a + K)(b + K) = ab + K = a'b' + K = (a' + K)(b' + K). since K is a right ideal of R). it is. if a + K = a' + K. after all. the mapping q. a ring. we have that if q. Nor is K merely an additive subgroup of R.: R ~ R/K defined by q. is a homomorphism of R into R'. However.a'b') = ab . Furthermore. (Prove!) This tells us right away that R/K is a ring. 3 Ideals. let K be an ideal of R. As always. then a .2. namely an ideal of R. Let K be an ideal of R. Homomorphisms. Just from grouptheoretic consideration of R as an additive group.a'b' are in K.3. So both (a . So R/K is a homomorphic image of R. As in groups. Thus (ab . and Quotient Rings 141 Finally. suppose R is the field C of complex numbers. being the homomorphic image of the ring R. An isomorphism from a ring R onto itself is called an automorphism of R.a' E K. exactly what we needed to have the product welldefined. We define Rand R' to be isomorphic if there is an isomorphism of R onto R'. since K is an ideal of R (in fact. we define a homomorphism to be a monomorphism if it is 11. A monomorphism which is also onto is called an isomorphism.a'b' E K.a')b E K. which we do. so far.(a) = a + K for a E R is a homomorphism of R onto R/K with kernel K. since K is an ideal of R (in fact. so (a . so a'(b . (Prove!) . How should we define a product in R/K in a natural way? What do we want to declare (a + K)(b + K) to be? The reasonable thing is to define (a + K)(b + K) = ab + K. Is it? We must show that if a + K = a' + K and b + K = b' + K.Sec.(a) = a + K for a E R is a homomorphism of R onto R/K having K as its kernel. then it is 11 if and only if Ker q. Then the mapping from C to C sending each element of C to its complex conjugate is an automorphism of C. the quotient group R/K exists. since K is a left ideal of R). For example.b' E K.a'b and a'(b .
3. let I = {a E R I cp(a) E I'}. Let I be those elements of R which in reduced form have an . for that is how we originally introduced 7L n . rl = rEI for every rEF.3. Then A + I = {a + i I a E A. Theorem 4. with the slightly obvious adaptation needed to make the proofs go through. Let the mapping cp: R ~ R' be a homomorphism of R onto R' with kernel K. but now. since 1 E I. what can the ideals of F be? Suppose that I =1= (0) is an ideal of F. Let the mapping cp: R ~ R ' be a homomorphism of R onto R' with kernel K. then RII = R'II'. then RII = (RIK)/(IIK). As is easily seen. Let n > 1 be a fixed integer and let In be the set of all multiples of n. Theorem 4. Let R be the ring of all rational numbers having odd denominators in their reduced form. Theorem 4. Then I is an ideal of R. Equivalently. define cp: 7L ~ 7L n by cp(a) = [aJ. Then. cp is a homomorphism of 7L onto 7L n with kernel In. then In is an ideal of I. If I' is an ideal of R'. 1 = a1a E I.5 (Second Homomorphism Theorem). if K is an ideal of R and I:> K is an ideal of R. So by Theorem 4. So F has only the trivial ideals (0) and F itself. Let A be a subring of a ring R and I an ideal of R. This sets up a 11 correspondence between all the ideals of R I and those ideals of R that contain K.3.4 One would have to be an awful pessimist to expect that the homomorphism theorems proved in Section 7 of Chapter 2 fail in this setting. since I is an ideal of F.3. We state the homomorphism theorems without any further ado. I = F. 7L n = 7Llln • (This should come as no surprise. for our first example. In short. the mapping l/J: RIK ~ R' defined by l/J( a + K) = cp (a) defines an isomorphism of RIK onto R'.3. If I' is an ideal of R' and I = {a E R I cp(a) E I'}.3 (First Homomorphism Theorem). In fact they do hold. As usual we use 7L.) 2. and (A + 1)11 === A I(A n I).3. let a =1= 0 E I. Let the mapping cp : R ~ R be a homomorphism of R onto R with kernel K. the ring of integers. I I We close this section with an inspection of some of the things we have discussed in some examples.3. If 7L n is the integers mod n. Let F be a field.6 (Third Homomorphism Theorem). i E I} is a subring of R. I:> K and 11K === I'.4 (Correspondence Theorem).142 Ring Theory Ch. Then R' === RIK. Examples 1. I is an ideal of A + I. in fact. leaving the few details needed to complete the proofs to the reader. Theorem 4.
3 to show that RII : : : : real field. Ker q. For if f(~) = f3 =1= 0. ::::::: H(7Lp ) (see . a2. Let I be those elements in R whose numerator is divisible by p. Let R be the ring of all realvalued continuous functions on the closed unit interval where (f + g)(x) = f(x) + g(x) and (fg)(x) = f(x)g(x) for f. Clearly.1 + I) (f + I) = (af31 + I) (f(~) + I) = (af31 + 1)(/3 + I) = af31f3 + I = a + I. Let R be the integral quaternions. is a homomorphism of R onto 7L 2 with kernel I. Thus we have shown directly that RII is isomorphic to the real field. So I is an ideal of R. g is in I. : R ~ 7L 2 . Then it is easy to verify that q.f(~). p a fixed prime.: R ~ ~ be defined by q. So g + I = I. in other words. We now use Theorem 4. then af31f + I = (af3. R = {aa + ali + a2j + a3 k I aa.Sec. = {f E R If(~) = O}. 1]. x E [0. then f(~) = 0.f(~» + f(~) = g(x) + f(~). We claim that I is an ideal of R. = I. by q. so (fg)(~) = f(~)g (~) = Og (~) = O. is a homomorphism. Homomorphisms. We leave it to the reader to verify that q.f(~) = 0. a real. gfis also in I. Because g(~) = f(~) . it is an additive subgroup. Thus fg E I. Give the explicit isomorphism of RII onto 7L 2 • 4. We claim that every real a comes up. then f(x) = (f(x) . since I is commutative. Define q. the integers mod 2. (alb) = 1 if a is odd. let The reader should verify that Ip is an ideal of R and that RlIp Problem 36 of Section 1 and the paragraph before it). So RII consists of all a + I. 6. I is an ideal of Rand RII::::::: 7L p . in other words. al. (Prove!) 5. is onto and Ker q. a3 E 7L}.(f) = f(~). Because f(~) is just a real number. Furthermore if f E I and g E R.3. 3 Ideals. So RII : : : : image of q. q. and Quotient Rings 143 even numerator. For a fixed prime p. Let I = {f E R If(~) = OJ. where g(x) = f(x) . Thus 7L 2 ::::::: RII. RII consists of the cosets a + I for a real. Thusf + I = (f(~) + g) + I = f(~) + I. What is RII? Given any fER. = IR.(alb) = 0 if a is even (a. b have no common factor) and q. it is easy to see that I is an ideal of R. Let R be the ring of all rational numbers whose denominators (when in reduced form) are not divisible by p. Let q. g E R. the integers modp.
Define cp: R ~ ~ bYCP ( ~ ~) a. (OxOO = (0 0 ' an ideal of R? Consider so it is in I. ~ :) (~ ~) + (~ ~) so that (~ since :) + I = ((~ ~) + (~ ~)) + I = (~ ~) + L ~) + (~ ~ ) is in 1. then c. that is. Given then ( ( ~ = :) E R . Let I = {(~ ~) I I a. Similarly. Let R reals. I is easily seen to be an additive subgroup of R. 0 b)(X Y) 0 bX) (OOOx = (0 0 ' so it. Thus all the cosets of I in R look like ( ~ ~) + 1. 1\1 ( ( ~ I) = a. b E ~}. we can check that '" is an isomorphism = onto the real field. If we map this onto a. given ( ~ ~ ). cP ( ~ ~) = a.4 7. So I is an ideal of R. So R/I :::::: IR. For. C cP ( ~ ~) = C aa 0 (o b) + (c d) = (a + b+ d) 0 a+c' . We claim that cP is a homomorphism. We see that R/I "'" ~ another way. Is it X Y)(O b) 0 Xb). What is R/I? We approach it from two points of view. ( ~ ~). is in I. R is a subring of the 2 X 2 matrices over the = {(~ ~) bE ~}. too.144 Ring Theory Ch.
Also. Let R be any commutative ring with 1. If a E R. if u E (a) and r E R. v = ya for some x. so is in (a). let (a) = {xa Ix E R}. We claim that (a) is an ideal of R. image of 'P = IR by 8. ae ~(~ ~)~(~ ~). y E R. v E (a).3.Sec. Thus (a) is an ideal of R. We leave it to the reader to verify that l/J is an isomorphism of R onto C. suppose that u. and Quotient Rings 145 and (~ ~)(~ ~) = (~ ad ~ be} hence ~((~ ~)+(~ ~))=~(a~e ~:~) = a + e = ~ ( ~ :) + ~ ( ~ ~) and ~((~ : )( ~ ~)) = ~(~ = = ad ~ be) ~) ~. but also 'P (a 0 b) b) = 0. Let R = {( _ ~ : ) I a. thus u = xa. So R/I Theorem 4. Define "': R ~ C by '" ( _ ~ ~ ) = a + bi. then u = xa. . a a ~ Thus a = O. b E ~} and let C be the field of complex numbers. SInce (a0 b) E Ker 'P. E Ker So ~ is indeed a homomorphism of R onto R What is Ker ~? If ( ~ a then'P ( 0 a = a. but it is certainly a left ideal of R.3. So R is isomorphic to the field of complex numbers. Note that if R is not commutative. whence u ± v = xa ± ya = (x ± y) a E (a). From this we see that I = Ker 'P. then (a) need not be an ideal. To see this. Homomorphisms. 3 Ideals. . 9. hence ru = r(xa) = (rx)a.
Prove Theorem 4. (Hint: Look at Example 9. : R ~ R' be a homomorphism of R onto R' with kernel K. show that I n I is an ideal of R. 17. verify that the mapping t/J given is an isomorphism of R onto C. If R. S are rings. let A = {a E R I q. 9.3. Prove that I + I is an ideal of R. su).5. s)(t. A :> K. In Example 3. If I is an ideal of R and A is a subring of R. Prove that II is an ideal of R. 11. 15. (r.3. s + u).4. 2. (b) A/K ~ A'. let II be the set of all sums of elements of the form ij. I are ideals of R. If A' is a subring of R'. I are ideals of R. define I + I by I + I = {i + j liE I. show that R/I ~ 7L p • 13. u) = (rt. 7.3. define the direct sum of Rand S.6. s) = (rb (r.146 Ring Theory Ch. Let q. j E I}. u) = (r + t. In Example 8. 16. 4. show that I n A is an ideal of A. by EB S = {(r. s) IrE R. Prove that L(a) is an ideal of R. j E I. Give a complete proof of Theorem 4. If I. 8. I are ideals of R. give the explicit isomorphism of R/I onto 7L 2 • 12. : R ~ R' is a homomorphism of R onto R' and R has a unit element. If I. Show that: (a) A is a subring of R. Show that the ring of 2 X 2 matrices over the reals has nontrivial left ideals (and also nontrivial right ideals). 6.3. R R EB S. If R is a commutative ring and a E R. If q. If R is a commutative ring with 1 and R has no ideals other than (0) and itself. Prove Theorem 4. (c) If A' is a left ideal of R'. where i E I. s) Sl) if and only if r = rb s = and where + (t. In Example 4. prove that R is a field.(1) is the unit element of R'. show that R/Ip ~ H(7L p )' 14. . In Example 6. s E S} Sb where (r.4 PROBLEMS Easier Problems 1.(a) E A '}.) * 3. Give a complete proof of Theorem 4. If I. show that q.2. 5. 1. 10. let L(a) = {x E R Ixa = OJ. then A is a left ideal of R.
Sec. n two relatively prime integers. 21. and ab = 0 for every a. and Quotient Rings 147 18. b any integers. p a prime. 1m the multiples of m in Z. If I. Let Z be the ring of integers and m. Homomorphisms. f are ideals of R. . suppose that I is an ideal of R. Let Z15 be the ring of integers mod 15. (c) R/I ~ FEB F. Show that I = (0) or I = R. 25. . Let R be the ring of 2 X 2 matrices over the real numbers. If m. (Contrast this with the result of Problem 16.. (b) I is an ideal of R. 3 Ideals.) *24. (Hint: Use a counting argument to show that the homomorphism of Problem 22(b) is onto. prove that Zmn ~ Zm EB Zn. let R1 = RII and R2 = Rlf. Show that Z15 ~ Z3 EB Z5. r + f) is a homomorphism of R into R 1 EB R 2 such that Ker 'P = I n f.. show that there are exactly 2n solutions of x 2 == x mod(p 1 ••• Pn). 0) IrE R} and {CO. If Pb P2' .creal} and I I = {(~ ~)lbreal}..Showthat: (a) R is a ring. Use the result of Problem 23 to prove the Chinese Remainder Theorem. n are relatively prime.IfR={(~ :) a. s) Is E S} are ideals of R EB S isomorphic to Rand S. (a) What is 1m n In? (b) Use the result of Problem 20 to show that there is a onetoone homomorphism from ZIImn to ZIIm EB ZIIn • 23. Let R be a ring with 1 and let S be the ring of 2 X 2 matrices over R. we can find an integer x such that x == a mod m and x == b mod n simultaneously.) Harder Problems 26. which asserts that if m and n are relatively prime integers and a. where 0 ::.b. Pn are distinct odd primes. MiddleLevel Problems 22. Prove that either R is a division ring or R has P elements. where F is the field of real numbers. respectively. 27. Show that R EB S is a ring and that the subrings {(r. b E R. x < Pl· . Show that 'P: R ~ R 1 EB R2 defined by 'P(r) = (r + I. 19. Pn· 28. 20. and In the multiples of n in Z. If I is an ideal of S show that there is an ideal f of R such that I consists of all the 2 X 2 matrices over f. Suppose that R is a ring whose only left ideals are (0) and R itself.
Suppose that there are no ideals other than K itself and R which contain K. Definition. Then (a) = {xa I x E R} is an ideal of R.1. In particular.4. (a) =1= (0). R' must be a field. then.4. by Lemma 4. Since a = la E (a). (a) = R.3. it is a result that stands on its own two feet. So R is a field. because 1 E R.148 Ring Theory Ch. MAXIMAL IDEALS This will be a section with one major theorem. A proper ideal M of R is a maximal ideal of R if the only ideals of R that contain Mare M itself and R.1. by our hypothesis on R. There are many difficult results that are of very little interest and of even less importance. However. The discussion preceding this definition has already almost proved for us . Let R be a commutative ring with unit whose only ideals are (0) and itself. This shows that a has the inverse b in R. What does this imply about R'? Since (0) in R' corresponds to K in R. and R' corresponds to all of R. 4. then ab = 1 (so b is also a right inverse of a). Lemma 4. Proof Let a =1= 0 be in R. It isn't difficult to prove. then there is a 11 correspondence between ideals of R' and ideals of R that contain K. The importance of this result will only become fully transparent when we discuss fields in Chapter 5. Thus. as we verified in Example 9 in the preceding section. So if R' is commutative and has a unit element. Show that if the left inverse b of a is unique. there are some resultsmany. but in mathematics the correlation between difficult and important isn't always that high. An element a E R is said to have a left inverse if ba = 1 for some b E R. 0 In Theorem 4. by the definition of (a). But then. every element i E R is a multiple xa of a for some x E R. and some easy results that are crucial. Let R be a ring with 1. This prompts the following definition. Of course. manywhich are of incredible difficulty and importance.4 29. Then R is a field. 1 = ba for some b E R. we must conclude that in this case R' has no ideals other than (0) and itself.4the Correspondence Theoremwe saw that if 'P : R ~ R' is a homomorphism of R onto R' with kernel K.
(a + bi)1 E N. (Prove using congruences mod 3 !) But t = r 2 + S2 = (r + si)(r . Consequently.3. 3 (r 2 + S2). Section 3).4. a subring of C (i 2 = 1). So there is an element r + si E N. we see that M is a maximal ideal of 7L if and only if M consists of all the multiples of some prime p. Examples 1. b E 7L}. D We give a few examples of maximal ideals in commutative rings. and since 1 E R we have that R' has 1 + M as its unit element.2. 0 This theorem will be our entry into the discussion of fields. Thus the set of maximal ideals in 7L corresponds to the set of prime numbers. Thus since RIM == 7L n and since 7L n is a field if and only if n is a prime.si). v. Let R be a commutative ring with 1.3.2 has a converse.4. by Lemma 4. so is in N. since N is an ideal of R. Let 7L be the integers. since r + si E N and N is an ideal of R. it has only (0) and itself as ideals. and let M be a maximal ideal of R. for all a + bi E R. hence ut E Nand 3 E MeN. 4 Maximal Ideals 149 Theorem 4. So the only ideal of R above M is R itself. This is Theorem 4. we saw in the discussion above that R' has no nontrivial ideals. and let R = {a + bi I a. (See Problem 3. by the correspondence given us by Theorem 4. Since RIM is a field.4. where 31 a and 31 b. then M is a maximal ideal of R. Proof We saw in Example 2 of Section 3 that the only ideals in a field Fare (0) and F itself. As an ideal of 7L we certainly have that M is an additive subgroup of 7L. We claim that M is a maximal ideal of R. Then RIM is a field. so 3v E N. Thus ut + 3v = 1 for some integers u. Therefore. 2. So N has an integer t = r 2 + S2 not divisible by 3. 1 = ut + 3v E N. so must consist of all multiples of some fixed integer n. We leave it to the reader to verify that M is an ideal of R. If R is a commutative ring with 1 and M an ideal of R such that RIM is a field. there can be no ideal of R property between M and R. Thus. Thus M is a maximal ideal of R. where 3 does not divide r or 3 does not divide s.4. but tEN. For suppose that N :J M and N =1= M is an ideal of R.1. for it will enable us to construct particularly desirable fields whenever we shall need them. M is a maximal ideal of R.4. Let 7L be the integers and M an ideal of 7L. This tells us that N = R. Theorem 4. Because M is a maximal ideal of R. Therefore. Let M be the set of all a + bi in R. . Proof There is a homomorphism of R onto R' = RIM. But then. R' = RIM is a field. Therefore.Sec.
150
Ring Theory
Ch.4
By Theorem 4.4.2 we know that RIM is a field. It can be shown (see Problem 2) that RIM is a field having nine elements.
3. Let R be as in Example 2 and let I = {a + bi I Sla and SIb}. We assert that I is not a maximal ideal of R. In R we can factor S = (2 + i)(2  i). Let M = {x(2 + i) Ix E R}. Mis an ideal of R, and since S = (2 + i)(2  i) is in M, we see that I C M. Clearly, I =1= M for 2 + i E M and is not in I because S {2. So I =1= M. Can M = R? If so, then (2 + i)(a + bi) = 1 for some a, b. This gives 2a  b = 1 and 2b + a = 0; these two equations imply that Sa = 2, so a = ~, b = t. But ~ fl. 7L, t $. 7L; the element a + bi = ~  ti is not in R. So M =1= R. One can show, however, that M is a maximal ideal of R. (See Problem 3.)
4. Let R = {a + bv'21 a, b integers}, which is a subring of the real field under the sum and product of real numbers. That R is a ring follows from
(a + bv'2) + (c + dv'2)
and
=
(a + c) + (b + d)v'2
(a + bv'2)(c + dv'2)
=
(ac + 2bd) + (ad + bc)v'2.
Let M = {a + bv'2 E R I Sla and 5Ib}. M is easily seen to be an ideal of R. We leave it to the reader to show that M is a maximal ideal of R and that RIM is a field having 2S elements. 5. Let R be the ring of all realvalued continuous functions on the closed unit interval [0, 1]. We showed in Example S of Section 3 that if M = {fER If (~) = O}, then M is an ideal of Rand RIM is isomorphic to the real field. Thus, by Theorem 4.4.3, M is a maximal ideal of R. Of course, if we let My = {f E R If( y) = O}, where y E [0, 1], then My is also a maximal ideal. It can be shown that every maximal ideal in R is of the form My for some y E [0,1], but to prove it we would require some results from real variable theory. What this example shows is that the maximal ideals in R correspond to the points of [0, 1].
PROBLEMS
1. If a, b are integers and 3 { a or 3 { b, show that 3 { (a 2 + b 2 ).
2. Show that in Example 2, RIM is a field having nine elements. 3. In Example 3, show that M
= {x(2 + i) Ix
E R} is a maximal ideal of R.
Sec. 5
Polynomial Rings
~
151
4. In Example 3, show that RIM
5. 6. 7. 8.
7L s. In Example 3, show that RII ~ 7L s EB 7L s. In Example 4, show that M is a maximal ideal of R. In Example 4, show that RIM is a field having 25 elements. Using Example 2 as a model, construct a field having 49 elements.
We make a short excursion back to congruences mod p, where p is an odd prime. If a is an integer such that p ~ a and r == a mod p has a solution x in 7L, we say that a is a quadratic residue mod p. Otherwise, a is said to be a quadratic nonresidue mod p.
9. Show that (p  1)/2 of the numbers 1, 2, ... , p  1 are quadratic residues and (p  1)/2 are quadratic nonresidues modp. [Hint: Show that {x 2 x =1= 0 E 7L p } forms a group of order (p  1)/2.] 10. Let m > 0 be in 7L, and suppose that m is not a square in 7L. Let R = {a + Y;;; b I a, b E 7L }. Prove that under the operations of sum and product of real numbers R is a ring. 11. If p is an odd prime, let us set Ip = {a + Y;;; b I pia and pi b}, where a + Y;;; b E R, the ring in Problem 10. Show that I p is an ideal of R.
1
12. If m is a quadratic nonresidue mod p, show that the ideal I p in Problem 11 is a maximal ideal of R. 13. In Problem 12 show that Rllp is a field having p 2 elements.
5. POLYNOMIAL RINGS
The material that we consider in this section involves the notion of polynomial and the set of all polynomials over a given field. We hope that most readers will have some familiarity with the notion of polynomial from their high school days and will have seen some of the things one does with polynomials: factoring them, looking for their roots, dividing one by another to get a remainder, and so on. The emphasis we shall give to the concept and algebraic object known as a polynomial ring will be in a quite different direction from that given in high school. Be that as it may, what we shall strive to do here is to introduce the ring of polynomials over a field and show that this ring is amenable to a careful dissection that reveals its innermost structure. As we shall see, this ring is very wellbehaved. The development should remind us of what was done for the ring of integers in Section 5 of Chapter 1. Thus we shall run into the analog of Euclid's algorithm, greatest common divisor, divisibility, and possibly most important, the appropriate analog of prime number. This will lead to
152
Ring Theory
Ch.4
unique factorization of a general polynomial into these "prime polynomials," and to the nature of the ideals and the maximal ideals in this new setting. But the polynomial ring enjoys one feature that the ring of integers did not: the notion of a root of a polynomial. The study of the nature of such rootswhich will be done, for the most part, in the next chapterconstitutes a large and important part of the algebraic history of the past. It goes under the title Theory of Equations, and in its honorable past, a large variety of magnificent results have been obtained in this area. Hopefully, we shall see some of these as our development progresses. With this sketchy outline of what we intend to do out of the way, we now get down to the nittygritty of doing it. Let F be a field. By the ring of polynomials in x over F, which we shall always write as F[x], we mean the set of all formal expressions p(x) = ao + alx + ... + an_lX n 1 + anx n, n 2:: 0, where the ai' the coefficients of the polynomial p (x), are in F. We sometimes employ the alternative notation: p(x) = aox n + alx n 1 + ... + an. In F[x] we define equality, sum, and product of two polynomials so as to make of F[x] a commutative ring as follows: 1. Equality. We declare p(x) = ao + alx + ... + anx n and q(x) = b o + b1x + ... + bnx n to be equal if and only if their corresponding coefficients are equal, that is, if and only if ai = b i for all i 2:: o. We combine this definition of equality of polynomials p(x) and q(x) with the convention that if
q(x)
and if b m + 1
=
bo + b1x + · · . + bnxn
= ... = b n = 0, then we can drop the last n  m terms and write
q(x)
=
q(x) as bo + b1x + ... + bmXm •
This convention is observed in the definition of addition that follows, where s is the larger of m and n and we add coefficients an+l = ... = as = 0 if n < s or bm + 1 = ... = bs = 0 if m < s. 2. Addition. If p(x) = ao + alx + ... + anx n and q(x) = b o + b1x + ... + bmx m , we declare p(x) + q(x) = Co + CIX + ... + csx s, where for each i, Ci = ai + bi. So we add polynomials by adding their corresponding coefficients. The definition of multiplication is a little more complicated. We define it loosely at first and then more precisely.
Sec. 5
Polynomial Rings
153
3. Multiplication. If p(x) = ao + alx + ... + anx n and q(x) = b o + b1x + ... + bmx m, we declare p(x)q(x) = Co + CIX + ... + ctX t, where the Ci are determined by multiplying the expression out formally, using the distributive laws and the rules of exponents XU XV = x u + v , and collecting terms. More formally,
We illustrate these operations with a simple example, but first a notational device: If some coefficient is 0, we just omit that term. Thus we write 9 + Ox + 7x 2 + Ox 3  14x 4 as 9 + 7x 2  14x 4 • Let p(x) = 1 + 3x 2 , q(x) = 4  5x + 7x 2  x 3• Then p(x) + q(x) = 5  5x + 10x 2  x 3 while
p(x)q(x)
=
=
(1 + 3x 2 )(4  5x + 7x 2
4  5x + 7x 2


x3)

x 3 + 3x 2 (4  5x + 7x 2 x 3 + 12x 2

x3)

= 4 =
5x + 7x 2
15x3 + 21x4

3x5
4  5x + 19x2
16x3 + 21x4
3x5 •
Try this product using the Ci as given above. In some sense this definition of F[x] is not a definition at all. We have indulged in some hand waving in it. But it will do. We could employ sequences to formally define F[x] more precisely, but it would merely cloud what to most readers is well known. The first remark that we makeand do not verifyis that F[x] is a commutative ring. To go through the details of checking the axioms for a commutative ring is a straightforward but laborious task. However, it is important to note Lemma 4.5.1.
F[x] is a commutative ring with unit.
=1=
Definition. If p(x) = ao + alx + ... + anx n and an degree ofp(x), denoted by degp(x), is n.
0, then the
So the degree of a polynomial p(x) is the highest power of x that occurs in the expression for p(x) with a nonzero coefficient. Thus deg(x  x 2 + x 4) = 4, deg(7x) = 1, deg 7 = O. (Note that this definition does not assign a degree to O. It is, however, sometimes convenient to adopt the convention that the degree of 0 be  00, in which case many degree related results will hold in this extended context.) The polynomials of degree 0 and the polynomial 0 are called the constants; thus the set of constants can be identified with F. The degree function on F[x] will playa similar role to that played by
154
Ring Theory
Ch.4
the size of the integers in 7L, in that it will provide us with a version of Euclid's Algorithm for F[x]. One immediate and important property of the degree function is that it behaves well for products. Lemma 4.5.2. If p(x), q(x) are nonzero elements of F[x], then deg(p(x)q(x» = degp(x) + deg q(x).
Proof Let m = degp(x) and n = deg q(x); thus the polynomialp(x) = ao + alx + ... + amx m, where am =1= 0, and the polynomial q(x) = b o + b1x + ... + bnx n, where bn =1= 0. The highest power of x that can occur in p(x)q(x) is x m + n, from our definition of the product. What is the coefficient of x m+ n? The only way that x m+ n can occur is from (amxm)(bnx n) = ambnx m+ n. So the coefficient of x m+ n in p(x)q(x) is amb n, which is not 0, since am =1= 0, bn =1= 0. Thus deg(p(x)q(x» = m + n = degp(x) + deg q(x), as claimed in the lemma. D
One also has some information about deg(p(x) + q(x». This is Lemma 4.5.3. If p(x), q(x) E F[x] and p(x) + q(x) deg(p(x) + q(x» ::; max(degp(x), deg q(x».
=1=
0, then
We leave the proof of Lemma 4.5.3 to the reader. It will play no role in what is to come, whereas Lemma 4.5.2 will be important. We put it in so that the "+" should not feel slighted visavis the product. An immediate consequence of Lemma 4.5.2 is Lemma 4.5.4.
F[x] is an integral domain.
Proof If p(x) =1= and q(x) =1= 0, then degp(x) ;::: 0, deg q(x) ;::: 0, so deg(p(x)q(x» = degp(x) + deg q(x) ;::: 0. Therefore, p(x)q(x) has a degree, so cannot be (which has no degree assigned to it). Thus F[x] is an integral domain. D
°
°
One of the things that we were once forced to learn was to divide one polynomial by another. How did we do this? The process was called long division. We illustrate with an example how this was done, for what we do in the example is the model of what we shall do in the general case. We want to divide 2x 2 + 1 into x 4  7x + 1. We do it schematically as follows:
where bn =1= 0 and where m 2:: n. where q(x).~) + (7x + ~) and 7 x + ~ is called the remainder in this division.. which satisfies the conclusion of the theorem.5 (Division Algorithm). then Given the polynomials f(x). This "repeat the procedure" suggests induction. g(x) E f(x) = q(x)g(x) + rex). where am =1= 0 and the polynomial g(x) = b o + btx + . Theorem 4. But keep in mind that all we shall be doing is what we did in the example above. So suppose that degf(x) 2:: deg g(x). here we call it the Division Algorithm. What exactly did we do? First. However.. where did the ~X2 come from? It came from the fact that when we multiply 2x 2 + 1 by ~ x 2 we get x 4 . rex) E F[x] and rex) = 0 or deg rex) < deg g(x). in the integers.Sec. and that is how we shall carry out the proof. where g(x) =1= 0.7x + 1 gets rid of the x 4 term and we go on to what is left and repeat the procedure. What this gives us is something like Euclid's Algorithm.. So subtracting ~x2(2x2 + 1) from x 4 .5. + a mxm . the highest power occurring in x 4 .. thus the polynomial f(x) = ao + at X + . If either f(x) = 0 or degf(x) < deg g(x). + bnx n. Proof We go by induction on degf(x). Consider :m xmng(x) n = ~m xmn(b o + b1x + · · · + bnx n ) n . F[x]. 5 Polynomial Rings 155 7x + 1 7x + 1 7x + 1~ and we interpret this as: x4  7x + 1 = (2x 2 + 1)(~x2 . then f(x) = Og(x) + f(x).7x + 1.
So why not use the degree function as the mechanism for finding g(x)? Since I =1= (0) there are elements in I having nonnegative degree. deg r(x) cannot be less than deg g(x). Theorem 4.(ami bn)xmng(x) = h(x) is such that the relation deg h(x) < degf(x) holds. an ideal of F[x] must merely consist of all multiples. As we see in the next theorem.q(x)g(x) is in I. Thus. I consists of all multiples of the fixed polynomial g (x) by the elements of F[x].5. So every element in I is a multiple of g(x). then deg t(x) 2:: deg g(x). so f(x) . On the other hand. so r(x) = t(x) . Remembering what h(x) is. we have achieved the form claimed in the theorem. So we are left with r(x) = O. we have that q(x)g(x) E I. where qt(x). Thus. we need to produce that fixed polynomial g(x). that is. then I = {f(x)g(x) I f(x) E F[x]}.n + ql(X). we get h(x) = f(x) . of some fixed polynomial.4 Thus (amlbn)xmng(x) has the same degree and same leading coefficient as does f(x). By assumption. by induction. by the division algorithm. But this says that t(x) = q(x)g(x).~m xmng(x) = qJ(x)g(x) + r(x) n so If q(x) = (amlbn)x m.D The Division Algorithm has one immediate application: It allows us to determine the nature of all the ideals of F[x]. that is. So there is a polynomial g(x) =1= 0 in I of minimal degree. by elements of F[x].6. Since g(x) has minimal degree for the elements of I and r(x) E I. If I =1= (0) is an ideal of F[x]. r(x) E F [x] and r(x) = 0 or deg r(x) < deg g(x). t(x) E I. g(x) =1= 0 is in I and if 0 =1= t(x) E I.q(x)g(x) is in I. Where are we going to dig it up? The one control we have numerically on a given polynomial is its degree. where r(x) = 0 or deg r(x) < deg g(x). Proof To prove the theorem. . t(x) = q(x)g(x) + r(x). h (x) = qt (x)g(x) + r(x).156 Ring Theory Ch. But since g(x) E I and I is an ideal of F[x]. thus t(x) .
satisfy the containing relation (f(x» c (g(x». To get some sort of uniqueness in all this. then ag(x) is in I and has the same degree as g(x). We shall write the ideal generated by a given polynomial. g(x) E F[x]. written as g(x) If(x). where g(x) is a polynomial of lowest degree contained in I.2. Moreover. We say that g(x) divides f(x). In order to get some sort of uniqueness.5. g(x). we single out a class of polynomials. with g(x) =1= O. Singling this out as the generator of I does give us a "monic" uniqueness for the generation of I. An integral domain R is called a principal ideal domain if every ideal I in R is of the form I = {xa Ix E R} for some a E I.6 can be stated as: F[x] is a principal ideal domain. (Prove!) We again emphasize the parallelism between 7L. the ideals (f(x» and (g(x» of F[x]. f(x) E F[x] is a monic polynomial if the coefficient of its highest power is 1. we shall insist that the greatest common divisor always be a monic polynomial. namely {f(x)g(x) I f(x) E F[x]}. Suppose f(x). then I = (g(x». But g(x) is not unique. Note that if f(x) =1= 0 and g(x) If(x). Our next step in this parallel development with what happens in the integers is to have the notion of one polynomial dividing another. D Definition. f(x)g(x) E I for all f(x) E F[x].5. 5 Polynomial Rings 157 since g(x) E I and I is an ideal of F[x]. The proof showed that if I is an ideal of F[x]. then there is only one monic polynomial of lowest degree in I. for if a =1= 0 E F. Definition. then deg g(x) ~ degf(x) by Lemma 4. so I = (ag(x».Sec. as (g(x». generated by f(x) and g(x). The net result of all this is that I = {f(x)g(x) I f(x) E F[x]}. respectively. Definition. the integers. if f(x) = a(x)g(x) for some a(x) E F[x]. Thus f(x) is monic means that We leave to the reader to show that if I is an ideal of F[x]. Theorem 4. and F[x] by turning to the notion of greatest common divisor. .
g(x) =1= 0 are in F[x] and f(x) I g(x) and g(x) If(x). Since [ =1= 0 is an ideal of F[x].5. since both f(x) and g(x) are in [. For any two polynomials f(x) and g(x) E F[x] (not both 0). so it. we neither know. and for t(x) E F[x]. we know that I =1= 0. 0 Another way to see the uniqueness of d(x) is from Lemma 4. This assures us that d(x) I f(x) and d(x) I g(x). as yet. If f(x) =1= 0.158 Ring Theory Ch. Thus I is an ideal of F[x]. (b) If hex) If(x) and hex) I g(x). that it exists. sex) run freely over F[x]. (rl(x)f(x) + Sl(X)g(X» + (r2(x)f(x) + S2(X)g(X» = (rl(x) + r2(x»f(x) + (Sl(X) + S2(X»g(x). Since g(x) =1= 0. So d(x) is the greatest common divisor off(x) and g(x). we have that d(x) = a(x)f(x) + b(x)g(x) for some appropriate a(x).7. Thus if hex) If(x) and hex) I g(x). they must then be multiples of d(x) by elements of F[x]. Given f(x) and g(x) =1= 0 in F[x]. For. We could define it in another.4 Definition. the polynomial d(x) E F[x] is the greatest common divisor of f(x). is again in I. then their greatest common divisor d(x) E F[x] exists. too. If we did that. then f(x) = ag(x). Because d(x) E I and [ is the set of all r(x)f(x) + s(~)g(x). Proof Let I be the set of all r(x)f(x) + s(x)g(x) as rex). d(x) = a(x)f(x) + b(x)g(x) for some a(x). where a E F. but we would not know its form. Theorem 4. the uniqueness of d(x) is guaranteed by the demand that we have made that the greatest common divisor be monic.6). t(x)(r(x)f(x) + s(x)g(x» = (t(x)r(x»f(x) + (t(x)s(x»g(x). nor what its form may be. b(x) E F[x]. so is again in I. We claim that I is an ideal of R. moreover. way as the monic polynomial of highest degree that divides both f(x) and g(x). it is generated by a unique monic polynomial d(x) (Theorem 4. its existence would be automatic. Since f(x). . then hex) I d(x). Although we defined the greatest common divisor of two polynomials. g(x) if d (x) is a monic polynomial such that: (a) d(x) If(x) and d(x) I g(x).8.5. This proves the theorem. b(x) E F[x].5. g(x) are in I. then hex) I (a(x)f(x) + b(x)g(x» = d(x). and equivalent.
0 We are now ready to single out the important class of polynomials that will play the same role as prime objects in F[x] as did the prime numbers in 7L. Definition. then f(x) and g(x) are relatively prime. if a(x)f(x) + b(x)g(x) = 1 for some a(x). then q(x) I g(x).Sec. then either p(x) If(x) or p(x) is relatively prime to f(x). q(x) divides the lefthand side of the relation in (1).5. that is. Although it is merely a very special case of Theorem 4. 5 Polynomial Rings 159 Proof By the mutual divisibility condition on f(x) and g(x) we have. g(x) in F[x] are said to be relatively prime if their greatest common divisor is 1.5. Thus q(x) divides the righthand side of (1). degf(x) :::. Therefore. degf(x). so a(x) = a. so degf(x) = deg a(x) + deg g(x) = deg a(x) + degf(x). b(x) E F[x]. If f(x).2. Proof We leave this "conversely" part to the reader as exercise. D As with the integers. to emphasize it and to have it to refer to.5. But f(x) = a(x)g(x). deg g(x) :::. an element of F.5. b(x) E F[x]. q(x) I g(x).7. g(x).5. a(x)f(x)g(x) + b(x)q(x)g(x) = = 1 for some a(x). in consequence of which deg a(x) = 0. g(x) E F[x] are relatively prime. by Lemma 4. The polynomial p (x) E F[x] is irreducible if p (x) is of positive degree and given any polynomialf(x) in F[x]. Proof By Theorem 4. Conversely. Definition.9 a(x)f(x) + b(x)q(x) b(x) E F[x]. we state: Theorem 4. we have Theorem 4. then a(x)f(x) + b(x)g(x) = 1 for some a(x). . D We leave the proof of the uniqueness of the greatest common divisor via Lemma 4. so degf(x) = deg g(x). The polynomialsf(x).8 to the reader. If q(x) and f(x) are relatively prime and if q(x) If(x)g(x). (1) Since q(x) I b(x)g(x)q(x) and q(x) I f(x)g(x) by hypothesis.9.5. the desired conclusion.10.
5. Corollary to Theorem 4. or b(x) is constant. Note that the irreducibility of a polynomial depends on the field F. (See Theorem 1. namely 1 or a monic polynomial of the form c·p(x).. then the ideal M = (p(x» is a maximal ideal of F[x]. According to the definition.. we have only two possibilities for (p(x). Theorem 4. b(x) in F[x]. Proof We first prove that if p(x) is irreducible in F[x]. where a(x) and b(x) are in F[x]. where IR is the field of real numbers.5. f(x» = 1 or p(x) I f(x).11. If p(x) E F[x] . is a maximal ideal of F[x]. the polynomial x 2 . then by Theorem 4. In other words.f(x». then b(x) must be a constant. a(x» = 1.6. an irreducible polynomial p(x) in F[x] enjoys the property that (p(x». Then.) D Aside from its other properties. where Q is the field of rational numbers. where at (x). The proof of this fact is very similar to the proof of an analogous observation concerning two equivalent definitions of a prime number. This shows that an irreducible polynomial cannot be factored as a product of two nonconstant polynomials. then the ideal (p(x» generated by p(x) in F[x] is a maximal ideal of F[x] if and only if p(x) is irreducible in F[x].10. For. and N ~ M. Proof We leave the proof to the reader. p(x) I b(x).5..2 is not irreducible in lR[x].160 Ring Theory Ch.4 We should note here that the definition implies that a polynomial p(x) of positive degree is irreducible in F[x] if and only if p(x) cannot be factored as a product of two polynomials of positive degree. an element of F). Thus (p(x). First suppose that p(x) (of positive degree) cannot be factored as a product of two noneonstant polynomials. If p(x) I a(x).v2)(x + v2). the ideal generated by p(x) in F[x]. which shows that p(x) is irreducible. ak(x) are in F[x]. .6. By Theorem 4.5. . For instance. but x 2 . where c is an element of F. suppose that N is an ideal of F[x]. given any f(x) in F[x]. then p(x) I ai(x) for some i.10. We prove this now. then either a(x) is a constant (that is.2 is irreducible in Q[x].5. Now let p(x) be irreducible in F[x] and suppose p(x) = a(x)b(x) for some a(x). and in this case a(x) must be a constant. If p(x) is irreducible in F[x] and p(x) I at (x)a2(x) . for in lR[x] x2  2 = (x . p(x) and a (x) are relatively prime. If. if p(x) = a(x)b(x). ak(x).. on the other hand. we must have p(x) I a(x) or (P(x). .
if f(x) == b E F. hence a(x) is a constant or f(x) is a constant. Since deg a(x) . we have shown M to be a maximal ideal of F[x]. then p(x) == a(x)b(x).12. Note that if p(x) == aox n + atXn1 + . This says that N == F[x]. which says that N C M. p(x) E N. N == (a(x» =1= F[x]. where deg a(x) . Therefore. . . which tells us that a(x) == f(x)p(x).::: 1. which is isomorphic to F). p(x) == a(x)f(x). then p(x) == af(x). If a(x) == a E F. Butp(x) is irreducible in F[x]. since N is an ideal of F[x]. ao =1= 0. Then either f(x) is irreducible in F[x] or f(x) is the product of irreducible polynomials in F[x]. since p(x) == a(x)b(x).Pk(X) are monic and irreducible in F[x].::: 1. is irreducible in F[x].. In fact. Because p(x) E MeN. and this field contains F (or more precisely. + anIX + an. however...5. By the maximality of M we conclude that M == N.(x). then. Since deg 1 == < deg b(x) ~ deg(b(x)f(x» == deg 1 == 0. deg b(x) . On the other hand. Thus p(x) is irreducible. F[x]/M is a field. Therefore.. mi > 0. thus g(x)1 E N for all g(x) E F[x]. The last topic in this direction that we want to discuss is the factorization of a given polynomial as a product of irreducible ones. the decency of which lies in that p(x) has a root in K. then. MeN. so f(x) = aIp(x). 5 Polynomial Rings 161 N = (f(x» for some f(x) E F[x].. the field {a + M I a E F}. This will allow us to make more precise the uniqueness part of the next theorem. f(x) == apl(X)m 1p2 (X)m 2 ••• Pk(X)mk. . where a is the leading coefficient off(x). then 1 == b1b E N. Let N == (a(x». combined with p(x) == a(x)b(x) == b(x)f(x)p(x).Sec. hence N == M. The exact statement and explanation of this we postpone until Chapter 5. since every element in N is of this form. If p(x) is not irreducible. then so is ao1p(x) irreducible in F[x]. ao1p(x) has the advantage of being monic. D ° This theorem is important because it tells us exactly what the maximal ideals of F[x] are. Thus f(x) E M. . we have reached a contradiction.Pt(x). Theorem 4. ° . But then a(x) E N == M.mk > and this factorization in this form is unique up to the order of the p. since every element in (a(x» has degree at least that of a(x). suppose that M == (p(x» is a maximal ideal of F[x]. If M is a maximal ideal of F[x]. So we have this monic irreducible polynomial trivially obtainable from p(x) itself..::: 1. In the other direction. we get that b(x)f(x) == 1. Let f(x) E F[x] be of positive degree. This allows us to construct decent fields K :) F. namely the ideals generated by the irreducible polynomials.
So the result is true in this case.5.Pk(X) = qk(X). f(x) = a(x)b(x). then. that the theorem is correct for all a(x) E F[x] such that deg a(x) < degf(x). But then f(x) is the product of irreducible polynomials in F[x]. qi(X) are monic irreducible polynomials and the mi' ni are all positive and a is the leading coefficient of f(x). It worked for 7L and F [x] because we had a measure of size in them. . then we have nothing to prove. Thus . By the induction. for which much of the argumentation works. Hence we obtain that ml .1 (so ml = nl). with a slight. r = k and P2(X) = q2(X). Otherwise.. we get Pl(X) = qi(X). a(x) [and b(x)] is irreducible or is the product of irreducibles. Since Pl(X) If(x). thenf(x) is irreducible and the uniqueness is clear. The proof is exactly the same as that of Theorem 1.. Again we go by induction on degf(x).162 Ring Theory Ch. we have that Pl(X) I ql(X)n 1 • • • qr(x)n r . Now to the uniqueness half.1 p2 (x)m z · · · Pk(X)m k apl(X)n 1 l q2 (x)n 2 • •• qr(x)n r • By induction we have unique factorization in the required form for f(X)/Pl(X).mk = nk. Suppose that f(x) = apl(X)m 1p2 (x)m 2 ••• Pk(X)m k = aql(X)n 1 ••• qr(x)nr . This completes the induction and proves the theorem. . and so proves the opening half of the theorem. Since qi(X) is monic and irreducible. If degf(x) = 1. so by the corollary to Theorem 4. a(x) and b(x) E F[x] and deg a(x) < degf(x) and deg b(x) < degf(x). as is Pl(X). that is. . Suppose. If degf(x) = 1. then f(x) = ax + b with a =1= 0 and is clearly irreducible in F[x]. . of which the two examples 7L and F [x] are special cases. where the Pi(X).10.7.. We go by induction on degf(x). This completes the induction. We can suppose (on renumbering) thatpl(x) = ql(X). either by the size of an integer or the degree of a polynomial. namely that f(x) is irreducible or the product of irreducibles. This suggests that there should be a wider class of rings. whose degree is less than degf(x). the coefficient of the highest power term of f(x). on renumbering the q's appropriately. Suppose the result true for polynomials of degree less than degf(x).1«:)) = = ap1(X)m.4 Pr~of We first show the first half of the theorem. 0 We have pointed out how similar the situation is for the integers 7L and the polynomial ring F [x]. obvious adjustment. m2 = n2.5.Pl(X) I qi(X) for some i. This measure of size was such that it allowed a Euclidtype algorithm to hold.1 = nl .. If f(x) is irreducible. .
R[x]. . or even any. If F is a field.6x + 7 and x + 4. show that the only invertible elements in F[x] are the nonzero elements of F. An integral domain R is a Euclidean ring if there is a function d from the nonzero elements of R to the nonnegative integers that satisfies: (a) For a =1= 0. The ring so constructed. addition. 5 Polynomials Rings 163 This leads us to define a class of rings. PROBLEMS In the following problems. The final comment we make here is that what we did for polynomials over a field we could try to do for polynomials over an arbitrary ring. Definition. b =1= 0 E R. 2. we shall not go any further with this interesting class of rings. (b) Given a =1= 0. d(a) ::. we introduce the ring R[x] of polynomials in x over R. 3. is a very interesting ring. It would be too much to expect that all. Find the greatest common divisor of the following polynomials over Q. Aside from a few problems involving Euclidean rings. of the theorems proved in this section would carryover to R[x] for a general ring R. and multiplication exactly as we did in F[x]. the field of rational numbers. If R is a ring. given any ring R (commutative or noncommutative). d(ab).4x 5 + 2. Defining degf(x) for f(x) E R[x] as we did in F[x]. there exist q and r E R such that b == qa + r. (b) x 2 1 and2x 7 . The interested student should try to see which of the results proved for polynomial rings (and the integers) hold in a general Euclidean ring. b =1= 0. whose structure is tightly interwoven with that of R itself. (b) There is a commutative ring R such that we can find f(x). for F a field. just as we did F[x]. we could define the polynomial ring R[x] in x over R by defining equality. show that: (a) deg(f(x)g(x» ::. F will always denote a field. That is.Sec. degf(x) + deg g(x) if f(x)g(x) =1= O. where r == 0 or d(r) < d(a). (a) x 3 . Easier Problems 1. the Euclidean rings. g(x) in R[x] with deg(f(x)g(x» < degf(x) + deg g(x).
If g(x). . let I = {f(x)a(x) + g(x)b(x)}. 12. show that p(x) is irreducible in F[x] and that F[x]/(p (x» is a field having 121 elements. MiddleLevel Problems 13. 4. Ifp(x) E F[x] is of degree 3 andp(x) = aox 3 + aIx 2 + a2x + a3' show that p(x) if irreducible over F if there is no element rEF such that p(r) = aor 3 + aIr 2 + a2r + a3 = O. and (d). (c) x 2 + X + lover F = 7L 2 • (d) x 2 + lover F = 7L 19 • (e) x 3 90verF=7L 13 • (f) x 4 + 2x 2 + 2 over F = Q. Find d(x). Prove the uniqueness of the greatest common divisor of two polynomials in F[x] by using Lemma 4. g(x) E F[x] are relatively prime in F[x]. If Fe K are two fields andf(x). Prove the Corollary to Theorem 4.x 4 + x 3 .1. Prove Lemma 4. Show that lR[x]/(x 2 + 1) ~ C. (a) Let p(x) = x 2 + 1. show that they are relatively prime in K[x]. 11. 9.1 and x 7 . Let F = 7L II . (a) x 2 + 7 over F = real field = IR. show that p(x) is irreducible in F[x] and that F[x]/(p(x» is a field having 11 3 elements.164 Ring Theory Ch. In Problem 3. 5.5. (b).3x + 3 over F = rational field = Q. show that every element in A is of the form a + bu. 6. where a.bElRandu 2 = 1.5.3. (b) Let p(x) = x 3 + X + 4 E F[x]. where f(x). the integers mod 11. let u be the image of x in A. 8. f(x) E F[x] and g(x) If(x). (b) x 3 . show that (f(x» c (g(x». show thatf(x)g(x) I h(x). g(x) run over Q[x] and a(x) is the first polynomial and b(x) the second one in each part of the problem. Let IR be the field of real numbers and C that of complex numbers.4 (c) 3x 2 + 1 and x 6 + x 4 + X + 1. 10. [Hint: If A = lR[x]/(x 2 + 1).5. 7.] 14. If f(x). (c). g(x) E F[x] are relatively prime and f(x) I h(x) and g(x) I h(x). so that I = (d(x» for Parts (a).10.8. Show that the following polynomials are irreducible over the field F indicated. (d) x 3 .
Show that p(x) and q(x) are irreducible in F[x] and that the fields F[x]/(p(x» and F[x]/(q(x» are isomorphic.. = ban = O. P2(X). Let F be a finite field.I + . If P is a prime.. 21. If R is the ring of even integers.Sec.. show that q(x) = 1 + x + x 2 + . show that F[x]/(q(x» has exactly pn elements.fuL EB F[x] EB'" EB F[x] (q(x» (PI (X» (P2(X» (Pk(X»" 18. and let q(x) E F[x] be irreducible of degree n.. Let F = 7Lp be the field of integers mod P. If a is a complex number such that a 2 + a + 1 = 0. the first by showing it to be isomorphic to something you know is a field. Let F = 7L 7 and let p(x) = x 3 . 22.) 16. Let Pl(X). Let F. 25. show that every ideal of R is principal. (See Problem 16 for a more exact statement. then. if q(x) = aox n + a1x n ... . the second by showing that if a + ba =1= 0. Let R be a commutative ring in which a 2 = 0 only if a = O. Construct a field having P 2 elements. 5 Polynomials Rings 165 15. Show that F[x]/(q(x» is a field having at most pn elements.2 and q(x) = x 3 + 2 be in F[x]. If R is a Euclidean ring..l is irreducible in Q[x]. then its inverse is of the same form. . for P an odd prime. show that R has a unit element. 20. where P is a prime. (Hint: Try to imitate Euclid's proof that there is an infinity of prime numbers.. Harder Problems 23. b E Q} is a field in two ways. 17..) 19. 24. + an' there is an element b =1= 0 in R such that baa = bal = . Pk(X). and let q(x) = x 2 + X + 1 be in Q(x). show that the set {a + ba I a. . x p .. q(x) be as in Problem 15. 26. Pk(X) E F[x] be distinct irreducible polynomials and let q(x) = PI (X)P2(X) . Show that F[x] contains irreducible polynomials of arbitrarily high degree. show that Euclid's algorithm is false in R by exhibiting two even integers for which the algorithm does not hold. Let Q be the field of rational numbers. Show that if q(x) E R[x] is a zerodivisor in R[x]. If R is a Euclidean ring. Show that ~ = .
29. If R[x] and I[x] are the polynomial rings in x over R and I. b integers} C C..) 6. respectively. One such field is that of the rational numbers. in Q[x]. are relatively prime) and (u. ao. the particular nature of F never entered the picture. . known as the Eisenstein Criterion. However. . there are results that exploit the explicit character of certain fields. . Even over 7L 2 such an algorithm is nonexistent. p a prime. m) = 1.. (b) R[x]/I[x] ~ (R/I)[x]. In this the field Q is highly particular. an have no common factor greater than 1 (i.. All the results hold for arbitrary fields. Let d (a + bi) = a2 + b 2• Show that R is a Euclidean ring where d is its required Euclidean function. relates the factorization over the rationals with factorization over the integers. Do Problem 26 even if the condition "a 2 = 0 only if a = 0" does not hold in R. POLYNOMIALS OVER THE RATIONALS In our consideration of the polynomial ring F[x] over a field F.. For instance. m. then where u. an are integers and the ao. We shall present two important theorems for Q[x]. We begin our consideration with two easy results. Let R = {a + bi I a.1. Gauss' Lemma. These results depend heavily on the fact that we are dealing with rational numbers.. Very Hard Problems *28.6. especially for coding theory. gives us a method of constructing irreducible polynomials of arbitrary degree. at will. ab . (R is known as the ring of Gaussian integers and plays an important role in number theory. Let R be a ring and I an ideal of R. Letf(x) E Q[x]. The first of these.166 Ring Theory Ch. the polynomial ring over the rational field Q. . there is no easy algorithm for obtaining irreducible polynomials of arbitrary degree n over the field 7Lp of the integers mod p.4 27. it would be highly useful to have. But it just doesn't existso far. Lemma 4. show that: (a) I [x] is an ideal of R[x].e. The second one.
..(a) = a R = RII. . the polynomial ring in x over RII. Cn (uox n + UlX n..... Ker <1>. then I[x]. Define <I> : R[x] ~ R[x] by: If aox n + a1x n. the polynomial ring in x over I. Furthermore. + an. R[x]II[x] == (RII)[x]. . Thus each ai is in . . by the very definition of what we mean by the Opolynomial in a polynomial ring.1 + .. then <I>(f(x» = 0. Since <I>(f(x» = q.1 + . .(ao)x n + q. 1.1.. + an is in Ker <1>.(ao)x n + q.2. Ul.2. + q. 6 Polynomials Over the Rationals 167 Proof Since f(x) E Q[x]. ab .. where the qi are rational numbers.6. If R is any ring and I an ideal of R. D The next result is a result about a particular homomorphic image of R[x] for any ring R. then <I>(f(x» = q. qi = bilci' where bi' ci are integers. Un' then each Ui = wa i. the element of R [x]. f(x) = qox n + qlX n. where the Ui are integers. What is the kernel. Thus clearing of denominators gives us f(x) = COC l . of <I>? If f(x) = aoxn + . + qn.(a n) = 0. ° we conclude q. defined f(x) = + I.(a n).l + · · · + un).(a1)x n.l + ..(ao) = 0. an are relatively prime integers. We leave it to the reader to prove that <I> is a homomorphism of R[x] onto R[x].. . as is claimed in the lemma.(al) = 0. .: R ~ R. .. Lemma 4... q. Then canceling out the greatest common factor of wand COCl ••• Cn gives us where U. m are relatively prime integers. q.(a n) = 0. whose kernel is I.1 + · · ..Sec.(a1)x n. + q. where ao.. then there is a homomorphism q. is an ideal of R[x]. If w is the greatest common divisor of un.. n. Proof Let by q. So for i = 0.
Of course. Let 7L be the ring of integers. b(x) E Q[x]. Proof Since 7Lp 7L. which happens to be I.6. and [(x).. since [(x) is monic. This proves the lemma. b l .6. Then [(x) = al (x)b l (x). al.. 0 As a very special case of the lemma we have the I R = Corollary.1.xr + b1'x r m 2 1 + .b1(x) .1 + . That I[x] C Ker <1> is immediate from the definition of the mapping <1>. by canceling out the common factor of UlU2 and mlm2. If w = 1. a(x) = . Hence I[x] = Ker <1>. b. More precisely.. Thus where v and ware relatively prime. .. a. . + b') r = !!1. we may assume with no loss of generality that the leading coefficients of al (x) and b l (x) are positive. . By Lemma 4. Proof Suppose that [(x) E 7L[x] and that [(x) = a(x)b(x).. al (x).!!l (a'x + a'x s .3). where a(x). + a') = .4 the kernel of 'P. are relatively prime. deg b(x) = r. Theorem 4. b l (x) are all in 7L[x].. . then. remembering that R = RII.. So Ker <1> C I[x].. where al (x). are relatively prime integers and b(x) = !!1.!!la I l ' (x) m 0 1 s m S 1 where a~. b l (x) are monic polynomials in 7L[x] and deg al (x) = deg a(x)..3. aI. the ideal of 7L generated by p.. m 2 where b~. Therefore.(b0.3 (Gauss' Lemma). .[(x) = aox n + alX n. b(x) as a product of a rational number and a polynomial with integer coefficients. . Because ao. + an is in I[x]. and deg a(x) = S. the corollary follows by applying the lemma to = We are ready to prove the first of the two major results we seek in this section. p a prime number in 7L. where a(x) and b(x) are in Q[x].. If [(x) E 7L[x] is a monic polynomial and [(x) = a(x)b(x). the ring I[x] is then an ideal of R[x] and R[x] ~ R[x]/Ker <1> = R[x]ll[x].l + . we can express each of a(x). an are in I. deg b l (x) = deg b(x).. 0 ~ TLI I. and (p). w [(x) = val (x)b l (x).168 Ring Theory Ch. By the First Homomorphism Theorem (Theorem 4. Then 7L[x]ll[x] ~ 7Lp [x]. we get that vabb~ = 1 and this leads .
D It might be instructive for the reader to try to show directly that if x 3 + 6x . geometry. similarly.7 is the product of two polynomials having rational coefficients.1 + . Theorem 4. bl(x) ¥ 0 because p ~ a. lV.. there is a j such that p ~ b j.. Let I = (p) be the ideal generated by p in Z.Sec. + an be a nonconstant polynomial with integer coefficients.6. algebra.p I an.1 + . irreducible polynomials of degree n over a given field F may be very hard to come by. and by the astronomers as one of the important astronomers of the past. since p lab . . Proof Suppose that f(x) = u(x)v(x). v the image of v in Z[x]/I[x] is not O. considered by many to be the greatest mathematician ever. w) = 1. The image of f(x) = x n + alX n. since the coefficients ab. However. are of gigantic proportions.. then it is already the product of two monic polynomials with integer coefficients. This contradicts that Z[x]/I [x] is an integral domain.p I an. then Z/I:::= 7L p and... where u(x). since p I w. but p2 ~ an. since we know by the Corollary to Lemma 4.. thus there is a prime p such that p I wand. So w =1= 1 is not possible.6. One should say something about C. is 0. at. So if u(x) is the image of u(x) and vex) . . by the Corollary to Lemma 4.. However. Suppose that there is some prime p such that pial. . p I a2. the image of w in Z[x]/I[x]. . Suppose then that w =1= 1. these exist in abundance and are very easy to construct. so is an integral domain. which is an integral domain. Gauss (17771855). a. F. and p ~ b j for the given i. since deg al (x) = deg a(x) and deg b l (x) = deg b(x). there is an i such that p ~ a i. His contributions in physics and astronomy are also of such a great proportion that he is considered by physicists as one of their greats. p ~ v. of al (x) are relatively prime. . and the theorem is proved. As we indicated at the beginning of this section. 6 Polynomials Over the Rationals 169 easily to v = 1. vex) are of positive degree and are polynomials in Q[x]. over the rationals. and consider Z[x]/I[x]. and since p~v. His contributions in number theory. Also.2 that 7L[x]/I[x] :::= (Z/I)[x] :::= Zp[x]. ab = bb = 1 and so f(x) = al (x)b l (x). . due to the next theorem. where both al (x) and b l (x) are monic polynomials with integer coefficients..6. Thus 01 (x) = v al(x)bl(x). This is precisely the claim of the theorem.4 (The Eisenstein Criterion). Let f(x) = x n + alX n. where v ¥ 0 and aleX) ¥ 0.2. and so on. Let I = (p) be the ideal generated by p in Z. j above. Z[x]/I[x] :::= 7L p[x]. since (v.. + an in Z[x]/I[x] is x n. By Gauss' Lemma we may assume that both u(x) and vex) are monic polynomials with integer coefficients. Then f(x) is irreducible in Q[x].
7 ·5 3x + 7 .175y + 875. 53 = (5X)4 .175(5x) + 875. p any prime. Since u(x)v(x) = x n + pxrh(x) + pxnrg(x) + p2g (x)h(x). The polynomial hey) is irreducible in Z[y] by using the prime 7 and applying the Eisenstein Criterion. Let f(x) = x n . and since 1 ::.r for some 1 ::. the constant term of u(x)v(x) is p2st. hence an = p2st. We pass to a polynomial g(x) closely related to f(x) whose irreducibility in Q[x] will ensure that of f(x). Let g(x) = 53f(x) = 54 4 . the Eisenstein Criterion tells us that f(x) is irreducible in Q[x]. thus g(x) is irreducible in Q[x]. then g(x) = hey) = y4 .7x + 7. The Eisenstein Criterion applies to g(x). we get that p 2 an. Since u(x)lx n . 3. for the Eisenstein Criterion applies. vex) = x n . Let f(x) = XlI . of course. using the prime 5.6. we must have that u(x) = x r . 1. x If we let y = 5x. Let f(x) = 5x 4 .4 that of vex) in Z[x]/I[x]. f(x) is not monic.r + ph(x).) . D 1 We give some examples of the use to which the Eisenstein Criterion can be put.6x 4 + 12x 3 + 36x . then x n = u(x)v(x). r < n. (See Problem 1. Since 2122. This suggests a slight generalization of the Eisenstein Criterion to nonmonic polynomials.170 Ring Theory Ch. v(x)lx n in Z[x]/I[x]. The irreducibility of h (y) implies that of g(x). Then one sees at a glance that f(x) is irreducible in Q[x]. apply the Eisenstein Criterion to f(x). Because f(x) = u(x)v(x).) 5. their constant terms are equal. a contradiction. but we can modify f(x) slightly to be in a position where we can apply the Eisenstein Criterion. In this way we see that f(x) is irreducible. (See Problem 4.p. 22 t 22 and 2 divides the other relevant coefficients of f(x). Since sand t are integers. 4. where g(x) and hex) are polynomials with integer coefficients.4x + 22. Let g(x) = f(x + 1) = (x + 1)4 + (x + 1)3 + (x + 1)2 + (x + 1) + 1 = x 4 + 5x 3 + 10x 2 + lOx + 5. We see that f(x) is irreducible in Q[x] by using either 2 or 3 to check the conditions of the Eisenstein Criterion. and so that off(x). This implies that f(x) is irreducible in Q[x]. Let f(x) = x 5 . But then u(x) = x r + pg(x) and v (x) = x n. r < n. where s is the constant term of g(x) and t the constant term of hex). 2. as it stands we cannot. Let f(x) = x 4 + x 3 + x 2 + X + 1. in Q[x].
then f(x) is irreducible in Q[x]. Let F be a field and b =1= 0 an element of F.. b =1= 0.p I anI. If p is a prime. Let F be a field.. Let 'P be an automorphism of F[x]. P I an . Prove the following generalization of the Eisenstein Criterion. Let f(x) = aox n + alX n. Prove there exist b =1= 0.30x + a is irreducible in Q[x]. ••• . such that 'P(a) = a for every a E F. Show that there is an infinite number of integers a such that f(x) x 7 + 15x 2 . then deg 'P(f(x» = degf(x). Prove that iff(x) E F[x]. . where F is a field. Define the mapping cp: F[x] ~ F[x] by 'P(f(x» = f(x + 1) for every f(x) E F[x]. pial' pi a2 . but p2 ( an. Let Fbe the field and 'P an automorphism of F[x] such that 'P(a) = a for every a E F. Let F be a field. 10.2 + . If f(x) E F[x]. + an have integer coefficients and suppose that there is a prime p such that p (a o . 6 Polynomials Over the Rationals 171 Gotthold Eisenstein (18231852) in his short life made fundamental contributions in algebra and analysis. Prove that 'P is an automorphism of F[x] such that 'P(a) = a for every a E F. Prove thatf(x) = x 3 + 3x + 2 is irreducible in Q[x]. then so is f(x). such that 'P(a) = a for every a E F. 7. Prove that 'P is an automorphism of F[x] such that 'P(a) = a for every a E F. 3. 2. PROBLEMS 1... prove that f(x) is irreducible in F[x] if and only if g(x) = 'P(f(x» is. 8. 5. Prove that 'P is an automorphism of F[x] such that 'P(a) = a for every a E F. show that f(x) = x p . 6. Define the mapping 'P: F[x] ~ F[x] by 'P(f(x» = f(bx + c) for every f(x) E F[x]. c in F such that 'P(f(x» = f(bx + c) for every f(x) E F[x]. 11.S