Professional Documents
Culture Documents
Encyclopedia of Physical Science and Technology EN002D-71 May 17, 2001 20:25
281
P1: GKD Revised Pages
Encyclopedia of Physical Science and Technology EN002D-71 May 17, 2001 20:25
form solids, the atomic density is much greater, as re- Certain general techniques are widely used for describ-
flected by the number of atoms that are nearest neighbors ing bonding and structure in solids. Tables of atomic radii
of any given atom. This number is called the coordination are available for ionic, covalent, and metallic bonding. De-
number. An example is the molecule NaCl, in which each viations of bond lengths from values predicted by these
atom has one nearest neighbor. In solid NaCl each atom radii of order 1–3% often reveal critical structural features
has six nearest neighbors. of importance to material fabrication and properties. The
Solids in their pure forms are nearly always crystalline. cohesion of solids can be connected to the cohesion of
A crystal is a periodic arrangement of atoms along lines, the elements. A binary solid Am Bn is said to have heat of
which in turn is repeated periodically along planes. Fi- formation Hf , which is the difference between m times
nally, the planes are repeated periodically to form the crys- the cohesive energy of A plus n times that of B minus the
tal lattice. cohesive energy of Am Bn . This heat of formation can be
Most of our knowledge of crystal structures comes from estimated with often remarkable accuracy from Pauling’s
the diffraction of waves of photons, electrons, or neu- table of elemental electronegativities X (A). This is proba-
trons by lattice planes. Usually all the atomic positions bly the most widely used table in science apart from the pe-
in the crystal can be determined this way. By comparing riodic table of the elements, and it is shown here as Table I.
chemical trends in bond lengths in crystals with those in
molecules, one can often infer the nature of the electronic
charge distribution responsible for chemical bonding in II. MOLECULAR CRYSTALS
the crystal. From this it may be possible to predict the na-
ture of chemical bonding at crystalline defects or even in We now turn to the differently bonded main groups of
noncrystalline solids, which are amorphous or glassy. solids. The molecular crystals are the simplest case, be-
The structures of millions of solids are known by cause the intermolecular forces are typically much weaker
diffraction. To understand these structures one begins by than the intramolecular ones. As a result the structure of
studying the simplest cases and classifying them into the molecules, as reflected, for example, by bond lengths
groups. The main groups are characterized as molecu- and vibration frequencies, is almost the same in the solid
lar, metallic, ionic, and covalent. In most solids the actual as in the gas phase. Some examples of materials that form
bonding is a mixture to some degree of these different molecular solids are the inert gases, diatomic halogens,
kinds of chemical interaction. While most solids are com- closed-shell molecules such as methane, and many planar
plex, the inorganic solids, which are best understood be- aromatic molecules such as benzene. Typically, in molec-
cause they have had the widest technological applications, ular crystals the heat of fusion per molecule per bond is at
are usually either simple examples from a main group or least 10 times smaller than the bond dissociation energy.
are closely related to them. In contrast, organic and bio- The binding forces that hold molecular crystals together
logically important molecules may be quite complex. The may arise from electric dipoles if the molecules carry per-
chemical and structural simplicity of technologically im- manent dipole moments (e.g., HCl). When the molecules
portant inorganic solids stems from the requirement of have no permanent moment, binding arises from mutually
availability of techniques for production in bulk. induced dipole moments (van der Waals interactions).
The structures of molecular crystals are determined pri- These correspond to packing cations and anions of nearly
marily by packing considerations and thus vary from ma- equal size (CsCl structure), and then successively larger
terial to material according to molecular shape. Molec- anion/cation size ratios lead to increasing anion–anion
ular solids are generally poor conductors of electric- contacts, thus reducing coordination numbers. These “ra-
ity, and even the photoconductivity is generally small dius ratio” rules do not actually describe the crystal struc-
unless metallic impurities are added to “sensitize” the tures, as shown in Fig. 1. What this means is that the
material. ions should not be regarded as hard spheres, but rather as
centers of quantum mechanically determined electronic
charge distributions. Additional evidence for the break-
III. IONIC CRYSTALS AND down of classical electrostatic models is contained in the
ELECTRONEGATIVITY elastic constants of the alkali halides. If these models were
correct, the elastic constants would satisfy certain rela-
Before discussing the structure of ionic crystals in detail, tions (the Cauchy relations) valid for central force inter-
we shall familiarize ourselves with the concept of elec- actions. These relations are not satisfied for most of the
tronegativity, defined by Pauling as “the ability of atoms alkali halides, another indication of quantum mechani-
in the bonded state to attract electrons to themselves.” cal interactions. Some simplified modern treatments of
Atoms in solids are in a variety of bonded states, and it these and related problems are discussed in the following
is due to Pauling’s insight that we have come to realize sections.
that the atomic electronegativity that he defined in terms
of heat of formation (Section I) is indeed nearly constant
for each element. His idea is that in solids charge flows
from cations with smaller electronegativity to anions with
greater electronegativity and that the heat of formation re-
sulting from this charge flow is proportional to (X c − X a )2v ,
where X c and X a are the cation and anion electronegati-
vities, respectively.
Ionic crystals are composed of cations and anions with
very large electronegativity differences, such as alkali met-
als and halides, columns I and VII of the periodic table,
respectively. In this case the charge transfer of valence
electrons is almost complete, so that the core configu-
rations become isoelectronic to those of inert-gas atoms
(e.g., Na+ to Ne, Cl− to Ar). While some energy is re-
quired to ionize the cations and transfer electrons to the
anions, this energy is more than recovered thanks to the
larger electronegativity of the anions and the mutual at-
traction of cations by their anion neighbors. In the case
of the alkali halides, the cohesive energies can be esti-
mated within a few percentage points by assuming com-
plete charge transfer and evaluating the electrostatic en-
ergies (including ion polarization energies). A core–core
repulsive energy, required by the exclusion principle, com-
pletes the calculation, which was first sketched around
1910.
As one might expect, the overall features of the crystal
structures of ionic crystals are given quite well by pack- FIGURE 1 The structures of the alkali halides M+ X− as func-
ing spherical cations and anions in the appropriate propor- tions of classical ionic radii r + and r − , respectively. (Coordination
tions indicated by their chemical formulas. However, the numbers in parentheses are those predicted by the classical ionic
ions are not quite the incompressible spheres suggested model.) In the upper left corner, for example, Li+ I− is predicted
by the classical model to have coordination number 4, but the
by their isoelectronic analogy to inert-gas atoms. If they
symbol indicates it is actually sixfold coordinated. Key: , sixfold
were, one could use simple geometrical arguments (origi- coordinated; , eightfold coordinated; ❤, six- or eightfold coor-
nating around 1930) to predict a coordination number of 8 dinated. [From Phillips, J. C. (1974). In “Solid State Chemistry”
(CsCl structure), 6 (NaCl structure), or 4 (ZnS structure). (N. B. Hannay, ed.), Vol. 1, Plenum, New York.]
P1: GKD Revised Pages
Encyclopedia of Physical Science and Technology EN002D-71 May 17, 2001 20:25
FIGURE 3 Sketch of electronic interactions between directed valence orbitals that produce an energy gap between
bonding states (electrons shared between nearest neighbors) and antibonding states (no electron sharing). [From
Phillips, J. C. (1970). Phys. Today 23 (Feb.), 23.]
P1: GKD Revised Pages
Encyclopedia of Physical Science and Technology EN002D-71 May 17, 2001 20:25
way in which the mixture occurs is of great importance The last case is primarily ionic materials with a covalent
both scientifically and technologically. We shall discuss component. Many oxides in which the oxygen atoms are
several important examples. three- to sixfold coordinated fall in this class, and this in-
The simplest case occurs for the tetrahedrally coordi- cludes many ceramic materials. These materials can have
nated covalent structure shown in Fig. 2. This structure high melting points and good chemical stability, but they
contains two kinds of atomic sites: site A with only B are brittle and for this reason their range of technological
neighbors, and vice versa. In silicon and gemanium both applications is limited.
sites are occupied by the same atom, which has four va-
lence electrons. However, one can occupy the two sites
with different atoms, such that the total number of valence VI. METALLIC BONDING
electrons is eight per atom pair (formally represented by
A N B8−N ). Many compounds of this type with N = 3 and Broadly speaking, three kinds of elements are found in
N = 2 are known. Two examples with N = 3 are GaAs metals. They are the simple s– p metallic elements from
and InP and ternary and quaternary alloys (Ga, In) (As, P). the left-hand side of the periodic table, such as lithium, alu-
These materials are transparent in the near infrared, and minum, and lead; the rare-earth and transition elements
their optical properties can be adjusted by “band gap engi- with f and d valence electrons, such as titanium, iron,
neering.” They are important as high intensity, low power and nickel; and metalloid elements, such as carbon, sil-
monochromatic light sources or as light amplifiers (lasers). icon, and phosphorus, which may also in certain combi-
The next interesting case is the triatomic material SiO2 nations form covalent solids. In metals the coordination
(silica). The electronegativity difference between silicon number (or number of nearest neighbors) is much larger
and oxygen is large, so the bonding here contains a large (usually twice as large or more) than the number of va-
ionic component. At the same time each oxygen atom lence electrons. This means that the directed valence bonds
contains six valence electrons while silicon has four, so found in molecules or in covalent crystals are much weaker
the total number of valence electrons per molecular unit (although not completely absent) in metals.
is 16 = 8 + 8. This favors covalent bonding. In the solid The high electrical and thermal conductivity of metals
each silicon atom has tetrahedral oxygen neighbors, while is a result of the absence of a gap in the energy spec-
each oxygen atom has two silicon neighbors, which is trum between filled and empty electronic states. This high
again the coordination characteristic of covalent bonding. electrical conductivity in turn reduces the contribution to
Silica is chemically stable and can be made very pure, for cohesion associated with charge transfer because the inter-
much the same reasons that silicon can. This high purity nal electric fields are limited by electronic redistribution
is essential to technological applications in the context of or charge flow on an atomic scale. Thus, ionic interactions
optical fibers for communications. are reduced in metals compared with ionic crystals.
Another feature of silica is that it can easily be cooled The reduction of covalent or molecular bonding as well
into a solid state that is not crystalline but more like a as ionic bonding in metals presents a paradox. If nei-
frozen liquid. The state is called a glass. The ductility of ther of these bonding mechanisms is fully effective, to
glasses at high temperatures is essential to the manufac- what forces do metals owe their cohesion? Modern quan-
ture of optical fibers. However, glasses are also ductile on tum theory shows a complex correlation of the motion of
a molecular scale and so do not form molecular “cracks,” metallic valence electrons, which reduces the Coulomb
which would be arrays of broken bonds that might be elec- repulsive energy between these like charges while leaving
trically active and destructive to the electronic capabilities almost unchanged the attractive Coulomb interaction be-
of semiconductor devices such as transistors. It is one of tween negatively charged electrons and positively charged
nature’s most felicitous accidents that silicon electronic atom cores. It is this correlation energy that is primarily
devices can be packaged by simply oxidizing the surfaces responsible for metallic cohesion.
of solid silicon to form a protective coating of silica, SiO2 . From studies of the structure and cohesion of metals
The silica coating is not only chemically stable (because of it appears that d valence electrons (as in the transition
its covalent-ionic bonding), but is also mechanically stable metals) contribute almost as effectively to metallic cohe-
because of the ductility of vitreous silica at the molecular sion as s and p electrons. The f electrons in rare-earth
level. Thus, the interface between the crystalline silicon metals, on the other hand, play a minor role in metallic
electronic device and the silica coating is itself almost per- cohesion but occasionally have magnetic properties. Tran-
fect. It does not store fixed charge, even when the thickness sition metals are notable for their strong magnetic proper-
of the silica is only a few molecular layers. This greatly ties (iron, cobalt, and nickel), as well as their high melting
enhances the performance of microelectronic devices (in- points and refractory properties, which result from the
tegrated circuits on silicon “chips.” large number of combined s, p, and d valence electrons.
P1: GKD Revised Pages
Encyclopedia of Physical Science and Technology EN002D-71 May 17, 2001 20:25
The compound with the highest known melting point is This has been done in several ways, which are substan-
tungsten carbide (WC), an interesting combination of a tially equivalent. The simplest case is A N B M compounds
transition element whose d levels are half-full with a met- where A and B have only s and p valence electrons and
alloid element whose s and p valence levels are half-full. N + M = P = 8, which means that the s and p valence
Also, here tungsten is very large and carbon is very small, levels are half-full. In this case one can separate ionic and
which makes possible an ionic contribution to the cohesive covalent crystal structures by separating the average en-
and refractory properties. ergy gap between occupied and empty electronic states
into ionic and covalent components, represented by C and
E h , respectively. Both NaCl (ionic) and diamond, silicon,
VII. QUANTUM STRUCTURAL DIAGRAMS and germanium (covalent) crystals (Fig. 2) belong in this
group, with C/E h = 0 in the latter and C /E h largely in
The description of structure and bonding in solids given the former. The quantum structural diagram for A N B8−N
in the preceding sections is largely qualitative, but it is a nontransition-metal compounds shown in Fig. 4 not only
fair (although abbreviated) account of most of what was is a huge improvement on the classical structure diagram
generally known as a result of quantum mechanical anal- shown in Fig. 1 but also is an exact separation of covalent
ysis in the period from 1930 to 1960. Starting in 1960 a and ionic crystal structures.
more quantitative description was developed that enables
us to inspect systematic trends in structure and bonding
with the aid of quantum structural diagrams.
With a structural diagram one assigns to each element
certain characteristics and then treats these characteris-
tics as configuration coordinates, which are used to con-
struct structural maps. The natural classical configuration
coordinates are atomic size and electronegativity, as de-
fined by Pauling (see Table I). To these we may add the
number of valence electrons per atom. One then takes a
class of binary compounds, say A N B M , with the same
value of P = N + M and uses size differences (or ratios)
as well as electronegativity differences as Cartesian coor-
dinates. If the characteristics or configuration coordinates
have genuine value for describing structure and bonding,
compounds composed of different elements A and B but
with similar values of their Cartesian coordinates, should
have the same crystal structure. Put somewhat differently,
the structural map should separate into simple regions,
with each region containing compounds with the same
crystal structure.
Early attempts to construct structural maps of this kind
using classical coordinates were only partially successful;
as many as 10 or 20% or more of the compounds were
misplaced. From this failure most workers concluded that
the problem of structure and bonding in solids, and espe-
cially in metals where the number of known compounds
exceeds 104 , was simply too complex to solve in any sim-
ple way. Finding a solution was left to the indefinite future,
when computers became large enough and quantum me-
chanical methods accurate enough to predict structures on
a case-by-case basis. FIGURE 4 The separation of the energy gap shown in Fig. 3 into
Recent research has shown that the idea of structural di- covalent and ionic components (Eh and C, respectively) generates
agrams is itself valid but that previous failures arose from a structural map that separates fourfold- and sixfold-coordinated
A N B8−N crystals perfectly (no transition or rare-earth elements).
the use of largely classical coordinates. In addition to the
The structures and coordination numbers (in parentheses) are as
number of valence electrons per atom (a quantum con- follows: , diamond, zinc blend (4); , wurtzite (4); , rock salt
cept), one must also use other quantum variables to replace (6); ❤, rock salt/wurtzite (6, 4). [From Cohen, M. L., Heine, V.,
the classical variables of atomic size and electronegativity. and Phillips, J. C. (1982), Sci. Am. 246 (6), 82.]
P1: GKD Revised Pages
Encyclopedia of Physical Science and Technology EN002D-71 May 17, 2001 20:25
Chemical Physics
Richard Bersohn
Bruce J. Berne
Columbia University
739
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
physics is divided into two sections, one on phenomena units of energy used to describe spectroscopic phenom-
which depend primarily on the properties of individual ena. The SI unit of energy is the joule (1 J = 107 ergs),
and pairs of molecules and the other on phenomena which but this unit is rarely used by spectroscopists. Just as
are primarily collective. the nuclear spectroscopist uses one million electron volts
(1 MeV = 9.65 × 1010 J) as a unit of energy, the X-ray
spectroscopist uses electron volts (1 eV = 9.65 × 104 J).
I. PROPERTIES OF INDIVIDUAL The ultraviolet, visible, and infrared spectroscopists use
AND PAIRS OF MOLECULES the cm−1 unit, which is not an energy unit but a reciprocal
of a wavelength (1 cm−1 × h/c = 11.96 J). The microwave
Studies in chemical physics can be loosely classified as and radiofrequency spectroscopists use the megahertz
spectroscopic, structural, and dynamic. Spectroscopy is (1 MHz × h = 3.99 × 10−4 J), and nuclear magnetic res-
concerned with the determination of molecular energy lev- onance (NMR) spectroscopists often use a dimensionless
els. Structural studies are aimed at finding the distribution quantity, the relative frequency shift in parts per million.
of particles within a molecule and molecules within a liq- Some spectroscopists use units of energy, others units of
uid or solid. The location of the nuclei defines the structure reciprocal wave length, and still others frequency units.
of the molecule, that is, the distances between nuclei and Indeed the reference to spectral features by their wave
the angles between internuclear vectors. The distribution lengths is still a very common practice, although it is to
of electrons is intimately connected with the forces that be deplored. There are natural reasons for these differ-
hold the atoms together. Dynamics involves the relation of ent choices of energy units but they are bewildering to the
the rate of molecular transformations and changes of state beginner. In a way, these units are like the light year, a non-
caused by collision to the intra- and intermolecular forces. standard unit convenient to the astronomer but to nobody
else.
Molecular spectroscopy is generally divided into four
A. Molecular Spectroscopy
branches, each corresponding to a different type of motion
Spectroscopy is the measurement of energy level differ- and, in general, to a different frequency range. Electronic
ences. This is most usually accomplished by measuring the spectroscopy is the study of the differences in electronic
frequencies of light absorbed or emitted by a molecule, but energy levels which occur in atoms and molecules; the
it is sometimes done by measurements of the energy of an corresponding frequencies are 1014−1017 Hz. Vibrational
incident photon or particle together with a measurement spectroscopy is the study of molecular vibrations, whose
of a scattered photon or particle. For example, the fre- frequencies are typically of the order of 1012 −1014 Hz. Ro-
quency of scattered light may differ from that of the inci- tational spectroscopy is the study of the rotational energy
dent light. The absolute value of the frequency difference levels of molecules; corresponding rotational frequencies
is a difference of energy levels of the molecule divided are typically in the range of microwaves, 1010 −1012 Hz.
by Planck’s constant. This phenomenon, called Raman Finally, NMR spectroscopy is the study of the magnetic
scattering, has many analogs. Electron loss spectroscopy fields acting on a nucleus.
is extensively used to measure vibrational frequencies of The given divisions are not quite as sharp as represented.
surfaces. The difference in energy between incident and Frequency ranges overlap, and electronic spectroscopy is
scattered electrons is, in general, a quantized energy left in often a source of information of vibrations and rotations
the solid. When very slow (“cold”) neutrons are scattered as well as electronic motion. Nevertheless, this classifi-
by a warm liquid or solid, the scattered neutrons move cation of spectra is of fundamental importance. The the-
faster than those in the incident beam. oretical justification for this classification is the Born–
In some spectroscopies the scattered particle whose en- Oppenheimer approximation. This is an approximation
ergy is measured is not the same as the incident particle. that exploits the fact that the electron is 104 −105 times
For example, in photoelectron spectroscopy an incident lighter than a typical nucleus. Classically speaking, the
photon with known energy whose wavelength is in the electron revolves around the molecule so rapidly that dur-
XUV (<100 nm) or X-ray region produces a photoelec- ing an electron period the nuclei do not have time to react
tron. Careful measurements of the kinetic energy of the to the different positions of the electron and barely move.
electron yield energy level spacings in the positive ion. In Thus, electronic energies can be calculated with the as-
photodissociation spectroscopy, when a molecule is dis- sumption that the nuclei are stationary. They are not sta-
sociated into fragments by light, measurement of the ki- tionary, of course, but the electronic energy levels can (in
netic energy of a fragment yields the internal energy of the principle) be calculated for any given arrangement of the
fragments. nuclei. The arrangement or configuration that produces
Before discussing spectroscopy, it might be appropri- the least electronic energy is the equilibrium structure of
ate to warn and console the reader about the multitude of the molecule.
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
Let us pause to consider the coordinates used to describe electrons occupy three types of states: bonding, nonbond-
the shape and location of an isolated set of N atoms. The ing, and antibonding. For example, in the water molecule
configuration of a molecule with N atoms is determined there are two core electrons in the oxygen inner shell, two
by 3N coordinates. Of these, six are external coordinates pairs of bonding electrons, and two pairs of nonbonding
in that they describe the location of the center of mass and electrons. Lower energy electronic transitions are often ex-
the spatial orientation of the molecule. However, the elec- plained by promotion of electrons from nonbonding to an-
tronic energy is independent of these coordinates, for ex- tibonding states. At somewhat higher energies, electrons
ample, the position and orientation of the molecule. Thus, in bonding states may be promoted to antibonding states.
there are 3N − 6 internal coordinates (3N − 5 for a linear At still higher energies, one has Rydberg series, which are
molecule) which are, in general, vibrations. a set of atomiclike transitions in which valence electrons
The electronic energy, considered as a function of the are excited to progressively higher energy states where
nuclear coordinates, behaves as a potential energy function the electron circum navigates the ion core at, typically, a
for the nuclear motion. In a stable molecule, the atoms exe- considerable distance. Excitations of inner shell electrons
cute small vibrations about the position of the minimum of require photons with energies in the X-ray region.
electronic energy. This statement implies that, in general, Electronic spectra are identified not only by their ener-
for each electronic state the molecule will have a different gies but also by two closely related quantities: their inten-
equilibrium structure. In mathematical terms, the poten- sity and their polarization. Electromagnetic fields cause
tial in the nth electronic state can be expanded in powers transition in molecules by exciting resonances in the elec-
of the displacement about the equilibrium position, tronic or nuclear motion. The interaction energy is almost
∂V invariably expanded in multipoles, that is, in powers of the
Vn (Q) = Vn (Q0 ) + (Q i − Q i0 ) size of the molecule divided by the wavelength of light.
∂Q i 0
i This ratio is so small that for practical purposes only the
1 ∂ 2 Vn electric and magnetic dipoles interacting with the external
+ electric and magnetic fields, respectively, are considered.
2 i j ∂Q i ∂Q j
0 There is a fairly sharp distinction between the domain of
× (Q i − Q i0 )(Q j − Q j0 ) spectroscopy in which the electric and magnetic dipole
moments are important in causing transitions. The elec-
where Q is the set of vibrational coordinates and Q0 is their tric dipole moment matrix elements are typically of the
equilibrium value. If Q0 is truly the equilibrium position, order of ea0 , the electronic charge multiplied by the Bohr
then all the first derivatives in the preceding equation will radius. The magnetic dipole moment matrix elements are
vanish. If the Q i are chosen to be normal coordinates, then typically on the order of eh/mc, the electronic charge mul-
all the cross derivatives vanish as well and the following tiplied by the Compton wavelength. However, h/mc is a
equation is obtained: factor of e2/hc 1371
smaller than a0 = h 2/me2 . Transition
1 3N −6 2
∂ Vn probabilities are classically proportional to the square of
Vn (Q) = Vn (Q0 ) + (Q i − Q i0 )2 + · · · the time derivative of the appropriate dipole moment, and
2 i=1 ∂Q i0 2
quantum mechanically to the square of the matrix element
A normal coordinate is a coordinate that classically will between the initial and final states of that dipole moment
vary sinusoidally in time with a single frequency. In con- operator. Hence, magnetic dipole transition probabilities
trast, an arbitrary nonnormal coordinate will vary in time are typically four to five orders of magnitude weaker than
with a number (up to 3N − 6) of frequencies. The quan- electric dipole transitions. Magnetic dipole transitions
tities (∂ 2 Vn /∂Q i2 )0 are the force constants, the curvatures are therefore rarely observed in molecular spectroscopy
of the potential with respect to the normal coordinates i. except in paramagnetic systems where only the mag-
From this definition, it is clear that the force constant may netic dipole interaction can accomplish magnetic moment
be different in different electronic states and indeed even reorientation.
the normal coordinates may differ. The simplest example The polarization of an electric dipole transition is the
is a diatomic molecule where the equilibrium internuclear direction along which the external time-dependent electric
distance is different in two different electronic states. field must lie in order to cause the transition. Transitions in
an atom placed in a potential, independent of angle, would
1. Electronic Spectroscopy
be unpolarized in the sense that the transition probability
The electronic spectroscopy of atoms is clearly separated would be independent of direction of the time-dependent
by energy domains into transitions of valence electrons optical field. In all molecules except those with tetrahe-
and transitions of inner shell electrons. When two or more dral or octahedral symmetry, the absorption is anisotropic;
atoms combine to form a molecule, to a good approxi- in some directions there will be no absorption. The vec-
mation, core electrons remain core electrons but valence tor µi f , the dipole moment matrix element between the
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
initial and final states, is often called the transition dipole. original goal, while eventually achieved, was dwarfed by
In a linear molecule, the transition dipole must lie either an unforseen discovery. Nuclei with odd numbers of neu-
parallel or perpendicular to the molecular axis. In a pla- trons or protons have a net quantized angular momentum
nar molecule, typical of the majority of dyes, the transition Ih where I = 12 , 1, 32 , . . . . The nuclear angular momen-
dipole must lie either in the plane in some particular direc- tum implies a rotation of positive charge which in turn
tion or perpendicular to the plane. The polarization of an produces a magnetic moment. The existence of nuclear
absorption is most often determined by measuring the ab- magnetic moments was manifested by the hyperfine struc-
sorption of polarized light by an oriented set of molecules ture of atoms, an interaction energy between the nuclear
as in a single crystal. magnetic moment and a magnetic field generated by the
electrons. Nuclear magnetic moments had also been de-
2. Vibrational Spectroscopy tected by atomic beam experiments in which atomic beams
were deflected in different directions by inhomogeneous
Infrared and Raman spectroscopy are used to determine
magnetic fields; radio-frequency transitions changed the
the 3N − 6 vibrational frequencies of a molecule with N
relative intensities. However, the nuclear magnetic mo-
atoms (3N − 5 if the molecule is linear). There are both
ment could be determined only if the effective magnetic
experimental and theoretical problems. On the experimen-
field of the electron could be calculated; in the late 1940s,
tal side, the intensity of a vibrational transition depends
when NMR was developed, computers adequate to calcu-
on the square of the rate of change of the dipole mo-
late these fields did not exist. Nuclear magnetic resonance
ment during the vibration. However, certain modes have
measurements on solid or liquid samples seemed like the
no accompanying change in dipole moment and therefore
perfect solution. The moments µ could be derived from
electric dipole transitions are forbidden; examples of these
the simple equation hν = µB where h is Planck’s con-
modes are the symmetric stretch of CO2 or the vibration of
stant, B is the magnetic induction, and ν the frequency
N2 . Raman spectroscopy often can supply the missing in-
used to excite the transition between the 2I + 1 magnetic
formation. On the theoretical side, the 3N − 6 vibrational
sublevels of the nucleus.
frequencies once determined must be interpreted in terms
In practice, it was found that the apparent magnetic
of (3N − 6)/(3N − 7)/2 force constants, that is, second
moment varied with the chemical environment of the nu-
derivatives of the potential function. One instructive point
clei. The effect was due to a local magnetic susceptibility,
has emerged: Vibrational frequencies of most molecules
which, in general, increased with the number of atomic
display very little differences in the condensed phase and
electrons. For the particular case of the protons 11 H and
in the gas phase. Obvious exceptions to this rule are vibra- 13
tions in such strongly interacting molecules as the water 6 C, resonance fields (or frequencies) differ at various
nonequivalent positions and this fact serves as a deci-
molecule.
sive structural tool for organic molecules. As important
One of the frontiers of chemical physics is the vibra-
as the structure determination is, the really unique feature
tionally highly excited “hot” molecule. Vibrations are no
is the ability to measure exchange rates between differ-
longer neatly divided into a set of harmonic normal mode
ent environments. For example, cyclohexane has six axial
oscillations. Vibrations are strongly anharmonic. Vibra-
and six equatorial hydrogen atoms that interchange as the
tions and rotations are no longer clearly separated and the
molecule converts from one chair form through the boat
nature of the states are difficult to describe. Yet it is pre-
form to a second equivalent chair form. At room temper-
cisely these energy-rich molecules that are the precursors
ature, only a single resonance line is seen because the two
to chemical reactions, whether they are unimolecular or
sites interchange at a rate much faster than the frequency
bimolecular.
difference between them. When the liquid is cooled, the
rate decreases and two lines are seen. The contributions
3. Radiofrequency Spectroscopy
of NMR to solid-state physics, all branches of chemistry,
Radiofrequency spectroscopy comprises many branches biochemistry, and even medicine through the imaging of
including nuclear magnetic resonance, nuclear quadrupole internal organs have been over-whelming. The original
resonance, electron spin resonance, atomic beam reso- goal targets, the magnetic moments of the stable nuclei,
nance, optical pumping of atoms, and microwave spec- have all been measured and tabulated in reference books
troscopy of rotating molecules. The results of the last tech- but a theory has not been developed to explain them.
nique are described in the section on molecular structure.
Nuclear magnetic resonance has contributed vastly more
B. Molecular Structures
to chemistry than the other techniques.
The development of nuclear magnetic resonance is an Molecular structure is most often determined by scattering
extraordinary chapter in the history of science in which the methods. A beam of photons or material particles (chosen
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
so that their wave-lengths are comparable to close inter- tions to this rule such as PCl5 , which is PCl+ −
4 PCl6 in the
nuclear distances) is directed at the sample. At particular solid state, but the rule is generally valid.
scattering angles, constructive interference will occur be- Nuclei scatter X-rays to a negligible extent because they
tween the waves scattered by pairs of atoms or, in a crystal, are so heavy that they do not react to the time-varying
planes of atoms. By measuring the relative scattered in- electric field of the photon. The electrons will contribute
tensities at the various angles at which constructive inter- a total scattering amplitude proportional to:
ference occurs, a structure may be inferred. The structure
of a condensed phase is probed by X-rays or neutrons;
the structure of a gas molecule or a surface is commonly f = ρ(r)eis · r d V
determined by electron scattering.
In several types of spectroscopy, molecular rotational where r is a vector whose origin is at the nucleus, ρ(r) is
energies are determined from which the three moments of the electron density at the point r, s = 2π (k − k0 ), and d V
inertia can be deduced. By measuring the spectra of the is a volume element. The wave vectors k and k0 are of the
same molecule with different nuclear isotopes, additional scattered incident X-rays, respectively. The atomic scat-
sets of moments of inertia can be obtained. tering factor f varies with the atom, being larger for atoms
Molecular structure is the distribution of nuclei and with more electrons. The amplitude for scattering from an
electrons within a molecule. The location of the nuclei, of- entire unit cell of a crystal is represented (approximately)
ten just called the structure, is the molecular architecture. as a sum over atomic scattering amplitudes:
Closely connected to this architecture is the distribution
of electrons circumnavigating the heavy nuclei. The tech- F= f j eis·R j
niques for determining the nuclear positions are based on j
interferences in scattering (diffraction) or spectroscopic
measurements of moments of inertia. where f j is the scattering amplitude of the jth atoms
whose mean position is at the point R j within the unit
cell. There will be constructive interference (that is, Bragg
1. Diffraction Methods
peaks) whenever s · (R j − Rk ) is an integral multiple of
When an incident wave is scattered by a molecule, the 2π . There will be a large number of planes of atoms in
scattered wave consists of a sum of contributions, large the crystal for which this Bragg condition will be satisfied
and small, from each of the electrons and nuclei in the and therefore the crystal structure is in principle overde-
molecule. The scattered intensity, which is the square of termined.
the scattered amplitude, will consist of a sum of squares As is shown by the previous equation, the atomic scat-
of the individual amplitudes plus a sum of products of tering amplitude is roughly proportional to the number
scattered amplitudes originating from different particles. of electrons in the atom. Thus, it is particularly difficult
The former sum is independent of the distances between to observe scattering interferences from hydrogen atoms.
particles, whereas the latter is structure dependent. The Neutron-scattering amplitudes depend on nuclear cross
constructive interference between the different amplitudes sections which are of comparable magnitudes for all nu-
gives rise to significant diffraction peaks when the inter- clei. An identical expression for F is obtained except that
particle distances are comparable to the wavelength of the the f
s are now nuclear scattering amplitudes. Because
incident wave. In practice, this means that the wavelength these amplitudes are small, neutron diffraction is carried
must be of the order of an angstrom, (1 Å = 10−10 m). The out with condensed phases and not with gases.
most common “particles” used for scattering have been the Electron diffraction has also been used for determina-
X-ray (λ ∼ 1–2 Å) and the velocity-selected slow neutron tion of molecular structure. Electrons interact far more
(λ ∼ 1 Å). In a few cases, especially for diffraction from strongly with matter than X-rays and can penetrate only
surfaces, hydrogen atoms and helium atoms have been a few hundred angstroms into a crystal. They have been
used. used, therefore, for structural measurements of solid sur-
X-ray diffraction from single molecular crystals has faces and gaseous molecules. The fact that the gaseous
been our most important source of knowledge of molecu- molecules exhibit all orientations to an incoming electron
lar structure. The assumption, in most cases solidly con- beam greatly reduces the information content of the exper-
firmed by experiment, is that the forces that hold molecules iment. In other words, only systems with a small number
together in the solid are much weaker than the interatomic of structural parameters can be effectively studied. The
forces that determine the molecular structure. Therefore, scattering amplitudes for individual atoms include a nu-
the structure in the solid will be negligibly different in the clear as well as an electronic term, but the essence of the
solid, liquid, and gas phases. There are, of course, excep- structurally dependent interference remains the same.
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
2. Structure Inferred from Moments of Inertia When the chemical composition of a system changes
with time, it is less clear how to express this change math-
At very high resolution, electronic spectra exhibit rota-
ematically. When dealing with a reaction, for example, of
tional structure from which the moments of inertia of both
the form:
the ground and excited electronic states can be extracted.
Similarly, vibrational spectra taken at very high resolution A+B← → X1 ← → X2 · · · ←
→ Xn → C + D
will yield the moments of inertia in the ground and the where A and B are reactants, C and D are products, and
excited vibrational states. Rotational Raman spectra have the Xi are intermediates, known or unknown, it can often
yielded moments of inertia of small symmetric molecules. fit the observations to an empirical formula with one or
The most extensive and accurate source by far of moment more terms of the following form:
of inertia data is microwave spectroscopy. This technique
d(C)/dt = k(A)a (B)b (C)c (D)d
is applicable to any polar gaseous molecule.
Frequencies in microwave spectroscopy that are di- where a, b, c, and d are not necessarily integer constants
rectly proportional to inverse moments of inertia are usu- and the parentheses indicate concentrations of the en-
ally measured to five or six significant figures. It would closed species. The constant k is called a rate constant.
thus appear that bond distances are determined to better The complexity of this equation is due to the complexity
than 0.001 Å. In reality, there are usually more structural of the reaction mechanism. If the reaction were really ele-
parameters required than measured moments of inertia. mentary (that is, if there were no intermediates), then the
For example, in CH3 I, a pyramid-shaped molecule, the equation,
structure is determined by two bond distances and a bond d(C)/dt = k(A)(B)
angle. Because of the threefold symmetry axis, only two
would express the fact that the reaction takes place as a
moments of inertia are measurable. The usual solution is
consequence of collisions of A and B molecules.
an isotopic substitution (for example, deuterium for hy-
Except for radiative and nonradiative decay of electron-
drogen). It is often found, however, that a unique structure
ically excited molecules, the approach to equilibrium of a
cannot be found consistent with all the moments of inertia.
set of molecules is always accomplished by intermolecular
The reason is that the different vibrational wave functions
interactions of some sort. In a low-pressure gas, the cen-
in the isotopically substituted molecules yield slightly dif-
tral event is a bimolecular collision. A collision changes
ferent average structures.
the states of the pair of molecules involved in it. The most
gentle change of state is the elastic scattering that involves
C. Dynamics of Molecular Processes
only a rotation of the relative momentum of the pair. The
A system at equilibrium is characterized by a balance probability of such an event is expressed by a differential
of opposing forces and reactions so that its macroscopic cross section, dσ/d, which gives the number of the pairs
properties are independent of time. Time-dependent pro- whose relative momentum is scattered into d (at a cer-
cesses are caused by deviations from thermal equilibrium. tain orientation, ) per unit time, per unit incident flux.
For example, if there is a spatial gradient in temperature, The integral of this differential cross section over all solid
electric potential, momentum, or concentration, equilib- angles is called the total scattering cross section.
rium is restored by the transport of energy, charge, mo- In perhaps the most famous experiment of atomic
mentum, or identity, respectively. The ratios between the physics, Rutherford demonstrated that the deflection of
fluxes of these properties and the corresponding gradi- α particles could be connected with the Coulomb law of
ent are called the transport coefficients—namely, the co- force between nuclei and α particles. If the potential en-
efficients of thermal conductivity, electrical conductivity, ergy was known in advance, classical or quantum equa-
viscosity, and diffusion, respectively. tions of motion could be used to determine the differen-
Even in a spatially uniform system there can be substan- tial scattering cross section. In principle, the process can
tial departures from equilibrium. The system may not be in be reversed; that is, the theoretical potential energy act-
an equilibrium distribution over its internal states and often ing between the two scatterers can be deduced from the
approaches equilibrium (“relaxes”) exponentially with a experimental differential cross section. Indeed in princi-
rate characteristic of each degree of freedom, which is the ple if the scatterers are small enough, preferably atomic
reciprocal of the relaxation time. For example, there are in nature, the potential energy can be calculated using
electronic relaxation times, vibrational relaxation times, the Schrödinger equation of quantum mechanics. A large
rotational relaxation times, and even nuclear or electronic amount of research in atomic scattering may be briefly
spin lattice relaxation times. There are great variations in summarized as follows. Atoms interact with a very steeply
the rate of approach to equilibrium of these various de- rising short-range potential and a relatively weak long-
grees of freedom, varying from 1012 sec−1 to 10−3 sec−1 . range attraction. The long-range attraction gives rise to
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
very large scattering at small angles superimposed on a The chemical reaction itself can be treated as a colli-
much less intense isotropic scattering due to the short- sion problem in mechanics. The measured temperature-
range repulsion. dependent rate constant for a chemical reaction, k(T ),
Elastic collisions can relax translational energies, pro- is an average over the Boltzmann distributions for both
ducing a Boltzmann distribution of translational energies. translation and internal energies. There is not enough in-
However, relaxations of internal degrees of freedom can formation in the function k(T ) to permit a potential en-
only be accomplished by inelastic collisions in which en- ergy function, a surface to be extracted by an inversion.
ergy is exchanged between internal degrees of freedom An area of research, chemical dynamics is the study of
and translation or between internal degrees of freedom of reactions in which some form of state selection of the re-
both molecules. The efficiency of these inelastic collisions actants or the products has been carried out. The state se-
expressed by the magnitudes of the inelastic cross sections lection can take many forms. If the reactant molecules are
depends on the nature and strength of the interaction be- formed into molecular beams, the direction and sometimes
tween the molecules. Specifically, the intermolecular po- also the magnitude of the relative velocity will be known
tential must depend on the molecular orientations in order and the differential reaction cross section can be deter-
to cause changes of rotational state and on an internal nor- mined. A reactant can be excited to a specific vibrational
mal coordinate in order to cause changes in vibrational state and the effect on the cross section can be measured.
states. As a general rule, rotational equilibrium is usually Similarly the vibrational and rotational state distribution
achieved with only five to ten collisions, but vibrational of product molecules can sometimes be determined by
equilibrium requires hundreds to thousands of collisions. laser-induced fluorescence, multiphoton ionization, or in-
The reason is simply that intermolecular potential energies frared emission. When sufficient information is obtained,
are much more dependent on molecular orientation than on it should ultimately be possible to extract a potential sur-
the relatively small vibrational amplitudes. A more subtle face on which the reactants move and are transformed into
point is the following. The probability of an inelastic colli- products. The assumption here is that the set of reactant
sion transferring an amount of energy hω between a more molecules and the set of product molecules correspond to
or less random translation and an internal degree of free- different potential minima on the same surface. If this is
dom is proportional to the spectral density of the random not the case, the reaction is said to be nonadiabatic. The
interaction at the frequency ω. The typical frequency asso- theoretical problem is more complicated; two potential
ciated with translational motion during a collision would surfaces must be extracted as well as a perturbing poten-
be the reciprocal of a collision duration. This frequency tial that couples them.
is just right for rotations, too small for vibrations, and too
large for nuclear magnetic energy level separations. Be-
D. Lasers in Chemical Physics
cause V → T (vibration to translation) relaxation is slow,
another form of vibrational relaxation occurs earlier which The laser is a light source possessing intensity, direction-
is called V → V
. These V → V
transitions between vi- ality, coherence, monochromaticity, and (often) tunabil-
brational states of nearly the same energy, for example, ity. In the past few decades, lasers have altered the face
of most branches of experimental chemical physics. The
2HBr(v = 1) = HBr(v = 0) + HBr(v = 2)
resulting enhanced knowledge of the physical world has
have much larger cross sections than a V → T process greatly extended the applicability of theory.
such as: Lasers are of two types, cw (steady state) and pulsed.
The cw laser can be made extremely narrow in frequency,
HBr(v = 1) + M = HBr(v = 0) + M
down to KHz and even less. This narrow line width per-
where M is another molecule. mits resolution of the rotational structure of all but the
There is a hierarchy of relaxation problems in dilute heaviest gas-phase molecules and the hyperfine structure
gases. One begins with macroscopic transport processes of molecules with nonzero spin. The laser line width
whose transport coefficients can be expressed in terms of is typically narrower than the Doppler-broadened spec-
averages of scattering cross sections. These in turn can, tral line of a gas-phase molecule or atom. This gives
in principle, be calculated from an intermolecular poten- rise to a new spectroscopy called Doppler spectroscopy,
tial energy. Spectroscopically the evolution of an initially which measures the distribution of the velocity compo-
nonequilibrium distribution over internal states can be fol- nent in the direction of the probing laser. In solution,
lowed toward a Boltzmann distribution. The relaxation the narrow line width permits measuring the width of the
rate can be expressed in terms of inelastic scattering cross Rayleigh scattered light, thus permitting rapid measure-
sections that can, again in principle, be calculated from ment of the molecular diffusion coefficient and molecular
the intermolecular potential energy function. mass.
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
Pulsed lasers have durations in the nanosecond, picosec- it has been shown that by varying the relative phases of the
ond, and even femtosecond domains. Strong lamps, both different modes of which the fs pulse is composed a mul-
cw and pulsed, have been used in photochemistry for a tiphoton dissociation of a metal carbonyl can be tailored
long time, and their average power can be comparable to alter the relative yields of different photodissociation
with that of a laser. However, the short pulse duration channels.
of a ns laser enables the generation by photodissociation We finish with two examples of accepted verities which
of a much higher instantaneous concentration of reactive the laser has demolished. The statement “A light beam
intermediates such as atoms or radicals. These or their cannot resolve objects whose separation is less than its
subsequent reaction products are in sufficient concentra- wavelength,” is true in the farfield region where r
λ but
tion that their nascent quantum state distribution can be is not true in the opposite nearfield region where r λ. It
probed. This measurement must be done under single- is now possible to excite molecules by laser light and then
collision conditions in a time much shorter than the time observe emission from individual molecules. Instead of
required for thermalization of the rovibrational distribu- observing an average excited state lifetime, the lifetimes
tion. No such measurements are possible with a conven- of individual molecules are measurable. The action of one
tional light source. enzyme molecule can be measured instead of the average
The simultaneous or nearly simultaneous absorption of behavior of an ensemble of enzyme molecules.
two or more quanta is a phenomenon unique to the laser. Another truism is that “proteins cannot fly.” In other
When a strong infrared laser is focused on a molecule, words, large biomolecules cannot be studied by mass
sometimes not just two but a large, somewhat indeter- spectroscopy because they have no vapor pressure and
minate number of quanta result in a super hot molecule. if heated will be destroyed. Embedding the protein or
Focusing a visible or ultraviolet laser on a molecule often polynucleotide between thin crystal layers and then pulse
results in ionization, nonresonant or resonant. The latter evaporating the crystallite with a ns laser produces gas-
process is called resonance-enhanced multiphoton ioniza- phase macromolecules. These can be ionized and their
tion (REMPI). With its aid, a molecule in a given rovibra- masses accurately determined. An important new window
tional ground state, which is extremely difficult to detect, on biochemistry has been opened.
is converted to an easily detectible ion. A technique of ion
imaging has been developed in which a molecular frag- II. COLLECTIVE PROPERTIES
ment or reaction product is ionized and then projected
down a tube by a strong electric field. An array detector Liquids and solids are systems that contain on the order
generates a pattern of rings. The radius of the ring yields of Avagodro’s number (6 × 1023 ) of strongly interacting
the transverse velocity of the detected molecule, and the molecules. How do the macroscopic properties of such
variation of intensity around the ring is a measure of the systems depend on the properties of the constituent iso-
anisotropy of molecules with this particular speed. lated molecules? The properties of collective systems can
Another nonlinear optical phenomenon unique to the be roughly divided into structural properties and dynamic
laser is second harmonic generation in which strong light properties. The theoretical study of such collective prop-
at frequency ω is partially converted to light at frequency erties falls within the province of statistical mechanics.
2ω. This is observed primarily in media lacking a center of The macroscopic properties of condensed systems are
symmetry and is especially suited to probing the interface of central importance to almost all fields of science. These
between two media. A variation of the technique, sum fre- properties are described by phenomenological theories
quency generation, permits measurement of the infrared such as thermodynamics, hydrodynamics, electrodynam-
absorption of the interface layer. ics, and chemical kinetics. Nevertheless, it should be
Picosecond lasers make possible the measurement of clear that the laws of thermodynamics, the thermodynamic
extremely fast processes, such as the lifetime of an elec- equation of state for given materials, the laws of hydrody-
tron in a conduction band in a semiconductor. The use namics and electrodynamics, and the properties germane
of femtosecond (fs) pulses allows the observation in real to these as well as all of chemical kinetics should be deriv-
time of even faster processes, such as the separation of able from the underlying classical or quantum mechanical
fragments following an initial excitation. The separation theory of atoms and molecules (in fact, the whole of chem-
time is typically <100 fs for a direct allowed transition, istry should be predictable from mechanics.)
but it can be longer if the fragments are both heavy or if
the dissociation is indirect (i.e., from a lower electronic
A. Statistical Mechanics
state than the one originally excited). Consistent with the
uncertainty principle, a fs pulse is broad in frequency. For Statistical mechanics is a molecular theory of macroscopic
example, a 50-fs pulse is about 1000 cm−1 wide. Recently, systems. It provides the bridge between the microscopic
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
world of nuclei and electrons and the macroscopic world of temperatures does the matter wave of one molecule in-
the phenomenological theories. Starting with the Hamil- terfere with the matter wave of another molecule. This
tonian of the system, the laws of equilibrium thermody- interference can give rise to astounding properties as it
namics, hydrodynamics, electrodynamics, and chemical does in liquid He which displays superfluidity (a purely
kinetics can be derived. The Hamiltonian H is the classi- quantum mechanical phenomenon). Because most liquids
cal energy as a function of the momenta and positions of exist at high temperatures, it is possible to ignore these
all the particles in the system. It consists of the sum of the quantum effects and to treat them using classical statis-
kinetic and the potential energies. tical mechanics—that is, statistical mechanics based on
Statistical mechanics can be subdivided into equilib- classical mechanics. Although the treatment of strongly
rium and nonequilibrium statistical mechanics. The for- interacting systems is still very difficult in classical statis-
mer deals with systems in thermodynamic equilibrium, tical mechanics, progress has been made on a variety of
whereas the latter deals with the time evolution of macro- fronts. Statistical mechanical perturbation theory allows
scopic systems. the properties of fluids to be calculated using the exact re-
sults for a simple reference system such as the hard sphere
fluid. This is a fluid consisting of spheres that are not al-
1. Equilibrium Statistical Mechanics lowed to interpenetrate. The structure and thermodynamic
The thermodynamic equation of state of a system can properties of fluids can be determined by integral equa-
be derived from an expression that relates the Helmholtz tions. The behavior of systems undergoing phase trans-
free energy A N (T, V ) to the microscopic properties of the formations can be determined by renormalization group
system, techniques. The properties of real imperfect gases can be
determined by diagrammatic techniques.
A N (T, V ) = −kT ln Q N (T, V ) What is meant by the term “the structure of liquids”?
Imagine an instantaneous photograph of a liquid. The
where T is the thermodynamic temperature (degrees atoms are packed together in a noncrystalline arrange-
Kelvin), V is the volume, and ment; that is, there is no long-range order. Nevertheless,
Q N (T, V ) = exp(−E n /kT ) the positions of nearby atoms are correlated. Because the
n atoms cannot overlap, the minimum distance from one
atomic center to another is the atomic diameter. Thus,
is the canonical partition function. Thus, to derive the ther- around an atom there will be an exclusion sphere outside
modynamic equation of state (that is, the relationship be- of which near neighbors are found, but these neighbors
tween the free energy, the number of particles, the temper- once again define a region of space which excludes next
ature T , and the volume V ), Q N (T, V ) must be calculated. nearest neighbors and so on. The radial distribution func-
In these equations k is Boltzmann’s constant, the sum tion gives the average density of atoms as a function of
goes over all quantum mechanical states of the system, distance from a given atom in the system. Due to the pre-
and {E n } is the set of energy eigenvalues corresponding viously stated packing effects, this function will exhibit
to these states. These energy levels are found by solving peaks and troughs as a function of distance. At very large
the Schrödinger equation of quantum mechanics, namely, distances it will be equal to the bulk density because the
H n (q1 , . . . , q N ) = E n n (q1 , . . . , q N ) presence of an atom at one place will not be felt by another
atom very far from it. These functions can be measured by
where n (q1 , . . . , q N ) is an energy eigenfunction of the diffraction methods such as neutron scattering and X-ray
system and {q1 , . . . , q N } are the coordinates specifying the scattering. They can also be determined from statistical
positions of the particles. Thus, to predict the thermody- mechanics.
namic properties the allowed energies for the system at the
given volume have to be determined. In very dilute gases it
2. Nonequilibrium Statistical Mechanics
is possible to calculate the thermodynamic properties very
accurately. This is already an impressive achievement, but In hydrodynamics, the time evolution of fluid flow depends
liquids are more difficult to treat. In quantum mechanics, not only on thermodynamic properties, but also on such
every particle has wavelike properties. It follows that the transport properties as shear and bulk viscosities, ther-
atoms and molecules that liquids are composed of occupy mal conductivity, mutual diffusion coefficients, etc. These
a region in space characterized by a diameter proportional characterise the transport of momentum, heat, and mass.
to the particle’s thermal deBroglie wavelength, a quan- In electrodynamics, the response of systems to the impo-
tity inversely proportional to (mT )1/2 . Only at very low sition of electric and magnetic fields depends on dielectric
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
response functions of the system, and in chemical kinetics tum coordinate. Starting with a given set of positions and
the approach to equilibrium is described by empirical rate momenta, these equations are solved by finite difference
laws and chemical reaction rate constants. Expressions for techniques on a computer. At the end of each time inter-
the transport coefficients in terms of the microscopic dy- val the new positions and momenta of all the particles are
namics of the quantum or classical systems can be derived recorded. After very many time steps this gives a trajectory
by modern statistical mechanics. Moreover, it is now pos- of the whole many-body system. Thermodynamic prop-
sible to derive the equations of macroscopic rate processes erties are found by time averaging dynamical properties
such as hydrodynamics and chemical kinetics starting with over the trajectory. Likewise, time correlation functions
the Hamiltonian of the system and using the methods of and thereby transport coefficients, rate constants, differ-
nonequilibrium statistical mechanics. ential scattering cross sections, spectral line shapes, etc.
Transport coefficients can be expressed in terms of time can be determined from the trajectory. This method is
correlation functions or covariance functions of sponta- called molecular dynamics. Recently, it has been possi-
neous fluctuations in equilibrium systems. For example, ble to simulate nonequilibrium systems by clever appli-
the self diffusion coefficient can be expressed as: cations of constraints and boundary conditions. Because
simulations are done on finite systems, periodic boundary
1 x
D= dtv(0) · v(t) conditions are usually adopted to bypass unusual surface
3 0
effects. Nevertheless, it is also possible to simulate sur-
where v(0) · v(t) is the autocorrelation function of the face properties. In order to use molecular dynamics, there
velocity of a labeled particle, v. Such expressions for must be knowledge of the forces that exist between the
transport coefficients are called Green–Kubo relations. molecules. Only in the simplest systems is it possible to
All transport coefficients and chemical rate constants can determine the forces between two molecules (either by ex-
be expressed as time integrals of time correlation func- periment or by ab initio quantum chemical calculations).
tions; that is, as Green–Kubo relations. In addition, spec- Thus, molecular modeling is required before molecular
troscopic band shapes, NMR, and ESR line shapes and dynamics can be applied.
differential cross sections for the scattering of light and
thermal neutrons can be expressed as space–time Fourier
transforms of appropriate time correlation functions. The 2. Monte Carlo Simulations
theory of such processes is called linear response theory. If there is interest only in equilibrium properties, an al-
To treat nonlinear processes it is necessary to use mode– ternative to molecular dynamics is the Monte Carlo tech-
mode coupling theory. nique. This ingenious method is based on the theory of
finite Markov chains. In classical statistical mechanics,
the probability distribution for finding the particles at a
B. Numerical Statistical Mechanics
given position in configuration space is proportional to the
One of the great developments in statistical mechanics dur- Boltzmann factor, exp(−U/kT ), where U is the potential
ing the past several decades is the development of methods energy of the system in that configuration. In Monte Carlo,
for simulating strongly interacting systems such as liquids, an initial configuration is chosen. A particle is next moved
liquid crystals, solids, and glasses on computers. The for- to a new position. This move is either accepted or rejected
malism of statistical mechanics provides exact analytical based on a certain criterion. This process is repeated for
expressions. Because it is often impossible to evaluate each particle. The criterion for acceptance or rejection is
these expressions analytically, numerical methods must based on a random number generator in such a way that
be used. In fact, prior to the development of computers the configurations thus generated are distributed accord-
very little progress was made in understanding liquids ing to the Boltzmann factor. These sampled configurations
and amorphous solids. There are two major techniques give a trajectory in configuration space that looks like a
used in computer simulation—molecular dynamics (MD) random walk. Averages of position-dependent properties
and Monte Carlo (MC)—and these have contributed enor- over this random walk trajectory then give thermodynamic
mously to the theory of condensed systems. properties.
Simulation techniques have been applied to a wide vari-
ety of many-body systems. For example, hydrogen bond-
1. Molecular Dynamics
ing in water and aqueous solutions, including protein solu-
A starting point for the simulation of a classical liquid is tions, has been studied in this way. Such methods are often
the Hamiltonian of the system. If there are N particles in used as experimental tests of theoretical assumptions. In
the system, 6N equations of motion must be solved: one many cases, totally new phenomena were first observed
for each position coordinate and one for each momen- using computer simulations and only later observed in real
P1: GNH
Encyclopedia of Physical Science and Technology EN002J-99 May 17, 2001 20:50
systems. The methods of statistical mechanics and numer- • LASERS • NUCLEAR MAGNETIC RESONANCE • PHYSI-
ical statistical mechanics are applicable to many fields CAL CHEMISTRY • POTENTIAL ENERGY SURFACES • STA-
outside of chemistry. For example, the question of quark TISTICAL MECHANICS • SUPERCOMPUTERS • TRANSITION
confinement in high-energy physics can be formulated as PROBABILITIES AND ATOMIC LIFETIMES
a problem in statistical mechanics that can be treated by
Monte Carlo techniques using the Feynman path integral
representation. Another example involves the fragmenta- BIBLIOGRAPHY
tion of nuclei that can be regarded as a liquid droplet of
nucleons. Berne, B. J., and Pecora, R. (1976). “Dynamic Light Scattering: With Ap-
With the advent of supercomputers these methods will plications to Chemistry, Biology and Physics,” Wiley (Interscience),
continue to play a very large role in the study of collec- New York.
Berne, B. J., ed. (1976). “Modern Theoretical Chemistry,” Vol. 6, Parts A
tive phenomena. One day very complex reactions will be and B, Plenum, New York.
simulated on computers. Bernstein, R. B., and Levine, R. D. (1987). “Molecular Reaction Dy-
namics and Chemical Reactions,” Oxford University Press, London.
Chandler, D. (1987). “Introduction to Modern Statistical Mechanics,”
Oxford University Press, New York.
SEE ALSO THE FOLLOWING ARTICLES
Forster, D. (1975). “Hydrodynamic Fluctuation, Broken Symmetry and
Correlation Functions,” W. A. Benjamin, Reading, MA.
ATOMIC PHYSICS • ATOMIC SPECTROMETRY • CHEMICAL McQuarrie, D. A. (1975). “Statistical Mechanics,” Harper & Row, New
KINETICS, EXPERIMENTATION • KINETICS (CHEMISTRY) York.
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
Cohesion Parameters
Allan F. M. Barton
Murdoch University
233
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
Cohesion parameters provide a simple method of corre- and type j. This equation can be understood in a simple
lating and predicting the cohesive and adhesive properties way by considering what happens when unit volumes of
of substances from a knowledge of the properties of the components i and j are mixed: Two i– j interactions are
individual components only. There are, of course, nu- formed for each pair of i–i and j– j interactions broken.
merous more sophisticated theories and techniques for
this purpose, but none is as easy to use in practical
applications. A. Hildebrand Parameter
Cohesion parameters provide estimates of the enthalpy At pressures below atmospheric pressure (i.e., for tem-
changes on mixing and do not explicitly treat any devi- peratures below the boiling point), the molar cohesive en-
ations from ideal entropy changes. They are based on ergy −U of a liquid can be taken as equal to the molar
J. H. Hildebrand’s definition of a special class of “reg- enthalpy of vaporization H less the pressure–volume
ular” solutions, those involving an ideal entropy change work (which for a vapor with ideal gas behavior is RT per
on mixing despite the existence of enthalpy changes, but mole, where R is the molar gas constant and T the absolute
are now applied far more widely. temperature). This is the basis of the original definition by
The simplest cohesion parameter is the Hildebrand pa- Joel H. Hildebrand and Robert L. Scott of the solubility
rameter, commonly called the solubility parameter, val- parameter, hereafter called the Hildebrand parameter δ:
ues of which are now being included by manufacturers of
polymers and solvents in the specification sheets of their δ = c1/2 = (−U/V )1/2 = [(H − RT )/V ]1/2 . (2)
products.
This parameter was intended for use only with nonpolar,
nonassociating liquid systems that form regular solutions,
but its use has been extended to all types of material. For
I. MAIN CLASSES OF COHESION
all liquids, the value of the Hildebrand parameter may be
PARAMETERS
determined directly from thermodynamic data by means
of this equation, but it is only for regular solutions that
Pure materials exist as condensed phases (e.g., liquids,
predictions using the Hildebrand parameter are reliable.
crystals, glasses, rubbers) over certain ranges of tem-
A regular solution has an ideal entropy of formation,
perature and pressure because in some circumstances
that is, a random molecular distribution, despite the exis-
these states are more stable than the gaseous state: There
tence of interactions that lead to a nonideal enthalpy of for-
are energetic advantages in the molecules being packed
mation (heat of mixing). This means that regular mixtures
closely together. In these condensed phases, strong at-
are restricted to those systems in which only dispersion
tractive forces exist between the molecules. The cohesive
forces are acting. (Dispersion forces, or London forces,
energy −U is positive (equal and opposite in sign to the
arise from the fluctuating dipoles that result from the pos-
internal energy), and the cohesive energy per unit volume
itive nucleus and negative electron “cloud” in every atom.
V of the material is defined as the cohesive pressure or
They occur in all systems and are distinct from forces as-
cohesive energy density, given the symbol c.
sociated with molecular polarity and “chemical” interac-
Similarly, many mixtures also exist as homogeneous,
tions between molecules.) For systems like this, without
condensed phases for the same reason: There is a signifi-
the orientation and ordering effects of polar molecules,
cant cohesive pressure maintaining that state in existence.
the cohesive pressure between unlike molecules is given
The basis of the cohesion parameter approach to mixtures
to a good approximation by the geometric mean of the
is that a material with a high cohesive pressure requires
cohesive pressures of the individual components,
more energy for dispersal than is gained by mixing it with
a material of low cohesive pressure, so immiscibility (sep- c = (ic j c)1/2 .
ij
(3)
aration of phases) results, but two materials with similar
cohesive pressure values gain cohesive energy on disper- From a combination of Eqs. (1), (2), and (3), it follows
sal, so mixing occurs. that the interchange cohesive pressure is given by Eq. (4):
The change in cohesive pressure associated with the A = (ic1/2 − jc1/2 )2 = (iδ − jδ)2 .
ij
(4)
process of mixing two components i and j with their re-
spective cohesive pressures ic and jc is given by the inter- By means of this fundamental equation, it is possible
change cohesive pressure, i j A, to describe the thermodynamics of the mixing process in
terms of Hildebrand parameters. For example, the mole
A = ic + jc − 2i j c,
ij
(1)
fraction activity coefficient f x at infinite dilution of com-
where i jc is the cohesive pressure characteristic of the in- ponent j with molar volume jV in component i is given
termolecular forces acting between molecules of type i by
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
FIGURE 1 Hansen parameter δp –δh locations for various classes of organic compound. [Adapted from Klein, E.,
Eichelberger, J., Eyer, C., and Smith, J. (1975). Water Res. 9, 807.]
δ is constant for all materials and that it is the relative 2. Liquids with moderate hydrogen bonding capacity,
magnitude of the three contributions (dispersion forces, including ketones, esters, ethers, and glycol
polar interactions, hydrogen bonding) that determines the monoethers
extent of miscibility. 3. Liquids with strong hydrogen bonding capacity, such
The Hildebrand parameter has been subdivided in other as alcohols, amines, acids, amides, and aldehydes.
ways. One approach was to divide it into two main com-
ponents, defining a nonpolar cohesion parameter δd and a This classification is still widely used in practical ap-
polar parameter δo . These are related by plications, and a few typical examples are presented in
Tables I and II (Section VI).
δ 2 = δd2 + δo2 . (29)
This approach neglects both induction interactions and,
more important, specific interactions. The induction “cor- B. Geometric-Mean Corrections
rection” can be taken into account to some extent by means One of the specific assumptions in the development of co-
of the factor i jb, as shown in the following expression for hesion parameter expressions is the geometric-mean ap-
the interchange cohesive pressure: proximation [Eq. (3)]. There are several equations that
2 2 permit some correction to be made for derivations from
ij
A = iδd − jδd + i jb iδo − iδo . (30) this behavior, for example,
This is equivalent to Eq. (25) with the δh term omitted. c = (ic jc)1/2 (1 − i jl),
ij
(31)
where i jl is a dimensionless constant of the order of 0.01
II. LIMITATIONS OF COHESION to 0.1, characteristic of a given pair of materials. When this
PARAMETERS is incorporated, the empirical expression for interchange
cohesive pressure corresponding to Eq. (4) becomes
Cohesion parameters provide one of the simplest methods ij
A = (iδ − jδ)2 + 2i jl iδ jδ. (32)
of correlating and predicting the cohesive and adhesive
properties of interacting materials from a knowledge of The value of the correction term can be estimated from
the properties of the individual components. It is there- either liquid- or gas-phase data.
fore to be expected that there will be severe limitations
on their use. What is surprising is that this simple cor-
relation method works as well as it does in practice. It C. Chameleonic Behavior
is really a semiquantitative version of the statement “like The most important situation requiring caution in the use
dissolves like.” In fact, of course, whether any particular of cohesion parameters, particularly Hildebrand param-
correlation or prediction method is seen to “work” depends eters and Hansen parameters, is that in which electron
on the precision that is expected in the application being donor–acceptor interactions within components are very
considered. different from those between components. Common ex-
amples are systems involving hydrogen bonding: alcohols
(particularly methanol), carboxylic acids, water, primary
A. Burrell Hydrogen Bonding Classes
and secondary amines, and glycol ethers. Most commonly,
In the use of Hildebrand parameters, the existence of hy- it is hydrogen bonding within the pure component that
drogen bonding is the most obvious cause of discrepan- causes a user the most problems, as one tends to overlook
cies from “regular” behavior. H. Burrell was one of the this, whereas one is more on the alert for new interactions
first to attempt to deal with the hydrogen bonding factor when components are mixed.
in the application of Hildebrand parameters to practical Although this has long been understood in a general
systems by the simple expedient of dividing solvents into way, it was put most clearly by K. L. Hoy, who proposed
three classes according to their hydrogen bonding capaci- the term chameleonic for those compounds that have
ties, on the assumption that complete miscibility can occur the capacity to assume a character similar to their
only if the degree of hydrogen bonding is comparable in environment.
the components: By dimerization or intramolecular association, what
would otherwise be a polar material can behave in a non-
1. Liquids with weak or poor hydrogen bonding polar manner, thus minimizing the energy. Examples are
capacity, including hydrocarbons, chlorinated carboxylic acids (structure I), glycol ethers (II and III),
hydrocarbons, and nitrohydrocarbons and diols (IV). It is clear that the cohesion parameters
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
SCHEME 1
of associated and dissociated forms are very different, 1. The extent of cohesion within the pure compound
so the particular value exhibited will depend on the sit- 2. The potential for cohesion in a mixture.
uation in which the compound occurs. Structures such
as tetramers (V) and chains (VI) have been proposed for When one says that the Hansen hydrogen bonding pa-
methanol and methanol–ethanol mixtures. rameter δh of ethanol is 20 MPa1/2 , it means that within
In the case of water, it is necessary to distinguish be- pure ethanol δh2 = 400 MPa is the extent of cohesion due
tween systems of water with low-permittivity organic liq- to hydrogen bonding, but it is also inferred that δh reflects
uids (i.e., associated water, with limited interaction be- the hydrogen bonding capability in a mixture.
tween water and organic) and systems of water with It is clear that this will work best when the two com-
hydrogen bonding organic liquids, where there is more ponents of the mixture are similar, and as the differ-
intermolecular association at the expense of intramolecu- ences increase, situations arise where the effective δh may
lar association. be much less than or much greater than that in a pure
The difficulty in all such examples is that Hildebrand liquid.
parameters and Hansen parameters are trying to indicate As far as hydrogen bonding is concerned, some pure
two or more things with one number or one set of compounds are both proton donors and proton accep-
numbers: tors. Here, the extent of cohesion due to association in a
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
mixture generally can have any value up to that in a pure used in conjunction with various other theoretical or em-
compound. Examples are alcohols, carboxylic acids, wa- pirical parameters to provide more effective predictions
ter, and primary and secondary amines. of solvent properties. Several examples of the parameters
In contrast, many compounds have a dominant capacity that have been used in this way to form “hybrid” cohesion
to accept protons: ketones, aldehydes, esters, ethers, ter- parameter scales are now described.
tiary amines, aromatic hydrocarbons, and alkenes. Here,
the potential for cohesion in mixtures due to hydro- A. Hydrogen Bonding Parameters
gen bonding is greater than that in pure compounds.
There are also some that are proton donors only, such The Burrell hydrogen bonding classification (Section II.A)
as trichloromethane. has been developed further by assigning quantitative val-
It is clear that neither Hildebrand parameters nor ues to the hydrogen bonding capabilities of liquids and
Hansen parameters are adequate to handle this problem plotting graphs of Hildebrand parameters against hydro-
in a quantitative way. There are two possible approaches: gen bonding capacities.
An alternative quantitative measure of the capacity of
organic liquids to donate and accept hydrogen bonds is
1. To treat each associated species in both the pure liq-
sound velocity in solvent–paper systems. Paper fibers are
uids and the mixtures as new compounds, with formation
held together largely by hydrogen bonds, and on wetting
constants that can be evaluated. This type of approach is
paper with a liquid, most disruption of the fiber bonding
traditional and well established, but rather cumbersome.
occurs in the presence of those solvents that preferentially
2. To use a full set of interaction cohesion parameters
form fiber–liquid hydrogen bonds. As a result, the velocity
with separate Lewis acid and base contributions as well as
of sound through the paper, which depends on the degree
dispersion, induction, and orientation terms. This method
of bonding, decreases as the liquid hydrogen bonding ca-
has the greater practical potential.
pability increases. Water, as the most common solvent for
disrupting fiber–fiber bonds in paper, was chosen as a ref-
On reference to Eq. (20), it can be seen that when i jAab
erence standard, and the hydrogen bonding parameter was
is large and negative, exothermic mixing (with evolution
defined as
of heat) may be explained, in contrast to the restriction to
athermic or endothermic processes, when only dispersion 100 × sound velocity in water-soaked paper
and polar forces exist. Thus, the answer to the criticism sound velocity in liquid-soaked paper
that Hildebrand parameters cannot cope with molecular
association or exothermic interactions is that they should B. Electrostatic Parameters
be expressed in the form of interaction cohesion param- The cohesive properties of materials are closely related
eters. The price to be paid is greater complexity in both to the various electrostatic properties, such as dipole mo-
evaluation and use. ment, relative permitivity, polarizability, ionization poten-
tial, and refractive index. Both dipole moment and relative
permittivity are of particular importance: In the absence of
III. COHESION PARAMETERS AND specific association, the dipole moment tends to determine
OTHER SOLVENT SCALES the orientation of solvent molecules around molecular so-
lutes, while dissolution of ions is promoted by high relative
Parameters describing and correlating the solvent permittivity of the solvent. The electrostatic factor, which
capacities of liquids have been based on a great variety is defined as the product of dipole moment and relative
of chemical and physical properties. Some are measures permittivity, takes both effects into account and provides
of liquid “basicity.” Others are direct determinations of the a basis for the classification of solvents. The fractional
solubility of a representative solute in a range of liquids, polarity is also noteworthy: The natures of molecular in-
for example, the solubility of hydrogen chloride in the teractions have been discussed in terms of the fraction
solvents at 10◦ C. of the total interactions due to dipole–dipole or orienta-
Once a solubility scale has been established, it is nec- tion effects. This fractional polarity can be calculated from
essary to determine the position on it of any required sol- the dipole moment, polarizability, and ionization potential
vent. If the solubility scale has a theoretical basis, it may once certain assumptions have been made.
be possible to calculate values from information on other
properties, but if it is an empirical scale, direct testing is
C. Spectroscopic Parameters
usually required.
Because Hildebrand parameter values can be deter- Numerous parameters have been developed on the ba-
mined readily and are widely available, they have been sis of spectroscopic measurements. One direct hydrogen
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
bonding parameter is based on the effect that a liquid has tio is widely used since it is a direct measure of the tol-
on a small amount of alcohol introduced into that liquid: erance of a solvent–resin mixture to added diluent, and
The greater the extent of hydrogen bonding between the qualitative correlations exist between dilution ratios and
liquid and the alcohol hydroxyl groups, the weaker the Hildebrand parameters. The heptane number of hydrocar-
O—H bond and the lower the frequency of the infrared bon solvents is a measure of the relative solvent power
radiation absorbed by that bond. If deuterated methanol of high-solvency hydrocarbons in the presence of resins
or ethanol is used, the O–D stretch band is in a spectro- not soluble in heptane. The wax number and other mis-
scopic region with little interference, permitting detection cibility numbers may also be correlated with the solvent
at low alcohol concentrations. The extent of the shift to Hildebrand parameter.
lower frequencies of the O–D stretching infrared absorp- Another type of empirical solvent classification scheme
tion of deuterated alcohol in the liquid under study thus has been developed in connection with chromatogra-
provides a measure of its hydrogen bond acceptor capa- phy, where it is useful to distinguish solvent strength or
bility. The spectrum of a solution of deuterated methanol “polarity” from solvent “selectivity.” Gas–liquid chro-
or ethanol is compared with that of a solution in benzene matographic methods are particularly convenient for
or other reference liquid, and the hydrogen bonding pa- quantitative characterization of the solvent properties of
rameter is defined as 10% of the O–D absorption shift the stationary phase, whether this is a liquid or a polymer
expressed in wave numbers. The choice of the reference (see Section V.A). One approach is to define a polarity
liquid is important. Benzene is not an inert solvent, the index, a measure of the capacity of a liquid to dissolve or
aromatic electron system having some hydrogen acceptor interact with various polar test solutes. There is, in general,
properties, and there is even the possibility of some hydro- a good correlation between the polarity index values and
gen bonding between methanol and tetrachloromethane, Hildebrand parameter values, but liquids such as diethyl
another reference solvent. It appears that alkanes such as ether and triethylamine that are strong proton acceptors
cyclohexane, heptane, and isooctane may be preferable as but have no proton donor capacity have Hildebrand pa-
standards. rameter values similar to those of alkanes, although they
Spectroscopic hydrogen bonding parameters form a show up as moderately polar on the polarity index scale.
special case of a more general type of parameter described This is because Hildebrand parameters are based on pure
by such names as “electron donating power” and “electron liquid properties, while the polarity index is based on the
accepting power.” interactions between different liquids.
There are many empirical tests in common use for quan- Often cohesion parameters are included in hybrid solvent
tifying solvent behavior. One group of tests describes the scales which incorporate several kinds of parameters. For
“solvent power” or “strength” of liquid hydrocarbons. The example, a “universal solubility” treatment includes the
aniline point or aniline cloud point, which is based on Hildebrand parameter as a measure of the energy neces-
the fact that aniline is a poor solvent for aliphatic hydro- sary to create a cavity in the bulk liquid to accommo-
carbons and an excellent one for aromatics, is defined as date a solute molecule. This contribution is combined
the minimum equilibrium solution temperature for equal with parameters allowing for solute–solvent dipole in-
volumes of aniline and solvent. An approximately quan- teractions, hydrogen-bond donor acidity, hydrogen-bond
titative correlation exists between Hildebrand parameters acceptor basicity, and an empirical coordinate covalency
and the aniline point for hydrocarbons, but it is of lim- parameter.
ited value. The kauri–butanol number (KB) is a measure
of the tolerance of a standard solution of kauri resin in
IV. APPLICATIONS OF COHESION
1-butanol to added hydrocarbon diluent. There is an ap-
PARAMETERS
proximately linear relationship between the Hildebrand
parameter and the KB number for hydrocarbons with
A. Liquids
KB > 35:
For a pair of regular liquids, the infinite dilution activity
δ/MPa1/2 = 0.040 KB + 14.2. (33)
coefficient expression in terms of Hildebrand parameters
The solvent power of “chemical” or “oxygenated” liq- is given by Eq. (5). For dilute solutions with specific inter-
uids (such as alcohols, ketones, esters, and glycol ethers) actions and size effects, resulting in nonzero enthalpies of
is much greater than that of hydrocarbons, and different mixing and nonideal entropies of mixing, the expressions
scales are necessary for their description. The dilution ra- become more complex but are still useful.
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
For systems composed of two liquids that have sub- miscibility between the phases. However, if a liquid can be
stantial but incomplete mutual miscibility, cohesion pa- obtained that specifically interacts with the component to
rameters may be used in combination with other empirical be extracted, then both the selectivity and the solvent ca-
expressions. Although satisfactory data correlation is pos- pacity are increased. For example, in the case of an alkane
sible, prediction of the exact extent of mutual solubility is i (for which both iδa and iδb are negligible), an aromatic
difficult. hydrocarbon j ( jδa negligible), and an electron-accepting
Cohesion parameter methods have been used for the liquid k (kδb negligible), the term for the specific interac-
correlation of activity coefficients of a wide range of sys- tion is given by 2 jV jδb kδa .
tems. One of the most flexible approaches to the corre- Another common situation in which cohesion parame-
lation and prediction of activity coefficients has been to ters have been employed involves solvent extraction of an
include terms for some or all of the following: electrically neutral ion pair or complex used to transport
ionic species distributed in relatively dilute solution be-
r Regular enthalpy of mixing from Hildebrand tween two very dissimilar phases such as hydrocarbon and
water. As in all other applications of cohesion parameters,
parameters it should not be expected that distribution ratios can be pre-
r Polar orientation and induction effects
r Entropy effects from hydrogen bonding equilibria dicted in detail by means of cohesion parameters, and even
r Gibbs free energy term for the breaking of hydrogen if the infinite dilution values are approximately correct,
these have limited applicability. Rather, the emphasis is
bonds in association interactions on formulating broad trends and establishing correlations.
As well as this situation of a gas possessing a hypo- entropy of melting are known for approximately regular
thetical liquid-like molar volume when it dissolves in a systems, it is possible to calculate the data necessary for
liquid at low pressures, there is also the case of gases phase diagrams, either temperature against ixs plots for
that liquefy or achieve liquid-like molar volumes because binary systems or triangular diagrams with temperature
of low temperatures and/or high pressures. The solvent contours for ternary systems.
properties of compressed gases, especially carbon diox- The more detailed and informative interaction cohesion
ide, for biochemicals and polymers have been receiving parameters have not been widely used in the correlation
more attention recently because of their application in of solid–liquid solubilities. Hansen parameters have been
high-pressure gas chromatography and supercritical fluid extended to ionic solids, but the determination of their
chromatography. values has not been pursued to any great extent. Any
Cohesion parameter concepts are being utilized for extension of interaction cohesion parameters to ionic
studies of cryogenic liquids, refrigerants, and aerosol pro- systems in the future would be very valuable in dealing
pellants, as well as fundamental dense gas and vapor– with solid–liquid systems.
liquid properties. For example, it is reported that even
at 25 K above their critical temperatures, compressed D. Polymers
gaseous helium (δ ≈ 8 MPa1/2 ) and xenon (δ ≈ 16 MPa1/2 )
at 200 MPa pressure separate into two phases because of The parameter most commonly used in the discussion of
their divergent cohesion properties. polymer solutions is the polymer–liquid interaction pa-
rameter χ . This parameter has been associated with var-
ious theoretical treatments, but it can be considered a
C. Solids general, dimensionless parameter reflecting intermolec-
The regular-solution theory and the Hildebrand parameter ular forces between a particular polymer and a particular
are based on the enthalpy changes occurring when liquids liquid. It was introduced as an enthalpy of dilution with no
are mixed. In order to extend these concepts to solutions entropy component but has been considered subsequently
of crystalline solids in liquids, it is necessary to estimate to be a Gibbs free energy parameter. As originally formu-
the thermodynamic activity of the solid referred to a hypo- lated, the polymer–liquid interaction parameter was ex-
thetical liquid subcooled below its melting point. If this is pected to be inversely dependent on absolute temperature
done, expressions such as Eq. (5) can be used to evaluate and independent of polymer concentration, but as now em-
cohesion parameters of nonvolatile solutes or to estimate pirically defined it depends in an unspecified way on tem-
their solubilities. perature, solution composition, and polymer chain length.
The Hildebrand parameter, defined in Eq. (2), has a liq- It can be shown that the enthalpy part of the interac-
uid state basis so this hypothetical liquid reference state tion parameter, χH , is related to the interchange cohesive
is necessary when the method is extended to solids. Use pressure by
of the vaporization enthalpy or sublimation enthalpy of a χH = i jAiV /RT. (36)
solid at 25◦ C in Eq. (2) uncorrected for the crystal–liquid
transition enthalpy change does not yield a Hildebrand If Hildebrand parameters are used, from Eq. (4),
parameter. It is another type of cohesion parameter that χH = (iV /RT )(iδ − jδ)2 (37)
should not be confused with any of those defined in
Section I. and in terms of Hansen parameters
Despite this complication, the solubility ( jxs on the mole 2 2 2
fraction scale) of a solid j in a liquid i has been shown χH = (iV /RT ) iδd − jδd + iδp − jδp + iδh − jδh .
to vary regularly with the Hildebrand parameter of the (38)
solvent, plots of log jxs against iδ being nearly linear or
only slightly curved. The enthalpy component of the polymer–liquid interac-
It is possible to derive more informative thermodynamic tion parameter is therefore closely related to the cohesion
expressions that involve various assumptions, and several parameters of the components of the polymer solution.
of these predict that, for solutions that approach regular Cohesion parameter predictions of thermodynamic
behavior, plots of log jxs against (iδ − jδ)2 should be ap- property values must be corrected to allow for the sub-
proximately linear, and this relatively simple method is stantial size differences between polymer and solvent
probably the most widely used method of correlating solid molecules. However, if only semiquantitative “compat-
solubilities in terms of Hildebrand parameters. ibility” information is required, cohesion parameters of
If iδ, jδ, and jV (for the subcooled liquid j correspond- amorphous polymers may be used in the same way as those
ing to the solid of interest) as well as the melting point and of liquids. When crystalline polymers are considered, it is
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
FIGURE 2 Hansen parameter three-dimensional model showing the extent of overlap between poly(phenylene oxide
phosphonate ester) (spherical) and cellulose acetate. [Adapted from Cabasso, I., Jaguer-Grodzinski, J., and Vofsi, D.
(1977). In “Polymer Science and Technology” (D. Klempner and K. C. Frisch, eds.), Vol. 10, p. 1, Plenum, New York.]
in principle necessary to calculate the activity of the crys- in Section I.C and illustrated in Fig. 2, it is possible to
talline solid relative to the real or hypothetical amorphous present this information in the form of irregular volumes.
material at the same temperature. In practice, the distinc- Three-dimensional models, stereographs, sets of projec-
tion between amorphous and crystalline polymers is not tions, and contour maps have all been used. In many cases
clear-cut, and nonthermodynamic empirical methods are the solubility behavior of a polymer can be adequately
frequently used in the estimation of polymer cohesion represented in two dimensions by δ and δh or by δ and
parameters. δp . Another useful type of plot involves the subdivision of
It is not correct to assume that the “best” solvent for the Hildebrand parameter into a “volume-dependent” part
a polymer is necessarily the corresponding monomer or δv = (δd + δp )1/2 and a “residual” or hydrogen bonding
low molecular weight liquid polymer (“oligomer”) made part δh . Triangular diagrams also provide a method of con-
up of the same repeating units. Such liquids usually have veying information in three properties on two-dimensional
cohesion parameter values lower than those of the poly- plots by making the excessively simple assumption that the
mers because their molar volumes are higher. Rather, the Hildebrand cohesion parameter is uniform for all materi-
best solvent is another liquid with a cohesive energy rather als and that it is the relative magnitude of the three con-
higher than that of the monomer but with a cohesive pres- tributions that determines the extent of miscibility. This is
sure the same as that of the polymer. illustrated in Fig. 3.
In one of the original applications of cohesion parame- Also widely used for polymers are hybrid maps of co-
ters to polymer solutions, the liquid Hildebrand param- hesion parameters with other quantities such as hydro-
eter was combined with the Burrell hydrogen bonding gen bonding parameter, dipole moment, and fractional
classification (Section II.A). Hansen’s three-component polarity.
cohesion parameter system was also first developed for Cohesion parameters are widely used to predict poly-
polymer–liquid systems. mer solubility or swelling in liquids, but some liquids also
As well as the assumption of spherical Hansen parame- affect the mechanical properties of polymers by environ-
ter “volumes” of polymer–liquid interaction, as described mental stress cracking or crazing. Whatever the cause of
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
into categories of hydrogen bonding capacity, as indicated ties, together with their ratios and products, can be esti-
in Table I. The Hildebrand parameter of a solute can then mated in terms of standard contributions from groups of
be taken as the midpoint of the range of solvent Hilde- atoms.
brand parameters that provides complete miscibility or The molar cohesive energy can be represented by the
the particular value that provides maximum solubility, or summation of atomic or group contributions:
maximum swelling in the case of a cross-linked polymer.
An ASTM test method for polymer solubility ranges uses −U = − z
U. (40)
z
mixtures of solvents to provide a spectrum of closely
spaced Hildebrand parameters. Other physical proper- Hildebrand parameters can be calculated from
ties that can be used as well as solubility and swelling
1/2
−U 1/2
include viscosity and related properties such as grease δ= = − z
U z
V . (41)
dropping points. A range of semiempirical equations is V z z
also available for correlation and prediction of cohesion
Also useful are the group molar attraction constants z F
parameters.
defined by
C. Homomorph Methods
z
F =− z
U z
V = δV (42)
z z z
for Hansen Parameters so
In any multicomponent cohesion parameter system such
as Hansen parameters, there arises the problem of eval- δ= z
F z
V. (43)
z z
uating the components of the Hildebrand parameter sep-
arately. One obvious approach is to compare the prop- Values of z F and z V have been tabulated for the most
erties of compounds that differ only in the presence or common organic molecular groups.
absence of a certain group. Here, the homomorph con-
cept is useful: The homomorph of a polar molecule is a
nonpolar molecule having very nearly the same size and VI. SELECTED VALUES
shape.
For liquids, the Hansen dispersion parameter obtained It is neither appropriate nor practicable to provide here
by homomorph methods can be subtracted from the to- a comprehensive compilation of values, but exhaus-
tal cohesion pressure using Eq. (22), with the remainder tive tables are included in the Handbook of Solubil-
being split into Hansen hydrogen bonding and polar pa- ity Parameters and Other Cohesion Parameters listed
rameters so as to optimize the description of the solubility in the bibliography. Rather, listed in Table I are typi-
and swelling behavior of a range of liquids and polymers. cal values for some liquids whose Hildebrand parameter
Both empirical methods and group methods (see next sec- values, Hansen parameter values, and interaction cohe-
tion) can be used. Once the three Hansen parameters for sion parameter values are known with reasonable re-
each liquid are evaluated, the Hansen parameters for each liability. The Burrell hydrogen bonding classification
polymer can be obtained. (Section II.A) is also included. There is considerable vari-
This method may distort the relative magnitudes of the ation in the Hansen parameters reported for water. A
intermolecular forces, but as pointed out in Section I.C, study of the solubilities of a range of organics in wa-
the theoretical bases of Hansen parameters are not good ter suggests δd = 20, δp = 18, δh = 18, and δt = 32 MPa1/2
in any case. rather than the results in Table I, which are more con-
Interaction cohesion parameters could, in principle, be sistent with the behavior of water in organic liquids.
evaluated in a similar way, but there has been little activity This variability in Hansen parameter values is a fun-
in this area. damental problem associated with the use of the sin-
gle parameter δh , rather than the pair of acid and base
parameters.
D. Group Contribution Methods
For polymers, interaction cohesion parameters are yet to
Many properties of materials change in a regular way with be determined in any detail, so the values given in Table II
increasing chain length in a homologous series, and some are restricted to ranges of polymer Hildebrand parameters
properties are conveniently linear. The miscibility behav- (for use with solvents of specified Burrell hydrogen bond-
ior of materials depends to a large extent on the cohesive ing class) and sets of Hansen parameters [together with the
and volume properties, specifically the molar cohesive interaction radius i R of Eq. (25)]. Table III presents pre-
energy −U and the molar volume V , and these quanti- ferred Hildebrand parameter values for some well-studied
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
TABLE I Typical Hildebrand Parameter, Hansen Parameter, and Component Cohesion Parameter Values
Burrell
Component cohesion Hansen
Hildebrand hydrogen Molar
parameters (MPa1//2 ) parameters (MPa1//2 )
parameter bonding volume
Liquid (δ t /MPa1/2 ) δd δo δi δa δb δd δp δh class (cm3 mol−1 )
polymers. For solid materials, in general only Hildebrand number of practical applications of cohesion parameters
parameters are available (Table IV). and of computational methods simplifying the process.
An excellent example of the determination and applica-
tion of Hildebrand parameters to a “new” solvent and its
VII. CURRENT STATUS compatibility with polymers is provided by 1,8-cineole.
This compound, present at high levels in the leaf oils of
Theoretical development of this topic appears to have some eucalypts, is proposed as a replacement for the sol-
reached something of a plateau, but there is an increasing vent 1,1,1-trichloroethane (which is now known to cause
TABLE II Typical Hildebrand Parameter and Hansen Parameter Values for Polymers, δ (MPa1//2 )
Hildebrand parameter
ranges in liquids of Burrell
Hansen parameter hydrogen bonding class
Polymer (manufacturer) δd δp δh jR Poor Moderate Strong
Pentalyn 255 alcohol-soluble resin (Hercules) 17.6 9.4 14.3 10.6 18–21 15–22 21–30
Pentalyn 830 alcohol-soluble rosin resin (Hercules) 20.0 5.8 10.9 11.7 17–19 16–22 19–23
Cellulose nitrate, 0.5 sec 15.4 14.7 8.8 11.5 23–26 16–30 26–30
Cellolyn 102 pentaerythritol ester of rosin, modified (Hercules) 21.7 0.9 8.5 15.8 16–21 17–22 21–24
Versamid 930 thermoplastic polyamide (General Mills) 17.4 −1.9 14.9 9.6 — — 19–23
Poly(methyl methacrylate) (Rohm and Haas) 18.6 10.5 7.5 8.6 18–26 17–27 —
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
TABLE V Relative Resistance of Polymers to 1,8-Cineole ing listed by companies such as Shell, DuPont, Monsanto,
(δ = 18 MPa1/2 ) Aeropres, Lal, and Eastman.
Effect of cineole A smaller but increasing number of sites deal with
on polymer compressed gases and supercritical fluids, for example,
(4-month continuous δ Polymerizations in Supercritical CO2 from the University
exposure) Polymer (MPa1//12 )
of Groningen, <http://polysg2.chem.rug.nl/>.
Soluble Natural rubber 17.0 Solvent applications of fluids have always been an
Polystyrene 18.5 important use of cohesion parameters, such as A Re-
Styrene-butadiene elastomer — view of Supercritical Carbon Dioxide Extraction of Nat-
Strongly swollen Neoprene 18.5 ural Products from Engineering World, <http://
(>100%) www.exicom.org/cew/oct97/awasthi.htm>.
Polyurethane rubber 20–21 Chemical properties, as well as physical properties,
Silicone rubber 15.5 are now being correlated by means of cohesion parame-
Little swelling Acrylic-styrene-acrylonitrile ∼22 ters, for example, Solution Effects on Cesium Complexa-
(<10%) tion with Calixarene—Crown Ethers from Liquid to Su-
Polyester urethane ∼20 percritical Fluids (University of Idaho), <http://www.
Polyester urethane ∼20 doe.gov/em52/65351.html>.
Polycarbonate/acrylic-styrene- — Liquid crystal applications are still poorly repre-
acrylonitrile sented, and studies of solids are mostly limited to
Fluoro elastomer — pharmaceuticals, such as Partial-Solubility Parameters
Resistant Acrylonitrile-butadiene-styrene — of Naproxen and Sodium Diclofenac from the Journal
Acetal 21–22 of Pharmacy and Pharmacology, <http://dialspace.dial.
Nylon-6 24–32 pipex.com/town/avenue/ax60/j98020.htm>.
Nylon-6,6 28–32 As in the printed literature, applications to polymers
Polyethylene 17 dominate internet cohesion parameter sites, for example,
Poly(butylene terephthalate) 21–22 American Polywater Corporation on polycarbonate
Poly(ethylene terephthalate) 21–22 stress cracking, <http://www.polywater.com/cracking.
Poly(methyl methacrylate) 20.5 html>; as well as Millipore, <http://millispider.millipore.
Polypropylene 16.5 com/micro/mieliq/MAL104.htm>; and Evaluating En-
Polycarbonate 19–22 vironmental Stress Cracking of Medical Plastics from
Styrene-acrylonitrile — Medical Device Link, <http://www.devicelink.com/mpb/
Poly(tetrafluoroethylene) 13 archive/98/05/001.html>.
Poly(vinyl chloride) 19.5 Polymer design is represented by IF/Prolog, <http://
www. ifcomputer.com/ Products/ IFProlog/Applications/
r Scientific and technical consultancies use cohesion PolymerDesign/home–en.html>.
parameters to promote their services. Surfaces receive less attention than homogeneous
r Institutions and individuals include reference to their fluid and solid phases, although coatings and adhesives
publications or research activities in resumes and are important, with typical sites being University of
bibliographies, such as the University of Geneva, Missouri—Rolla, <http://www.umr.edu/∼wlf/Adhesion/
<http://www.unige.ch/sciences/pharm/fagal/barra∼4. young.html>, and Shell Chemicals, <http://www2.shell-
html#pubi>, and Princeton University, <http://www. chemical.com/CMM/WEB/GlobChem.NSF/Searchv/SC:
princeton.edu/∼chemical/faculty/pubs.html>. 2023-94?OpenDocument>.
Chromatographic properties, such as Relationship
As with most other areas of the physical sciences, the Between Retention Behavior of Substituted Benzene
amount of internet-accessible material describing the Derivatives and Properties of the Mobile Phase in RPLC
theoretical background is limited. Probably the most in the Journal of Chromatographic Science, <http://www
comprehensive source is that written by John Burke .j-chrom-sci.com/353sun.htm>, provide additional possi-
(Oakland Museum of California), <http://sul-server-2. bilities for applications of cohesion parameters to complex
stanford.edu/byauth/burke/solpar/>. systems.
For values of individual liquids, by far the greatest Nanoparticles are beginning to be discussed in terms
amount of quantitative information is provided in the of cohesion parameters, for example, on the formation
pages of Charles Tennant & Co, <http://www.ctennant.co. of gliadin nanoparticles: the influence of the solubility
uk/tenn04.htm>, with smaller numbers of compounds be- parameter of the protein solvent, <http://link.springerny
P1: LDK/LSK/MAG P2: FYK/FFU QC: FYD Final
Encyclopedia of Physical Science and Technology EN003C-118 June 13, 2001 21:8
.com/link/ service/ journals/ 00396/ bibs/ 8276004/ ple, Molecular Analysis ProTM , <http://www.chemsw.
82760321.htm>, but dyes are surprisingly under- com/13051.htm>.
represented. While internet sources do not provide the comprehen-
Among environmental applications, the area of sive and integrated information available in the Handbook
materials substitution is the most potentially productive of Solubility Parameters and Other Cohension Parame-
use of cohesion parameters, for example, Toluene ters (Barton, 1992) or even the convenient summary in
Replacement in Solvent Borne Pressure Sensitive Ad- the Encyclopedia of Physical Science and Technology,
hesive Formulations (Shell Chemicals), <http://www2. they should not be overlooked for information on recent
shellchemical.com/CMM/WEB/GlobChem.NSF/Searchv/ developments.
SC:2023-94? OpenDocument>; Solvents—the Alterna-
tives from the Waste Reduction Resource Center, SEE ALSO THE FOLLOWING ARTICLES
Raleigh, NC, <http://www.p2pays.org/ref/01/00023.htm/
index.htm>; and Solvent Substitution Data Systems (U.S. BONDING AND STRUCTURE IN SOLIDS • GAS CHRO-
EPA), <http://es.epa.gov/ssds/ssds.html>. MATOGRAPHY • HYDROGEN BONDS • LIQUIDS, STRUC-
Database and compilations include ThermoDex, TURE AND DYNAMICS • MOLECULAR HYDRODYNAMICS
<http://thermodex.lib.utexas.edu/search.jsp>, and Info- • SURFACE CHEMISTRY
chem, <http://www.infochem.demon.co.uk/data.htm>.
Examples of modeling and simulation sites are Sci- BIBLIOGRAPHY
Polymer from ScienceServe, <http://www.scienceserve.
com/wwwscivision/scipolymer/prediciti.htm>; Molecu- Barton, A. F. M. (1983). In “Polymer Yearbook” (H. G. Elias, and R. A.
Pethrick, eds.), p. 149, Harwood, Chur, Switzerland.
lar Analysis ProTM , <http://www.sge.com/software/soft/
Barton, A. F. M. (1992). “Handbook of Solubility Parameters and Other
13051.htm>; The Askadskii Approach from Million- Cohesion Parameters,” 2nd ed., CRC Press, Boca Raton, FL.
Zillion Software, <http://www.millionzillion.com/cheops/ Barton, A. F. M. (1990). “Handbook of Polymer-Liquid Interaction Pa-
metho1/askadskii.htm>; Cerius2 from Molecular Simu- rameters and Solubility Parameters,” CRC Press, Boca Raton, FL.
lations Inc., <http://www.man.poznan.pl/software/msi- Barton, A. F. M., and Knight, A. R. (1996). J. Chem. Soc. Faraday Trans.
92, 753.
doc/cerius38/TutMats/tut – synthia.doc.htm1#740585 >;
Burrell, H. (1975). In “Polymer Yearbook” (J. Brandrup, and G. H.
< http:/ /www.msi. com /solutions /polymers /misc .html >; Immergut, eds.), 2nd ed., IV-337, Wiley (Interscience), New York.
and ProCAMD from Capec, <http://www.capec.kt.dtu. Hansen, C. M. (1969). Ind. Eng. Chem. Prod. Res. Dev. 8, 2.
dk/capec/docs/software/procamd/procamd.html>. Hansen, C. M., and Beerbower, A. (1971). In “Kirk-Othmer Encyclope-
QSAR (quantitative structure activity relationships) dia of Chemical Technology,” 2nd ed. (A. Standen, ed.), Suppl. Vol.,
p. 889, Wiley (Interscience), New York.
and QSPR (quantitative structure property relationships),
Hoy, K. L. (1970). J. Paint Technol. 42, 76.
which study relationships between useful chemical and Karger, B. L., Snyder, L. R., and Eon, C. (1978). Anal. Chem. 50, 2126.
Rowes, R. (1985). “Chemists propose universal solubility equation,”
ideal candidates for this kind of approach, for exam- Chem. Eng. News (March 18), 20.
P1: GJC/LOW P2: FYK Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN004G-160 June 15, 2001 12:44
Crystallography
Jeffrey R. Deschamps
Judith L. Flippen-Anderson
Laboratory for the Structure of Matter, Naval Research Laboratory
I. Introduction
II. Evolution of Crystallography
III. Structure of a Crystal
IV. Steps in Crystal Structure Analysis
V. Comparison of X-Ray and Neutron Diffraction
VI. Results
Appendix I: Factors Affecting Intensities
Appendix II: Methods of Structure Solution
Appendix III: Methods of Refinement
121
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
122 Crystallography
Miller indices A set of integers with no common factors, Unit cell Parallelepiped bounded by three noncoplanar
inversely proportional to the intercepts with the crystal vectors a, b, c with angles α, β, γ that repeats by trans-
axes of a lattice plane. lation. If this unit is the smallest volume that meets
Orientation matrix A matrix relating the crystal axes to these criteria it is referred to as the primitive unit cell.
the instrument axes such that one can predict the values X-ray Electromagnetic radiation with wavelengths in the
of the instrument angles (2θ, ω, χ , and ) for a given range 0.01 to 1.0 nm (0.1 to 10 Å). The wavelength
reflection of the crystal. range most commonly used in diffraction experiments
Patterson function A Fourier summation that uses the is between 0.71 and 1.54 Å. Shorter wavelengths re-
squares of the structure factor magnitudes as coeffi- quire longer path-lengths (distance from crystal to de-
cients. The peaks in this map correspond to vectors tector) in order to resolve adjacent peaks in the diffrac-
between atoms. The peak height is related to the scat- tion pattern. Both the sample and the air along the beam
tering powers of the atoms at the two ends of the vector. path can significantly attenuate longer wavelengths.
The region around the origin gives information about
bonded distances.
Phase problem A central problem of crystallography. CRYSTALLOGRAPHY is a broadly encompassing dis-
The intensities of the different reflections allow deriva- cipline that involves a variety of fields of study. The pri-
tion of the amplitude of the structure factors but not mary concern of modern crystallography is the three-
their phases. The phases are required in order to calcu- dimensional arrangement of atoms in matter. Although the
late the electron density, which is a “map” showing the term most often refers to studies of crystalline solids (ei-
position of atoms in the unit cell. ther single crystals or crystalline powders) using X-ray or
Point group A group of symmetry operations that leave neutron diffraction, it encompasses a much broader range
unmoved at least one point within the object to which of methodologies.
they apply.
Polar space group Space group in which the origin is not
fixed by symmetry and hence must be defined (e.g., the I. INTRODUCTION
space group P21 ).
Reciprocal lattice A set of imaginary points constructed In the late 1660s, crystallography began as the study of the
in such a way that the direction of a vector from one macroscopic geometry of crystals. Crystals were grouped
point to another coincides with the direction of a normal into systems on the basis of the symmetry of their external
to the real space planes within the crystal. The sepa- shapes. Based on these observations, seven crystal systems
ration of those points (absolute value of the vector) is were identified: triclinic, monoclinic, trigonal, tetragonal,
equal to the reciprocal of the real inter-planar distance. hexagonal, orthorhombic, and cubic. It was theorized that
Space group Identical atom groups are usually symmetri- the observed crystal shapes could be built up by stacking
cally arranged within the crystal lattice. The symmetry minute balls. This idea was refined, and in the late 18th
relating the groups may be due to rotations, inversions, century it was thought that crystals were composed of ele-
mirror planes, or some other relational operation. The mentary building blocks (now referred to as the unit cell).
space group constitutes a mathematical shorthand de- This was supported by the orderly cleavage angles of cal-
scription of the symmetry operations required to pro- cite, which suggested a regular stacking of these elemen-
duce the unit cell. tary blocks. Further studies in the 1800s led to derivation
Special position A point left invariant by at least two of the 32 different point groups, the Bravais lattices, and
symmetry operations of the space group. the 230 space groups. All of these advances were made
Structure factor Fhkl , complex quantity corresponding before any direct observations of the arrangement of atoms
to the amplitude and phase of the diffraction maximum within a crystal were possible.
associated with the reciprocal lattice point hkl: In 1912, Von Laue reasoned that the arrangement of
N atoms in crystals could help him measure the wavelength
Fhkl = f j exp[2πi(hx j + ky j + lz j )] of X-rays. Based on experiments with copper sulfate,
j=1 he demonstrated the ability of crystals to act as three-
where N is the number of atoms in the unit cell and dimensional diffraction gratings. Crystallography had en-
x j , y j , z j are the fractional coordinates of the jth atom. tered a new era: the analysis of the arrangement of atoms
Torsion angle If a group of four atoms (ABCD) is pro- in a crystal by careful analysis of the diffraction pattern of
jected onto a plane normal to the bond between B and that crystal. Crystal structures were now viewed as being
C, the angle between bonds connecting A and B, and built up from repeating units of an atomic pattern rather
C and D is the torsion angle. than the regular stacking of solid shapes.
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 123
As new types of scattering were discovered, they be- by eye, and Fourier transforms were performed with
came incorporated into the discipline, and crystallogra- Beevers–Lipson or Patterson–Tunell strips and summed
phy came to include structural studies of all classes of by hand. Three-dimensional electron-density maps were
substances. Absorption, diffraction, or other scattering plotted by hand, one section at a time, traced onto glass
methods are used to study crystals, powders, amorphous sheets, and stacked in frames for interpretation. As long
materials, surfaces, liquids, and gases. The physics and as all calculations were done by hand, small, flat molecules
methodologies associated with these techniques are also were the ones most amenable to study. Early computers
part of the science of crystallography. were difficult to program and had limited storage capacity,
Crystallographic studies play a vital role in materials but they made it possible to solve and refine crystal struc-
science, chemistry, pharmacology, mineralogy, polymer tures of molecules of moderate size in 6 months to a year.
science, and molecular biology. Accurate knowledge of The rapid advances in crystallography owe much to the
molecular structures is a prerequisite for rational drug de- development of computer-controlled diffractometers for
sign and structure-based functional studies. Crystallogra- data collection, high-speed computers for data analyses,
phy is the only method for determining the “absolute” and, most recently, powerful graphics devices for display-
configuration of a molecule. Absolute configuration is a ing structures with the ability to perform real-time manip-
critical property in biological systems, as changes in this ulations. Tasks that required months of effort can now be
may alter the response of the biologic system. accomplished in minutes with the aid of a computer.
A requirement for the high accuracy of crystallographic
structures is that a good crystal must be found, and this A. The Early Years
is often the rate-limiting step. Additionally, only limited
The first experiments in X-ray diffraction were recorded
information about the dynamic behavior of the molecule
on film. By 1913, W. H. Bragg had constructed the first
is available from a single diffraction experiment.
“X-ray spectrometer” to allow a more careful study of
In the past three decades, new developments in detec-
X-rays. This instrument also proved useful in studies
tors, increases in computer power, and powerful graph-
on crystals. Using measurements made with this X-ray
ics capabilities have contributed to a dramatic increase in
spectrometer, Bragg’s son determined the structures of
the number of materials characterized by crystallography.
fluorspar, cuprite, zincblende, iron pyrites, sodium nitrate,
Synchrotron sources offer the possibility of time-resolved
and the calcite group of minerals.
studies of physical, chemical, and biochemical processes
The first crystal structures reported were of substances
in the millisecond to nanosecond range; the ability to study
crystallizing in cubic space groups. The structure of
the nearest neighbors of cations present at parts per mil-
diamond was determined in 1913 by the Braggs, from
lion concentrations; and the possibility of recording small-
symmetry considerations using the observed intensities to
angle scattering data and powder data in seconds. Charge-
discriminate among possible structures. Their model
density studies have been made on numerous light atom
established the carbon–carbon single bond distance of
structures and are beginning to provide new insights into
1.52 Å and confirmed that bonds to carbon are directed
bonding of transition metals. Rietveld refinement is rev-
tetrahedrally. The younger Bragg combined symmetry ar-
olutionizing the study of powders and is being extended
guments with the notion that the scattering power of atoms
to fibers. Direct methods of structure solution are being
is related to their atomic weight to explain the structures
applied successfully to structures of over 1000 atoms. The
of the alkyl halides. These concepts were extended to ZnS,
Human Genome Project has created many opportunities
CaF2 , and FeS2 . The alkyl halide structures show anions
for crystallographic studies of biological macromolecules
surrounded by cations and cations surrounded by anions,
and resulted in intense activity in the areas of structural
demonstrating conclusively that discrete “molecules”
genomics and proteomics. Polarized neutrons are being
of the type NaCl do not exist in crystals of ionic
used to determine the spin structure of magnetic materials
materials. Pauling published the first intermetallic struc-
and to probe the surface structure of such materials. The
ture, Mg2 Sn, in 1923. In that same year, Dickerson and
review that follows attempts to describe the fundamentals
Raymond showed that hexamethylene tetramine consisted
of crystallography and capture the excitement and diver-
of discrete molecules, each having the same structure,
sity within the discipline.
with C–N distance of 1.44 Å in a body-centered cubic
lattice of edge 7.02 Å.
II. EVOLUTION OF CRYSTALLOGRAPHY
1. Heavy-Atom Methods
It was not very long ago that X-ray diffraction data were
collected on photographic film, intensities of spots on The elder Bragg realized that the periodic pattern in the
the film (corresponding to data points) were “measured” electron-density distribution could be represented by a
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
124 Crystallography
Fourier summation. The coefficients of this summation The crystal structure can be regarded as alternating layers
became known as the structure factors. This allowed the of Zn and S atoms perpendicular to the {111} direction.
solution of structures where positions of atoms were not Looking at one layer of Zn atoms, we find it lies closer to
restricted to special positions. The determination of the one of the two adjacent layers of S atoms. If we assume
structure of diopside, CaMg(SiO3 )2 , in 1928 was the first that the short Zn–S spacing is a “bonding” interaction and
example of the use of this method. the long Zn . . . S gap is a van der Waals contact, we do not
In 1934, Patterson showed that a Fourier series with |F|2 expect cleavage between bonded layers. This implies that
as coefficients could be summed without knowledge of the one of the {111} faces corresponds to a layer of S atoms
phases and would reveal interatomic vectors. Since the and the other to a layer of Zn atoms.
weight of a peak in the Patterson function is proportional It was fortuitous that the intensities in the Coster, Knol,
to the product Z i Z j of the atomic numbers of the atoms at and Prins study were measured using AuLα radiation
the ends of the vector, vectors involving heavy atoms stand (λ = 1.276 Å). The K-absorption edge of Zn is 1.283 Å.
out among light atom–light atom vectors. In 1936, Harker Thus, the measured intensities of Friedel pairs were not
showed that symmetry properties of the crystal caused the same. This was the first example of anomalous scatter-
vector density to accumulate on certain planes and lines ing. The difference due to anomalous scattering is greatest
(later known as the Harker section) in the Patterson func- when data are collected near an absorption edge of a heavy
tion. These two papers were the foundation of the heavy- atom in the structure. It was nearly 20 years after the ZnS
atom method for crystal structure solution. The method experiment that Bijvoet realized this principle could be
assumes that phases calculated from the heavy-atom posi- used to determine the absolute configuration of the sodium
tions will be sufficiently accurate that a Fourier synthesis rubidium salt of (+)-tartaric acid.
(using |F| as opposed to |F|2 ) will reveal the positions of
more atoms, thus allowing solution of the structure. Phase
B. Modern Crystallography
information from the new atoms could then be added to
the Fourier synthesis to locate more new atoms and so on, The early years can be characterized as a period in which
until the full structure was revealed. the size of a problem amenable to analysis was compu-
The method of isomorphous replacement was used first tationally and instrumentally limited. Data collection and
to solve the structure of the alums by Lipson and Beevers analysis were manual processes. This began to change in
in 1935. If a centrosymmetric light-atom structure and its the 1950s.
heavy-atom derivative differ only in the presence of the
heavy atom, then the differences in the intensity of equiv-
1. Data Collection
alent reflections can be used to determine the signs of
the structure factors. Robertson applied this method to the Progress in crystallographic data collection can be charted
phthalocyanines in 1936. These molecules form an iso- by examining early issues of Acta Crystallographica,
morphous series and crystallize in P21 /a. By comparing published by the International Union of Crystallography
intensities of nickel phthalocyanine and the unsubstituted (IUCr). Although a manual “diffractometer” was available
molecule, Robertson was able to assign phases to all but in 1913, the primary method for collecting intensity data
a few of the measured reflections from the h0l projection. for crystallographic studies relied on X-ray cameras and
Harker generalized the method to the noncentrosymmetric film. In 1948, about two-thirds of the crystal structures re-
case in 1956. Isomorphous replacement can be combined ported used a camera and film to collect the intensity data,
with anomalous dispersion to obtain phase information for and less than one-quarter used a combination of film and a
large molecules. diffractometer. Despite the existence of automated single-
crystal diffractometers, the situation was little changed by
1962.
2. Absolute Configuration
Use of the Cambridge Structure Database (CSD) allows
Friedel’s law states that the scattering from the front and a more systematic study of the evolution of data collec-
back sides of a plane, hkl, are the same. This means that the tion methods (Fig. 1) with the caveat that this database
measured intensities of the 111 and 111 reflections (and references only organic and organometallic structures. Al-
all other “Friedel” pairs) should be equivalent. However, though automated diffractometers were first available in
in a 1930 study on ZnS by Coster, Knol, and Prins, it about 1955 it was not until the mid-1960s that this new
was noted that the 111 reflection was not equivalent to technology made much of an impact. At that time, only
its Friedel mate. In the case of the ZnS crystal, the {111} a few hundred structures were being added to the CSD
faces are prominent but, even by visual examination, do each year. By the mid-1970s, over 1000 structures were
not appear identical. One face is shiny and the other is dull. added to the CSD each year and data for over 80% were
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 125
FIGURE 1 Changes in data-collection methods with time. Using structures entered in the Cambridge Structure
Database, changes in data collection strategies were determined by examining the fraction of structures reported
in one year for various data collection methods (photograph with visual estimation of intensities, photograph with
densitometer, diffractometer, or unknown). The large fraction listed as unknown between 1960 and 1965 is likely due
to changes occurring at that time in how data were collected and how data-collection methods were reported.
collected on a diffractometer. In 1997, over 15,000 struc- concerned the development of the mathematical aspects
tures were added to the CSD and virtually all were col- of crystal structure analysis. The first general procedures
lected on a diffractometer. The explosive growth of this for solving both centrosymmetric and noncentrosymmet-
database cannot be attributed solely to improvements in ric structures was developed in the early part of the 1960s.
data collection, but certainly the routine use of automated Use of this method grew with the power of computers and
instrumentation had a significant impact. computer programs. It is now the most widely used method
of solving crystal structures of moderate size. Efforts are
currently being made to apply direct methods to very large
2. Structure Solution structures such as proteins. For a more detail discussion
Data collection was not the only beneficiary of the post- of structure solution methods, see Section IV.H.
World War II progress in crystallography. No general
method existed for solving unknown structures without
3. Charge-Density Distribution
heavy atoms until the advent of direct methods—a means
of determining the values of phases from relationships The first papers to explore the difference between the
among the structure factor magnitudes associated with results of X-ray and neutron diffraction experiments ap-
those phases. The earliest structure solved by such meth- peared in the early 1970s. Systematic differences between
ods was decaborane by use of some inequalities derived by the positional and thermal parameters determined by the
Harker and Kasper. However, this technique was limited to two techniques were reported. These differences were at-
centrosymmetric structures. The major effort in the 1950s tributed to the difference in how neutrons and X-rays
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
126 Crystallography
interact with atoms. Neutrons are diffracted primarily systematically underestimate lone-pair peak heights and
from the nucleus; hence, neutron diffraction produces in- place hydrogen atoms too close to the atoms to which
formation about nuclear position. X-rays are diffracted by they are bonded. The experiments must be conducted at
electrons and therefore yield information about the dis- low temperature (i.e., −75◦ C or preferably less) for this
tribution of electrons in the molecule. It was logical to method to succeed.
extend these ideas and attempt to map the redistribution Other approaches include refinement of separate param-
of electron density that occurs on bonding. eters for core and valence electrons or the direct refinement
A number of different kinds of mapping have been of a deformation model. The major advantage of the di-
done. Subtracting core-electron densities from experimen- rect refinement methods is that they make no assumptions
tal electron densities (i.e., p(r)) should reveal details of about the (sin θ )/λ dependence of bonding features.
the redistribution of valence electrons on bonding. The Errors in the deformation density maps arise from er-
term “valence density” is used to describe the difference rors in both the model and the data. Demands on diffrac-
function: tion methodology and interpretation are many times more
severe than those relevant to an average stereochemical
pvalence (r) = p(r) − pi,core (r) investigation. The following considerations are extremely
atoms
important:
To define either of these functions the positions and ther-
mal parameters must be known. One way to approach this 1. The X-ray data set must be complete (all symmetry-
is to use neutrons to determine the positional and thermal related reflections measured) to a (sin θ )/λ limit of about
parameters and to use these parameters in conjunction 1.3 Å−1 . This implies the use of short-wavelength ra-
with X-ray, spherical-atom scattering factors to calculate diation such as MoKα (λ = 0.7107 Å) or AgKα (λ =
structure factors for the “promolecule” using: 0.5612 Å). The maximum value of (sin θ )/λ is 0.65 for
CuKα (λ = 1.5418 Å).
Fcalc,N = f i exp(2πiH · ri )Ti
atoms
2. The data must be corrected for absorption, extinc-
tion, and thermal diffuse scattering. Multiple reflections
Fcalc,N is the X-ray structure factor calculated from neu- must be either avoided or eliminated. If these conditions
tron positions. The deformation density or X –N map then are met, internal agreements should be of the order of 2%;
corresponds to: i.e.,
1
X −N
pdeformation (r) = × (Fobs,X − Fcalc,N ) exp(−2πiH·r)
V H
F 2 (H) − F 2 (H)
F 2 (H) = 0.02
H H
Since both Fobs and Fcalc contain the effects of thermal mo-
tion, this deformation density map is thermally smeared.
4. Rietveld Analysis
Its resolution is limited by the maximum (sin θ )/λ value
of the data. There has been a renaissance in powder diffraction in re-
Given that there are few neutron-diffraction facilities cent years because Rietveld refinement allows determina-
in the world and that it is difficult to correct adequately tions of positional and thermal parameters from powder
for systematic effects in the two experiments (i.e., ab- data, even when the diffraction peaks are not well sep-
sorption, extinction, thermal diffuse scattering, multiple arated in the recorded pattern. Rietveld analysis is not a
reflections), it would be desirable to study bonding effects method of structure solution and can only be applied when
using exclusively X-ray data. There are several approaches cell dimensions and space group are known and when a
to this problem. reasonable model exists for the structure.
The X –X formalism is similar to the X –N formal- In a polycrystalline sample, information may be lost as
ism described above, except that the calculated values for a result of the random orientation of the crystallites. A
positional and thermal parameters are derived from re- more serious loss of information can result from the over-
finement of high-angle X-ray data [(sin θ )/λ > 0.70 Å−1 ]. lap of independent diffraction peaks in the powder pat-
For deformation density maps, neutral spherical atoms are tern. Using the total integrated intensities of the separate
subtracted from the observed density; in valence density groups of overlapping peaks in the least-squares refine-
maps, Hartree–Fock core-electron densities are used to ment of a structure leads to the loss of all the information
evaluate Fcalc . Comparison of X –X and X –N maps shows contained in the often-detailed profile of these composite
that they do indeed yield the same qualitative information. peaks. Rietveld developed a refinement method that uses
Bonding density shows in the middle of bonds, and lone- the profile intensities of the composite peaks instead of
pair density is in the correct location. However, X –X maps the integrated quantities. This is a pattern-fitting method
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 127
of structure refinement and allows extraction of the max- precision of the resulting parameters. However, precision
imum amount of information contained in the powder comparable to single-crystal X-ray diffraction can now be
pattern. obtained from neutron diffraction at a fixed wavelength
A powder pattern is recorded in a step-scan mode with with Rietveld refinement. For X-rays from conventional
a step width of 0.02 to 0.03◦ 2θ . No attempt is made to sources, the precision of positional parameters is compa-
allocate observed intensity to individual reflections or to rable to the single-crystal case, but thermal parameters are
resolve overlapping reflections. Instead, the intensity of less reliable by a factor of two or three. Considerable ef-
the powder diffraction pattern is calculated as a stepwise fort is being expended to improve profile functions for the
function of the angle, 2θ . Refinement allows calculation various X-ray and neutron sources.
of the shifts in the parameters that will improve the fit of
the calculated powder pattern to the observed one. The
5. Small-Angle Scattering
quantity minimized is
Scattering at small angles is derived from large structural
wi [yi (obs) − yi (calc)]2 units—units whose dimension D is much larger than the
i
wavelength of the radiation used in the experiment. The
where yi (obs) is the observed intensity of the ith step of the acronyms SAXS and SANS refer to small-angle X-ray
profile, and yi (calc) includes the usual structural parame- scattering and neutron scattering, respectively.
ters (i.e., positional parameters xi , yi , z i ; thermal parame- Different sorts of small-angle experiments are typi-
ter Bi j ; and site-occupancy parameters p j ). However, the cal of the kind of material studied and yield a charac-
model must also include instrumental and sample parame- teristic type of pattern. Low-angle data from ordered or
ters: 2θ0 (overall scale), overall temperature factor, profile semi-ordered systems give Bragg peaks at specific values
breadth (H2 = U tan 2θ + V tan θ + W), profile asymme- of scattering vector. Examples include aligned structures
try, background, preferred orientation, lattice parameters, with long-range periodicity, such as two-dimensional bi-
and wavelength. The agreement factors most often quoted ological structures or samples such as opal that present
are long-range order.
1/2 Scattering from polymers in dilute solution or from
wi [yi (obs) − (1/c)yi (calc)]2 biological materials yields patterns that look rather like
Rweighted pattern =
wi [yi (obs)]2 the Debye–Scherrer rings observed in wide-angle data
from powders. When scattering arises from the spheri-
and cal particles in a mono-disperse system, the pattern is a
Bessel function consisting of a succession of peaks of
|Ik (“obs”) − Ik (calc)|
RBragg = diminishing magnitude that are broad relative to Bragg
Ik (“obs”)
peaks. Analysis of the pattern yields the radius of the
In RBragg , “obs” has quotation marks because Ik (“obs”) is particles.
calculated by partitioning the intensity. In some circumstances, multiple Bragg reflections give
To date, most of the papers published using this rise to scattering in the small-angle region. One of the
method have been neutron-diffraction studies from reac- advantages of the “tunable” sources of X-rays and neu-
tor sources. Advantages of neutron data include minimal trons is that multiple Bragg scattering can be avoided by
preferred orientation, no polarization, and neutron absorp- choosing wavelengths larger than the lattice spacing.
tion cross-sections smaller than X-ray values by a factor of Small-angle scattering can be applied to a wide variety
104 , scattering independent of θ, and for fixed-wavelength of materials. In polymer science, it has been used to in-
experiments the peak shape is simple. vestigate chain conformation in amorphous polymers, the
In X-ray experiments using radiation from conventional state of mixing in polymer blends, the compatibility ranges
sources, peak shape is complicated by both α1 –α2 split- of polymer blends, and the measurement of domain struc-
ting (at high angles) and by the fact that the peak shape ture and molecular conformation within those domains.
is neither Lorentzian nor Gaussian, but is better described In biological materials, examples include measurement of
by a convolution of these two functions called the Voigt the radius of gyration of proteins in solution; aggregation
function. The pseudo-Voigt function used in many pro- of chlorophyll into micelles; diffraction patterns of semi-
grams is an approximation of the Voigt function that can ordered materials such as muscle, collagen, etc.; and stud-
be evaluated much more quickly. ies of the shapes and constitution of viruses. Separation
Spallation neutron sources, time-of-flight experiments processes, such as those used in refining of metals or ex-
with reactor sources, and synchrotron sources all have spe- traction of tar sand, frequently involve micelle formation.
cial problems in defining peak shapes, and this limits the Small-angle scattering can give information regarding the
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
128 Crystallography
shape, size, and degree of polymerization of the aggre- 7. Implications of New X-Ray
gates. Studies of materials such as cements, zeolites, and and Neutron Sources
catalysts involve the measurement of size and distribution
of pores and measurements of specific surface. Synchrotron sources present a unique combination of
properties that are very attractive for X-ray scattering,
absorption, and diffraction experiments. The radiation
6. Extended X-Ray Absorption Fine Structure
produced has extreme brightness over a broad spectral
Extended X-ray absorption fine structure (EXAFS) is a range. Synchrotron sources are five to six orders of mag-
technique for studying the local environment of a specific nitude brighter than the bremsstrahlung (that part of the
atomic species in a complex matrix. Because the inter- X-ray spectrum caused by the slowing of electrons on
action of X-rays with the material under study is ab- impact with the target, also referred to as white radia-
sorption rather than diffraction, the technique can be ap- tion) available from a conventional rotating anode source.
plied to gases, liquids, and amorphous solids as well as to Monochromators currently available produce resolution
crystals. of 0.1 eV at 8 keV. The radiation is naturally collimated
In the experiment, the X-ray absorption coefficient is with a divergence of the order of 2×10−4 radians, is plane
measured from slightly below to about 1000 eV above polarized, and has a precise time structure (subnanosec-
the absorption edge for the atomic species whose environ- ond pulses repeated every 0.5 to 1 µsec). These properties
ment is to be studied. Through analysis of the fine structure allow experiments that simply cannot be done with con-
above the edge it is possible to determine the coordination ventional sources: EXAFS on dilute samples (parts per
of the atom. The actual position of the edge gives infor- million range), measurement of the magnitude and angu-
mation about the oxidation state of the absorbing atom, lar dependence of the real and imaginary components of
while structure at energies at or just below the edge pro- anomalous dispersion, and determination of the structure
vides information about bound states associated with the of a protein using only one derivative and three wave-
absorbing atom and hence about the symmetry of the en- lengths. (In this context, the use of anomalous dispersion
vironment. The technique can be applied to any atom with is formally equivalent to multiple isomorphous replace-
Z > 15 to yield a determination of radial distance to a pre- ment with the added feature that the isomorphism is ex-
cision of ±0.02 to 0.01 Å. For elements with high values act.) Determinations of cation-site distribution in minerals
of Z , it may be preferable to use the L-absorption edge and diffraction from monolayers on surfaces have many
rather than the K edge because higher fluxes are available; applications in such areas as catalysis or materials sci-
however, theory for the L edge is not yet well developed. ence. Perhaps the most exciting application is the ability
Extended X-ray absorption fine structure has applica- to do time-resolved studies of physical, chemical, and bi-
tions in many branches of science. In molecular biology, ological processes using small-angle scattering, powder
it has been used to study Ca+2 transport in membranes, diffraction, and other scattering techniques.
binding of oxygen in hemoglobin, and other coordination One early example was the study by Larson of temper-
problems. It is an invaluable tool for the study of amor- ature and temperature gradients in silicon during pulsed-
phous substances such as glass, since the manufacture of laser annealing. In this example, the duration of the laser
glasses with particular mechanical and thermal properties pulse was 15 nsec and that of the synchrotron X-ray pulse
depends on structure. Catalysis has become extremely im- was 0.15 nsec. The laser bursts were synchronized so that
portant in energy development, resource utilization, pollu- the probing X-rays arrived at 20, 55, and 155 nsec after
tion abatement, refining of metals, etc. The chemical state the laser pulse. The experiments showed that the lattice
and atomic environment of an atomic species in a catalyst temperature of silicon reaches the melting point during
in situ while reduced with hydrogen, chemisorbed with the 15-nsec pulse and remains at the melting point during
oxygen, heated, quenched, etc., can be determined with the high reflectivity phase, after which time the tempera-
EXAFS. This allows the design of heterogeneous catalysts ture rapidly subsides. Temperature gradients at the liquid–
that are tailored from precise knowledge of electronic and solid interface were measured for the first time and were
structural parameters. found to be in the range of 107 ◦ K/cm. Larson received the
Synchrotron sources have revolutionized EXAFS stud- Warren Award for Diffraction Physics for this pioneering
ies. The intensity of the source and high collimation make work in nanosecond, time-resolved X-ray diffraction.
it possible to collect the relevant data in about 20 minutes The new pulsed spallation sources (such as the Euro-
in a sample such as Cu metal with a resolution of 1 eV pean Spallation Neutron Source at Forschungszentrum
at 8.8 keV. To collect similar data with a rotating anode Jülich GmbH, or the Spallation Neutron Source at Oak
source would take about 2 weeks, and the precision would Ridge National Laboratory) provide spectra substantially
be reduced by a factor of about 100. richer in short-wavelength neutrons than those available
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 129
from the reactor sources. Pulse duration and repetition are globin, for example, at 1.8 Å resolution, the negative den-
source parameters, but the time structure can be exploited sity of the protons (which form about half of the scattering
by a variety of techniques. Essentially the same types of material in the cell) makes the polypeptide chain stand out
experiments are done at both the reactor and spallation clearly in the Fourier maps. Soaking the same crystal in
sources. The higher neutron fluxes available with the new D2 O has allowed identification of exchangeable protons.
sources allow experiments to be done on smaller samples Neutron- and X-ray-scattering experiments using non-
and/or in shorter times than was previously possible. Bragg scattering can be used to study the size, shape, and
aggregation of micelles. Contrast variation can be used to
8. Contribution of Diffraction to Molecular Biology study the internal structure of viruses.
The average scattering densities from the protein coat
Molecular biologists seek to unravel the mysteries of the
and the RNA interior of viruses are different. Each can be
cell by mapping gene location, function, and control. By
“matched” by a different H2 O/D2 O ratio. Thus, for virus
understanding all of these we gain insight into how and
particles in solution, the matched phase becomes “invisi-
when genes are turned on and how these might be used
ble” to the neutron beam and allows the radial distribution
to perform useful tasks. One goal of such studies is to al-
of scattering of the other component to be recorded. For
ter (i.e., reengineer) a natural biologic process to perform
spherical viruses, this allows measurement of the thick-
some other function. Examples of this include attempts
ness of the protein coat and the degree of interpenetra-
to modify metal-binding proteins such that a protein that
tion of the two phases. Enzymes and other proteins bind
was originally selective for calcium is selective for zinc
some substrates so well that detailed, atomic-level anal-
or copper. This reengineered metal-binding protein could
yses of the structure of the native protein can be com-
then be used to construct a sensor for zinc or copper. Other
pared with those of protein–substrate complexes, protein–
examples include modifying enzymes for use in industrial
inhibitor complexes, and proteins with catalytic groups
processes rather than biologic processes. In order to ef-
bound to allosteric sites. Detailed comparisons between
fect these changes, the relationship between structure and
such structures have greatly enhanced our understanding
function must be understood.
of the mechanisms of biological catalysis.
The structure of a macromolecule can be “known” at
Molecular graphics programs are now available that can
a number of levels. Primary structure is the linear se-
display the full three-dimensional structure of a macro-
quence of building blocks (i.e., amino acids) from which
molecule and zoom in on any portion of it. It is possible
the protein is built. For example, the β chain of human
to examine the active site and to attempt to fit known sub-
hemoglobin contains 144 amino acids, of which the first
strates and/or inhibitors into that site. The availability of
five have the sequence valine, histamine, leucine, threo-
coordinates for macromolecules in the Protein Data Bank
nine, and proline. The term “secondary structure” refers
(see Section VI.A.8) has allowed many fruitful applica-
to local interactions that determine the conformation of
tions of this type.
the polypeptide chain and the interchain hydrogen bond-
ing scheme. “Extended chain,” “α-helix,” and “β-sheet”
are terms used to describe secondary structure. Tertiary
III. STRUCTURE OF A CRYSTAL
structure is the three-dimensional arrangement of atoms
within the macromolecule, while quaternary structure de-
A. Choice of Unit Cell
scribes the arrangement and interaction of aggregates of
the macromolecules themselves. Diffraction techniques A crystal is a multifaceted solid, similar in appearance to
are widely used to study the secondary, tertiary, and some- an unpolished gemstone. Internally it consists of a basic
times quaternary structures of macromolecules. pattern, known as the repeating unit, of molecules that
Single crystals of macromolecules may be studied by repeats itself by translation, in three dimensions, to the
X-ray diffraction. A wide variety of techniques are edges of the crystal. By choosing one corner of the repeat-
now available for macromolecular structure solution. In ing unit to be the origin, one can use three translational
the past, heavy-atom multiple isomorphous replacement vectors, having both length and direction, to construct a
(MIR) was the most common method of structure solution; parallelepiped that contains the entire basic pattern. This
however, anomalous dispersion is becoming more com- parallelepiped is defined as the unit cell and one need only
mon as a method of structure solution. The relatively small determine the contents of the unit cell, the basic pattern,
protein, crambin, was an early example of using anoma- to know the structure of the entire crystal. The general
lous dispersion to effect structure solution of a macro- symbols for the unit-cell vectors are a, b, c and for their
molecule. In a few cases, neutron-diffraction studies have magnitudes a, b, c. The coordinate axes, or directions of
been carried out on single crystals of proteins. In myo- the sides of the unit cell, are referred to in general as the
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
130 Crystallography
Crystallography 131
+∞
fj = V ρ j (uvw)
−∞
× exp[2πi(hu + kv + lw)] du dv dw
where ρ j (uvw) is the electron density of the jth atom re-
ferred to x j , y j , z j as origin. Thus, if we know the position
of an atom, then we can calculate the phases. Conversely,
if we know the phases, then we can calculate the electron
density and hence the positions of the atoms. The central
problem in crystallography is that the phases are not ob-
served in the diffraction experiment. Solving a structure
consists of finding positions for the atoms or of finding
FIGURE 3 Bragg’s law. The distance AB = BC = d sin θ. Con- phases for the structure amplitudes. Details of the major
structive interference occurs if the path difference is a whole num- methods for structure solution are found in Appendix II.
ber of wavelengths. Thus, nλ = 2d sin θ .
The amplitude of the wave scattered by the plane hkl is IV. STEPS IN CRYSTAL
1 1 1 STRUCTURE ANALYSIS
Fhkl = V ρ(x yz)
0 0 0
The steps outlined here would be typical of those used for
× exp[2πi(hx + ky + lz)] d x d y dz a moderately complex structure. Notes are also included
on alternative techniques that may be more applicable to
where ρ(x yz) is the electron density at the point x, y, z
macromolecular crystallography.
in the unit cell. The quantity (hx + ky + lz) is the vector
product (H · r).
By the properties of Fourier series, Fhkl is a Fourier A. Growing Crystals
coefficient of ρ(x yz) so that:
All strategies for the growth of crystals for diffraction ex-
1 periments are aimed at bringing a concentrated solution of
ρ(x yz) = Fhkl exp[−2πi(H · r)] a homogeneous population of molecules very slowly to-
V h k l
ward a state of minimum solubility. The goal is to achieve
where the summations in h, k, and l each run from −∞ a limited degree of supersaturation, from which the system
to +∞. Note the change in sign of the exponent between can relax by formation of a crystalline precipitate. Many
the two expressions. Fhkl is the Fourier transform of the techniques developed for achieving these ends have been
electron density in the cell. The electron density is the described.
inverse Fourier transform of structure factors. Crystallization techniques used in routine synthetic
If the electron density is a superposition of N atomic methods tend not to produce crystals of the quality re-
densities, then the structure factor expression can be quired for structural work. Suitable crystals must be grown
rewritten as: slowly at near-equilibrium conditions. This implies low
N supersaturation ratios and small gradients. Generally, su-
Fhkl = f j exp[2πi(hx j + ky j + lz j )] persaturation is achieved by changing the composition of
j=1 a solution containing the sample to be analyzed (and pos-
The summation is over all atoms in the cell and the scat- sibly other additives) or by altering the temperature. In
tering factor of the jth atom is either case, the concentration of the sample is driven be-
yond its saturation limit, whereby the sample is forced out
of solution and crystal formation may result.
When crystals are grown from solution, changing the
solvent may have a pronounced effect on their habit and
size. Properties that may influence crystal growth, such
as density, viscosity, dielectric constant, and solubility,
may be varied over a wide range by mixing two or more
solvents.
FIGURE 4 Miller’s index. The plane has intercepts a/3, b/4, c/2. A small surface-to-volume ratio is useful for slow evap-
The Miller’s index is 342. oration of solvent. For small samples, an NMR (nuclear
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
132 Crystallography
magnetic resonance) tube may be a suitable crystallization with no satellite growths or twinning, and on the order
vessel. of 0.1 mm on each edge. Fine-tuning conditions, adding
Very insoluble compounds may be crystallized using detergents or counterions to the precipitant, and seed-
a method known as reactant diffusion. In this method, ing droplets with microcrystals are all techniques used
reactants A and B are allowed to mix by diffusion; the to prepare large, single crystals from initial successful
very insoluble product C will crystallize in the zone of experiments.
mixing. Sparingly soluble compounds can sometimes be
crystallized from boiling solvent in a soxhlet extractor.
B. Microscopic Examination
Crude product is placed in the thimble and the reservoir
is seeded. The crystal chosen for analysis must have a uniform in-
Sublimation is effective for some classes of compounds. ternal structure and be of an appropriate size and shape.
Vacuum sublimation reduces the temperature required and The first criterion implies that the substance is pure—the
so increases the range of compounds for which it is suit- crystal contains no voids or inclusions and is not bent,
able. If the temperature gradient is too large, only micro- cracked, distorted, or composed of crystallites. It must be
crystals will be formed. Large crystals grow at the expense a single crystal, and ideally it should not be twinned (a
of small ones only when the process is carried out slowly. twinned crystal has two or more different orientations of
Vapor diffusion is a method that works well with mil- the lattice growing together).
ligram quantities. The solute is dissolved in a solvent in The size of crystal required is determined by the con-
which it is relatively soluble. A small container of this so- ditions of the experiment. For X-ray diffraction, approx-
lution is placed inside a closed beaker with a second sol- imately 0.1 mm is preferred; for neutrons, an order of
vent in which the solute is only sparingly soluble. The two magnitude larger is appropriate. A crystal with roughly
solvents must be miscible in one another and the second equal dimensions and well-defined edges is ideal; how-
solvent should be the more volatile. Suitable solvent pairs ever, many crystals grow as plates or needles. Although a
include ethanol/ether, benzene/ligroin, and water/ethanol. crystal with a highly asymmetric shape is far from ideal,
Diffusion of one liquid into another is also effective. The useful structural information can often be obtained from
solute is dissolved in the solvent in which it is more such a crystal. In some cases, a more uniformly shaped
soluble. Crystals form at the interface between the two fragment may be cut from a larger crystal with a sharp
solvents. razor blade.
A form of vapor diffusion is commonly used for the Examination with a binocular microscope allows a rapid
growth of protein crystals. For proteins (and other macro- screening of crystals. A few that appear suitable should
molecules) the solution properties modified to achieve su- then be examined more closely through crossed polariz-
persaturation include increasing the concentration of an ers. As the crystals are rotated, they should either appear
additive (e.g., a precipitant), decreasing total solution vol- uniformly dark in all orientations or they should be bright
ume, changing the solution pH, and/or changing the tem- and extinguish (appear uniformly dark) every 90◦ of ro-
perature. Although a wide variety of experimental setups tation. An unsuitable crystal may show dark and light re-
have been used in protein crystallization, the most com- gions simultaneously, or regions that do not extinguish, or
mon technique is “hanging-drop microvapor diffusion” different regions that display different colors.
(HDMVD). In HDMVD, a droplet of 4 to 20 µl con-
taining protein and precipitant is suspended from a glass
C. Mounting a Crystal
coverslip which is sealed above a reservoir of a solution
at a higher precipitant concentration. Since the droplet is The usual way of mounting a crystal for X-ray diffraction
at a lower precipitant concentration than the reservoir, the is to glue it to the end of a glass fiber that is mounted in a
net migration of water vapor occurs from the droplet to brass or aluminum pin. For photographic work, it is neces-
the reservoir, resulting in a decrease in drop volume. The sary to align a real lattice vector (Weissenberg technique)
decrease in drop volume results in increased precipitant or a reciprocal lattice vector (precession technique) so that
and protein concentrations, which should drive the protein it is perpendicular to the X-ray beam. On a diffractometer,
out of solution. Crystals form if conditions are favorable. it is necessary for the crystal to be well centered in the
More often, protein precipitates as an amorphous solid X-ray beam, but alignment is neither necessary nor advis-
along the bottom of the droplet. able since the aligned position is the one that often leads
Once initial conditions are found where crystals have to overlapping reflections.
formed, additional experiments often must be performed Materials that are air or moisture sensitive may be sta-
to perfect crystal growth in order to produce X-ray- bilized by covering the crystal with a light coating of oil or
diffraction-quality crystals. The crystals must be single, the mounting glue (provided that the crystal is not soluble
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 133
F. Data Collection
It is always advisable to collect symmetry-equivalent
data (reflection intensities). The degree of agreement be-
tween equivalent reflections allows assessment of crys-
tal quality, absolute configuration, stability of the count-
ing chain, suitability of absorption correction, and other
systematic effects. For charge density studies, a com-
plete dataset should be collected to a resolution of at
least sin θ/λ = 1.3 Å−1 with an internal agreement of
about 2%. For determination of absolute configuration, ef-
fects of decomposition, absorption, and extinction errors
are minimized if hkl and hkl are measured consecutively
at +2θ and −2θ . This may be done for a few dozen of
the most sensitive reflections or for the whole dataset. For
FIGURE 5 Polaroid rotation photograph.
large datasets, it is common practice to collect data in
shells.
in the solvent of the glue). Very sensitive substances may
have to be mounted in capillary tubes under an inert atmo- G. Data Reduction
sphere, with all mounting operations carried out in a dry The process of deriving structure amplitudes |F hkl | from
box. the observed intensities I hkl is known as data reduction. A
number of geometrical factors influence the intensities ob-
D. Preliminary Photographs served in a diffraction experiment. The most important of
Most diffractometers are equipped with a Polaroid cam- these are Lorentz, polarization, absorption, and extinction
era for a quick check of crystal quality. Alternatively a corrections.
rotation photograph can be acquired with an area detector. The first three corrections are normally applied to the
The spots on the photograph should have similar shapes, observed intensities in the process of calculating the struc-
without tails or streaks (see Fig. 5). A more sensitive check ture amplitudes, |Fhkl |. The Lorentz factor corrects for
of crystal quality is provided by omega scans of a num- the relative speeds with which different reflections pass
ber of reflections measured with different orientations of through the reflecting position. Since X-rays are polar-
the crystal. If the diffraction pattern falls off rapidly with ized on reflection, and the degree of polarization depends
angle, low-temperature data collection is advisable; how- on experimental conditions, a correction must be applied
ever, some crystals crack when cooled. If suitable crystals to account for how the polarization affects the observed
are scarce, it may be advisable to collect a dataset at room intensities. Frequently, Lorentz and polarization correc-
temperature and then cool the sample to improve the res- tions are combined. Absorption depends on the average
olution of the experiment path-length of both the incident and the reflected beam
and hence may be very different for symmetry-related
reflections.
E. Establishing the Orientation Matrix
Extinction is an interference process. An extinction cor-
The orientation matrix and the cell dimensions are estab- rection is generally made during least-squares refinement,
lished by determining the setting angles of about eight making it a correction applied to the model rather than to
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
134 Crystallography
the observations. A detailed discussion of the absorption, Advances in the phasing of macromolecular data have
extinction, and other factors affecting diffraction intensity been made by the use of a phenomenon called anomalous
can be found in Appendix I. dispersion or anomalous scattering. Single isomorphous
replacement with anomalous scattering (SIRAS) takes ad-
vantage of both the phasing power and anomalous scat-
H. Solving the Phase Problem
tering properties of certain heavy atoms. An advantage of
The central problem in crystallography arises because the the SIRAS technique is that data, in some cases, can be
experimental data yield only the modulus of the structure collected using a conventional Cu-Kα radiation source.
factor, |F hkl | and not the phase. The phase is required in The SIRAS approach to phasing obviates the need for
order to evaluate the electron density in the unit cell, but crystallization and data collection from multiple samples.
it cannot be measured directly. The technique does require careful data collection but pro-
Several methods have been developed to determine duces all of the information needed to determine phases
the phases of the complex structure factors, |F hkl |, with from a single dataset collected from only one crystal.
no prior knowledge of atomic positions. These meth- A crystal containing an anomalous scattering atom may
ods include multiple isomorphous replacement, single- be used to collect data at multiple wavelengths and allow
isomorphous replacement with anomalous dispersion, the phases to be determined using multiple-wavelength
multiple-wavelength and single-wavelength anomalous anomalous dispersion (MAD). A single crystal can be
dispersion, heavy-atom, and direct methods. With the ex- analyzed at multiple wavelengths, generating a varia-
ception of direct methods, all of these methods take advan- tion in scattering factors that allows direct determination
tage of the scattering properties of “heavy” atoms (transi- of crystal structures. With careful measurements, even
tion, actinides, and lanthanides). For very large molecules, weak signals from a single crystal can provide the nec-
such as proteins, neither direct nor heavy-atom methods essary phasing information. Data must be collected using
are generally used. Molecular replacement (MR) also pro- “tunable” X-ray radiation available only at synchrotron
vides a powerful phasing method for the structure analysis. facilities.
A model structure similar to the structure being analyzed A paper by J. Karle (see Bibliography) offers the
is required for MR. possibility of nearly direct phasing for protein crystals.
If the positions of the atoms are known, both the mag- The potential for analysis of the data using this single-
nitude and phase of the structure factor can be calculated. wavelength anomalous dispersion (SAD) technique is yet
The heavy-atom method of structure solution depends on to be explored. Given the collection of sufficiently accurate
this. If at least one atom in the structure is heavy enough anomalous dispersion data, one dataset at one wavelength
to be located in a Patterson function, that position can may provide all the information required to determine the
be used to calculate phase angles. A Fourier summation phases.
using observed structure amplitudes and these calculated Direct methods of phase determination do not depend
phases would reveal the heavy atom and some others. The on any a priori structural information. Phases are deter-
additional atoms are included in the structure factor cal- mined from statistical relationships among the intensities.
culation, providing a better estimate of the phase angles. Powerful computer programs have facilitated the use of
This process can be facilitated by use of the tangent for- these methods for centrosymmetric and noncentrosym-
mula to extend and refine the new phases and is repeated metric structures of moderate size. Considerable effort has
until all of the atoms are located. been made to extend these methods to protein structures,
Multiple isomorphous replacement (MIR) is a powerful and structures containing over 1000 non-hydrogen atoms
method for determination of phases. MIR depends upon have been solved by direct methods.
the phasing power of heavy-metal atoms bound to a com- In general, the conformationally rigid portions of the
pound in such a fashion that the positions of other atoms molecule are most easily located. Once the major features
in the crystal are minimally perturbed (i.e., isomorphous of the molecule have been recognized, the difference map
derivatives). The MIR method requires that the crystallog- provides a powerful means both for checking the accu-
rapher prepare more than one derivative of the parent crys- racy of the partial model and for completing it. A Fourier
tal. These often turn out to be unstable, not isomorphous, summation using F = |F obs − F calc | as coefficients and
or to have the metal bound with too low an occupancy phases calculated from the known portion of the struc-
to be useful. Even when successful, this method requires ture yields a map of the discrepancies between the crystal
the collection and reduction of numerous datasets from and the model. An atom present in the crystal but not
multiple crystals. Anomalous dispersion methods can be included in the model will appear as a peak. Difference
combined with isomorphous replacement to circumvent maps are also useful for locating light atoms in the pres-
some of the disadvantages. ence of heavy ones, such as the location of hydrogen atoms
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 135
in compounds of first-row elements or carbon atoms in a trosymmetric crystal is treated as a twin by inversion and
tungsten compound. the contribution of the two components evaluated during
refinement as the Flack parameter. In the case where the
arrangement of atoms in the model and crystal are in agree-
I. Refinement
ment, the contribution of the Flack parameter is zero. If
Once approximate locations have been determined for all, the model and crystal are inverted with respect to each
or almost all, of the atoms in the structure, it must be re- other, the Flack parameter is one and the model needs to
fined or made more precise. The standard method for re- be inverted.
finement of structures, known at atomic resolution (not for
macromolecules), is full-matrix least squares. For struc-
tures that are not known at atomic resolution, or for very K. Derived Parameters
large structures, sparse-matrix or simulated annealing is The parameters produced directly by the least-squares re-
used for refinement. Since the structure factors are not finement are the positions of the atoms and their thermal
linear functions of the parameters, the process is an iter- parameters. Bond lengths, bond angles, and torsional an-
ative one. Hydrogen atoms are not generally included in gles are derived from these positions. An examination of
the early stages of refinement but are included in the final short inter- and intramolecular contacts may provide in-
cycles of refinement. formation about hydrogen bonding, van der Waals forces,
The function minimized is ω2 , where ω is the packing forces, etc.
weight assigned to a particular observation and is the Room-temperature studies of organic compounds gen-
difference between the observed and calculated values of erally show appreciable thermal motion with Ui j values
F hkl (for a refinement based on |F|) or F 2hkl (for a refine- of the order of 0.04 Å2 . This corresponds to a root-mean-
ment based on |F|2 ). A convenient parameter, referred to square vibration amplitude of 0.2 Å and is a reminder that
as the R-factor, for monitoring the progress of the refine- some caution is required in comparing bond lengths from
ment is diffraction experiments with those determined by spec-
R= (||Fo | − |Fc ||) |Fc | troscopic and theoretical work. Bond lengths can some-
times be “corrected” for thermal motion, but it is generally
Correct structures generally have R-values under 0.10, and preferable to reduce thermal motion by cooling the crystal.
those that are well behaved are frequently under 0.05. Many investigators routinely collect data at temperatures
in the range of 100 to 200 ◦ K.
J. Determination of Absolute Configuration A single crystal structure determination provides
valuable information on chemical connectivity, relative
If a particular torsional angle has a positive sign in the conformation, and, under the proper experimental condi-
right-handed enantiomer, it will have a negative sign in tions, absolute configuration. However, an understanding
the left-handed molecule. Thus, determination of the ab- of structure–function relationships requires correlating
solute configuration in a chiral molecule can be regarded features from a number of different structures. The exis-
as the determination of the correct signs for the torsional tence of computer-searchable databases of structural data
angles. A torsional angle ABCD is positive if a clockwise greatly enhances the possibilities for such comparisons
rotation will cause the bond AB to eclipse the bond CD. (see Section VI).
Determination of absolute configuration by X-ray crys-
tallography requires a structural study in the presence of
dispersive scatterers. Thus, if at least one atom in the struc-
V. COMPARISON OF X-RAY AND
ture is an anomalous (i.e., dispersive) scatterer, Friedel’s
NEUTRON DIFFRACTION
law breaks down and reflections from two sides of the
same plane are no longer equal. The differences in in-
For a structure with N atoms, each with atomic scattering
tensity are generally small, so careful measurement is re-
amplitude f i and position ri in the unit cell, the structure
quired. Coster et al. demonstrated that a noncentrosym-
factor for the Bragg reflection of index h is
metric crystal structure could be distinguished from its
inverted image using these differences. Later, Bijvoet re-
N
alized this principle was more general and used it to de- F(h) = f i (h)Ti (h) exp(2πih · ri )
i=1
termine the absolute configuration of the sodium rubid-
ium salt of (+)-tartaric acid. In 1983, Flack developed a where Ti is the temperature factor of the ith atom. Differ-
method for distinguishing a noncentrosymmetric structure ences between neutron diffraction and X-ray diffraction lie
from its inverse. Using the method of Flack, any noncen- in the scattering amplitudes f i (h) and in the temperature
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
136 Crystallography
parameters Ti (h). In X-ray diffraction, X-rays are scattered periment, positions of hydrogen atoms will be more poorly
by the electrons of an atom, and the scattering factors f i (h) determined than positions of carbon atoms. In the neutron
are strongly dependent on scattering angle. At a scattering experiment on the same compound, hydrogen would con-
angle of 0◦ , f i is proportional to Z , the atomic number of tribute 24% and carbon 76%. (Because of the negative
the scattering atom. As the scattering angle increases, the sign on the scattering length of hydrogen, a Fourier sum-
scattering factor f i (h) decreases. Thermal motion causes mation will show “holes” rather than peaks at hydrogen
the scattering to fall off even more strongly. These factors positions.) For deuterobenzene, the contributions from C
limit the resolution available in an experiment and make and D (in the neutron-scattering experiment) are virtually
it difficult to determine the positions of light atoms ac- identical, with each contributing 50% of the scattering.
curately in the presence of heavy ones or to distinguish Thus, neutron diffraction can locate hydrogen and deu-
among heavy atoms that have similar atomic numbers. terium atoms with the same precision as carbon, nitrogen,
In neutron diffraction, the scattering is primarily from and oxygen, while X-rays cannot. For this reason, stud-
the nucleus. Since the diameter of the nucleus is small ies of hydrogen bonding in biologically significant com-
relative to the wavelength of thermal neutrons, the scat- pounds such as amino acids and sugars were among the
tering factor is a constant, characteristic of the particular early experiments using neutron diffraction.
nucleus and independent of scattering angle. There is no As the precision of the experiments and the sophistica-
simple relationship between the scattering amplitudes and tion of the refinements improved, it became obvious that
the nuclear mass or charge. Nuclei with similar atomic there were systematic differences in the positional and
number can have significantly different scattering ampli- thermal parameters from the two experiments that were
tudes. Hence, neutron diffraction can distinguish among much larger than expected from the estimated standard
near neighbors in the periodic table and so is useful in the deviations. The differences are very pronounced for hy-
study of alloys. drogen atoms. Even in the most precise, low-temperature
Table II shows the relative scattering lengths of a num- studies, electron density maxima for hydrogen atoms were
ber of atoms for X-rays and neutrons. If the average con- displaced by as much as 0.2 Å from the positions of pro-
tribution to the intensity of a structure factor is tons determined from neutron diffraction. Since the C–H
and O–H bond lengths in the neutron experiment agree
n
P(i) = f i2
f i2 with spectroscopic measurements, it was recognized that
i=1 the apparent shortening of the bonds to hydrogen observed
in X-ray experiments is a bonding effect. The position of
Then, for a compound such as benzene with equal numbers
maximum electron density does not coincide with the po-
of hydrogen and carbon atoms, carbon will contribute 97%
sition of the nucleus because the formation of the cova-
and hydrogen will contribute 3%. Thus, in the X-ray ex-
lent bond perturbs the electron-density distribution in the
atom.
TABLE II Scattering Amplitudes for Selected Similar, but smaller, effects are observed for first-row
Elements (10−12 cm)a atoms, C, N, O, etc. Typically, the discrepancy is of the
order of 0.01 Å, but it depends critically on the range
Scattering amplitude
Atomic of Bragg angles included in the X-ray refinement. If the
Element number X-ray Neutron refinement is based on very-high-order data [(sin θ )/λ >
1H 1 0.28 −0.37 1.00 Å−1 ], the discrepancies will be much reduced; how-
2D 1 0.28 0.67 ever, few organic compounds scatter to such high angles
12 C 6 1.69 0.66 even at liquid-nitrogen temperature.
14 N 7 1.97 0.94 Systematic differences in temperature factors also re-
16 O 8 2.26 0.58 flect the very real differences in the scattering processes
32 S 16 4.51 0.28 in the two experiments. In aromatic molecules, thermal
Clb 17 4.79 0.96 vibration parameters are greater for X-rays than for neu-
Brb 35 9.87 0.68 trons in the plane of the ring and smaller perpendicular to
127 I 53 14.95 0.52 the ring, implying that electron density is smeared in the
238 U 92 25.92 0.84 plane of the ring by covalent bonding and contracted in
the perpendicular direction.
a For X-rays, the scattering amplitude at 0◦ is given by
Atoms with lone pairs of electrons show significant
(e2 /mc2 ) f 0 or ((0.282 × 10−12 cm) × atomic number).
differences in both positional and thermal parameters in
Values for neutrons are taken from “International Tables
for X-Ray Crystallography,” Vol. IV, 1974, pp. 270–271. the two experiments because the lone-pair density is not
b These values are for the elements in their natural centered on the nucleus. These very real differences sug-
isotopic abundances. gest strongly that neutron diffraction is the preferred tool
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 137
for determining atomic parameters, whereas the X-rays drugs, the phenomenon of one-dimensional conduction, or
measure the electron density in the solid. The combina- any other structure–property relationship) requires the de-
tion of the two techniques provides a means of studying tailed analysis of a large number of structures and correla-
bonding effects (this topic was covered in greater detail in tion with results from other disciplines. Databases provide
Section II.B.3). the kind of information required for structure–function
studies and are very valuable to the scientific community.
The topics presented here are intended to be illustrative of
VI. RESULTS the many ways that the databases have been used. There
are many other uses not documented here.
The determination of a single crystal structure may answer
A. Crystallographic Databases
a question about the connectivity of a molecule or some
detail of its conformation. The direct results of structure At present there are eight major databases for crystal-
analysis are the positional, thermal, and occupancy pa- lographic results continuously maintained and updated
rameters of atoms in the asymmetric unit. Bond lengths in different laboratories in Europe and North America.
and angles, torsion angles, and intermolecular associations In each case the data are available in machine-readable
(such as hydrogen bonding) are all derived from these ba- form, and considerable effort has been expended to de-
sic structural parameters. While the final R-value that a velop efficient computational algorithms for searching the
structure refines to is often a good indicator of the quality files and correlating the data. Perhaps the most impor-
of a structural determination, a plot showing the thermal tant feature of the databases is that the data have been
ellipsoids (Fig. 6) can also give an indication of the quality checked and errors are corrected when possible or flagged
of a structural determination. Errors in the structure deter- if uncorrected. Some of these databases have been in-
mination and thermal motion in the molecule can distort corporated into commercial packages. Some are avail-
these ellipsoids. able in both printed and computer-readable form. In the
Understanding a complex process (such as the mech- brief summary that follows, they are listed in alphabetical
anism of a reaction, the biological activity of a class of order.
FIGURE 6 Thermal ellipsoid plot from a crystallographic structure determination. Hydrogen atoms are plotted as
small balls with an arbitrary radius; non-hydrogen atoms are plotted as ellipsoids, with the axes corresponding to the
thermal parameters for the atom. Notice that the ellipsoids gradually increase in size and become more asymmetric
moving out the chain from C8 to C16. The ellipsoids can become large and more asymmetric for noncovalently bounds
groups such as the coordinated nitro group (N2 , O1 , O2 , and O3 ). In both of these cases, the axes of the ellipsoids
are aligned with expected vibrational modes and do not indicate any major problems with the structure. If the axes
were poorly aligned and/or the ellipsoids more asymmetric, this could be an indication of problems with the structural
determination.
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
138 Crystallography
1. Cambridge Structural Database The data stored include chemical name, chemical for-
mula, density, lattice parameters, space groups, atomic
(http://www.ccdc.cam.ac.uk/prods/csd/csd.html; Cam-
coordinates, oxidation state, temperature factors, remarks
bridge Crystallographic Data Center (CCDC), University
regarding conditions of measurement, R-values, and
Chemical Laboratory, Lensfield Road, Cambridge, CB2
bibliographical references. Online access to the file is
1EW, U.K.) The Cambridge Structure Database (CSD)
available. It is also possible to lease the entire database.
is the largest database of experimentally determined or-
The database now contains over 53,000 entries and is
ganic and metallo-organic crystal structures in the world;
updated twice a year.
inorganic carbon compounds (such as carbonates and
cyanides) are excluded. The CCDC also provides a suite
of graphical search, retrieval, data manipulation, and vi- 5. Metals Data File
sualization software for use with the database. The CSD
(Manager, CAN/SND, Canada Institute for Scientific and
contains bibliographic data, tables of connectivity, atomic
Technical Information, National Research Council (NRC)
positions, cell dimensions, and quality indicators for vir-
of Canada, Ottawa, Canada K1A OS2.) The Metals file
tually all three-dimensional structures of organic com-
contains structural data for metal and alloy structures de-
pounds published since 1935. The CSD currently contains
termined since 1913 based on either powder or single-
over 210,000 entries and is growing at a rate of approx-
crystal diffraction. Under an exclusive license from the
imately 15,000 entries per year. The CSD is available to
NRC, Toth Information Systems has maintained and up-
scientists throughout the world.
dated the database. If available, the following information
is included: formula, cell dimensions, structure type, Pear-
2. Crystal Data son symbol, atomic coordinate, temperature parameters,
(http://www.nist.gov/srd/nist3.htm; Crystallographic Sec- occupancy factors, R-values, method of refinement, in-
tion, National Bureau of Standards, Washington, D.C. strument used, radiation, and bibliographical information.
20234.) This database contains lattice parameters for all New software for manipulating the file has been developed
crystals whose dimensions have been reported by X-ray, by Toth Information Systems.
neutron, or electron diffraction on single crystals or fully
indexed powders. Data include name, formula, cell di- 6. Nucleic Acid Database
mensions, space groups, number of molecules in the cell,
density (measured and calculated), bibliographical data, (http://ndb-mirror-2.rutgers.edu/NDB/ndb.html; Dr. H. M.
crystal habit, melting point, etc. Crystal Data accepts data Berman, Department of Chemistry, Rutgers University,
from the other databases. 610 Taylor Road, Piscataway, NJ 08854-8087.) The goal
of the Nucleic Acid Database (NDB) project is to assem-
ble and distribute structural information about nucleic
3. Electron Density Data Base acids. Online access to the NDB is freely available. A
(Prof. H. Burzlaff, Lehrstuhl fur Kristallographie, Institut variety of tools have been developed in conjunction with
fur Angewandte Physik der Universitat, Bismarckstrasse the NDB to provide a robust user interface.
10, D-91054 Erlangen, Germany.) This database contains
accurate structure factors for crystal structures whose elec- 7. Powder Diffraction File
tron densities have been carefully determined. This is the
type of data required for studying bonding effects, co- (http://www.icdd.com/ productsold/pdf2.htm; Interna-
valency in organometallics, and other details of electron tional Center for Diffraction Data (ICDD), 1601 Park
distribution. Lane, Swarthmore, PA 19081.) The Powder Diffraction
File (PDF) is a compilation of powder diffraction pat-
terns produced and published by the Joint Committee on
4. Inorganic Crystal Structure Database
Powder Diffraction Standards of the ICDD. The PDF is
(http://crystal.fiz-karlsruhe.de/portal/cryst/ab icsd.html.) the world’s largest and most complete collection of X-ray
The Inorganic Crystal Structure Database (ICSD) was powder diffraction patterns. The 1999 release of the PDF
initiated in 1978 at the Institute for Inorganic Chemistry contains about 115,000 patterns, of which 20,000 are or-
at the University of Bonn. Today the database is produced ganic and 95,000 inorganic. Included in the database are
by FIZ Karlsruhe (P.O. Box 2465, D-76012 Karlsruhe, calculated powder patterns for almost 40,000 compounds.
Germany) in cooperation with NIST. “Inorganic” is The PDF is used for identification of crystalline materi-
defined to exclude metals, alloys, and compounds with als by matching d-spacings and diffraction intensity mea-
C–H and C–C bonds (with the exception of graphites). surements. Each pattern includes a table of interplanar
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 139
140 Crystallography
acetals, the geometry of nearly 2400 ethers and esters was Examples of this include mapping of the methionine bind-
investigated. The results show clearly that there is sub- ing site in the enzyme S-adenosyl ATP transferase by
stantial, systematic variation in C–O bond lengths. The Marshall and colleagues, and the postulating of a com-
fragment of interest is defined as R1 –O–R2 . Data are di- mon site of action for gamma-butyrolactone analogs and
vided into four categories, depending on whether R1 is picrotoxinin. On the basis of this model, it has been pro-
methyl, primary, secondary, or tertiary, and each of those posed that the convulsant activity of these compounds is
categories is then divided according to the effective elec- related to their ability to block the passage of chloride ions
tronegativity of the group R2 . The four categories of R2 through channels. The model would appear to be applica-
are alkyl, aryl, enol/ether, and acyl/carboxylic ester, giving ble to a number of convulsant and anticonvulsant drugs.
16 categories in all. The shortest C–O bonds (1.418 Å) are
found in compounds where R1 is methyl and R2 is alkyl.
E. Crystallography and Molecular Mechanics
Within each of the four categories of R1 , the length of the
R1 –O bond increases with increasing electronegativity of Molecular mechanics is an empirical method for calcula-
the R2 group. The effective electronegativity of a group tion of properties of molecules such as molecular geome-
can be estimated from the pK a of its conjugate acid. In the try, heat of formation, strain energy, dipole moment, and
subset of 2-(aryloxy)-tetrahydropyrans, there is a linear re- vibrational frequencies. Different programs use different
lationship between the length of the exocyclic C–O bond parameter sets and reproduce these physical properties
and the pK a of the leaving group. This implies a linear with different degrees of fidelity. Parameters are assumed
relationship between the bond length and the free energy to be transferable from one type of molecule to another.
of activation for the hydrolysis reaction or any other reac- The geometrical parameters used in molecular-
tion in which the C–O bond is cleaved. The consequences mechanics programs are frequently derived from crys-
of these generalizations to the chemistry and reactivities tallographic data. However, the predictive value of the
of acetals and glycosides are fully explored in these pa- method is limited by the datasets used to derive the pa-
pers. In the third paper in the series, the authors plot the rameters. For instance, a parameter set derived from un-
reaction coordinate for six aryl tetrahydropyranyl acetals. crowded hydrocarbons is not likely to predict structures of
Both relative free energy and pK a are plotted as a func- crowded hydrocarbons with satisfactory accuracy. Tran-
tion of the bond length. The Morse function is also plotted, sition states, small rings, unusual states of hybridization,
as is a reaction coordinate–energy contour diagram. This and other electronic effects may require special treatment.
work by Kirby et al. is one of the most comprehensive These caveats notwithstanding, the method has enjoyed
attempts at deriving structure–reactivity relationships so considerable success.
far available. It is not uncommon for molecular-mechanics calcula-
tions to be used to provide a starting point for the re-
finement of electron diffraction data. A recent example
D. Drug Design
is provided by the study of bicyclo[3.2.0]heptane. Many
The coupling of the resources of databases with molec- other examples are available in the literature.
ular graphics devices and QSAR (quantitative structure– Molecular mechanics can also be useful in interpre-
activity relationships) techniques raises a tantalizing pos- tation of crystal structures, particularly in differentiating
sibility that therapeutically useful new structures might be electronic and steric effects or in estimating the effects
predictable. Detailed comparison of structures of known of packing forces. In the structure of lepidoptrene, a C–C
agonists and antagonists is the major strategy currently bond was observed to have a length of 1.64 Å in the X-ray
in use. Similarities in the three-dimensional structure of experiment, while molecular mechanics predicted a length
portions of the molecules allow identification of the fea- of 1.57 Å. Mislow and colleagues demonstrated that the
tures required for binding. Differences in other portions additional lengthening of the bond beyond the amount ex-
of the structure may account for the agonist/antagonist re- pected from steric strain was caused by “through-bond”
sponse after binding. Once the pharmacophore has been coupling of adjacent π systems.
identified, binding studies can be combined with structural Although molecular-mechanics calculations can be use-
comparisons to map the receptor site. ful in assessing which aspects of a structure control con-
A sum or superposition of active molecules can be used formation, it should be pointed out that conformational
to define the available volume within a receptor. Inac- parameters from molecular mechanics tend to be less
tive molecules then define excluded volumes—volumes reliable then bond lengths or bond angles. Also, confor-
occupied either by the receptor itself or by a cofactor. mational parameters in crystal structures are most influ-
Any new drug must be designed to present the correct enced by packing forces. Molecular-mechanics calcula-
pharmacophore and to occupy only the available volume. tions have been used to suggest alternative conformations
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 141
(having energy similar to that observed in the crystal) integrated path-length for incident and diffracted beams
that might exist in the gas phase or in solution. Addi- for all reflections.
tionally, molecular-mechanics has been used to optimize For crystals with no reentrant angles, the most com-
the geometry of a pharmacophore in model studies of mon methods are the analytical method and the method
drug–receptor binding, to evaluate the interaction ener- of Gaussian quadrature. In both cases, the objective is to
gies between dinucleoside monophosphates and cationic evaluate the integral,
intercalators such as ethidium bromide, and to interpret 1
conformational polymorphism. Although some caution is A= exp(−µT ) dυ
V crystal
obviously necessary, molecular mechanics and crystallog-
This can be done analytically if the crystal is divided into
raphy can provide complementary information in a variety
a number of polyhedra in each of which the path-length
of cases.
is a linear function of the coordinates. De Meulenaer and
Tompa first programmed this method in 1965. Calcula-
tion time is independent of the severity of absorption. The
APPENDIX I: FACTORS
factor-limiting accuracy in cases of severe absorption is
AFFECTING INTENSITIES
the precision of measuring the crystal, particularly in its
shortest direction.
A number of geometrical factors influence the intensities
Gaussian quadrature is a numerical integration method
observed in a diffraction experiment. The most impor-
that evaluates an integral by summing an appropriate poly-
tant of these are Lorentz, polarization, absorption, and
nomial. It uses a nonisometric grid in which the inter-
extinction corrections. The first three corrections are nor-
val is subdivided symmetrically about the midpoint, with
mally applied to the observed intensities in the process
large spacings near the middle and smaller ones toward
of calculating the structure amplitudes |Fhkl |. Many re-
the edges. This tends to put the maximum number of grid
searchers feel that the absence of this correction is the sin-
points near the surface of the crystal where the change of
gle largest source of systematic error in crystal structures
absorption with path-length is largest. The number of grid
in the current literature. Extinction is a correction made
points determines the precision of the calculation. Thus,
during least-squares refinement and so is a correction ap-
if a 4 × 4 × 8 grid gives a precision of 2% in calculated
plied to the model rather than to the observations. How-
transmission for µ = 2.5 cm−1 , a grid of 8 × 8 × 16 is
ever, the correct formulation for extinction depends on
required to produce the same precision if µ = 5.0 cm−1 .
both the polarization and the path-length (calculated dur-
By choosing sufficient points, the Gaussian method can
ing absorption correction) and so is discussed with these
reproduce the analytical result to any desired precision;
other correction terms.
however, for strongly absorbing crystals, the analytical
method is the method of choice.
A. Absorption When a crystal has reentrant angles, neither of these
methods can be used, and one possible resort is numeri-
The absorption of X-rays by crystals obeys the relation
cal integration with an isometric grid; however, the grid
I = I0 exp(−µt), where µ is the linear absorption coef-
must be very fine in order to achieve reasonable precision.
ficient, in units of (length)−1 , and t is the thickness. The
For crystals mounted in capillaries, crystals with irregu-
general effect is to reduce the intensity of reflections at low
lar shapes, and other pathological cases, absorption may
sin θ. If the crystal is centrosymmetric in cross-section,
be dealt with by measurement of a transmission surface
neglect of this correction will have little effect on posi-
as proposed by Huber and Kopfmann in 1969. However,
tional parameters, although scale and thermal parameters
these problems can be dealt with by the program DIFABS,
may be very strongly affected. If the crystal cross-section
developed by Walker and Stuart in 1983, which models the
is not centrosymmetric, all structural parameters will be
absorption surface by a Fourier series in polar coordinates.
systematically wrong. Neglect of absorption is probably
The coefficients are obtained by minimizing the sum of
the largest single source of systematic error in published
squares of residuals between observed and absorption-
structures.
modified values of the structure factors. The chief virtue
Absorption corrections can be done by collecting
of the method is that it can be used even when the crystal
ϕ-scans and calculating an empirical correction or by
is no longer available
indexing and measuring crystal faces and calculating a
face-indexed absorption correction. For a face-indexed
B. Lorentz Factor
correction, the size and shape of the crystal must be pre-
cisely determined. Once the crystal shape is established, The Lorentz factor corrects for the relative speeds with
a number of techniques are available for calculating the which different reflections pass through the reflecting
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
142 Crystallography
position. The intensity of a reflection produced by a mov- parallel or perpendicular to the equatorial plane of the
ing crystal depends on the time taken for the corresponding diffractometer. If the monochromator is in the incident
reciprocal lattice point to pass through the sphere of re- beam and mounted so that its axis is perpendicular to
flection. Using 1/(ωS cos θ) as the angular velocity, the the equatorial plane, the component that was attenuated
definition S = 1/d, and Bragg’s law, λ = 2d sin θ , the cor- by reflection from the monochromator is attenuated again
rection takes the form: by the sample, so that the polarization correction for the
twice-reflected beam becomes:
d/(ω cos θ ) = λ/(2ω sin θ cos θ) = λ/(ω sin 2θ)
p = 1 + cos2 2θm cos2 2θ 1 + cos2 2θm
The factor 1/(sin 2θ) is the Lorentz factor.
For a monochromator mounted with its axis parallel to
the equatorial plane, reflection from the monochromator
C. Polarization
attenuates the beam normal to the equatorial plane, while
X-rays are polarized on reflection, and the degree of polar- reflection from the sample attenuates the parallel compo-
ization depends on experimental conditions. Neutrons are nent, so that
not polarized on reflection from ordinary crystals. X-rays
p = cos2 2θm cos2 2θ 1 + cos2 2θm
produced from an X-ray tube or a rotating-anode gener-
ator are unpolarized; that is, all directions of the electric These formulas assume that the monochromator crystal
vector normal to the direction of propagation are equally is an ideally mosaic crystal. For a perfect or non-mosaic
represented. Thus, the beam can be regarded as composed crystal, the factor cos2 2θm should be replaced by |cos θm |.
of two components, one polarized parallel to the reflection In practice, the polarization ratio seldom corresponds
plane and one perpendicular. The relative intensities of the to either of these ideal values and may not even lie be-
two components are tween them. This has led some investigators to recast the
equations in the form:
I|| = I⊥ = 12 I0
p = (1 + K cos2 2θ )/(1 + K )
For an ideally imperfect crystal, the intensity of radia-
tion scattered in a particular direction is proportional to for the monochromator axis perpendicular to the equato-
sin2 φ, where φ is the angle between the electric vector rial plane, and
and the direction of observation. It follows that the par-
p = (K + cos2 2θ )/(1 + K )
allel component will be attenuated by reflection but the
perpendicular component will not. Thus, the relative in- for the parallel orientation, where K is the actual measured
tensities of the two components after reflection at an an- value of the polarization ratio for the monochromator in
gle θ will be I = 12 I0 sin2 (90 − 2θ ) = 12 I0 cos2 2θ and question. The value of K is different for different wave-
I⊥ = 12 I0 sin2 90◦ = 12 I0 . Thus, the beam is partially po- lengths. For routine structural work with MoKα radiation,
larized at all angles and completely polarized at 2θ = 90◦ . the error in assuming that the monochromator is an ideal
For data monochromated by means of a β filter, the mosaic will generally be small. However, maximum error
polarization correction is occurs at θ = 45◦ and so is important in the case of very
precise studies that rely on high-angle data. Vincent and
p = 12 (1 + cos2 2θ )
Flack have developed a method for determining K , the
Frequently, Lorentz and polarization corrections are com- polarization ratio, without special equipment.
bined to give:
L p = (1 + cos2 2θ )/2 sin 2θ D. Extinction
These corrections are applied to the observed intensity to As early as 1922, Darwin realized that absorption is not
derive structure amplitudes: the only effect that attenuates the X-ray beam as it passes
through the crystal. He described two phenomena, which
|Fhkl | = Ihkl /L p
he designated as primary and secondary extinction, and
In many modern diffractometers, monochromatization of showed how they could be treated mathematically.
the primary beam is achieved by Bragg reflection from Primary extinction is an interference process. If a set
a suitable crystal. Three of the most commonly used of planes is in a position to reflect, the reflected rays may
monochromator crystals are quartz (d = 3.35 Å), LiF also be reflected a second time. Since there is a phase
(d = 2.01 Å), and highly oriented graphite (d = 3.35 Å). change of π/2 on reflection, a beam that has been reflected
The monochromator may be installed in the incident beam n times will be exactly out of phase with one that has
or the diffracted beam and may be mounted with its axis reflected (n–2) times. This causes the reflected intensity
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 143
to be proportional to |F| rather than |F|2 . A crystal for It contains both a breadth parameter and a misalignment
which this is true is called an ideally perfect crystal. Such parameter. If β g, then the broadening of the diffrac-
crystals are rare. It is much more common to encounter tion peak is dominated by mosaic spread, and we have a
crystals where I ∝ |F|n where I < n < 2 but nearer to 2. type I extinction. Type II extinction, when β g, is less
Darwin modeled the phenomena of extinction by as- commonly encountered and corresponds to the situation
suming that crystals were made up of mosaic blocks, where the misalignment is small and the breadth of the
slightly misaligned with respect to one another. In perfect diffraction peaks is controlled by small domain size. Sep-
crystals, the blocks are assumed to be large and the mis- aration of the terms is possible if two determinations are
alignments small. Secondary extinction occurs because made on the same crystal with different wavelengths. This
the planes first encountered by the incident beam reflect is rarely done.
so strongly that deeper planes receive less radiation and Coppens and Hamilton first extended the treatment of
so reflect with less power than they otherwise would have extinction to an anisotropic model in 1970. The crystal
done. This effect is pronounced for strong reflections with is modeled as if it were composed of ellipsoidal parti-
|F/V | of the order of 0.1 × 10−24 cm−3 . cles whose misorientations follow a Gaussian probability
According to the mosaic model, the effect is expected in distribution. Since there is no need for the distribution
crystals where the mosaic blocks are small with respect to of mosaic blocks to obey any symmetry in the crystal,
the size of the crystal. A mosaic crystal in which the blocks symmetry-equivalent data are not averaged in this
are sufficiently misaligned that secondary extinction is treatment.
negligible is called an ideally imperfect crystal. Since such
crystals are seldom encountered in experiments, it is con- E. Anomalous Dispersion
venient to correct for extinction in least-squares refine-
ment. In most current programs, the correction is based When the wavelength of the incident X-ray beam is close
on Zachariasen’s 1967 formalism for isotropic extinction, to the absorption edge of a scattering atom, the atomic
in which the mosaic blocks are assumed spherical: scattering factor for that atom becomes complex:
−1/4 f = f 0 + f + i f
Fc∗ = k|Fc | 1 + 2r ∗ Q 0 T̄p2 p1
where k is the scale factor, Fc is the calculated value of the where f 0 is the normal scattering factor for wavelengths far
structure factor, r ∗ = β[1 + (β/g)2 ]1/2 where β = 2t̄/3λ, t̄ from the absorption edge, and f and i f are correction
is the mean path length in a single domain, and g is re- terms. The quantity f is usually negative, and i f
lated to the mosaic spread distribution and is frequently is always π/2 radians ahead of the real part in phase.
assumed to be Gaussian. For X-rays, For structural work, the corrections are assumed to be
2 3 independent of scattering angle. In addition, f = 0 for
e FK λ wavelengths longer than the absorption edge.
Q0 =
mc2 V sin 2θ Four aspects of anomalous dispersion important in nor-
mal structural work in crystallography are
where e2 /mc2 is the classical radius of an electron, and K
is the polarization ratio. For neutrons, 1. Determination of absolute configuration
2 2 2. Solution of the phase problem (structure solution)
F λ
Q0 = 3. Distinguishing among atoms of similar scattering
V sin 2θ
power
The term T̄ is the mean path-length in the crystal and 4. Avoiding systematic errors in structures with polar
represents an integration of incident and diffracted beams space groups
over all diffraction paths in the crystal. This is normally
evaluated during the calculation of absorption correction. Knowledge of the absolute configuration is extremely im-
If absorption is small in the crystal, T̄ may be arbitrar- portant in physiologically active materials, since biolog-
ily set to some value such as 0.03 cm. Finally, the term ical systems discriminate strongly between enantiomor-
pn is a polarization term generally assumed to have the phous forms of their substrates. Anomalous scattering
form 1 + cos2n 2θ appropriate to filtered radiation. If a with phase change causes a breakdown in Friedel’s law,
monochromator is used, the appropriate form of the po- and Ihkl = Ihkl . This effect was first exploited for deter-
larization factor should be incorporated into the extinction mination of the absolute configuration of the sodium ru-
calculation. For very precise work, the actual polarization bidium salt of (+)-tartaric acid using Zr radiation (λ =
ratio of the monochromator should be determined experi- 6.07 Å), whose wavelength is slightly shorter than that of
mentally. For extinction correction, the term refined is r ∗ . the absorption edge of Rb (λ = 6.86 Å). The experiment
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
144 Crystallography
coupled with the known relationship between the stere- F. Scaling of Data
ochemistry of (+)-tartaric acid and (+)-glyceraldehyde
Virtually all methods of solving structures require a rea-
showed that Fischer’s arbitrary choice had been correct.
sonable estimate of the relative scale factor between the
With modern data collection techniques, the determination
observed and calculated structure factors. Most computer
of absolute configuration is relatively uncomplicated. In
programs that calculate scale and temperature factors are
favorable cases, the method can be applied to a compound
based on Wilson’s 1942 equation:
with no atom heavier than oxygen when the incident ra-
diation is CuKα (λ = 1.54 Å). Rabinovich and Hope have Ihkl
= f i2
determined the absolute sign of the torsional angles in i
the achiral compound 4, 4 -dimethylchalcone, C17 H16 O. which says that the local average value of the intensity is
These authors believe that the determination of absolute equal to the sum of the squares of the scattering factors. It is
configuration may be possible with hydrocarbons. assumed that the average is taken over a sufficiently narrow
In structure solution, anomalous scattering without range of (sin θ)/λ so that the f values can be treated as
phase change is formally equivalent to isomorphous re- constants:
placement. An anomalous-difference Patterson function
Ihkl = k f i2 exp[−2B(sin2 θ/λ2 )]
is analogous to an isomorphous-difference Patterson and
i
so contains peaks only for vectors between anomalously
scattering atoms and vectors between anomalous scatter- Ihkl f i2 = k exp[−2B(sin2 θ/λ2 )]
ers and normal scatterers. Vectors between normal scat- i
terers do not appear.
Anomalous dispersion with phase change ( f = 0), loge Ihkl f i2 = loge k − 2B(sin2 θ/λ2 )
can be used to determine the phase angles from noncen- i
trosymmetric crystals. In the case where the position of
Thus, a plot of loge [Ihkl / i f i2 ] versus (sin2 θ )/λ2 will
the anomalous scatterer is known, the procedure requires give a straight line of slope 2B and intercept k.
that differences in intensity between be measurable for a A number of conditions must hold so that the values of
significant number of intensities. B and k are reasonable. The sampling interval should be
Anomalous dispersion provides an elegant means to small, so 40 to 50 intervals of (sin θ )/λ are required for
distinguish among near neighbors in the periodic table three-dimensional datasets. Weak reflections must be in-
that would otherwise have similar scattering power. Al- cluded. Elimination of all reflections with I < 3σ (I ) will
loys such as β-brass (Cu–Zn) and Cu2 MnAl were early cause the average intensity to be too high in the high ranges
examples of structures determined by this technique. The of (sin θ)/λ, and that will cause an underestimate of the
tunability of the synchrotron source allows choice of a temperature factor and a corresponding overestimate of
wavelength close to the absorption edge of an element the scale factor. Including weak reflections at half of the
in the sample to maximize the anomalous component of local minimum observed intensity is better than leaving
scattering. them out, but a Baysian fill is probably a better strat-
Neglecting to correct for anomalous dispersion will ob- egy. Large excursions on both sides of the best-fit line
viously introduce a small error in the magnitudes and are quite common and reflect the facts that Wilson’s for-
in the phases of the calculated structure factors. It was mula was derived for a random distribution of atoms and
long assumed that such errors would have little effect on real structures contain many repetitions of certain bonded
atomic positions. However, Cruickshank and McDonald and nonbonded distances. The inflection points (the re-
have pointed out that neglect of the correction will always gions in the graph where the experimental points cross the
cause errors in thermal parameters, and in the case of po- straight line) are relatively constant for different types of
lar space groups very serious errors in coordinates can structures. Hall and Subramanian recommend an “inflec-
arise. The size of the error varies directly with f and tion point least squares” in which the least-squares line
inversely with the resolution of the data. For a moderately is fitted to 15 points: the five lowest angle points, five
heavy atom such as Co (Z = 27), the error in coordinates points nearest to (sin2 θ)/λ2 = 0.15, and five points near-
can be as large as 0.06 Å in the experiment done with est (sin2 θ)/λ2 = 0.26.
Cu radiation. Neglect of anomalous dispersion will cause
errors of the order of 0.005 Å for structures with atoms no
G. Thermal Diffuse Scattering
heavier than oxygen with Cu radiation or sulfur with Mo
radiation. The error caused by including f and choos- Thermal diffuse scattering (TDS) arises mainly from
ing the wrong enantiomer is twice as large. low-frequency acoustic modes in the crystal. First- and
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 145
second-order TDS cause the scattering density to peak un- postulate trial structures and see whether the pattern of in-
der Bragg peaks, with the degree of peaking related to the tensities matched. The method cannot generally be applied
velocity of sound in the crystal. The effect is not removed to molecular structures in low-symmetry space groups.
in normal data-reduction techniques and is different in
different directions in the crystal. In normal structure de-
termination, TDS is ignored. The result is a systematic B. Transform Methods
decrease in apparent thermal parameters. Since TDS in- A crystal can be regarded as convolution of the lattice
creases with increased (sin θ)/λ, it enhances the apparent with the unit-cell contents. By the convolution theorem,
intensity of high-order diffraction data. the transform of a convolution is the product of the trans-
In very precise work, such as the determination of forms. The diffraction pattern is the transform of the crys-
charge-density distribution, it is extremely important that tal structure and so must be the product of the delta func-
the effect be eliminated or accounted for. It should be noted tion representing the reciprocal lattice and the transform
that the amount of TDS included in a diffraction profile of the unit cell contents.
will generally be different for X-ray and neutron experi- From the properties of a delta function we know that
ments, since it depends on such experimental conditions the product f (x) δ(x − x0 ) has values only at x0 since
as primary beam divergence, wavelength spread crystal δ(x − x0 ) = 0 if x = x0 . Thus, the diffraction pattern of
dimensions, and counter aperture. Extensive calculations a crystal can be regarded as the transform of the unit-cell
are required to correct for TDS, and most formulations contents sampled at the points of the reciprocal lattice.
demand that the elastic constants of the crystal be known. This implies that a direct plot of the weighted recipro-
However, cooling the crystal can reduce the effect. If α cal lattice can give some information about the structure.
is defined as I (TDS)/I (Bragg), cooling from room tem- The method is used frequently to solve the structures of
perature to liquid-nitrogen temperature will reduce α by polynuclear aromatic hydrocarbons. In these compounds
a factor of 5. Cooling to liquid-helium temperature will the dominant features of the diffraction pattern are the ben-
reduce α by another factor of five. Facilities for X-ray zene transform and the fringe function showing the sepa-
diffraction experiments down to liquid-nitrogen tempera- ration of the molecules. The molecules generally crystal-
ture are fairly common. Helium cryostats are rare. lize in centrosymmetric space groups with one short axis
roughly perpendicular to the plane of the molecule.
Figure 8a shows the calculated transform of a regular
APPENDIX II: METHODS OF hexagon, the transform of a benzene ring. It is character-
STRUCTURE SOLUTION ized by positive and negative regions with strong positive
peaks at a distance 0.8 Å−1 from the origin. If the hexagon
A. Trial-and-Error Methods
is tilted along an axis, the distance perpendicular to the
The earliest structures that were determined by X-ray axis of tilt will be foreshortened in crystal space. The cor-
diffraction were mineral structures with relatively high responding distances in the transform will be elongated.
symmetry. Intensities were measured as strong, medium, Simple geometry will allow calculation of the angle of tilt.
and weak, and most of the atoms sat on special positions Determination of the separation is relatively simple.
in the cell. From knowledge of the density of the mate- Consider two centrosymmetric molecules related by a cen-
rial, its chemical formula, and the space group, one could ter of symmetry with a separation of molecular centers
FIGURE 8 Transform of a hexagon. The circle has a radius of 0.8 Å−1 . (a) Regular hexagon; (b) tilted hexagon;
(c) geometric construction to determine tilt of hexagon in crystal space.
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
146 Crystallography
FIGURE 9 (a) Transform of naphthalene with correctly oriented reciprocal lattice; inset shows naphthalene orien-
tation used to calculate the transform. (b) Weighted reciprocal lattice of naphthalene for comparison with transform.
Reflections such as 202 and 801, which lie on regions of the transform where density is changing, are much more
sensitive to orientation than are those such as 203 that lie in the middle of strong areas.
of 5 Å. Since the molecules are identical in shape and The location of the heavy atom can be determined from
orientation, the combined transform will be that of a sin- the Patterson function:
gle molecule crossed by straight fringes. Regions in the 2
combined transform are strong only if the corresponding P(uυw) = (Fhkl )2
V h k l
region in the single transform is strong, but weak regions
arise either from weak regions in the single transform or × [cos 2π (hu + kυ + lw)]
from the zeros of the fringe system. A line perpendicular
to the fringes is the direction of the line joining the cen- = p(x yz)
ters of molecules, and the separation of the molecules is volume of cell
reciprocal to the spacing of the fringes. × p(x + u, y + υ, z + w) dυ
If the transform and the reciprocal lattice are drawn on
the same scale, the correct relative orientation of one to the The Patterson function can be calculated directly from the
other can be established by matching strong areas of the intensities with no previous knowledge of the phases. It
diffraction pattern with strong areas in the transform. For is the self-convolution of the electron density. This means
fine adjustment of the orientation, attention must be paid that a peak at uυw represents a vector between two atoms
to those reflections most sensitive to orientation effects, whose separation is equal to the vector distance from the
those lying on rising or falling regions of the transform. origin to the point uυw. The weight of that peak is propor-
See, for example, reflections 202 and 801 in Fig. 9. By tional to the product of the atomic numbers of the atoms
contrast, the 203 reflection lies well within a strong area at each end of the vector. Vectors between heavy atoms
of the transform and its value will not be affected by even tend to dominate these maps and hence allow the position
fairly large changes in orientation. of the heavy atom to be determined.
Figure 10 shows a hypothetical molecule with one
C. Heavy-Atom Methods
heavy atom and its Patterson function. Note that the Pat-
In X-ray diffraction, the scattering power of an atom is terson function is always centrosymmetric. If the heavy
proportional to the square of the atomic number, Z 2 . If a atom were iodine and the light atoms were carbons, I–I
molecule contains a heavy atom (high Z ) and that atom vectors would have weight 2809; I–C vectors, 318; and
can be located, then a set of phase angles can be calculated C–C vectors, 36.
for the dataset that are approximations to the true phases. If two molecules are related by a center of symmetry as
A Fourier synthesis calculated with observed structure in Fig. 11, then in addition to the intramolecular vectors
amplitudes and phases appropriate to the heavy atom will near the origin of the vector maps, there are intermolecu-
give a map that contains the heavy atom, some light atoms, lar vectors. The I–C and C–C vectors are double-weight
and some noise. Phases based on the known atom posi- vectors because both molecules give the same pattern of
tions are better estimates of the true phase than the heavy vectors. However, the I–I vectors, which represent vectors
atom alone. The iterative procedure is repeated until all between iodine atoms across the center of symmetry, are
atoms are located. single-weight peaks.
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 147
FA = FL + FAR
FB = FL + FBR
148 Crystallography
FIGURE 12 Isomorphous replacement. (a) The difference FAB between the structure factors for the two isomorphs
A and B provides the center for |FB |. (b) The ambiguity in phase can be resolved by a third derivative.
Crystallography 149
FIGURE 13 Anomalous dispersion. The first circle is FN . The vectors FH and FH are drawn and their vector sum
becomes the center for FN H . This gives a twofold ambiguity, which can be resolved in the (a) centrosymmetric case
by noting that the phase must lie on the real axis. In the (b) noncentrosymmetric case, the ambiguity can be resolved
by using the negative anomalous component as shown in (c) or an isomorphous derivative (not shown). FN−H and
FN+H refer to Fhkl and F hkl , respectively.
a
where Q is the ratio f λ2 / f 2,h
n
, |F1,h
n
| is the magnitude of the tural information are known as direct methods. It is not
structure factor for the normally scattering atoms, |F2,h n
| is necessary to determine all of the phases. In general, about
the normal part of the structure factor for the anomalously 10× the number of non-hydrogen atoms in the cell is suf-
scattering atoms, and φ1,hn
and φ2,hn
are the associated phase ficient.
angles. There is a second equation for the Friedel mate, Virtually all of the direct-method programs currently
and it is the same as the previous one except for a minus available make use of normalized structure factors, E(h),
sign before the last term. Thus, there will be two indepen- which “correct” the structure factor F(h) for fall-off with
dent equations for each wavelength at which data are col- angle caused by the temperature factor and the scattering
lected, plus a third equation resulting for the trigonometric factors of the atoms,
identity sin2 x + cos2 x = 1. This set of equations can be 1/2
−1/2
solved if data are collected at two or more wavelengths. E(h) = F(h) εh f i2
Generally, one dataset is collected at energies below the
absorption edge so that f = 0; a second set is collected If the shapes of the scattering curves are similar, the values
above the absorption edge, with f = 0 and f hav- of E(h) can be calculated from the relation:
ing the same value as for the first experiment. The actual
−1/2
N
values for the real and imaginary parts of the scattering E(h) = εh Z j (σ2 ) 1/2
exp(2πih · r j )
factor are determined experimentally before the choice of j=1
wavelengths is made. The obvious advantage is that the
where N is the number of atoms in the cell, and εh is the av-
isomorphism is exact. In principle, the data can be col-
erage intensity multiple of the hth reflection. The average
lected on one crystal; in practice, radiation damage will
value of |E|2 (h) = 1. The most frequently used relation-
require the use of several different crystals. A generaliza-
shipin direct methods for centrosymmetric structures is
tion of the theory can take into account any number of
the 2 or triplet relationship:
types of anomalous scatterers and any number of anoma-
lous scattereres within each type. S(E h ) ≈ S E k E h−k
k
F. Direct Methods
where S(E h ) is the sign of the reflection hkl, and ≈ means
Methods of structure solution that attempt to evaluate the “probably equal to.” The probability associated with this
phases of the structure factors without recourse to struc- relationship is
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
150 Crystallography
The symbol kr implies that k ranges only over those vec-
−3/2
P+ (E h ) = + tanh σ3 (σ2 )
1
2
1
2 |E h | E k E h−k tors associated with large |E| values.
k The process of choosing initial origin-determining
N phases is similar to the centrosymmetric case, but an ad-
where σn = n
i=1 Z i and Z is the atomic number. For
example, ditional enantiomorph-determining reflection must also
be specified. The symbols are assigned in the same way
h = 633̄ and k = 790 as before and result in assignments such as φh = 2a − b.
These are converted to numerical values, and each set is
h–k = 163
then expanded and refined using the tangent formula. The
In this example from a centrosymmetric structure, if all first computer programs for structure solution used this
of these reflections are strong, it is probable that the sign method. The symbolic addition procedure generates only
of F(633) is positive. The probability increases as the a few alternatives for the values of the phases which must
magnitude of the normalized structure factors increase. be considered because the number of resulting unknown
The steps in solving a centrosymmetric structure (where symbols is usually no more than three for four.
the phases can have values of 0 or π) are In multisolution (i.e., multitrial) methods, a small num-
ber of phases are assigned arbitrarily to fix the origin
1. Evaluate E terms. This includes calculation of the and, in the case of noncentrosymmetric space groups,
normalized structure factors and sorting of the reflections the enantiomorph. Additional reflections are each as-
among eight subgroups defined by the parity of the h, k, signed many different starting values in the hope that
and l indices. one (or more) of these sets of starting conditions will
2. Form lead to a solution. Some programs use random-number
2 relationships with the strongest
reflections—those that have E values greater than some generators to set starting values for some 20 to 200
arbitrary value such as 1.2 or 1.5. phases, which are then extended and refined by the tangent
3. Determine phase. Historically, phases were deter- formula:
mined by the symbolic addition method. In this method, wk |E k E h−k | sin(φk + φh−k )
origin-determining reflections are given signs and a few tan φh = k
k k |E k E h−k | cos(φk + φh−k )
w
others,
chosen from the strongest |E| values with the most
The weighting function in the tangent formula is useful
2 interactions, are given symbols (a, b, c, etc.). Signs
can be used because, in the case of a centrosymmetric in some approaches. A new phase φh is assigned a weight
structure, the phases can only be 0 or π. Thus, Sh means the that is the minimum of 0.2αh and 1.0:
sign of the structure factor for the h reflection. The value w0 = min(0.2αh , 1.0)
for Sh could arise in one of several ways. One way is its
assignment as described above. Alternatively, Sh could ac- Although this allows rapid development of a phase set for
quire a value through the triplet relationship as follows: If each trial, it tends to lead to an incorrect centrosymmetric
Sk is know to be positive, and Sh−k is known to be a, then: solution in the case of polar space groups. For Hall–Irwin
weights, w = w0 f (α/αest ), where f = 1.0 for α < αest and
Sh = Sk Sh−k = a decreases for α > αest . This weighting tends to conserve
the enantiomorph and has been incorporated into more
During the course of the analysis, relationships appear
recent versions of the multisolution programs. The major
among the symbols, such as ac = e. Manipulation of these
difference in treatments for centric and acentric datasets
relationships usually allows the number of unknowns to
lies in the values assigned to the extra reflections in the
be reduced at the end of phase determination.
starting set. In the centric case, the possible values are 0
4. Calculate the E map—a Fourier summation using
and π ; in the acentric case, general reflections have four
E values as coefficients and phases determined in step 3.
possible values, ±π/2 and ±3π/2.
A recent modification to the multisolution approach
In the noncentrosymmetric case, the solution is more dif-
consists of phase refinement alternated with the impo-
ficult since the phase can take any value between 0 and
sition of constraints by peak picking in real space. This
2π. Hence, a different set of relationships was developed
approach, referred to as a dual-space method, differs from
for this case:
other multisolution methods in that the phases are ini-
ϕh = ϕk + ϕh−k
kr tially assigned values by computing structure factors for
a randomly positioned set of atoms. The occurrence of
kr |E k E h−k |(ϕk + ϕh−k ) two Fourier transformations per cycle results in an al-
ϕh =
kr |E k E h−k | gorithm that is computationally intensive. However, the
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
Crystallography 151
new approach also extends the limits of these programs or incorrect thermal parameters. Holes indicate electron
to much larger structures. It has proven to be capable of density in the model where none exists in the crystal (see
solving complete structures containing as many as 2000 Fig. 14).
independent non-hydrogen atoms (provided that accurate It is common to use difference maps to find hydrogen
diffraction data have been measured to a resolution of atoms once the positions of the heavier atoms have been
1.2 Å or better and some heavy atoms are present in the refined isotropically. In centrosymmetric structures, the
structure). phases are either positive or negative and are usually cor-
There has been progress in the experimental evalua- rect for all but the smallest structure factors by this stage.
tion of triplet-phase invariants. The phenomenon of si- Hydrogen atoms will have electron densities in the range
multaneous diffraction has long been considered a nui- 0.6 to 0.9 e Å−3 . Phase errors in the noncentrosymmet-
sance in single-crystal work. If three reflections lie on ric case cause difference maps to be less well defined. At
the sphere of reflection simultaneously, there is a power the end of refinement, a difference map should show no
transfer that tends to enhance such weak reflections at the significant features.
expense of strong ones. In 1977, it was discovered that Since it is much faster to calculate a Fourier synthe-
the shape of the simultaneous diffraction profile is sen- sis than to carry out least squares, refinement by differ-
sitive to the phase of the triplet that gave rise to it. The ence synthesis was popular before the advent of modern
sense of the asymmetry is opposite for positive and nega-
tive triplets. This effect has been observed with ordinary
mosaic crystals of relatively heavy scatterers (ZnWO4 )
using CuKα radiation from a fine-focus tube. This tech-
nique could be enhanced by the use of synchrotron radia-
tion, since it is desirable that the beam be monochromatic,
intense, and highly collimated. It is envisioned that exper-
imental phase determination could be used to establish a
starting set of 50 or so triplets and that the tangent for-
mula or some similar technique could then expand these
relationships.
A. Difference Fourier
A Fourier synthesis with coefficients F = |F0 | − |Fc | re-
flects differences between the crystal and the model. The FIGURE 14 Difference map. Comparison of observed, calcu-
difference map is routinely used during structure solution lated, and difference electron density for (a) an atom displaced
by 0.1 Å from its correct position, and (b) a temperature factor
to check the integrity of the model as it is being developed.
overestimated. (c) Difference density observed when an isotropic
Large peaks in the map (Z /3 to Z /2) correspond to atoms temperature factor is used in the model and the thermal vibra-
not yet included in the model. Smaller peaks may indi- tion of the crystal is actually anisotropic. Dashed contours are
cate a slightly misplaced atom or wrong scattering type negative.
P1: GJC/LOW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004G-160 June 8, 2001 19:47
152 Crystallography
Crystallography 153
atoms are uncorrelated. If systematic effects such as ab- The rigid-body models require 12 parameters for cen-
sorption and extinction have been ignored, if the qual- trosymmetric molecules and 21 for noncentrosymmetric.
ity of the crystals is poor, or if the weighting scheme is In the former, both the translational and librational axes
inappropriate, these parameters may have little physical pass through the center of gravity of the molecule. In the
meaning. noncentrosymmetric case, the libration axes do not nec-
Hirshfeld has proposed a convenient test for reason- essarily coincide, and an additional tenser, S, is required
ableness of a set of vibration parameters. Since bond- to account for the correlation between the librational and
stretching vibrations can be assumed to have much higher translational motions.
energy than other intramolecular modes such as torsion or Two indicators are used to test the goodness of fit be-
angle bending, bonds can be considered “rigid” to a first tween the refined values of the Ui j parameters and those
approximation. If two atoms A and B are bonded, it is calculated from the rigid-body model. The root-mean-
expected that the component of vibration along the bond square difference between observed and calculated values,
will be the same for both atoms. For carbon atoms, agree- given by (Ui j )2
1/2 , would be considered excellent in
ment within 0.001 Å2 is expected for low-temperature −4 2 −3 2
the
region of 2×10 Å but only fair at 2×10 Å . R =
structure determinations where bonding effects have been |Ui j |/ Ui j would be considered excellent at 0.04.
accounted for. In the spherical atom approximation, Obviously, the interatomic vectors in a crystal structure
the discrepancy is more likely to be of the order of can be interpreted as bond distances only if thermal vibra-
0.005 Å2 . In molecular crystals, vibrations of neighbor- tions are negligible. To correct for the effects of thermal
ing atoms are correlated to some degree. For relatively vibration, a model for that vibration is required. Often it is
rigid molecules, translational and vibrational oscillations better to reduce the effects of thermal motion by working
are the major contributors to the internal modes of vi- at the lowest available temperature than to attempt an a
bration. If the rigid body model is a good approxima- posteriori correction.
tion, it is possible to “correct” bond lengths for thermal
vibration.
The reason for the correction can be seen in Fig. 15. SEE ALSO THE FOLLOWING ARTICLES
If a molecule is undergoing librational motion, with a
root-mean-square amplitude of libration ω about an axis CRYSTAL GROWTH • CRYSTALLIZATION PROCESSES •
through 0, then an atom at a distance l from the center of FERROMAGNETISM • MICROSCOPY • PHASE TRANS-
FORMATIONS, CRYSTALLOGRAPHIC ASPECTS • X-RAY
libration sweeps out an arc. However, the X-ray analysis
will place the center of the atom at C, the centroid of the SMALL-ANGLE SCATTERING • X-RAY, SYNCHROTRON
AND NEUTRON DIFFRACTION
electron distribution. Thus, bond distances are systemati-
cally shortened by an amount PC = l − l cos ω ≈ lω2 /2.
In three dimensions, the corrections are additive. BIBLIOGRAPHY
Carter, C. W., Jr., and Sweet, R. M., Eds. (1997). “Methods in Enzymol-
ogy,” Vol. 277, “Macromolecular Crystallography.” Academic Press,
New York.
Dunitz, J. D. (1979). “X-Ray Analysis and Structure of Organic
Molecules,” Cornell University Press, Ithaca, NY.
Ewald, P. P., Ed. (1962). “Fifty Years of X-Ray Diffraction,” N.V.A.
Oosthoek’s Uitgeversmaatschappij, Utrecht.
Journal of Physics D, Applied Physics. (1991). Special issue on structural
aspects of crystal growth, 24(2), February 14.
Karle, J. (1994). J. Chem. Inf. Comput. Sci. 34, 381–390.
Ladd, M. C. F., and Palmer, R. A., Eds. (1980). “Theory and Practice of
Direct Methods in Crystallography,” Plenum, New York.
McPherson, A. (1982). “Preparation and Analysis of Protein Crystals,”
Wiley, New York.
Schmidt, P. W., Ed. (1983). “Proc. symp. small-angle scattering.” Trans.
Am. Crystallogr. Assoc. 19.
Schoenborn, B. P., Ed. (1985). “Proc. symp. structure determination with
synchrotron radiation.” Trans. Am. Crysrallogr. Assoc. 21.
FIGURE 15 Rigid-body motion. An atom at a distance l from the Wyckoff, H. W., Hirs, C. H. W., and Timasheff, S. N., Eds. (1985). “Meth-
center of libration sweeps out in an arc. Least-squares analysis ods in Enzymology,” Vol. 114, “Diffraction Methods for Biological
places the atom at C rather than P. Macromolecules.” Academic Press, New York.
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
Electrons in Solids
Richard H. Bube
Stanford University
I. A Bond Picture
II. Free-Electron Model of Metals
III. Energy Bands
IV. Optical Properties
V. Electrical Properties
VI. Galvanomagnetothermoelectric Effects
VII. Amorphous Semiconductors
VIII. Superconductors
IX. Conducting Polymers
X. Junctions
XI. Magnetic Properties
307
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
Mobility The proportionality constant between the drift thermal activation energy of E. We consider the behav-
velocity of a carrier in a conducting state in a solid and ior specifically in different kinds of crystalline materials as
the applied electric field. follows.
Pauli exclusion principle The recognition that no two In an ionic crystal an electron is transferred from the
electrons in the same system can have all of their quan- cation to the anion, and the electron is subsequently local-
tum numbers the same; that is, only one electron can ized around the anion. So that the electron can be removed
exist in a specific energy state. from the anion and can move freely in the ionic crystal, a
Periodic potential The kind of potential characteristic of certain basic energy E must be supplied. If we think of
a crystalline material with atoms located in a regular the electron on the anion as being characterized by a range
periodic lattice. of energies characteristic of these electronic states, then
Scattering Processes that interact with an electron mov- the next allowed state where conductivity is possible lies
ing in a solid under the influence of an electric field to at higher energies, separated from the allowed energies
counteract the effects of the field and return the carrier below by some finite energy gap E. If the electron is to
distribution toward the equilibrium condition. make the transition from the lower states where conduc-
Schroedinger equation The basic quantum-mechanical tion is not possible to the upper states where conduction is
equation used to describe the energy levels allowed for possible, energy must be supplied to overcome this energy
electrons in a particular potential energy. gap. This energy may be supplied by photons, in which
case the absorption spectrum consists of an absorption
edge at the value of the photon energy equal to this energy
SOLIDS may be conveniently separated into those that gap, which rises to large values of absorption when the
are crystalline and those that are amorphous. A crystalline photon energy exceeds this energy gap.
material consists of atoms arranged in a periodic array, Since the energy required to remove an electron from a
so that the whole crystal can be constructed by repeating covalent bond (in a covalent material) to allow it to be free
a basic unit called the unit cell. Usually this periodic ar- is in general less than the energy required to accomplish
ray is in three dimensions, but special significant cases this in an ionic material, the absorption edge for a covalent
are known of two-dimensional periodic arrays (as in the material occurs for lower values of photon energy. The
plane of an interface). Because the atoms have a periodic largest energy gap for any material known is that for ionic
arrangement, the electrical potential in the solid also is lithium fluoride, for which E = 11.5 eV, whereas in the
periodic. The allowed energies for electrons, and many of widely used semiconductor silicon, E = 1.1 eV.
the optical and electrical properties of electrons in crys- In a material with metallic binding, on the other hand,
talline solids, can be associated with the effects of this there is a constantly available “sea” of free electrons ca-
periodic potential. An amorphous material does not have pable of absorbing energy from photons to be excited to
a periodic potential or the long-range order typical of a higher energy states. The metal is not characterized, there-
crystalline material, but it does exhibit some of the same fore, by an absorption edge as the ionic and covalent insu-
basic electrical and optical properties as those of a crys- lators and semiconductors are, but rather has a continuous
talline material, insofar as these are based on the proper- absorption due to free electrons over a wide photon-energy
ties of chemical bonds rather than uniquely on long-range range.
order. This same kind of general picture can be connected
with the basic features of the electrical properties of ionic,
covalent, and metallic materials. In insulators and semi-
I. A BOND PICTURE conductors, electrons must be excited thermally from the
highest-lying filled states to the next higher-lying empty
Some of the most basic optical and electrical properties of allowed states for conductivity to be possible. An acti-
both crystalline and amorphous materials can be described vation energy for this process, with a value correspond-
qualitatively by considering the nature of the chemical ing to that of the optical absorption edge E described
bonds between atoms rather than the effects of a periodic above, can be measured approximately from the slope in a
potential. plot of the logarithm of the electrical conductivity versus
In both kinds of materials, there is a minimum en- reciprocal temperature. Therefore ionic materials have a
ergy E that must be supplied to electrons in the chem- large activation energy and a small electrical conductivity.
ical bonds of the material to give them enough energy Covalent materials, in general, have a smaller activation
to be able to move freely through the material. Thus energy and a larger electrical conductivity. Metals, having
there is a minimum photon energy equal to E for a ready supply of free electrons, have no activation energy
optical absorption and an electrical conductivity with a and the largest value of electrical conductivity.
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
II. FREE-ELECTRON MODEL OF METALS electron can occupy a particular energy state, we are led
to describe the occupation of a state with energy E by the
It is often found that the outermost valence electrons of Fermi-Dirac distribution function
atoms making up a solid can be treated as if they were
f (E, T ) = 1/[exp{(E − E F /kT } + 1], (3)
essentially free electrons. This is particularly true in the
class of materials known as metals. where E F is a particular energy called the Fermi energy,
These valence electrons in metals are of course not com- k is Boltzmann’s constant, and T is the temperature. At
pletely free, since they still move in the presence of the 0 K the allowed energy states are occupied for all energies
positively charged ions that are located on the crystal lat- up to the Fermi energy,
so that if n is the total number of
tice sites of the metal. But to an often striking degree it is free electrons, n = N (E) dE from E = 0 to E F , and as
sufficient to consider the valence electrons to be moving a result,
in a potential energy of zero, partially shielded from the
positively charged ions by the other electrons, and with E F = (h✘ 2 /2m)(3π 2 n)2/3 . (4)
the repulsive interactions between electrons canceled out From the expression for f (E, T ), it can be seen that the
by the residual attractive effects of the positively charged Fermi energy is the energy for which f (E, T ) = 12 . The
ions. When we consider such electrons from a quantum- Fermi energy can also be related to thermodynamic quanti-
mechanical view, it is appropriate to consider them as con- ties by recognizing that it is the electrochemical potential.
tained in a box the size of the metal with zero potential Using f (E, T ) makes it possible to calculate the density
energy inside the box and infinite potential energy outside of occupied electronic states n(E) as a function of energy
the box. To escape from a real metal, an electron in the by multiplying Eqs. (2) and (3), n(E) = f (E , T )N (E),
metal needs to acquire sufficient energy to overcome the with the result shown in Fig. 1.
finite potential barrier at the surface, known as the work This simple model is able to give qualitative and even
function of the metal, which is much larger than the aver- semiquantitative agreement with a number of phenom-
age electron energy and therefore approximately infinite. ena related to electrons in metals, such as photoemission
From calculations based on the Schroedinger equation (emission of electrons from a metal as the result of absorp-
for free particles in a box, it is possible to derive what tion of photons), thermionic emission (emission of elec-
energy states are allowed. For a cube with side L, the trons from a metal because of high temperatures), field
allowed energies are given by emission (emission of electrons from a metal because of
E(n x , n y , n z ) = (h✘ 2 π 2 /2mL 2 ) n 2x + n 2y + n 2z , (1) high electric fields), and heat capacity (a measure of how
rapidly the energy of a collection of electrons changes with
where n x , n y , and n z are positive integers equal to one temperature). If the concept of electron spin is included,
or greater. This quantization of the allowed energy levels this same simple model can be used to describe the mag-
arises from the necessity to have solutions with wave- netic properties of free electrons (free-electron or Pauli
length λ such that nλ/2 = L in each of the three dimen- paramagnetism). A typical energy model for thermionic
sions, to satisfy the geometric boundary conditions that emission, for example, is given in Fig. 2.
the solution must go to zero at the faces of the cube. If we
examine the magnitude of E(1, 1, 1) we see that it is of the
order of 10−14 /L 2 eV/cm−2 , so that for a cube of macro-
scopic dimensions (e.g., L = 1 cm), E(1, 1, 1) can be set
equal to zero. The same is true of the energy separation
between different energy levels, so that the distribution
of allowed energy levels can be treated effectively as a
pseudocontinuous distribution.
The variation of the density of these states N (E) per unit
volume [N (E)dE is the number of states per unit volume
between E and E + dE] can be calculated from Eq. (1)
with the result that
N (E) = (1/2π 2 )(2m/ h✘ 2 )3/2 E 1/2 , (2)
which includes a factor of 2 to account for the two possible
FIGURE 1 The total density of allowed states per unit volume
spin states (intrinsic angular momentum) of an electron. N(E ), the Fermi distribution function f (E ), and the density of oc-
When these results are coupled with the postulate con- cupied states per unit volume n(E ) = f (E )N(E ), at a finite tem-
sistent with the Pauli exclusion principle that only one perature in a metal with Fermi energy EF .
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
FIGURE 3 Four ways of describing band properties: (a) flat band E versus x, (b) electron wave dispersion relationship
E versus k, (c) density of states N(E ) versus E, and (d) equal-energy surfaces in k-space.
The basic electrial differences between solids—metals, Finally, if the gap between the top of the filled band
semiconductors, and insulators—are readily understand- and the higher-lying empty band is small, appreciable
able in terms of the band picture. Possibilities are illus- excitation of electrons into the conduction band may occur
trated in Fig. 5. If the highest-energy-allowed band occu- at normal temperatures, and intermediate conductivity is
pied by electrons is only partially occupied, then there are observed typical of a semiconductor.
available allowed states at very small energies above oc- If electrons exhibit free-like behavior, then a plot of
cupied states, and the drift of electrons in an electric field equal-energy surface in k-space, as shown in Fig. 3d, will
can be readily achieved. A partially filled valence band have spherical symmetry, since the magnitude of E does
therefore corresponds to a metal. not depend on the direction of k. Thus the equal-energy
Even if the highest-energy-allowed band occupied by surfaces in many materials are frequently spherical with
electrons is totally filled, metallic properties can be found center at either the minimum or maximum value of k for
if the next higher-lying band overlaps the filled band (pos- the energy band. A notable exception is the widely used
sible in a three-dimensional crystal) to again produce a semiconductor silicon, for which the minimum of the con-
continuum of allowed states separated by only small ener- duction band does not lie at k = 0 and for which the equal-
gies from occupied states. Overlapping bands also produce energy surfaces are ellipsoidal rather than spherical.
metallic behavior.
If the highest-energy-allowed band occupied by elec- B. Interpretation of E versus k Curves
trons is totally filled, and the next higher-lying band lies
an appreciable energy above the top of the filled band, From a knowledge of the dependence of E on k in an en-
insulator-like properties are observed. Only electrons in ergy band, two significant properties of electronic energy
the upper empty band can contribute to electrical conduc- states can be deduced:
tivity; their density is very small, since thermal excita-
tion across a large energy gap is required to raise them 1. The velocity associated with an electronic state is the
from the filled valence band to the empty conduction group velocity, given for the one-dimensional case by
band. vg = h✘ −1 ∂E/∂k. (5)
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
FIGURE 4 (a) The free-electron model of electrons as perturbed by a small periodic potential (weak-binding ap-
proximation) causing Bragg reflection conditions for k = nπ/a in the extended zone representation. (b) The reduced
representation in the first Brillouin zone achieved by translating band segments in (a) by n2π/a to bring them into the
basic zone between k = −π/a and k = + π/a. A wavenumber k
= (k + n2π/a) corresponds to the same solutions as
does k. In the reduced representation, separate bands are designated by a band index I .
For free electrons this means that vg = h✘ k/m, which illus- band in Fig. 3a, for example, we see that vg = 0 at the top
trates the fact that the momentum mvg = h✘ k, in agreement and bottom of the band, is positive for +k and negative for
with the de Broglie relationship that the momentum −k, and has a maximum some-where near the middle of
p = h/λ. If this result is applied to the upper conduction the band. We see immediately why the application of an
electric field to a totally filled band produces no electric
current; the number of occupied states with positive vg is
equal to that with negative vg , and the application of an
electric field cannot change this equality.
2. Although electrons near the top or bottom of a band
may be said to behave in a free-like manner, it is clear that
they are moving in a periodic potential and are not re-
ally free. To describe them in a free-like model, we
must usually define an effective mass m ∗ that is different
from the free mass m. From the result that the force is
the time derivative of the momentum, we obtain F = h✘
∂k/∂t. If we also desire to write F = m ∗ a, after the
classical form, then it follows that since a = ∂vg /∂t =
h✘ −1 ∂/∂t(∂E/∂k) = h✘ −1 ∂/∂k[(∂k/∂t)(∂E/∂k)],
of the lower band in Fig. 3b the effective mass for electrons IV. OPTICAL PROPERTIES
is negative. It is in this negative sign that we see most
strongly the effects of the periodic potential. Physically, Optical properties of solids include a wide range of phe-
a negative effective mass corresponds to an acceleration nomena involving either the interaction of light with
that is in the opposite direction to the applied force; it crystals or the generation of light by crystals under suitable
corresponds to a condition like that encountered in Bragg conditions.
reflection. Typical variations of vg and m ∗ are shown in The velocity of light v in a material is reduced compared
Fig. 6. Effective masses in most solids vary between about with its value in vacuum c by a factor known as the index
10−2 m(m ∗e = 0.014 m in InSb, one of the smallest values) of refraction r = c/v. In general, the value of r for near-
to values of the order of m (for free electrons in the alkali visible light can be expressed by
metals or for holes in many semiconductors).
For the filled valence band of a semiconductor to con- r 2 = εr + c2 α 2 /4ω2 , (7)
tribute to the electrical conductivity, there must be some where εr is the relative dielectric constant associated with
empty states corresponding, for example, to electrons that polarization of electrons in the material, α is the optical
have been excited up to the conduction band. Instead of absorption constant (a sample with thickness of 1/α re-
describing the electrical properties of the valence band in duces the light intensity passing through it by a factor of
terms of all the electronic states with positive and nega- e), and ω is the light frequency.
tive velocities and negative and positive effective masses
(depending on which state in the band is being examined),
A. Optical Reflection
it is conceptually much simpler to describe the conductiv-
ity in a partially empty band in terms of the empty states Reflection occurs for any wavelike phenomenon when the
themselves, called “holes.” If we treat these holes as if they wave passes from a material with one set of properties to
had a positive charge and a positive effective mass, a sym- a material with another set of properties. Light is reflected
metric treatment of electrical properties involving them on passing from a material with value εr1 and α1 to a
at the top of the valence band can be given, as is com- second mateial with value εr2 and α2 . For simple reflection
monly given for electrons at the bottom of the conduction at an air–material interface (εr1 = 1 and α1 = 0 for air)
band. with a material without absorption (α2 = 0), the reflection
coefficient R is given by
R = (r2 − 1)2 (r2 + 1)2 . (8)
If the material does have a finite α, the result is that the
higher the absorption index of the material, the more light
it reflects, until in the extreme case R approaches unity:
R = (r2 − 1)2 + 22 (r2 + 1)2 + 22 , (9)
where 2 is the absorption index expressed through the
complex index of refraction r ∗ = r + i. The absorp-
tion index is related to the absorption coefficient by the
relationship
α = 2ω/c. (10)
This effect accounts for the very high reflectivity of metals
and other materials with high values of α due to specific
absorption processes.
One application of considerable practical importance
that involves the reflectivity properties of thin films of one
material on another is the antireflecting coating. As a way
to reduce the reflection at an air–material A interface, a
second material B with suitable index of refraction (ideally
the square root of the index of refraction of A) is deposited
in thin-film form on A, with a thickness such that light
FIGURE 6 Typical variations of group velocity vg and effective reflected from the air–B interface interferes destructively
mass m∗ as a function of k. with light reflected from the B–A interface. This occurs
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
when the film B thickness is an odd multiple of quarter and variation with energy of the absorption constant de-
wavelengths of the light in the film, λB = λair /rB . pends on whether the transition involves a photon only
(direct transition) or whether it involves both a photon
and a phonon (indirect transition). The absorption con-
B. Absorption Processes
stant decreases by many orders of magnitude as the pho-
Optical absorption in solids may be categorized under six ton energy drops below the band-gap energy. Direct and
principal headings. One of these is the absorption of pho- indirect optical transitions are illustrated in Fig. 8.
tons in the excitation of optical mode vibrations of the If a direct optical transition is involved, a plot of (αh✘ ω)2
crystal lattice, known as reststrahlen absorption, which versus h✘ ω, where ω is the light frequency, yields a straight
usually occurs in the infrared between 10 and 200 µm. line with the energy intercept of the direct band gap. If an
This is the major absorption process that does not involve indirect optical transition is involved, a plot of α 1/2 ver-
electronic transitions. Other kinds of optical absorption sus h✘ ω gives two straight-line segments with intercepts
associated with the excitation of lattice vibrations may oc- of h✘ ω1 and h✘ ω2 , as shown in Fig. 8b. The upper segment
cur when an impurity is present in the material and local corresponds to an indirect transition with emission of a
vibrational modes associated with the impurity become phonon, whereas the lower segment corresponds to an in-
possible. direct transition with absorption of a phonon. The indirect
Figure 7 illustrates the other five major types of elec- band gap is given by (h✘ ω1 + h✘ ω2 )/2, and the energy of the
tronic absorption processes: participating phonon is given by (h✘ ω2 − h✘ ω1 )/2.
3. Excitation of a bound electron–hole pair, known as
1. Excitation of an electron from the valence band to an exciton, requiring less energy than that needed to pro-
higher-lying conduction bands, characterized by high- duce a free electron–hole pair by excitation across the band
absorption processes, showing only relatively small vari- gap. The exciton can be thought of as a hydrogenic system,
ations of absorption coefficient with photon energy de- capable of moving and transporting energy through the
pending on the density-of-states distributions in the bands crystal without transporting net charge. The electron and
involved. The optical absorption constant is usually in the hole making up an exciton may ultimately be thermally
range of 105 to 106 cm−1 . dissociated into free carriers or may recombine with the
2. Excitation of an electron from the valence band to emission of photons or phonons.
the lowest-lying conduction band with the minimum re- 4. Excitation of an electron from an imperfection level
quired energy of the forbidden band gap. The magnitude to the conduction band, as shown in Fig. 7, or from the
FIGURE 7 Characteristic types of optical transitions shown both for the flat-band model and for the E versus k plot.
(1) Excitation from the valence band to higher-lying conduction bands, (2) excitation across the band gap, (3) exciton
formation, (4) excitation from imperfections, and (5) free-carrier excitation.
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
FIGURE 8 (a) Band-to-band direct transitions, and (b) band-to-band indirect transitions. Inserts show the variation of
absorption coefficient with photon energy expected for each type of transition, and a typical phonon dispersion curve
for the indirect material.
valence band to an imperfection. If imperfections are It involves the absorption of photons and the absorption
present in the crystal, they create energy levels that lie in or emission of phonons, since both energy and k must
the forbidden gap. Therefore, at energies less than the band be changed in the transition, and is thus an indirect op-
gap it is still possible to excite electrons to the conduction tical transition. The specific quantitative description for
band from imperfection levels occupied by electrons, or the effect depends on the magnitude of both the electri-
to excite electrons from the valence band to unoccupied cal conductivity and the frequency of the light; for optical
imperfection levels, each process giving rise to optical ab- frequency excitation of free carriers in a nondegenerate
sorption. This absorption in turn ceases when the photon semiconductor α ∝ ω−n where n is between 2 and 3. The
energy is less than the energy required to make a transition results of a classical calculation for the dependence of ab-
from the imperfection level to one of the bands. The ab- sorption coefficient on frequency for materials with dif-
sorption coefficient may be expressed as α = So NI (where ferent low-frequency conductivity are given in Fig. 9.
NI is the density of suitably occupied or unoccupied imper- Another kind of optical absorption due to free electrons
fections, and So is the optical cross section, of the order of (or holes) corresponds to the motion of free carriers act-
10−16 cm2 ). For very high imperfection densities, the cor- ing collectively as a kind of “electron gas,” which has a
responding absorption constant may have values as high characteristic frequency ω p = (nq 2 /ε0 m ∗ )1/2 , where n is
as 103 cm−1 , but in general it is considerably smaller. The the density of the carriers, q is the electronic charge, ε0
absorption spectrum for this type of absorption consists of is the permittivity of free space in SI units, and m ∗ is the
a threshold corresponding to the ionization energy of the effective mass of the carriers. When photons with this fre-
imperfection and a relatively slow variation of absorption quency are incident on the material, resonant absorption of
for higher photon energies. energy occurs. The effect is known as plasma resonance.
Another kind of imperfection absorption may occur that The plasma frequency occurs in the ultraviolet for metals
is not illustrated in Fig. 7. It is most commonly encoun- and in the infrared for most semiconductors.
tered when an impurity with incomplete inner-shell atomic
levels is present in a material. Absorption between two
C. Photoelectronic Effects
such atomic levels yields an absorption spectrum with a
Gaussian shape, peaked at the energy separation between When light is absorbed by a material so as to raise elec-
the two levels. trons to higher-energy states, several possibilities occur.
5. Excitation of a free electron (or free hole) to a higher If the excited electrons are in the conduction band, then
energy state within the same band or to higher bands. This the conductivity of the material is increased as a result
process can occur over a wide range of photon energies. of the absorption of light, and the effect is known as
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
luminescence [the conductivity analogue is electron-beam 2. Photoconductivity excitation spectra, plotted as pho-
induced current (EBIC)], excitation by friction produces toconductivity σ (σlight − σdark ) versus h✘ ωinc
triboluminescence, and luminescence may also be excited 3. Luminescence excitation spectra, plotted as lumines-
by exposure to X-rays or high-energy particles. A material cence emission intensity at a particular photon energy as
in thermal equilibrium emits radiation due to the recombi- a function of h✘ ωinc
nation of thermally excited electrons and holes, which is 4. Luminescence emission spectra, plotted as lumines-
commonly known as blackbody radiation and is described cence emission intensity as a function of h✘ ωemit of the
by Planck’s radiation law. Luminescence is distinguished emitted light
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
V. ELECTRICAL PROPERTIES
A. Ohm’s Law
The basic equation describing electrical properties in el-
ementary discussions is Ohm’s law, I = V /R, where I is
the electrical current measured in a circuit with resistance
R when a voltage difference V is applied. This relation
can be rewritten to explicitly indicate the role of electrical
conductivity, σ :
J = σ E, (11)
where J is the electrical current density (current per unit
area), and E is the electric field. Alternatively, we can
write J = nqvd , where n is the density of free carriers, q
is the charge per carrier, and vd is the drift velocity of the
electron caused by the electric field. We define a quantity
called the carrier mobility µ so that
vd = µE; (12)
then the conductivity can be expressed as
σ = nqµ. (13)
If both electrons and holes contribute to the conductivity,
then the total conductivity can be expressed as
σ = q(nµn + pµ p ), (14)
where p is the density of holes and µn and µ p are, respec-
tively, the mobilities of electrons and holes.
For Ohm’s law to hold, it is necessary that neither n nor
µ be a function of electric field. This requirement breaks
down for high values of electric field for which n may
be increased by the electric field through mechanisms of
impact ionization, field emission, or contact injection, or
for which µ may become a function of electric field if
the mobility is a function of carrier energy that in turn is
FIGURE 10 Optical spectra for an illustrative situation. (a) Cho-
increased by high electric fields.
sen band structure involving band-to-band transitions and an im-
purity with atomic ground state and excited state, in which it is The temperature dependence of electrical conductiv-
assumed that only recombinations with energy EG or E
are ra- ity is determined by the temperature dependence of the
diative; (b) optical absorption spectrum; (c) photoconductivity ex- free-carrier density and the temperature dependence of
citation spectrum; (d) extrinsic luminescence excitation spectrum, the free-carrier mobility. In a metal, the free-carrier den-
where L ext is the emission intensity of the extrinsic emission band;
sity is independent of temperature, and therefore the tem-
(e) intrinsic luminescence excitation spectrum, where L int is the
emission intensity of the intrinsic emission band; and (f) lumi- perature dependence of the conductivity for a metal arises
nescence emission spectrum showing both extrinsic and intrinsic totally from the temperature dependence of the mobility.
emission bands. The free-carrier density in a semiconductor or an insulator
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
is thermally activated over a wide temperature range and Actually, in a metal this is true only at sufficiently high
therefore increases exponentially with temperature over temperatures where scattering by acoustic lattice waves is
this range. The temperature dependence of the mobility elastic; in other words, the change in energy on scattering
depends on the specific scattering process that limits the is small compared with the thermal energy of an elec-
drift of carriers in an electric field. tron, kT . At low temperatures (temperatures less than the
Debye temperature θD , which is defined approximately
by kθD = h✘ ωmax , where h✘ ωmax is the maximum vibra-
B. Scattering and Mobility
tional phonon energy) this is no longer true in a metal,
The effect of an electric field on a distribution of free (or and acoustic lattice wave scattering in a metal becomes
quasi-free, to be more exact) carriers in a material is to inelastic; an appropriate calculation shows that over this
shift the distribution so that more carriers have velocity in range µ ∝ T −5 .
one direction than in the opposite. The magnitude of this An illustration of the temperature dependence of elec-
shift is determined by the interaction between the effects trical resistivity ρ = 1/σ as a function of temperature for
of the electric field and the effects of a variety of scattering a metal is given in Fig. 11, with data for silver with
processes that act to return the distribution to its equilib- θD = 226 K. Above this temperature the resistivity varies
rium condition. If a force qE acts on an electron for an linearly with T , whereas below this temperature ρ changes
average time τ , then the net shift in the distribution will be to a T 5 dependence, until at very low temperatures the re-
given by qEτ = h✘ k [see discussion preceding Eq. (6)], sistivity is limited by impurity scattering. Since the total
so that k = qEτ/ h✘ . We may consider τ to be the aver- resistivity for a metal is equal to the sum of the resistivity
age time between scattering events, which acts in such a due to lattice scattering and the resistivity due to impurities
way that if n 0 carriers are unscattered at t = 0, then the
density of unscattered carriers n(t) at time t > 0 is given
by dn(t)/dt = −n/τ , or n(t) = n 0 exp(−t/τ ).
Using this approach, we may rewrite the above equation
for k, as qEτ = m ∗ vd , from which it follows that
µ = (q/m ∗ )τ. (15)
In general the scattering relaxation time is a function of
energy τ (E), and the quantity that enters Eq. (15) as τ is
a suitable average of τ (E) over electron energies.
The specific form of τ (E) depends on the scatter-
ing mechanism. These mechanisms include scattering by
acoustic lattice waves, optical lattice waves, charged im-
perfections, neutral imperfections, piezoelectric effects,
dislocations, grain boundaries, surfaces, and inhomo-
geneities. Each scattering mechanism is characterized by
its own temperature dependence of mobility. As examples
of these processes, we consider scattering by acoustic lat-
tice waves and charged imperfections.
Acoustic lattice scattering corresponds to scattering of
free carriers by interaction with lattice atoms as they move
due to thermal energy. The probability for scattering is pro-
portional to the average energy in the lattice waves, that
is, to kT . The mean free path for scattering (the average
distance traveled between scattering events, equal to the
product of the thermal velocity of the carrier νth and τ )
by acoustic lattice waves is therefore proportional to T −1 .
The relaxation time τ (E) ∝ (T νth )−1 . Since in a semicon-
ductor νth = (2kT /m ∗ )1/2 , it follows that τ ∝ µ ∝ T −3/2 .
In a metal, on the other hand, scattering events are ex-
perienced only by electrons near the Fermi energy, and
so the average value of τ (E) = τ (E F ); as a result, τ ∝ FIGURE 11 Typical temperature dependence of electrical resis-
µ ∝ T −1 . tivity for silver with a Debye temperature of 226 K.
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
C. Imperfections in Semiconductors
In most practical semiconductors, the electrical conduc-
tivity is controlled not by thermal excitation across the
band gap of the material, but by thermal excitation from
localized imperfections. The electrical behavior of imper-
fections depends on their location in the crystal and their
effective number of valence electrons.
The simplest kind of imperfection is one that differs by
one in valence from the atom for which it substitutes in
FIGURE 12 Scattering of an electron by a positively charged
impurity by Coulomb attraction, and scattering of a hole by a the crystal lattice. Such imperfections give rise to localized
positively charged impurity by Coulomb repulsion. The hyperbolic energy levels in the forbidden gap of the semiconductor
paths of electrons and holes are mirror images of each other. lying close to either the conduction or the valence band
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
E. Electrical Conductivity in Semiconductors be expected from our previous discussion of the Fermi
distribution, except for the additional degeneracy factor
In an intrinsic semiconductor (a semiconductor with prop-
of ( 12 ) that appears in Eqs. (24). This factor arises because
erties controlled by excitation and recombination across
donor and acceptor states can accommodate an electron
the band gap rather than through imperfections), the tem-
with either of two possible spin orientations; taking this
perature dependence of the Fermi level (and hence the
into account leads to the insertion of the factors of ( 12 )
temperature dependence of the free-carrier densities) can
shown in Eqs. (24).
be calculated simply by equating Eqs. (19) and (20), since
The general statement of Eq. (23) can be simplified in
in an intrinsic semiconductor n = p = n i . The result is
several special cases with physical significance. For an
that
intrinsic material, n A = (ND − n D ) = 0 and n = p, as we
(E c − E F ) = E G /2 + (3kT /4) ln m ∗e m ∗h . (22) have seen. If only donors are present, then n A = 0 and
p can be neglected in the extrinsic conductivity range,
In an extrinsic semiconductor (a semiconductor with so that n = (ND − n D ). Similarly, if only acceptors are
properties controlled by excitation and recombination present, then (ND − n D ) = 0 and n can be neglected in
through imperfections), the situation may be a little more the extrinsic conductivity range, so that p = n A . If both
complicated, but a simple rule may be stated that covers donors and acceptors are present in approximately equal
all cases. In the intrinsic case above, the requirement that densities, so that almost complete compensation occurs,
n = p may be recognized as a condition of charge neu- n A = (ND − n D ).
trality. This condition of charge neutrality may be used in Analytical results can be obtained for simple cases.
general to determine the temperature dependence of the As an example, consider the case of donors only. Two
Fermi level (and hence of the free-carrier density and the ranges may be defined. In the high-temperature range
conductivity). where all the donors are ionized, n = ND and (E c − E F ) =
Consider the case shown in Fig. 14 with one kind of kT ln (Nc /ND ) from Eq. (19). In the low-temperature
donor and one kind of acceptor imperfection present in range, the donors are only partially ionized and
the material. The general charge-neutrality condition that
governs this case is given by (E c − E F ) = (E c − E D )/2 + (kT /2) ln (2Nc /N D) (25a)
with
n + n A = (ND − n D ) + p, (23)
n = (Nc ND /2)1/2 exp[−(E c − E D )/2kT ]. (25b)
where the negative species on the left are the free-electron
−3/4
density and the density of ionized (electron-occupied) ac- Therefore a plot of ln (nT ) versus 1/T yields a
ceptors, and the positive species on the right are the density straight line at low temperatures with a slope proportional
of ionized (electron-unoccupied) donors and the free-hole to one-half the donor ionization energy. These results are
density. Each of these terms can be written in terms of summarized in Fig. 15.
the location of the Fermi level, and hence the Fermi-level The case of one kind of donor with density ND and
position satisfying the equation can be readily determined one kind of acceptor with density NA can also be solved
by a computer even if there are many different donors and analytically. Consider the case first where ND > NA . At
acceptors present. low temperatures for which n NA or (ND − NA ), a plot
The information needed to accomplish this lies in the of ln (nT −3/2 ) versus 1/T has an activation energy
appropriate expressions for electron-occupied donors and of (E c − E D ). For intermediate temperatures for which
hole-occupied acceptors. These may be summarized as NA n (ND − NA ), a plot of ln (nT −3/4 ) versus 1/T
follows: has an activation energy of (E c − E D )/2. Finally, all of the
−1 uncompensated donors are ionized at high temperatures
n D = ND 12 exp[(E D − E F )/kT ] + 1 (24a) and n = (ND − NA ). If ND = NA , we have exact compen-
and sation, and the Fermi level must lie halfway between the
1 −1 donor and acceptor levels to achieve charge neutrality; the
(NA − n A ) = NA 2
exp[(E F − E A )/kT + 1 . observed activation energy is therefore (E D + E A )/2.
(24b)
In these equations ND is the total density of donors with an VI. GALVANOMAGNETOTHERMOELECTRIC
ionization energy of (E c − E D ), and NA is the total den- EFFECTS
sity of acceptors with an ionization energy of (E A − E v ).
Notice that the expressions are similar only if we compare When any two of the following interact—an electric
the expression for electron-occupied donors with that for field, a magnetic field, and a thermal gradient—a num-
hole-occupied acceptors. Equations (24) are what would ber of electrical effects result. Some of these are small
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
B. Hall Effect
In the Hall effect, an electric field and a magnetic field ap-
plied at right angles to a material with either free electrons
or free holes produce an electric field in the third orthog-
onal direction, in order to produce zero current in that di-
rection. A magnetic field Bz in the presence of an electric
field E x producing a current density Jx causes electrons
to acquire velocity in the y-direction. Since there can be
FIGURE 15 Electron density and conductivity as a function of no net current flow in the y-direction, however, it follows
temperature for a semiconductor with donors only with indicated
activation energies. The slopes for the ln σ versus 1/T plot differ
that an electric field E y will form such that qE y = qνx Bz .
slightly from those of the ln (nT −3/4 ) versus 1/T plot because of Then the induced Hall field E y can be measured and the
the temperature dependence of Nc and µn in the extrinsic range. In carrier density deduced from it:
the intrinsic range, none of the slopes seen will be exactly −EG /2k
because in this range a plot should be made of ln (nT −3/2 ) versus E y = ±(1/nq)Jx Bz . (27)
1/T.
The polarity of E y indicated by the choice of sign tells
whether the carriers are electrons (negative sign) or holes
second-order effects (Nernst, Ettingshausen, and Righi- (positive sign). Two commonly defined parameters are
Leduc effects). Here, we consider three of the major ef- the Hall coefficient RH = ±1/nq and the Hall mobility
fects: the Hall effect, magnetoresistance, and thermoelec- µH = σ RH = ±µ.
tric power. The Hall mobility, always defined as µH = σ RH , is a
second kind of mobility, in addition to the conductivity
mobility defined in Eq. (15). A third kind of mobility often
A. Cyclotron Resonance Frequency referred to is the drift mobility µd = d/Et, obtained from
Classically, the effect of a magnetic field on free electrons a direct measurement of the time t required for carriers to
can be readily described. The force exerted on a moving travel a distance d in the material under an electric field
electric charge q with velocity v by a magnetic field B is E. If there are localized trapping states in the material
given by qv × B. This force causes an electron to move associated with imperfections, a carrier injected at x = 0
in a circular orbit in the plane orthogonal to B, that is, may spend a major portion of its time in the material in
in the x–y plane for Bz . The radius of the circular orbit a trapped state rather than in a free state, and therefore
can be determined by equating the magnetic force to the may spend much longer in reaching the detection point at
centrifugal force of the circular motion: m ∗e ν 2 /r = qνBz . x = d than simply d/(µcon E). We may express the drift
The angular frequency ωc = ν/r is called the cyclotron mobility in terms of the conductivity mobility, the density
frequency and is given by of free carriers n, and the density of trapped carriers n t by
ωc = qBz m ∗e . (26) µd = µcon [n/(n + n t )]. (28)
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
If both electrons and holes are present at the same time, metal than in a semiconductor, being of the order of micro-
since the value of E y associated with the electrons is of volts per degree in the former and of a millivolt per degree
opposite sign to that associated with the holes, the pos- in the latter. Typical expressions for α are as follows:
sibility exists for exact cancellation. By considering the
Metal α = π 2 k 2 T qE F , (30a)
total current due to both electrons and holes, we find that
in general (for small values of Bz ), n-type Semiconductor α
E y = q pµ2p − nµ2n α 2 Jx Bz , (29a) = −(k/q)[A + (E c − E F )/kT ], (30b)
RH = E y /Jx Bz and
= pµ2p − nµ2n q( pµp + nµn )2 , (29b) p-type Semiconductor α
and = +(k/q)[A + (E F − E v )/kT ]. (30c)
µH = σ RH Here A is a constant depending on the specific scattering
mechanism for the free carriers involved; for acoustic lat-
= pµ2p − nµ2n ( pµp + nµn ). (29c)
tice scattering A = 2, and for charged-impurity scattering
The Hall coefficient and the Hall mobility both become A = 4. Since the thermoelectric power in a semiconductor
zero if pµ2p = nµ2n . provides knowledge of the location of the Fermi level at a
given temperature, it can also be used to provide direct in-
formation about the magnitude of the free-carrier density
C. Magnetoresistance in the semiconductor.
Magnetoresistance is the change in resistance correspond-
ing to an applied electric field because of the effects of a VII. AMORPHOUS SEMICONDUCTORS
simultaneously applied magnetic field. In the Hall-effect
geometry, to be specific, it means a change in electrical Amorphous semiconductors are a class of semiconducting
resistance corresponding to an electric field E x because of materials that do not show the long-range order typical
Bz . To bring about Jy = 0 by the development of a specific of crystalline materials with a periodic potential that are
value of E y , as described above for the Hall effect, for all discussed primarily in this article. Amorphous materials
the carriers, essentially requires that all the carriers have are generally made by one of three methods: (1) deposition
identical properties. When all the carriers do not have iden- from the vapor phase, (2) cooling from a liquid melt (called
tical properties (e.g., some are electrons and some holes, “glasses”), and (3) transformation of a crystalline solid by
the scattering relaxation time varies with energy and carri- particle bombardment, oxidation, and so forth.
ers have a range of energies, or the energy-band structure There are three general categories of amorphous ma-
is such that the effective mass is not a scalar), the devel- terials: (1) covalent solids such as tetrahedral films of
opment of E y cannot remove all of the y-component of Group IV elements, or III–V materials; tetrahedral glasses
velocity for all the carriers. Some carriers travel a rela- formed from II–IV–V ternary materials (e.g., CdGeAs2 );
tively greater distance than others between end contacts, or chalcogenide glasses formed from the Group VI ele-
and an increase in resistance results. Although the effect ments, or IV–V–VI binary and ternary materials; (2) ox-
is small, it has been of research interest because of the de- ide glasses such as V2 O5 –P2 O5 that have ionic bonds
pendence of magnetoresistance effects on various kinds and show electrical conductivity between different valence
of energy-band structures. states of the transition metal ion; and (3) dielectric films
such as SiOx and Al2 O3 .
Because of the lack of long-range order, amorphous
D. Thermoelectric (Seebeck) Effect
semiconductors do not show the same kind of density-of-
The application of a temperature gradient to a material states band gap as described above for crystalline semicon-
causes the average energy of free carriers at the hot end ductors. A schematic density-of-states band diagram for
to increase, which thus establishes a concentration gra- an amorphous semiconductor is given in Fig. 16. The den-
dient along the material. Diffusion associated with this sity of states continues through the points E v and E c that
concentration gradient is counteracted by the buildup of would mark the band gap in a typical crystalline material,
an electric field due to the displaced charge, to satisfy the but the mode of transport changes from that characteris-
condition that the total current be zero. The magnitude tic of extended states (somewhat like that in the allowed
of the voltage per degree difference is called α, the ther- bands of a crystalline material) for electrons above E c
moelectric power in V/K. The effect is much smaller in a (holes below E v ) to that characteristic of localized states
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
VIII. SUPERCONDUCTORS
Since the capacitance C = εr ε0 A/wd where A is the area for electrons and valence band discontinuity E v for
of the junction, holes). If many quantum wells are grown on top of one
another, and the barriers are made so thin that tunneling
(C/A)−2 = 2(qφD − qφapp ) εr ε0 q 2 ND+ , (32)
between them is significant, the result is a superlattice, a
where φapp is the magnitude of the voltage applied to concept first proposed by Esaki and Tsu in 1969. Proper-
the junction from an external source. A plot of (C/A)−2 ties of superlattices can be varied, not only by the choice of
versus φapp according to Eq. (32) yields a straight line with the materials used to make up the heterojunctions, but also
intercept on the voltage axis of φD and with slope inversely by the spacing of the layers and their thickness. Thus, it
proportional to ND+ . becomes possible to fine-tune a whole range of properties
Figure 18 differs from the other three cases in that for through choice of the appropriate structural parameters.
an ohmic contact between a metal and a semiconductor,
there is an excess of free carriers in the semiconductor
such that an accumulation layer exists. The width of this XI. MAGNETIC PROPERTIES
accumulation layer is determined by the balance between
drift and diffusion currents occurring in this region. In various places in the previous paragraphs we have men-
The dependence of current on applied voltage for the tioned the fact that electrons have an intrinsic angular mo-
junctions of Figs. 17, 19, and 20 can be described by a mentum known as spin. In our discussion thus far this
relationship of general form property has played little role except in determining ap-
propriate statistics. We have also discussed the interaction
J = J0 [exp(qφapp /AkT ) − 1]. (33)
between moving free electrons and a magnetic field in such
This equation describes a current that increases exponen- effects as cyclotron resonance, the Hall effect, and mag-
tially with +φapp (forward bias) and that reaches a voltage- netoresistance, but these have been interactions between a
independent reverse current of −J0 for −φapp (reverse magnetic field and a moving charge, not with the electron
bias). Actual current transport mechanisms determine the spin or with magnetic moments associated with electron
specific dependences of J0 and A. For a Schottky barrier motion. It is a consideration of these latter interactions that
with transport controlled by thermionic emission, or for an leads to the large field known as magnetic properties.
ideal homojunction with transport controlled by diffusion, Since the subject of magnetic properties is treated in
A = 1. detail elsewhere in this Encyclopedia, our purpose in this
Because of the extreme versatility of the standard p–n section is simply to give a brief introduction and survey.
homojunction, it is useful for a wide spectrum of appli- There are two sources of an electronic magnetic mo-
cations, including rectifiers, amplifiers (transistors), pho- ment: (1) the orbital electronic motion of electrons in
todetectors, photovoltaic energy converters, and radiation atoms, and (2) the intrinsic electron angular momentum,
emitters, both noncoherent and lasers. the spin.
Properties and applications of single-crystal or thin-film A current I flowing in a loop enclosing an area A gives
p–n junction devices are being greatly expanded by the rise to a magnetic moment
ability acquired in recent years that makes it possible to
µ = µ0 IA, (34)
deposit multilayer heterojunctions with atomically abrupt
interfaces and controlled composition and doping in in- where µ0 is the permeability of free space, and the direc-
dividual layers that are only a few tens of nanometers tion of the magnetic moment is perpendicular to the plane
thick. These layers are so thin that the energy levels in in which A is defined. In the completed electronic shell of
them show the quantum effects of small potential well an atom (2 s electrons, 6 p electrons, 10 d electrons, etc.),
thickness, which causes the continuous allowed levels the net magnetic moment in the absence of a magnetic
associated with macroscopic thicknesses to become the field is zero. If a magnetic field is applied to such a ma-
discrete allowed spectrum described by Eq. (1) for small terial, however, a magnetic field is set up according to
values of L. It becomes possible to fine-tune a whole range Eq. (34) that opposes the applied field (Lenz’s law). This
of properties through choice of the appropriate structural phenomenon is known as diamagnetism and corresponds
parameters. to a slight repulsion of a material by a magnetic field. The
Structures called quantum wells are formed by sand- magnetization, the magnetic moment per unit volume, M
wiching a very thin layer of a small-band-gap material be- is related to the magnetic field H by M = κH, where κ is
tween two layers of a wide-band-gap material. The actual the magnetic susceptibility; for a diamagnetic effect, κ is
values of the discrete levels in a quantum well are deter- small and negative. All materials exhibit some degree of
mined by the thickness and the depth of the well, which is diamagnetism. The classical Langevin equation for dia-
the band discontinuity (electron band discontinuity E c magnetic susceptibility is
P1: LLL/GJP P2: FYD/GTL QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-216 June 15, 2001 20:31
FIGURE 21 Typical ferromagnetic hysteresis curve showing sat- SEE ALSO THE FOLLOWING ARTICLES
uration magnetization Ms , remanent magnetization Mr , and coer-
cive field Hc . BONDING AND STRUCTURE IN SOLIDS • POLYMERS, ELEC-
TRONIC PROPERTIES • SOLID-STATE CHEMISTRY • SOLID-
and inclusions such that domains can be easily aligned by STATE ELECTROCHEMISTRY • SUPERCONDUCTIVITY
an applied magnetic field, the area of the hysteresis loop is
small. Such materials are useful for transformer cores or
for electromagnets and are called soft magnetic materials. BIBLIOGRAPHY
Magnetic materials are playing an increasingly promi-
nent role in power distribution, conversion between elec- Bube, R. H. (1992). “Electrons in Solids,” 3rd ed., Academic Press, New
trical and mechanical energy, microwave communica- York.
tions, and data storage. Colclaser, R. A., and Diehl-Nagle, S. (1985). “Materials and Devices for
One significant development in magnetic memory has Electrical Engineers and Physicists,” McGraw-Hill, New York.
Gerlach, B., and Löwen, H. (1991). Rev. Mod. Phys. 63(1), 63–90.
been the magnetic bubble. A cylindrically shaped mag-
Hummel, R. E. (1985). “Electronic Properties of Materials,” Springer-
netic domain—a small region of oppositely magnetized Verlag, New York.
material in a matrix of magnetic material with the op- Mayer, J. W., and Lau, S. S. (1990). “Electronic Materials Science: For
posite orientation—is stable for a wide range of magnetic Integrated Circuits in Si and GaAs,” Macmillan Co., New York.
fields. These configurations allow the retention of memory Solymar, L., and Walsh, D. (1979). “Lectures on the Electrical Properties
of Materials,” 2nd ed., Oxford Univ. Press, Oxford.
when the power is turned off and also allow a high packing
Wilson, J., and Hawkes, J. F. B. (1983). “Optoelectronics: An Introduc-
density. tion,” Prentice-Hall, Englewood Cliffs, N.J.
Increasingly stringent requirements are imposed for Wolfe, C. M., Holonyak, N., Jr., and Stillman, G. E. (1989). “Physical
magnetic recording heads as the recording density in- Properties of Semiconductors,” Prentice-Hall, Englewood Cliffs, N.J.
P1: FYD Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
Excitons, Semiconductor
Donald C. Reynolds
Wright Patterson Air Force Base
Thomas C. Collins
University of Tennessee
687
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
by first-order perturbation theory of a system of N atoms the ground and excited states of the exciton lie below the
having one valence electron per atom with the following band-gap energy of the semiconductor. Hence, the exciton
properties: structure must first be determined in order to determine the
band-gap energy. The exciton binding energy can be de-
(a) The coupling between different atoms in a crystal is termined from spectral analysis of its hydrogenic ground
small compared with the forces holding the electron and excited state transitions (this also gives central-cell
within the separate atoms. corrections). Precise band-gap energies can be determined
(b) The Born–Oppenheimer approximation is valid. by adding the exciton binding energy to the experimentally
(c) The total wave function is a product of one-electron measured photon energy of the ground-state transition.
functions. Both direct and indirect exciton formation occurs in
semiconductors, depending on the band structure. The
Thus, the Frenkel exciton is a tight-binding descrip- former is characteristic of many of the II–VI and III–V
tion of an electron and a hole bound at a single site such compounds, and the latter is characteristic of germanium
that their separate identities are not lost. This model of and silicon. For indirect optical transitions, momentum
the exciton emerges as a limiting case of the general the- is conserved by the emission or absorption of phonons.
ory of excitons and is applicable to insulating crystals. The detailed nature of the valence-band structure of de-
In the case of semiconductors, nonequilibrium electrons generate and nondegenerate semiconductors is elucidated
and holes are bound in excitonic states at low tempera- by understanding the intrinsic-exciton structure of these
tures by Coulomb attraction. Semiconducting crystals are semiconductors.
characterized by large dielectric constants and small ef-
fective masses; therefore the electrons and holes may be B. Excitation of the n-Particle System
treated in a good approximation as completely indepen-
dent particles, despite the Coulomb interaction. This re- Excitons are excited states of the system in which the
sults because the dielectric constant reduces the Coulomb number of electrons does not change. Ordinarily in solid-
interaction between the hole and electron to the extent that state physics one thinks of calculations of one-body ap-
it produces a weakly bound pair of particles that still retain proximations as excitation of the system. However, this
much of their free character. The exciton represents a state only works in metals near the Fermi surface or where one
of slightly lower energy than the unbound hole–electron. adds an electron or takes one away from the system. In
The effective mass theory used to describe such weakly the case of metals, the electrons and holes are very dif-
bound particles was developed by Wannier. These weakly fuse and the interaction between the excited “particle”
bound excitons are appropriately described using the one- and the other “particles” is very small. When an electron
band electronic structure picture by adding the Coulomb is added to the system, one obtains electron affinities or
interaction between the hole and electron. Semiconduc- N + 1 solutions, and when an electron is removed one
tor materials are the heart of most modern optical and obtains ionization energies or N − 1 solutions. All so-
electronic devices; as a result, the dominant technological lutions to one-body calculations such as the one-particle
interest is focused on these materials. In view of this inter- Green’s function method, Hartree–Fock calculations, and
est, this article emphasizes the Wannier excitons, which similar approximations (unless specifically added) do not
are appropriate for these materials. contain the interaction of the excited “particle” with the
other “particles.” It is necessary to calculate solutions to
the two-body Green’s function (or some approximation to
the two-body problem) in order to have an exciton.
I. INTRINSIC EXCITON CHARACTERISTICS
Another approach is to calculate the many-body excited
states by construction of excited wave functions that are or-
A. Introduction
thogonal to the approximate ground-state wave function.
The intrinsic fundamental-gap exciton in semiconductors The excited-state approximate total energy is calculated,
is a hydrogenically bound hole–electron pair, the hole be- from which is subtracted the approximate total ground-
ing derived from the top valence band and the electron state energy. The formalism gives an effective operator
from the bottom conduction band. It is a normal mode that looks like a scaled or screened hydrogen Hamiltonian.
of the crystal created by an optical excitation wave, and This approach was used by both Frenkel and Wannier. In
its wave functions are analogous to those of the Block the case of Frenkel, the excited states were required to be a
wave states of free electrons and holes. When most semi- linear combination of very local excitations. In particular,
conductors are optically excited at low temperatures, it he required the atoms in his model to all be in the ground
is the intrinsic excitons that are excited. The energies of state except one. This one was excited, but the excitation
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
3. Indirect Transitions
Two of the most extensively studied indirect materials are
FIGURE 1 Band structure and band symmetries for the wurtzite the elemental semiconductors Si and Ge. Both of these
structure. materials have indirect band gaps, and therefore the low-
est energy electronic state is an indirect exciton. For this
lowest energy state to be optically excited, momentum
2. Direct Degenerate Semiconductors must be conserved; thus, additional momentum must be
Materials that crystallize in the diamond or the zinc-blende supplied by the creation or annihilation of an appropriate
structures are representative of degenerate semiconduc- phonon. These materials crystallize in the diamond struc-
tors. Two materials that have been extensively investi- ture and belong to the Oh point-group symmetry.
gated and that are characteristic of direct degenerate semi- The band structures of Si and Ge are similar; as a re-
conductors are GaAs and InP. These materials crystallize sult, Si will be used as the example in this discussion to
in the zinc-blende structure, which has Td point-group describe the indirect exciton. The band structure of Si is
symmetry. shown in Fig. 3. The conduction-band minimum occurs
The dipole momement operator for electric dipole ra- as a 1 symmetry approximately 85% of the way to the
diation in zinc-blende structures transforms like 5 . The zone boundary in the
100 direction. The corresponding
conduction band is s-like, while the valence band is p-like. valence band symmetry is 5 . Using the group character
This structure does not have a principal axis; therefore, the tables at = (0, 0, 0) and = (k, 0, 0),
crystal-field energy is zero and the full degeneracy of the p + + − −
1 × 5 = 15 + 25 + 15 + 25 (5)
levels is retained. Thus, disregarding spin–orbit coupling,
the following decomposition at the center of the Brillouin
zone is obtained:
conduction band S → 1
valence band P → 4
Introducing the spin doubles the number of levels. Con-
sider the 1 s-like conduction band and the triply degener-
ate p-like valence band. The states at the center of the Bril-
louin zone, which belong to 1 and 4 representations of
the single group, are shown in Fig. 2. The splitting caused
by the presence of spin is represented by the inner product
as follows:
1 × D1/2 → 6
4 × D3/2 → 7 + 8
Physically this result means that the six valence-band
states, consisting of the three p-like states, each associated FIGURE 2 Band structure and band symmetries for the zinc-
with one or the other of the two spin states, and that are blende structure.
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
− D. Perturbations
The presence of the 15 symmetry means that this is an al-
lowed transition. The maximum in the valence band occurs 1. Magnetic Fields
+ + −
at k = 0 having 25 symmetry. The 25 → 15 transition
When the crystal is placed in a uniform magnetic field,
is allowed as
there are several new terms in the Hamiltonian, which
+ −
25 × 15 = 2− + 12
− −
+ 25 −
+ 15 (6) will be described below. In this description, the band-gap
extrema are at k = 0 with their shape parabolic at least to
The momentum-conserving phonon in the valence band
+ second order in k and with only double spin degeneracy.
from 5 to 25 is
One may write the exciton equation as a simple hydro-
+
25 × 5 = (5 ) + (1 + 2 ) + 1 + 2 gen Schrodinger equation including mass and dielectric
anisotropy. In general one finds the mass anisotropy is
= (TO + TA) + (LA + LO) (7)
small, allowing first-order perturbation calculations to be
where TO is transverse optical, TA is transverse acoustic, made for the energy states as well as for the magnetic-field
and L indicates longitudinal. All phonons are allowed, as effects.
seen from Fig. 4. In this case, the energy denominator in Since in this model the valence and conduction band
extrema are at k = 0, the wave vector of the light that
creates the exciton k will also represent the position of
the exciton in k space. If one divides the momentum and
space coordinates into the center-of-mass coordinates and
the internal coordinates, the exciton Hamiltonian can be
divided into seven terms as follows:
H = H1 + H2 + H3 + H4 + Hk1 + Hk2 + Hk3 ; (9)
2
−h 1 ∂ ∂2 1 ∂2
H1 = + +
2m µx ∂ x 2 ∂ y2 µz ∂z 2
e2
− (x 2 + y 2 + η−1 z 2 )−1/2 (10)
εη1/2
Ax ∂ Ay ∂ Az ∂
H2 = −2iζ + + (11)
x ∂ x y ∂ y z ∂z
2
e2 Ax A2y A2z
H3 = + + (12)
2mc2 µx µy µz
ζ
H4 = (geγ Seγ + ghγ Shγ )Hγ (13)
2 γ =x,y,z
FIGURE 4 The vibration spectrum of silicon. [As determined by i h Kx ∂ Ky ∂ Kz ∂
Hk1 = + + (14)
Brockhouse (1959). Phys. Rev. Lett. 2, 256.] 2m x ∂ x y ∂ y z ∂z
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
showed a splitting, resulting from combined strain and but encompasses a number of neighboring lattice points
electron-hole spin exchange. The inclusion of the spin- of the host crystal. Hence, in the absorption process, light
exchange interaction also allows one to bridge the gap is absorbed by the entire region of the crystal consisting
between the description of excitons in the J –J coupling of the impurity and its surroundings.
scheme and the L–S coupling scheme. The oscillator strength of the bound exciton, Fd , relative
to that of the free exciton f ex can be expressed as
Fd = (E 0 /|E|)3/2 f ex (23)
II. EXTRINSIC EXCITON
CHARACTERISTICS where E 0 = (2h2 /m)(π/ 0 )2/3 , E is the binding energy
of the exciton to the impurity, m is the effective mass of
A. Introduction the intrinsic exciton, and 0 is the volume of the unit cell.
It has been shown in some materials that Fd exceeds
The intrinsic exciton may bind to various impurities, de-
f ex by more than four orders of magnitude. An inspection
fects, and complexes, and the subsequent decay from the
of Eq. (23) reveals that, as the intrinsic exciton becomes
bound state yields information concerning the center to
more tightly bound to the associated center, the oscillator
which it was bound. Bound-exciton complexes are ex-
strength, and hence the intensity of the excitoncomplex
trinsic properties of materials. These complexes are ob-
line, should decrease as (I /E)3/2 .
served as sharp-line optical transitions in both photolumi-
In magnetic fields, bound excitons have unique Zeeman
nescence and absorption. The binding energy of the exci-
spectral characteristics, from which it is possible to iden-
ton to the impurity or defect is generally weak compared
tify the types of centers to which the free excitons are
to the free-exciton binding energy. The resulting complex
bound. Bound-exciton spectroscopy is a very powerful an-
is molecular-like (analogous to the hydrogen molecule or
alytical tool for the study and identification of impurities
molecule–ion) and has spectral properties that are analo-
and defects in semiconductor materials.
gous to those of simple diatomic molecules. The emission
or absorption energies of these bound-exciton transitions
are always below those of the corresponding free-exciton B. Bound-Exciton Complexes
transitions, due to the molecular binding energy. in Different Symmetries
Bound excitons were first reported in the indirect semi-
conductor silicon. Here it was found that when group V 1. Degenerate Semiconductors
elements were added to silicon, sharp photoluminescent The model of the donor–bound-exciton complex is used
lines were produced, and these lines were displaced in to describe bound-exciton complexes in zinc-blende struc-
energy in a regular way. The binding energies of exciton tures. The Hamiltonian of the system may be written
complexes produced by adding different group V donors
were described by the linear relation H = Hex + Hd + Hdex (24)
E = 0.1E i (22) where Hex and Hd are the exciton and donor Hamiltonians
and Hdex describes the interaction between the exciton
where E is the binding energy of the exciton and E i is the and donor. The Hamiltonian for the exciton is given in
ionization energy of the donor. The small differences in Eq. (4). In this equation, Heh is the interaction Hamiltonian
ionization energies for different-effective-mass chemical between the electron and hole,
donors result from central cell corrections. A similar re-
lationship was found when the group III acceptors were Heh = −e2 /ε|re − rh | + Hexch (25)
added to silicon. A modified linear relationship has been
found for donors and acceptors in compound semiconduc- Here ε is the dielectric constant and Hexch is the electron–
tors. hole exchange Hamiltonian. The exchange Hamiltonian
The sharp spectral lines of bound exciton complexes Hexch is
can be very intense (large oscillator strength). The line
Hexch = A1 σ · J + A2 σx Jx3 + σ y Jy3 + σz Jz3 (26)
intensities will, in general, depend on the concentrations
of impurities and/or defects present in the sample. where σ and J are the operators for electron spin and effec-
If the absorption transition occurs at k = 0 and if the tive hole spin, respectively, and A1 and A2 are parameters
discrete level associated with the impurity approaches the describing the exchange energy.
conduction band, the intensity of the absorption line in- The model of the exciton bound to a neutral donor is
creases. The explanation offered for this intensity behavior shown in Fig. 5. In the initial state the two electrons pair to
is that the optical excitation is not localized in the impurity form a bonding state, leaving an unpaired hole. When the
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
2. Nondegenerate Semiconductors
The theory for the nondegenerate case is based on the
wurtzite structure, with the salient factors of the band
structure such as band symmetries and selection rules be-
ing derived from group theory.
FIGURE 6 Schematic representation of radiative recombination Consider any simple optical transition in which an
of an exciton bound to a neutral acceptor, where the final state is electron bound to an impurity is taken from one band
the acceptor in the ground or in the excited configuration. to another. Suppose the initial and final states can be
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
2. Stress Field
In zinc-blende-type semiconductors, the uniaxial strain FIGURE 10 Schematic plot of the splitting of the (A0 X) initial and
ground states under uniaxial stress. [From Schmidt, M., Morgan,
patterns and electric dipole selection rules have been de-
T. N., and Shairer, W. (1975). Phys. Rev. B11, 5002.]
rived for lines arising from weakly bound exciton com-
plexes. The effect of stress on excitons bound to shallow
neutral acceptors in zinc-blende structures has been rather the splitting and shifts of the initial energy states prior
thoroughly investigated. to exciton decay. The energies of these states have been
In the unstrained crystal, a hole from the J = 32 (8 ) va- calculated and compared with experimental observations,
lence band in combination with an electron from the J = 12 as shown in Fig. 11. The lines in Fig. 11. show the pre-
(6 ) conduction band gives rise to the ground-state exci- dicted energy levels (in the absence of a crystal field) for
ton. Uniaxial stress splits the J = 32 degenerate valence
band into two bands, one with M j = ± 12 , the other with
M j = ± 32 . This splitting is reflected in optical transitions
involving holes from the valence band. The shallow accep-
tor removes an energy state from the valence band and es-
tablishes it as a quantum state of lower energy in the gap re-
gion. This state is made up of valence-band wave functions
and therefore will also reflect valence-band splittings.
When an excition is captured by the shallow acceptor,
an acceptor-bound exciton complex (A0 X) is formed that
consists of two holes and one electron weakly bound to
a negative acceptor ion. In the absence of stress, three
transitions are observed, as shown in Fig. 6.
When a uniaxial stress is applied to the (A0 X) complex
described above, the degeneracy of the states is lifted due
to the splitting of the 8 hole states. A schematic plot of the
FIGURE 11 Experimental (points) and calculated (lines) line
resulting energies is shown in Fig. 10. The lower part of shifts under increasing uniaxial stress. The stress is applied in
the figure shows the splitting of the final (one-hole) state a [100] direction. [From Schmidt, M., Morgan, T. N., and Shairer,
after the collapse of the exciton. The upper part shows W. (1975). Phys. Rev. B11, 5002.]
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
transitions between the initial and final states, with the cen-
ter of gravity shift included. The σ lines are for transitions
polarized perpendicular to the applied stress, while the
π lines show transitions polarized parallel to the applied
stress. The agreement between theory and experiment is
very good.
D. Multibound Excitons
Sharp photoluminescent lines have been observed at ener-
gies less than the energy of the line associated with an exci-
ton bound to a neutral donor in silicon, germanium, and sil-
icon carbide. Similar lines have also been observed that are
associated with acceptors in silicon and gallium arsenide.
The energies and widths of these lines were such that
they could not be explained in terms of any recombination
mechanism involving just a single exciton bound to a neu-
tral shallow impurity center. A model involving a multi-
exciton complex bound to a donor (acceptor) was invoked
in which each line was associated with radiative recombi-
nations of an exciton in the bound multiexciton complex.
A series of emission lines was observed in silicon crys-
tals also lightly doped with boron or phosphorus. The
series began with the bound-exciton line and converged
toward the energetic position of the maximum of emis-
sion of the condensed electron–hole state. The emission
series is shown for both boron and phosphorus dopants
in Fig. 12. The impurities can bind a series of intermedi- FIGURE 12 Emission spectra of (a) Si : B with TO phonon and
ate “multiple-exciton states” containing the single bound (b) Si : P without phonon assistance. Excitation intensity 7.5 W
exciton and electron–hole droplet state. cm−2 , T = 2 K. The dashed lines indicate the positions of the
FEs and the maxima of EHD emission. In Si : P(NP), the FE
A model was proposed in which the multiple-exciton
does not really appear; however, its position is known from the
complex is built up by successive capture of free excitons phonon-assisted FE spectrum. The EHD emission in the NP
at neutral impurity centers. A multiple-exciton complex spectrum only appears at higher doping levels and under high
having index m can capture another free exciton and then excitation. The arrows mark the calculated values hνm ∗ given
∗ = −18.5[1 − exp(−0.21m)] meV for Si : B and hν ∗ =
have the index m + 1; the decay of an exciton would de- by hνm m
−18.5[1 − exp(−0.32 m)] meV for Si : P. [From Sauer, R. (1973).
crease the index to m − 1. The observed photon energy
Phys. Rev. Lett. 31, 376.]
hνm is the difference between the energies of the initial
and final states,
hνm = E g − E FE − E m = hνFE − E m (34) Other models have been proposed, including a shell model
in which all of the electrons and all of the holes in the
where E FE is the binding energy of the free exciton and E m bound multiexciton complex are assumed to be equiva-
that of an exciton in the m complex. The energy difference lent and therefore must conform to the Pauli principle.
between the mth line hνm and the free-exciton line is a The complex is then built up along the lines of a shell
measure of the binding energy E m . model similar to what has been used to study nuclei and
The model was successful in obtaining an empirical fit many-electron atoms.
to the series of emission lines with the series formulas The behavior of these new lines in the presence of mag-
hνm∗ = −18.5[1 − exp(−0.21m)] meV (35) netic and stress fields helped to establish the viability of
the bound multiexciton complex model.
for Si : B (except for the bound exciton line) and
hνm∗ = −18.5[1 − exp(−0.32m)] meV (36) E. Donor–Acceptor Pairs
for Si : P. The calculated line positions are shown in Donor–acceptor pairs introduce transitions in the bound-
Fig. 12, h ν0∗ = 0 corresponds to the free-exciton line. exciton region whose behavior is quite different from
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
excitons bound to foreign impurities or defects. The pairs III. INTERACTION OF EXCITONS
can produce bound states distributed in energy. The range WITH OTHER SYSTEMS
of energies results both from the possible impurities or
defects interacting as pairs and from a dependence on pair A. Phonons
separation. Discrete pair spectra were first observed as a
very complicated spectra in GaP consisting of very many Emission from bound exciton complexes has been ob-
sharp lines. The donors and acceptors will occupy substi- served in many materials. These are very sharp transi-
tutional or interstitial sites. In the case of substitutional tions which in many cases are replicated by emission
sites, both the donor and acceptor can occupy sites on the lines that are separated in energy from the parent tran-
same sublattice for a compound material such as GaP, or sition by an optical phonon energy for the particular lat-
they may be on opposite sublattice sites. Another arrange- tice in question. In crystals having the wurtzite symme-
ment is with one impurity at an interstitial site and the try there is a 1 and a 5 longitudinal optical–transverse
other at a particular lattice site. All of these arrangements optical (LO–TO) splitting due to long-range electrostatic
have been observed. forces as well as a 1 –5 LO–LO and TO–TO splitting
The energy required to bring a hole and an electron from due to anisotropic short-range interatomic forces. The 1 –
infinity to an ionized donor–acceptor pair separated by a 5 LO–LO splitting has been observed on the phonon
distance R may be written as sidebands, due to the interaction with the macroscopic
longitudinal optical phonon electric field, in both CdS and
ZnO. In CdS, the phonon-assisted the results from the col-
E(R) = E g − E A − E D + e2 /ε R (37) lapse of the exciton bound to a neutral acceptor with the
creation of an LO phonon. Both the 1 and the 5 LO
In this expression E(R) is the energy of the pair recom- phonons are created in this process. These two phonons
bination line, E g the band gap of the semiconductor, E A differ in energy by 2.4 cm−1 . This small difference in en-
and E D the acceptor- and donor-binding energies, respec- ergy is clearly resolved, showing that the phonon-assisted
tively, R the donor–acceptor separation, and ε the low- transitions are not appreciably broadened by the phonon
frequency dielectric constant. When the donor–acceptor interaction.
distances become small [R < R0 = (donor–acceptor In the case of CdS, the exciton is rather weakly bound
concentration)1/3 ], a van der Waals attractive term may (17 meV) to the acceptor. This results in a localized state
become important, and Eq. (37) becomes in K space. The phonon energies show that it is localized
near K = 0. The phonon dispersion curves were calcu-
E(R) = E g − E A − E D + e2 /ε R − (e/K )(a/R)6 (38) lated for CdS using a mixed binding model. In this model
the potential contains a short-range part corresponding to
In the case of random pair distribution, it would be covalent bonding and a long-range part due to Coulomb
expected that over a small range of R, the line intensity interactions between point ion charges. The calculations
would reflect the statistical probability of a specific pair show that the LO phonon dispersion curves are quite
occurring. In considering GaP, which has the zinc-blende flat in the vicinity of K = 0. This would account for
structure, and assuming that both the donors and accep- the very small line broadening observed in the phonon
tors result from substitutional impurities and that both interaction.
occupy sites on the same sublattice, it is possible to re- A similar interaction between the LO phonon and an
late R to a given observed line. For the preceding case, exciton bound to a neutral donor was observed in ZnO.
Rm = a0 ( 12 m)1/2 , where a0 is the GaP lattice constant In this material, the energy separation between the 1 LO
and Rm the distance to the mth nearest neighbor on the phonon and the 5 LO phonon is larger (11 cm−1 ).
radius of the mth shell. The donors and acceptors oc-
cupy face-centered cubic sites, and the number of pairs
1. Exciton–Bound-Phonon Quasi-Particle
for a given m can be tabulated. The variation in number
of pairs allows a correlation with observed spectra. For Optical transitions have been observed in a number of
the case when the donors and acceptors occupy opposite ionic crystals in which the energy separating the parent
sublattice sites, Rm = a0 ( 12 m − 16 ) and N (R) > 0 for
5 1/2
transition and its LO-phonon sideband is less than the
all m. LO-phonon energy hω0 by approximately 10%. In ab-
The value of E(R) and R are determined from experi- sorption spectra of AgBr : I, transitions associated with
ment; therefore Eqs. (37) and (38) can be helpful in identi- the bound exciton occur at energy separations approxi-
fying the donors and acceptors involved in donor–acceptor mately 30% less than hω0 . These results can be explained
pair recombination. in terms of a bound-phonon quasi-particle model. The
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
calculated binding energies and oscillator strengths for (E j − E 0 )
0|eik·r | j
j|eik ·r |0
this new quasi-particle can account for phonon interac-
(39)
j (E j − E 0 )2 − (hω)2 K K
tions whose energies are less than that of the LO phonon
hω0 . LO phonons bound to neutral donors have been ob- where |0 is the donor ground-state wave function and
served in both Raman-scattered and luminescence spectra the sum is over excited donor states j. The interaction is
of GaP crystals that were doped with S, Te, Si, and Sn. attractive when hω is less than the excitation energy of the
These results were interpreted as impurity modes associ- donor. For a spherical approximation, the interaction will
ated with dielectric effects of the neutral donors, rather produce bound states for each angular momentum of the
than as local modes associated with mass defects of the LO phonon around the donor. When the first excited state
substituents. of the donor is comparable with the phonon frequency,
The spectra due to Raman scattering from the neu- an approximation in which only the lowest donor excited
tral donor LO-phonon bound states are shown in Fig. 13. state is kept will give a reasonable lower bound to the
The binding energies associated with the exciton–bound- binding energy for HKK .
phonon quasi-particle for several different donors are Using this approximation, one obtains the following
given in Table I. wave functions and energies for the s and p states. For s
The virtual process states,
donor + LO phonon k → excited donor 2
32 ε0 e E 2s − E 1s
EB = hω0 −1
→ donor + LO phonon k 729 ε∞ 2aε0 (E 2s − E 1s )2 − (hω0 )2
produces the interaction of an LO phonon with a donor (40)
site. The effective scattering matrix element Hkk is pro- K
portional to K α
2 3 (41)
3
2
a + K2
For p states,
2
224 ε0 e E 2p − E 1s
EB = hω0 −1
6561 ε∞ 2aε0 (E 2p − E 1s )2 − (hω0 )2
(42)
cos θb
K α
2 3 (43)
3
2
a K2
In these expressions, ε0 and ε∞ are the static and high-
frequency dielectric constants of the Frohlich electron–
phonon interaction. The donor Bohr radius is a, and the
donor binding energy is e2 /2aε0 for a hydrogenic donor.
The theoretical binding energies from Eqs. (40) and (42)
are included in Table I.
B. Photons
1. Spatial Resonance Dispersion
For those states in a crystal where the photon wave vec-
tor and the exciton wave vector are essentially equal, the
energy denominator for exciton–photon mixing is small
and the mixing becomes large. These states are not to be
FIGURE 13 Raman scattering of 5145- Å Ar+ laser light from considered as pure photon states or pure exciton states,
GaP containing ∼1015 cm−3 neutral Sn, Te, or S donors, recorded but rather mixed states. Such a mixed state has been
just below the k = 0, LO phonon at 50.2 meV, showing the new called a polariton. When there is a dispersion of the di-
local modes. These modes can be seen easily at donor concen-
electric constant, spatial dispersion has been invoked to
trations as low as 1017 cm−3 , although their strengths relative to
the LO lattice normal mode decrease in proportion to the neutral explain certain optical effects of crystals. It was origi-
donor concentrations. [From Dean P. J., Manchon, Jr., D. D., and nally thought that it would introduce only small correc-
Hopfield, J. J. (1970). Phys. Rev. Lett. 25, 1027.] tions to such things as the index of refraction, until it was
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
TABLE I Experimental and Calculated Binding Energies EB of LO Phonons Localized at Neutral Donors in GaP
Observed through Raman Scattering (EB ) p, and as Sidebands in the Luminescence of Excitons Bound to Neutral
Donors (EB )sa
p p
E 1s (A1 )→E 2s (A1 ) E 1s (A1 )→E b2p (E B )scalc (E B )2exp (E B )calc (E B )exp
Donor (meV) (meV) (meV) (meV) (meV) (meV) (meV)
demonstrated that if there was more than one energy trans- from other oscillators are in included in a background di-
port mechanism, as in the case of excitons, this was not electric constant ε. In Eq. (44) one has expanded both the
true. Spatial dispersion addresses the possibility that two numerator and denominator in powers of k, keeping terms
different kinds of waves of the same energy and same to order of k 2 , m ∗ is the sum of the effective masses of
polarization can exist in a crystal differing only in wave the hole and electron that comprise the exciton, and ω0 j
vector. The one with an anomalously large wave vector is the frequency of the jth oscillator at k = 0. Eliminat-
is an anomalous wave. In the treatment of dispersion by ing k 2 from Eq. (44) and neglecting the line width of the
exciton theory, it was shown that if the normal modes of oscillators, the working equation is
the system were allowed to depend on the wave vector,
4π α0 j + α2 j ω2 n 2 c2 ω02 j
a much higher order equation for the index of refraction n =ε+
2
2 (45)
2 2 c2 m ∗ − ω2
j ω0 j + hω0 j ω n
2
would result. The new solutions occur whenever there is j
any curvature of the ordinary exciton band in the region of The sum is over excitons from the top two valence bands,
large exciton–photon coupling. These results apply to the where the “allowed” excitons have been included with
Lorentz model as well as to quantum-mechanical mod- α0 j = 0, α2 j = 0 while the “forbidden” (which are seen
els whenever there is a dependence of frequency on wave only because k = 0) excitons are included with α0 j = 0,
vector. α2 j = 0. The above equation reduces to a polynomial in n 2
It was pointed out early in the investigation of spatial whose roots give the wavelengths of the various “normal
dispersion that the specific dipole moment of polarization modes” for transfer of energy within the crystal.
of a crystal and the electric field intensity are not in di- In the classical case, α2 j = hω0 j /m ∗j = 0. For a given
rect proportion. It was found that the two were related by a
frequency, the two roots of n 2 for Eq. (45) are −n and
differential equation that resulted in giving Maxwell equa-
+n. Thus in the classical case, for a given principle polar-
tions of higher order. This led to the existence of several
ization, frequency, and direction of propagation, only one
waves of the same frequency, polarization, and direction
transverse mode exists.
but with different indices of refraction. Subsequent stud-
ies of the reflectivity of CdS demonstrated the effects of
spatial dispersion. Extensive calculations resulted in the 2. Two-Photon Processes
following expression for the index of refraction:
The concept of two-photon processes dates back more than
2 2
c k 40 years, and was first treated theoretically. The observa-
n2 = =ε tion of two-photon transitions occurred approximately 30
ω2
years later. The two-photon transition is a nonlinear pro-
4π α0 j + α2 j k 2 ω·2j cess, and as such its full potential as a tool for investigating
+
(44)
2 m ∗ − ω2 − iω
j ω0 j + hω0 j k
2 material parameters was not realized until the advent of
j j
laser sources.
In this equation the sum over j is to include the excitons The process is one in which two quanta are simulta-
in the frequency region of interest, and the contributions neously absorbed in an electronic transition. The energy
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
sum of the two quanta must be equal to the energy of the B. Excitons in Quantum Wells
electronic transition. In one-photon absorption spectra, the and Quantum Dots
absorption coefficient as a function of photon energy is
Superlattice structures have generated considerable in-
obtained. In two-photon transitions, the absorption is de-
terest for more than a decade because of the novel
pendent on two frequencies; therefore, instead of a plane
transport phenomena predicted for such structures. The
curve, the spectrum is a two-dimensional surface. The two
superlattice is a multilayered periodic structure having di-
processes are complementary; however, they yield dif-
mensions varying from a few-angstroms to hundreds of
ferent information. The single-photon process is allowed
angstroms. Carriers in semiconductor superlattices may
between states of different parity, while the two-photon
be confined to certain layers by the superlattice potential
transitions are allowed between states of the same par-
variations, resulting in new conductivity properties. The
ity. The polarization dependence of two-photon absorp-
allowed carrier energy levels are determined by quanti-
tion is more complicated than it is for one-photon pro-
zation effects when the confined regions are sufficiently
cesses. Even for isotropic materials, the mutual orientation
small. The confined regions produce new optical effects as
of the electric vectors of the absorbed fields is important.
well as electrical effects. Superlattices emerged as practi-
Thus, the two-photon spectrum contains more informa-
cal structures when the metal organic chemical vapor de-
tion than the one-photon spectrum. All two-quantum pro-
position (MOCVD) and molecular beam epitaxy (MBE)
cesses require an intermediate state in which one photon
crystal growth techniques evolved, making high-quality
is absorbed or emitted and the atom is left in an excited
structures feasible. Very thin layers with smooth surface
state.
morphology can be grown by these techniques. One of
The fine structure of the 2P states of the exciton from
the very common heterostructures produced by these tech-
the top valence level in CdS has been studied. In this ex-
niques is GaAs/Alx Ga1−x As. By cladding the GaAs layer
periment, a visible dye laser and a CO2 laser were used.
with GaAIAs barriers, the electrons and holes are con-
From the polarization selection rules, it was shown that
fined within the GaAs well, resulting in a modification of
the visible photon created the virtual 1s exciton, and the
their energy levels in the well. Repeating the growth of
absorption of the infrared photon brings it to the final p
these layers results in a multiquantum well (MQW) struc-
state. Using these results, the two-photon absorption coef-
ture, the number of wells being equal to the number of
ficient was calculated and compared with the experimen-
repeated cycles. When the layer thicknesses are small, the
tally measured results. The agreement between theory and
electrons are confined as quantized electron waves. If the
experiment is very good.
barrier-layer thicknesses are large enough, tunneling be-
tween wells does not occur. The confinement of carriers
IV. SPECIAL PROPERTIES OF EXCITONS within the GaAs well results in an effective increase in the
bandgap. The low-temperature bandgap of bulk GaAs is
A. Introduction 1.5196 eV. Alx Ga1−x As has a direct bandgap for x < 0.45,
with a bulk bandgap which is 1.25 x eV greater than
The excitonic properties of the semiconductor are key to
GaAs. In the quantum well structure, the difference in
the development of a large number of devices. In this
bandgap is divided between the conduction band and the
section five of the properties that have direct application
valence band. The percentage contribution to each band is
will be outlined. The first is the exciton states in quantum
a measure of the confining barrier for that band. Well-size
wells. The binding energy of the exciton can be adjusted
quantization results in a shift of the allowed energies for
due to the thickness of the layers of the different semi-
electrons. If infinite confining barriers are assumed, the
conductor materials. This leads to having different energy
allowed minimum energies for electrons are given by
responses, as well as many other electrical and optical
properties. E n = h 2 n 2 /8m ∗ L 2 (46)
The second area is hole–electron droplets, which are
extremely useful in the study of many-body effects and where h is Planck’s constant, n is an integer marking the
the phase transition from a gas into a liquid. The role of number of half-wavelengths of the confined electron, m ∗
excitons in developing a high-temperature superconductor is the effective mass, and L is the well thickness. These en-
is given in Section IV.D. When this property is exploited, ergy shifts for electrons in the conduction band are shown
it could have a major impact on modern electronic de- in Fig. 14.
vices. The last two areas outlined are lasing transitions The energy levels in the valence band of the quan-
and optical bistability. These two areas lead to totally op- tum well are also modified. In bulk GaAs, the light and
tical switching devices, which again may lead to many heavy hole valence bands are degenerate at K = 0. From
useful devices such as the optical computer. Eq. (46) it is seen that the energy of the confined particles is
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
1 1 1
= + (γ1 ± γ2 ) (48)
µ± me m0
and
1 1
= (γ1 ∓ 2γ2 ) (49)
m± m0
where m 0 is the free electron mass. In these equations
the upper sign refers to the JZ = ± 32 (heavy-hole) band
and the lower sign to the JZ = ± 12 (light-hole) band. The
positions of the electron and hole are designated by re and
rh , respectively, ρ, φ, and Z are the cylindrical coordinates.
The potential wells for the conduction electron, Vew (Z e )
FIGURE 14 Quantum-state energy levels in GaAs/Alx Ga1−x As and the holes Vhw (Z h ) are assumed to be square wells of
quantum well. CB and VB refer to conduction and valence bands,
respectively.
width L
0, |Z e | < L/2
Vew (Z e ) = (49a)
Ve , |Z e | > L/2
different for different masses. The layered structure has re- and
duced the cubic symmetry of bulk GaAs to uniaxial sym-
0, |Z h | < L/2
metry. The optical transitions thus become nondegenerate. Vhw (Z h ) = (49b)
Vh , |Z h | > L/2
This feature is shown in the valence band of Fig. 14.
The line shape of the absorption and photolumines- The values of Ve and Vh are determined from the Al con-
cence bands of GaAs (MQW) is excitonic. Verification centration in the Alx Ga1−x As barrier.
of these bands as being assigned to light and heavy- An exact solution of the Schrödinger equation corre-
hole free-exciton transitions has been made from polar- sponding to the exciton Hamiltonian Eq. (47) is not pos-
ization measurements. These measurements include opti- sible. A variational approach was used to calculate the
cal spin-orientation measurements and linear polarization ground-state energy E 1 of the Hamiltonian. The binding
measurements and linear polarization measurements of energy of the ground state of an exciton E 1s is then ob-
emission emanating from a cleaved edge of the MQW. tained by subtracting E 1 from the sum of the lowest elec-
The lowest level conduction and valence subband lev- tron and hole subband energies (E e + E h ). These subband
els are shown in Fig. 14. In absorption and photolumi- energies are determined by solving the transcendental
nescence, it is the excitons associated with these subband equations for finite square-wells.
levels that are observed. A summary of a calculation of The binding energy of the ground state of a heavy-hole
the energy levels of heavy- and light-hole excitons associ- exciton E 1s (h) (solid lines) and a light-hole exciton E 1s ()
ated with the lowest electron and hole subbands for finite (dashed lines) as a function of well-width L for different
values of the potential barrier heights is presented. values of the potential-barrier heights are shown in Fig. 15.
The Hamiltonian of an exciton associated with either the For a given value of X , the value of E 1s (h) increases as
heavy- or the light-hole band, in a quantum well structure L is reduced until it reaches a maximum and then drops
as shown in Fig. 14, within the framework of an effective quite rapidly. Similar behavior is exhibited by E 1s (). The
mass approximation is given by: explanation of this behavior is that as L is reduced the
exciton wave function is compressed in the quantum well,
−h 2 1 ∂ ∂ 1 ∂2 h2 ∂ 2 h2 ∂ 2
H = ρ + 2 2 − − leading to increased binding. However, beyond a certain
2µ± ρ ∂ρ ∂ρ ρ ∂φ 2m e ∂z e2 2m ± ∂z h2 value of L the spread of the exciton wave function into the
e2 barrier becomes important. This causes the binding energy
− + Vew (Z e ) + Vhw (Z h ) (47) to approach the value in bulk Alx Ga1−x As as L becomes
ε0 |re − rh |
diminishingly small.
where m e is the effective mass of the conduction electron, It is seen that for a given value of X , E 1s () is larger
ε0 is the static dielectric constant, m ± is the heavy (+) or than E 1s (h) for L greater than a certain critical value L c ,
light (−) hole mass along the Z direction, and µ± is the at which they become equal. For values below L c , E 1s ()
reduced mass corresponding to heavy (+) or light (−) hole is smaller than E 1s (h). The value of L c depends on X ;
bands in the plane perpendicular to the Z axis. Both µ± the larger the value of X the smaller the value of L c . For
and m ± can be expressed in terms of the Kohn-Luttinger X = 0.3, L c = 50 Å. This behavior can be understood as
band parameters γ1 and γ2 as follows: the value of E 1s () is greater than that of E 1s (h)
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
C. Hole-Electron Droplets
Nonequilibrium electrons and holes in semiconductors are
bound in excitons at low temperatures by Coulomb attrac-
tion. The exciton forms because it represents a state of
slightly lower energy than the unbound hole–electron. At
high exciting intensities the density of hole–electron pairs
is increased and excitons are formed at a higher rate. At
high concentrations the interaction among electrons be-
comes very important, and when a certain threshold den-
sity is reached liquid droplets are formed. These collective
droplets consist of nonequilibrium electrons and holes;
therefore, when electron–hole recombination occurs, spe-
FIGURE 15 Variation of the binding energy of the ground state, cific radiation is emitted.
E1s , of a heavy-hole exciton (solid lines) and a light-hole exciton This intense radiation was first observed in Si. The spec-
(dashed lines) as a function of the GaAs quantum-well size () for tra contained, in addition to the well-known peaks due to
A concentration X = 0.15 and 0.3, and for an infinite potential
the annihilation of free excitons with appropriate phonons,
well.
some broad bands of radiation shifted toward lower
energies.
for large L. Both increase as L is reduced, E 1s () less The formation of a condensed phase of nonequilibrium
rapidly than E 1s (h) as proportionately more of the light- carriers was considered, to account for the new radiation
hole exciton wave function penetrates into the barrier, thus peaks. If the collective interaction predominates in the
reducing the increase in E 1s (). At a certain value of L, condensed phase, survival of excitons as quasi-particles
which depends on X , the two values become equal, and is doubtful. It was shown that the condensed phase con-
then E 1s () becomes smaller as L is reduced further. This sists of a degenerate electron–hole plasma characterized
contrasts with the behavior of E 1s (h) and E 1s () for infi- by metallic properties.
nite potential barriers where E 1s () is always larger than The most convincing evidence that the new substance
E 1s (h). is in a separate phase was obtained from light-scattering
In the case of quantum dots, two general configurations experiments. The effect is analogous to the scattering of
are considered. The energy levels in the first case, type I light by drops dispersed in a fog. The range of angles
quantum dots, are similar to those in Fig. 14. Here the con- through which the radiation is scattered depends on the
duction band of the quantum dot is lower in energy than size of the particles, the larger scattering angles resulting
the bulk conduction band and the valence-band energy is from smaller particles.
higher that that of the bulk valence band. If the combi- The far-infrared absorption of the condensed phase in
nation dot size and potential barrier are large enough, an Ge was investigated. The results were explained by as-
exciton may be bound to the quantum dot. The overlap suming that the absorption is caused by the excitation of
of the electron and hole wave functions of the dot exci- plasmons in the drops of the condensed phase. From these
ton will be larger than that of the bulk exciton leading to experiments the drop radius was estimated to be R ≈ 10−3
very interesting studies of the exchange and correlation to 10−4 cm.
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
plane, another Cu-O plane, and Y3+ plane. Focusing on behind at the Cu2+ site and cannot dissipate before the
the Cu-O plane and chain, it is found that the plane valence second quasi-particle arrives to absorb the energy from
and conduction-like states are pushed to lower energy by the field. The two quasi-particles thus form a Cooper pair
the close proximity of the Y3+ compared to the chainlike mediated by the excitonic field of the Cu ion.
electronic states. One finds that some of the chain has hole- Starting with the energy band results and the correla-
like quasi-particle states at the Fermi surface. These states tion effects related to the Cu2+ (d9 ) one can add in the two
are located at the Brillouin zone boundary. These holelike quasi-particle pair states (Cooper pair states) and the po-
states are intinerant and are the current carrying states. larization field to derive an effective interaction between
There is another state found both in the chain and the the two quasi-particle states given below. These two quasi-
Cu-O plane which appears to be caused by approxima- particles are two hole states from O2− -like states that are
tions that have been made in the electronic band structure at the Fermi surface. This effective interaction is given by
calculations which do not account fully for correlation
effects. To understand this more fully, one has to resort
4π ε 2 ωexc
2
κ(q) (1 − 1/ε∞ )
to atomiclike calculations and focus on the Cu-like states Veff (q) = +1 (50)
embedded both in the plane and the chain including cor- |q|2 κ(q) q02 − ωexc
2 κ(q) + iδ
relation. One finds a ground state for the Cu to be Cu2+
with one of the d electronic states unoccupied and at a
much higher energy. Namely, one of the d (x 2 − y 2 ) states In the above expression ωexc is the Cu 3d − 4s and p exci-
is not converged and Cu2+ does not exist in the ground ton frequency and κ(q) is the screening function caused by
state. Returning now to the approximations that are made the O2− quasi-particles. q0 are the calculated band energies
in one electron band structure, one finds this artificial state of the hole states and ε∞ is the dielectric constant in infi-
arising very quickly from the center of the Brillouin zone nite frequency of the crystal. (4π e2 )/(|q|2 κ(q)) represents
to 2 to 3 eV above the Fermi surface. In this way the band the Coulomb interaction. One finds if the first term in the
calculation tries to account for these correlation effects bracket on the right-hand side is negative and has an ab-
and form Cu+2 (d9 ). There is no experimental verification solute value larger than one, superconductivity will exist.
for this state, as it is only an artifact of the approximation. It is interesting to note that the first term on the right-hand
Also, using the atomiclike calculations one finds the first side will have a different sign as long as ωexc
2
\κ(q) is larger
excited state (the primary exciton) in the 4–6 eV range than q 2 . As more and more hole states are added κ(q) be-
for the Cu electronic system. These states are seen in the comes larger; thus, the ratio of ωexc2
\κ(q) becomes smaller
optical spectra experimentally. and the superconductivity will disappear. This happens
The above discussion demonstrates the two electron- experimentally in YBa2 Cu3 O7−x as x increases, and su-
like systems with which to develop an excitonic theory for perconductivity is totally gone in these compounds by the
superconductivity. The holelike states are predominantly time x has reached 0.5. As we remove oxygen from this
related to the O2− electronic states of the chain. As this compound, the vacancy appears in the chain and has the
quasi-particle travels along the chain, it approaches the lo- effect of lowering the Fermi energy. This creates the num-
calized Cu-like states and polarizes the Cu valence states. ber of holes near the Fermi surface and, in turn, increases
The polarization states are represented by the mixing in κ and diminishes the ratio.
of the 4 s and p conduction bands of the Cu electronic There are also four other well-defined experiments that
system. This is represented in the model by the exciton to date are inexplicable without invoking a superconduct-
discussed above. ing state in which the phonon-mediated electron–electron
One would ordinarily expect that the holelike states of interaction is replaced by excitons.
O2− would be able to screen out this interaction much The first experiment is one in which an attempt was
more readily than the polarization of the d-like electrons. made to measure the effects of gravity on the position of an
However, as stated before, the planelike states of Cu-O are electron in a copper tube. A large temperature-dependent
suppressed to lower energy than those of the chain. Thus, transition in the magnitude of the ambient axial electric
when the holelike states attempt to readjust they tend to field inside the vertical copper tube was found. Above a
mix in the conductionlike states from the plane, causing temperature of 4.5 K the ambient field was 3 × 10−7 V/m
their polarization charge to be distributed into other layers or greater. Below 4.5 K the magnitude of the ambient field
away from the chain. This leaves the d-like states polarized drops very rapidly, to 5 × 10−11 V/m at 4.2 K.
without the total dampening out of the effect by the quasi- These measurements were made using time-of-flight
particles at the Fermi surface. spectra of an electron traveling in the center of the copper
Another important feature is that the polarization distur- tube, and the field effects on the tube were screened out.
bance cannot keep up with the quasi-particle, but remains The equation for the time of flight spectra is
P1: FYD Final Pages
Encyclopedia of Physical Science and Technology E005-237 June 15, 2001 20:43
LO phonon. A medium-gain process is due to an exciton– tors have high absorption coefficients, particularly at res-
exciton interaction, and a high-gain process involves an onant excitonic transitions. These materials will produce
exciton–electron interaction. absorptive and dispersive bistable devices. Useful absorp-
The interpretation of the excitation dependence of the tion in these materials is achieved in very short transvers-
spontaneous emission and of the gain for the free exciton- ing paths, making very fast switching devices achievable.
related processes are as follows: Both the free- and bound-exciton transitions in semicon-
ductors have shown promise for high-speed, low-power
1. For the excitation intensity J < 1 A/cm2 , only the switching devices. These characteristics make possible
E x –LO process yielded some gain. fast, all-optical, signal-processing devices.
2. For 1 A/cm2 < J < 3 A/cm2 , the low-energy tail
resulting from electron–exciton interaction line
dominates. SEE ALSO THE FOLLOWING ARTICLES
3. For J > 3 A/cm2 , the low-energy tail resulting from
electron–exciton interaction is the dominant gain BONDING AND STRUCTURE IN SOLIDS • CRYSTALLOGRA-
process. CdS, CdS: Se, and CdSe lasers provide a PHY • GROUP THEORY, APPLIED • LASERS, SEMICONDUC-
tunability from 0.5 to 0.7 m. In CdS, mode-locked TOR • METALORGANIC CHEMICAL VAPOR DEPOSITION
pulses shorter than 4 psec have been obtained. (MOCVD) • MOLECULAR BEAM EPITAXY, SEMICONDUC-
TORS • QUANTUM MECHANICS • SUPERCONDUCTIVITY
F. Optical Bistability
Optical bistability can be defined as any optical system BIBLIOGRAPHY
possessing two different steady-state transmissions for the
same input intensity. To achieve optical bistability, the op- Baldereschi, A., and Lipari, N. O., (1971). Phys. Rev. B3, 439.
Cardona, M. (1969). “Modulation Spectroscopy,” by F. Seitz and D.
tical device must have feedback. This implies that the Turnbull and E. Ehrenreich. Solid State Physics, Suppl. 11. Academic
transmission intensity must have some dependence on the Press, New York.
output intensity. Many of the bistable devices have been Craig, D. P., and Walmsley, (1968). “Excitons in Molecular Crystals,”
Fabry–Perot etalons containing materials having nonlin- Benjamin, Elmsford, New York.
ear indices of refraction at high input light intensities. In Davydov, A. S. (1962). “The Theory of Molecular Excitons,” McGraw-
Hill, New York.
this type of device, the cavity is tuned so that a trans- Dexter, D. L., and Knox, R. S. (1965). “Excitons,” Interscience Publish-
mission maximum lies close to the laser frequency, but ers, New York.
still having low transmission at low input intensities. As Dimmock, J. O., (1967). “Theory of Excton States,” Semicond. and
the input intensity is increased, the light penetrating the Semimetals 3, Chap. 7, Academic Press, New York.
cavity will be sufficient to cause the nonlinear index ma- Green, R. L., Bajaj, K. K., and Phelps, D. E., (1984). Phys. Rev. B29,
1807.
terial to tune the cavity toward the laser frequency. This Knox, R. S., (1963). “Theory of Excitons,” Solid State Phys. Suppl. 5,
has been termed intrinsic dispersive optical bistability— Academic Press, New York.
intrinsic because the nonlinear index material provides the Rashba, E. I., and Sturge, M. D., ed. (1982). “Excitons,” North Holland
feedback, and dispersive because the reflective or real part Publishing Co., Amsterdam.
of the nonlinear susceptibility is more important than the Reynolds, D. C., and Collins, T. C., (1981). “Excitons, Their Properties
and Uses,” Academic Press, New York.
imaginary or absorption part. Thomas, D. G., and Hopfield, J. J., (1959). Phys. Rev. 116, 573.
For practical optical bistable devices, attention has Thomas, D. G., and Hopfield, J. J., (1962). Phys. Rev. 128, 2135.
been focused on semiconductor materials. Semiconduc- Wheeler, R. G., and Dimmock, J. O., (1962). Phys. Rev. 125, 1805.
P1: FYK/FQW P2: FYK Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN005A-240 June 26, 2001 19:22
Ferromagnetism
H. R. Khan
Forschungsinstitut für Edelmetalle und Metallchemie
and University of Tennessee at Knoxville
759
P1: FYK/FQW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN005A-240 June 15, 2001 20:47
760 Ferromagnetism
induction or the magnetic flux density B (Wb/m2 ) riodic system is due to the incompletely filled M shells
through the relation of their atoms. Due to these incompletely filled shells,
the atoms behave as magnets ordered in parallel arrange-
B = µ0 (H + M) ment in ferromagnetic materials. In antiferromagnetic ma-
where µ0 is the permeability of the free space and has terials, the atomic magnets are ordered in antiparallel ar-
the value 12.57 × 10−7 Wb/Am. rangement. Ferrimagnetic materials are a special case of
Magnetization (M) Magnetic moment per unit volume. ferromagnetic materials. The neighboring atoms interact
Coercive force (H c ) Negative value of the magnetizing with each other in a material, and this interaction force
field H that makes the magnetic induction B of a fer- is dependent on the distance between neighboring atoms
romagnetic material zero. and the diameter of the atomic shell responsible for the
Remanence (Mr ) If the magnetic flux density does not atomic magnetic moment. The sign and magnitude of this
drop to zero upon reducing the magnetic field on the interaction force cause a material to show different mag-
specimen to zero, then this remained magnetic flux den- netic behavior. The usefulness of a ferromagnetic material
sity in the specimen is called the remanence. is shown by its magnetization curve and hysteresis loop.
Permeability (µ) The ratio B/H is the permeability of a The hysteresis loop provides information about the “per-
material. meability” and “coercivity” of a ferromagnetic material.
Magnetic susceptibility (χ) The ratio M/H is called the For example, a soft magnetic material to be used as a
magnetic susceptibility of a material and may be ex- transformer core should have a high value of permeabil-
pressed in mass, volume, or molar units. ity, whereas a hard or permanent magnetic material should
Curie temperature (T c ) Temperature above which the have a high value of coercivity. The physical condition,
spontaneous magnetization of a ferromagnetic material purity, and composition of a material control the useful
vanishes. magnetic properties like permeability and coercivity, and
Neel temperature (T N ) Temperature below which the they can be modified by controlling different parameters
interaction between the atomic moments affecting an- of a material. By rapid solidification of materials from the
tiparallel orientation surmounts the thermal agitation. melt, very soft magnetic and hard magnetic materials can
At Neel temperature TN , the susceptibility of a material be produced. Both the hard magnetic and soft magnetic
has its maximum. materials find applications in the electrical and electronic
industry. Some examples of their uses are transformers,
motors, generators, relays, telephone cables, audio and
A SPECIAL ARRANGEMENT of electrons in the video recording and replaying, and data memory systems
atoms causes a material to become a ferromagnetic mate- and computers.
rial. For example, the incompletely filled M shells of iron, A magnetic field gradient is generated by a magnet con-
cobalt, and nickel atoms are responsible for the ferromag- sisting of a flat north pole and a pointed south pole, as
netism in these metals. Atoms behave as small magnets or- shown in Fig. 1. The magnetic field is stronger near the
dered in parallel arrangement in ferromagnetic materials. pointed pole. lf a piece of material in the cylindrical form is
The magnetization curve and the hysteresis loop determine suspended with a string between the poles, then a magnetic
whether it is a hard or soft ferromagnetic. The parameters force is generated on this material. The kind of magnetism
determined from the hysteresis loop are the permeability, on the material is determined from the direction of the
coercivity, remanence, and the area of the loop itself. The
area of the loop gives the energy loss per unit volume of the
specimen per cycle and is dissipated as heat energy called
as hysteresis loss. The hysteresis loop and the parameter
derived from it determine the suitability of a material in a
particular application.
Ferromagnetism 761
762 Ferromagnetism
Ferromagnetism 763
The initial magnetization as shown in Fig. 7 is not re- FIGURE 8 Plot of the magnetic induction B as a function of mag-
versible. When H slowly increases, the value of B also netizing field H , hysteresis loop.
P1: FYK/FQW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN005A-240 June 15, 2001 20:47
764 Ferromagnetism
tal along which they are easily magnetized are called di-
rections of easy magnetization.
V. ANISOTROPIC MAGNETIZATION
Ferromagnetism 765
IX. MAGNONS
766 Ferromagnetism
Ferromagnetism 767
few nanometers. In these systems the magnetic layers are The iron-nickel alloys possess high values of initial and
magnetized antiparallel via exchange interaction across maximum permeability and very low hysteresis loss com-
the nonmagnetic layers, for example, copper or chromium. pared to the iron-silicon alloys. An alloy of composition
When a sufficiently large magnetic field is applied, it can with 78.5% nickel and 21.5% iron is called permalloy and
align the magnetization of the magnetic layers in a paral- has an initial relative permeability of ∼10,000 compared
lel direction. In a magnetic metal, the free electrons which to 250 for the pure iron. These alloys are in general used
carry the electric curent have their spins aligned either for the magnetic screening of the electronic equipment.
parallel or antiparallel to the magnetization of the metal. Small additions of the metals chromium or molybdenum
Electrons experience different resistance to their motion further modify the magnetic properties of these materials
depending on the direction of their spin relative to the to be used as cores in transformers or inductors work-
magnetization. When current flows in the plane of the ing at the audio or higher frequencies. For example, the
multilayers with antiparallel magnetization, the electrons magnetic cores of inductors and transformers working at
experience a high resistance; but when a magnetic field radiofrequencies (∼100 Mc/sec) show high eddy current
is applied and the magnetization of the magnetic layers losses. Used here are the ferrites, which have high resistiv-
changes to a parallel state, the electrons experience lower ity (∼106 times that of metals) and high permeability. In
resistance. In the case of conventional magnetoresistance, other applications, the ferromagnetic materials with a high
the resistance increases with the application of magnetic value of “coercivity” and large area of hysteresis loop are
field; whereas in the case of Giant Magnetoresistance, the required. These materials possess hard magnetism com-
resistance decreases with the application of magnetic field pared to the already discussed soft magnetism. The ad-
because of the change from antiparallel to parellel mag- dition of carbon to iron increases the hysteresis loss. The
netization of the magnetic layers. carbon steel was used as a material for permanent magnets
in earlier days. However, aging degenerates the magnetic
properties of carbon-steel magnets. Addition of metals
XII. FERROMAGNETIC MATERIALS such as cobalt, chromium, or tungsten improves the mag-
AND THEIR APPLICATIONS netic properties, and these materials are less susceptible to
aging. A large number of alloys composed of iron, nickel,
The hysteresis loop of a ferromagnetic material provides cobalt, aluminum, copper, platinum, manganese, and ox-
information about its usefulness in technical applications, ides of iron and rare-earth metals have been developed
as discussed in Section IV. The hysteresis loop depends on that show high values of coercivity and are suitable for
the physical condition, composition, and purity of a spec- permanent magnets.
imen. Depending on the application of a ferromagnetic Some oxides like γ -Fe2 O3 and CrO2 possess high co-
material, the important properties are the “permeability” ercivity and are used as recording tapes in the form of this
and “coercivity.” layers of fine powders. Permalloys are used to construct
When a strain-free material is cold-worked, the perme- the inductive magnetic heads to write signals as residual
ability of the material is reduced and the hysteresis loss is magnetization on the tapes or to reproduce the electri-
increased. The strain-relieving heat treatment of the cold- cal signals from the magnetized tapes. Magnetoresistive
worked specimen again brings the original magnetic prop- read heads based on permalloy are also used. Significant
erties back, for example, the permeability is increased and progress has been made in the development of very sensi-
the hysteresis loss reduced. In general, the strain-free crys- tive and small read heads based on magnetic-nonmagnetic
tals show the minimum hysteresis loss. multilayer systems. Magnetic discs or drums are made for
The presence of the impurities carbon, oxygen, nitro- the memory systems in computers. Development of mod-
gen, sulfur, etc., affects the permeability and hysteresis ifying the magnetic properties by rapid solidification of
loss of a ferromagnetic material. In general, the materials alloys from the melt has created a new field. By rapid so-
with high permeability and low hysteresis loss are pure lidification, the microstructure can be affected—in some
materials. cases the phases may be finely dispersed, and in other
The composition of a ferromagnetic material also influ- the alloys may become noncrystalline or amorphous. The
ences its magnetic properties. The addition of silicon to amorphous alloys have no crystal lattice and no magnetic
iron increases the permeability and reduces the hystere- anisotropy.
sis loss. However, high concentrations of silicon decrease There are no extended defects that would otherwise in-
the saturation magnetization. Therefore, the iron-silicon teract strongly with the domain walls in these noncrys-
alloys with low concentrations of silicon are desirable in talline materials. In certain cobalt-based noncrystalline
applications like the cores of transformers and in electric alloys, the magnetostriction can be adjusted to zero such
motors and generators. that the internal and applied stresses have minimal effect
P1: FYK/FQW P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN005A-240 June 15, 2001 20:47
768 Ferromagnetism
on the magnetic properties. Amorphous magnetic alloys SEE ALSO THE FOLLOWING ARTICLES
have high hardness and yield strength and are magnet-
ically soft. In particular, cobalt-containing alloys have CRYSTALLOGRAPHY • ELECTROMAGNETISM • GEOMAG-
vanishingly small magnetostriction. Combining the good NETISM • MAGNETIC MATERIALS • SUPERCONDUCTIVITY
mechanical and magnetic properties, they are very useful • TRANSFORMERS, ELECTRICAL
materials. They can be strained elastically over wide lim-
its, are insensitive to irreversible magnetic damage, and
are very suitable materials where the elastic deformabil- BIBLIOGRAPHY
ity is desired. They are also useful materials for making
recording heads in audio, video, and data recording sys- Baibich, M. N., et al. (1988). Phys. Rev. Lett. 61, 2472.
tems, due to their good high-frequency response and wear Binasch, G., Grünberg, P., Saurenbach, F., and Zinn, W. (1989). Phys.
Rev. B39, 4282.
resistance. Brailsford, F. (1968). “An Introduction to the Magnetic Properties of
Soft magnetic and highly elastic mechanical behavior Materials,” Longmans Green and Co., London.
has led to the development of flexible magnetic shielding. Chikazumi, S. (1964). “Physics of Magnetism,” John Wiley and Sons,
The amorphous magnetic materials consist of two main New York.
groups. In one group, the materials are composed of tran- Craik, D. J., and Tebble, R. S. (1966). “Ferromagnetism and Ferromag-
netic Domains,” North-Holland, Amsterdam.
sition metals and metalloids; in the second group, they are Della Torre, E., and Bobeck, A. H. (1974). “Magnetic Bubbles,” North-
composed of only different metals. Some amorphous ma- Holland, Amsterdam.
terials used as soft magnetic materials are Fe81 (Si, B, C)19 ; Kittel, C. (1979). “Introduction to Solid State Physics,” John Wiley and
(FeNi)78 (Mo, Si, B)22 ; (Co, Fe)70 . . . 76 (Mo, Si, B)30 . . . 24 ; Sons, New York.
and (Co, Mn)70 . . . 76 (Mo, Si, B)30 . . . 24. Morrish, A. H. (1965). “Physical Principles of Magnetism,” John Wiley
and Sons, New York.
Hard magnetic materials can also be produced by rapid Standley, K. J. (1972). “Oxide Magnetic Materials,” 2nd ed., Oxford
solidification techniques. For example, a magnetic mate- University Press, London.
rial of composition Fe14 Nd2 B produced by rapid solid- Steeb, S., and Warlimont, H., eds. (1985). “Rapidly Quenched Metals,”
ification is superior to the Co-Sm material. Other hard Vol. 11, North-Holland, Amsterdam.
magnetic materials (some transition and rare-earth metals, Vonsovski, S. V. (1975). “Magnetism,” Halsted Press, New York.
Wohlfarth, E. P. (1980–1982). “Ferromagnetic Materials,” Vol. I (1980),
and boron), have also been produced by rapid cooling. The Vol. II (1980), Vol. III (1982), North-Holland, Amsterdam.
rapid cooling technique is also less expensive compared Zeiger, H. J. (1973). “Magnetic Interaction in Solids,” Oxford University
to the conventional methods. Press, London.
P1: GPJ Final Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
Gamma-Ray Spectroscopy
R. F. Casten
C. W. Beausang
WNSL, Yale University
I. Introduction
II. Gamma-Ray Detection
III. Gamma-Ray Spectroscopy and Nuclear
Structure
IV. Conclusions
433
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
A. Interaction Mechanisms
For energies ranging from a few kilo-electron volts to a few
mega-electron volts, gamma-rays interact with matter via
one of three principal mechanisms: the photoelectric ef-
fect, Compton scattering, or for energies above ∼1 MeV,
the electron-positron pair production. Most gamma-ray
detectors exploit one or more of these effects both to de-
tect the gamma-ray and to measure its energy. Of course,
gamma-rays are electromagnetic waves and sometimes FIGURE 2 Schematic diagram showing the relative probabilities
their wave properties are also used in their measurement, for photoelectric, Compton scattering, and pair production as a
for example, with diffraction techniques. Before we dis- function of gamma-ray energy.
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
gamma-ray. Compton scattering is the dominant inter- gamma-ray spectroscopy is a constant trade-off between
action mechanism for gamma-ray energies in the range these two properties. For example, scintillation detectors,
from a few hundred kilo-electron volts up to a few mega- such as NaI(Tl) detectors, typically have high efficien-
electron volts. It is important to realize that this is the cies but poor energy resolution compared to Ge detectors.
energy range of most gamma-rays produced in typical nu- Detectors based on the technique of crystal diffraction
clear structure experiments. (see below) have superb energy resolution but very small
For gamma-ray energies greater than 2m e c2 (m e c2 be- efficiency.
ing the rest mass of the electron ∼511 keV), electron-
positron pair production is possible, and, for energies
1. Scintillation Detectors
significantly higher than this threshold, pair production
begins to dominate the interaction cross section. In this The detection of gamma-rays (or other types of ionizing
case, some of the incident gamma-ray energy is used to radiation) by the scintillation light produced in certain
create the electron-positron pair while the remainder (in materials is one of the oldest techniques on record, and
excess of 2m e c2 ) is shared as kinetic energy between the it is still one of the most useful and common techniques
electron and positron. Eventually, the positron annihilates today.
with another electron in the detector medium, producing A scintillator, either a solid or a liquid, is a material
two photons each of energy 511 keV emitted back to back. which converts the energy lost by the gamma-ray into
As one can see, all three of these interaction mecha- pulses of light. The scintillation light is detected in turn by
nisms result in the production of a single energetic elec- a light-sensitive material which usually forms the cathode
tron (or, in the case of pair production, of an electron- of a photomultiplier tube. The light pulses are converted
positron pair) with a kinetic energy less than or equal to into electrons in the photocathode. These electrons are
that of the incident gamma-ray energy. These energetic then accelerated and their number vastly (and linearly) am-
electrons recoil through the bulk material of the detector, plified in the photomultiplier. The resulting current pulse
their range being typically less than a millimeter or two. is proportional to the energy of the absorbed gamma-ray.
They rapidly slow down, losing energy through many col- The energy required to produce a light pulse is fairly
lisions with other atomic electrons. In an ideal detector all large, on the order of 30–50 eV. Thus, the average number
of the incident gamma-ray energy is eventually absorbed of light pulses produced when, say, a 500-keV gamma-
in the detector material by a combination of photoelectric, ray is absorbed is on the order of 10,000. Fluctuations in
Compton scattering, and (for high enough gamma-ray en- this number and in the light collection process limit the
ergies) pair-production processes. resolution obtainable with scintillation detectors.
It is intuitively obvious that the number of collisions,
and hence the number of secondary electronic excitations
2. Semiconductor Detectors
produced, is proportional to the primary electron energy
and, hence, is directly related to the incident gamma-ray A semiconductor detector is essentially a large diode con-
energy. For example, in semiconductor detectors, which structed out of either Si or Ge. For gamma-ray spec-
are essentially diodes made out of germanium (Ge) or troscopy, Ge detectors are preferred, as Ge has a larger
silicon (Si), the electron-hole pairs produced following stopping power. The diode is operated under a reverse bias
electron–electron collisions are extracted by a high volt- and is normally fully depleted (i.e., with no free charge car-
age placed across the detector and produce a current pulse riers). The gamma-ray interaction produces electron-hole
which is proportional to the deposited gamma-ray energy. pairs in the depletion region, which are collected because
For scintillation detectors, such as sodium iodide (NaI(Tl)) of the detector bias voltage and which produce a current
detectors, the collisions of the primary electron produce pulse proportional to the absorbed gamma-ray energy.
excited atomic or molecular states. The subsequent decay In contrast to scintillator detectors, the average energy
of these states produces scintillation photons (typically in required to produce a single electron-hole pair is only
the UV range). These photons are converted into a cur- about 2–3 eV. Therefore, a 500-keV gamma-ray can pro-
rent pulse using a photocathode and photomultiplier tube. duce around 250,000 primary charge carriers, much larger
The size of the current pulse is again proportional to the than the corresponding number for scintillation detectors
deposited gamma-ray energy. with a corresponding decrease in the statistical fluctua-
tions and improvement in detector resolution. Figure 3a
shows a typical spectrum of a 60 Co source obtained using
B. Gamma-Ray Detectors
a modern Ge detector. The energy resolution obtained is
Different types of detectors have quite different efficien- about 2 keV for gamma-ray energies of about 1000 keV.
cies and energy resolutions. Indeed, generally speaking, Using a NaI(Tl) scintillation detector, the two peaks in
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
Fig. 3a (having energies of 1174 and 1332 keV) would n is the order of diffraction. Clearly, the resolution scales
be barely resolved. Following a brief discussion of crystal with n. Higher order diffraction gives greater dispersion
diffraction detectors, we will focus most of the remain- and, hence, energy precision, although the efficiency gen-
der of this article on gamma-ray spectroscopy using Ge erally falls off with n. The accuracy depends on the pre-
detectors. cision of the angle measurement. In the realization of this
technique at the Institut Laue Langevin in Grenoble, in
the GAMS (GAMma-ray Spectrometer) family of instru-
3. Ultra-High Energy Resolution Spectroscopy: ments, accuracies of the latter are typically in the milli-arc
Crystal Diffraction seconds range (Koch et al., 1980). Given the nature of the
Gamma-rays are, after all, electromagnetic photons, and technique, the gamma-ray energy spectrum is stored en-
under certain circumstances their wave properties may be ergy interval by interval rather than sampled fully at each
used in their detection and measurement. Indeed, the ul- point in time. An example of a crystal diffraction spectrum
timate in current gamma-ray energy resolution and mea- compared to the corresponding Ge detector spectrum is
surement precision is obtained by the use of crystal diffrac- shown in Fig. 3b. Generally speaking, crystal spectrom-
tion. With such techniques it is routine to measure a 1-MeV eters offer the greatest advantages over Ge detectors for
gamma-ray with an energy resolution of ∼3 eV and an en- gamma-ray energies below ∼1 MeV. At higher energies
ergy precision of better than 1 eV. The cost, however, is their efficiency drops quickly, and hence, lower orders of
low efficiency and the need to scan the energy spectrum diffraction are used with poorer energy resolution.
one small energy bite at a time.
The technique uses Bragg diffraction from a nearly per-
C. Level Scheme Construction
fect crystal, usually of Si. As for optical and X-ray transi-
tions, the gamma-ray wavelength λ and diffraction angle θ One might ask why the extraordinary energy precision of
are related by the Bragg law: nλ = 2d sin θ , where the lat- crystal diffraction techniques can be useful since nuclear
tice spacing d is known to an accuracy of 1 part in 1010 and models seldom predict nuclear states to accuracies better
FIGURE 3 (a) Typical spectrum of a 60 Co source obtained using a modern Ge detector with and without escape
suppression. The vertical scale has been greatly expanded in order to show the Compton background. The dramatic
reduction in the height of the background when using a Compton suppression shield is obvious. The insert shows the
same spectra but now with the full vertical scale to illustrate the height of the photopeaks compared to the background.
(b) An example of the very high energy resolution obtainable using a crystal diffraction system. The top spectrum,
obtained using a Ge detector, shows seven peaks, some of which are not resolved. The lower spectrum, obtained
using the GAMS crystal diffraction system, shows the same portion of the spectrum, but with a dramatic improvement
in resolution. In both spectra, the gamma-rays are labeled with their energies in kilo-electron volts. Continued.
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
FIGURE 3 Continued.
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
than many kilo-electron volts? The principal reason relates While coincidence spectroscopy is a powerful tech-
to the construction of reliable level schemes. One method nique, it also has limitations. For example, it does not help
of constructing level schemes utilizes the Ritz Combina- place ground state transitions or very weak transitions,
tion Principle. Here, nuclear level energies are determined and, in some cases, experimental constraints preclude its
by demanding that their energy differences be equal to use. Nevertheless, it is, by far, the most common approach
the energy of the gamma-ray transition connecting them to sorting out the plethora of gamma-rays observed in nu-
(see Fig. 1). A brief discussion of this technique enables clear de-excitation. We will further discuss coincidence
us to see both the role of the ultra-high energy resolu- spectroscopy below when we introduce advanced multi-
tion in level scheme construction and the complementary detector arrays of Ge detectors.
and much more commonly used technique of coincidence We briefly mention that another application of ultra-
spectroscopy with Ge detectors. high resolution crystal spectroscopy is in the determi-
To see the point, imagine constructing a level scheme nation of fundamental constants such as the accepted
which consists of 50 levels spanning the excitation en- standard for length measurements (the definition of the
ergy range from zero up to 2.0 MeV simply by using the meter) through the precise measurement of gamma-ray
Ritz Combination Principle in a case where one has de- wavelengths. These applications fall outside the scope of
tected, say, 500 γ -rays with energies below 1.2 MeV, with this article.
Ge detector energy accuracy of ±0.1 keV. This is a typi-
cal situation encountered in the spectroscopy of low-spin
D. The Evolution of Detector Arrays
states of heavy nuclei. In such a case the probability of
an accidental Ritz Combination, that is, a level energy To illustrate the increasing power and sophistication of
difference that inadvertently coincides within uncertain- gamma-ray detectors, particularly of Ge detector arrays,
ties with a gamma-ray energy, is about 10%. Even with it is useful to consider the nuclear reactions by which the
other experimental input, such as information on the an- nuclei to be studied are formed. One of the most common
gular momenta of the levels to rule out certain transition reaction mechanisms used to populate high-spin states in
placements from conservation of angular momentum, it atomic nuclei is the heavy-ion fusion-evaporation reac-
is clear that a large number of incorrect placements and, tion. This type of reaction has the advantage of bringing
hence, incorrect physics will result. There are two ways large quantities of angular momentum into the product
of resolving this situation: either using time coincidence nucleus (often up to the limit allowed by fission), while at
relations between successive gamma-rays to place them the same time populating only a few product nuclei with
correctly in the level scheme of a nucleus or improving significant probability.
the energy resolution significantly. For the latter, the crys- In such reactions, a heavy-ion beam is incident on a
tal diffraction approach is ideal. With an energy precision target at an energy just above the Coulomb barrier. A typ-
of, say, ±5eV, the probability of an accidental sum drops ical reaction might involve a 48 Ca (Z = 20, N = 28) beam
to negligible levels. Indeed, data from the GAMS spec- incident on a 108 Pd (Z = 46, N = 62) target at a beam en-
trometers at the ILL have often shown that existing Ge ergy of 200 MeV. Following the collision, the beam and
detector results (usually data taken without coincidences) target nuclei fuse to form a compound nucleus, in this case
are in error. 156
Dy (Z = 66 = 20 + 46, N = 90 = 28 + 62). The com-
However, the usual solution to this problem is the use pound nuclear system will be produced in a highly ex-
of coincidence spectroscopy. This technique exploits the cited and rapidly rotating state, with typically 60 MeV of
✏
fact that nuclear levels are generally short lived, with typ- excitation energy and about 70h of angular momentum.
ical half-lives in the pico- to nanosecond range, so that The initial decay of the compound system is via the
successive gamma-ray de-excitations effectively occur si- emission of a few (3–5) particles, usually neutrons and
multaneously, on the time scale of standard pulse analysis less frequently protons or alpha particles. This first stage
electronics. Therefore, if two or more gamma-rays are ob- of the decay process typically removes about 40 MeV of
✏
served in separate detectors, in time coincidence (within excitation energy and about 10h of angular momentum.
say a few nenoseconds) then they must occur in a cas- The remainder of the excitation energy and most of the an-
cade in the nuclear level scheme. For example, in Fig. 1 gular momentum is subsequently removed by gamma-ray
the 6+ + + +
1 → 41 and 41 → 21 transition would be in coinci- emission. Each gamma-ray photon removes either one or
dence, as would the 31 → 2+
+ + + +
2 and 22 → 41 or 22 → 21
+
two units of angular momentum. Thus, we can expect the
+ +
transitions. However, the 31 → 22 transition is not in co- emission of cascades of up to 30 gamma-rays following
incidence with the 6+ +
1 → 41 transition. Such coincidence each reaction. Because of this multiplicity of gamma-rays,
relations are of inestimable help in constructing complex the study of transitions following production of the com-
nuclear level schemes. pound nucleus imposes stringent requirements on detector
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
plots the population intensity of various nuclear states ered by Johnson, Ryde, and Sztarkier (1971) using just
as a function of angular momentum or spin of the state. two Ge(Li) detectors, while the structure of 160,161 Yb was
Pioneering experiments in the early 1960s were carried out investigated by Riedinger et al. (1980). up to spin ∼30 h ✏
with one or a few NaI(Tl) scintillation detectors (Morinaga using only four Ge detectors. In the last three decades,
and Gugelot, 1963). The sensitivity of these experiments the study of the properties of the atomic nucleus through
was limited, both by the poor energy resolution of NaI(Tl) gamma-ray spectroscopy has evolved through the devel-
detectors (about 80 keV at 1000 keV) and by the small opment of larger and more efficient Ge detector arrays
number and size of the detectors, to spins up to about and, indeed, has driven the development of these arrays.
Starting in the 1980s and continuing to today,
large arrays of Ge detectors such as TESSA, GASP,
Eurogam, Gammasphere (Lee, 1990) and Euroball (Gerl
and Lieder, 1992) further revolutionized gamma-ray spec-
troscopy. Future arrays such as the proposed GRETA
(Gamma-Ray Energy Tracing Array) spectrometer
(Deleplanque et al., 1999), which promise very large in-
creases in sensitivity resulting from modern manufactur-
ing techniques, electronics, and digital data processing,
are in the planning stages.
The preferred solution is to detect these scattered pho- of the detectors in these arrays are composites formed by
tons in a second, surrounding detector, termed an escape closely packing several Ge detectors together as a unit.
suppression or an anti-Compton shield, and to reject, us- Two varieties of such units, called clover (Duchene et
ing fast electronics, coincidence events between the Ge al., 1999) or cluster (Eberth et al., 1996) detectors, are
detector and the shield detector. The combination of Ge nowadays the backbone of advanced arrays such as the
detector and suppression shield is termed an escape sup- YRAST Ball array at Yale University (Beausang et al.,
pressed spectrometer (ESS). The material commonly used 2000) or the planned Exogam and Miniball arrays in
in the anti-Compton shield is bismuth germanate (BGO), Europe (Simpson et al., 2000). Even more powerful detec-
a dense, high-efficiency scintillator material. tors, termed tracking detectors, are under development.
After suppression, typically about 65% of the remaining These will be discussed below.
events are in the photopeaks (PT = 0.65). A typical ESS
configuration showing a Ge detector and shield is shown
in Fig. 5, while the improvement in the background, and 2. Counting Rates
hence spectrum quality, is illustrated in Fig. 3a.
It is informative to look at some of the numbers involved
The PT ratio is of prime importance for coincidence
in a typical nuclear physics reaction carried out in the
spectroscopy. For example, when requiring a coincidence
laboratory. Once again, we consider the example of the
between two Ge detectors, a PT ratio of 0.25 implies that 48
Ca + 108 Pd reaction, which was used in the experiment
only (0.25)2 or ∼6% of the events will be photopeak-
in which the first superdeformed band in 152 Dy was dis-
photopeak coincidences. The remaining 94% will be
covered (Twin et al., 1986).
background events. Using an ESS, however, the photo-
Typically, the beam intensity from an accelerator is
peak-photopeak coincidence fraction increases to (0.65)2
about 1010 –1011 particles per second incident on a tar-
or 42%, an improvement of a factor of 7. Even larger im-
get. This corresponds to an electric current on the order
provements are obtained when three- or higher fold coinci-
of a few nano-Amperes (nA). About one beam particle in
dence events are recorded. For example, the improvement
a million will actually strike a target nucleus and induce
is a factor of 17 for triples coincidences, 45 for quadru-
a nuclear reaction. Therefore, we expect about 100,000
ples, and 120 for quintuples. Today’s largest gamma-
reactions per second. About 20% of the reactions produce
ray spectrometers, the Gammasphere array in the United 152
Dy and about 1% of these will populate the nucleus in
States and the Euroball array in Europe, regularly record
the superdeformed state, corresponding to about 200 such
even higher fold coincidence events (Lee, 1990; Gerl and
events per second.
Lieder, 1992).
The array used in the original discovery of the super-
The efficiency and sensitivity of ESS arrays improved
deformed band in 152 Dy, the TESSA3 array (Nolan,
rapidly, so that by the mid 1980s arrays with more than
Gifford, and Twin, 1985), had a total photopeak efficiency
20 ESS having total absolute peak efficiencies of up to
of about 0.5%. Assuming that each superdeformed nu-
1% were constructed. By convention, the total photopeak
cleus decays by emitting a cascade of ∼25 gamma-rays,
efficiency is defined as the probability of measuring the
and that we require a coincidence between two detectors
full energy of the 1332-keV 60 Co gamma-ray when the
(γ 2 ) before accepting an event, we might expect to detect
source is placed at the center of the array. These ESS ar-
about 1 gamma-ray coincidence event per second originat-
rays enabled nuclear phenomena that occur at an intensity
ing from a superdeformed cascade. Since each cascade is
of about 1% of the total intensity of the nucleus to be
∼25 transitions long, we expect about 1 count per gamma-
studied. Worldwide there were about a dozen arrays with
ray transition every 20 sec or so. The background rate from
this level of sensitivity. One of the earliest of these, the
other processes is many hundreds of times greater.
TESSA3 array (Nolan, Gifford, and Twin, 1985), located
at Daresbury Laboratory in the United Kingdom, was used
in the discovery of the classic discrete line superdeformed
3. Doppler Effects and Segmentation
band in 152 Dy. Superdeformation will be discussed further
below. The lifetimes of the highest spin states populated via
In the mid 1990s the latest generation of gamma-ray heavy-ion fusion-evaporation reactions are often compa-
spectrometers with total photopeak efficiencies of up to rable to, or shorter than, the stopping time of the recoiling
∼10% came on line. These spectrometers, namely the nucleus (recoiling due to the momentum imparted by the
Gammasphere (Lee, 1990) and Eurogam/Euroball (Gerl incident beam nucleus that initiates the reaction) in the
and Lieder, 1992; Beausang et al., 1992) arrays, contain target material. Typical recoil velocities are on the order
up to 240 individual Ge elements and have sensitivities of of a few percent the speed of light. Therefore, Doppler
better than 0.001% of the production cross section. Some effects play a major role.
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
FIGURE 5 Escape suppression spectrometer showing Ge detector and shield. This is the type of ESS used for clover
Ge detectors in the Eurogam/Euroball array. The clover Ge detector position is indicated inside the suppression shield.
The liquid nitrogen storage dewar is also shown.
The Doppler shifted energy of a gamma-ray emitted beam direction, the resulting energy resolution can be very
from a nucleus in flight is given by poor.
One solution is to use very thin targets in order to mini-
v
E γ = E 0 1 + cos θ , mize slowing down effects and detectors that subtend only
c a small range of angles. Knowing the detector angles, one
where E 0 is the unshifted energy and θ is the detector can correct for the Doppler shift and recover most of the
angle. If a detector records the same gamma-ray emit- resolution. The limit on detector resolution now becomes
ted from different nuclei having a wide range of velo- the finite opening angle of the Ge detector itself, in other
cities or traveling at different angles with respect to the words the uncertainty in knowing in which part of the Ge
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
detector the gamma-ray actually interacted. For a constant center of this element and dθ is the opening angle of this
recoil velocity, the Doppler broadening is given by segment. A gamma-ray may also scatter between two el-
v ements of the clover detector. In this case simulations and
dE = E 0 sin θ dθ, measurements have shown that the gamma-ray interaction
c
usually takes place close to the boundary between the two
where dθ is the opening angle of the Ge detector, typi- crystals. Thus, one is justified in taking θ as the angle of
cally about 5–10◦ . For experiments on very high spin nu- the boundary. The other enormous advantage of the clover
clear states, the energy resolution is dominated by Doppler detector is that the energy is measured accurately, even for
broadening effects and is often a factor of two or more such scattering events. The energies measured in each sep-
worse than the intrinsic resolution of the Ge detector. For arate crystal may be added together while preserving the
very high recoil velocities the problem is much worse. Be- good energy resolution of the individual crystals. Because
cause of the sin θ dependence, the Doppler broadening is of this add-back feature, the efficiency of a clover detec-
worst for detectors placed at θ = 90◦ to the beam direction tor, consisting of four individual Ge crystals, is actually
(even though the Doppler shift is zero at 90◦ ). about six times the efficiency of the individual detector
Various methods have been developed to minimize crystals.
Doppler broadening effects. Most involve the concept of
detector segmentation in which one determines in what
part of a detector a given photon was detected. The devel- 4. Tracking Detectors
opment of the clover detector, for example, with four sepa- Recently, further advances in detector manufacturing tech-
rate Ge detectors closely packaged in a single vacuum ves- nology allow the electronic segmentation of a single crys-
sel, was driven by such concerns (Duchene et al., 1999). tal into smaller elements, thus further localizing the inter-
A schematic diagram of a clover Ge detector is shown in action site within the volume of the detector. The ultimate
Fig. 6. The idea is that by using four small detectors, one goal of these developments is the development of a track-
effectively has a much larger detector while preserving ing detector array, which actually allows one to follow the
the smaller opening angle for each individual segment. trajectory of each individual gamma-ray as it traverses a
Gamma-rays may interact in only a single element of the detector, even if it undergoes multiple scattering events en
clover detector. In this case one takes the angle θ to be the route.
Ideally, such an array needs to cover all the available
solid angle and localize each gamma-ray interaction to
within 1–2 mm in three dimensions. A variety of tracking
detectors are under development worldwide, including the
Gamma-Ray Energy Tracking Array (GRETA for short)
in the United States (Deleplanque et al., 1999).
Such an array would be very efficient. Simulations for
the proposed GRETA array indicate that it may be up to
a thousand times more sensitive than the best of today’s
spectrometers. This sensitivity comes about because of the
high-count rate capability (the relatively low-count rate in
each segment is the limiting factor, rather than the high
rate in the entire detector), excellent PT ratio, resolution,
and efficiency.
A prototype detector for the GRETA array has already
been extensively tested in Lawrence Berkeley National
Laboratory, Berkeley, CA. One key test involved the de-
termination of the gamma-ray interaction position by use
of a closely collimated source. The interaction positions
are determined by detailed measurements of pulse shapes
on an event-by-event basis. A comparison of measured
pulse shapes, with calculations show excellent agreement,
FIGURE 6 Schematic diagram showing the four Ge crystals of
which is a major first step in a proof of principle for the
a segmented clover Ge detector. In this type of clover detector,
to further improve position information, signals are taken from the detector. The next step in this project is to purchase a
center contacts of each crystal (labeled 1–4) and also from the mini-array of such detectors. These multiple detectors,
left, right, and middle parts of the outer electrical contacts. assembled into a closely packed array, allow one both to
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
do physics and to prove the principle of practical gamma- 1. Recoil Distance and Doppler Shift Attenuation
ray tracking for the first time. The proposal to construct Methods
this array is currently awaiting funding.
Typical recoil velocities following heavy-ion fusion-
evaporation reactions are a few percent the speed of light.
5. Pair Spectrometers The associated Doppler shifts of emitted gamma-rays
can be used to obtain level lifetimes. The recoil velocity
High-energy gamma-rays (with energies, say, in the
corresponds to an easily measured, maximum Doppler
1–10 MeV range) can interact with matter to produce
shift of about 20–30 keV in a 1000-keV gamma-ray. The
positron-electron pairs. When the positron annihilates,
fraction of the gamma-ray intensity which lies in the
two photons, each of 511 keV, are emitted at an angle
Doppler shifted peak can be proportional to the lifetime of
of 180◦ to each other. When such a pair-production pro-
the nuclear state. Two Doppler-based techniques are com-
cess occurs in a Ge detector, one or both of the 511-keV
monly used. The first, termed the Recoil Decay Method
photons may escape from the detector without being de-
(RDM), utilizes two parallel foils separated by a distance
tected. Hence, each gamma-ray transition leads to three
d. The nuclei of interest are produced in the first, thin foil
peaks in the spectrum, the full energy peak plus the so-
and recoil out of the foil with a well-defined recoil veloc-
called single and double “escape” peaks. This proliferation
ity. Having flown a distance d, they are rapidly stopped
of peaks can significantly complicate spectral analysis and
in the second, thicker stopper foil. If the nuclear state of
adversely affect nuclear level scheme construction.
interest decays while the nucleus is flying between the two
To improve such spectra one often uses a pair spectrom-
foils, then the gamma-ray will be emitted with the appro-
eter, which, in essence, is the inverse of the anti-Compton
priate Doppler shift. On the other hand, if the lifetime is
shield spectrometer discussed earlier. Whereas in an anti-
long enough that the nucleus reaches the stopper foil and
Compton shield any event detected in the shield is used
is stopped, the gamma-ray will be emitted from a nucleus
to veto the coincident event in the Ge detector, in a pair
at rest, without a Doppler shift. Changing the distance
spectrometer the simultaneous detection of 511-keV γ -
d between the foils can access different lifetime ranges.
rays on opposite sides of the Ge detector is used to posi-
Typically, the RDM technique is used to probe lifetimes
tively trigger (i.e., to select) the double escape peak in the
in the nanosecond to picosecond range.
Ge detector spectrum.
The Doppler Shift Attenuation Method (DSAM) is sim-
ilar to the RDM in that two foils are used. However, in
F. Measurements of Nuclear Level Lifetimes this case, the foils are placed in intimate contact with each
other. Now the recoiling nuclei immediately enter the sec-
Aside from the measurement of gamma-ray energies and
ond foil and begin to slow down and stop. If the nuclear
intensities, and the determination of gamma-ray transi-
lifetime of interest is of the same order of magnitude as
tion placements in nuclear level schemes by coincidence
the slowing down time of the nuclei in the foil, around
and Ritz Combination techniques, gamma-ray detectors
1–2 ps, then the gamma-ray transitions will be emitted
can be used to measure another extremely important ob-
with a range of Doppler shifts, ranging from the maximum
servable, namely, nuclear level lifetimes. These lifetimes
shift down to zero. Level lifetimes may be extracted by
are proportional to squares of quantum mechanical quan-
carefully analyzing the resulting, complicated peak shapes
tities called transition matrix elements and therefore can
and comparing them to model calculations. Of course the
directly reveal insights into nuclear structure and the prop-
calculations also have to include the slowing down process
erties of nuclear excitations.
itself. The DSAM method is sensitive to level lifetimes on
Two general classes of techniques are used: those in-
the order of picoseconds to femtoseconds, i.e., somewhat
volved in directly measuring the time difference between
shorter than those accessible with the RDM method.
successive gamma de-excitations in a nucleus and those
based on Doppler effects. The former has traditionally
been limited by the rise time of voltage pulses from de-
2. The GRID Technique
tectors to the nanosecond range, but advances using faster
scintillation detectors have pushed the frontiers of elec- Another Doppler-based method is used at the ILL in
tronic time measurements farther down to nearly the pi- Grenoble, referred to earlier (Koch et al., 1980), using
cosecond range. Doppler techniques are typically used to the ultra-high resolution crystal diffraction instruments
measure lifetimes from hundreds of picoseconds down to GAMS4 and GAMS5. In this approach, a thermal neu-
the few femtoseconds range. These techniques cover a tron from a reactor is captured by a target nucleus which
range of lifetimes characteristic of a wide variety of nu- then emits a series of gamma-rays from the capture state
clear decays. (typically lying at an excitation energy of about 6 MeV) to
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
lower lying states. Each emitted gamma-ray carries a small nique has proven to be quite useful in studies of nuclei off
linear momentum, p = E/c. Hence, the emitting nucleus the line of nuclear stability in β-decay experiments.
recoils in a direction opposite to that of the gamma-ray. If
the same nucleus then emits a subsequent gamma-ray prior
to stopping in the target material, the second gamma-ray
will be Doppler shifted. III. GAMMA-RAY SPECTROSCOPY
The shifts are exceptionally small. Typical recoil ener- AND NUCLEAR STRUCTURE
gies are a few electron volts, and therefore, the technique
relies of measuring Doppler broadening effects of this The atomic nucleus is a unique, many-body quantum
order using crystal diffraction techniques. Note that one mechanical system. When describing nuclei, numbers of
measures a Doppler broadening rather than a shift be- the order of 100 seem to occur frequently. For example,
cause the gamma-ray emission of the ensemble of nuclei the depth of the potential holding the protons and neu-
is effectively isotropic. The technique is known as the trons, collectively known as nucleons, together is about
GRID (Gamma-Ray Induced Doppler) technique (Borner 50 MeV. The maximum angular momentum the nucleus
and Jolie, 1993) and, like the DSAM, is useful for nuclear can hold before centrifugal forces break it apart is about
✏
level lifetimes shorter than or on the order of the stopping 100 h, which occurs for nuclei around mass 100.
time, typically ∼1 ps. Typical nuclei have a few hundred constituent nucleons.
This number implies that the nucleus occupies a unique
position in the plethora of quantum systems found in na-
ture. A few hundred particles grouped together is sufficient
3. Fast Timing Spectroscopy
to allow one to contemplate macroscopic nuclear proper-
The coincidence measurements with Ge detectors de- ties such as shape and surface area and thickness. One the
scribed above are typically used to establish nuclear level other hand, it is few enough that the addition or subtrac-
schemes. Such measurements utilize coincidence resol- tion of a single proton or neutron can radically change the
ving times on the nanosecond time scale (10−9 sec). How- behavior of the whole system. Indeed, one of the appeal-
ever, the time response of BaF2 scintillation detectors is ing features of the nucleus is that it is a many-body quantal
much faster than that of Ge detectors and, with special system in which the number of interacting bodies can be
care, can be reduced to the few picosecond range. Hence, precisely controlled, measured, and varied. We will see a
coincidence timing can also be used to directly measure stunning example of the microscopic nature of the nucleus
nuclear level lifetimes in the few tens of picoseconds below when we discuss the phenomenon of backbending.
range, which is typical of the lifetimes of many collective This mixture of macroscopic and microscopic behavior in
excitations in medium mass and heavy nuclei. In practice, a strongly interacting system (the nucleons are after all
the technique, called FEST (Fast Electron Scintillation bound together in the nucleus by the effects of the strong
Timing) [see Buescher et al. (1990) for a simplified dis- force) is nearly unique in nature.
cussion and for references to more technical literature], The behavior of the nucleons inside the nucleus can be
is most commonly used in β-decay experiments where likened to the behavior of a herd of wild animals. The
the time is measured between the emission of a β-ray (de- herd clusters together for protection, defining a shape and
tected in a thin, fast plastic scintillator) and the subsequent form. (The Hungarian word for such a herd is gulyas, so
gamma-ray emission in the daughter nucleus. the nucleus is a bit like a goulash soup of nucleons.) How-
The technique must be used with great care. One prob- ever, the behavior of a single animal can have dramatic
lem is that the BaF2 detectors have very poor energy res- effects on the collective motion of the whole system. In
olution (∼10%). Additional gamma-ray selection, by co- the following sections, we will describe some of the fea-
incidence with cascade gamma-rays using Ge detectors tures of the excited atomic nucleus and attempt to describe
(with “normal” nanosecond timing), is normally needed a few of the many manifestations of its macroscopic and
to simplify the BaF2 spectra to one of two gamma-rays at microscopic behavior.
most. Therefore, most applications are in low multiplicity Generally speaking, atomic nuclei can be excited from
experiments. Another serious problem relates to the en- the “bottom up” using reactions such as Coulomb excita-
ergy dependence of the timing. A gamma-ray moves at tion, inelastic scattering, or direct reactions, or from the
the speed of light and in 3 ps travels ∼1 mm. Since typical “top down” using β-decay, neutron capture, and heavy-
BaF2 detectors have sizes on the order of centimeters, it is ion fusion-evaporation reactions. The former approach
clear that the timing is sensitive to the exact position in the most often excites states selectively, while the latter ap-
crystal where the gamma-ray absorption occurs. Hence, proach is much less selective, tending to populate most
the time properties of such detectors are energy dependent states along a myriad of possible de-excitation routes,
and must be carefully calibrated. Nevertheless, the tech- subject only to constraints due to angular momentum
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
selection rules or phase space considerations. Gamma-ray duction of β-decay parent nuclei can be achieved by sim-
spectroscopy is most often used in this second approach. ple reactions such as ( p, n) or by heavy-ion reactions. The
When gamma-ray spectrometry is used in “bottom up” simpler, lower energy reactions tend to form only one or a
techniques, such as Coulomb excitation, it is exploited pri- couple of parent nuclei, whereas heavy-ion reactions may
marily as an indicator of the excitation probability of par- form many times more, and, in that case, selection tech-
ticular levels rather than as a study of de-excitation modes niques are needed to select the decay products of interest.
per se. A popular technique in β-decay is the use of moving
In this section, we will discuss a number of aspects tape collectors in which the activity is collected on a tape
of gamma-ray spectrometry. Although the distinction is a (e.g., movie reel tape or aluminized Mylar) for some pe-
bit artificial, it is convenient, and historically pertinent, to riod of time (typically ∼1.8 times the half-life for the
break the discussion up into the study of low- and high- desired β-decay). The tape is then moved to a low back-
spin states. ground area for detection of gamma-rays following decay.
Collection of a new activity at another spot on the tape pro-
ceeds simultaneously.
A. Low-Spin States
Gamma-ray spectroscopy following β-decay was for
The study of the low-spin nuclear states dates back to many years in the 1950s–1970s a standard technique used
the beginning of nuclear structure and is the basis for our to elucidate nuclear structure. Since β-decay itself carries
understanding of the equilibrium structure of nuclei and off little or no angular momentum, the spin states acces-
its evolution with nucleon number. Low-spin states are sible with this technique are generally those within ±2–3 h ✏
typically populated following β-decay, neutron capture, of the parent (ground or isomeric) state.
Coulomb excitation, or photon scattering reactions. In recent years the technique has enjoyed a renaissance
with the use of arrays of much higher efficiency Ge detec-
tors (e.g., clover or cluster detectors). Since the gamma-
1. Beta-Decay
ray multiplicity following β-decay is low and there is no
Nuclei formed off the valley of stability decay back toward Doppler effect, the detectors can often be mounted in close
stable nuclei via β-decay (which includes the processes of geometry to maximize count rates and achieve consider-
β − , β + , and electron capture decay). Typically, β-decay able coincidence efficiencies.
populates several excited levels in the daughter nucleus. One current setup for such studies is the Yale mov-
Half-lives near stability range from seconds to days. Pro- ing tape collector (Casten, 2000). Illustrated in Fig. 7,
FIGURE 7 Diagram of the Yale moving tape collector showing the target box, counting area, and tape holding box.
Activity is deposited on the tape in the target box, with the beam entering from the left. It is then transported to the
counting area. The holding box provides a delay to let unwanted extraneous activity decay away before the tape once
more returns to the target box.
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
(n, γ ) studies have used the ultra-high energy resolution band, where the spacing between adjacent transitions is
GAMS crystal diffraction detectors (see Fig. 3b). Studies constant, is shown in Fig. 8 (the most intensely popu-
of nuclei such as 196 Pt (Cizewski et al., 1978) and 168 Er lated superdeformed band in 150 Gd). However, usually,
(Davidson et al., 1981) with (n , γ ) have provided some of dramatic changes in structure occur (e.g., due to centrifu-
the most comprehensive and complete level schemes ever gal forces or quenching of pairing) as a nucleus rotates
produced and have provided key tests of nuclear models, faster and faster. These are manifest as deviations from
such as the Interacting Boson Model. Today, most fore- the simple linear dependence outlined above. For exam-
front (n, γ ) work is carried out using the GRID technique ple, a spectrum of the ground state rotational band of 158 Er
to measure lifetimes with GAMS detectors. is illustrated in Fig. 9, where the lines indicate transitions
linking states with increasing spin. Notice that at gamma-
ray energies of about 400 keV the transitions double back
B. High-Spin States on themselves. This phenomenon is called backbending
1. Backbending and the Pauli Principle and corresponds to a dramatic change in the internal struc-
ture of the nucleus.
One of the fundamental questions to ask about a nucleus is: The origin of this structural change lies in the effects of
What is the shape? Is the nucleus spherical, like a soccer the familiar Coriolis force on the microscopic structure of
ball, or deformed, stretched out like an American foot- the nucleus. As we have stressed, the nucleus is not a rigid
ball or perhaps flattened like a Frisbee? It turns out that body, but instead is made up of only a few hundred protons
some nuclei are spherical; some are deformed like foot- and neutrons that orbit the center of mass in orbits char-
balls; and some are deformed like Frisbees. The excitation acterized by particular angular momenta. We know that
spectrum of deformed nuclei is particularly easy to un- many medium mass and heavy nuclei exhibit properties
derstand. A deformed system has a defined orientation in similar to those of a superconductor. In the ground state
space (it is not isotropic), and rotations of this shape can be of an even-even nucleus, all of the protons are coupled
observed. A quantum mechanical rotor has an excitation pairwise, in identical but time-reversed orbits, so that the
energy given by total angular momentum of each pair is zero. Similarly, the
h2 ✏
neutrons are also paired. Hence, the total angular momen-
E(I ) =I (I + 1), tum of the ground state of any even-even nucleus is zero.
2J
As an interesting aside, it follows that in an odd-proton or
where I is quantum number counting the angular mo- odd-neutron nucleus, the ground state spin and parity is
mentum of the state (I = 0, 2, 4, . . . for the ground state usually determined by the quantum numbers of the final
rotational band, the odd spins are missing for symmetry unpaired proton or neutron.
reasons which are not relevant here) and J is the moment The question, therefore, becomes, what happens to
of inertia of the nucleus. The gamma-ray energy (which is these pairs of protons and neutrons as the nucleus as a
measured in the experiment) is just the energy difference whole begins to rotate? Just as a person walking on a
between adjacent states. merry-go-round experiences a force on the moving plat-
h2
✏
form, the so-called Coriolis force, the nucleons in the nu-
E γ (I → I − 2) = [I (I + 1) − (I − 2)(I − 1)] cleus also experience the effect of the rotating bulk. Just
2J
as with the merry-go-round, the Coriolis force increases
h2 ✏
=[4I − 2]. the faster the nuclear rotation or the orbital velocity. In-
2J deed, the size of the Coriolis force is such that at moderate
Thus, the gamma-ray energy increases linearly with angu- nuclear rotational frequencies it perturbs the orbits of the
lar momentum. For gamma-rays linking adjacent levels, particles sufficiently that the pairs of nucleons will begin
the energy difference is given by to break apart. This has the effect of dramatically chang-
ing the excitation energies of the states and the gamma-ray
E γ = E γ (I → I − 2) − E γ (I − 2 → I − 4) energies for transitions between them. It is this breaking
h2
✏
of the superconducting pairs that is responsible for the
= [(4I − 2) − 4((I − 2) − 2)] backbending observed in Fig. 9.
2J
An illustration of the effects of the Pauli exclusion
4h2 ✏
FIGURE 8 Spectrum of the most intensely populated superdeformed band in 150 Gd. In addition to the regular
picket-fence pattern of gamma-rays associated with decays of superdeformed states, the spectrum also shows,
at lower energies, the complex pattern of transitions depopulating the nearly spherical normal deformed states in
150 Gd.
unpaired proton is in different orbits about the nucleus. major to minor axes, typically 2:1 or 3:2. The observation
In one of these cases the odd-proton acts like a spectator of the first high-spin SD bands in 152 Dy and 132 Ce, by the
to the underlying even-even nucleus, and in this case the Liverpool University groups of Peter Twin and Paul Nolan,
above backbending phenomenon occurs as before. In the respectively (Twin et al., 1986; Nolan et al., 1985),
other band, however, the odd-proton occupies one of the sparked an enormous worldwide effort to discover addi-
orbits of the pair of aligning nucleons. The pair breaking tional examples of highly deformed nuclei and to charac-
is therefore prohibited by the Pauli exclusion principle, terize the properties of such highly stressed systems
and the backbending is delayed until higher rotational fre- (stressed both by the application of very high angular
quencies when it becomes possible to occupy higher lying momenta and by extreme values of deformation). Today,
orbits. about 40 nuclei in four main mass regions, or islands, have
been shown to exhibit SD behavior. Most of these nuclei
have more than one known SD band.
2. Superdeformation
Superdeformed rotational bands are generally charac-
One of the forefront areas of research in high-spin nu- terized by extremely regular gamma-ray energy spacing.
clear structure physics over the last decade has been the The energy spacing from one transition to the next in the
study of superdeformed (SD) nuclei. These states exist in rotational band is either constant or varying slowly and
a second minimum in the nuclear potential energy surface regularly from one transition to the next. For example,
in which the nucleus takes on an ellipsoidally deformed the strongest SD band in 150 Gd is illustrated in Fig. 8.
shape which roughly corresponds to an integer ratio of The regular picket-fencelike pattern of SD transitions is
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
FIGURE 9 Spectrum illustrating the ground state rotational band of 162 Er illustrating the backbending phenomenon.
(Figure courtesy of Mark Riley).
unmistakable in this spectrum as is the irregular pattern of the rotational band being perturbed up and down in en-
transitions de-exciting lower lying normal deformed states ergy by very small amounts, on the order of 60 eV. This
(150 Gd is nearly spherical in its ground state). staggering essentially separates the rotational band into
Due to this regularity, which is the rule rather than the two
I = 4 h sequences. The origin of the perturbation,
✏
✏
exception for SD bands, one can feel confident in predict- which affects states differing in spin by 4 h, is still unclear.
ing where transitions in a given band should occur. How- Several theoretical models have been proposed to explain
ever, detailed measurements of the strongest SD band in this phenomena, none of which, however, can reliably pre-
149
Gd, using the Eurogam Ge detector array, revealed a dict which SD bands should exhibit staggering and which
very small deviation from this smooth behavior (Flibotte should not.
et al., 1993), Indeed, it was found that every second
energy spacing was larger/smaller than the average. The
3. Magnetic Rotation and Chiral Symmetry
deviation, illustrated in Fig. 11, is very small, only about
0.25 keV, and is measurable only due to the very high Interesting effects have also emerged from the study of
quality spectra available from the Eurogam array. It is be- near-spherical nuclei. One of the consequences of quan-
lieved that the deviation is caused by alternate states in tum mechanics is that the rotation of a spherical shape
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
FIGURE 12 A partial level scheme of the odd-odd nuclei 136 Pm (61 protons and 75 neutrons) and 138 Eu (63 protons
and 75 neutrons). The proposed chiral twin bands are shown on the left of each level scheme.
right- or a left-handed coordinate system. The so-called been coupled to high-transmission magnetic separators.
3d-tilted axis-cranking model, developed by S. Frauendorf A magnetic separator is a device placed behind the target
and J. Meng (1997), addresses such a system and position which will selectively transport nuclei, produced
predicts a doubling of energy levels, one corresponding to in a reaction, to its focal plane where they can be de-
each chirality or handedness. For complete symmetry, the tected and identified using a variety of different detectors.
levels of the same spin and parity would be degenerate. If Residual nuclei that are not of interest, or scattered beam
the solutions for different chiralities mix, then the degen- particles, will not be transmitted through the separator.
eracy will be broken, and one set of states, corresponding Very small fractions of the total reaction cross section can
to a
I = 1 h rotational band, will be lifted with respect
✏
be selected using this method. Nuclear structure informa-
to the second band. Indeed, two
I = 1 h bands in the
✏
tion is obtained by detecting gamma-rays produced at the
doubly-odd nucleus 134 Pr have been proposed as a pos- target position, in coincidence with recoils detected at the
sible chiral candidate (Frauendorf and Meng, 1997). Fol- focal plane. One example of the use of this technique is
lowing on this suggestion, several other candidate bands illustrated here.
have been observed in nearby nuclei (Starosta et al., 2001; One of the goals of nuclear physics is to understand
Beausang et al., 2001; Hecht et al., 2001), while candidate the limits of nuclear existence as functions, for example,
bands have also recently been reported in doubly-odd 188 Ir of angular momentum, isospin, or indeed mass. For ex-
(Balabanski et al., unpublished). The proposed chiral twin ample, what are the heaviest nuclei that can exist? For
bands in 136 Pm and 138 Eu are shown in Fig. 12. many years now, various models have predicted that an
island of superheavy nuclei should exist. However, most
models disagree as to the exact proton and neutron num-
bers categorizing this island and indeed on the extent of
C. Spectroscopy in Coincidence
the island. Recently, models have predicted that these su-
with Separators
perheavy nuclei might indeed be deformed. Therefore, it
A great deal of exciting new spectroscopy of nuclei far is very relevant to inquire as to what is the structure of
from stability or with very large Z has been achieved over the heaviest nuclei accessible to gamma-ray spectroscopy
the last several years when large Ge detector arrays have and to ask the simplest type of questions about them, for
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
example, are they spherical or deformed? Unfortunately, the limits of accessibility, down to 1 particle per second
the production cross sections for superheavy nuclei are or even less. Therefore, detectors will have to be corre-
such that, even using very intense beams, only one or spondingly more efficient. Second, because the nucleus
two nuclei are produced per week or two. These small to be studied is sometimes the one produced as a beam
numbers are clearly beyond what we can measure with by the radioactive beam facility, most experiments will be
existing gamma-ray facilities. Therefore, we cannot ad- done in inverse kinematics in which the roles of beam and
dress the spectroscopy of the superheavy elements (yet). target are interchanged.
However, we can look at the structure of very heavy nuclei In inverse kinematics, m b > m t where m b and m t are the
lying just below these unattainable regions. masses of the beam and target nuclei. Therefore, the reac-
Recently, groups at Argonne National Laboratory in tion products all go forward in the laboratory system. For
the United States and at the University of Jyvaskyla in m b m t , this forward focusing results in a quite narrow
Finland carried out tour de force experiments to study the cone of reaction products. For example, for elastic scat-
excitation spectrum of 254 No (Leino et al., 1999). With tering of 62 Ni on 12 C, the maximum allowed scattering
Z = 102, No is the heaviest nucleus for which gamma- angle is ∼10◦ . This has two principal effects. First, mea-
ray spectroscopy has ever been carried out. The gamma- suring angular distributions of reaction products is much
ray spectrum of transitions de-exciting states in 254 No is more difficult. Second, on the other hand, it is possible to
shown in Fig. 13. A rotational band structure is clearly capture much larger percentages of the reaction products
visible, indicating that 254 No is in fact a deformed nu- in the acceptance angles of various types of charged par-
cleus. A very surprising feature of the spectrum is that ticle spectrometers and mass separators, thereby enhanc-
the rotational band is observed up to very high spins ing counting rates. These considerations impose design
∼18 h, (an amazing number for such a heavy, fissile nu-
✏
constraints on gamma-ray detectors surrounding the tar-
cleus). The existence of a rotational cascade up to spin get. First of all, ultra-high efficiency is needed. Second,
∼18 h, well beyond the classical fission barrier limit, indi-
✏
generally, a forward angled cone needs to be left free of
cates that 254 No is held together primarily by microscopic detectors.
shell effects, rather than macroscopic liquid drop binding, The requirement of maximal gamma-ray counting ef-
as in normal nuclei. Shell effects, for certain favorable pro- ficiency generally means a close geometry and detectors
ton and neutron numbers and for favorable deformation, that subtend large solid angles. However, Doppler effects
can provide an additional 1–2 MeV of binding energy. It can then be very large, especially when using inverse kine-
is this binding energy, which does not depend strongly on matics, and high detector granularity will generally be
angular momentum, which holds 254 No together to such critical. This granularity can currently be achieved in two
high spin. ways and considerable development in both directions is
needed. One is the use of highly segmented tracking arrays
such as GRETA discussed earlier. The other is the use of
D. Experiments with Radioactive Beams
position-sensitive Ge detectors of the type developed by
Today, a new era in nuclear structure physics is opening up Glasmacher and colleagues for use in intermediate energy
with access to a much wider selection of nuclei, extending Coulomb excitation experiments at MSU (Muller et al.,
far beyond the valley of stability and encompassing nu- in press). In these detectors a resistive readout at the two
clei that are expected to be exotic in both proton/neutron ends of a linear Ge crystal allows the localization of the
composition and structure. The physics opportunities with γ -ray interaction to an accuracy of ∼2 mm. This detec-
such beams have been discussed elsewhere (RIA Physics tor is capable of measurements even with beam intensi-
White Paper, 2000) and need not be repeated here. What ties of ∼1 particle per second or even less. An example
are relevant are the particular methods of carrying out of a gamma-ray spectrum (corrected for Doppler effects)
gamma-ray spectroscopy on exotic nuclei. Basically, the from intermediate energy Coulomb excitation taken with
techniques to be used will be familiar ones, such as β- an early generation detector system (using NaI(Tl) detec-
decay, Coulomb excitation, and fusion-evaporation reac- tors) is shown in Fig. 14. This data was taken on 40 S in
tions. High-, medium-, and low-spin states will all present order to test predictions of the underlying particle mo-
topics of interest. tion in exotic nuclei with a high excess of neutrons over
Experiments with radioactive beams differ primarily in protons (the heaviest stable isotope of sulfur has 20 neu-
two critical respects from their stable beam siblings. First, trons, 40 S has 24 neutrons). The Coulomb excitation was
beam intensities will often be much lower than with sta- accomplished in this case using a 197 Au target.
ble beams. Instead of beams of 1011 particles per second, For specialized experiments, such as low-energy
many experiments will need to be carried out with inten- Coulomb excitation in inverse kinematics designed
sities that are less than 106 particles per second and, at explicitly to excite only the lowest one or two states,
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
FIGURE 13 Spectrum of gamma-rays depopulating excited states in 254 No. With Z = 102, 254 No is the heaviest
nucleus for which gamma-ray spectroscopy has ever been accomplished. The gamma-rays are labeled by their
transition energies in kilo-electron volts and also by the spin of the state they depopulate. The inserts show the
population intensity as a function of spin for the two beam energies, 215 (top) and 219 MeV (bottom). More angular
momentum is brought into the system at higher beam energies, and this is reflected in the stronger population of
higher spin states in the lower spectrum (Leino et al., 1999).
the gamma-ray spectra are particularly simple. Therefore, are both unimportant and undetected. One typical design,
energy resolution is not a problem and high efficiency the GRAFIK detectors (Sheit et al., 1996) actually incor-
can be obtained with, for example, low-resolution NaI(Tl) porates the target inside an annular hole in the detector,
detectors placed in close geometry. Here, Doppler effects achieving ∼80% of 4π solid angle coverage.
P1: GPJ Final
Encyclopedia of Physical Science and Technology En006G-273 June 29, 2001 21:10
Riley for providing the figures we have used. This work was supported
in part by the U.S. DOE grant number DE-FG02-91ER-40609.
BIBLIOGRAPHY
High-Pressure Research
E. F. Skelton
A. W. Webb
Naval Research Laboratory
I. Introduction
II. High-Pressure Research Environments
III. Measurement of Pressure
IV. Research at High Pressure
345
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
treatments have been successful in producing this transfor- (∼50 cm3 ) can achieve pressures up to 1 GPa, while the
mation. In this cubic phase, the superconducting transition more recently developed diamond-anvil cells can be used
temperature of Nb3 Si is 18.5–19 K, as compared to 0.29 K to subject microscopic-size samples (∼10−8 cm3 ) to pres-
in the tetragonal phase. sures in excess of 500 GPa. Each of these systems is dis-
The role of pressure in understanding physical proper- cussed in more detail below.
ties of materials can also be of importance. An example Much higher pressures can be achieved for brief periods
where it may be vital is in understanding the origin of of time by using conventional or nuclear explosive-driven
superconductivity in certain organic charge transfer salts shock waves. Current research in this area also involves
based on the cation molecule ditetramethyltetraselenoful- the study of shock waves produced by high-powered laser
valenium (TMTSF), since all but one of these salts is only beams. These shock pressures are usually accompanied
superconducting at elevated pressures. by significant elevations in the sample temperature. The
pressures achieved by shock waves can be in range of a
few tesla Pascals.
I. INTRODUCTION
B. Historical Review
A. Definition of Pressure
The father of modern high-pressure research is Percy W.
Pressure is defined as the ratio of a force divided by the Bridgman, a man who dedicated his professional life to
area over which that force acts; thus, the units of pressure, high-pressure experimentation. Working at Harvard Uni-
force per unit area, are newtons per square meter (=1 Pa), versity, he published over 200 research papers and, in ad-
dynes per square centimeter (=10−6 bars), or pounds dition to a wealth of basic scientific research, was also
per square inch (at sea level 1 atm pressure = 1.0133 × responsible for several important technological discover-
105 Pa = 1.0133 bars = 14.696 lb in.−2 ). In this article we ies: the principle of unsupported area, known today as
shall use the S.I. unit of pressure, the Pascal, abbreviated the Bridgman seal, and the principle of massive support,
Pa. Since the pressure, or more correctly the stress state, upon which the Bridgman anvil is based. In 1946 he was
will vary depending on the direction of the force relative awarded the Nobel Prize for his pioneering work in this
to the area over which it is applied, in a strict sense, it is field.
necessary to consider the six independent components of The work following Bridgman can be divided into
the stress tensor. In terms of a working definition, how- three parts based on the experimental technique employed:
ever, most researchers usually presume that they are deal- dynamic or shock pressures, large-volume studies, and
ing with hydrostatic pressure or something close to it, in diamond-anvil cell work. Each of these methods will be
which case the three diagonal elements of the stress ten- discussed in detail in the body of this article.
sor are the same value and all off diagonal terms, which
represent the shear components, are zero.
In modern high-pressure research, the force is usually II. HIGH-PRESSURE RESEARCH
transmitted to the sample of interest via some medium. If ENVIRONMENTS
that medium is a fluid, then a true hydrostatic pressure en-
vironment does exist, although today much high-pressure A. Static Pressures
research is carried out in the range of tens of gauss Pascals
1. Fluid Systems
pressure or above (1 GPa = 10 kbar), where few materials
remain in the fluid state. Consequently, much work is done The application of pressure by means of a fluid, whether
today under conditions of quasi-hydrostatic pressure (i.e., liquid or gas, has generally proven to be the most satis-
since the forces are transmitted through solidified fluids factory method. These hydrostatic environments will not
or relatively soft solids, shear stress components can be a subject a test specimen directly to shear stresses; how-
problem). ever, oftimes the reaction in the sample may be anisotropic
Natural pressures found in our universe range from and internal shear components can exist. The maximum
100 kPa in the atmosphere at sea level, to about 100 MPa at pressure obtainable in a fluid system is limited by the de-
the deepest part of the oceans (the bottom of the Marianas sign and strength of the container, often referred to as a
trench in the Pacific Ocean), to 0.36 TPa at the center of the “pressure bomb.” The ultimate strength of even compound
earth, to tens of tesla Pascals at the center of our sun. To- cylinders poses a limit, but more often the sealing of such
day, synthetic, or man-made, pressure environments can systems presents the defining pressure limit.
be produced in the laboratory to span this range. In terms One of the best solutions to the sealing problem is
of static pressures, the relatively large volume systems the “unsupported seal” designed by Bridgman: in this
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
mechanism, the confining pressure is itself used to help pumped to the system capability. This flooded leg is then
seal the system. A mushroom-shaped piston or plug faces pumped with oil, thereby raising the pressure in the gas
the system on its full diametrical face, but in turn trans- to the maximum attainable with the liquid system. This
mits the total resultant force to soft annular packing that is often takes several cycles of the system with appropriate
placed between the plug and the driving rod. The smaller valving, since the gas is significantly more compressible
stem of the plug projects into a well in the rod for align- than the pump fluid.
ment purposes, but the well has sufficient depth that the A second method of increasing gas pressure is through
packing must bear the full load. Therefore, the pressure the use of a stepped piston intensifier. In this device, a
on the packing is always larger than the fluid pressure by large oil-driven piston is mechanically linked to a smaller
a constant factor. This factor is related to the relative areas diameter piston, which uses the mechanical advantages
of the piston face and the annulus. In practice, the packing of the ratios of the areas to pump the gas to pressures
material may vary from rubber, for lower-pressure sys- perhaps as much as 10 times that of the oil. As in the
tems, to stacks of softer and harder metals, with In, Pb, or previous case, several cycles may be needed in order to
even annealed Cu serving as the sealing agent. attain the maximum effect.
Although the working pressures obtainable in fluid sys-
tems are generally below 1 GPa, even at these modest
3. Experimental Probes
pressures, the viscosity of many fluids will have increased
to the point where small-bore plumbing no longer al- One subject receiving extensive study is the fluids that
lows equilibration of the pressure throughout the system. serve as compression agents, as discussed above. Fusion
We shall return to this problem in the discussion of the curves, especially at low temperatures, are, of course, of
diamond-anvil cells. considerable interest. The transformation from the Fluid to
the solid state requires assumptions concerning the magni-
tude of the strains present and their effect on the pressure–
2. Means of Pressure Generation
temperature status.
Liquids can be pumped on by any of several meth- Experiments designed to map out equations of state for
ods. Hand-operated piston pumps have served for small- selected fluids are more ambitious. These require simulta-
volume systems, although special modifications for the neous determination of the pressure, volume, and temper-
sliding seal and check valves may be needed as the ulti- ature (P, V , T ) of a given system. The measurement of
mate pumping pressure increases. In some cases, double the volume is the most difficult, since even heavy walled
checks have been found to be effective, and all parts must containers will undergo some, albeit small, deformation
be machined to extremely close tolerances and must re- under load. One technique is to seal a known amount of
ceive fine finishes. For larger systems or very high pres- the fluid of interest in a system separated from the pressure
sures, these pumps are tedious to operate. fluid by a bellows. A sensing system, either internal or ex-
Another manually operated system employs a large ternal, monitors the change in the extension of the bellows
hand-operated wheel driving a screw that forces a piston of the system and, thus, the volume as the pressure and
into the pump cylinder. This type of system is limited in temperature are varied.
maximum pressure by the relative volumes of the pump, If the pressure containers employed are fabricated of
the system, and the stroke of the piston. nonmagnetic materials, such as Be–Cu, then changes in
Pumps adapted from manually operated units and the magnetic properties of the contained material can be
driven electrically are common at hydraulic system pres- assessed through the walls, thereby obviating the need for
sures. Air-driven diaphragm pumps using normal air are direct contact into the pressure chamber. Although this
convenient for the operation of relatively large systems to will greatly simplify the experiment by removing the need
pressures approaching 200 MPa. to pass electrical leads through the pressure walls, it can
Liquid systems are also encountered in which the liquid have the drawback of a large experimental error since the
is sealed in a small cylinder with a piston that is loaded sample usually represents only a small portion of the ex-
with a hydraulic press and then clamped. Such systems ternal sensing coil volume.
can be made quite small and, for this reason, often find The issue of making electrical contact to the contained
applications in research involving cryogenic temperatures sample is not a simple one. Insulated wires must be
or high magnetic fields. brought through the pressure wall into the fluid-filled vol-
Gas systems can be pumped directly to pressures ap- ume without pinching off or extruding the wires them-
proaching 100 MPa; pressures above this must be obtained selves or destroying the insulation. Extrusion is usually
indirectly. One of the oldest methods is to use a U-shaped, controlled by placing a conical portion of the lead on the
high-pressure chamber with Hg forced into one leg by gas high-pressure side with an insulating sleeve that is only
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
driven tighter into its seat with the application of pres- of friction, such as some of the transition metal dichalco-
sure. A preload is useful in setting the seal before the ini- genides, for example, MoS2 or WSe2 . In these latter cases,
tial application of pressure. Liquids will not pass through the aforementioned unsupported seal can be employed.
the insulation as readily as gases, especially He, and
therefore, this type of probe is often used in clamp-type b. Design principles. Normally, a piston is ground
cells. with a very small clearance in the lapped cylinder, on
Another sealing technique involves the use of leads the order of 0.0005 in. This is usually sufficient to seal
swaged in high-temperature insulation with stainless steel most solids, except as noted above. For lower pressure ap-
jacketing. These leads are common with thermocouple as- plications, the piston and cylinder are made of hardened
semblages; other special conductors can also be used. In tool steel, with the piston made somewhat harder than
these applications, the metal sheathing is silver soldered the cylinder. With quality steels, pressures approaching
into a threaded plug that passes the leads into the pressure 2.5 GPa can be attained in these systems. Higher pres-
cavity. The interior end is sealed with epoxy. sures require harder materials, with WC being the most
A third technique that has been successfully used to tractable. This is a brittle material, and while it does have
pressures up to 1 GPa involves passing the leads from a high compressive strength (∼800,000 psi), it is weak in
the high-pressure environment through a second length of tension—its tensile strength is usually only about 2% of
high-pressure tubing, with the first being used to pressur- its compressive strength. Therefore, it needs support that
ize the system. A large U-bend is placed in this tubing is usually provided by two or three interference fit support
that is filled with an oil and then submerged in a liquid rings or hardened steel. These will serve to prestress the
nitrogen dewar, thus freezing the oil through which the WC and allow it to be worked to much higher pressures.
leads pass. Care must be taken to insure that liquid oil is These binding rings are designed to maintain a compres-
present above the frozen solid on the high-pressure side sive load on the WC to its maximum working pressure.
in gas-pressurized systems.
c. Failure modes. Shortly after the internal pressure
exceeds the sum of the interference compression and the
4. Piston-Cylinder Devices modest WC tensile strength, radial tensile fracture occurs.
a. General. Simpler, in principle, than fluid bombs If the problems with radial fracture can be forestalled,
are the piston-in-cylinder devices. In their simplest form, the next mode of failure is breakage along a plane per-
a hole is drilled in a block of solid material and, after pendicular to the cylinder axis. The solution to this is to
plugging one end or using a blind hole, the test sample again prestress the WC, this time with end clamping of
is inserted (see Fig. 1). A strong, close-fitting piston is the cylinder. Some researchers have utilized sophisticated
then inserted and the pressure is applied. Solid materials hydraulic clamping systems to adjust the end loading pro-
are readily studied in such a device, unless they are very portional to the sample pressure; however, the equipment
soft, such as Pb, In, or polymers, or have a low coefficient needed for such work can be costly, and the ultimate pres-
sures achieved can usually be attained more easily by other
techniques. Heavy clamping bolts and support rings can
also be used for such loading.
Some designs, such as a cell developed at the University by a screw-driven wedge, have also proven effective. In
of Rochester of single crystal X-ray studies, are quite sim- order to achieve remote and/or programmable control of
ple, consisting of two triangular plates clamped together the load, the piston anvil can be driven by a hydrauli-
by three adjustable screws. cally operated mechanism or by an electric stepping mo-
Critical in the design and operation of the DAC is the tor. One DAC designed for operation at cryogenic tem-
alignment of the anvils themselves. It is very important peratures employs a metal bellows chamber pressurized
that the culet faces be parallel and normal to the uniaxial with He gas cooled from room temperature; this system
force direction. In earlier work, one anvil face was typi- could achieve pressures of 10 GPa at temperatures as low
cally made much larger than the other, for example, 1.2 vs as 30 mK.
0.6 mm diameters. This frequently led, at higher loads, to As with Bridgman anvils, the pressure across the dia-
surface fracture of the larger diamond face due to penetra- mond faces is not uniform. If a sample is placed directly
tion by the smaller. The modification to avoid this was to between the two anvils, as the load is applied, the sample
use a matched pair of diamond anvils with approximately will extrude laterally from the containment region until the
equal surface areas and to align their axes as well as their frictional forces between the sample and the anvil faces
planar surfaces. become sufficient to contain the load. Under these condi-
The principle of massive support is being applied to the tions, the sample will be exposed to an extreme pressure
diamond anvils by the addition of culet faces that optimally gradient, ranging from a maximum near the center to al-
make an angle of 5–10◦ with the culet flat; the load on these most atmospheric pressure at the periphery. This carries
faces decreases as one travels radially out from the center. the added feature of subjecting the material under study to
The highest pressure attained with these beveled anvils is large shear strains. An advantage of this, however, is that
on the order of 550 GPa, as of this writing. the response of the material to a wide range of pressures
In achieving these two alignment criteria, two designs can be examined at a single setting. In earlier work, phase
have become somewhat generally adopted. One design transitions were detected optically with this technique.
has the anvils mounted in hemicylindrical rockers that In more recent studies to the highest static pressures, re-
can be translated along their cylindrical axes, which are searchers are using highly collimated beams of extremely
set orthogonally in the cell body. Thus, axial alignment is intense radiation, for example, that produced with syn-
achieved by appropriate translational adjustment, and fa- chrotron storage rings, to study small portions of samples
cial alignment is achieved by appropriate rocking motions; under pronounced pressure gradients.
both of these can be carefully controlled with small adjust- It is more common in the operation of DACs, however,
ment screws. In the other design, one diamond is mounted to employ a hardened metal gasket for containment of
in a hemispherical support whose orientation can be care- the sample. High-strength metals, such as stainless steel,
fully adjusted for the facal alignment; the other diamond Inconel, Waspaloy, or hardened Be–Cu, are often used. For
is mounted in a flat plate whose position can be translated purposes of alignment and extra hardening, the gasket is
in a plane normal to the diamond axes, thus allowing the usually prestressed by compressing it between the anvils,
axial alignment. frequently to about 50% of its original thickness. A hole
It is important to maintain the alignment of the dia- with a diameter of 12 to 23 that of the culet face is then drilled
monds during operation of the DAC. This is usually ac- in the center of the indentation. The indentation serves to
complished by placing one diamond on a piston that slides allow reasonably rapid recentering of the gasket between
in a closely fitting cylinder equipped with a guide pin to the anvils. This cylindrical hole, whose linear dimensions
prevent rotation during loading. Some designs provide for are typically a few hundred micrometers, constitutes the
alignment during operation, but this is generally unneces- sample chamber.
sary for sub-50 GPa work. The sample of interest is then loaded in the gasket hole,
In operation, one of the anvils is usually kept station- frequently mixed with some standard material to serve
ary and the other is driven toward it. The magnitude of as a pressure calibrant; NaCl or Au are frequently used
the force required for this is modest and can be devel- calibrants for X-ray studies, while the wavelength shift
oped in one of several ways. In the earliest design, a lever of fluorescence from ruby is also a commonly used pres-
arrangement is used to provide a mechanical advantage sure gauge. Although the gasket will help in providing a
of 3 or 4, and the force is manually generated through more uniform distribution of pressure, if a truly hydro-
compression of a load screw, either directly of through a static pressure environment is required, the sample and
stack of Belleville spring washers. Similarly, in the three- calibrant must be immersed in a suitable fluid. A mixture
bolt design referenced above, manual advancement of the of 16 parts methanol, 4 parts ethanol, and 1 part water will
screws will generate sufficient load to achieve the limits remain fluid to pressures just above 10 GPa at room tem-
of the DAC. Other mechanical systems involving levers perature. For hydrostatic conditions at higher pressures,
and pivots with screw adjustments, or a lever operated liquified gases are required. He, Ar, or N2 , condensed and
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
sealed at cryogenic temperatures, provide near hydrostatic ber of the refrigerator is built directly into the DAC. A
conditions to pressures well above 10 GPa. metal bellows chamber pressurized with He gas was used
Another application of diamond anvils should also be to generate the compressive force for the anvils.
noted here: researchers at Cornell University have been
experimenting with a spherical diamond indentor with a c. Experimental probes. As noted above, the major
tip radius of about 10 µm that is pressed into a flat diamond advantage of the DAC is its virtual transparency to a broad
face. The sample is contained between the two diamond spectrum of electromagnetic radiation. Some of the earlier
surfaces, and, as in the case of the ungasketed DAC, the studies in the DAC were made in the visible portion of
sample will extrude to the point where frictional forces be- the spectrum; refractive index changes, optical absorption,
tween it and the diamonds cause the sample to effectively and birefringence are several types of measurements that
form its own gasket. Since the contact area is controlled were performed on samples as they underwent various
by the tip radius, contact force, and deformation of the pressure-induced phase changes.
diamonds, it is very small, and modest loads can produce Optical studies were quantified by incorporating the
extreme pressures: values in excess of 100 GPa have been DAC into a spectrometer system, including appropriate
claimed. One major drawback with this arrangement is focusing optics. Work in the infrared region is usually
that because the sample is small, it is difficult to detect. performed using type-II diamond anvils; these allow trans-
The reported pressures have been determined from the mission studies to be carried out in the 1–4 and 5.5–15 µm
model used to represent the tip and flat, and therefore, regions, as well as the visible and near ultraviolet. Fluores-
reported pressures are vulnerable to the accuracy of the cence and spectroscopic measurements, both absorption
model. and Raman, have also been carried out in the DAC. An
abundance of research has been performed in the higher
b. Temperature. Elevated temperature studies with a photon energy regions as well: both angular and energy
DAC must be approached with the realization that the sta- dispersive Bragg scattering measurements have been used
ble form of carbon at atmospheric pressure is graphite, not to detect structural phase transitions, as well as measure
diamond. At normal temperatures though, the diamond-to- thermal expansivities and compressibilities. Extended X-
graphite transition is not observed because of the very high ray absorption (EXAFS) studies have been undertaken
activation energy associated with this transformation. But, with limited success in the DAC, with the major difficulty
if sufficient thermal energy is provided, this activation bar- being interference in the EXAFS patterns by Renninger
rier can be surmounted. Therefore, most high-temperature scattering from the diamond anvils. Using appropriate
research with DACs is restricted to below 800◦ C. gamma-rays, Mössbauer studies have also been performed
High temperatures in a DAC have been attained by two in a DAC.
methods: resistive heating and laser heating. In the for- Magnetic susceptibility and microwave absorption
mer case either the gasket itself can be used as a heating measurements have been performed in a DAC; this work
element or, more commonly, resistive heaters are used to has been directed primarily toward superconducting ma-
envelope the diamonds and gasket material. Temperatures terials. In this same context, some researchers have also
are usually limited to below 200◦ C in the former and to equipped DACs with electrical leads extending into the
below 800◦ C with the latter. Most recently, scientists from pressure cavity, thereby permitting electrical resistance
Los Alamos National Laboratory and the Naval Research measurements under conditions of varied pressure and
Laboratory have collaborated to develop a DAC that is op- temperature. Additional details of the various measure-
erated in a vacuum oven equipped with two concentric Ta ments performed in a DAC are discussed in the later parts
heaters. Pressures in excess of 10 GPa have been attained of this article.
at temperatures above 1200◦ C with this system.
The other heating technique employs a high-powered
laser beam that is focused on the sample. Using a pulsed B. Dynamic Pressures
laser, a team at Cornell university has achieved temper-
1. Fundamental Principles
atures in excess of 5400◦ C for brief periods of time, ac-
tually melting diamond, and using a pulsed YAG laser, The most obvious difference between static and dynamic
temperatures of 2000◦ C have been sustained at pressures pressures is that of duration: in most static systems, once
of 2.5 GPa. the pressure is set, it is generally considered to be con-
Research has also been carried out with DACs at cryo- stant in time, whereas shock pressures result from large
genic temperatures. Researchers at the Naval Research amplitude waves passing through matter and are necessar-
Laboratory have attained temperatures down to 30 mK at ily of brief duration, typically on the order of microsec-
pressures up to 10 GPa by coupling a DAC to a liquid-He onds. Another difference between the two techniques is
dilution refrigerator. In this apparatus, the mixing cham- that static pressures can be applied isothermally, whereas
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
shock waves are generally accompanied by large thermal (2) uniformity of the disturbance across the sample face,
excursions. Final, static pressures are three dimensional in and (3) the decompression or release processes. The most
character and often hydrostatic, whereas the shock pres- common methods for the production of shock waves are
sure is often considered to be uniaxial, the result of a two- detonation of explosives, either directly in contact or driv-
dimensional shock front passing through the sample. The ing a flyer plate, and impact of a projectile from a gun. Ex-
development of pressure results from the inertial response ploding foils or wires and imploding magnetic fields have
of matter to a rapid acceleration. Although several differ- been used occasionally. Laser-generated shock waves are
ent shock techniques will be discussed, all depend on the becoming more common, especially in the light of cur-
sudden application of force to a surface of the target for rent national defense strategies. Electric rail guns promise
the initiation of a shock wave. interesting results at the upper pressure limits, but devel-
A shock wave represents a thin region, typically 1 nm opment has been sporadic. Finally, nuclear-driven shock
wide, of a material over which there is a discontinuity studies have resulted in the highest dynamic pressures and
in stress, density, material velocity, and internal energy. have tested the extension of existing equation-of-state the-
This region travels at supersonic velocity with respect to ories to pressures in the tens of tesla Pascal regime.
the material into which it progresses. Although impacts
normally generate a region of rapidly increasing pressure,
2. Experimental Techniques
building to a maximum, a shock wave results because the
pulse velocity increases with increasing pressure. This is a a. Explosively driven shock waves. Interest in the
fundamental requirement for the establishment of a shock effects of explosives on materials originated with military
wave; it leads to a sharpening of the disturbing pulse to a concerns for the consistent performance of these devices
steplike discontinuity. Since all forms of shock-generating and later with the related issue of armor penetration. In
force are short lived, or the target moves away from the the latter case, spalling became a major issue. This is the
disturbed region, a low-pressure region, or release wave, fracturing of part of the target on the opposite side from
is launched into the target following the shock wave. The the impact due to a tensile failure in the interior where two
release wave travels in a denser material because of the release waves have met. Interest in this area was markedly
preceding shock wave; therefore, it travels faster and even- increased by the need for predictable, well-controlled use
tually over takes and destroys the shock front. of explosives in the original atomic bomb triggering se-
Shock studies have depended heavily on the concurrent quence. It was this factor too that stimulated the accumu-
development of theoretical models or codes that allow the lation of large amounts of data on the effects of shocks on
researcher to predict the effects of shock waves on specific various materials, and that forms the basis for many of the
materials. Materials are frequently treated as if they were research programs currently underway.
liquids in these codes; however, the errors resulting from There are two ways of explosively generating shock
such approximations are generally small because of the waves: one in which the explosive is in direct contact with
extreme pressures involved (i.e., generally above 10 GPa). the target and the other in which the explosive launches a
The parameters that are typically measured in a shock flyer plate that then impacts the target. In the first case, it is
experiment include the shock velocity, the particle velocity important that the explosive impact reach all points on the
behind the shock front, and, more recently, the pressure front of the target simultaneously. Since detonation usu-
and temperature. Within the experimental uncertainties, ally starts at a point, or along a line, the explosive burn will
the shock velocity is usually found to be a linear function take place along a spherical or cylindrical front, respec-
of the particle velocity. tively. Some commercial suppliers of explosives provide
Using the principles of conservation of mass, energy, triangular sheet line generators that are perforated with
and momentum across the shock front, a set of three equa- an array of holes that serve to break up the curved shock
tions can be derived that relate the final density, internal front into a series of many smaller fronts that approximate
energy, and pressure (or stress) to the initial values for a line.
the material and the shock. The locus of values derived One form of plane wave generator is called a “mouse-
from a number of experiments forms a curve that defines trap” (see Fig. 7). It consists of a sheet of explosive
the final states that can be reached for a given material in material laid on a thin, inert (glass or metal) driver plate
the shock velocity-particle velocity plane, or the stress- that is inclined above the main charge at an angle θ , such
volume plane. This curve is generally referred to as the that sin(θ ) equals v/d, where d is the detonation velocity
Hugoniot, since it defined the Rankine-Hugoniot equation down the sheet over the driver plate and v is the resultant
of state for the material with respect to its initial state. velocity of the plate from the pressure generated by the
Loading methods will influence the precision of three detonated gases. Initiated at its upper edge, the driver plate
parts of the shock experiment: (1) control of the pressure, strikes the main charge at all points simultaneously and
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
the latter case is the mousetrap where the driver plate hits
the target rather that a main charge.
The pressures that can be obtained using these tech-
niques range up to a few tens of gauss Pascals. The primary
advantage is the relatively simple set up and correspond-
ingly low cost.
One advantage of the laser initiation is that the energy beam itself can be used for diagnostic purposes at the
deposited by the laser beam results in a shock wave that target.
is initiated simultaneously over the irradiated area. Typ- Plasmas also generate X-radiation with wavelengths
ically, the laser beam is focused to a spot size with a varying according to the target composition. This radia-
diameter of up to 1 mm. Irradiances greater than about tion can be used either directly, by recording the shock on
108 W/cm2 are required to ignite a plasma and produce an X-ray streak camera, or indirectly, by exciting X-rays
a shock wave. These small impact areas lead to large from a second target that are then used to monitor the shock
energy losses due to two-dimensional expansion of the passage. Shock velocities can be determined by monitor-
plasma. Edge effects will also be significant in evalua- ing the light generated when the shock reaches the back
tion of the shock. The facts that laser pulses normally surface of the target using a streak camera.
last no more than a few hundreds of nanoseconds and
the plasma dissipates very rapidly after the energy in- e. Nuclear-driven shock waves. Nuclear explo-
put ceases result in the launching of the aforementioned sions have been used to obtain equation-of-state data for
release wave, which rapidly overtakes and destroys the many years. Pressures obtained with this method have
shock front. As a result of these constraints, effective approached 7 TPa, and although this seems to be less
sample thicknesses have been limited to a few tens of than those attainable with laser beams, the target area was
micrometers, although the pressures attainable with these nearly 30 cm in diameter and the nuclear device was det-
systems can range up to 10 TPa. This pressure range is of onated about 3.5 m away from the sample.
considerable interest in the development of equations of Optical techniques are preferred for the initial signal
state. generation because the electromagnetic interference is ex-
Not all of the laser energy is effective in the gen- treme. Usually, the shock velocity is measured for a ref-
eration of the shock wave; some is lost in the forma- erence material and several samples mounted on the ref-
tion of the plasma, some is carried off with the plasma, erence material target plate. Impedance matching is used
and some is reflected without effect. The absorption effi- to derive the particle velocities of the samples from the
ciency of most materials increases with the photon en- known equation of state of the reference material.
ergy, ranging from about 30% in the near infrared to Access to nuclear tests is difficult, and although the
about 90% in the near ultraviolet. Short wavelengths of- technique appears to be comparatively simple, not only
fer another advantage, namely, the decrease in produc- the sample, but most of the expensive signal processing
tion of suprathermal electrons. This, in turn, lessens the equipment is lost in the process; thus, experiments are
electron preheat of the target prior to passage of the limited to materials of the greatest interest.
shock front. X-ray production, however, is enhanced at
shorter wavelengths and for higher Z materials, which may f. Electric rail guns. These devices represent a vari-
cause some target preheating. Laser beams may have lo- ation of the gas gun; however, here the force used to ac-
cal fluctuations in energy density, or “hot spots,” which celerate the projectile is electrical. The “gun” is made of
can lead to nonplanarity of the shock front. These ef- two parallel, electrically conducting rails. The projectile,
fects are minimized at longer wavelengths by thermal con- a conductor shorting the rails, rests between them. The
duction, but are more pronounced at shorter wavelengths system forms a linear dc motor, and the acceleration of
because the energy is deposited closer to the ablation the projectile is enhanced by the formation of a plasma
surface. behind it, thus aiding in its acceleration. Although several
The flyer plate concept has also been applied to laser- groups have experimented with this method, difficulties
generated shocks. Carbon disks have been irradiated with have prevented its fully successful implementation. Theo-
a 3-nsec laser pulse, generating a shock pressure of 0.5 TPa retical estimates indicate that ultimate impactor valocities
and accelerating the disk to a velocity of 100 km/sec. Im- as high as 40 km/sec should be achievable; these would
pact of this disk on a second disk delivers the energy in lead to shock pressures up to 10 TPa.
a much shorter time, resulting in the production of 2-TPa
shock pressures.
3. Measurement Techniques
A conventional shock diagnostic apparatus is electrical
in nature. The intense electromagnetic storm generated by Evaluation of shock experiments requires the measure-
laser and particle beams is an extremely hostile environ- ment of any two of the four shock variables: Up , particle
ment of these sensitive detectors. Lasers, however, bring velocity; Us , shock wave velocity; P, pressure; and V ,
their own solution. Because the timing of the laser pulse specific volume. The three conservation laws (i.e., that
can be precisely established, the incoming laser pulse can of mass, linear momentum, and energy across the shock
be used to trip optical diagnostics, or a portion of the laser front) are then used to calculate the other two parameters:
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
ρ0 /ρ1 = 1 − (Up /Us ) of the target; this produces a bright flash of visible radi-
ation that can be detected with a streak camera. Since it
P1 − P0 = ρ0 Us Up
may be difficult to establish the timing of the impact, a
E 1 − E 0 = [P1 + P0 ][V0 − V1 ]/2, stepped target is often used that provides two signals of
shock arrival with a well-defined separation, that, along
where E refers to the energy, ρ refers to the density, and with the timing information from the streak record, gives
the subscripts 0 and 1 refer to the variables ahead of and the shock velocity. This concept is also employed by using
behind the shock front, respectively. gaps that are prefilled with a gas, such as air, Ar, or Xe, or
Shock velocities can be measured in several ways. The a liquid, such as CCl4 . These materials all emit light when
average valocity can be calculated from the sample thick- impacted by a shock wave; the brightness of the light in-
ness and by timing the impact and exiting of the shock creases with the intensity of the shock, thus making this
front. Electrical pins that short upon the arrival of the shock technique useful in the lower pressure regions. If lucite
front are used to obtain timing data. These pins can con- is used as the transparent rear cover for the flash gap, it
sist of an insulated conductor separated from a grounded becomes opaque when the shock reaches it, quenching the
metallic target or a metallic conductor in a plastic, insu- signal to the recording device.
lated rod that is coated with metal forming a grounded Some polished surfaces change their reflectivity when
sheath. The end of the rod is capped with a thin insula- shocked. This allows the use of mirrors at the impact and
tor, and the central conductor is charged with a dc voltage rear surfaces of the target. A streak camera is used to
of up to several hundred volts. When the shock wave en- record the shock events. The shock-opacity principle can
counters the end cap, the center conductor is shorted to be applied here also: when the shock wave reaches the rear
ground, generating a rapidly rising pulse on a line, which surface, the reflectivity of the polished surface is greatly
is then recorded. The separation of the bare contact from altered and readily detected.
the grounded target or the thickness of the insulating cap Optical measurements are frequently carried out in
is minimized, consistent with the need to prevent shorting evacuated systems in order to minimize the effects of air
for the level of readout voltage used. If closing time is to shocks around the rear or free-surface of the target. An-
be no more than 100 nsec, then the gaps must be no more other type of optical technique involves using the mirrored
than 10 µm wide. rear surface of the target as an element in and optical lever
When electrical noise is a problem, optical pins, con- system. With this method, a streak camera is used to record
sisting of small microspheres containing a pressurized gas, reflections of a series of point light sources recording the
such as Ar, are affixed to the end of optical fibers. As the passage of a shock wave, or the motion of the image of
shock front passes, the trapped gas emits light that is car- a fine wire by the moving free surface. If a liquid cell is
ried along the fibers to remote detection equipment. These used, a thin foil may be suspended in it at some angle to
pins can also be multiplexed onto a single streak camera the expected shock front. The shock velocity in the fluid
record. can be determined, if the fluid remains transparent and the
Pins of either type can be placed at various steps in foil remains reflective when shocked.
the target or in holes of carefully determined depth. They Lasers, because of their brightness, monochromaticity,
should, of course, be separated by sufficient distance so and coherence, have led to many new measurement tech-
that the release wave generated by one hole or step will niques. They can be used as light sources in those tech-
not interfere with the readings of adjacent pins. niques in which brilliance is important. Their monochro-
Electrically conducting targets can be used as one plate maticity allows them to be used as interferometers for
of a capacitor; in this case, the velocities are determined velocity measurements. Interference fringe patterns are
by measurement of the variations in capacitance as the established when partially reflected light from the front
target moves. This techniques provides information about surface of the target interferes with light reflected from
the position, the free-surface velocity, and structure of the the moving face at the rear. After the effects of the initial
wave front. shock pass, the steady fringe reading corresponds to the
Application of a strong, uniform magnetic field orthog- particle velocity in the window material. The timing of the
onal to both the direction of shock propagation and to a passage of the shock through the window material gives
conductor through which the shock will pass will generate the shock velocity. These two pieces of data define a point
an electrical voltage when the conductor is moved by the on the Hugoniot curve.
shock. The large magnetic field requirements limit more Since X-ray interactions with matter generally in-
routine application of this technique. volve low-level electronic processes, they are usually con-
The shock arrival at the rear surface can be detected sidered to be unaffected even by severe shock waves.
optically for higher shocks because of the intense heating Pulse X-ray sources are needed to study transient
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
phenomena. Laser-generated plasmas have been used as or the variation of the free energy with change in volume.
pulse sources. The free piston gauge, or dead-weight piston gauge,
Since pressure is also one of the primary variables, sev- makes use of the former. Presuming an exact knowledge
eral methods have been used to obtain estimates of its of the piston area, the force is fixed by a set of calibrated
value in the shock front. The variation of the electrical re- weights balanced on the piston. Friction between the
sistance of manganin wire has long been used as a means piston-cylinder interface is minimized by rotation of the
of measuring static pressures; the linearity of this variable piston.
with pressure has been shown to be valid to 30 GPa un- Several techniques are used to compensate for distor-
der dynamic conditions. The resistance of other materials tions at elevated pressures: (1) extension of the cylin-
will also vary with pressure, but their resistance–pressure der under pressure is restricted by a separately, pressur-
curves are either not linear or they involve large temper- ized jacket surrounding the cylinder; (2) calculations of
ature coefficients. Both tourmaline and quartz have been changes from engineering principles; (3) comparison of
used as piezoelectric pressure transducers. They have been two identical systems using different materials. These sys-
calibrated up to 2 GPa, probably the limit of their elastic tems are limited in pressure by the strength of the materials
behavior. Polyvinyl fluoride is a polymer film that can be used in their fabrication; with tungsten carbide, measure-
poled to give a piezoelectric gauge, which is finding ap- ments are routinely reported to 2.5 GPa, although some
plication. Ferroelectrics, such as lead zirconate titanate, work has been carried to pressures as high as 6 GPa. In a
have also been used. recent program to compare pressure scales between 13 in-
Another electrical technique is based on the observation ternational metrology laboratories, variances in raw data
that many materials either lose polarization or become amounted to no more than 78 ppm. Although this is within
polarized upon the passage of a shock. In the first case, the combined uncertainities, the derived calibration con-
electrodes on the two faces of the material being shocked stants (i.e., the slopes of calibration curves) disagreed in
will generate a voltage through an external resistance; the a more marked manner, suggesting fundamental differ-
value of the voltage will be proportional to the pressure ences, particularly between the controlled displacement
and sample area and inversely proportional to the sample and other systems.
thickness. A thermodynamic pressure scale, similar to the thermo-
Since each of these techniques requires electrical con- dynamically defined temperature scale, would be another
tact to sensors in the shock regime, they are each prone approach to an absolute pressure scale. Although this has
to many difficulties, including shorting by shock-induced been proposed, it has not been implemented as of this
conductivity in the gauge or support material, or loss of writing. The volume can be defined as the variation of the
contact. free energy, G, with pressure at constant temperature. Us-
A number of other shock phenomena have been noted ing an electrochemical cell, this is also proportional to the
in shock studies; some of these are unique. Two examples variation of the cell potential φ, with pressure; that is,
are phase transformations, either reversible (and there-
V = [(∂G/∂ P)T ] = −n(∂φ/∂ P)T ,
fore present only in the compressed state) or metastable
(such as the graphite-to-diamond transition), and changes where the parameter n is dependent on the particular elec-
in conductivity, particularly of semiconductors and insu- trochemical cell employed. This method has been infre-
lators that become conducting in the vicinity of the shock. quently used. One of the more difficult problems is the
Shifts in the Curie point with both pressure and tempera- variation in the ionic conductivity in the cell as the pres-
ture lead to shock-induced demagnetization of ferromag- sure is varied.
netic materials. Luminescence has been observed for some
materials, while some transparent materials such as poly-
B. Secondary, Practical Scales
methyl methacrylate (PMMA) and NaCl become opaque
during shock compression. In some cases, the mechanical Like its thermodynamical counterpart, temperature, pres-
strength of a substance increases markedly upon the pas- sure is most frequently determined from the variation of
sage of a shock. Post-shock studies span the full range of some previously calibrated state coordinate. Variations in
possibilities and are considered below. crystallographic volume, electrical resistivity, or fluores-
cence wavelengths have all been used with success.
underway to develop such equations from first principles Similar devices have also been introduced as pressure
calculations (i.e., from basic physical laws and interatomic sensors. The ideal material for pressure calibration would
potential functions), most materials today are treated have a small thermal coefficient of resistivity, a minimal
with semi-empirical EOSs. These employ one or a few hysteresis, and preferably a linear pressure response over
parameters derived from measurements, for example, the as large a range as possible. Manganin, a Cu–Mn alloy, is
volume dependence of the Grüneisen parameter. a material that has long been used for this purpose. Unfor-
Shock wave data have been used for EOS development tunately, the pressure coefficients tend to vary somewhat
at much higher pressures and temperatures than those en- from sample to sample, and therefore, each lot must be
countered in static experiments, for example, to thousands separately calibrated. Manganin pressure gauges have
of Kelvin temperature and hundreds of gauss Pascals pres- been used in static systems to pressures of 6 GPa. They
sure. Researchers have carried out combined studies on the have also been used in shock work, but significant correc-
same materials at the upper end of available static pres- tions for shock temperatures and errors derived from the
sures and the lower end of the dynamic pressure range to creation of point defects tend to limit the accuracy of the
provide reliable calibration materials. results.
Other materials used for this purpose include Au–Cr
with about 2% Cr and zeranin, formed of Cu, Mn, and Ge.
2. Calibrated Fixed Points These materials were originally used as resistance stan-
Structural phase transitions that occur in materials and can dards. The first has an appreciable temperature coefficient
be readily detected from discontinuous changes in their that limits its utility; the latter has not been in use for
physical properties are often used as specific pressure cal- as long, but seems to have the same desirable qualities
ibration points. For example, the freezing pressure of Hg at as manganin in addition to a more rapid recovery and a
0◦ C, which can be detected from a discontinuous change higher resistance to oxidation.
in the volume, has been accepted to be 0.75692 GPa for
many years. Also, Bi exhibits discontinuous changes in its c. Superconducting transition temperatures.
electrical resistivity at room temperature and 2.5499 and Early calibration techniques at cryogenic temperatures
7.7 GPa. One difficulty in employing these phase transi- grew directly from interest in the effect of pressure
tions is that they can sometimes be sluggish and can be on superconducting transition temperatures, Tc . These
affected by the hydrostatic nature of the pressure. measurements can either be performed by monitoring
the sample resistance, which drops to zero at Tc , or the
magnetic susceptibility, which will change abruptly at Tc
3. Other Pressure Scales due to the Meissner effect.
In, Pb, and Sn have all been calibrated and used in this
a. Ruby fluorescence. An excellent secondary
regard. However, one difficulty in this work is the lack of
pressure scale that has become very popular with the ex-
other low-temperature calibrants against which to fix the
tensive use of the diamond-anvil cell is one based on the
Tc ’s. Typically, a known fixed pressure is sealed in a bomb
pressure-induced shift in the wavelength of the R2 fluores-
at room temperature, after which the system is cooled to
cence peak from ruby. Since temperature will also cause
determine the shift in Tc . The problem is that there is likely
a shift in this wavelength, a thermal correction factor of
to be a change is sealed pressure due to differences in
0.068 Å/K must also be applied. Based on EOS studies
the thermal expansivity of the various components of the
with a number of materials (NaCl to 30 GPa and Ag, Cu,
system. It is difficult to accurately correct for this change.
Mo, and Pd to 100 GPa), the ruby line shift is given by
P(GPa) = 380.8 (λ/λ0 )5 − 1 , d. Semiconductors. Since the resistivity of semi-
conductors is also sensitive to pressure, they too can be
where λ and λ0 are the ruby fluorescence wavelength at used as pressure calibrants. These have the advantage that
elevated and atmospheric pressures, respectively. Based they will typically be significantly more responsive than
on Au and Cu EOS studies, the foregoing calibration has metallic resistors; however, they also tend to have large
been found valid to 200 GPa to within 5%. Most recent temperature coefficients. In one case, Sb-doped Te has
experiments in a diamond-anvil cell have led to and ex- been shown to have an exponentially decreasing resistiv-
tension of the ruby scale to ∼500 GPa, although the ruby ity up to 0.65 GPa: specifically, the ratio of the resistance
signal tends to be obscured by diamond fluorescence in at pressure P to that at atmospheric pressure has been
the 150–300 GPa range. reported to be exp[−1.122 P(GPa)]. Doping GaAs has the
effect of decreasing its temperature sensitivity while
b. Electrical resistance. Calibrated electrical resis- increasing its pressure sensitivity to a value approaching
tance sensors have long been used as temperature gauges. that of manganin.
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
e. Capacitance. Pressures can also be determined scattered radiation must be analyzed over a spatial range
from variations in capacitance. In this case the dimensions to determine the values of θ for which the Bragg equation
of the solid dielectric used in the capacitor will change is satisfied. Alternately, if heterochromatic or white radia-
in both area and thickness as a function of increasing tion is used, then the diffraction geometry (i.e., θ) is fixed
pressure. These gauges will typically have a resolution and the scattered photons are analyzed in terms of their
as small at 0.07/MPa. However, they too must be operated respective energies.
under conditions of extreme temperature control and ne- An important experimental detail in this work is to pro-
cessitate at least three electrical contacts into the pressure vide a window into the pressurized region with a relatively
chamber. These requirements, along with their relatively low absorption coefficient for X-rays. As noted above, the
low range of applicability, have limited the use of such diamond anvils of the DAC satisfy this requirements very
gauges. well: the X-ray absorption cross section for carbon de-
creases by a factor of 4000 over the photon energy range
from 1 to 20 keV. Other materials that also have been
IV. RESEARCH AT HIGH PRESSURE used as high-pressure X-ray windows are pryophyllite and
B4 C. The latter material has the advantage of being amor-
High-pressure research involves examining the response phous and therefore will not contribute substantially to the
of materials to the environment of extreme pressures. This diffraction pattern of the sample.
examination, usually in the form of a measurement of one Nevertheless, it is often difficult to bring a sufficient
or more physical parameters, can either be made in situ number of X-ray photons in and out of the pressurized
(i.e., while the pressure is changing) or upon completion environment in a short time period. Consequently, X-ray
of the pressure excursion. Although the latter experiments diffraction experiments can require very long exposure
are easier to perform, since they do not require access to periods. Typically, tens to hundreds of hours are required
the high-pressure environment during pressurization, they for a single measurement with a DAC when conventional
have the undesirable drawback of requiring a metastable X-ray tubes are employed.
product. One means of accelerating this process is to employ a
The objective of most of this research is to assess the ef- much brighter source of radiation. In the past few years,
fect of altering interatomic distances on a variety of phys- most of the high-energy synchrotron storage rings in the
ical phenomena, for example, local, short range order; world have been used for high-pressure structural re-
crystalline structure; electrical resistivity; and mechani- search. Since the X-ray flux available with these machines
cal, magnetic, and optical properties. In short, most phe- is many orders of magnitude greater than that available
nomena that can be measured at atmospheric pressure can with conventional radiation sources, the measurements
also be studied at elevated pressures. can usually be completed in much shorter time intervals;
exposure periods of minutes or seconds are typical. In
addition to speeding up the entire process, these brighter
A. Structural Measurements
X-ray sources also permit measurements that would not
One of the most fundamental properties of a condensed otherwise be feasible, for example, phase transition ki-
matter system pertains to the arrangement of atoms of netics. Experiments performed recently at the Stanford
which that system is formed. In the majority of materials, Sychrotron Radiation Laboratory involved monitoring the
this is the crystallographic structure. A wealth of high- structure of a number of alkali–halide salts in 1-min time
pressure research is directed toward the examination of intervals as they were driven through a first-order struc-
changes in crystal structure under pressure. Such studies tural phase transition with increasing pressure. Parameters
generally involve the application of basic X-ray diffrac- in the equations describing these phase transitions can be
tion techniques. A material is illuminated with either determined from these measurements.
monochromatic or heterochromatic X-rays, and the scat- In the quest for higher pressures, researchers using
tered photons are analyzed to determine crystal structure. DACs are employing diamond anvils with beveled tips;
This can be accomplished through utilization of Bragg’s the consequence of this, in addition to pressures in excess
equation: of 500 GPa, is a further diminution of the sample volume.
Thus, even brighter X-ray sources will be needed for future
Edhkl sin θ = hc/2,
work. Efforts have been made to employ devices inserted
where E is the energy of the X-ray photon, dhkl is the in the synchrotron rings to further increase the emitted
interatomic planar spacing, θ is the Bragg diffraction an- photon flux, so-called wigglers and undulators, for mate-
gle, h is Planck’s constant, and c is the speed of light. If rials studies on samples contained in a region only a few
monochromatic radiation is used, then E is fixed and the microns in diameter.
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
Another type of high-pressure X-ray measurement that lations indicate that at elevated pressures, hydrogen will
has recently been advanced through the utilization of syn- undergo two transitions from its highly ordered diatomic
chrotron radiation sources is EXAFS, or extended X-ray insulator state, first to a nonmetallic monatomic structure,
absorption fine structure. Analyses of X-ray absorption followed by a transition to a metallic state. Unfortunately,
data is complementary to standard crystallography in that there are many possible high-density phases for hydrogen,
it allows determination of nearest and next-nearest dis- all with very similar values for the free energy. Therefore,
tances of specific atoms, coordination numbers, and ther- it is difficult to forecast, with certainty, the properties of
mal vibrational properties. A difficulty encountered in metallic hydrogen or whether it may be metastable un-
EXAFS measurements performed with a DAC is the pres- der normal pressure and temperature conditions once it
ence of Bragg or Renninger scattering in the absorption is formed. The imagination and excitement of scientists
spectrum. To circumvent these problems, amorphous ma- throughout the world, however, have been captured by
terials such as B4 C are used to contain the pressure and some serious theoretical predictions of very high temper-
provide and X-ray window. ature superconductivity in metallic hydrogen . . . if it can
These structural measurements are pursued for several be created.
scientific reasons: to determine compressibilities or equiv- As a prelude to this, other systems that are more easily
alently bulk moduli, to detect structural phase transitions, studied are being examined. Iodine, for example, at pres-
and to identify new crystalline phases. Frequently, experi- sures of 13 to 17 GPa undergoes an insulator–metal tran-
mental work is closely coordinated with theoretical model sition, and at about 21 GPa it converts from a diatomic to
calculations, the latter often predicting the existence of a monatomic conductor. Similar studies on other halides
possible new and interesting phases. would also seem appropriate (i.e., the metallization and
subsequent dissocation in Br2 , Cl2 , and possible F2 ). Ma-
terials that have been metallized under pressure include
B. Electrical Measurements BrI, HI, CsI, BaTe, BaSe, BaS, and BaO.
Another interesting pressure-induced electronic phe-
1. Nonsuperconducting Materials
nomenon involves valence changes. The valence states in
Electrical measurements are usually directed toward mon- Yb, Eu, Pr, Sm, and Ce can all be shifted through applica-
itoring the electrical resistance of a sample under pressure, tion of pressure. In Ce, this valence change manifests itself
although other measurements may also require the incor- in a very interesting, isomorphic phase transition. As the
poration of electrical connections to the pressure chamber. material changes valence state, there is a first-order phase
For example, the pressure dependences of elastic mod- transition with approximately a 15% volume change but
uli have been measured ultrasonically with transducers no change in crystalline structure (i.e., each phase is face-
bonded to the pressurized sample. The most difficult as- centered cubic).
pect of this work is providing a feed-through that will not
significantly attenuate the electrical signal and will con-
2. Superconducting Materials
tain the pressure. In some cases the anvils themselves are
used as probes, for example, as noted above, tetrahedral The practical applications of superconducting materials
presses are constructed with each of the four rams elec- are almost limitless. Power transmission lines; electrical
trically isolated, thus four-probe resistance measurements motors and generators; magnetic levitation for, for exam-
can readily be carried out. ple, high-speed transit systems, and computer electronics
The electronic properties of solids are related to the are but a few. A major problem limiting the utilization
characteristics of the ground state and various excited of superconducting wires is the simple fact that the best
state energy levels. Since these levels can be affected by known superconductor, Nb3 Ge, has a transition tempera-
pressure, much of the research in this area is concerned ture, Tc , no higher than 23.2 K. Therefore, any realistic
with how the states will move under pressure. Professor application of superconductivity must also include an ap-
H. G. Drickamer of the University of Illinois has intro- propriate means for cryogenic refrigeration, and therefore,
duced the phrase “pressure tuning of electronic energy in most cases, the gains that would otherwise be realized
levels.” are obviated by the temperature demands.
An area that is currently receiving considerable atten- Since there is no theoretical reason why the coop-
tion in this regard involves pressure-induced metallization. erative electronphonon coupling mechanism that is re-
Although this phenomena has been studied in a number quired for normal (BCS) superconductivity cannot take
of materials, perhaps none holds more interest than the place at higher temperatures, there are major research
prospect of metallizing hydrogen. Originally predicted by programs underway worldwide to discover new materi-
Wigner and Huntington in the 1930s, recent model calcu- als with higher Tc ’s. Guidelines for these efforts can be
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
either empirical or theoretical. As an example of the for- organic salts at cryogenic temperatures. It is believed that
mer, in V-based A3 B compounds that crystallize in the such information would prove extremely useful in char-
A15 structure, such as V3 Sn, V3 Ge, and V3 Si, it has been acterizing the nature of the origin of superconductivity in
−1/2
noted that a plot of Tc vs m B , where m B is the mass of these materials.
the B-atom, tends to be linear. On the basis of this it has
been predicted that the Tc of Nb3 Si should surpass that
C. Melting/Freezing Phenomena
of Nb3 Ge. Since the A15 phase of Nb3 Si is expected to
the denser than its normal Ti3 P structure, it is expected The transition between the condensed and liquid phases
that pressure should be favorable to conversion of Nb3 Si of matter is perhaps one of the most important and least
to the A15 structure. Shock experiments have produced understood in the field. It is a problem that touches a broad
this transformation. Unfortunately, the measured Tc is not cross section of disciplines: condensed matter physics,
as high as expected; the explanation is believed to lie in rheology, metallurgy, and the geosciences. From a techno-
defects introduced during the shock conversion. logical viewpoint, an understanding of these issues is very
One of the most promising classes of materials in terms important for a variety of materials-related industries, for
of potential high-Tc superconductors is the recently dis- example, those dealing with semiconductor devices, ce-
covered organic salts. These are often one dimensional ramics, and optical components. For example, it is still
or quasi-one dimensional and are promising because they not known why pressure will enhance the crystal growth
do not involve the usual electron–phonon (BCS) mecha- rate in some systems.
nism for superconductivity. Rather, the electron–electron The pressure dependence of the melting temperature,
interactions are mediated through excitations of a Peierls Tm (P), has been measured for a wide variety of materials.
ground state or excitons. The first organic supercon- It is generally found that dTm /dP is positive and the Tm (P)
ductors are from the ditetramethyltetraselenofulvalenium curve is usually fit to some phenomenological or empirical
(TMTSF)2 X family, where X represents a suitable anion. relation, for example, the Lindemann equation. Although
At room temperature and ambient pressure, all of these recent advances in first-principles calculations of Gibbs
salts, independent of the anion, exhibit the same crys- free energies from effective interatomic or intermolecular
tallographic structure, electrical resistivity, and thermo- potential functions have been most encouraging, a micro-
electric power. However, at low temperature and/or high scopic theory explaining the solid–liquid transition is still
pressure, a variety of ground states may exist. Many of lacking.
these salts have a spin density wave at low tempera- An area related to this concerns crystallization phenom-
ture and ambient pressure, but at elevated pressures of ena in amorphous solids, for example, metallic glasses.
0.6–0.8 GPa, they become superconducting with a Tc of “Met-glasses” represent an important new class of mate-
1.2 K. One of these salts, (TMTSF)2 CIO4 , becomes super- rials with certain improved physical properties, for exam-
conducting at ambient pressure with about the same Tc . ple, superior radiation and corrosion resistance. It has been
Other TMTSF-salts exhibit high-temperature metal– demonstrated that hydrostatic pressure can raise the crys-
insulator transitions associated with anion ordering. How- tallization temperature of metallic glasses by 10–20◦/GPa;
ever, it has been demonstrated that pressure can be used this effect is still not understood. Moreover, detailed struc-
to effectively suppress these transitions, thereby leav- tural analyses of high-pressure crystallization phenomena
ing the material in the metallic state at low tempera- have yet to be carried out. A number of outstanding ques-
tures. In these cases the materials will undergo a super- tions in this area remain to be answered.
conducting transition. These salts also appear to pass 1. Why does pressure inhibit crystallization? Is this
through a glassy phase in which the resistivity coeffi- related to the instability of favorable crystalline phases
cient is negative at intermediate temperatures; they then thought to allow formation of metallic glasses from the
undergo superconducting transitions at lower tempera- melt?
tures. One of these, (TMTSF)2 FSO3 , exhibits the highest 2. Are the crystalline phases produced at high pressure
values of Tc known to date for organic compounds, ≥3 K. different from those formed at ambient pressure, and, if
The variables that determine under what conditions su- so, what does this imply?
perconductivity occurs or the nature of the “glassy” phase 3. What are the changes that occur on cooling and
are not at all understood. The facts that the proposed spin release of pressure, and can they be understood in terms
density wave ground state of these systems is very sensi- of more subtle structural changes that occur during
tive (1) to pressure and (2) to the type of anion used sug- heating?
gest that this is a structural issue. For this reason, it would 4. What is the crystallization nucleation mechanism,
be important to carryout extended high-pressure, single and are there precrystallization phenomena that may shed
crystal, X-ray diffraction experiments on many of these light on the mechanisms involved?
P1: GNB Final Pages
Encyclopedia of Physical Science and Technology EN007G-315 June 30, 2001 17:33
D. Materials Modification by Shock Treatments flyer plates, or impact by gun-driven projectiles. How-
ever, shock conditions can also be achieved by subjecting
It is recognized that the state of some materials can be
materials to intense pulses of radiation, for example, from
significantly altered through application of shock pres-
electron beam accelerators, X-ray or neutron sources, or
sures. Although no comprehensive theory has yet been
high-intensity lasers. The pulsing methods hold an impor-
developed to explain this phenomenon, it is presumed that
tant advantage of relatively high repetion rate, in contrast
the rapid, massive shear deformation induced by the shock
to methods dealing with explosives.
waves and the concomitant defect state that follows is re-
Perhaps an even more important advantage of laser-
sponsible for what has been perceived to be a unique state
driven shock studies is that the same laser pulse can be used
of matter in the post-shock material.
to produce both a shock and a high-temperature plasma
Anomalous behavior in post-shocked materials has
that will emit X-rays to probe the shock. Exquisite timing
been identified in a number of separate areas: enhanced re-
will be possible using time-of-flight delay methods. These
activity in, for example, structural phase transitions, chem-
advantages suggest that laser-driven shock states may be
ical reactions, and sintering processes; enhanced atomic
uniquely useful in some material processing or testing
migration as seen in radioactive tracers and thin films of
purposes. Hardening, welding, and compaction have been
materials deposited at interfaces; shock-induced polariza-
demonstrated using shocks generated by explosives; the
tion in polymers and ionic crystals; shock-induced opacity
possibilities of using lasers for these purposes has yet to
in optical materials; bleaching of color centers; formation
be assessed.
of color centers; anomalous shifts in absorption bands;
saturation of dislocation densities without deformations.
It is likely that many, if not all, of these anomalies are
related to the massive defect state left in the wake of the SEE ALSO THE FOLLOWING ARTICLES
shock wave. A wide variety of micro- and macrostruc-
tural effects have been observed. Dislocation multiplica- DENSE MATTER PHYSICS • DIAMOND FILMS, ELECTRICAL
tion, twinning, and void formation are examples of some PROPERTIES • HIGH-PRESSURE SYNTHESIS (CHEMISTRY)
of the small-scale effects; spallation and flow are promi- • PULSED POWER SYSTEMS • SUPERCONDUCTIVITY
nent among large-scale effects of shock loading.
Shock effects influence the surface and near-surface
characteristics of materials. The surface hardness of
2024 Al alloys was found to increase with increas-
BIBLIOGRAPHY
ing shock pressures produced by flyer plates. Explosive
Akimoto, S., and Manghnani, M. H. (1982). “High Pressure Research in
welding leads to hydrodynamic and thermodynamic in- Geophysics,” Center for Academic Publications Japan, Tokyo.
terpenetration of colliding metallic surfaces. Compaction Bridgman, P. W. (1958). “The Physics of High Pressure,” G. Bell and
of powders, which have high strength and resist ordinary Sons, London.
consolidation or are chemically unstable and thus cannot Drickamer, H. G., and Frank, C. W. (1973). “Electronic Transitions and
be sintered, can be accomplished with shocks. Interiors of the High Pressure Chemistry and Physics of Solids,” Chapman & Hall,
London.
the particles remain relatively cold, while the surfaces are Ferraro, J. R. (1984). “Vibrational Spectroscopy at High External Pres-
heated to cause interdiffusion, welding, or melting. More sures,” Academic Press, Orlando, FL.
ductile materials may be used to bond together stronger Hazen, R. M., and Finger, L. W. (1982). “Comparative Crystal Chem-
particles or fibers in order to form composite materials. A istry: Temperature, Pressure, Composition and the Variation of Crystal
strong and oscillatory dependence on hardness of Au–Ge Structure,” Wiley, New York.
Jayaraman, A. (1983). “Diamond anvil cell and high pressure physical
alloys has been observed for shock treatments with dura- investigations,” Rev. Mod. Phys. 55, 65.
tions of 0.1–1 µsec. Minomura, S. (1985). “Solid State Physics under Pressure,” KTK Sci-
Although the precise mechanisms are not yet under- entific Publishers, Tokyo.
stood, there is clear evidence to indicate that shock waves Schilling, J. S., and Shelton, R. N. (1981). “Physics of Solids under High
do, in fact, produce an altered state in many materials. Pressure,” North-Holland, Amsterdam.
Skelton, E. F. (1978). “High Pressure Science and Technology in Japan,”
The classic means of subjecting materials to extreme Office of Naval Research, Arlington, VA.
shock states is chemical explosive techniques: detonation Spain, I. L., and Paauwe, J. (1977). “High Pressure Technology,” Dekker,
of contacting explosives, impact by explosively driven New York.
P1: GLQ/LOW P2: ZCK Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN007C-333 June 30, 2001 15:18
Impedance Spectroscopy
J. Ross Macdonald
University of North Carolina
703
P1: GLQ/LOW P2: ZCK Final Pages
Encyclopedia of Physical Science and Technology EN007C-333 June 30, 2001 15:18
are sometimes made in the time domain and then Fourier on electrolyte double-layer response, the technique of AC
transformed to the frequency domain. Impedance spec- polarography pioneered by D. E. Smith, and the electrolyte
troscopy is by no means limited to the measurement and studies of Randles and Somerton, Sluyters and Oomen,
analysis of data at the impedance level (e.g., impedance vs. R. P. Buck, and J. E. Bauerle. Since the late 1960s, IS has
frequency) but may involve any of the four basic immit- developed rapidly, in large part because of the availability
tance levels; thus, most generally, IS stands for immittance of new, accurate, and rapid measuring equipment.
spectroscopy.
Since IS deals directly with complex quantities, its his- There are two main categories of IS: electrochemical
tory really begins with the introduction of impedance into IS (EIS) and everything else. EIS involves measure-
electrical engineering by Oliver Heaviside in the 1880s. ments and analysis of materials in which ionic conduc-
His work was soon extended by A. E. Kennelly and C. P. tion strongly predominates. Examples of such materials
Steinmetz to include vector diagrams and complex rep- are solid and liquid electrolytes, fused salts, ionically
resentation. It was not long before workers in the field conducting glasses and polymers, and nonstoichiometric
began to make use of the Argand diagram of mathemat- ionically bonded single crystals, where conduction can in-
ics by plotting immittance response in the complex plane, volve motion of ion vacancies and interstitials. EIS is also
with frequency an implicit variable. Electrical engineer- valuable in the study of fuel cells, rechargeable batteries,
ing examples were the circle diagram introduced by C. W. and corrosion.
Carter (1925) and the Smith-Chart impedance diagram of The remaining category of IS applies to dielectric ma-
P. H. Smith (1939). These approaches were soon followed terials: solid or liquid nonconductors whose electrical
in the dielectric response field by the introduction in 1941 characteristics involve dipolar rotation, and to materials
of the Cole–Cole plot: a plot of ε on the y (or imagi- with predominantly electronic conduction. Examples are
nary) axis versus ε on the x (or real) axis. Such complex single-crystal or amorphous semiconductors, glasses, and
plane plots are now widely used for two-dimensional rep- polymers. Of course, IS applies to more complicated sit-
resentation of the response of all four immittance types. uations as well, for example, to partly conducting dielec-
Finally, three-dimensional perspective plots that involve a tric materials with some simultaneous ionic and electronic
log-frequency axis were introduced to the IS area by the conductivity. It is worth noting that although EIS is the
author and his colleagues in 1981; these plots allow com- most rapidly growing branch of IS, nonelectrochemical
plete response at a given immittance level to be shown in IS measurements came first and are still of great value
a single diagram. and importance in both basic and applied areas.
Because IS analysis generally makes considerable use In the EIS area in particular, an important distinction
of equivalent circuits to represent experimental frequency is made between supported and unsupported electrolytes.
response, the whole history of lumped-constant circuit Supported electrolytes are ones containing a high concen-
analysis, which particularly flowered in the first third of the tration of indifferent electrolyte, one whose ions generally
century, is immediately relevant to IS. Since then, much neither adsorb nor react at the electrodes of the measuring
work has been devoted to the development of theoreti- cell. Such an added salt can ensure that the material is
cal physicochemical response models and to the defini- very nearly electroneutral everywhere, thus allowing dif-
tion and analysis of various distributed circuit elements fusion and reaction effects for a low-concentration ion of
for use in IS-equivalent circuits along with ideal, lumped interest to dominate the AC response of the system. Sup-
elements like resistance and capacitance. The preferred port is generally only possible for liquid electrochemical
analysis method for fitting of IS data to either equivalent materials; it is often, but not always, used in aqueous elec-
circuits or to a mathematical model is complex nonlin- trochemistry. Solid electrolytes are unsupported in most
ear least squares fitting (CNLS), introduced to the field in cases of interest, electroneutrality is not present, and Pois-
1977 by Macdonald and Garber. In this procedure, all the son’s equation strongly couples charged species. Because
parameters of a fitting model are simultaneously adjusted of this difference, the formulas or models used to analyze
to yield an optimum fit to the data. supported and unsupported situations may be somewhat
Early experimental work in the IS field is discussed or completely different.
in the book on IS listed in the bibliography (Macdonald, Another important distinction is concerned with static
1987). Here it will suffice to mention the work of Grahame potentials and fields. In a material-electrode system
P1: GLQ/LOW P2: ZCK Final Pages
Encyclopedia of Physical Science and Technology EN007C-333 June 30, 2001 15:18
interfacial processes, up to 10 MHz or higher, sometimes the applied ac signal, they can yield 0.1% accuracy, and
needed to characterize bulk response of the material of they carry out measurements automatically.
interest. Although FRAs such as the Solartron 1260 cover a fre-
Although IS measurements on solids or dielectric liq- quency range from 10 µHz to 32 MHz, impedance re-
uids usually involve cells with two identical-plane parallel sults using them are not sufficiently accurate above about
electrodes, the situation is often much more complicated 50 kHz when potentiostatic control is used. A typical FRA
for measurements on liquid electrolytes. There, one usu- determines impedance by correlating, at each frequency,
ally employs one or more small working electrodes, a very the cell response with two synchronous signals, one in
small reference electrode, and a large counter electrode. phase with the applied signal and the other phase-shifted
Such an arrangement ensures that everything of interest by 90◦ . This process yields the in-phase and out-of-phase
(related to immittance) happens at or near the working components of the response and leads to the various im-
electrode(s). Further, a rotating-disk working electrode is mittance components. A useful feature is autointegration,
frequently used to control hydrodynamic conditions near a procedure that averages results over an exact number of
the electrode. cycles, with the amount of such averaging automatically
Because the kinetics of electrode reactions often depend selected to yield statistically consistent results. Recently, a
strongly on the static (dc) potential difference between the dielectric front end has become available for FRAs. It has
working electrode and the bulk, or, equivalently, the work- an extremely high input impedance and makes possible
ing electrode and the reference electrode, a potentiostat is accurate measurements on dielectrics and on very-high-
needed to fix this p.d. to a known and controllable value. resistivity solids containing mobile charges.
The simultaneous application of both ac and dc signals to
a three- or four-electrode cell makes it particularly diffi-
cult to obtain accurate frequency-response results above B. Analysis and Interpretation of Data
50 kHz or so.
1. Graphics
Although a calibrated double-beam oscilloscope, or
the use of Lissajous figures with a single-beam instru- Before carrying out a detailed analysis of IS immittance
ment, can be used to determine immittance magnitude data, it is a good idea to examine the data graphically,
and phase, such measurements are generally insufficiently both to search for any outliers and to examine the struc-
accurate, are time consuming, and apply only over a lim- ture of the data, structure that will usually reflect, at least
ited frequency range. A superior alternative is the use of in part, the physical processes present that led to the data.
audio-frequency or high-frequency bridges. Several such From the experimental situation one will generally know
bridges are discussed in the IS book. Of particular interest whether one is dealing with an intrinsically insulating ma-
is the Berberian–Cole bridge, which can cover a wide fre- terial, such as a nonconducting or a leaky dielectric, or
quency range and can allow potentiostatic dc bias control. whether the situation is of intrinsically conducting char-
Another important technique using a bridge and special er- acter: mobile charges dominate the response but may be
ror reduction procedures has recently been developed by completely or only partially blocked at the electrodes. For
Schöne and co-workers that allows potentiostatic control complete blocking, no DC can pass, a case that could be
and yields very accurate impedance results up to 3 MHz. confused with dielectric response. In the intrinsically con-
But manual balancing of a bridge is often disadvantageous ducting situation, dielectric effects are generally minimal,
because of its slowness, especially for corrosion studies and Z and M representations of the data are often most
where the properties of the system itself may be slowly useful. In the nonconducting case, Y and ε are frequently
changing. most appropriate, but it is nevertheless a good idea ini-
Manual balancing is avoided in various automated tially to examine plots of the data for all four immittance
network analyzers and impedance analyzers now com- levels, whatever the conducting/nonconducting situation.
mercially available. But the measuring instrument that When mobile charges are present, five principal phys-
has virtually revolutionized IS measurements and prin- ical processes may influence the data; these are bulk
cipally led to the burgeoning growth of the field in the resistive-capacitive effects, electrode reactions, adsorp-
past 20 years is the frequency-response analyzer (FRA). tion at the electrodes, bulk generation-recombination
Typical examples are FRAs produced by Solartron and by effects (e.g., ion-pairing), and diffusion. The double-
Zahner. Although space does not allow a full description layer capacitance is the reaction capacitance CR , and the
of their many features, such instruments allow potentio- reaction resistance RR is inversely proportional to the
static control for three- or four-terminal measurements, reaction rate constant. It is important to distinguish CR
they are highly digitized, they incorporate automatic fre- from the usually much larger low-frequency psudocapac-
quency sweeps and automatic control of the magnitude of itance associated with the diffusion of mobile charge or
P1: GLQ/LOW P2: ZCK Final Pages
Encyclopedia of Physical Science and Technology EN007C-333 June 30, 2001 15:18
FIGURE 4 Three-dimensional perspective plots of Na β-alumina data at (A) the impedance level, (B) log impedance
level, and (C) complex modulus level. (Reprinted by permission of John Wiley & Sons, Inc., from “Impedance
Spectroscopy—Emphasizing Solid Materials and Systems,” J. R. Macdonald, ed. Copyright 1987, John Wiley
& Sons, Inc.)
should not rely on the conventional log[ f ] curves alone. of measuring devices without adequate cross-calibration;
Since the diagram shows that much high-frequency data and nonphysical behavior is now apparent at the highest
are not resolved by this kind of plot, graph B involves the frequencies. These results make it clear that even when
logarithms of the data. Although high-frequency response 3-D plots are used, it is always desirable to explore the
now appears, the error in the low-frequency point is nearly results of different transformations of the data and to pick
obscured by the reduced resolution inherent in a log plot. the one with the best resolution.
Much improved results appear in graph C, a 3-D M
plot. Resolution over the full frequency range is greatly
2. Complex Nonlinear Least Squares Data Fitting
increased; the error in the lowest frequency point is
clearly shown; a midfrequency glitch now appears that a. Strengths and weaknesses. Although graphic
is not evident in the other plots and arises from a switch examination of IS data is an important analysis step, only
P1: GLQ/LOW P2: ZCK Final Pages
Encyclopedia of Physical Science and Technology EN007C-333 June 30, 2001 15:18
over a range of temperatures and/or potentials, can help b. Recent developments. Currently, the capability
one establish which of the possible circuits is most phys- of using various types of weighting involving model pre-
ically reasonable. dictions instead of data values exists only in LEVM, first
Figure 6 shows all possible potentially equivalent con- released in the summer of 1989. Although weighting such
ducting circuits involving two resistances and two capac- as FPWT is somewhat more complicated than PWT be-
itances. Specific parameter value choices that make them cause it varies with each nonlinear least squares iteration
all have exactly the same response are also indicated. Here as the parameter estimates change during the fitting proce-
the values for circuit D were taken exact, and approximate dure, its bias reduction potential makes such complexity
values for the other elements are denoted with a ∼ sign. often worthwhile. Although LEVM allows the fitting of
Let the units of these elements be M
for resistances and real or imaginary parts of the data separately, fitting both
µF for capacitances. Note that the two RC time constants together, as in CNLS, ensures that the best use is made of
of circuit A, a series connection, differ by less than 17% all the data in determining the parameter estimates and is
and are thus very close together. Can IS procedures re- thus preferred when both parts are available.
solve such a situation? Figure 7 shows the exact complex- Real IS data often have independent random errors that
plane response of these circuits at both the Z and the M have both an additive term and one that depends on the true
levels, compared to single time-constant Debye response. model predictions. A rather general error-variance model
The M curve shows much better separation of the two incorporating these possibilities is included in LEVM. For
response regions than does the Z curve. Thus, adequate a specific angular frequency ω j , the real and imaginary
graphical resolution is indeed possible. Further, it turns out parts of ν j , the error variance used in determining the
that CNLS fitting of synthetic data calculated from any of weighting, may be written as
these circuits with appreciable proportional random errors
added still yields excellent parameter estimates. In fact, ν j = U 2 + |F (ω j )|2ξ
with reasonably good data, CNLS can resolve response and
ν j = U 2 + |F (ω j )|2ξ ,
where U is associated with the additive random errors and
ξ is an arbitrary positive fractional exponent.
When U = 0, ξ = 1, and F is a data value, one has PWT;
whereas when F is a model prediction the result is FPWT.
Another widely used weighting, modulus weighting, fol-
lows when the same values of U and ξ are used but both
F and F in the above equations are replaced by |F|.
It is usually inconsistent, however, with the types of er-
rors likely to be present and generally leads to appreciably
more bias in parameter estimates even than PWT. CNLS
fitting yields a standard deviation Sf of residuals, which is
FIGURE 7 Complex-plane immittance responses, at the Z and a measure of the overall goodness of fit. For proportional
M levels, of the circuits of Fig. 6. random errors having a proportionality constant of σr , Sf
P1: GLQ/LOW P2: ZCK Final Pages
Encyclopedia of Physical Science and Technology EN007C-333 June 30, 2001 15:18
is an unbiased estimate of σr for FPWT, is nearly so for related work of the past 40 years. More details appear in
PWT, and is appreciably biased on the low side for FMWT the IS book.
and MWT, types of modulus weighting. Because of the charge decoupling present in a supported
When the data involve one or more inductive-type loops, situation, it is often an excellent approximation to treat
such as may arise from adsorption of a mobile charge at the effects of the various physical processes present in-
an electrode, it is desirable to use a nonzero U along with dependently. On the other hand, for unsupported condi-
PWT or FPWT or to use modulus weighting. Otherwise, tions where strong coupling is present, a unified treat-
because values of the imaginary part of the data may be- ment of all the processes together is necessary. The most
come very small and even pass through zero in the loop complete such theory, which incorporates all five of the
region, PWT or FPWT alone can strongly overemphasize processes mentioned in section III.B.1, was published by
the effect of these values near zero and thus lead to poor Franceschetti and Macdonald in 1978. It is a continuum
fitting. (i.e., averaged, not microscopic) theory, includes intrinsic
Although U and ξ may be given fixed values during and extrinsic charge effects, and applies to either ionic or
CNLS fitting, a way has been found to incorporate them as electronic conduction conditions. Even though it strictly
free parameters in LEVM least squares fitting. When this is applies only to flat-band conditions, its results are still suf-
done, the data themselves determine the most appropriate ficiently complicated that only in simplified cases does it
weighting for their fit, thus removing an appreciable part lead to responses that may be modeled by an equivalent
of the subjective element present in prior weighting ap- circuit.
proaches. Further, Monte Carlo fitting studies have shown It is useful to separate the electric processes present
that the statistical uncertainties of U and ξ in CNLS fits into bulk- and electrode-related groups whenever possible,
with them both taken free, or with only ξ free, are usually something which is usually indeed possible using CNLS
quite small compared to their estimates, and their biases fitting. The first group includes bulk resistance and dielec-
are even smaller. Thus, their fit estimates may be used with tric effects, the homogeneous reactions associated with
confidence. dissociation and recombination of the charges present, and
Although for much IS data one would expect that ξ even possible dispersive response. It is generally associ-
would be close to unity, this need not always be the case. ated with response effects at the high end of the frequency
Consider, for example, a set of real data arising from the range, while significant electrode effects often occur near
sum of the radioactive decay of two different species. Now the low end, possibly at very low frequencies. Bulk resis-
the statistics of such decay follows a Poisson process, one tance and capacitance are extensive quantities, dependent
for which ξ = 0.5. The radioactive background count will on the effective separation between electrodes.
also involve such a process. Thus the appropriate variance The second group involves what happens in the neigh-
model would involve U = 0 and three terms, each with borhood of the electrodes (within a few Debye lengths of
2ξ = 1. The first two would be the two exponential decays them) and is thus intensive. No net charge is transferred
and the last, the background. In such a fitting situation, to an electrode if it is completely blocking for all mobile
where ξ is known absolutely, it should be held fixed at its charges. The next simplest EIS situation is that where a
proper value. mobile metallic ion is of the same species as the atoms
of a metallic electrode: a parent-ion electrode. Then, in
a symmetrical-electrode situation there is a sink/source
IV. APPLICATIONS of ions at each electrode, since electron transfer at an
electrode can transform ions into atoms and vice versa,
A. Basic Analysis of Material Properties depending on the polarity of the electric field at the elec-
and Electrode Effects trode. Such a reaction can be written
1. Bulk and Reaction Response Mez+ + ze− ,
Me
Although IS is of great value for the characterization of where Me denotes a metal atom and z the number of elec-
the electrical properties of material-electrode systems, its trons transferred. An example of a symmetrical cell of this
use for this purpose requires that connections be known type is Ag|AgCl|Ag.
between model and/or equivalent circuit parameters and Particularly important for the aqueous electrolyte area
the basic characterization parameters. One must be able is the redox electrode, where charge crosses the interface
to pass from estimates of macroscopic quantities, such at the electrode only in the form of electrons. The species
as resistances and capacitances, to estimates of average Red and Ox are usually soluble in the electrolyte, satisfy
microscopic quantities. Here only a brief overview will
be given of some of the large amount of theoretical IS- O x z+ + ne− ,
Red(z−n)+
P1: GLQ/LOW P2: ZCK Final Pages
Encyclopedia of Physical Science and Technology EN007C-333 June 30, 2001 15:18
and involve the forward and reverse reaction rate con- No diffusion effects appear when only charge of a single
stants kf and kr , respectively. If z = n, the Red species is sign is mobile; this often is an excellent approximation for
uncharged and may diffuse in the electrode, or may evolve solid electrolytes.
if it is a gas. For supported conditions, matters are different. Con-
sider a single species with diffusion coefficient D and va-
lence number z (possibly zero). Then (ld )2 = 4D/ω, a re-
2. Distributed Circuit Element Response sult that follows from the above expression when one sets
a. Diffusion. Since diffusion is not localized at a Dn = Dp = D and z n = z p = z. Further, when both positive
point in space but is distributed over a finite region, it and negative charges are mobile, diffusion under unsup-
leads to electrical response characteristic of a distributed ported conditions leads to a single expression involving
circuit element (DCE). Such elements cannot be described tanh, as above, but for supported conditions, as in the re-
by means of a finite number of ideal elements, such as re- dox case, two such terms appear, one for each species—in
sistances and capacitances. The response of several DCEs keeping with the lack of coupling between the species.
important to IS will be discussed.
In addition to possible diffusion of uncharged species b. Other DCEs. A characteristic signature of diffu-
within an electrode, diffusion of mobile species in the elec- sion is (iω)±0.5 response, but IS data more often exhibits
trolyte may contribute significantly to the impedance of a CPE response (iωτ )±ψ , where 0 ≤ ψ ≤ 1. But such re-
IS system. Generally, diffusion response is neither inten- sponse is not physically realizable over all frequencies,
sive nor extensive. At sufficiently high frequencies, diffu- and so other DCEs have been introduced that approxi-
sion effects are confined to the immediate neighborhood mate such behavior over a limited frequency range. They
of the electrode (or within a hydrodynamic boundary layer may be written as impedances or complex dielectric con-
at a rotating electrode) and so are intensive; whereas at low stants, depending upon which I level is appropriate. Here
enough frequencies, diffusion occurs throughout the mate- they will be given at the Z level.
rial between electrodes, and the response becomes exten- An empirical DCE of the above type is Havriliak–
sive as the frequency decreases and the effective diffusion Negami (HN) response, written as
length ld , proportional to (ω)−0.5 , becomes comparable to Z HN (ω) = RHN /[1 + (iωτ )α ]β ,
the size of the cell.
The diffusion impedance, appropriate when there is a where 0 ≤ α ≤ 1 and 0 ≤ β ≤ 1. It reduces to Cole–
fast electrode reaction, is of the form Davidson response when α = 1 and to Cole–Cole response
0.5 0.5 (termed ZC response at the Z level) when β = 1. The first
Z W (ω) = Z W (0) tanh i(l/ld )2 i(l/ld )2 , of these yields an asymmetric arc in a complex-plane plot
and the second one a symmetric arc. Both shapes appear
where l is the separation between symmetrical electrodes often in practice, and ZC fitting is frequently used to rep-
and Z W (0) is a resistance proportional to l and thus is ex- resent data that yield an arc of a semicircle with its center
tensive. Such response is known as finite-length Warburg below the real axis. Such behavior is usually ascribed to
behavior. At high enough frequencies that the tanh term the presence of a distribution of some physical quantity
goes to unity, Z W (ω) becomes proportional to the intensive in space, time, or energy. Rough electrodes are one ex-
quantity ld and is termed (ordinary) Warburg response. ample. Although fitted values of α and/or β often show
As the electrode reaction rate decreases toward zero, a appreciable temperature dependence, there exists no the-
more complicated expression for Z W (ω) must be used, but ory yielding such dependence for HN response.
it reduces to the form Another important DCE is that of Kohlrausch–
0.5 0.5 Williams–Watts (KWW) response. It yields a stretched
Z W (ω) = Z C ctnh i(l/ld )2 i(l/ld )2
exponential in the time domain, response proportional to
when the electrode is completely blocking (open-circuit exp[−t/τ ]ψ , with 0 ≤ ψ ≤ 1. Here there are, if anything,
diffusion). Here Z C is given by (l/ld )2 /ωCDOC , and too many different theories yielding such response, but
CDOC /CC is the effective low-frequency limiting dielec- again they do not usually predict the temperature depen-
tric constant associated with the process. dence of ψ. The corresponding frequency response is very
For general unsupported situations, those with positive difficult to calculate accurately, but an excellent approxi-
and negative charged species mobile and having diffu- mation for it is available in LEVM. Complex plane plots
sion coefficients of Dn and Dp and valence numbers of z n of KWW response yield an asymmetric arc for any ψ < 1
and z p , until ψ = 1, when a Debye semicircle appears.
Another DCE category is associated with the presence
(ld )2 = (4Dn Dp /ω)[(ηn + ηp )/(ηn Dn + ηp Dp )]. in a material of a distribution of activation energies (DAE).
P1: GLQ/LOW P2: ZCK Final Pages
Encyclopedia of Physical Science and Technology EN007C-333 June 30, 2001 15:18
Such distributions are likely in IS materials and may be theoretical analysis of the situation. Since only Rs and Cs
expected even in single crystals when there are many com- are involved, ambiguity is present, and many other cir-
peting possibilities for the individual motion of mobile cuit structures with the same elements and the same fre-
charges. Both Gaussian and exponential distributions have quency response are possible. Nevertheless, the present
been considered in detail and can lead to either symmetric hierarchical ladder-net-work connection is more physi-
or asymmetric complex-plane arcs. But only an exponen- cally reasonable than the others for homogeneous mate-
tial DAE yields CPE-like fractional-exponent frequency rial. It ensures that bulk charging and conduction effects
response over a finite frequency region. This exponent, take place before reaction/adsorption ones. For polycrys-
φ, is not limited to the range from 0 to 1 but satisfies talline materials, however, circuits involving series rather
−∞ < φ < ∞. Further, unlike the other DCEs considered, than hierarchical connection of parallel RC subcircuits are
an exponential DAE predicts temperature dependence of often found appropriate.
φ in good agreement with many experimental results. For the conditions above, no diffusion DCE element is
present. The Z W (ω) one discussed earlier appears, how-
ever, in the Z D3 position when charges of both signs are
3. Equivalent Circuits
mobile and at least one of them reacts at an electrode.
Many different equivalent circuits have been proposed When static fields are present in the material, either intrin-
over the years for IS fitting, and no one circuit structure sic and/or externally produced, numerical analysis of the
is appropriate for all situations. Figure 8 shows a circuit, nonlinear transport equations governing the IS response
however, that has been found useful for a variety of materi- shows that the Fig. 8 circuit still applies to good approx-
als and experimental conditions. Bulk properties are repre- imation but elements such as CR and RR then depend ap-
sented by Cg , the geometrical compacitance, and R∞ , the preciably on the static p.d. present. Finally, the circuit of
high-frequency limiting resistance. CR , associated with an Fig. 8 has often been found appropriate for the fitting of
electrode reaction, is the double-layer capacitance (possi- data for supported conditions as well as unsupported ones.
bly including both a compact inner-layer capacitance and The Fig. 8 circuit is particularly appropriate for ana-
a diffuse double-layer capacitance), and RR is the reaction lyzing electrode–interface effects in low-resistivity EIS
resistance. Finally, CA and RA are associated with adsorp- situations. For high-resistivity solids, however, such as
tion at an electrode. The Z D elements, when present, are ion-conducting glasses, one is usually more interested
DCEs. Also, not all the other elements need be present; for in isolating and inerpreting bulk dispersion behavior.
example, in the absence of adsorption CA and RA would Figure 9 shows a circuit appropriate for such materials,
not appear. where te DCE block is a conductive-system dispersive
For an unsupported, fully dissociated material with DE, and the DED block is a dielectric-system dispersive
charges of only a single sign mobile, the Fig. 8 circuit one. The circuit thus allows one to account for electrode
with all Z D s absent has been found to yield an accurate effects when important, and either conductive dispersion,
representation of the impedance resulting from a flat-band or dielectric dispersion, or both.
B. Uses of IS for Evaluation and Control under different anodizing and sealing conditions. Discrim-
of Electrochemical Processes ination was possible between properties of the dense bar-
of Industrial Interest and Importance rier layer and the porous outer layer, and changes arising
from aging and from the effects of natural environmental
1. Corrosion and Surface Protection
conditions were reflected in the results.
Corrosion of metallic structural materials leads to im- An EIS monitor has been used for the detection of paint
mense damage each year (an estimate for the United States degradation under atmospheric exposure. A model is being
for 1988 is $200 billion); thus its control and amelioration developed to help predict the lifetime of protective or-
are of tremendous economic importance. EIS has played ganic coatings on steel based on short laboratory tests.
and is playing a valuable role in quantifying and mitigating The model includes the steps of defect formation, trans-
corrosion effects. For example, it has been successfully ap- port of corrodents, loss of adhesion, and corrosion. EIS
plied to complicated corrosion systems to determine cor- helps elucidate how these four processes interact and de-
rosion rates as well as the mechanisms and efficiency of pend on coating processes and environmental effects.
corrosion inhibitors. The use of EIS has broadened the Although IS analysis should properly be carried out
range of corrosion phenomena that can be studied us- only on time-invariant data—data obtained from a sys-
ing electrochemical techniques and has been particularly tem whose properties are independent of time—some of
valuable in evaluating the corrosion behaviors of polymer- these properties are often not time-invariant during mea-
coated metals and anodized aluminum alloys. In addition, surement of a corroding system. If the change is slow
it has been incorporated into a quality control test for an- compared to the required measurement time and/or if it is
odized aluminum surfaces and for chromate-conversion- approximately linear in time, imporved results may be ob-
coated aluminum alloys. tained by making a set of measurements from low to high
The application of EIS techniques has resulted in a great frequencies immediately followed by one from high to low
deal of information on methods of corrosion protection frequencies. Averaging of the results will then eliminate
that are difficult or impossible to study with traditional dc much of the variation with time.
techniques, such as conversion and polymer coatings, an- A test of time invariance can be made by analyzing the
odic films, and inhibitors. Not only can EIS measurements data with the Kramers–Kroning (K–K) relations, integral
provide greater sensitivity and more information about transforms connecting real to imaginary parts of the data
the processes investigated than can conventional dc meth- and vice versa. They are only applicable for time-invariant
ods, they are particularly appropriate when impedances systems. All useful fitting models and equivalent circuits
are high and/or when low-conductivity media are used. are minimum phase and so automatically satisfy the K–K
EIS measurement and analysis has been used to provide relations. Thus, a good fit is evidence of time invariance.
fast and sensitive information on the protection properties Strong failure of the K–K relations for a given set of data
of chromated galvanized steel. Such measurements may is immediate evidence of unwanted time variation, and,
be used as a quality control procedure, since the charge unlike CNLS fitting, no model or circuit is required to
transfer resistance has been found to be well related to carry out such a test. Although ordinary K–K analysis re-
the corrosion rate. EIS has been used to detect corroding quires much computation and uncertain extrapolations as
areas of large structures accurately and has been applied well, an alternate program available in LEVM avoids such
for corrosion monitoring of steel reinforcing bars in con- difficulties, uses only measured data, and caan readily
crete to yield a nondestructive estimate of the amount of test for time-invariance as well as estimating the imagin-
corrosion damage. ary-part response associated with real-part data, or
Since the roughness of an electrode surface is reflected vice versa.
in the results of EIS measurements involving the electrode,
EIS may be used to identify surface inhomogeneities pro-
2. Batteries and Fuel Cells
duced by corrosion. It provides (averaged) information on
surface morphology on a much smaller scale than does EIS studies have been made of the kinetics of the insertion
even electron micrography. EIS has been employed as a reaction in solid-state batteries based on such reactions. A
means of nearly continuous evaluation of localized cor- single EIS experiment allows information to be obtained
rosion processes such as pitting, crevice corrosion, stress about the electrode-interface reaction and diffusion in the
corrosion, cracking and fatigue corrosion, abrasion, and electrolyte or electrode. Measurements at different battery
corrosion under a porous surface layer. voltages to determine the dependence of the results on the
EIS measurements over a relatively wide frequency charge of the battery have led to increased understanding
range have been found to yield valuable detailed informa- of the discharge process and thence to improved battery
tion about the properties of aluminum oxide layers formed design.
P1: GLQ/LOW P2: ZCK Final Pages
Encyclopedia of Physical Science and Technology EN007C-333 June 30, 2001 15:18
An interesting EIS study has been carried out on elec- beyond potential and current, can add valuable additional
trochemically impregnated Ni electrodes from four differ- information to IS studies. It is likely that considerable fu-
ent manufacturers of Ni/H cells. The EIS measurements ture development will be concerned with such possibilites.
were made in KOH electrolyte, and large differences were
found in the impedance behavior of the electrodes from
the different manufacturers. The results indicated a prob- SEE ALSO THE FOLLOWING ARTICLES
able correlation between impedance parameters and cell
life and performance. BATTERIES • CORROSION • ELECTROCHEMICAL ENGI-
EIS studies of molten carbonate fuel cells have in- NEERING • ELECTROCHEMISTRY • ELECTROLYTE SOLU-
creased the understanding of processes going on under TIONS, THERMODYNAMICS • ELECTROLYTE SOLUTIONS,
operating conditions of the cell. In particular, they have TRANSPORT PROPERTIES • FUEL CELLS, APPLICATIONS
helped identify and elucidate the reactions that occur both IN STATIONARY POWER SYSTEMS
at the anode and at the cathode.
Electrolyte-insulator-semiconductor sensors meld inte- Archer, W. I., and Armstrong, R. D. (1980). The application of
grated circuit technology with traditional chemical tech- A. C. impedance methods in solid electrolytes. Electrochemistry 7,
nology. They can be used to monitor pH changes, for 157–202.
Armstrong, R. D., Bell, M. F., and Metcalfe, A. A. (1978). The AC
example, and can be constructed with ion-selective mem-
impedance of complex electrochemical reactions. Electrochemistry 6,
branes to make them sensitive to a specific ion. IS 98–127.
measurements and analysis can yield, in favorable cases, Franceschetti, D. R., and Macdonald, J. R. (1978). Theory of small-signal
information on the electrical characteristics of the elec- AC response of solids and liquids with recombining mobile charge. J.
trolyte, the insulator, the semi-conductor, and the vari- Chem. Phys. 68, 1614–1637.
Gabrielli, C., ed. (1990). Proceedings of the First International Sympo-
ous interfaces and on interface states. The IS approach
sium on Electrochemical Impedance Spectroscopy. Electrochim. Acta
allows very low surface-state densities at the insulator- 35, 1483–1670.
semiconductor interface to be determined. Measurements Macdonald, D. D. (1991). Mechanistic Analysis Using Electrochemical
have shown, however, that it is the electrolyte–insulator Impedance Spectroscopy, Proceedings of the Symposium on High
interface that responds to pH changes. Temperature Electrode Materials and Characterization, 91-6, pp. 1–
43. The Electrochemical Society, Inc., Pennington, NJ.
Solid-electrolyte chemical sensors are electro-chemical
Macdonald, D. D., ed. (1993). Proceedings of the Second Interna-
cells designed to measure the concentration or pressure of tional Symposium on Electrochemical Impedance Spectroscopy, Elec-
chemical species in gases or fluids; for example, zirconia- trochim. Acta 38, 1797–2143.
based solid electrolytes have been used to measure oxygen Macdonald, J. R., ed. (1987). “Impedance Spectroscopy—Emphasizing
concentration. Such sensors are employed to measure the Solid Materials and Systems,” Wiley–Interscience, New York.
Macdonald, J. R. (1987). Impedance spectroscopy and its use in ana-
oxygen concentration in steel melts and the air–fuel ratio
lyzing the steady-state AC response of solid and liquid electrolytes.
in automobile engines. EIS has been found very useful J. Electroanal. Chem. 223, 25–50.
to study (and to help optimize) electrode materials and Macdonald, J. R., and Potter, L. D., Jr. (1987). A flexible procedure for
appropriate pretreatment preparation for such sensors. analyzing impedance spectroscopy results: Description and illustra-
In recent years a number of variants on and extensions tions. Solid State Ionics 23, 61–79.
Macdonald, J. R. (1999). A full list of categorized JRM publications,
of IS have been developed. An important one is electro-
with titles, is available in http://www.physics.unc.edu/∼macd/.
hydrodynamic impedance. Here the speed of a rotating- Mansfeld, F. (1988). Don’t be afraid of electrochemical techniques—but
disk electrode is modulated sinusoidally, resulting in use them with care. Corrosion 44, 856–868.
modulation of the mass transport in a liquid electrolyte. Schöne, G., Wiesbeck, W., and Lorenz, W. J. (1987). High-frequency
Such modulation allows the minimization of the coupling impedance spectroscopy of fast electrode reactions. J. Electroanal.
Chem. 229, 407–421.
with interfacial kinetics. Modulation of numerous other
Sluyters-Rehbach, M., and Sluyters, J. H. (1984). AC Techniques. In
quantities in an IS experiment is also possible, such as “Comprehensive Treatise of Electrochemistry” (E. Yeager, J. O’M.
light, temperature, or magnetic field. Thus analysis of Bockris, B. E. Conway, and S. Sarangapani, eds.), Vol. 9, pp. 177–292.
other transfer functions, cause and effect relations that go Plenum Press, New York.
P1: GNB/FEE P2: GLM Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
Incommensurate Crystals
and Quasicrystals
Uwe Grimm Max Scheffer
The Open University Chemnitz University of Technology
I. Aperiodic Crystals
II. The Structure of Quasicrystals
III. Physical Properties of Quasicrystals
IV. Concluding Remarks
731
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
Bragg peaks, and is supported on the (usually dense) set crystals, apart from defects that exist in any real solid, is
of integral linear combinations of a finite set of vectors periodic in space. In other words, associated to a crystal
in Fourier space. there is a three-dimensional periodic lattice, and this lat-
Quasiperiodic tiling Quasiperiodic space-filling tiling tice also determines the possible symmetries that may be
that serves as the analogue of the periodic lattice in apparent in the crystal shape.
structure models of perfect quasicrystals.
Random tiling Space-filling tiling in which tiles are ar-
ranged randomly. Random tilings are believed to be a A. Periodic Crystals
more realistic description of the structure of real qua- A “conventional,” periodic crystal is thus characterized by
sicrystals than perfect quasiperiodic tilings. a periodic lattice. Once the distribution of atoms in a single
fundamental domain of the lattice, a unit cell, is known, the
entire structure is determined by periodicity. In addition
SOLIDS have traditionally been classified as either crys- to this translational symmetry, crystalline structure may
talline or amorphous. The basic property that distinguishes have other symmetries such as a rotational symmetry with
a crystal from an amorphous or glassy material is the long- respect to a certain axis. However, these symmetries have
range positional order of its microscopic constituents. to be compatible with each other, and this restricts the pos-
Classical crystallography deals with lattice-periodic struc- sible symmetries of a periodic three-dimensional crystal
tures that can be described by a space-filling periodic repe- to one of 230 crystallographic space groups classified by
tition of a single microscopic building block, the so-called Schoenflies and von Fedorow in the late 19th century. In
unit cell. However, order does not imply periodicity, and particular, the crystallographic restriction only concedes
over the last decades it has become evident that aperiodi- two-, three-, four-, and sixfold rotational symmetry. Other
cally ordered materials not only are theoretically possible, symmetries, such as fivefold rotational symmetry or icosa-
but commonly realized in nature. Aperiodic crystals can be hedral symmetry, cannot be reconciled with lattice peri-
classified into incommensurate crystals, known since the odicity in three-dimensional space, and thus cannot be
1950s, on the one hand, and quasicrystals, discovered in accommodated in periodic crystals.
the early 1980s, on the other. In the former, aperiodicity is In general, the atomic structure of a material need not
due to the combination of several periodic structures with manifest itself in the shape of its surface; for instance, the
incommensurate periods. Quasicrystals are, in a sense, structure of gold cannot be guessed by the morphology of
a more radical manifestation of aperiodic order, as the a gold nugget. Only if the surfaces correspond to special
atomic positions cannot be interpreted in terms of under- planes, for instance, planes parallel to the faces of a cube
lying periodic lattices in three dimensions. They are usu- in a cubic crystal, does the shape of the crystal reflect
ally identified by symmetries that are incompatible with its atomic structure. This happens if special surfaces of a
lattice periodicity and hence forbidden in classical crys- crystal are energetically favored and if the growth velocity
tallography, such as icosahedral symmetry. The beautiful of these surfaces is high. For example, minerals frequently
symmetry, the peculiar aperiodic order, the rather intricate develop beautiful facets.
and subtle atomic structure, the unique (and only partially
understood) physical properties, and, last but not least,
the quest for technical applications have made quasicrys- B. Diffraction
tals an important topic of crystallography, mathematics, Direct access to the underlying atomic structure and its
physics, chemistry, and materials science. long-range order is provided by diffraction experiments.
An X-ray, electron, or neutron beam is scattered by the
sample, and interference gives rise to a diffraction pat-
I. APERIODIC CRYSTALS tern that can be recorded, providing information about the
structure of the material. Let ρ(r) denote the density of
Crystals, in our common perception, are characterized by scatterers in space, and q = kout − kin the scattering vec-
their morphology, their faceted shape, and have tradition- tor, i.e., the momentum diffierence between the incoming
ally been be classified according to their symmetry. Ob- and the scattered radiation. Provided that the scattering
viously, the regularity of crystals reflects the underlying is elastic and that multiple scattering can be neglected,
order of their microscopic structure. It can be visualized the measured intensity I (q) is proportional to the Fourier
by the beautifully ordered patterns of sharp diffiraction transform ĝ(q) of the pair correlation or Patterson function
spots, so-called Bragg peaks, as observed, for instance, in
X-ray diffiraction experiments. For a long period of time, g(r) = d 3 r ρ(r + r )ρ(r ). (1)
it was taken for granted that the microscopic structure of
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
In many situations, I (q) can also be expressed directly in C. What is Aperiodic Order?
terms of the absolute square of the Fourier transform ρ̂(q)
At first sight, the term “aperiodic order” may appear para-
of the scattering density,
doxical. However, there exists a wealth of possible struc-
I (q) ∼ |ρ̂(q)|2 . (2) tures that, in a sense, are intermediate between the peri-
odic order of a perfect crystal and the disorder that one
If the scattering density ρ is periodic with respect to the may find realized in amorphous intermetallic alloys. Ape-
three-dimensional lattice spanned by the base vectors a1 , riodic order is realized in incommensurate crystals and
a2 , and a3 , the diffraction intensity I (q) is concentrated quasicrystals, which are discussed in detail below; while
on a lattice, the reciprocal lattice in momentum space. the discovery of these structures came quite as a surprise
This lattice is spanned by the three dual base vectors a∗1 , at the time, it is now apparent that these are not merely
a∗2 , and a∗3 . The diffraction pattern thus consists of sharp rare caprices of nature, but, on the contrary, occur quite
peaks, so-called Bragg peaks, at positions commonly.
3 The kind of aperiodicity encountered here is known as
q= h j a∗j (3) quasiperiodicity, and may be most easily understood by
j=1 a one-dimensional example. The paradigm of a periodic
function is the trigonometric function sin(x), which is pe-
in momentum space, where h j , j = 1, 2, 3, are integer
riodic with period 2π , i.e., sin(x + 2π ) = sin(x). Now,
numbers indexing the diffraction spots.
consider the sum of two sine functions
Incommensurate crystals and quasicrystals, unlike
amorphous solids, also display sharp Bragg diffraction 1+c 1−c
patterns, but in contrast to periodic crystals, the positions f (x) = sin(x) + sin(cx) = 2 sin x cos x ,
2 2
of the diffraction spots do not lie on a periodic lattice in (5)
three-dimensional momentum space. Indexing the Bragg
peaks as linear combinations of three base vectors as in where c is some fixed number. Is the function f (x) pe-
Eq. (3) would require irrational coeficients. How- riodic? Well, this depends on the values of c. If c is a
ever, upon enlarging the number of vectors a∗j , j = rational number, c = m/n with coprime integers m and n,
1, 2, . . . , D > 3, one recovers an indexing scheme then the periods 2π and 2π c = 2π m/n are commensu-
rate, and the function is periodic with period 2π n because
D
sin[c(x + 2π n)] = sin(cx√+ 2π m) = sin(cx). However, if
q= h j a∗j (4)
c is irrational, say c = 2, the two frequencies are in-
j=1
commensurate, and f (x) is aperiodic. This can also be
with integer coefficients h j , but now D > 3 integers are seen from the product form of the function f (x) that is
required to index a Bragg spot. Assuming that the base also given in Eq. (5). Looking, for instance, at the set of
vectors a∗j are linearly independent with respect to integral solutions of f (x) = 0, we see that the sine and the cosine
linear combinations (otherwise one could do with fewer functions in Eq. (5) each contribute zeros at equally spaced
vectors), one finds that the set of integral linear combina- positions xk = 2π k/(1 + c) and x = 2π ( + 1/2)/(1 − c),
tions of the a∗j , in general, densely fills space. The gen- respectively, but the two spacings are incommensurate if
eralization from Eq. (3) to Eq. (4) may appear innocent, c is irrational. Still, the function f (x) retains a lot of its
but the question remains of how the aperiodic order in regularity—after all, it is just the sum of two sine func-
real space that looks like that produces such diffraction tions. In fact, it is almost periodic in a sense that may be
patterns. inferred from Fig. 1, which shows two different sections
−2
0 10π 20π 30π
√
FIGURE 1 Graphs of f (x) (gray) and f (x + x 0 ) (black) on the interval 0 ≤ x ≤ 30π , with x 0 = 58π/(1 + 2). The grey
curve has been widened to a “tube” of four times the width of the black line, indicating the size of the deviation between
the two curves on the interval shown.
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
FIGURE 2 The quasiperiodic√ function f (x) of Eq. (5) as a cut of the two-dimensional periodic “egg carton” function
F(x , y), Eq. (6), for c = 2.
of the graph of the function f (x) that almost agree on a F(x , y) = sin(x) + sin(y) (6)
large interval.
In Fourier space, the function f (x) is represented by along the line y = cx, i.e., f (x) = F(x , cx). This is shown
two frequencies, either commensurate or incommensu- in Fig. 2. The quasiperiodic function emerges as a section
rate. In general, a function is periodic if all its frequencies through a higher dimensional periodic function along a
in Fourier space are situated on a periodic lattice, thus direction that induces the incommensurability. It is pre-
are integer linear combinations of d base vectors in d cisely the same idea that underlies the higher dimensional
space dimensions. A quasiperiodic function is a general- description of incommensurate crystals and perfect qua-
ization of this concept where, again, all frequencies are sicrystals.
integer linear combinations of D vectors, but this number
may be larger than the spatial dimension, D ≥ d, where
D. Incommensurate Crystals
equality implies periodicity. Thus, the Fourier transform
of a quasiperiodic structure that is not periodic will be Incommensurate structures in crystals have been known
supported on the set of all integer linear combinations of since the 1950s. These are magnetic crystals that exhibit
D > d vectors in d-dimensional Fourier space. Thus, the a helical ordering of the spins which is incommensurate
diffraction pattern of a quasiperiodic structure consists of with the underlying periodic lattice structure. A sketch of
Bragg peaks located on a particular dense set of points in such a situation is shown in Fig. 3a. The existence of these
Fourier space, and the Bragg peaks can be indexed by D in- systems may not seem too surprising, as the incommensu-
teger numbers as in Eq. (4). In mathematical terminology, rability occurs due to an additional degree of freedom, the
the Fourier transform is finitely generated over the inte- spin, in an otherwise perfectly periodic crystal. However, it
gers; its support is a module of rank D. One can also con- did not take long until evidence for incommensurability of
struct ordered structures whose Fourier transforms consist the structure itself was found in form of so-called satellite
of Bragg peaks that cannot be indexed by a finite number reflections in diffraction patterns. The satellite peaks show
of integers; however, such structures have not yet been up next to the main reflctions, hence the name, but their
observed in nature. coordinates with respect to the lattice of main peaks are
It is instructive to think of the function f (x) as a cut of not simple fractions, and may even depend continuously
the two-dimensional periodic function on temperature. In particular this continuity invalidates the
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
E. Quasicrystals
Quasicrystals entered the scene only in the early 1980s
when icosahedral symmetry was found in a selected-area
electron diffraction analysis of a rapidly cooled Al–Mn al-
loy (see Fig. 4). Figure 5 shows the two Platonic solids with
icosahedral symmetry, the icosahedron and the dodecahe-
dron. The 20 faces of the icosahedron are equilateral trian-
gles; they meet in 30 edges and 12 vertices. The dodeca-
hedron consists of 12 faces that are regular pentagons, and
comprises 30 edges and 20 vertices. Both polyhedra show
the same symmetry. There are six fivefold axes, connect-
ing opposite vertices of the icosahedron or the centers of
opposite pentagons of the dodecahedron, respectively. The
10 threefold axes connect the centers of opposite faces of
FIGURE 3 Examples of incommensurate structures. (a) Incom- the icosahedron or opposite vertices of the dodecahedron,
mensurately ordered degree of freedom. (b) Displacive modula- respectively; in both cases, the 15 twofold rotational axes
tion. (c) Occupational modulation. (d) Composite structure. connect midpoints of opposite edges. The three different
types of symmetry axes and their relative orientations in
interpretation of the satellite peaks in terms of structurally space are perfectly recovered in the diffraction pattern of
stable periodic superstructures, which refers to another pe- Fig. 4.
riodic structure with a large unit cell that is superimposed Apparently, the diffraction pattern consists of sharp
on the original lattice. spots, and thus the structure must be ordered. However,
Sketches of several possible scenarios of incommensu- as icosahedral symmetry is incompatible with lattice pe-
rability in a crystalline structure are compiled in Fig. 3. riodicity, it cannot be a conventional periodic crystal.
Most important are the modulated structures depicted in Even though incommensurate crystals had been known
Figs. 3b and 3c. These crystals are characterized by a peri- for about 30 years and an explanation of the observed
odic deviation, the modulation, from their underlying pe- diffraction pattern in terms of a quasiperiodically ordered
riodic lattice structure. The structure is incommensurate structure was readily available, this discovery came as a
if the period of the modulation does not match the lattice surprise and gave rise to prolonged controversy. However,
periodicity. In Fig. 3b, the modulation is displacive, i.e., it soon became clear that alternative interpretations of the
the positions of the atoms are shifted. Figure 3c shows an- results in terms of periodic crystals were either inconsis-
other scenario where the deviation is occupational. In this tent or required periodic structures with huge unit cells
case, the periodic modulation determines the occupation comprising thousands of atoms. Strictly speaking, this
probability of the perfect lattice positions. Finally, Fig. 3d question can never be resolved completely because the
shows an example of an incommensurate composite struc- peaks observed in experiments cannot be infinitely sharp
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
FIGURE 4 The first reported experimental evidence of a quasicrystal: Selected-area electron diffraction patterns
of a rapidly cooled Al–Mn alloy showing icosahedral symmetry. Different symmetries were observed by tilting the
sample by the angles given in the figure, exactly corresponding to the angles between different symmetry axes of
isosahedral symmetry. [From Shechtman, D., Blech, I., Gratias, D., and Cahn, J. W. (1984). “Metallic phase with
long-range orientational order and no translational symmetry.” Phys. Rev. Lett. 53, 1951–1953. Copyright 1984 by the
American Physical Society.]
due to the finite size and disorder of the crystal and due models of quasicrystals consist of two parts: The geomet-
to the limited resolution. Thus, one may always describe ric arrangement of the building blocks, which takes care of
the experimental data in terms of a hypothetical periodic the quasiperiodic long-range order, and the location of the
structure; when the number of atoms in a unit cell be- atoms within each building block, their “decoration.” The
comes too large, however, the description as an aperiodic geometric arrangement of building blocks is convention-
crystal is not only more elegant and appealing, but also ally encoded in a space-filling tiling with a finite number of
much simpler. In particular, it can easily account for the prototiles. To some extent, these tilings can be visualized
symmetry of quasicrystals (Fig. 6). directly by high-resolution electron microscopy of qua-
Soon after the discovery of icosahedral quasicrys- sicrystals. Whereas much is known about quasiperiodic
tals, intermetallic alloys with further crystallographically tilings of space and their symmetry properties, the actual
forbidden symmetries were found, showing either a 12-, distribution of atoms in quasicrystalline solids remains
10-, or 8-fold symmetry axis. The corresponding diffrac- largely unknown. Diffraction data alone do not suffice to
tion patterns reveal that these quasicrystals are periodic in derive the atomic density unequivocally, and electron mi-
one direction of space, which coincides with the symmetry croscopy methods are just on the verge of reaching the
axis, i.e., they consist of a periodic stacking of planes with required atomic resolution. For several systems, sophisti-
12-, 10-, or 8-fold rotational symmetry. Accordingly, they cated models have been proposed, although many details,
are referred to as dodecagonal, decagonal, and octagonal for instance, the kind and the amount of inherent disor-
quasicrystals. der, need to be unraveled. For this reason, the following
discussion focuses on the geometric part.
FIGURE 6 A single-grain icosahedral HoMgZn quasicrystal, grown from the ternary melt, on a millimeter-scale
background. [Figure courtesy of P. C. Canfield and I. R. Fisher; reprinted with permission from Fisher, I. R., Cheon,
K. O., Panchula, A. F., Canfield, P. C., Chernikov, M., Ott, H. R., and Dennis, K. (1999). “Magnetic and transport
properties of single-grain R–Mg–Zn icosahedral quasicrystals [R = Y, (Y1−x Gdx ), (Y1−x Tbx ), Tb, Dy, Ho, and Er].”
Phys. Rev. B 59, 308–321. Copyright 1999 by the American Physical Society.]
pentagon tiling is shown in Fig. 11, exhibiting perfect 10- ing is of the same kind no matter at what length scale it is
fold symmetry. probed.
It is worth mentioning that the golden mean τ , and in Besides this nice property, the Penrose tiling has another
fact the Fibonacci sequence itself, reappears in 10-fold “magic” property that it shares with a number of other
and icosahedral tilings as well as in their diffraction pat- tilings used in description of quasicrystalline structures.
terns. This is related to the fivefold rotational symmetry It is the existence of perfect matching rules, which means
common to these structures because τ = 2 cos(π/5). In that there exist a marking of the basic prototiles and a
the diffraction pattern of Fig. 11, it can be recognized set of local rules that determine the possible local neigh-
as the length ratio of distances between peaks of similar borhoods of a marked tile such that, if the marked tiles
intensity. This is related to a rescaling symmetry of are assembled as in a jigsaw puzzle, the resulting tiling is
the quasiperiodic tiling, the so-called inflation/deflation aperiodic and, in fact, is indistinguishable from a Penrose
symmetry. In essence, it is the higher dimensional version tiling. However, these matching rules do not provide a
of the substitution rule that was used to construct the constructive instruction to produce a perfect quasiperi-
Fibonacci sequence: in an inflation step, each tile is dis- odic tiling: in general, after assembling a number of tiles,
sected into several parts of tiles such that a tiling emerges one meets the situation that it is impossible to add a tile
whose tiles are just scaled copies of the original prototiles without violating the rules, and there is no information on
and that, upon rescaling, is equivalent to the original tiling. the location where the arrangement of tiles needs to be
Deflation is the reverse process, in which a number of altered to rectify the problem.
tiles is replaced by a larger tile. For the Penrose tiling, the Quasiperiodic tilings are a natural generalization of
linear rescaling factor associated to this symmetry is just periodic lattices that can account for the noncrystallo-
τ again. This property means that the quasiperiodic order- graphic symmetries found in diffraction experiments. The
L S L L S L S L L S L L S L S L L S L S L
FIGURE 7 (a) The Fibonacci quasicrystal and (b) its diffraction pattern.
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
L
S
L
S
L
L
S
L
L
S
L
S
L
L
S
L
FIGURE 8 Projection of the Fibonacci quasicrystal from the two-dimensional square lattice.
beautiful example of several difierent modifications of the sic constituents of their structure. These may be conceived
decagonal phase in AlCoNi is shown in Fig. 12. Unfortu- as particularly stable local configurations of atoms, and a
nately, the information contained in the diffraction pattern simplistic picture of a quasicrystal would be a conglomer-
is not sufficient to reconstruct the structure, as the phase ate of such clusters held together by “glue atoms.” While
information is lost. This is why holographic methods this approach can easily be combined with an underlying
that can access the phase information have attracted tiling picture, an alternative concept has recently attracted
increasing interest. For quasicrystals with one periodic growing attention, according to which a covering of space
direction, however, high-resolution transmission electron rather than a tiling is employed.
microscopy along the periodic direction yields direct The difference between a covering and a tiling is the
information on the spatial distribution of structural units. possibility of overlaps. An example of a covering by a
An example for a decagonal AlCoNi quasicrystal is shown regular decagon is shown in Fig. 14. Here, the possible
in Fig. 13a. Connecting similar contrasts in the image, one overlaps are restricted by the markings of the decagon
arrives at the tilings of Fig. 13b, at difierent length scales and the resulting structure is in fact completely equivalent
connected by multiplication with τ . This observation is to the Penrose tiling of Fig. 10. This shows one of the
related to the inflation/deflation symmetry of the Penrose advantages of the covering picture: only a single “quasi
tiling mentioned above. For this particular alloy, the tilings unit cell” is needed, whereas at least two different tiles
obtained in this way are very close to ideal quasiperiodic are required in a Penrose tiling. Furthermore, the shape of
tilings. This is corroborated by Fig. 13c, which shows the quasi unit cell resembles the typical motives observed
the projection into “internal space” corresponding to the in electron microscopy, whereas the tiles usually have to
cross section of the strip in the projection of Fig. 8. If be imposed artificially. Last but not least, the picture is
the tiling were perfect, all projected points would come very much reminiscent of interpenetrating atomic clusters,
to line inside the decagons. Indeed, for the three experi- which is rather appealing from the physical point of view.
mentally derived tilings of Fig. 13b, only few points fall In fact, the structures that are equivalent to Penrose tilings
outside the decagons. For this reason, this particular high- are characterized by a maximal density of clusters.
temperature AlCoNi phase is referred to as a highly perfect An example of a structure model of a decagonal AlCoNi
quasicrystal. quasicrystal based on the decagon covering is shown in
Fig. 15. The model was chosen such that it fits the features
seen in atomic resolution Z-contrast scanning transmission
C. Clusters and Coverings
electron microscopy. Here, the size of the basic decagonal
Right after the discovery of quasicrystals, the idea cluster is 2 nm. Note the asymmetric decoration, matching
emerged that highly symmetric atomic clusters are the ba- the asymmetric contrast in the Z-contrast image.
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
(a)
(b)
FIGURE 9 (a) Scanning tunneling microscopy image of a 1.5-nm silver film on a GaAs(110) surface and (b) a detail
showing two phason defects marked by arrows. [Reprinted with permission from Ebert, Ph., Chao, K.-J., Niu, Q., and
Shih, C. K. (1999). “Dislocations, phason defects, and domains walls in a one-dimensional quasiperiodic superstruc-
ture of a metallic thin film.” Phys. Rev. Lett. 83, 3222–3225. Copyright 1999 by the American Physical Society.]
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
FIGURE 11 Fourier transform of the Penrose pentagon tiling. The diffraction peaks are represented by disks whose
areas are proportional to the intensity.
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
FIGURE 12 Electron diffraction images of four of the eight known structural modifications of the decagonal phase in
AlCoNi. [From Ritsch, S., Beeli, C., Nissen, H.-U., Gödecke, T., Scheffer, M., and Lück, R. (1998). Phil. Mag. Lett. 78,
67–75.]
FIGURE 13 (a) High-resolution electron microscopy image of a decagonal AlCoNi quasicrystal, and tiling analysis
(b) in physical and (c) in internal space. [From Ritsch, S., Beeli, C., Nissen, H.-U., Gödecke, T., Scheffer, M., and Lück,
R. (1996). “Highly perfect decagonal Al–Co–Ni quasicrystals.” Phil. Mag. Lett. 74, 99–106.]
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
FIGURE 14 This covering of the plane by marked regular decagons was shown to be equivalent to the Penrose tiling
of Fig. 10 by P. Gummelt.
flips to obtain a random tiling such as shown in Fig. 16. As a further benefit, the random tiling picture can also
The disorder may not be apparent on first view, but, for resolve the somewhat mysterious growth mechanism of
instance, the star-shaped tiles are distributed in a less reg- quasicrystals. The problem with quasicrystal growth is
ular way and may appear at short distances in the ran- that a perfect quasiperiodic structure such as the Penrose
domized tiling, while they are separated by at least two tiling in Fig. 10 cannot be grown by local growth rules.
pentagons in the perfect tiling. In other words, the ran- Thus, even though the perfect matching rules would allow
dom tiling comprises local configurations that are absent for an assignment of energies to local configurations that
in the perfect case. In the projection setup, this can be result in a perfect quasiperiodic ground state, the random
interpreted as an arbitrary deformation of the projection tiling scenario seems to be much more realistic, with the
strip of Fig. 8. It turns out that arrangements that lead to system choosing among many possible local configura-
the highest statistical symmetry are entropically favored, tions that just differ slightly in energy.
which may be interpreted as an entropic mechanism that
stabilizes quasicrystals. Although the tilings are stochas- III. PHYSICAL PROPERTIES OF
tic, their diffraction images are still believed to be point- QUASICRYSTALS
like. Moreover, most electron microscopy investigations
of the local order in quasicrystals support this stochastic In contrast to the incommensurate crystals, quasicrystals
picture. constitute a fairly coherent class of materials which share
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
FIGURE 15 Structure model of decagonal AlCoNi and Z-contrast images. (a) Decoration of the Gummelt decagon
by transition metal atoms (large circles) and aluminum atoms (small circles). Dark spots refer to positions c = 1/2
along the periodic axis, light symbols to c = 0. The arrows denote positions that moved significantly during a first-
principles relaxation of the structure. (b) The structure superimposed on a Z-contrast image. (c) Lower resolution
Z-contrast image, several clusters, and their overlaps. [From Yan, Y., and Pennycook, S. J. (2001). “Chemical ordering
in Al72 Ni20 Co8 decagonal quasicrystals.” Phys. Rev. Lett. 86, to appear. Copyright 2001 by the American Physical
Society.]
similar physical properties. These are briefly summarized metallic systems, which have to be prepared synthetically.
below. A compilation of the concentrations of the components
and the temperature ranges where such structures exist
is provided by phase diagrams, which are particularly
A. Appearance of Quasicrystals in Nature
important for sample preparation. Most of the systems
One may wonder why it took about 30 years after the forming quasiperiodic and incommensurate crystals show,
first investigations of incommensurably modulated crys- in addition, a variety of phases with different struc-
tals until quasicrystals were finally discovered. Arguably, tures. Since the complexity of phase diagrams increases
one reason for this is the natural appearance of incom- with the number of phases, phase diagrams of such
mensurate crystals. Whereas incommensurate phases are systems are generally intricate. An example is given
found in minerals like plagioclase feldspars, the predom- in Fig. 17, which shows a cut through the aluminum-
inant number of the known quasicrystals occur in inter- rich part of the three-dimensional phase diagram of the
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
ternary Al–Pd–Mn system at a constant temperature of material happens by the migration of defects. Quasicrys-
873 K. talline structures possess special kinds of defects not ex-
isting in crystalline structures. Besides dislocations, one
finds structural rearrangements, so-called phasons, which
B. Morphology
do not generate structural misfits but destroy the per-
In certain cases, it is possible to grow single-grain qua- fect quasiperiodic order. Quasicrystals are mostly very
sicrystals from the melt. An example is shown in Fig. 6. hard and brittle, a very common property of intermetal-
The pentagonal surfaces of the dodecahedral crystal are lic alloys. For example, the Vickers hardness of Al-based
perpendicular to the fivefold axis of the icosahedral struc- quasicrystals is comparable to the hardness of steel and
ture. The existence of other planes with different sym- slightly lower than the hardness of silicon. The brittle-
metries was corroborated by the investigation of voids in ness of the quasicrystalline alloys is expressed in their
quasicrystalline alloys. The magnificent electron micro- low toughness, which is around 40 times lower than that
scopic picture in Fig. 18 depicts the surface of a hole of other Al-based alloys. This circumstance changes at
inside an icosahedral quasicrystal. A variety of differ- higher temperatures, above about 900 K, where a brittle-
ent polygons emerge, indicating twofold, threefold, and to-ductile transition was experimentally observed in which
even more complex surfaces. For decagonal quasicrystals, a softening of the material occurs. For the explanation of
a prismatic morphology prevails. These needle-shaped this behavior, the knowledge of the structure and the ki-
crystals, which often show a decagonal cross section, netics of the defects is required. Due to the high symmetry
form as a result of an anisotropic growth of the quasicrys- of icosahedral quasicrystals, a higher isotropy of the me-
talline grains, which usually grow considerably faster in chanical properties compared to crystals was expected and
the periodic direction than in the quasiperiodic planes. was verified by experiments.
C. Mechanical Properties
D. Electronic Properties
The mechanical properties of metallic alloys are strongly
influenced by the type and the concentration of struc- As it is the electronic interaction of the charged con-
tural defects. For instance, the plastic deformation of the stituents that holds a solid together, the electronic structure
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
FIGURE 17 Cut through the Al-rich part of the three-dimensional phase diagram of the ternary system Al–Pd–Mn at
a constant temperature of 873 K. The concentration of the three components is represented by a plane. Lines in the
diagram separate regions of thermodynamic equilibria of one or more phases, depending on the concentration. For
most compositions, no structure can exist (light and middle gray). Those alloys will decompose into several phases
with different compositions. Regions where only a single phase exists (dark gray) are always separated by regions in
which several phases are coexistent. Remarkable is the coexistence of a decagonal (D) and an icosahedral (i) phase
in the two-phase region (D + i) in one ternary system. [Figure courtesy of T. Gödecke and R. Lück; From T. Gödecke
and R. Lück (1995). “The aluminum–palladium–manganese system in the range from 60 to 100 at. % Al.” Z. Metallkd.
86, 109–121.]
of a solid is important for stability. If, in turn, the spa- tion is assumed. Its main fingerprint is the development
tial arrangement of the atoms influences the electronic of a pseudogap in the electronic density of states at the
structure, a complex interplay between the electrons and Fermi level, which may also explain the transport anoma-
the structure will result. This is often observed in in- lies observed in quasicrystals. For instance, the electric
termetallic phases, where, under special conditions, the conductivity is very low, so quasicrystals are poor con-
electronic system favors special atomic structures. For ductors. The conductivity of quasicrystals, contrary to
many quasicrystals, a so-called Hume–Rothery stabiliza- the conductivity of metals, decreases enormously as the
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
FIGURE 18 A beautifully faceted hole in an icosahedral quasicrystal. [From Beeli, C., Gödecke, T., and Lück, R.
(1998). “Highly faceted growth shape of microvoids in icosahedral Al–Mn–Pd.” Phil. Mag. Lett. 78, 339–348.]
temperature is lowered, and it also appears to decrease often show diamagnetic behavior even though they con-
with increasing structural perfection of the sample. Sim- tain a fair proportion of transition metal atoms. Concern-
ilar anomalies are also observed in other transport prop- ing the ZnMgRE quasicrystals, the situation is different.
erties, such as thermal conductivity, Hall coefficients, and In contrast to the 3d states of the transition metals, the
thermopower. 4f states of the rare earth metals cannot be filled by the
electrons of the other constituents, and the magnetic mo-
ments survive. However, neutron diffraction experiments
E. Magnetic Properties show that these moments are only short-range ordered; no
long-range magnetic order in a quasicrystalline alloy has
As mentioned above, magnetic moments can form incom-
been found. At very low temperatures, around approxi-
mensurable phases, even if the moments are situated on
mately 4 K, these phases behave like spin glasses, which
a periodic lattice. Ordering phenomena of magnetic mo-
means that the short-range order of the spins becomes
ments in quasicrystals could be rather interesting due to the
frozen.
geometric frustration that may be caused by the aperiodic
structure. Experimentally, magnetic properties were in-
vestigated mainly for Al-based quasicrystals and for qua-
F. Applications
sicrystals with the composition ZnMgRE, where RE de-
notes a rare earth metal. Besides the approximately 70 at % An exceptional property of some quasicrystalline phases
Al, many Al-based quasicrystals contain transition metals is their very low surface energy, which results in a wetting
such as Mn, Fe, Ni, and Co. In the pure metals, these atoms of the surface which lies in between PTFE (Teflon) and
show magnetic moments, which originate from partially normal metals. Their high resistivity against scratches
occupied 3d states. As a consequence of the changed elec- makes quasicrystalline materials well suited for coatings.
tronic structure in the quasicrystal, however, these mag- The hardness as well as the low weight of quasicrystalline
netic moments vanish together with the partial occupation materials can be exploited in composite materials, in
of the 3d states. Thus, high-quality Al-based quasicrystals which advantageous properties of the components can be
P1: GNB/FEE P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN007D-335 June 30, 2001 15:29
combined. Icosahedral quasicrystals based on titanium SEE ALSO THE FOLLOWING ARTICLES
can store up to two hydrogen atoms per metal atom,
which makes them good candidates for use in hydrogen CRYSTAL GROWTH • CRYSTALLIZATION PROCESSES •
technology. CRYSTALLOGRAPHY
BIBLIOGRAPHY
IV. CONCLUDING REMARKS
Axel, F., and Gratias, D. (eds.). (1995). “Beyond Quasicrystals,” Editions
Aperiodic crystals not only form a fascinating chapter of de Physique, Les Ulis, France, and Springer, Berlin.
modern crystallography, but are also of importance for a Baake, M., and Moody, R. V. (eds.). (2000). “Directions in Mathematical
Quasicrystals,” AMS, Providence, RI.
variety of scientific disciplines. On the mathematical side,
Blinc, R., and Levanyuk, A. P. (eds.). (1986). “Incommensurate Phases
one is interested in the aperiodic ordering and its math- in Dielectrics,” North-Holland, Amsterdam.
ematical description as well as in a characterization of Grünbaum, B., and Shephard G. C. (1987). “Tilings and Patterns,”
the plethora of possible structures that still may be found Freeman, New York.
to exist. As far as the physics of quasicrystals is con- Hippert, F., and Gratias, D. (eds.). (1994). “Lectures on Quasicrystals,”
Editions de Physique, Les Ulis, France.
cerned, the understanding of the physical properties on
Janot, C. (1994). “Quasicrystals: A Primer,” 2nd ed., Clarendon Press,
the basis of their structure is at the center of the interest. Oxford.
However, this may first require a more detailed knowl- Janssen, T., and Janner, A. (1987). “Incommensurabilty in Crystals.”
edge about the structure than is available to date, and, in Adv. Phys. 36, 519–624.
particular, a thorough account of the type and amount of Moody, R. V. (ed.). (1997). “The Mathematics of Long-Range Aperiodic
Order,” Kluwer, Dordrecht.
the inherent disorder in quasicrystals. Current technolog-
Patera, J. (ed.). (1998). “Quasicrystals and Discrete Geometry,” AMS,
ical applications of quasicrystals, partly still in a prelimi- Providence, RI.
nary stage, look promising, and further research should be Senechal, M. (1995). “Quasicrystals and Geometry,” Cambridge Univer-
rewarding. sity Press, Cambridge.
Aperiodic crystals, like other surprising discoveries, Stadnik, Z. M. (ed.). (1999). “Physical Properties of Quasicrystals,”
Springer, Berlin.
have again taught us that even long-held beliefs in science
Steinhardt, P. J., and Ostlund, S. (eds.). (1987). “The Physics of Qua-
may eventually prove wrong. Who knows—even though sicrystals,” World Scientific, Singapore.
it appears improbable today, maybe some day someone Yamamoto, A. (1996). “Crystallography of Quasiperiodic Crystals.” Acta
will come up with a sevenfold quasicrystal. Cryst. A 52, 509–560.
P1: GPB Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN008c-349 June 29, 2001 12:30
25
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN008c-349 June 29, 2001 12:30
I. DESCRIPTION OF THE MODEL range of the attractive strong interaction, that is, oppo-
sitely aligned angular momenta of the alike nucleons pro-
The shell model treats the nucleus as a system of neu- duce maximum overlap of the nucleons’ wave functions
trons and protons interacting through the strong interac- and so the largest interaction. By similar reasoning, the
tion. Neutrons and protons are collectively referred to as next lowest energy states of the two alike nucleons are
nucleons and are fermions, because they have an intrin- J = 2, then J = 4, etc. Figure 1 shows the relative strength
sic spin angular momentum of one-half. As fermions, they of the alike-nucleon interaction versus the total angular
satisfy Fermi–Dirac statistics and obey the Pauli exclusion momentum J of the nucleons. For two alike nucleons
principle, which states that no two fermions can occupy in the same level, only even total angular momenta can
the same state in the same system, that is, they cannot have occur because of the Pauli exclusion principle, which re-
the same set of classifying quantum numbers. The Pauli quires their total wave function to be antisymmetric un-
exclusion principle leads to the filling of shells (or levels) der particle interchange. The above empirical observation
produced by the mean field of the nucleons. As in atoms, suggests that building blocks of nucleon pairs of angu-
the filling of a shell leads to a highly stable structure, with lar momentum zero and two may play an important role
all the angular momenta of the nucleons in the shell sum- in determining low-lying nuclear properties. A system of
ming up to zero. In the shell model, such structures are fermion pairs is symmetric under the interchange of any
assumed to be inert, and nuclear properties are described two pairs. Consequently, such pairs are boson-like objects.
in terms of the remaining nucleons (that is, the valence These observations, together with the known phenomeno-
nucleons) moving outside the closed shells. logical usefulness of angular-momentum-two bosons in
When two alike nucleons occur outside a closed shell, the geometrical model, provide the motivation for the
it is observed that their angular momenta couple to zero IBM.
in the nuclear state of lowest energy, that is, the ground The basic assumption of the IBM is that an even–
state. In fact, it is found empirically that the ground- even nucleus of Np valence protons and Nn valence neu-
state angular momenta (J ) of all even–even nuclei are trons in the shell model can be treated as a system of
zero. The physical explanation of this result is the short n p = Np /2 valence proton bosons and n n = Nn /2 valence
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN008c-349 June 29, 2001 12:30
odd–odd nuclei and β-decay, which cannot be done with is possible if some groups in the fermion classification
the IBM-2. (that is, group chain) coincide with some groups in the
boson classification [see Eq. (1)]. Combined bose–fermi
groups can then be introduced corresponding to particular
V. INTERACTING BOSON–FERMION couplings of the bosons and fermions.
MODEL It was, of course, known that the conservation of the
total angular momentum requires the combined system
The empirical success of the IBM for even–even nuclei en- to be invariant under the total angular momentum oper-
couraged its developers to expand it to odd–even and even– ator (the sum of parts acting on the bosons and on the
odd nuclei by the addition of one fermion. This odd- A fermions), generating SOBF (3). But it was found that spec-
version is known as the interacting boson–fermion model tra are often closer to those associated with combining
or IBFM. The IBFM Hamiltonian contains a boson term, the larger groups in Eq. (1), such as SO(6), with their
a fermion term, and a third term representing the interac- fermion counterparts, implying the conservation of less
tion (or coupling) between the bosons and fermion. As in well-understood quantities.
the IBM, the IBFM can be discussed in symmetry limits, Figure 4 illustrates the spectrum of an odd-A nucleus
as in Eq. (1), in which the odd fermion is coupled to the related by such a bose–fermi symmetry to the spectrum
valence bosons in either the U(5), SU(3), or SO(6) limits. of its even–even neighbor. It was claimed that cases such
The different limits can be related to particular cases in as those shown in Fig. 4 are examples of supersymmet-
the collective model, such as the strong-coupling or weak- ric structure in nuclei. However, supersymmetry conven-
coupling limits. tionally refers to a description involving a superalgebra,
The addition of an odd fermion greatly increases the which is an algebra containing operators that transform
number of possible parameters in this model; the number bosons into fermions and vice versa. In fact, the cases
of possible states also greatly increases. For these reasons, and examples given then and in later work are for bose–
the IBFM has been applied mainly to special cases, such as fermi symmetries rather than true supersymmetries. Nev-
a nucleon in a single j level or in several j levels with the ertheless, the fact that the properties of certain neighbor-
imposition of some boson–fermion symmetry (see Sec- ing even–even and odd-A nuclei can be related by the
tion VI). This model has also been used for studies of same group-theoretical chains and the same Hamiltonian
β-decay between odd–even and even–odd nuclei. is of significant interest and provides new insight into the
structure of complex nuclei. Present investigations regard-
ing high-spin superdeformed bands indicate that superde-
VI. BOSON–FERMION SYMMETRIES formed bands in certain neighboring even–even and odd-A
nuclei may prove to be the best examples of bose–fermi
In 1980, Iachello observed that in certain cases new sym- symmetries in nuclei. Recent work with superalgebras in-
metries, corresponding to simultaneous transformations of dicates that examples of real supersymmetries may exist
the boson and fermion systems, can be introduced. This in nuclei.
FIGURE 4 Comparison of the theoretically predicted (theory) and observed (experiment) energy spectra for 190 76 Os
and 191
77 Ir, as an example of a bose–fermi symmetry in nuclei. Each energy level is labeled by the value of the total
angular momentum J. The dashed line enclose levels of the same symmetry. The solid lines indicate levels between
which strong electromagnetic radiation occurs.
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN008c-349 June 29, 2001 12:30
for use of the IBM, means that the validity of the pair in- SEE ALSO THE FOLLOWING ARTICLES
terpretation cannot be definitively tested. It simply is not
known whether the many-pair condensate analogues of the GROUP THEORY • NUCLEAR PHYSICS • PARTICLE
many-boson IBM states would indeed allow a reasonable PHYSICS, ELEMENTARY
approximation to the shell model eigenstates, nor whether
the shell model operators mentioned above would in fact
reproduce the data. As discussed above, the shell model BIBLIOGRAPHY
interpretation of the bosons must be appropriate for the
levels the IBM is used to model: for instance, the inter- Arima, A., and Iachello, F. (1984). “Advances in Nuclear Physics” (J.
W. Negele and E. Vogt, eds.), Vol. 13. Plenum, New York.
pretation for a description of all the low-energy rotational Barrett, B. R. (1984). “Nucleon–Nucleon Interaction and Nuclear Many-
bands of 156 Gd (Fig. 3.) would be different from that ap- Body Problems” (S. S. Wu and T. T. S. Kuo, eds.). World Scientific,
propriate for a description of the first, fourth, and fifth Singapore.
bands only. (For example, if the strength of the interac- Bonatsos, D. (1988). “Interacting Boson Models of Nuclear Structure,”
tion in the theoretical boson calculations were increased Clarendon Press, Oxford.
Casten, R. F. (ed.) (1993). “Algebraic Approaches to Nuclear Struc-
by around 50%, then the second and third model bands ture: Interacting Boson and Fermion Models.” Contemporary Conc
obtained would correspond to the fourth and fifth bands Physics, 6. Harwood Academic Publishers.
in the experiment, while the observed second and third Casten, R. F. and Feng, D. H. (1984). Nuclear dynamical supersymme-
bands would have no IBM counterparts). A possible fail- try. In “Physics Today,” Vol. 37. American Institute of Physics. New
ure of the S D pair interpretation is then apparent in exact York.
Casten, R. F., and Warner, D. D. (1988). The interacting boson approx-
calculations for lighter nuclei, where it is found that the imation. In “Reviews of Modern Physics,” Vol. 60. The American
many-pair states describe only some of the levels that the Physical Society, New York.
IBM would be used to model. A similar conclusion has Dieperink, A. E. L., and Wenes, G. (1985). “Annual Review of Nu-
been obtained in an approximate calculation for 156 Gd clear and Particle Science,” Vol. 35. Annual Review Inc., Palo Alto,
itself. If this situation does indeed persist in heavy nuclei, California.
Iachello, F., and Arima, A. (1987). “The Interacting Boson Model.”
it would have to be concluded that the simple interpreta- Cambridge Univ. Press, London and New York.
tion of the bosons as pairs is inconsistent with the use of Iachello, F., and Talmi, I. (1987). Shell-model foundation of the inter-
the IBM to model all the collective low-energy levels, as acting boson model. In “Reviews of Modern Physics,” Vol. 59. The
is invariably the case (Fig. 3). American Physical Society, New York.
There is much controversy in this area, which only fur- Iachello, F., and Van Isacker, P. (1990). “The Interacting Boson-Fermion
Model.” Cambridge University Press, London and New York.
ther research can resolve. A truly valid shell model inter- Mizusaki, T., and Otsuka, T., (ed.) (1996). Microscopic Study of the
pretation of the elegantly simple IBM would reveal a cor- Interacting Boson Model. In “Progress of Theoretical Physics: Sup-
respondingly simple latent structure amid the complexity plement,” Number 125. Yukawa Institute and the Physical Society of
of realistic shell model calculations. Japan, Kyoto.
This said, it is worth noting that after 25 years the Inter- Scholten, O. (1985). “Progress in Particle and Nuclear Physics,” Vol. 14.
Pergamon, Oxford.
acting Boson Model approach to describing the properties Talmi, I (1993). “Simple Models of Complex Nuclei: The Shell Model
of medium-to-heavy-mass nuclei has held up extremely and Interacting Boson Model.” Contemporary Concepts in Physics,
well and has proven itself to be quite versatile and robust. Vol. 7. Harwood Academic Publishers.
P1: GLQ Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
I. Brief History
II. Description of Liquid Crystal Phases
III. Properties of Liquid Crystals
IV. Applications
717
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
range of temperatures between those at which the ma- change dramatically with changes in temperature or other
terials are normal liquids and solid crystals. For other aspects of the environment.
liquid crystals, it is mainly the concentrations of differ- Early attempts to explain the turbid appearance of the
ent components of a solution that determine the phases. nematic liquid crystals included the idea that the molecules
Many solutions or pure compounds form several distinct in these materials grouped into swarms. The boundaries
phases at different temperatures or concentrations. Sev- between swarms would represent variations in the optical
eral thousand compounds have liquid-crystalline phases properties of the medium that could scatter light. It was
in their pure states. Many biological materials, such as later found that nematic liquid crystals, which are most of-
cell membranes, also display liquid-crystalline behavior. ten formed from rod-like molecules, are generally homo-
Since liquid crystals have both the ability to flow and the geneous throughout, with the long axes of the molecules
anisotropy of crystals, they display many properties of lining up parallel to one another. Long-wavelength ther-
great interest and, often, of practical importance. mal fluctuations in the direction of the alignment scatter
visible light.
Classification and identification of liquid crystal phases
I. BRIEF HISTORY were first done by using a polarizing microscope with a
heating stage. Observations of the textures and how they
The study of liquid crystals began with observations by an changed from phase to phase were useful in determin-
Austrian botanist, Friedrich Reinitzer, in 1888. Reinitzer ing some of the properties of the crystals. By mixing two
found that cholesteryl benzoate melted from a solid at compounds in various proportions and determining the re-
145◦ C into a liquid having a murky or turbid appearance. sulting phase transitions, it was possible to compare the
At 179◦ C the liquid cleared. The color of the turbid liquid phases they formed in their pure states. The present names
changed from red to blue as the temperature increased. for the different smectic states (A, B, C, etc.) were merely
On cooling, the reverse occurred. Reinitzer sent some of assigned as they were observed and do not necessarily
this material to O. Lehmann, who was able to make fur- bear a logical relation to their structures. For many of the
ther studies with polarized light on a microscope equipped states formed in lyotropic materials (those whose concen-
with a heating stage, which allowed him to vary the tem- trations determine the phases), there are several names
perature of samples being observed. Lehmann discovered arising from different lines of research.
that the turbid liquid actually displayed optical anisotropy The study of liquid crystals continued fairly strongly
or birefringence, as do solid crystals. The combination of into the 1930s, including measurements of viscosity co-
the ability of the material to flow like a liquid and yet retain efficients and development of theoretical models for the
the anisotropic optical properties of a crystal led Lehmann elastic and flow properties of nematics. The renewed in-
to coin the name “liquid crystal” to describe this state of terest in liquid crystals in this country owes much to Glenn
matter. H. Brown of Kent State University, who organized a se-
The turbid appearance, resembling that of a colloidal ries of international conferences starting in the 1960s
solution, gave rise to early ideas that the liquid crystal was and founded the Liquid Crystal Institute at Kent State
no more than such a solution, but it was later found that the University. Low-power electronic liquid crystal displays
liquid-crystalline state is a distinct phase of matter with (LCDs) are now found in a very wide range of devices.
fixed transition temperatures into the solid and normal or Further developments in displays and other uses of liq-
isotropic liquid states and that the molecules in a liquid uid crystals have stimulated a wide range of investiga-
crystal have orientational order. tions in this country and abroad into the nature of these
In 1922, G. Friedel proposed a classification system that phases.
is still used extensively, dividing liquid crystals into three
classes: smectic, nematic, and cholesteric. In a smectic
phase, the molecules are arranged in sets of parallel planes.
The smectic phases generally have high viscosity and a II. DESCRIPTION OF LIQUID
soapy appearance; hence the name was derived from the CRYSTAL PHASES
Greek word for “soap.” The word “nematic” is derived
from the Greek word for “thread.” A nematic liquid crystal The most obvious difference between liquids and solids
often shows a thread-like pattern when placed between is the ability of a liquid to flow or to adapt its shape to its
crossed polarizers and viewed through a microscope. The container under the influence of small external forces. On
third class was named “cholesteric” since the molecules the molecular level, a crystalline solid has long-range or-
forming these phases commonly contained cholesterol. A der, that is, a strong correlation between the positions and
cholesteric has a characteristic iridescent color, which can orientations of molecules that are far apart, whereas such
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
Smectic E
Diethyl p-terphenyl- p- p”-carboxylate 173–189
C2H5OOC COOC2H5
Smectic G
N 2-( p-Pentylphenyl)-5-( p- SG 79–103
n-H11C5 O C5H11-n pentyloxyphenyl)pyrimidine
N
continues
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
TABLE I (Continued )
Liquid crystalline
Formula Name range (◦ C)
Smectic C
p-n-Octyloxybenzoic acid 108–147
n-H17C8 O COOH
Smectic D
p -n-Octadecyloxy-3 -nitrodiphenyl- p-carboxylic acid 159–195
n-C18H37 O COOH
O2N
Smectic F
N 2-( p-Pentylphenyl)-5-( p-pentyloxyphenyl)pyrimidine SF 103–114
n-H11C5 O C5H11-n
N
infinite pitch, which correspond to nematics. A cholesteric c. Blue phases. Some cholesteric compounds ex-
is formed by adding a small amount of a cholesteric, or hibit a phase between the cholesteric and isotropic phases,
even a substance that is not in itself liquid crystalline but usually in a narrow temperature range. The local molec-
is optically active, to a nematic substance. The pitch of ular orientation shows a three-dimensional periodicity
a cholesteric can vary sensitively with temperature, va- and is perhaps a stable lattice of defects in the uniform
por pressure of certain substances, and other influences. cholesteric structure. Mechanically, the behavior is sim-
When the pitch corresponds to the wavelength of visible ilar to that of a cubic crystal with a large resistance to
light, the scattered light is highly colored. A cholesteric shear (a shear modulus of several hundred to several
also rotates the direction of linearly polarized light, that thousand ergs per cubic centimeter). Several blue phases
is, is optically active. This activity is roughly 1000 times exist.
stronger than the activity of an ordinary optically active
substance such as quartz. d. Discotic. Disklike molecules may have a discotic
phase in which the molecules are stacked aperiodically,
forming liquidlike columns having nematiclike ordering
of the symmetry axes of the molecules. Some of these
materials have smectic phases similar to the smectic A in
which the columns form a hexagonal array. These phases
thus exhibit translational order in two dimensions, but not
in the third. The appearance under a microscope is similar
to that of an ordinary nematic.
2. Smectic Phases
The various smectic phases have, in addition to the ori-
entational order found in nematics, different degrees of
positional and, in some cases, bond-orientational order-
ing (Fig. 3). On the basis of appearance under a po-
larizing microscope, miscibility with known phases, and
X-ray scattering, at least nine thermotropic smectic phases
have been identified, although not all are truly liquid crys-
tals. Of these nine phases, eight have a characteristic pack-
ing of the molecules in layers. The ninth phase, known as
smectic D (the letters used to denote the different phases
FIGURE 2 A cholesteric or twisted nematic. On a small scale,
the molecules behave as in the nematic phase. Over longer dis- were assigned in chronological order as the phases were
tances the director rotates along a helix whose pitch is sensitive first observed), has a cubic packing. In the smectic A, B,
to changes in temperature, pressure, etc. [Courtesy of Nuno Vaz.] and E phases, the molecules align with their long axes
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
I. Nematics
Ordinary nematic Uniaxially positive Schlieren; threaded Parallelism of long p-Azoxyanisole; p-methoxybenzylidene
marbled; molecular axes p-n-butylaniline
pseudoisotropic;
homogeneous
Cholesteric nematic Uniaxially negative; Focal conic with Nematic packing in planes; Cholesteryl nonanoate
or isotropic Grandjean steps; superimposed twist in
optically active homogeneous; direction perpendicular
isotropic to long axes of molecules
II. Structured smectics
Smectic B Uniaxially or Mosaic; stepped drops; Layer structure; molecular Ethylethoxybenzylideneaminocinnamate;
biaxially positive pseudoisotropic; axes orthogonal or tilted terephthal-bis-butylaniline
homogeneous; to layers; hexagonal
schlieren arrangement within layers
Smectic E Uniaxially positive Mosaic; pseudoisotropic Layer structure; molecular di-n-Propylterphenyldicarboxylate
axes orthogonal to layers;
ordered arrangement
within layers
Smectic G Uniaxially positive Mosaic Layer structure with ordered 2-(4-n-Pentylphenyl)-5-
arrangement within layers (4-n-pentyloxy-phenyl)pyrimidine
III. Unstructured smectics
Smectic A Uniaxially positive Focal conic (fanshaped Layer structure; molecular Diethylazoxybenzoate
or polygon); stepped axes orthogonal to layers;
drops; homogeneous; random arrangement
pseudoisotropic within layers
Smectic C Biaxially positive Broken focal conic; Layer structure; molecular Dodecyloxyazoxybenzene
schlieren; axes tilted to layers;
homogeneous random arrangement
within layers
Smectic D Isotropic Isotropic; mosaic Cubic structure 4 -Octadecyloxy-3 -nitrodiphenyl-4-
carboxylic acid
Smectic F Uniaxially positive Schlieren; broken Layer structure 2-(4-n-Pentylphenyl)-5-(4-n-pentyloxy-
focal conic with phenyl)pyrimidine
concentric axes
a From Brown, G. H., and Wolken, J. J. (1979). “Liquid Crystals and Biological Structure,” pp. 30–31, Academic Press, New York.
perpendicular to the layers. In the tilted smectics, C, F, G, Extremely thin films, down to one molecular layer thick,
H, and I, the long axes are at an angle with the layer nor- have been made with smectics. These films are being used
mals. The smectic A, C, D, and F phases are unstructured to probe surface effects as well as new thermodynamic
smectics: They do not show an ordered arrangement of phase behavior.
molecules within layers. The smectic A, B, and C phases In the discussion that follows, the smectic phases
are the best known of the smectic phases. are treated in alphabetic order, which does not always
One variation on the ordinary smectic structure can oc- correspond to the sequence of phases observed on heating
cur in optically active compounds having a tilted smectic or cooling.
phase or in a tilted smectic phase to which a small amount
of a chiral compound has been added. A macroscopically
chiral structure can form in which the directors of adjacent a. Smectic A. Smectic A liquid crystals (Fig. 3) are
layers of molecules form a small angle, giving an uniform the least ordered of the untilted or orthogonal smectic
twist and resulting in a strongly optically active substance. phases. The molecules are arranged in layers with the
Some of these chiral smectics are ferroelectric and have director perpendicular to the layers. Except over short
the potential for applications in displays with fast response distances, the molecules show no correlations in position
times (microseconds). within the layers. Studies of X-ray scattering by smectic
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
N O
CH3O N N OCH3
p-Azoxyanisole
N H
CH3O C N C4H9-n
4-Methoxybenzylidene-4 -n-butylaniline
O
Diallylazoxybenzene-4, 4 -dicarboxylate
Ch,A CH3
CH(CH3)(CH2)3 CH(CH3)2
H3C
CH3(CH2)7COO
Cholesteryl nonanoate
A,B H
C2H5O C N CH CHCOOC2H5
Ethyl-4-ethoxybenzylidine-4 -aminocinnamate
N,A,C
n-C6H13 O COOH
O2N
4 -n-Hexyloxy-3 -nitrobiphenyl-4-carboxylic acid
A,C,B H
n-C10H21O C N CH CH C O-n-C5H11
O
n-Amyl-4-n-decyloxybenzylidene-4 -aminocinnamate
N,A,C,B H H
C2H5 O C CH HC N C C N CH CHC OC2H5
O O
Diethyl terephthalylidene-bis-(4- aminocinnamate)
a From Brown, G. H., and Wolken, J. J. (1979). “Liquid Crystals and Biological Structure,” pp. 32–33, Academic
B. Lyotropic Phases
1. General Characteristics FIGURE 4 Three molecular models of the same lipid in different
configurations. The lipid, dipalmitoyl phosphatidylcholine, consists
Mixtures of two or more components that change phase of two hydrocarbon tails, (CH2 )14 CH3 , linked to a head group,
with changes of concentration are called lyotropic. Al- (CH3 )3 N+ (CH2 )2 PO− 4 , by ester linkages and a glycerol back-
bone, (OCO)2 (CH2 )2 CH. The molecule on the left has a single jog
though compounds that form thermotropic phases are by
(gauche rotation) in one tail, while the other tail is all trans. The
no means uncommon, lyotropic mixtures are very famil- middle molecule has a kink (gauche–trans–gauche sequence) in
iar; soap-and-water being one of the prime examples of one tail. [Courtesy of H. L. Scott, J. F. Nagle, and the American
a two-component system with lyotropic phases. Mixtures Institute of Physics; from “Biomembrane phase transitions,” Phys.
of three or more components are also common, one exam- Today 31(2), 38–47 (1978).]
ple being the use of an emulsifier to mix the oil and water
or vinegar of a salad dressing. The soap-and-water mix-
ture, of course, is most useful when the soap molecules When a crystalline amphiphile is added to water, sev-
allow grease to “dissolve” in the water, forming a three- eral mesophases can be observed, ranging from a true so-
component system. Although water is a very common lution to the crystal state. Some of the phases may show
component of lyotropics, other solvents can be used as smectic or lamellar packing (in layers) or even cubic or
well. Most lyotropic phases involve the solution of rod- hexagonal molecular packing. The amphiphile can also
like molecules or aggregates of molecules in a normally aggregate into structures such as spherical or cylindri-
isotropic solvent such as water. It is principally the rod-like cal micelles. Micelles have the ability to solubilize an
entities that become ordered. otherwise insoluble chemical by encapsulating it. This
Soaps are a simple example of a whole class of mol- is what soap does to dissolve grease. Similarly, water
ecules, called surfactants or amphiphiles, which form ly- can be dissolved in oil by inverse micelles formed by
otropic phases in water (Fig. 4). These molecules are called surfactant molecules with the tails pointing outward in-
amphiphiles (from the Greek amphi, meaning “of both stead of inward. One common sequence of mesophases
kinds,” and philo, meaning “loving”). One part of an am- obtained on adding water is as follows: solid, lamel-
phiphilic molecule, the polar “head,” has an affinity for lar liquid crystal, cubic liquid crystal, hexagonal liquid
polar solvents such as water (hydrophilic), while the other crystal, micellar, homogeneous solution. Table IV gives
end, the organic “tail,” is relatively insoluble in water (hy- some of the properties of some of the common lyotropic
drophobic). The result of these opposite tendencies is that phases.
the molecules prefer to organize themselves into surfaces
(hence the name “surfactants”) with the polar heads point-
2. Lamellar
ing toward the water. Such systems may form a number
of possible structures, depending on the concentrations of These phases, also called “neat” or G phases, correspond
the components and the shapes of the molecules involved to the thermotropic smectic phases; that is, they are
(Fig. 5). The amphiphilic molecule is usually represented characterized by layers having a well-defined thickness
in a figure with a circle as the polar head and a wiggly but no structure within the layers (Fig. 6a). Different
line as the organic tail. Simple soaps often have a string lamellar phases have been found in the same system. In
of hydrocarbons forming a hydrophobic tail attached to a the phase designated Lα , the α is used to indicate that
polar head. A second type of molecule has two such tails the hydrocarbon chains in the tail of the amphiphile are
attached to a polar head. fluid or flexible. It is possible to observe a transition from
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
4. Cubic
A cubic structure, also referred to as the viscous isotropic
or V1 phase, sometimes forms at amphiphile concen-
trations between those producing lamellar and hexago-
nal phases. Ordinary optical observations show only an
isotropic structure. X-ray diffraction studies show that the
surfactant molecules pack into spheres, which then pack
into a face- or body-centered-cubic lattice. The inverted
structure (V2 ) can also form between the lamellar and in-
verse hexagonal phases. Another viscous isotropic phase
(S1c ) has been observed at concentrations of amphiphile
lower than that of the hexagonal phase.
5. Nematic
Lyotropic nematics have been observed for which the
optical axes are easily oriented, as is the case for ther-
motropic nematics. The basic units that align are not sin-
FIGURE 5 Characteristic phase diagrams of amphiphile–water gle molecules, but aggregates of molecules whose sizes
systems. Two-phase regions are shown shaded. (a) Strongly po-
lar amphiphiles (e.g., soaps, alkyl sulfates, quaternary ammo-
are comparable to those of micelles—20 to 100 Å.
nium salts, and lysolecithins). (b) Amphiphiles with relatively large
hydrophobic regions (e.g., monoglycerides and lecithins). [From
Friberg, S., and Larsson, K. (1976). “Liquid crystals and emul-
C. Polymeric Liquid Crystals
sions.” In “Advances in Liquid Crystals,” Vol. 2, Academic Press, Examples of liquid-crystalline order have been found in
New York, by permission.]
fluid polymer melts and solutions. Such polymers may
play an important role in the spatial organization of bio-
logical macromolecules, for instance, in the packaging of
one lamellar phase, such as Lα , to another in which the DNA in chromosomes and in the aggregation of micro-
tails “freeze,” that is, lose much of their flexibility. The tubules, which form the structural framework of cells. A
water and surfactant molecules in the lamellar phase are fiber formed from the liquid crystalline phase of a poly-
in alternate layers with the surfactant molecules forming mer, Kevlar, is an ultra-high-strength polymer that has a
double-thickness layers called bilayers in which the stiffness comparable to that of steel with a much lower
hydrophobic tails are separated from the water layers by mass density.
planes of polar heads. Single bilayers of lipid molecules Examples of both lyotropic and thermotropic polymeric
form the underlying structure of biological membranes phases have been studied. Solutions of synthetic polypep-
(Fig. 7). tides can form a helicoidal cholesteric structure in which
the polymers form twisted rods with pitches between 10−7
and 10−3 m. The rods may be separated by several mil-
3. Hexagonal
limeters. Unlike the monomeric thermotropics, however,
As water is added to a lamellar phase, the layer struc- the rods can be untwisted into a nematic phase by changing
ture can be replaced by one in which the surfactant temperature. Polymers having thermotropic liquid crys-
molecules apparently form cylindrical structures with talline mesophases have been made by adding single liquid
the polar heads forming the outer shell. The cylinders crystalline molecules to polymer chains to form either
line up in hexagonal arrays with the water between comb-like or linear polymers. In a comb-like polymer, the
(Fig. 6b). This phase is also called the middle, or M1 , liquid crystal monomers are attached by flexible links to
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
Suggested structural
arrangement
the main chain, like the teeth of a comb. All the com- crystals end to end have been formed with stable liquid
mon thermotropic phases have been obtained in this way, crystalline phases in the range 100–400◦ C. The phases
with the possibility of locking the structure by quench- have properties in common with the monomeric liquid
ing (rapid cooling) in the presence of applied magnetic or crystals, but the response to external stimuli can be much
electric fields. Linear polymers formed by linking liquid slower.
FIGURE 6 Molecular arrangements in two lyotropic phases, shown in cross section. (a) A lamellar phase, in which
the amphiphile molecules form bilayers with their hydrophobic tails toward the layer centers, away from the water. (b)
The hexagonal phase, showing cross sections of rod-like structures having their axes perpendicular to the plane of
the drawing. Water is in the region between the cylinders. [Courtesy of Ging-Sheng Yu.]
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
splay: δn x /δx and δn y /δy Some of the concepts and results of the continuum the-
twist: δn y /δx and δn x /δy ory can be applied to other phases. In the smectic A phase,
for instance, the smectic layers are easily bent, correspond-
bend: δn x /δz and δn y /δz.
ing to a splay deformation, so K 1 has values similar to
those found in nematic phases. Twist and bend deforma-
Making use of the symmetries of the nematic phase, that tions of the director, on the other hand, are nearly ruled out,
is, invariance under rotations about the director and the as they require changes in the layer thickness comparable
equivalence of n and −n, the free energy density of a to the compression of a normal liquid. One would expect
nematic in bulk can be written to second order as follows: K 2 and K 3 to increase anomalously in a nematic phase as a
smectic A phase is approached, especially when the tran-
F = 12 {K 1 (∇ · n)2 + K 2 (n · ∇ × n)2
sition is nearly second order, that is, when the transition
+ K 3 (n × ∇ × n)2 }. (2) has a small latent heat.
angle. Most of the remarks below refer to nematic liquid of geometries. For instance, a simple twist can be cre-
crystals. ated close to a surface at which the director is anchored in
one direction contained in the plane, for example, along
the z axis of an x–z plane. If the magnetic field is ap-
1. Surface Interactions plied along the x axis, the director aligns along a right-
Surface forces are often strong enough to impose a well- or left-handed twist, eventually lining up with the mag-
defined direction to the director at boundaries of the liquid netic field far from the surface. In a typical case, this
crystal with other materials. By treating a cleaned glass distance can be about 3 µm with a field of 1 T. It is in-
surface with certain detergents, it is possible to align the teresting that a weak external perturbation can be used
nematic director perpendicular to the surface. It is thus to create a distortion on a scale approaching an optical
easy to prepare a sample of nematic liquid crystal be- wavelength. A magnetic field applied at right angles to
tween two parallel plates of glass that has a “homeotropic the helical axis of a cholesteric can distort the structure
texture,” that is, a single-domain crystal with its optical and eventually “untwist” the cholesteric into a nematic
axis perpendicular to the walls. Other surface treatments, structure.
such as rubbing the glass with a tissue or evaporating films
at oblique angles of incidence, can give rise to other an-
chorings of the director at the surface. in some cases, 3. Electric Fields
as with the free surface of MBBA (the first example of The director of an insulating nematic liquid crystal tends to
Table I) in the nematic phase, there is a continuous set align either parallel or perpendicular to an electric field,
of directions that the director can take on, such as the depending on the structure of the molecule. For a typ-
cone of directions that make a constant angle with the sur- ical nematic that aligns parallel to the electric field, an
face normal. In such a case, transitions in the anchoring electric field of about 1 V/cm is equivalent in effect to
have been observed on changing the temperature of a sam- a magnetic field of 1 G (roughly the strength of the
ple with a thickness on the order of 50 µm. earth’s magnetic field). In some LCDs an electric field
Surface alignment of the nematic director is used to is used to switch from the twisted nematic configuration
form the twisted-nematic liquid crystal used in many set up by the surface alignment to a nearly homeotropic
LCDs. A nematic liquid crystal having a macroscopic twist alignment when the field is turned on. A potential dif-
is created by placing an ordinary nematic between two sur- ference of only a few volts is sufficient to cause this re-
faces, each of which has been treated to align the director alignment. The devices use very little energy as only a
parallel to a particular direction in the surface (see Fig. 17). rotation of molecules is involved. Two display devices
The two surfaces are parallel to one another, but the align- that make use of the alignments due to surfaces and
ment directions are at an angle of 90◦ , causing the director electric fields are discussed in Section IV (see Figs. 17
to twist slowly through a right angle. and 18).
E. Optical Properties
The optical behaviors of liquid crystals give rise to useful
and often spectacular effects such as the optical switch-
ing characteristics used for liquid crystal displays and the FIGURE 12 Focal conic fan-shaped texture in a smectic A liquid
vivid temperature-dependent colors of the cholesterics. crystal. The fans in different regions may appear in different colors.
The appearance of a bulk sample of liquid crystal in or- [Courtesy of Dr. Mary E. Neubert.]
dinary light ranges from transparent through translucent
and from turbid to brightly and iridescently colored. By
using polarized light, a very wide range of appearances liquid crystalline molecule together with the tendency for
or textures can be created by means of surface, flow, and the long axes of the molecules to align with each other
field alignment of the various phases (see Figs. 10–16). along the director. This makes the electric polarizabil-
Transitions from one phase to another can often have a ity and, in turn, the index of refraction of the medium
remarkable appearance. anisotropic. Phases such as the ordinary nematic and the
The microscopic basis for the many optical effects is untilted smectic A are optically uniaxial with the optic
the elongated shape and electronic structure of the typical axis along the director. In fact, the cylindrical symmetry
of the phases implies that any macroscopic physical prop-
erties, optical ones included, have identical values when
measured in any orientation perpendicular to the director.
For light propagating along the optic axis, all directions of
polarization are equivalent, so there is no birefringence.
For light traveling along other paths, birefringence is ob-
served. Most commonly, the uniaxial phases have positive
birefringence; that is, the refractive index is at its maxi-
mum for light polarized along the director (also the optic
axis).
The colors that are produced with liquid crystals can
arise in several ways.
FIGURE 11 Sample undergoing a transition from a nematic (top)
1. When the pitch of a cholesteric is in the range of
to smectic A (bottom). The temperature is lower at the bottom of
the picture. One of the “threads” from which the nematic takes its wavelengths of visible light, Bragg reflections occur for
name is indicated by an arrow in top half of the picture. [Courtesy visible light, similar to the Bragg scattering of X-rays from
of Dr. Mary E. Neubert.] solid crystals.
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
FIGURE 13 Fan-shaped texture in a smectic E. The striations of- FIGURE 15 A cholesteric with large, visible pitch, created by
ten bound areas having different colors. [Courtesy of J. W. Doane, adding an optically active material to a normally nematic material.
Liquid Crystal Institute.] In the dark areas the molecules are perpendicular to the plane of
the picture. [Courtesy of J. W. Doane, Liquid Crystal Institute.]
2. Dichroic dye molecules such as methyl red, when tures that are observed are due to the interference charac-
dissolved in liquid crystals, tend to orient along the di- teristic of birefringent materials.
rector. These molecules will absorb light polarized in one
direction. By aligning the director of such a solution in A cholesteric is uniaxial on a local scale of tens or hun-
different directions with respect to the polarization of a dreds of molecules in length with the optic axis rotating
light beam, different colors may be produced. to describe a helix identical to that described by the direc-
3. For optical studies of the nature of phases and the tor. Light incident along the pitch axis can be thought of
temperatures of transitions, liquid crystals are placed be- as a sum of two waves, one with electric field rotating in
tween crossed polarizers. The beautiful and revealing tex- the opposite sense to the helix and one with the electric
field rotating in the same sense. The first wave behaves as
it would in a normal medium, having an effective index
of refraction equal to the average of the refractive indices
for light polarized along and perpendicular to the optic
axis. The second wave shows anomalous behavior, with
nearly perfect reflections in a narrow band of wavelengths
close to the pitch of the cholesteric. This band is typi-
cally only 25 nm (25 × 10−9 m) wide, producing a very
pure colored appearance similar to the colors sometimes
seen on beetle wings. In the blue phase of cholesterics
an unusual platelet structure can be observed when dif-
ferent crystal domains have different faces alighned with
the sample surface. Distinct colors are reflected from each
FIGURE 14 Mosaic textures in a smectic H. The different areas domain, corresponding to the wavelengths satisfying the
are usually of different colors. [Courtesy of Dr. Mary E. Neubert.] conditions for Bragg scattering.
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
light. Two laser beams of the same frequency can be di- magnetic susceptibilities, neutron scattering, and fluores-
rected at a liquid crystalline sample to produce a “phase cence recovery after photobleaching (FRAP, used to study
grating,” which causes a spatial variation in the refrac- diffusion). Besides the study of optical textures, X-ray
tive indices and, in turn, diffracts some of the original scattering, NMR, and thermal measurements are of par-
light. This phenomenon may be usable for holographic ticular importance in determining the structure and na-
imaging. ture of phases and phase transitions and are discussed
briefly.
F. Homologous Series
1. X-Ray Scattering
Since it is ultimately intermolecular forces that give rise
to the formation of mesophases and determine which of As in the study of solids, X-ray scattering is used to deter-
them form and at which temperatures, it is important to mine the symmetries of the phases as well as to measure
study the relationships between physical properties and the separation of the planes of smectics, intermolecular
chemical structure. Although the shape of a molecule has distances, packing of molecules, and degree of long- and
a large bearing on the phases it forms and the tempera- short-range molecular order. For many thermotropic liq-
tures at which it transforms, other considerations, such as uid crystals it is possible to study oriented samples, but
the rigidity of the bonds and the imbalances in attractions for the lyotropics the studies are more often restricted to
between different parts of neighboring molecules, play powder methods. The diffraction patterns seen are neither
strong roles. These roles can be investigated by studying the sharp Bragg peaks of monodomain solid crystals nor
series of compounds differing in, for instance, the nature the diffuse reflections characteristic of isotropic liquids.
of the terminal groups or the polarizability of different sub- In oriented samples one finds combinations of sharp and
stituents. One of the chemical variables that lends itself to diffuse rings, arcs of rings, and spots.
such investigation is the length of the alkyl chains that The recent availability of synchrotron sources of high
commonly terminate liquid-crystalline molecules. The intensity and low line width (typically under 10−3 Å−1
compound MBBA is one of a series of molecules differing as opposed to 0.04 Å−1 for a rotating-anode generator)
only in the number of carbon atoms appearing in the end has enabled studies of free-standing films of liquid crys-
chains. Another molecule in the series is the 10th exam- tals with thicknesses down to several molecules. This
ple of Table I, 4-butyloxybenzal-4-ethylaniline. Typically, has yielded information relevant to the study of two-
the earlier members of such a series of molecules, called dimensional phases and phase transitions and, in fact, ev-
a homologous series, will have nematic phases, the later idence that the thin films have the expected properties of
members will have smectic phases, and intermediate mem- a two-dimensional crystal.
bers will display both smectic and nematic phases. Plots
of quantities such as transition temperatures versus carbon
2. Nuclear Magnetic Resonance
number will often show an even–odd effect. For instance,
a plot of the nematic-to-isotropic transition temperature The phenomenon of NMR offers a number of tools use-
versus carbon number will show an alternation between ful in elucidating the nature of phases and the motions
two curves, one for odd numbers of carbons and the other of the molecules in those phases. The spectra obtained
for even numbers. The separation of the two curves tends from the hydrogen nuclei in most liquid crystal phases are
to diminish at higher carbon numbers. This suggests that not made up of sets of well-defined absorption lines, as
the attraction between ends of molecules plays a role in the in isotropic liquids, but are typically as broad as 40 kHz,
stability of nematic versus isotropic phases, since besides with widths proportional to the order parameter, s. This
affecting the length of the molecule, the addition of one is due to dipole–dipole interactions between neighboring
carbon to an alkyl chain changes the orientation of the last hydrogen nuclei. In isotropic liquids, such interactions are
carbon–carbon bond with respect to the molecule’s long rapidly modulated by the tumbling of the molecules, rapid
axis. enough on the NMR time scale (10−7 sec, say, for pro-
ton NMR at 100 MHz) to average to zero. In most liquid
crystals the tumbling is not isotropic, and the interactions
G. Other Studies
are not averaged out, although they are somewhat reduced
Many of the tools available to the physicist and chemist compared to the crystalline solid. In the smectic D and vis-
have been used to study the properties of liquid crys- cous isotropic phases, chemical shift spectra are obtained
tals and determine the nature of the phases. Many have as in isotopic liquids, apparently because the molecules
been mentioned above. Others include measurements diffuse rapidly between areas in which they have different
of refractive indices, dielectric constants and relaxation, orientations.
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
The NMR spectra and relaxation times observed in liq- clei of molecules according to their positions. Such exper-
uid crystalline phases depend on the alignment of the sam- iments have allowed direct measurements of self-diffusion
ple in the magnetic field. Especially in a smectic A phase, coefficients in a few nematics and smectics as well as the
it is possible to create a single-domain liquid crystal for diffusion coefficients of probe molecules such as benzene
which the director can be rotated with respect to the mag- or tetramethylsilane (TMS) dissolved in liquid crystals.
netic field; this drastically changes the spectrum and even Since the method is sensitive to the direction of molecular
reduces it to a single line when the director makes a “magic displacements, anisotropies can and have been measured.
angle” of 54.74◦ with the magnetic field. In a tilted smec- It is also possible to vary the diffusion times, allowing
tic, such as smectic C, the limited freedom of the director investigations into nonlinear diffusion, for instance, that
to reorient can be seen in the variation of the NMR signal occurring between restrictions such as cell walls.
(the free induction decay in a pulsed NMR experiment)
when the sample is rotated in the magnetic field. The di-
rectors can align in any of a cone of directions centered 3. Thermal Measurements
on the layer normal. Thus, when rotating a sample orig- Most of the measured values of properties of liquid crys-
inally having a uniform alignment of directors but not tals depend on temperature. Studies of these temperature
necessarily layers (this is accomplished by cooling from a dependencies are of the utmost importance in determin-
nematic phase while in a strong magnetic field), the direc- ing the correctness of models used to predict them. De-
tors “follow” the field, although not to the extent seen in terminations of phase diagrams and measurements of heat
the nematic phase. Unless the sample consists of a single capacities and latent heats are also necessary to discover
domain (all the layers are parallel), the final spectrum will the nature of phases and the differences between them.
be made up of a superposition of spectra corresponding to Because of the large number of mesophases and possible
the parts of the sample having various angles between the parameters that can be manipulated to effect transitions,
director and the magnetic field. studies of liquid crystals allow investigations into many
For a nematic sample that has been suddenly rotated, the predictions of statistical physics, even extending to studies
realignment of the director can sometimes be observed in of the phases of two-dimensional systems. For example,
the changing NMR signal, especially near room temper- by mixing a compound that has nematic, smectic A, and
ature. An effective rotational viscosity can be measured smectic C (or only nematic and A) phases with various
with this effect. proportions of another that has only nematic and smec-
The NMR signals from other nuclei are also studied. tic C, one obtains a phase diagram (with temperature and
The signals from deuterium nuclei that have replaced hy- concentration as variables) having a “multicritical point,”
drogen are especially useful, as the difference in chemi- where the nematic–A, nematic–C, and A–C phase bound-
cal behavior is usually slight and the spectra from aligned aries meet. As the point is approached by varying con-
samples show many individual, in some cases nonoverlap- centrations, the nematic–C transition entropy decreases to
ping, lines that can be assigned to the nuclei at particular zero, as does the “bump” in the specific heat curve near the
locations in the molecules. This allows detailed investiga- nematic A transition, while the corresponding bump near
tion into the alignment of the molecules in various phases, the A–C transition increases. Although the phase diagrams
including the more solidlike smectic phases, in which even for different mixtures differ, high-resolution investigations
the rotations of the molecules about their long axes may close to the multicritical point have shown universal fea-
be restricted. tures in the shapes of the diagrams. It has been difficult
Measurements of spin–lattice relaxation times (T1 ’s) to make theoretical predictions for these shapes because,
have been a rich source of information pertaining to the among other reasons, the smectic A and C phases lack
motions of the molecules in liquid crystal phases. For ne- long-range translational order. As confirmed with X-ray
matics at temperatures above roughly 50◦ C, for instance, scattering, fluctuations in layer positions diverge logarith-
the relaxation of hydrogen nuclei is dominated by mo- mically with the sample size in those phases.
tions due to cooperative fluctuations in the local director,
giving a characteristic frequency and angular dependence
to the measured times. Similar effects can be seen in the
IV. APPLICATIONS
rotating-frame relaxation rates in smectic liquid crystals.
With deuterated samples (samples in which deuterium
A. Liquid Crystal Displays
atoms have replaced some hydrogen atoms), it is possible
to study the motions of different parts of the molecules. Liquid crystal displays are used in a wide number of ap-
It is possible to study translational motions of molecules plications, from clocks to oscilloscopes. Their popularity
by using pulsed magnetic field gradients to “label” the nu- is due to the conveniently thin, flat shape and the very low
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008E-966 July 3, 2001 14:53
corresponding appearance of new hydrodynamic modes. SEE ALSO THE FOLLOWING ARTICLES
Studies of the importance of fluctuations for different
symmetries, ranges of interactions, and the spatial di- FERROMAGNETISM • LIQUID CRYSTAL DEVICES • LIQ-
mensions of ordering are useful in understanding critical UIDS, STRUCTURE AND DYNAMICS • MACROMOLECULES,
phenomena. STRUCTURE • NUCLEAR MAGNETIC RESONANCE • SUR-
Near a phase transition, many properties of materials FACTANTS, INDUSTRIAL APPLICATIONS • ULTRASONICS
should change in ways characteristic of the symmetries of AND ACOUSTICS
the phases involved as opposed to the specific materials
being used. This universality can be tested in many ways
with liquid crystals. An example is the shape of phase di- BIBLIOGRAPHY
agrams for mixtures of a compound that has nematic and
smectic A phases in its pure form with another compound Chandrasekhar, S. (1993). “Liquid Crystals,” 2nd ed., Cambridge Univ.
Press, Cambridge, UK.
that has nematic and smectic A and smectic C phases. Chigrinov, V. G., and Blinov, L. M. (1996). “Electrooptic Effects in
On a plot of temperature versus composition, the curves Liquid Crystal Materials,” Springer-Verlag, Berlin.
forming the boundaries between nematic, smectic A, and Collings, P., and Hird, M. (1997). “Introduction to Liquid Crystals:
smectic C regions meet at a multicritical point. A second Chemistry and Physics,” Taylor & Francis.
example of this behavior has been found in systems that Demus, D., ed. (1998).“Handbook of Liquid Crystals,” Wiley, New York.
Dong, R. Y. (1997). “Nuclear Magnetic Resonance of Liquid Crystals,”
have a reentrant nematic phase; that is, they display the 2nd ed., Springer-Verlag, Berlin.
sequence of phases nematic–smectic A–nematic on heat- Kumar, S. (2000). “Liquid Crystals: Experimental Study of Phys-
ing at constant pressure. Although a phenomenological ical Properties and Phase Transitions,” Cambridge Univ. Press,
model for this behavior is successful in predicting certain Cambridge, UK.
properties of the phase transitions, the microscopic model Lebedev, V. V., and Kats, E. I. (1994). “Fluctuational Effects in the
Dynamics of Liquid Crystals,” Springer-Verlag, Berlin.
to account for such behavior is less certain. Mark, H. F., ed. (1987). “Encyclopedia of Polymer Science and Engi-
Free-standing films of liquid crystals have been used to neering: Liquid Crystalline Polymers to Mining Applications,” 2nd
investigate theories of two-dimensional phase transitions ed., Vol. 9, Wiley, New York.
and have applications to studies of membrane biology and Vij, J. K., ed. (2000). “Advances in Chemical Physics: Advances in
chemical catalysis. Even narrow strands formed by colum- Liquid Crystals,” Vol. 113, Wiley, New York.
Vill, V. (1995). “LiqCryst: Liquid Crystal Database,” Springer-Verlag,
nar liquid crystals have been studied. These experiments Berlin.
may be of relevance to possible one-dimensional nematic Virga, E. G. (1995). “Variational Theories for Liquid Crystals,” CRC
phases. Press, Boca Raton, FL.
P1: GLQ Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
Luminescence
J. N. Demas
S. E. Demas
University of Virginia
I.Introduction
II.Origins of Luminescences
III.Excited State Types
IV. Methods of Studying and Characterizing
Excited States
V. Processes Affecting Luminescence
VI. Types of Luminescence
799
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
800 Luminescence
Luminescence 801
802 Luminescence
Luminescence 803
from promotion of an electron from an occupied to an is the subscript. For example the 253.65-nm emission line
unoccupied orbital. arises from a transition from 3 P1 to the 1 S0 ground state
For example, the ground-state configuration of atomic (S = 1 → S = 0; L = 1 → L = 0; J = 1 → J = 0).
mercury is [Xe](4 f )14 (5d)10 (6s)2 , where [Xe] stands for Molecular excited states are derived in much the same
the closed-shell xenon core. If everything but the 6s way, except that the orbitals of the system are described
electrons are denoted as core, the lowest excited states by molecular orbital theory. The single-electron molecular
of atomic mercury are given by (core)(6s)1 (7s)1 and orbitals are made up of combinations of atomic orbitals
(core)(6s)1 (6 p)1 . derived from the different atoms in the molecule. Thus, the
The state diagram for atomic mercury and some of the molecular orbitals extend over the entire molecule and are
radiative transitions responsible for emissions of Fig. 2 not localized on a single atom. This delocalization makes
are shown in Fig. 4. The other transitions can be derived for very rich bonding and spectroscopy. As with the atomic
from energy differences between states. The state des- case the electrons are added to fill up the lowest energy
ignations based on the quantum numbers S, L, and J are orbitals in order to derive the ground-state configuration.
shown above each set of states. The superscript denotes the Excited states usually arise from orbital promotions of
spin multiplicity of the state, M, and is related to the spin electrons from occupied to unoccupied orbitals.
angular momentum quantum number S by M = 2S + 1. Excited states of molecular systems are derived from
The orbital symmetry of the state is determined by the a variety of electron configurations. In organic systems
orbital angular momentum quantum number L and is the configurations responsible for the low-energy states
given by the upper case letter. The J quantum number, generally involve π -π ∗ and n-π ∗ states. The π -π ∗ states
which arises from coupling of spin and orbital angular are derived from the promotion of an electron from a
momentum and represents the total angular momentum, π-bonding to a π -antibonding orbital (e.g., anthracene).
The n-π ∗ excited states are derived from the promotion of
an electron in a nonbonding orbital to a π ∗ antibonding
orbital; an example is ketones, where an electron in one of
the nonbonding oxygen orbitals is promoted to the anti-
bonding π orbital between the carbon and oxygen atoms.
Metal complexes introduce more new states. The co-
ordinating ligands can contribute low-lying π -π ∗ or
n-π ∗ states. Splittings of the degenerate d orbitals by a
nonspherical ligand environment can give rise to metal-
localized d-d transitions in metal complexes with d
electrons. In addition, there are charge-transfer transitions
derived from the promotion of an electron from a metal-
localized orbital to a ligand-localized orbital or from lig-
and to metal orbitals.
A comparison of Figs. 1 and 2 shows a remarkable dif-
ference between the molecular and the atomic emission
spectra. The atomic spectrum is incredibly sharp, while
the molecular spectrum is very broad and exhibits regu-
lar progressions. The atomic states are simple because of
the absence of any other vibrational or rotational states.
In contrast, large molecules have a large number of vi-
brational and hindered rotational states superimposed on
the simple energy level diagrams of Figs. 3 and 4. Fur-
thermore, the molecule can exist in a large number of
conformations in the solvent matrix, each with a charac-
FIGURE 4 Energy level and state diagram for atomic mercury. teristic absorption and emission. These factors result in a
The term symbols for the states are indicated across the top. broadening of the molecular transitions.
Some of the radiative transitions are indicated by solid lines. The A more complete energy level diagram is given in
orbital configuration is indicated on each state. For example, 6s 6 p
Fig. 1, where a dominant molecular vibration has its en-
denotes a (6s)1 (7 p)1 outer-shell configuration, and 7d denotes a
(6s)1 (7d )1 . The core is omitted for clarity. [Reprinted with permis- ergy levels superimposed on each electronic state. This
sion from Leverenz, H. W. (1950). “An Introduction to Lumines- figure shows why the absorption and emission tend to
cence of Solids,” John Wiley & Sons, New York.] overlap with, and be mirror images of, one another. A
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
804 Luminescence
well-defined vibrational progression is characteristic of tally unexpected species suddenly appear. A classic exam-
systems in which there is little distortion on going from ple of this is encountered in electrically excited mixtures
the ground to the excited state. Where distortions occur, of Ar and F2 . There are no known stable Ar compounds.
the vibrational structure is smeared out and the emission However, one sees an intense ∗ ArF emission derived from
band is broadened and red-shifted. ∗
Ar + F2 → ∗ ArF + F (1)
An interesting type of hybrid atomic molecular sys-
tem is exemplified by rare-earth ions in crystal lattices where asterisks denote excited species. The reason for the
or in molecular complexes. The electronic configuration existence of ∗ ArF but not of ArF is that ∗ Ar is not the same
of rare-earth ions is (core)( f )n (n = 0–14). The lowest ex- chemical species as Ar; it has a completely different elec-
cited states are derived, not by orbital promotions, but tronic configuration. Ar has a closed-shell [Ne](3s)2 (3 p)6
by rearrangement of the electrons within the f shell. Fur- electronic configuration with no free bonding electrons
thermore, these f electrons are so well shielded within the and so does not form ArF. The lowest excited state of ∗ Ar,
atom that the excited state transitions are very insensitive however, is [Ne](3s)2 (3 p)5 (4s)1 , which has unpaired s and
to the environment around the atom. Thus, the transitions p electrons. Chemically this configuration is very similar
of rare-earth elements look more like atomic transi- to that of potassium metal; a free s electron is bound to a
tions than molecular ones. Atomic-state classifications are singly charged core. Not surprisingly the bonding in ∗ ArF
used because of the small perturbations on the atomic is ionic and very much like that of KF.
transitions. Rare gas chemistry can be even more complex. At high
Figure 5 shows emission spectra for a neodymium(III)- pressures the rare gas halide can react to give triatomic
doped glass at room and liquid-nitrogen temperatures. The species
quasi-atomic line spectra are very clear, especially at 77 K; ∗
RgX + Rg → ∗ Rg2 X (2)
compare these spectra with Figs. 1 and 2. Emission nar-
rowing on cooling is common and one of the reasons why where Rg stands for a rare gas and X for a halogen.
emissions are frequently studied at low temperatures. In Figure 6 shows the emission spectra of several triatomic
this case the 77 K emissions are only 0.16 nm wide. rare gas compounds.
Excimers and exciplexes are chemically stable excited-
state species that can exist only in the excited state and
E. Excimers and Exciplexes
do not have a corresponding ground-state form. Excimers
Even if one fully understands ground-state chemistry, one are excited state dimers formed by the association of two
may find surprises in the excited-state manifold, where to- identical subunits. Exciplexes are excited state complexes
FIGURE 5 Emission spectra of Nd3+ in Y3 Al5 O12 at room and liquid-nitrogen temperatures. [Reprinted with permis-
sion from Van Uitert, L. G. (1966). In “Luminescence of Inorganic Solids” (P. Goldberg, ed.), p. 516, Academic Press,
New York.]
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
Luminescence 805
806 Luminescence
Luminescence 807
detector. These directly obtained data are not corrected In principle, multifrequency phase shift and pulse mea-
for the transmission characteristics of the optics or for surements provide similar information. Commercial pack-
the variations in the detector’s sensitivity with wave- ages are available for both types of measurements. Both
length. Uncorrected spectra may bear little resemblance types of measurements appear regularly in the literature,
to true luminescence spectra and must be corrected, usu- and the choice depends on the nature of the problem
ally by calibrating the response of the system with a (i.e., wavelength range, lifetime, required dynamic range,
source of known spectral distribution such as a standard funds) and the researcher’s personal preference. Many
lamp. workers like to see actual decay curves (the sample im-
Excitation spectra are obtained by measuring the rel- pulse response) rather than a set phase shift versus mod-
ative emission intensity at a fixed wavelength while ulation frequency. However, with the advent of inexpen-
scanning the excitation source. For weakly absorbing sive, bright, easily modulated LEDs for frequencies into
solutions, the amount of light absorbed will be directly the megahertz range and sophisticated high frequency sig-
proportional to the sample absorbance. If the emission ef- nal processing, the phase-shift measurement will certainly
ficiency is independent of excitation wavelength, then the dominate analytical instruments based on lifetime mea-
excitation spectrum will match the absorption spectrum. surements for the forseeable future.
As with emission spectra the directly obtained spectra are As it turns out, many practical and fundamentally in-
distorted by the variations in light output of the source teresting systems are not characterized by a single de-
versus wavelength. Data are corrected by measuring the cay time, but rather by sums of multiple exponentials or
excitation source intensity as a function of wavelength. even more complex decays. This problem is mirrored in
Excitation-emission matrices (EEM) are two-dimen- phase-shift measurements where the lifetimes determined
sional plots of excitation and emission spectra. These are at different frequencies differ because the decay is not
invaluable for characterizing complex mixtures. The ma- a single exponential. Such complexity is the rule rather
trix can provide a unique fingerprint for complex mixtures, than the exception in biological systems, solid-state com-
and, as such, it is useful in identifying and tracing complex posite sensors, or dynamic multicomponent molecules in
mixtures such as oil spills. solution. Fortunately, the mathematical tools and the ex-
Excited state lifetime measurements are extremely use- traordinary power of inexpensive desktop computers are
ful diagnostic tools of excited-state processes. The stan- allowing a successful assault on these important systems.
dard method is to excite the sample with a pulse that is In terms of the rate constants and paths indicated in
shorter in duration than the decay phenomena and then Fig. 3, the fluorescence and phosphorescence lifetimes
watch the relaxation by monitoring the luminescence. It are given by
is also possible to monitor the decay by following the
excited state absorption spectrum or the electron spin res- τf = 1/(kf + kqS + kisc ) (5a)
onance spectrum. Using mathematical tricks, one can also τp = 1/(kp + kqT ) (5b)
measure lifetimes appreciably shorter than the excitation.
For extremely short decays, picosecond pulse probe tech- where the subscripts f and p denote the fluorescence
niques are used. Here, a sample is probed using an optical and phosphorescence processes, respectively, q denotes
delay line where time between excitation and monitor- a quenching path, and S and T denote processes from the
ing is set by adjusting the distance the probe pulse travels singlet and triplet states, respectively; kisc is the rate con-
before striking the sample. Lifetimes in the low nanosec- stant for intersystem crossing between S1 and the triplet
ond range are readily measured using emission relaxation manifold.
methods, while picosecond methods measure subpicosec- Luminescence quantum efficiencies (photons emitted
ond decays. per photon absorbed) are given by
An alternative approach to lifetime determinations is f(S1 ) = kf τf (6a)
the phase shift measurement where one excites the sam-
ple with a sinusoidal excitation. The emission is sinusoidal p(S1 ) = isc kp τp (6b)
and phase-shifted from the excitation. The phase shift is p(T1 ) = kp τp (6c)
related to the lifetime and the modulation frequency. A
variation is to use a very short duration, high repetition ic = kic /(kic + kqS ) (6d)
source such as a mode-locked laser or a synchrotron. Such isc = ic /(S1 )/p (T1 ) (6e)
a source can be decomposed in the fundamental at the rep-
etition frequency and the higher harmonics. The individual where ic is the efficiency of internal conversion from
Fourier components of the excitation can be used to simul- the upper excited singlet to S1 and isc is the efficiency of
taneously evaluate the decay times at multiple frequencies. intersystem crossing between S1 and T1 . The parenthetical
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
808 Luminescence
Luminescence 809
810 Luminescence
Luminescence 811
H2 O is a much more efficient deactivator of many molec- Finally, Eq. (7e) indicates any other type of chemical
ular systems than D2 O, because of the higher vibrational reaction. Examples include the addition of singlet oxygen
frequencies that more easily bridge the gap between the to an olefin or the hydrogen abstraction of a proton from
ground and excited states. For example, crystalline Eu3− a protic solvent by a triplet ketone.
and Tb3+ hydrates are more than 10 times less lumines- The kinetics of Eqs. (7) yield two Stern-Volmer equa-
cent than the corresponding deuterates. This behavior has tions relating emission intensity or lifetime to the concen-
been used for counting the number of water molecules tration of the quencher,
around metal ions in proteins or the average solvent ex-
posure of metal complex sensitizers in organized media (0 /) − 1 = K SV [Q] (8a)
such as micelles. (τ0 /τ ) − 1 = K SV [Q] (8b)
K SV = k2 τ2 (8c)
B. Bimolecular Processes
where the ’s are emission intensities and τ ’s are excited
Excited states that persist for any appreciable time are state lifetimes. The subscript 0 denotes the value in the ab-
also susceptible to quenching by other chemical species. sence of quencher. The K SV ’s and k2 ’s are Stern-Volmer
Excited-state deactivators are called quenchers. Quench- and bimolecular quenching constants, respectively. Thus,
ing can occur by a variety of processes plots of the experimentally determined left-hand side ver-
∗ sus [Q] provide K SV ’s and k2 ’s if the unquenched lifetime
D + Q − k2 → D + Q + (7a)
is known. The k2 ’s, in particular, provide a great deal of
∗
D+ Q (7b) fundamental information concerning the interaction of ex-
D+ + Q− (7c) cited states with quenchers.
Figure 13A shows a typical intensity Stern-Volmer plots
− +
D +Q (7d) for luminescence quenching by molecular oxygen of a
reaction products (7e) metal complex in a polymer support. In solution, the plot
would be a straight line, but in the polymer different
where k2 is the bimolecular rate constant for deactivation molecules are in different environments, which leads to
of the excited state. If more than one deactivation pathway the characteristic downward curvature of the plot. Even
is present, k2 is the sum of all processes affecting the though dissolved in a solid elastomer, the emission is
state. strongly quenched by oxygen (25-fold at 1 atm of pure
Equation (7a) denotes catalytic deactivation of the ex- oxygen and 8-fold at 1 atm of pure air). This sensitivity
cited state. For example, external heavy atoms can increase to oxygen has led to the development of these systems as
spin-orbit coupling in a luminescent molecule during a
collision and enhance intersystem crossing from a lumi-
nescent singlet to a nonemissive triplet or quench a triplet
state back to the ground state.
Equation (7b) indicates energy transfer from a donor to
an acceptor molecule. Efficient energy transfer quenching
must generally be an exothermic reaction. The photody-
namic effect is an example of bimolecular energy trans-
fer. A dye in the presence of both light and oxygen will
kill organisms. Chemically very reactive singlet oxygen
is formed by energy transfer deactivation of dye triplet
states. The chemically reactive singlet oxygen then kills
the organism.
Electron transfer quenching denoted by Eqs. (7c) and
(7d) can be either oxidative or reductive depending on
whether the excited species is oxidized or reduced. Elec- FIGURE 13 (A) Intensity Stern-Volmer quenching plot for
tron transfer forms the basis of a number of solar energy [Ru(4,7-Ph2 phen)3 ]2+ (4,7-Ph2 phen = 4,7-diphenyl-1,10-phen-
conversion schemes and can occur in the excited state anthroline) in silicone rubber [Carraway, E. R., Demas, J. N.,
even though the ground-state species are thermodynam- DeGraff, B. A., and Bacon, J. R. (1991). Anal. Chem. 63, 337.]
The solid line is the best fit for a two-site model. (B) Lumines-
ically stable. This again points out the great differences
cence intensity of Ru(4,7-Ph2 phen)2+3 in silicone rubber while be-
in chemistry that can arise between ground- and excited- ing breathed over [Bacon, J. R., and Demas, J. N. (1987). Anal.
state reactions, and stresses the chemical uniqueness of Chem. 59, 2780. Copyright 1991 and 1987 American Chemical
the excited state relative to the ground state. Society.]
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
812 Luminescence
oxygen sensors in biomedical and industrial applications the energy and then transfer it by a resonance mechanism
where they are rapidly replacing the traditional Clark elec- to the active chlorophyll.
trode sensors. The term quenchometric sensors is used to In crystals composed of a single component, long-range
describe systems that detect an analyte by quenching of energy transfer can occur by a contact mechanism. The
an excited state. close proximity of the molecules permits energy to hop
Figure 13B demonstrates the high sensitivity of the sys- from one molecule to its neighbor. By a series of hops
tem to changes in oxygen concentration. The photolu- the excitation can sample a very large volume. If there are
minescence is being monitored while the film is being any quenchers in this volume, the energy can be trans-
breathed on. At one point the breath is held and then ferred to the quencher, and no luminescence of the major
breathing is resumed. The burst in emission intensity is component is observed. The classic example of this is the
caused by the increased exchange time in the lungs and, phosphorescence of naphthalene, where it was necessary
thus, the lower oxygen concentration and higher emis- to reduce the concentration of β-methylnaphthalene to be-
sion intensity. The irregularities of the following few low 10−7 mole fraction to see the host emission.
breaths are a result of the higher blood carbon dioxide
concentration.
C. Stimulated Emission
Quenchometric oxygen sensors also function as pres-
sure sensors if the gas composition is always the same. Light absorption occurs if the photon energy matches the
This led to the development of PSPs for monitoring pres- energy gap between the ground excited states. The pho-
sure over entire aircraft or automobile models. These PSPs ton, in effect, induces a transition between the ground and
are largely supplanting pressure tapes, which are expen- excited states with the loss of a photon from the radiation
sive and time-consuming to put on models. field and the production of an excited state. However, the
If two partners in a collision are excited, new excited- inverse process is also possible. If the excited molecule is
state reactions are observed. For examples, if two excited exposed to photons the energies of which exactly match
triplets collide, one can have an energy-pooling, spin- the energy gap between the excited state and a lower en-
conserving reaction that yields an excited singlet state. ergy state, the photon induces a transition between the
Also, two excited singlets can annihilate one another to upper and lower states. The net result is the addition of a
form ground state species: photon to the radiation field rather than a loss. This pro-
cess is called stimulated emission. Not only is the photon
T + T → ∗S + S (9a) of the same energy as the stimulating photon, but it is also
∗
S + ∗S → S + S (9b) of the same phasing and is traveling in the same direction
as the original photon.
Triplet-triplet annihilation is a common form of triplet de- Stimulated emission is not generally observed because,
cay under intense excitation conditions where high triplet for it to occur efficiently, a significant fraction of the
concentrations exist. Because of the long lifetimes of the molecules must be in the excited state. Furthermore, the
triplet state, triplet-triplet annihilation can also yield a population of the upper state must have a higher popu-
P-type delayed fluorescence, which can persist for many lation than the terminating state (population inversion).
microseconds after the cessation of irradiation. Singlet- Because of the symmetry of the absorption and the stim-
singlet annihilation is less common because of the diffi- ulated emission process, absorption will always be more
culty of achieving sufficiently high concentrations of the efficient than stimulated emission if the terminal state con-
short-lived species. However, with the advent of modern centration exceeds that of the upper state.
high-flux lasers it is readily observed and can be a signif- If stimulated emission occurs in a resonant cavity with a
icant nonradiative pathway. high degree of optical feedback, very intense highly direc-
Energy transfer does not require that the molecules be tional monochromatic radiation results. Such systems are
in contact with one another. Resonance energy transfer called lasers (light amplification by stimulated emission
can occur at distances far exceeding the physical contact of radiation) and have countless practical and fundamental
distance if the emission spectrum of the donor overlaps applications including surveying, weaponry, excited life-
the absorption of the acceptor and the acceptor absorp- time determinations, and luminescence studies.
tion spectrum is intense or highly allowed. Dipole-dipole
resonance energy transfer, which is also known as Förster
D. Multiphoton Processes
transfer, can occur at distances approaching 100 nm in
favorable cases, and 30- to 50-nm transfers are common. At the low fluxes obtained with most conventional light
The energy collection in the photosynthetic unit generally sources, the only absorption processes generally noted in-
consists of hundreds of antenna chlorophylls that collect volve single photons. With the high fluxes available from
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
Luminescence 813
814 Luminescence
photoluminescence is in quantitative analysis. The excita- be studied similarly since there is a big difference in the
tion and emission spectra of materials varies greatly. For diffusion coefficients of the bound and unbound ligand.
example, the characteristic excitation and emission spec- Surface detection of single molecules frequently used
tra provides “fingerprints” of different crude oils and have near-field scanning optical microscopy (NSOM) to reduce
been used to trace the sources of oil spills or illegal dump- the detection volume or area. The resolution of normal
ing. Furthermore, many nonluminescent materials can be microscopy is limited to the diffraction limit, which is
made luminescent by suitable, and frequently highly se- about half the wavelength of the light. However, by mak-
lective, chemical reactions. Concentrations can then be ing the source of collection optically smaller, it is possi-
determined from photoluminescence intensities. ble to excite or collect from a far smaller region (i.e., a
A major advantage of luminescence over absorption few nanometers). The source/collector is a fiberoptic tip
methods of analysis is the frequently much greater sensi- drawn down to sub-wavelength dimensions. If the tip is
tivity of the emission methods. This sensitivity enhance- very close to the detection volume, the light does not have
ment relies in part on the nature of the measurement. distance enough to broaden by diffraction. Imaging is done
Absorption measurements of weakly absorbing samples by raster scanning the tip over the surface much as a scan-
depend on being able to measure small differences in the ning tunneling or atomic force microscope does. Not only
intensities of large transmitted signals, which is an intrin- can single molecules be readily detected, but motion pic-
sically difficult problem. On the other hand, in emission tures of surfaces can be measured to follow such effects
measurements one is looking for small signals on essen- as surface diffusion.
tially zero backgrounds. The selectivity of luminescence methods is best exem-
Emission measurements are so sensitive that it is pos- plified by fluoroimmunoassay (FIA), by which specific
sible to detect single atoms in the gas phase. Indeed, it antigens can be detected even in a messy medium such as a
is possible to hold, study, and cool to cryogenic temper- serum. Radioimmunoassay (RIA) has dominated the field
atures a single atom by using optical tweezers. Due to when the ultimate in sensitivity is required (pico- to femto-
the background, detection in solution has proved much moles). The problems of working with and disposing of ra-
more difficult.. The supporting solvent has strong scatter- dioactive tracers have given impetus to the use of the some-
ing (Raleigh and Raman) and it is extremely difficult to what less sensitive FIA. Furthermore, advances in flow
get impurity emissions down to low enough levels. The cytometry and laser-based detection schemes promise to
solution is to reduce the sampled volume. Scattering and make FIA fully competitive with RIA.
impurity emission scales with the volume, but the emission
of a single molecule is independent of the volume. So by
reducing the sampled volume to pico- or femtoliters, sin- B. Chemiluminescence
gle molecule detection becomes possible. Single molecule In chemiluminescence (CL) the energy necessary for
detection by luminescence is now routine in solution and excited-state generation is derived from the energy re-
on surfaces. leased in a chemical reaction. Excluding flames, proba-
In solution, two methods are used. The analyte may bly the first man-made CL was the air oxidation of phos-
be constrained to a hydrodynamically stabilized flowing phorus, discovered by Brand in 1669. This discovery is
stream of a few microns. A tightly focused laser beam in- the subject of a classic and beautifully detailed engraving
tersects the stream forming the analysis region of a few (Fig. 15).
picoliters or less. As the molecules flow through a fo- Three basic processes can initiate CL: (1) decomposi-
cused laser beam, they are excited and fluoresce multiple tion of a high-energy species to lower-energy ones; (2)
times. This burst of photons is collected with a high nu- exothermic reaction of two or more components; and
merical aperture microscope objective, focused through a (3) electron transfer reactions. These processes are rep-
pin-hole spatial filter to remove any unwanted background resented by
regions and then focused onto a single photon-counting
photomultiplier tube or avalanche photodiode. In the al- A →∗ B (10a)
ternative implementation, a confocal microscope is used or
to image an exceptionally small solution volume (sub-
∗
femtoliter). Then, as molecules diffuse in and out of the B+C
sensing volume, bursts of photons occur. One can use this
strategy for determining the diffusion coefficient of the A + B → ∗C + D (10b)
fluorophore since time spent in the vicinity of the sens-
or
ing element is determined by the diffusion. Kinetics of
∗
reactions such as ligand receptor binding on proteins can C
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
Luminescence 815
C C C O∗ + C O (12)
O O
The excited ketone can then luminesce or transfer en-
ergy to another luminescent species.
The long-lived afterglow in electrical discharges in ni-
trogen is an example of recombination CL [Eq. (10b)],
N + N + M → ∗ N2 + M (13)
O O
where R NH2 for Luminol. That the emissive species is
The CL arises on cleavage to ketones, with one being the phthalate dianion is shown by the agreement between
excited and the other being in the ground state: the CL and photoluminescence spectra of the product.
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
816 Luminescence
Thus, although paths to the excited state are quite dif- F2 + hν (or electrical energy) → 2F (19a)
ferent, the final excited state is the same.
H2 + F → H + ∗ HF (19b)
Only a few milligrams of Luminol produce an impres-
∗
sive light show, but the CL efficiency (photons emitted per F2 + H → 2F + HF (19c)
number of molecules reacted) is, in fact, only about 1%.
This result emphasizes the impressive quantity of light where ∗ HF is vibrationally excited HF produced with pop-
that a mole of photons represents. ulation inversion. The laser transitions are near-IR emis-
The third class of CLs is initiated by electron transfer sions arising between vibrational levels of HF.
processes. This type is exemplified by the one-electron
transfer reduction of [Ru(bpy)3 ]3+ (bpy = 2,2 -bipyri- C. Bioluminescence
dine), which yields a highly visible orange CL with hy-
drazine, water, and a variety of other reductants. Indeed, Bioluminescence (BL) is a CL arising from living mat-
with NaBH4 as the reductant, the spectacular emission is ter. Fireflies and the glow of disturbed microorganisms
highly visible in a well-lighted room. in a ship’s wake are probably the best known. Al-
The complexity of simple CL systems is demonstrated though grouped separately, BLs are actually CLs in which
by the simple [Ru(bpy)3 ]3+ /oxalate system: the reactants are produced by, and organized in, living
organisms.
[Ru(bpy)3 ]3+ + C2 O2−
4 → [Ru(bpy)3 ]
2+
+ C2 O−
4 (16a) Not surprisingly, one of the best known BLs is firefly
[Ru(bpy)3 ]3+ + C2 O− ∗
4 → [Ru(bpy)3 ]
2+
+ 2CO2 reaction, which involves the enzymatic oxidation of a lu-
ciferin. This reaction can have an incredible efficiency,
(16b) approaching 100%. The luciferin molecule and a num-
The initial reduction step does not appear to be energetic ber of synthetic analogs have been studied to elucidate the
enough to excite the complex, and a secondary reaction mechanism. The mechanism appears to involve a peroxide
with the energetic C2 O−4 free radical is required for excita-
decomposition with free radical intervention.
tion. This type of behavior, in which the initial step is not O
sufficiently energetic to initiate CL, is fairly common, and
thus reactive radicals are common in the actual CL step. N N
OH O2
A special case of electron transfer CL is that in Enzymes
HO S S
which the reductant is the ultimate reductant, an electron.
[Ru(bpy)3 ]3+ exhibits an intense CL on reaction with hy- Photinus pyralis
drated electrons: (20)
O
[Ru(bpy)3 ]3+ + e− → [Ru(bpy)3 ]2+
•
(17) N N
+ CO2 + hv
The efficiency of excited state production approaches HO S S
100%.
Applications of CL are diverse. There are commercial Bioluminescence is much more pervasive than origi-
emergency lights that require only the breaking of a seal nally suspected. Low-level BLs are common to many fun-
to permit the mixing of solutions. The long-lived CL is damental and essential biological processes such as lipid
bright enough to read by. peroxidation, intracellular redox processes, and catalase
One of the simplest and most sensitive NO analyzers decomposition of poisonous H2 O2 .
utilizes the very efficient CL of the reaction Bioluminescence has had a crucial role in direct stud-
ies of cellular and biochemical processes. For example,
NO + O3 → ∗ NO2 + O2 (18) there is a CL associated with the formation of the ul-
With very simple instrumentation, part per billion levels timate carcinogen benzo[a]pyrene-7,8-dihydrodiol 9,10-
of NO can be accurately and precisely measured. epoxide from benzo[a]pyrene. Also, a very sensitive CL
Another useful analytical application uses metastable assay for the benzo[a]pyrene-7,8-diol has been developed.
∗ Bioluminescence also continues to play a pivotal role in
N2 to transfer energy to luminescent metals or molecules.
Concentrations as low as 106 atoms per cubic centimeter the development of the fundamental concepts of CL.
are readily measured.
Chemical lasers use CLs. In a chemical laser the excited
D. Electroluminescence
state population inversion is produced directly by chemi-
cal energy. One of the most efficient chemical lasers uses Electroluminescence (EL) is luminescence occurring on
the hydrogen-fluorine reaction, which is initiated by the electron flow in a solid-state device or electrochemical
dissociation of F2 , cell. The mechanisms of these two ELs are quite different.
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
Luminescence 817
818 Luminescence
E. Cathodoluminescence
Luminescence 819
screen in operation will clearly show the three color ele- to molecule until the energy comes close enough to the
ments. The red phosphor is based on the intense red line scintillator molecule for efficient energy transfer. The scin-
emission of Eu(III); the ghastly flesh tones of early color tillator molecule must have an excited singlet state that is
TV were the result of the poor red characteristics of earlier below the energy levels of the solvent molecules. Thus,
phosphors. the scintillator traps the energy and emits.
Organic scintillators form the basis of the important
liquid scintillation counter. The penetrating power of β
F. Radioluminescence
particles is very limited. Therefore, counting weak β emit-
Radioluminescence arises from interactions between ion- ters such as tritium is very difficult with windowed de-
izing radiation and matter. This radiation may be particles, tectors because the window or the sample itself absorbs
such as α or β particles, and cosmic rays or photons, such most of the radiation. The penetration problem is solved
as X-rays or γ rays. Materials exhibiting radiolumines- by homogeneously dissolving the material to be counted
cence are called scintillators, named after the scintilla- in a “cocktail” consisting of the solvent and the scintil-
tions, or bursts of light, that arise when radiation strikes lator. Since the particles are emitted directly in the pres-
the material. ence of the scintillator, the penetrating power becomes
The mechanisms of excitation by ionizing radiation are irrelevant.
very similar to those listed under cathodoluminescence.
The X- and γ -ray excitations are initiated by energy loss
of the photons as they interact with matter. For higher en- G. Thermoluminescence
ergies, energy loss occurs by photoionization. Substrates Thermoluminescence (TL) is luminescence that arises on
of high atomic number absorb energy more efficiently than gentle warming of a material and usually occurs below
those of low atomic number because of the high concen- incandescence. Reading the sample by heating it destroys
tration of electrons. the activation, so the readout is a single-shot experiment.
Charged particles and secondary electrons arising from Most TL materials can, however, be reactivated and used
γ rays behave like the charged particles of gas discharges repeatedly.
striking a surface, and the processes of excited species The mechanism of TL is shown in Fig. 18. The acti-
generation are the same. Due to the penetrating power of vation process consists of trapping an electron in a trap
radiation, however, excitation can penetrate much more
deeply.
An important by-product of ionizing radiation in solids
is that electrons are frequently trapped in lattice sites
(Fig. 8). Depending on the depth of the trap sites, the
electron may either recombine rapidly or be trapped for
an extended period of time, even for millions of years.
Release of these electrons can then yield a delayed re-
combination luminescence. This delayed luminescence in
thermoluminescence is discussed in Section VI.G.
Radioluminescence forms the heart of many radiation
detectors. The traditional γ detector is thallium-activated
sodium iodide. The high atomic number of iodide in-
creases the efficiency of energy absorption, and the energy
is used to excite the Tl+ luminescent center.
β-Particle detectors are crystals or organic molecules
(e.g., anthracene) or solutions of highly fluorescent or-
ganic scintillators dissolved in solid aromatic polymers
(e.g., polystyrene) or aromatic solutions. The substrate FIGURE 18 Thermoluminescence glow curve (a) of quartz ex-
of an aromatic polymer or solvent is an essential com- tracted from pottery for use in TL dating. The sample has been
ponent. The fluorescent indicator is generally present in artificially irradiated with 550 rads of β radiation as part of the
relatively low concentrations, and much of the excitation calibration process. Curve (b) is the red-hot glow measured on
reheating the sample. The TL is composed of both the stored and
occurs away from the scintillator. The bulk of the initial
irradiated components. [Reprinted with permission from Aitken,
excited states formed are, thus, the easily excited bulk M. J., and Fleming, S. J. (1972). In “Topics in Radiation Dosime-
aromatic solvents. Due to the closeness of the aromatic try,” Supplement 1 (F. H. Attix, ed.), p. 1, Academic Press, New
species, the excitation can migrate or hop from molecule York.]
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
820 Luminescence
site. The trap site must be deep enough to prevent rapid problems of sensitivity and storage times. When the work
removal of the electron at the storage temperature. Activa- was resumed in the early 1960s, it was discovered that
tion can be accomplished by ionizing radiation or by light. all the earlier TLs of LiF arose from impurity centers that
Once the sample is activated, the readout is performed by were no longer present in commercially purer LiF. This led
heating the sample. When the temperature becomes suf- to a considerable effort to elucidate the impurity problems
ficiently high, the electrons are thermally excited into the and eventually led to usable systems. Also, other useful
conduction band. These conduction electrons can migrate systems were discovered by other groups.
through the lattice until they find a hole to relax into. If Another type of sensitive, and at one time widely used,
the hole is an emissive center, or is near one, the center is dosimeter utilized radiophotoluminescence. Irradiation of
excited and emits. Alternatively, the relaxation to the hole a crystal produces color centers. When photoexcited,
can be emissive. many of these color centers emit. Since the emission inten-
The term thermoluminescence is misleading since it sity is directly proportional to the concentration of color
does not mean thermal generation of the excited system. centers, a dosimeter is available. An advantage of photolu-
Thermal-stimulated luminescence much more accurately minescent dosimeters is that the readout is not destructive
describes the phenomenon. and can be repeated.
In TL measurements or development, readout is per- Another important use of TL is in archeological and
formed by heating the sample at a uniform rate and view- geological dating when radiocarbon dating is unsuitable
ing the sample emission. A filter may be added over the de- (Fig. 18). Heating destroys any TL and initializes the phos-
tector to restrict the viewing wavelength. Figure 18 shows phor. As all rocks have some degree of radioactivity from
238
a typical TL glow curve. The development supplies infor- U, 235 Th, and 40 K, the TL can be used to measure the
mation on the distribution and depth of traps. Deeper traps length of time that the sample was irradiated since cool-
require higher temperatures to boil the electrons out. For ing. For igneous rocks this is the time since formation.
example, a 375◦ C glow curve corresponds to a trap life- For pottery it is the time since the pottery was fired. Such
time of several million years. Indeed, elaborate models dating can give results accurate to better than 10%.
have been developed to extract quantitative information
about trap depth and distribution from glow curves.
H. Flame Emissions
In most examples of TL, one is examining a delayed
photoluminescence. Particularly in solids subjected to ra- With the exception of hydrogen-oxygen flames, virtually
diation chemistry, the nature of the traps and of the emis- all flames exhibit pronounced visual emissions. We shall
sive centers is modified by irradiation. Therefore, the TL discuss briefly the origin of some of these emissions.
emissions may look quite different from the photolumi- Luminescence can arise from the major components of
nescence of the unactivated solid. the reaction or from trace materials. The orange glow of
The principle use of TL is in radiation dosimetry. The candle flames and oxidant-starved gas flames arises, not
TL signal is proportional to dose over a very wide range. from luminescence, but from the incandescence of carbon
Modern TL dosimeters are sensitive in the microrad re- particles.
gion. Thermoluminescence dosimeters have a number of Luminescence from a state is independent of how the
advantages. They are inexpensive, robust, reusable, and state was populated. Many flames are hot enough and the
sensitive, and they intrinsically integrate the dose. Fur- excited state of elements and compounds low enough that
thermore, they require no power source or connections. a significant excited-state population can be achieved ther-
At one time they were widely used for in vivo monitor- mally. For example, the emitting state of atomic sodium is
ing during therapy since they could be easily implanted or at 589 nm. For flames of various temperatures the per-
swallowed. centages of molecules in the excited state are as fol-
Lithium fluoride is one of the most popular personal lows: 2000 K, 1 × 10−3 %; 3000 K, 6 × 10−2 ; and 4000 K,
dosimeters. It is sensitive down to 10−2 rads and linear to 0.4%. While these excited-state populations may seem
>103 rads. The average atomic number of LiF is similar very small, they are in fact very large in comparison with
to that of tissue, which gives the dosimeter a response the concentrations that could be achieved by all but the
indicative of tissue irradiation. The traps are deep enough most intense laser sources in photoluminescence experi-
to give long storage times without appreciable fading. ments. This efficiency is readily seen by the intense yellow
The early work on TL demonstrates the difficulty of sodium fluorescence when even tap water is introduced
studying emissions that intrinsically rest on impurity sites. into a relatively cool Bunsen burner flame.
The first attempts at TL dosimetry date back to the early An interesting aspect of flame spectroscopy is that a
1950s, when Farrington Daniels attempted to use LiF as flame can be too hot to yield good elemental emissions.
a dosimeter. The original work was abandoned owing to Too hot a flame can thermally ionize many of the atoms
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
Luminescence 821
and reduce the population of the emissive element. This than 50 ps and occur during the collapse phase of the bub-
propensity of luminescent species to ionize can be sup- bles. Each optical pulse can generate more then a million
pressed by increasing the concentration of free electrons photons and the phenomena is readily visible to the eye as
in this flame. For example, ionization of calcium can be a steady star-like point.
suppressed by spiking the flame with cesium or potas- Water is far and away the best solvent for SL and there
sium, which have lower ionization potentials and increase is a strong temperature dependence. Sonoluminescence is
the concentration of free electrons. Many of the products 100-fold brighter at 0◦ C compared to 40◦ C. Dissolved rare
of the chemical reactions contributing to the flame are gases appear to be critical to SL; it is the 1% Ar in air that
produced in excited states and emit (see Section VI.B). is responsible for SL under ambient air conditions.
While not, strictly speaking, flames, inductively cou- For a phenomenon known for over 60 years, our under-
pled plasma torches produce plumes that appear to be standing of its precise origins is still uncertain. While there
flames. Furthermore, these plasmas behave like extremely are several explanations for SL, the area is still amazingly
high temperature flames, which are capable of exciting all contentious with articles appearing regularly in support
but the most recalcitrant elements. of different mechanisms. The fact that the emission is a
One of the most useful applications of flames is for continuum with no unique molecular or atomic character
elemental quantitative analysis. High-temperature flames exacerbates definitive modeling.
reduce most complex matrices to their elemental compo- While there seems to have been no direct practical ap-
nents. Many elements, especially metals, then emit in the plications of SL to date, there are needs for short-lived,
flames, and the emission can be used for analytical quan- inexpensive, very compact light sources in such areas as
tification. Even many elements that do not emit directly luminescence lifetime measurements. Since SBSL sys-
can still be analyzed in flames by photoluminescence or tems can be built extremely inexpensively, SL may find
by atomic absorption. use in these areas.
Because of the chemical generation of many of the ex- Although not a direct application of SL, ultrasonic-
cited states in flames, population inversion over the ground induced chemistry is a growing area with potentially sig-
state can result. This population inversion can form the nificant applications. Ultrasound has been found to accel-
basis of a laser. The best known chemical laser is the erate and control a number of chemical reactions. Some of
HF or DF laser, made by burning hydrogen or deuterium this chemistry may be induced by the high energy photons
in fluorine. Visible or UV chemical lasers are still being of SL.
sought.
J. Fracto-Emission or Triboluminescence
I. Sonoluminescence
Fracto-emission (FE) is the emission of particles and pho-
Sonoluminescence (SL) was first observed in 1934 by H. tons before, during, and following the propagation of a
Frenzel and H. Schultes at the University of Cologne. crack in a stressed material. Particles include electrons,
As an indirect result of wartime research on sonar or ions, and neutral atoms and molecules. Light emission
acoustic radar, they observed SL in an ultrasonic water under mechanical stimulation is often called tribolumines-
bath. Very strong ultrasonic fields were found to yield cence (TL). Grinding and breaking of crystal and glasses
clouds of chaotic flashing bubbles, which are now termed can produce FE. However, even pulling tape off of a sur-
“multi-bubble sonoluminescence.” Such systems were not face can produce FE.
amenable to systematic study until D. Fellipe Gaitan Sir Francis Bacon in 1605 first reported FE by grinding
succeeded in trapping a single sonoluminescing bubble sucrose, although FE could not have been missed innumer-
at the acoustic resonance in the center of a flask. This able times earlier. Amusing, if not practical, is the parlor
single-bubble sonoluminescence (SBSL) opened the way demonstration of grinding a Wint-o-Green Life Saver
for an explosion of research on SL. between one’s teeth or with a pair of pliers in a darkened
The majority of SL is a broad continuum, with effective room to produce bright blue-green flashes. The Wint-o-
black body temperatures in excess of the sun (e.g., well Green Life Saver is pressed sucrose flavored with methyl
beyond 10,000 K). In many cases the continuum emission salicylate. Triboluminescence is, in many cases, due to
is still rising at the 200-nm cutoff of air. Indeed, some the production of strong electric fields along fracture lines
have suggested that temperatures suitable for fusion may with a concomitant electrical breakdown. The observed
be achieved this way, but this seems an extremely unlikely luminescence is then a mechanically introduced cathodo-
possibility. In addition, under some conditions, broad lines luminescence. Another easily observed example of TL
are present. Even though the typical ultrasonic transducer arises when you strike two pieces of quartz together. Bright
operates at about 25 KHz, the light pulses are much less orange flashes result that can be viewed in a dimly lighted
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
822 Luminescence
room. For either demonstration eye protection should be cite. Atomic sodium emission can arise from the breaking
worn because of the likelihood of flying particles. sodium silicate glasses.
Figure 19A shows the photoluminescence of methyl Fracto-emission is a valuable tool for studying the phys-
salicylate, the flavoring of emission of Wint-o-Green Life ical and chemical processes that occur before, during,
Saver which is pressed sucrose. Figure 19B is the TL and following fracture. It allows examination of failure
from grinding pure sucrose. The UV-rich band structure mechanisms including fatigue, microcrack initiation, and
can be clearly associated with the emission of nitrogen growth. Such tools are invaluable in materials research
gas. Figure 19C shows the TL of a Wint-o-Green Life for studying the failure mechanisms of such materials
Saver . The emission clearly is the nitrogen band struc- as glasses, metal ceramics, polymers, and composites.
ture augmented by a broad visible emission. This broad In addition, FE is of interest to geologists and geophysi-
spectrum is the same as the PL of methyl salicylate, the cists for detecting fractures in mines and along geological
flavoring. faults.
There can be an enormous amount of chemistry going Crystalloluminescence is emission during the growth
on in FE. Extensive bond breaking can occur. For exam- of crystals and may arise from cleavage. It is, thus, a form
ple, carbon dioxide is formed during the grinding of cal- of TL from internally generated breakage.
ACKNOWLEDGMENT
BIBLIOGRAPHY
Adam, W., and Cilento, G., eds. (1982). “Chemical and Biological Gen-
eration of Excited States,” Academic Press, New York.
Alkemade, C. Th. J., Hollander, Th., Snellman, W., Seeger, P. J., and ter
Harr, D., eds. (1982). Metal vapours in flames, Int. Ser. Nat. Philos.
103, Pergamon Press, New York.
Ambrose, W. P., Goodwin, P. M., Jett, J. H., Van Orden, A., Werner,
J. H., and Keller, R. (1999). Single molecule fluorescence spec-
troscopy at ambient temperature. Chem. Rev. 99, 2929–2956.
Cundall, R. B., and Dale, R. E. (1983). “Time-Resolved Fluorescence
Spectroscopy in Biochemistry and Biology,” Plenum Press, New York.
Demas, J. N. (1983). “Excited State Lifetime Measurements,” Academic
Press, New York.
De Silva, A. P., Gunaratne, H. Q., Gunnlaugsson, T., Huxley, A. J. M.,
McCoy, C. P., Rademacher, J. T., and Rice, T. E. (1997). Signaling
recognition events with fluorescent sensors and switches. Chem. Rev.
FIGURE 19 (A) Photoluminescence and excitation spectrum of 97, 1515–1566.
methyl salicylate, the flavoring of a Wint-o-Green Life Saver
R
. (B) Dickinson, J. T. “Fracto-Emission, Encyclopedia of Materials,” Elsevier,
Triboluminescence of sucrose. (C) Triboluminescence of a Wint-o- in press.
Green Life Saver R
. [Adapted with permission from Linda Sweet- Dunn, R. C. (1999). Near-field scanning optical microscopy. Chem. Rev.
ing. Copyright 1998.] 99, 2891–2927.
P1: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN008B-389 June 29, 2001 15:35
Luminescence 823
Fouassier, J. P., and Rabek, J. F., eds. (1990). “Lasers in Polymer Science Schaefer, F. P. (1990). “Dye Lasers,” 3rd ed., Springer-Verlag, New
and Technology: Applications,” CRC Press, Boca Raton, FL. York.
Harvey, E. N. (1957). “A History of Luminescence,” American Philo- Schulman, S. G., ed. (1988). “Molecular Luminescence Spectroscopy,”
sophical Society, Philadelphia. Vol. 2, Wiley, New York.
Herman, B. (1998). “Fluorescence Microscopy,” 2nd ed., Springer- Sweeting, L. M., Cashel, M. L., Dott, M. L., Gingerich, J. M., Guido, J.
Verlag, New York. L., Kling, J. A., Pippin III, R. F., Rosenblatt, M. M., Rutter, A. M.,
Horowitz, Y. S., ed. (1984). “Thermoluminescence and Thermolumines- and Spence, R. A. Spectroscopy and mechanism in triboluminescence,
cent Dosimetry,” Vols. 1–3, CRC Press, Boca Raton, FL. Mol. Cryst. Liq. Cryst. 211, 389.
Lakowicz, J. R. (1999). “Principles of Fluorescence Spectroscopy,” Sec- Sze, S. M. (1981). “Physics of Semiconductor Devices,” 2nd ed., John
ond Editon, Plenum Press, New York. Wiley & Sons, New York.
Pawley, J. (1995). “Handbook of Biological Confocal Microscopy,” 2nd Weber, M. J. (1982). “CRC Handbook of Laser Science and Technology,”
ed., Plenum Press, New York. Vols. 1–3, CRC Press, Boca Raton, FL.
Periasamy, A. (2001) “Methods in Cellular Imaging,” Oxford Univ. Yen, W. M., and Selzer, P. M., eds. (1981). Laser spectroscopy of solids.
Press, New York, in preparation. Top. Appl. Phys. 49.
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
Magnetic Materials
R. C. O’Handley
Massachusetts Institute of Technology
919
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
Examples are the saturation magnetization and Curie after saturation. Further, permanent magnets are char-
temperature. Fundamental properties reflect the elec- acterized by large coercivities so that the remanent
tronic structure that in turn is a consequence of chem- magnetization is reduced only by a very large field. Ma-
istry and crystallography. terials with coercivities in the range of about 25–500
Hard magnetic material See permanent magnet. Oe (2–40 kA/m) are considered to be semihard mag-
Magnetic anisotropy The preference for the magnetiza- netic materials. Greater and smaller coercivities gen-
tion vector in a material to point along one or more spe- erally characterize hard and soft magnetic materials,
cific axes. The preferred direction(s) of magnetization respectively.
can be governed by sample shape (magnetostatic en- Remanence The magnetization or flux density that re-
ergy), cyrstal structure (magnetocrystalline anisotropy mains in a sample at zero field after being exposed to
energy), strain in the structure (magnetoelastic energy), a specific field.
as well as surface or interfacial interactions (surface or Soft magnetic materials Materials that are relatively eas-
Néel energy). ily magnetized and demagnetized. They are typically
Magnetic glasses Noncrystalline or amorphous mate- characterized by high relative permeabilities and low
rials that exhibit strong or useful magnetic proper- coercivities (from a few tens of Oe down to as low as
ties. Magnetic glasses can be either oxide glasses or milli-Oe in some cases).
metallic glasses (amorphous magnetic alloys such as Superparamagnetism A type of magnetic behavior
Fe40 Ni40 B20 ). characterized by zero remanence and zero coercivity.
Magnetization The process by which a magnetic ma- It is observed in ferromagnetic particles or clusters of
terial is magnetized, generally either by domain wall atoms that are so small that thermal energy overrides
motion or by rotation of the magnetization within the tendency of the magnetization to lie in a particular
the domains. The term magnetization is also used as direction due to crystallography or shape. Specifically,
a noun to describe the state of a material in terms K u V < kB T.
of the vector sum of its magnetic moments per unit Technical magnetic properties Those properties of
volume. magnetic materials that depend strongly on processing
Magnetoresistance The change in resistance of a ma- and microstructure. Examples are coercivity, rema-
terial depending upon its state of magnetization. Like nence, and permeability.
magnetostriction, it can have isotropic components that Zeeman energy The potential energy of a magnetic mo-
show up on heating through the Curie temperature as ment in a field.
well as anisotropic components that vary with the rel-
ative direction of current and magnetization.
Magnetostatic energy The energy of a magnetic moment A MATERIAL is said to be magnetic when it possess
in a field caused by other moments. This energy, some- atomic-scale magnetic moments that show long-range
times referred to as dipole energy, is particularly strong ordering (ferromagnets, antiferromagnets, and ferrimag-
and positive inside a material magnetized normal to a nets) below a Curie temperature. In a ferromagnetic ma-
thin dimension. terial, all magnetic moments order parallel to each other.
Magnetostriction The strain associated with the state of In antiferromagnetic and ferrimagnetic materials the mo-
magnetization of a material. The isotropic part of this ments on one crystallographic site order with their mo-
strain, that decreases sharply on heating through the ments antiparallel to those on another, crystallographically
Curie temperature, is called the volume magnetostric- different site. In antiferromagnets, the atomic moments
tion. The anisotropic part of the strain, called the Joule on the two types of sites are equal and no net moment
magnetostriction, is a function of the direction of mag- results; in ferrimagnets, the atomic moments are not
netization relative to the crystal axes. equal and a net moment results. A material might also
Nanomagnetic material A material with variations be said to be magnetic if its atomic-scale magnetic mo-
in magnetic properties over a length scale of a few ments show no spontaneous ordering but do respond to
nanometers to hundreds of nanometers. Nanomagnetic an applied magnetic field with an increase in its magnetic
materials may be manmade by thin film deposition moment density (paramagnetic materials). The class of
and/or lithographic techniques or they may be natural magnetic materials could as well include diamagnetic ma-
consequences of thermodynamics and kinetics, such terials which need not have atomic-scale magnetic mo-
as the fine microstructures that can result from heat- ments but do respond to an applied field by producing
treating certain amorphous magnetic alloys. a weak magnetization directed opposite the applied field.
Permanent magnet A magnetic material which retains a Given this broad definition, it can be said that all materials
large magnetization upon removal of an applied field are magnetic.
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
A more restrictive definition of magnetic materials This article begins with an overview of some of the im-
might start with those that show ferromagnetic ordering portant manifestations of magnetism in materials (mag-
at room temperature, that is, they have Curie tempera- netic domains, magnetic anisotropy, magnetostriction)
tures which are greater than room temperature. Of all the and outlines the underlying science that explains the prop-
elements, only four, Fe, Co, Ni, and Gd, are ferromag- erties of magnetic materials. Magnetic materials can be
netic at room temperature. Several other elements show classified either by their functional properties (e.g., mag-
antiferromagnetism (e.g., Cr and α-Mn) or other forms netically soft or hard) or by the nature of their bonding
of magnetic ordering such as helical or spiral moment ar- and structure (oxides, alloys, intermetallic compounds).
rangements (Tb, Sm, Ho, etc.). Yet, starting with these Table I summarizes some of the important crystal struc-
few elements, the class of ferromagnetic, antiferromag- tures and magnetic materials systems that occur in a ma-
netic, and ferrimagnetic alloys and compounds expands trix of functional classes and bonding types. In this article,
to include as significant components about two-thirds of the major magnetic functional classes are described and
the elements of the periodic table. At least three materials, material examples are given. Also, an attempt is made to
Cu2 MnAl, ZrZn, and InSb, are ferromagnetic (the latter include descriptions of the major bonding and structure
two only at cryogenic temperatures), although they con- types of magnetic materials. Magnetic effects at surfaces,
tain none of the abovementioned strongly ferromagnetic in thin films, and fine particles are also described.
elements. This article surveys elements, alloys, and com- The magnetic fields B and H are related in MKS units
pounds that are ferromagnetic or ferrimagnetic at room by B = µ0 (H + M) where H is the applied field due to
temperature. macroscopic current densities, J = I /area. Ampere’s law
Magnetic materials are said to be soft if they respond to relates the H field to the current:
relatively weak external fields by changing their state of
magnetization. This change in magnetization is accompa- ∇ × H = J or H · dl = I, (1)
nied by a change in force on the material if the external
field is inhomogeneous. Magnetic materials are said to thus H has units A/m. M is the magnetic moment den-
be hard when they retain a state of net magnetization in sity or magnetization in the material, M = N µm /V ,
the presence of a significant opposing field. Hard mag- where µm is the average magnetic moment per atom
netic materials produce a field that can attract other soft or or molecule in the system and N /V is the number of such
hard magnetic materials. Half a century ago, most of the entities per unit volume. The magnetization, M, while hav-
interest in magnetic materials was associated either with ing the same MKS units as the H field, is the field due to
these forces (motors, electromagnetic actuators) or with microscopic currents. B is the flux density, φ/A (with units
their ability to concentrate magnetic flux (shielding) or of Tesla), whose time rate of change induces a voltage by
enhance flux change (inductors and transformers). Today, Faraday’s law:
engineering interest in magnetic materials has expanded
−∂φ
to include their ability to store information at high density ∇ × E = −∂B/∂t or E · dl = V = (2)
∂t
in a nonvolatile way (the information is retained when the
power to the storage device is off), as well as their varied The microscopic currents that give rise to the magne-
electrical and spin transport properties. tization are the spin and orbital angular momenta of the
While exciting new magnetic compositions are still be- electrons.
ing discovered, the range of properties exhibited by exist- The magnetic susceptibility χm is usually used to de-
ing magnetic compositions is now often expanded and/or scribe weak magnetic responses to H as in paramagnetic
tailored to specific needs by making them in thin film, mul- and diamagnetic materials. The magnitude of χm is typ-
tilayer, or nanoparticle form. In these reduced-dimension ically ±10−4 –10−6 (dimensionless in MKS units). Dia-
structures, novel magnetic properties can be obtained be- magnetism is not a matter of aligning preexisting atomic
cause of the altered atomic and chemical structure near magnetic moments. Rather, it is an electronic response
an interface and by arranging magnetic and sometimes to an applied field that creates a new component of or-
nonmagnetic components at small length scales so that bital angular momentum and thus a magnetic moment.
new magnetic interactions and transport effects can be The diamagnetic response is always negative as it can be
engineered. Thus a modern overview of magnetic materi- traced classically to Faraday’s law or Lenz’s law. A ma-
als must include not only materials whose properties vary terial with a paramagnetic (diamagnetic) susceptibility of
with composition, but also bond type and/or crystal struc- χ = +(−) 10−5 would show a magnetization M = 10 A/m
ture. Reference should also be made to the exciting new in a field of H = 106 A/m (applied field B = µ0 H ≈ 1.25
developments achieved by rendering old compositions in Tesla). A magnetization of 10 A/m corresponds to a flux
new structures. density B = µ0 M of 1.25 × 10−5 T which is five orders
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
TABLE I Important Examples of Magnetically Soft and Hard Materials can be Found in
each of the Major Bonding Types: Oxides, Metallic, and Covalent
Crystal structure and examples
Bond type Soft Hard
of magnitude less than the spontaneous magnetization of In a ferromagnetic material, the H field in Eq. (3) in-
the ferromagnetic materials listed in Table I. cludes not only the applied field but also the strong internal
The reason for the weak magnetic response of param- field called the Weiss molecular field, λM: B = µo (H +
agnetic materials to an external field is that thermal en- λM). The Weiss molecular field expresses the tendency
ergy k B T overwhelms the energy that favors alignment of for long-range magnetic order due to the exchange
a paramagnetic moment with the field, g µ B B. The field interactions between local moments. The constant λ is
and temperature dependence of the magnetization in para- called the molecular field constant. Because λM H ,
magnets (or ferromagnets, see below) is described by the the energy of the magnetization in the exchange field,
Brillouin function, B J (x): namely g µ B J µo λM, is comparable in magnitude to
k B T and magnetic ordering occurs spontaneously in the
M(H , T ) 2J + 1 2J + 1 absence of an applied field. Thus, for a ferromagnet, Eq.
= B J (x) = coth x
M(∞, 0) 2J 2J (3) becomes a transcendental equation that can be solved
graphically or numerically to describe the results in Fig.
1 x
− coth , (3) 2b. Figure 2b also compares the experimental data for
2J 2J Ni with the form derived from Eq. (3) L(x) for J = ∞).
as shown in Fig. 1. In Eq. (3), J is the total (spin plus
orbital) angular momentum quantum number and x =
g µo µ B J H/k B T , expresses the energy of the magnetic
moment in the H field relative to the thermal energy
kB T .
Ferromagnetic materials are characterized by a long-
range ordering of their atomic moments, even in the ab-
sence of an external field. The observed field dependence
of the magnetization extrapolates from its high-field val-
ues to a nonzero value, called the spontaneous magnetiza-
tion (Fig. 2a). The spontaneous, long-range magnetization
of a ferromagnet is observed to vanish above an ordering
temperature called the Curie temperature TC (Fig. 2b).
FIGURE 1 Brillouin function versus x = µµ B/kB T for various val-
Experimental curves of M versus H do not always show
ues of J. The infinite-spin limit is given by the classical Langevin
true saturation. Thermodynamics suggests that M(H ) ap- function, L(x). [From O’Handley, R. C. (2000). “Modern Magnetic
proaches saturation like 1 − H −1 . For polycrystalline sys- Materials, Principles and Applications.” Reprinted with permission
tems, M(H ) approaches saturation like 1 − H −2 . of John Wiley and Sons.]
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
FIGURE 2 (a) Magnetization of a ferromagnetic material versus field. (b) Reduced magnetization versus reduced
temperature for nickel (open data points, from Weiss and Forrer, Ann. Phys. 5, 153, 1926) and Brillouin function, B 1/2 (x)
(solid line). [From O’Handley, R. C. (2000). “Modern Magnetic Materials, Principles and Applications.” Reprinted with
permission of John Wiley and Sons.]
Saturation magnetic moments and Curie temperatures are I. INTERACTIONS AND ENERGIES
listed in Table II for some representative ferromagnetic IN MAGNETIC MATERIALS
materials.
The relative magnetic permeability µ = µr /µ0 = B/H
A. Magnetostatics
is used more often than susceptibility (χm = M/H ) to de-
scribe the response of ferromagnetic materials to H . This Magnetostatics refers to the consequences of the magnetic
is because ferromagnets are useful in electromagnetic de- fields that appear near the surfaces of magnetized bodies. If
vices where it is B rather than H that is important in the magnetization of a polycrystalline ferromagnetic sam-
inducing a voltage [Eq. (2)]. ple measuring, for example, 1 × 1 × 5 mm is measured,
µ0 Ms Ms µ0 Ms Ms n B /FU T C (T N )
emu emu
(T) (T) = M s /µ B N v
cm3 cm3
(µ B /FU)
Substance Structure (290 K) (0 K) (0 K) (K)
The quantity n B is called the magneton number, the number of Bohr magnetons per atom
or per formula unit (FU) in a material.
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
FIGURE 5 Crystal structures for Fe, Ni, and Co showing easy and hard magnetization directions with respective
magnetization curves below. [From O’Handley, R. C. (2000). “Modern Magnetic Materials, Principles and Applications.”
Reprinted with permission of John Wiley and Sons.]
plane are N x + N y = 0, Nz = 1. For a rod infinite along z , to the electron’s orbital motion via quantum mechanical
N x = N y = 0.5, Nz = 0. For a sphere N x = N y = Nz = 1/3. spin-orbit interactions, ξ L · S. Hence, the spin is coupled
to the symmetry of the crystal field. Thus, for strong mag-
netic anisotropy, a low-symmetry crystal field and strong
B. Magnetic and Magnetoelastic Anisotropy spin-orbit coupling are required.
The shape of a material is an extrinsic factor affecting the Figure 5 depicts the hexagonal-close-packed crystal
form of the M–H curve for a field applied along differ- structure of cobalt and shows the M–H curves for fields
ent directions in a sample. Several intrinsic factors can applied in a “hard,” base-plane direction [1000] as well as
also give rise to anisotropy in magnetic properties. These along the “easy” c axis, [0001]. The uniaxial anisotropy
are (1) magnetocrystalline anisotropy (the preference for of HCP cobalt or other uniaxial magnetic materials can
M to lie along certain high-symmetry crystallographic di- be described phenomenologically by the anisotropic free
rections), (2) magnetoelastic anisotropy (the preference energy density
for M to lie in a direction dictated by the symmetry of f ahex = K ul sin2l θ. (8)
an imposed anisotropic deformation of the material), and i=0
(3) field-induced or directed-pair anisotropy (the prefer-
ence for magnetization along an axis determined by a field The field energy density, −µo M · H, must be added to
present during prior heat treatment). Eq. (8) to give the total free energy
2. Magnetoelastic Interactions
The most common manifestation of magnetoelastic (ME)
interactions is magnetostriction, λ = l /l, the strain in
a material that accompanies a change in its direction of
magnetization. Figure 8 shows the field dependence of FIGURE 8 Representations of a demagnetized sample (left) and
the magnetostrictive strain measured parallel and perpen- two states of magnetization with the sense of magnetostrictive
strain shown for positive magnetostriction constant: (l /l )|| > 0. At
dicular to the applied field direction, starting from a ran-
the bottom is illustrated the field dependence of strain parallel and
domly demagnetized, isotropic sample. Below saturation, perpendicular to the magnetization direction. [From O’Handley,
the strain is generally quadratic in the field-induced mag- R. C. (2000). “Modern Magnetic Materials, Principles and Appli-
netization; above the anisotropy field, the strain saturates. cations.” Reprinted with permission of John Wiley and Sons.]
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
3d metals
Fe 5.2 × 104 −1.8 × 104 4.8 × 104 −1.0 × 104
Cou 7.0 × 105 1.8 × 105 4.1 × 105 1.5 × 105
Ni −12 × 104 3.0 × 104 −4.5 × 103 −2.3 × 103
Ni80 Fe20 — — −3 × 102 —
Fe50 Co50 −1.5 × 104 a
4 f metals
Gdu −1.2 × 105 +8.0 × 104 +1.3 × 104 —
Tbu −5.65 × 107 −4.6 × 106 — —
Spinel ferrites
Fe3 O4 −2 × 104 — −0.9 × 104 —
CoFe2 O4 +106 — 2.6 × 105 —
Garnets
YIG −2.5 × 103 — 1 × 103 —
Hard magnets
BaO·6Fe2 Ou3 4.4 × 105 — 3.2 × 105 —
Smu Co 5 7 × 106 — 1.1–2.0 × 107 —
Fe14 Nd2 Bu −1.25 × 107b — 5 × 106 —
a Disordered; K 1 ≈ 0 for ordered phase.
b Uniaxial materials are designated with a superscript “u” and their values K u and K u
1 2
are listed under K 1 and K 2 , respectively. The sign convention for the uniaxial materials
is based on the sin θ notation of Eq. (8): K 1 > 0 implies easy axis. Units are J/m ;
2 3
the magnitude of the cubic anisotropy. For example, in Eqs. (14) and (15) show that the magnetostrictive strains
Ni, B1 ≈ 6.2 × 106 N/m2 , so for ε ≈ 10−3 , | f me
c
| ≈ 6.2 × λijk are related to the ME stress coefficients Bi by the
10 J/m which is comparable to |K 1 | ≈ 4.8 × 103 J/m3 .
3 2
elastic constants in a way that parallels the mechani-
Some values of Bs and λs are listed in Table IV for cal stress strain relations, εij = Cjkl σkl , (except for the
representative materials.
The ME free energy in Eq. (12) also implies that if the TABLE IV ME Coupling Coefficients or Stresses (MPa) and
direction of magnetization changes (i.e., the αi s change) Magnetostriction Constants λ100 and λ111 in PPM at Room
Temperature for Several Materials
then the material may change its equilibrium state of strain.
This is the phenomenon of anisotropic magnetostriction λ100 B1 λ111 B2 Polycrystal,
shown in Fig. 8. The dependence of the magnetostrictive (λγ,2 ) (MPa) (λε,2 ) (MPa) λs
strain on the direction of magnetization may be derived
3d metals
by adding to Eq. (12) the form of the elastic energy for a
BCC-Fe 21 −2.9 −21 +2.9 −7
cubic crystal
HCP-Cou (−140) +6 (50) +13 (−62)
2
f el = 12 C11 e11 + cycl + C12 (e11 e22 + cycl) FCC-Ni −46 +6.2 −24 +4.3 −34
2
a-Fe80 B20 — — +32
+ 12 C44 e12 + cycl . (13) 4f metals
Minimization of Eqs. (12) and (13) with respect to the εij ’s TbFe2 2600 1753
leads to the form of the magnetostrictive strains λ100 and Oxides
λ111 : Fe3 O4 −15 56 +40
2 B1 CoFe2 O4 −670 120 −110
λ100 = − (14) Yttrium-iron garnet −1.4 −1.6 −2
3 C11 − C12
See Eqs. (14) and (15). Some polycrystalline magnetostriction values
and
are also listed. The prefix “a-” designates an amorphous material. For
1 B2 uniaxial materials (superscript “u”) where λγ ,2 or λε,2 was reported, their
λ111 = − (15)
3 C44 values are given in parentheses in the λ100 and λ111 column, respectively.
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
FIGURE 11 Schematic valence electronic structures of atomic oxygen (a), T atom (b), and how they combine to form
T-metal oxides (c), and T metals (d, e).
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
3. Intermetallic Compounds
and Amorphous Alloys
The dependence of magnetic moment on the local envi-
ronment (number, type, distance, and symmetry of nearest
FIGURE 13 Temperature dependence of the magnetization in neighbors) is shown clearly for a series of Mn-containing
NiFe2−x Vx O4 . [From Blasse, G., and Gorter, E. W. (1962). J. Phys. intermetallic compounds. The magnetic moment per Mn
Soc. Jpn. 16, Suppl. B-1, 176.] The schematic M–T curves at right atom shows a linear decrease with increasing Pauling
illustrate how the magnetization of two sublattices having different
temperature dependences combine to give the unique features
valence (Fig. 15). The Pauling valence is a function of
observed in the data. interatomic distance and coordination and is a measure of
covalent (as opposed to ionic) bonding. Increased covalent
2. Transition Metals and Alloys bonding hinders magnetic moment formation because co-
valent bonds have spin-paired electrons. A similar model
In 3d transition metal alloys, the saturation magnetic mo-
(Corb, B. W. et al., 1983) has been shown to apply to 3d
ment µ B (per average T species) varies as indicated in
metalloid alloys; it also shows that increasing the number
Fig. 14, which is called the Slater-Pauling curve. For alloys
of bonding orbitals per T atom decreases its moment.
to the right of the peak moment, the Fermi level cuts across
The variation of magnetic moment with valence elec-
only the spin-down d band. As the d electron concen-
tron concentration in crystalline transition-metal borides
tration, n d = n v − n s ≈ n V − 1, increases from 7.5 e/atom
TB and T2 B, shows a characteristic variation that appears
oward 10 e/atom, the number of holes per atom (unpaired
similar to that of the Slater-Pauling curve but with the
spins), and hence the moment per atom, decreases linearly
peak moment suppressed and shifted more toward lower
FIGURE 20 (a) Magnetic properties of BCC Fe-Co alloys. Anisotropy and magnetostriction. [After Hall, R. C. (1959).
J. Appl. Phys. 30, 816; dotted lines are for CsCl-ordered phase.] (b) Magnetic properties of amorphous Fe-Co-B alloys
for which the magnetocrystalline anisotropy is essentially zero over the entire composition range. [From O’Handley,
R. C. et al., (1979). J. Appl. Phys. 50, 3603; O’Handley, R. C. (2000). “Modern Magnetic Materials, Principles and
Applications.” Reprinted with permission of John Wiley and Sons.]
of these alloys, grain size and texture are the primary fac- species reduces the saturation magnetization but often has
tors determining the technical magnetic properties actu- other beneficial effects (increased resistivity, decreased
ally attained. magnetostriction). The high electrical resistivity of
Amorphous metallic alloys are materials that are rapidly amorphous alloys (120–150 µcm) compared to Si-Fe
quenched from the melt or deposited from the vapor (30−50 µcm) and iron-nickel alloys (20 µcm) makes
states as thin films so that their atomic structure lacks them attractive for high-frequency operation. Reasonably
the long-range order of a crystalline solid. Without long- strong magnetization can be realized in a variety of
range order, amorphous alloys have no magnetocrystalline amorphous alloys based on iron, cobalt, and/or nickel (cf.
anisotropy. Thus amorphous metallic alloys based on tran- Figs. 16 and 20b for Fe-Co-B amorphous alloys).
sition metals can show a very easy magnetization process. The most widely used soft magnetic ferrites are based
Magnetoelastic anisotropy remains as the major impedi- on manganese-zinc ferrite and nickel-zinc ferrite. These
ment to easy magnetization in amorphous alloys. Figure 21 materials have the spinel structure of Fe3 O4 . Figure 22
shows that the coercivity of amorphous Co80−x Fex B20 al- shows the dependence of magnetostriction and permeabil-
loys is minimized near the composition x = 4 for which ity on Fe2 O3 content in [(MnO)0.7 (ZnO)0.3 ]1−x ·(Fe2 O3 )x .
λ ≈ 0. In these compositions, K 1 is small so the permeability
The presence in amorphous magnetic alloys of sig- peaks near the composition at which λs = 0 (cf. Fig. 21).
nificant concentrations of non-magnetic, glass-forming The insulating character of oxide magnets makes them
useful at frequencies in excess of the MHz range.
Starting with amorphous Co-Nb-B alloys, the coerciv- Fe44 Co44 Zr7 B4 Cu1 (Willard, M. A. et al., 1998) crystal-
ity increases by nearly four orders of magnitude through lize to high-magnetization α-Fe nanostructures. The latter
various stages of devitrification corresponding to various show saturation flux densities in excess of 2 T.
nanostructure length scales. The coercivity initially in-
creases consistent with the D 6 power law derived theoret-
D. Hard Magnetic Materials
ically (Herzer, 1993). Above a peak, Hc drops off more
gradually, consistent with the 1/D behavior predicted for Permanent magnets are used to produce strong fields with-
defect sizes greater than the domain wall width (Fig. 10). out having to apply a current to a coil. Hence they should
Iron-rich amorphous and nanocrystalline alloys gener- exhibit a strong net magnetization. It is also important
ally show larger magnetization than those based on Co or that the magnetization be stable in the presence of ex-
Ni. However, iron-rich amorphous alloys generally have ternal fields. These two conditions indicate that the B-H
fairly large magnetostriction, limiting their permeability loop should have large values of remanent induction, Br ,
(Fig. 20b). Formation of a nanocrystalline iron-rich al- and coercivity Hc , respectively. Permanent magnets have
loy can lead to a dramatic reduction in magnetostriction, coercivities in the range of 104 –106 A/m.
thus favoring easy magnetization (Hasegawa, N. et al., The shapes of the M-H loops in the second quadrant
1993). (which determines demagnetization behavior) are com-
Most magnetically soft nanocrystalline systems are pared in Fig. 25 for some common permanent magnets.
based on DO3 crystallites (e.g., Fe3 Si) in an amorphous The permanent magnets with the highest energy products
matrix. However, the presence of 25 atom% Si signif- are based on Fe14 Nd2 B1 . Figure 25b shows the evolution
icantly reduces the saturation flux density of the alloy of the maximum energy product in permanent magnets
(Hasegawa, N. et al., 1991). Spinodal decomposition in over recent decades.
metastable amorphous transition metal-carbon alloys can The earliest permanent magnets were the natural forms
be used to form nanocrystalline alloys of the general for- of magnetite, Fe3 O4 . More recently, the high-anisotropy
mula (Fe, Co)81 Ta9 C10 . Annealing at 550◦ C for 20 min hexagonal ferrites (barium or strontium hexaferrite)
results in primary crystallization of α–Fe (or α−FeCo) (Kojima, H. et al., 1982) and the magnets of the alnico
particles measuring 5–10 nm in diameter and dispersed family were developed (McCurrie, R. A., 1982). The
transition metal carbide nanocrystals (generally at triple hexagonal ferrites derive their magnetic hardness from
junctions). The primary nanocrystals share grain bound- their very large magnetocrystalline anisotropy (Table I)
aries making grain-to-grain exchange coupling stronger. and relatively weak magnetization. The alnico magnets
The softest magnetic properties are obtained for the small- achieve high-coercivity by the formation of a high-aspect-
est nanocrystalline grain sizes. In these alloys the stable ratio structure consisting of columns of nonmagnetic
carbide grain boundary phase inhibits grain growth just as Ni3 Al in a matrix of α-Fe by spinodal decomposition
Nb does in FeBSiCuNb. from a Heulser composition such as Fe2 NiAl. Alnico
Other magnetic nanocrystalline materials show po- magnets can achieve high remanence by processing
tential for increased saturation flux density. Alloys of in a mgentic field to achieve strong orientation of the
the type Fe88 Zr7 B4 Cu1 (Suzuki, K. et al., 1991) and columnar microstructure.
FIGURE 25 (a) Second quadrant M-H loops of some common permanent magnets. (b) Increase in (BH )max of
permanent magnets over recent decades. [From O’Handley, R. C. (2000). “Modern Magnetic Materials, Principles
and Applications.” Reprinted with permission of John Wiley and Sons.]
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
TABLE V Comparison of Some Magnetic Properties for SmCo5 , Sm2 (CoFe)17 , and
Fe14 Nd2 B Permanent Magnets at 25◦ C
the easy and hard axis magnetization curves for the in Eq. (23) is most often measured in thin film systems and
two principal Co-Sm magnets and Fe14 Nd2 B (see also is clearly a function of the nature of the material across
Table V). the interface from the magnetic species. Compared to en-
Many permanent magnets are essentially nanostruc- hanced surface magnetic moments, surface anisotropy is
tured materials consisting of high-anisotropy, single- more readily determined experimentally but is harder to
domain particles (or multidomain particles with pinned calculate from first principles. The literature abounds with
domain walls) that are magnetically decoupled from each observations of surface anisotropy effects in thin films and
other. at surfaces.
The first term in Eq. (23) is often expressed as:
K eff sin2 θ, with the contributions to the effective aniso-
IV. RELATED MAGNETIC PHENOMENA tropy defined by
AND APPLICATIONS Ks
K eff = −2π Ms2 + 2B1 εx x + 2 . (24)
t
A. Thin Film and Surface Magnetism Here B1 is the magnetoelastic coupling coefficient, εx x is
Research in magnetic thin films has grown in sophistica- the in-plane film strain, K s is the surface anisotropy [−K s
tion as advances in ultrahigh vacuum technology and sur- in Eq. (23)]. Measured values of K s are typically of order
face characterization techniques kept pace with improve- 0.1–0.5 mJ/m2 .
ments in theoretical understanding and computational Figure 29 shows the measured effective, first-order uni-
accuracy. axial anisotropy for Cu/Ni/Cu(001) epitaxial films. The
At a clean surface, the atoms of a material have re- positive values of K eff over a relatively wide thickness
duced nearest-neighbor coordination, reduced symmetry range, 2 nm < tNi < 140 nm, imply perpendicular magne-
and generally different bond lengths compared to inte- tization there. Here the perpendicular magnetization re-
rior atoms. These changes generally result in narrower d sults from small magnetostatic energy and large positive
bands and increased density of states, which in turn favor magnetoelastic energy.
increased saturation magnetic moment and lower Curie There has been considerable interest in the switching
temperature. First principles band structure calculations at of the easy axis of magnetization in ultrathin, epitaxial
surfaces bear this out. It is very difficult experimentally to Fe films (tFe < 1 nm). This phenomenon has been studied
isolate the magnetic moment of surface atoms sufficiently as a function of film thickness and temperature and the
to detect a 10% increase in moment. results point to the origin of perpendicular magnetization
Near a surface, the narrower, more atomic-like d bands in surface anisotropy.
give rise to increased angular momentum; the reduced Co thin films layered with Au (den Broeder, F. J. A.
symmetry near a surface generally leads to stronger mag- et al., 1988) or with Pd (Engle, B. et al., 1991) can show
netic anisotropy. The ratio L 2z /(L 2x + L 2y ) is increased be- strong perpendicular magnetic anisotropy up to 2 nm in
cause L = r × p and momentum perpendicular ot the sur- thickness. In these systems both magnetoelastic energy
face must be reduced. The surface magnetic anisotropy and interface anisotropy appear to be important for per-
can be expressed as an energy density in powers of the di- pendicular magnetization.
rection cosines of M [cf. Eq. (11)] having the appropriate
uniaxial symmetry: B. Electronic Transport in Magnetic Materials
σ = K 1s α32 + K 2s α32 + K 32 α12 α22 + ··· (23)
The electrical transport properties of magnetic materials
With this convention, K 1s > 0 (<0) favors in-plane (per- (called galvanomagnetic effects) are interesting because
pendicular) magnetization near the surface. The first term the response of the conduction electrons can depend on
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
FIGURE 29 Variation of effective anisotropy times Ni thickness (K eff × t) versus Ni thickness in Cu/Ni/Cu(001)
sandwiches [after Jungblut, R. et al. (1994). J. Appl. Phys. 75, 6424; Bochi, G. et al. (1995). Phys. Rev. B 52, 7311;
and Ha, K. et al. (1999). J. Appl. Phys. 85, 5282], using in situ magneto-optic Kerr effect, ex situ vibrating-sample
magnetometry, and ex situ torque magnetometry, respectively, to determine the effective anisotropy. K eff × t > 0
implies perpendicular magnetization in the uniaxial approximation. [From O’Handley, R. C. (2000). “Modern Magnetic
Materials, Principles and Applications.” Reprinted with permission of John Wiley and Sons.]
their spin direction. The unique features result from the Thus, an electron of spin-up before scattering can only
presence of both s and d electrons at the Fermi energy scatter into a spin-up state (no change in spin-angular mo-
(Fig. 11e) as well as differences in the density of 3d ↑ mentum), if it does not change its orbital angular momen-
and 3d↓ states at E F . In magnetic transition metals, the tum; it can scatter into a spin-down state if it increases
two equally-populated sub-bands of s electrons (↑ and ↓) its orbital angular momentum. These two spin-dependent
carry most of the current in two parallel spin channels. scattering mechanisms can therefore serve to mix the
In each spin channel there is a small resistivity associated carriers in the two spin sub-bands. Opening a path be-
with s electron scattering and a larger resistivity associ- tween two parallel conduction paths always lowers the net
ated with scattering of s electrons into localized d states, resistivity.
s-d scattering. To a first approximation, there is no mix- The “ordinary” galvanomagnetic effects, i.e., the Hall
ing between the two spin channels. For iron, the Fermi effect, E = RH (J × µ0 H) and magnetoresistance, are ob-
energy cuts across both spin-up and spin-down d bands, served in most materials. These effects are classical and
hence there is a large s-d contribution to the resistivity in arise from the Lorentz force, F = q(v × B), on the charge
each channel. In Ni, the Fermi energy is above the top of carriers. In ferromagnetic materials, the ordinary effects
the majority-spin d band and hence the resistivity in that are present but are generally overshadowed by phenom-
band is less than in the minority-spin sub-band where s-d ena with similar symmetries. The galvanomagnetic effects
scattering can occur. unique to ferromagnets are called “extraordinary,” “spon-
Magnetotransport effects in a given magnetic mate- taneos,” or “anomalous” because of their greater strength
rial can arise from two types of magnetic interactions— relative to the ordinary effect. The extraordinary galvano-
exchange coupling and spin-orbit interaction. A con- magnetic effects derive their strength from the fact that the
duction electron generally has a greater scattering cross role of the external field is replaced by the internal field,
section at sites having spin opposite its own. This is re- which is proportional to the magnetization and hence is
ferred to as exchange scattering or spin disorder scattering. much stronger than an applied field. The mechanism by
Conduction electrons can also interact with a scattering which M couples to the electron trajectory in ferromag-
site by the spin-orbit interaction. The total angular mo- nets is the spin-orbit interaction between the current carrier
mentum must be conserved during the scattering process. (orbit) and the magnetization (spin).
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
In ferromagnetic materials the Hall resistivity may be antiferromagnetically through the Cr layers and hence are
written difficult to saturate. If electrons bearing the polarization
EH of one ferromagnetic layer drift into the other (through a
ρH = = ρoH + ρs H
J thin 2–3 nm of noble metal) the scattering probability is
= R o B + R s µo M (25) greater or less depending on whether the polarizations of
the two layers are antiparallel or parallel. In order for the
where the first term is the ordinary Hall resistivity pro- difference in scattering probability to show up as a large
portional to the external field and the second term is R /R, two general conditions should be satisfied.
the spontaneous effect, proportional to the magnetiza-
tion. We can write these two terms as a simple sum 1. The thicknesses of the three layers must be small
because the vector symmetry of the spin-orbit energy, enough that a large fraction of the charge carriers
L · s, responsible for the spontaneous Hall effect, is com- from one ferromagnetic component diffuse into the
patible with the energy of the classical Lorentz force, other before experiencing spin-dependent scattering
r · F ∝ r · (v × B) = (r × v) · B ∝ L · M ∝ L · s . 2. The magnetization in the two components should be
The anisotropic magnetoresistance (AMR) may be de- able to be controlled independently either by having
termined by extrapolation of high field MR (ordinary MR) different coercivities or different anisotropy fields
data to H = 0 (see Fig. 30). The fractional change in re-
sistance with field due to anisotropic magnetoresistance, A spin valve is a simple embodiment of the GMR effect
while only a few percent in the best cases (Ni90 Fe10 ), is in which there are only two magnetic layers separated by
used in numerous sensor applications. While the Hall ef- a nonmagnetic conductor. The magnetic layers are uncou-
fects are linear in B = µo (H + M), the MR effects are pled or weakly coupled in contrast to the generally strong
quadratic in B. AF exchange operating in Fe/Cr-like multilayer systems.
There is a more-recently-discovered galvanomagnetic The magnetoresistance can be made to change in fields
effect that is unique to electronic transport in thin of a few tens of Oe rather than tens of kOe. One of the
films. Called giant magnetoresistance (GMR), it is layers is magnetically soft and the other is magnetically
of a different physical origin than AMR. GMR arises hard. Thus, a modest field can cause a change in the angle
from spin-spin exchange scattering and requires small between the moments of these two magnetic layers.
dimensions in the components of a ferromagnetic/noble In operation of the spin valve, cycling the field causes
metal/ferromagnetic sandwich or multilayer. In 1988, M1 and M2 to lie antiparallel or parallel to each other. The
Baibich, M. N. et al. (1988) reported an MR ratio of resulting M-H loops are shown schematically in Fig. 31.
order 50% at 4.2 K in multilayers of the Fe-Cr alloys. The sharp magnetization reversal near H = 2 Oe is due to
This magnetoresistance was approximately an order the switching of the soft NiFe layer 2 in the presence of its
of magnitude greater than the highest values known weak coupling to layer 1. The more rounded magnetiza-
to that time. The Fe layers in these experiments were tion reversal near 100 Oe is the switching of the hard layer.
typically 30–60 Å thick and separated by Cr layers The relative orientations of layers 1 and 2 are indicated by
from 9 to 60 Å. The iron layers are strongly coupled the pairs of arrows in each region of the M-H curve. In
FIGURE 30 (a) Resistivity of Ni0.9942 Co0.0058 at room temperature versus applied field (McGuire, T. 1975). (b) Low-
field magnetoresistance for cobalt thin film showing even field symmetry and hysteresis. [After Parkin, S. S. P. (1994).
In “Ultrathin Magnetic Structures,” Vol II (Heinrich, B. and Bland, J. A. C. eds.) Springer-Verlag, Berlin; O’Handley,
R. C. (2000). “Modern Magnetic Materials, Principles and Applications.” Reprinted with permission of John Wiley and
Sons.]
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
FIGURE 34 Schematic of the read and write processes in a pseudo-spin-valve random access memory. The write
process involves current pulses through both the word line and the sense line such that the field at the PSV exceeds
Hc2 . The read process involves a field pulse that takes the device to the high-resistance state without switching
the semihard layer (Hc1 < H < Hc2 ). [From O’Handley, R. C. (2000). “Modern Magnetic Materials, Principles and
Applications.” Reprinted with permission of John Wiley and Sons.]
material. Such bits should be arranged periodically in or- MRAM is basically an array of spin valves, pseudo-spin
der to be synchronized with the signal channel. This can be valves or spin tunnel junctions (Fig. 34), each of which
achieved using high resolution lithography. The term pat- can be set to a given bistable state by a relatively large
terned media is used to refer to media for which each bit write field; that state can be read without destroying it by
consists of a single, lithographically defined
√ grain. Such a using a smaller read field.
recording medium eliminates the random N noise asso- The write process consists of magnetizing both the free
ciated with multigrain bits. It also eliminates the noise as- and semihard layers in one direction or another by an ap-
sociated with irregular or saw-tooth transitions that cause propriately directed word current pulse (and simultaneous
noise in thin-film media. Patterned media will allow for sense-line current). After the write process, the two layers
high bit densities because the superparamagnetic limit will are in their remanent states and the resistance of either
thus apply to a single bit, not to each of the many grains state has the same minimum value (Fig. 34).
in a multigrain bit. Finally, the patterning process defines The MRAM read process consists of applying a bipo-
a sharper transition between bits, and dispersion of easy lar current pulse to the word line. This pulse produces a
axes can be minimized relative to that in thin-film media. field sufficient to switch the soft layer but not the hard
Thus patterned media have relaxed conditions on coerciv- layer: Hc1 < Hword < Hc2 . Thus, depending upon the state
ity and Mr t product. of the element, “0” or “1,” the resistance in the sense line
Considerable current interest is focused on a class of changes in phase or out-of-phase, respectively, with the
storage devices called magnetic random access memories word current pulse. After application of the read pulse,
(MRAMs). They have several advantages over hard disk the MR element reverts to its original remanent state (↑↑
drives: they have no moving parts, need no read or write or ↓↓). This is only possible if the two magnetic layers
heads, and offer the ability to access information at an are ferromagnetically exchange coupled through the Cu
arbitrary sequence of addresses (random access) as op- spacer, or if the read pulse is followed by a small reset
posed to sequential access as in tape and disk storage. An pulse of opposite polarity.
P1: GSS/GJK P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN008N-393 June 29, 2001 15:46
The materials requirements for MRAMs are similar to to supplant semiconductors in some microelectronic
those of spin-valves and spin-tunnel junctions plus the applications.
additional resrtrictions imposed by nanofabrication and
multimaterial compatibility.
SEE ALSO THE FOLLOWING ARTICLES
D. Postscript FERROMAGNETISM • GEOMAGNETISM • MAGNETIC
The recent history of magnetic materials research and de- FIELDS IN ASTROPHYSICS • MAGNETIC RECORDING •
velopment is characterized by frequent reinvention of the THIN FILM TRANSISTORS
field. The decade of the 1960s was dominated by work
on permalloys, ferrites, transport phenomena, and mag-
BIBLIOGRAPHY
netic resonance in bulk and thin film samples. In the1970s,
magnetic “bubble films” (perpendicularly magnetized do-
Baibich, M. N. et al. (1988). Phys. Rev. Lett. 61, 2742.
mains in which information could be stored and manip- Bozorth, R. M. (1955, 1993). “Ferromagnetic Materials,” Van Nostrand,
ulated) and rapid developments in amorphous magnetic New York; IEEE Press, New York.
alloys attracted considerable attention. The 1980s saw Cadeville, M. et al. (1996). J. Phys. (Paris) 27, 29.
continued improvements in amorphous magnetic alloys, Chikazumi, S. (1997). “Physics of Ferromagnetism,” Oxford University
rapidly accelerating activity in magnetic thin films and Press, Oxford.
Corb, B. W. et al. (1983). Phys. Rev. B 27, 636.
surfaces, and the magnetic recording developments those Diedrichs, P. H. et al. (1991). In “Magnetism in the Nineties,” (Freeman,
activities supported, as well as the emergence of Fe-Nd- A. J. and Gschneider, K. eds.), North-Holland, Amsterdam.
B permanent magnets. The 1990s witnessed enormous den Broeder, F. J. A. et al. (1988). Phys. Rev. Lett. 60, 2769.
growth in magnetic recording research, development of Engle, B. et al. (1991). Phys. Rev. Lett. 67, 1910.
nanocrystalline magnetic materials, and extensive basic Hasegawa, H. et al. (1993). J. Mater. Eng. Perf. 2, 181.
Hasegawa, H. et al. (1991). J. Appl. Phys. 70, 6253.
and applied work on a host of new magnetotransport phe- Herzer, G. (1993). J. Mater. Eng. and Perf. 2, 193.
nomena. Jin, S. et al. (1994). Science 264, 413.
The immediate future of magnetic materials will Livingston, J. D. (1996). “Driving Force: The Natural Magic of Mag-
involve greater use of nanofabrication (thin films, phase nets,” Harvard Univ. Press, Cambridge MA.
separation) to engineer new properties and devices. Moodera, J. et al. (1996). Appl. Phys. Lett. 69, 708.
O’Handley, R. C. (2000). “Modern Magnetic Materials; Principles and
Many of these new magnetic materials technologies will Applications,” John Wiley and Sons, New York.
converge in the emerging field of spin-tronics, where Willard, M. A. et al. (1998). J. Appl. Phys 84, 1.
spin-dependent transport phenomena hold the potential Yoshizawa, Y. et al. (1988). J. Appl. Phys. 64, 6044.
P1: FMX/LSU P2: GQT Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN011B-553 July 25, 2001 16:54
Permittivity of Liquids
J. Barthel
R. Buchner
University of Regensburg
I. Phenomenological Aspects
II. Analysis of Complex Permittivity Spectra
of Liquids
III. Experimental Methods
IV. Molecular Interpretation of Relaxation Modes
697
P1: FMX/LSU P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN011B-553 July 25, 2001 16:54
FIGURE 6 Dielectric permittivity, ε (ν), and loss, ε (ν), of FIGURE 8 Dielectric permittivity, ε (ν), and loss, ε (ν), of the
0.0345 mol dm−3 La[Fe(CN)6 ] in water at 25◦ C. Experimental data monohydrate of trifluoromethanesulfonic acid at 40◦ C. Experi-
(FD: •, TDR: ) are fitted to a superposition of two Debye pro- mental data (FD: •, TDR: ) are fitted to a superposition of four
cesses (solid lines) attributed to the ion pair (IP) and to the solvent Debye processes (solid lines).
(H2 O). Also indicated is the total loss, η (ν), of the solution.
FIGURE 10 Waveguide apparatus for the determination of ε̂(ν) in the E-band (60–90 GHz) range with transmission
measurements. 1–9: waveguide interferometer with cell C and movable probe P; PLO, PLO-D, PLO-P: microwave
signal source and control unit; 8, MMC, S, RE: signal detection unit; HH, MT, SMD, SM, PM, SP: probe position
control unit; PD: interface enabling the control of four interferometers (E-, A-, Ku-, X-band) in the frequency range 8.5
to 90 GHz; MC: personal computer.
P1: FMX/LSU P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN011B-553 July 25, 2001 16:54
i
self
+ µi (t) · µ j (0) (26)
i j=i
distinct
material analysis and the characterization of pharmaceu- Barthel, J., Buchner, R., Eberspächer, P. N., et al. (1998). “Dielectric re-
tical systems. laxation in electrolyte solutions. Recent developments and prospects,”
J. Mol. Liq. 78, 82–109.
Barthel, J., and Buchner, R. (2000). “Relative permittivities of elec-
SEE ALSO THE FOLLOWING ARTICLES trolytes,” In “Experimental Thermodynamics” (Goodwin, A. R. H.,
Marsh, K. N., and Wakeham, W. A., eds.), Vol.6, Ch.9c, Blackwell,
ELECTROLYTE SOLUTIONS, THERMODYNAMICS • Oxford.
ELECTROLYTE SOLUTIONS, TRANSPORT PROPERTIES Buchner, R., and Barthel, J. (1994). “Dielectric relaxation in solutions,”
Annu. Rep. Progr. Chem., Sect. C 91, 71–106.
• HYDROGEN BOND • INFRARED SPECTROSCOPY • Craig, D. Q. M. (1995). “Dielectric Analysis of Pharmaceutical Sys-
LIQUIDS, STRUCTURE AND DYNAMICS • MICROWAVE tems,” Taylor & Francis, London.
COMMUNICATIONS Davis, J. L. (1990). “Wave Propagation in Electromagnetic Media,”
Springer, Berlin.
Kaatze, U. (1997). “The dielectric properties of water in its different
BIBLIOGRAPHY states of interaction,” J. Solution Chem. 26, 1049–1112.
Madden, P., and Kivelson, D. (1984). “A consistent molecular treatment
Bagchi, B., and Chandra, A. (1991). “Collective orientational relaxation of dielectric phenomena,” Adv. Chem. Phys. 56, 467–566.
in dense dipolar liquids,” Adv. Chem. Phys. 80, 1–126. Scaife, B. K. P. (1989). “Principles of Dielectrics,” Clarendon, Oxford.
P1: GSS Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN011A-554 July 14, 2001 21:50
Perovskites
C. N. R. Rao
Nehru Center for Advanced Scientific Research
I. Crystal Chemistry
II. Electrical and Magnetic Properties of
Perovskite Oxides
III. Magnetic Properties of Perovskite Fluorides
IV. Metal–Insulator Transitions
V. Oxides of K2 NiF4 Structure
VI. Ferroics
VII. High-Temperature Superconductors
VIII. Colossal Magnetoresistance
IX. Why Are Perovskites Special?
707
P1: GSS Final Pages
Encyclopedia of Physical Science and Technology EN011A-554 July 14, 2001 21:50
708 Perovskites
FIGURE 2 (a) Close-packed AO3 layer in perovskites. (b)–(f) BO6 octahedra in different perovskite polytypes: (b)
3C, (c) 2H, (d) 6H, (e) 4H, and (f) 9R.
P1: GSS Final Pages
Encyclopedia of Physical Science and Technology EN011A-554 July 14, 2001 21:50
Perovskites 709
710 Perovskites
FIGURE 4 Periodic table of perovskites. [From Goodenough, J. B. (1974) In “Solid State Chemistry” (C. N. R. Rao,
ed.), Dekker, New York.]
and bσ = bc (bc is the critical value of the transfer energy) ≈S (S + 1) decreases the covalent mixing, it is natural that
separate oxides exhibiting localized electron behav- maxima in the curves bπ = bc and bσ = bc corresponding
ior from those with collective electron properties. to smallest values of bπ and bσ occur in the middle of the
Compounds in column 1 are insulators because the B columns with S = 52 . Rare earth orthoferrites with S = 52
cations are of d ◦ electron configuration. Most of the are antiferromagnetic insulators and exhibit parasitic
compounds in column 2 (spin S = 12 ) are metallic and ferromagnetism. The important contributions here are:
Pauli paramagnetic; the line bπ = bm separates LaTiO3 (a) Fe3+ spins canted in a common direction either by co-
from GdTiO3 because GdTiO3 is a semiconductor with a operative buckling of oxygen octahedra or by anisotropic
ferromagnetic Curie temperature (Tc ) of 21 K. AMoO3 superexchange and (b) canting of an antiferromagnetic
(A = Ca, Sr, Ba) and SrCrO3 in the third column (S = 1) rare earth sublattice because of interaction between two
are metallic and Pauli paramagnetic. Other compounds in sublattices.
this column are semiconducting and antiferromagnetic. LaCoO3 is shown twice in Fig. 4, both in the S = 2 and
The line bπ = bm separates metallic and Pauli para- S = 0 columns at the end, since Co3+ in this solid can have
magnetic SrCrO3 from the antiferromagnetic semimetal either the low-spin or the high-spin configuration. The
CaCrO3 . The line bπ = bc separates PbCrO3 from LaVO3 compound exhibits a transition from a localized electron
because the latter exhibits a crystallographic transition at state to a collective electron state (metal-insulator transi-
a temperature lower than the Neel temperature (TN ) char- tion). In the ninth column of Fig. 4, perovskites contain-
acteristic of localized electrons. The region bm > bπ > bc ing d 4 cations are placed. Of the three compounds in this
appears to be quite narrow as revealed by electrical, mag- column, SrRuO3 is a ferromagnetic metal (Tc = 160 K);
netic, and associated properties. Pressure experiments are CaRuO3 is antiferromagnetic (TN = 110 K) with a weak
valuable in the study of this region; thus, dTN /d P < 0 ferromagnetism. Since both the compounds have the same
in CaCrO3 , while dTN /d P > 0 in YCrO3 and CaMnO3 . RuO3 array, the change from ferromagnetic to antiferro-
Since increasing pressure increases bπ (by decreasing magnetic coupling is of significance. SrFeO3 is placed in
lattice dimensions), dTN /d P > 0 for bπ < bc (localized the same column on the assumption that Fe4+ (3d 4 ) is in the
behavior) and dTN /d P < 0 for bm > bx > bc (collective low-spin state, but recent work suggests that Fe4+ in this
behavior). Compounds in columns 4, 5, and 6 are anti- oxide is in the high-spin state down to 4 K. CaFeO3 , on the
ferromagnetic insulators. Since the intraatomic exchange other hand, shows disproportionation of Fe4+ to Fe3+ and
P1: GSS Final Pages
Encyclopedia of Physical Science and Technology EN011A-554 July 14, 2001 21:50
Perovskites 711
Fe5+ below 290 K. In the last but one column containing magnetism at low temperatures. Neutron diffraction and
S = 12 B cations, metallic and Pauli paramagnetic LaNiO3 specific heat measurements show that the 3D transition
should be separated from antiferromagnetic YNiO3 and occurs at 2.65 K. The 3D magnetic structure consists
LuNiO3 indicating that in LaNiO3 bσ > bm and in YNiO3 of ferromagnetic planes parallel to the c-axis that are
bσ < bm . Similarly, in the last column, LaCoO3 should be coupled antiferromagnetically.
separated from LaRhO3 because the latter is a narrow gap Among the fluorides with layered structures, the
semiconductor with a filled t2g (π ∗ ) band and an empty K2 NiF4 family has been widely investigated. K2 NiF4
eg (σ ∗ ) band. is a two-dimensional Heisenberg antiferromagnet with
TN = 97 K and J/k = − 50 K. The isostructural K2 CoF4
behaves as a 2D, S = 12 , Ising system (TN = 107.8 K,
III. MAGNETIC PROPERTIES OF J/k = − 97 K). K2 CuF4 (and its rubidium and cesium
PEROVSKITE FLUORIDES analogs) crystallizing in an orthorhombic-distorted
K2 NiF4 structure are 2D Heisenberg ferromagnets. The
In oxides, ferrimagnetism is common in spinel, garnet, distortion of the structure and ferromagnetic properties
and magnetoplumbite structures because of the occupa- arise from the ordering of the elongation axis of the
tion of tetrahedral, octahedral, or dodecahedral sites by CuF6 octahedra alternately in the a and b directions. The
magnetic ions. In fluorides, the transition metal cations intralayer exchange constant J/k is ∼11 K and the value
invariably occupy octahedral sites. Ferrimagnetism there- for interlayer coupling is ∼0.03 K.
fore results from the manner in which the octahedra
are linked. In fluorides of hexagonal BaTiO3 structure,
for example, one of the sublattices consists of octahe- IV. METAL–INSULATOR TRANSITIONS
dra linked by corners and the other octahedra linked by
faces; one third of the metal ions are present in corner- LaCoO3 , which is an insulator at ordinary temperatures,
shared octahedra and two-thirds in face-shared octahera. becomes metallic at high temperatures ( 1200 K). More
Both the sublattices have different magnetic moments interesting are the transitions from the metallic to the in-
that are coupled antiferromagnetically leading to ferri- sulating state brought about by compositional changes.
magnetism. CsFeF3 is a typical material possessing this Oxides of the type La1−x Srx MO3 (M = V or Co) show
structure; Fe–Fe interaction between face-shared octa- metal–insulator transitions with an increase in x. Thus,
hedra is ferromagnetic and that between corner-shared La1−x Srx CoO3 becomes metallic at all temperatures when
octahedra is antiferromagnetic, giving rise to ferrimag- x > 0.3, while LaCoO3 (x = 0) is an insulator at room tem-
netism with a Tc of 60 K. Isostructural RbMnF3 is, perature. When M = Mn or Co, the oxide becomes fer-
however, antiferromagnetic since all interactions between romagnetic at the same composition when the d-electrons
the neighboring octahedra are antiferromagnetic in this become itinerant. Another interesting system showing
compound. compositionally controlled metal–insulator transitions is
KCrF3 is an antiferromagnetic solid adopting an LaNi1−x Mx O3 , where M = Cr, Mn, Fe, or Co. In this sys-
antiferro-distortive structure, the magnetic structure tem, the metallic resistivity of the x = 0 oxide gives way
being similar to that of LaMnO3 (A type). Superexchange to a semiconducting or an insulating behavior above a par-
interaction dx02 − y 2 − p − dz12 leads to ferromagnetic ticular value of x. Such a change–over occurs essentially
layers that are antiferromagnetically coupled through the at a universal value of resistivity (2000 µ
cm) in all these
empty dx 2 − y 2 orbitals. KCuF3 is a 1D antiferromagnet (A oxide systems, the value corresponding closely to Mott’s
type) with the spins lying in the ab plane. The magnetic minimum metallic conductivity. A metal–insulator transi-
behavior is again a consequence of antiferro-distortive tion is brought about in the La4−x Ba1+x Cu5 O13 + δ system
ordering of distorted octahedra. Interaction between by a change in the La:Ba ratio or oxygen stoichiometry.
two half-filled dx 2 − y 2 orbitals occurs along the c-axis,
while interchain coupling is through filled dz 2 –half-filled
dx 2 − y 2 interaction in the ab plane. There are two forms V. OXIDES OF K2 NIF4 STRUCTURE
of tetragonal KCuF3 , one with I4/mcm symmetry and
the other P4/mbm symmetry; the 1D character is more Quasi-two-dimensional oxides of the type A2 BO4 pos-
pronounced in the latter. CsNiF3 is the only fluoride sessing the K2 NiF4 structure contain the ABO3 perovskite
crystallizing in the hexagonal 2H perovskite structure, layers in between the rock-salt AO layers with B–O–B
where infinite chains of face-sharing NiF6 octahedra interaction occurring only in the ab plane. Electrical
parallel to the c-axis exist. It exhibits 1D ferromagnetism and magnetic properties of the A2 BO4 oxides are con-
at high temperatures (70 < T < 300 K) and 3D antiferro- siderably different from those of the ABO3 perovskites.
P1: GSS Final Pages
Encyclopedia of Physical Science and Technology EN011A-554 July 14, 2001 21:50
712 Perovskites
Accordingly, LaNiO3 is metallic and Pauli paramagnetic, stress. In a ferroelectric, spontaneous electric polarization
while La2 NiO4 exhibits two-dimensional antiferromag- is altered by the application of an electric field. These
netic ordering around 200 K and a semiconductor–metal three ferroics are primary ferroics since they are gov-
transition around 600 K. In the LaO(LaNiO3 )n family, erned by switchability of the properties. Perovskites pro-
the electrical conductivity decreases with an increase vide many examples of ferroics. BaTiO3 , Bi4 Ti3 O12 , and
in n, becoming essentially metallic when n = 3. SrRuO3 KNbO3 are well-known ferroelectrics, while PbZrO3 and
is a metallic ferromagnet, but Sr2 RuO4 is a paramag- NaNbO3 are antiferroelectric. Some of the perovsk-
netic insulator. A more interesting comparison is pro- ites exhibit paired properties: ferroelectric-ferroelastic,
vided by LaCuO3 and La2 CuO4 . The former is a metal KNbO3 ; ferroelectric-antiferromagnetic, HoMnO3 ; ferro-
and the latter is an antiferromagnetic insulator. Oxygen- electric-superconducting SrTiO3 ; and antiferroelectric-
excess La2 CuO4 and La2−x Srx (Bax )CuO4 are supercon- antiferromagnetic, BiFeO3 .
ductors, (Tc ∼ 30–40 K), although in the normal state, There are several secondary ferroic properties that occur
they are marginally metallic. A strict comparison of the as induced quantities, and the orientation states differ in
properties of three- and two-dimensional oxides can derivative quantities that characterize the induced effects
be made only when the d-electron configuration of (for example, induced electric polarization by dielectric
the transition metal ion B is the same. A comparative susceptibility). Thus, SrTiO3 is a secondary ferroic show-
study of the two systems has been made with respect ing ferrobielectricity.
to their electrical and magnetic properties. For example, Materials such as Pb(Mg1/3 Nb2/3 )O3 are relaxor ferro-
members of the La1−x Sr1+x CoO4 are all semiconduc- electrics. Materials such as Pb(Zr1−x Tix )O3 or PZT are
tors with a high activation energy for conduction unlike electro-optic materials. Aurivillius oxides of the formula
La1−x Srx CoO3 (x ≥ 0.3) which is metallic; the latter Bi2 An−1 Bn O3n+1 are high-Tc ferroelectrics.
oxides are ferromagnetic. La0.5 Sr1.5 CoO4 shows a
magnetization of 0.5µB at 0 K (compared to 1.5µB
of La0.5 Sr0.5 CoO3 ), but the high-temperature suscep- VII. HIGH-TEMPERATURE
tibilities of the two systems are comparable. In SrO(La0.5 SUPERCONDUCTORS
Sr0.5 MnO3 )n , both magnetization and electrical con-
ductivity increase with an increase in n approaching Superconductivity in perovskite oxides has been known
the value of the perovskite, La0.5 Sr0.5 MnO3 . LaSrMn0.5 for some time, the highest Tc observed until 1987 being
Ni0.5 (Co0.5 )O4 shows no evidence of long-range ferro- ∼13 K in Ba(Bi, Pb)O3 . The oxide system where high
magnetic ordering, unlike the perovskite LaMn0.5 Ni0.5 Tc was first reported in the 30–40-K region, La2−x Bax
(Co0.5 )O3 ; high-temperature susceptibility behavior of (Srx )CuO4 , has the tetragonal K2 NiF4 structure at ordinary
these two insulating systems is, however, similar. LaSr1−x temperatures, but becomes orthorhombic around 180 K.
Bax NiO4 exhibits high electrical resistivity with the Oxygen-excess La2 CuO4 also shows superconductivity
resistivity increasing proportionately with the magnetic in the 30-K region. Superconducting oxides of the 123
susceptibility (note that LaNiO3 is a Pauli paramagne- type (Tc ∼ 90 K) with the general formula LnBa2 Cu3 O7−δ
tic metal). High-temperature susceptibilities of LaSrNiO4 (Ln = Y, La, Nd, Sm, Eu, Gd, etc.) are defect perovskites
and LaNiO3 are comparable. Susceptibility measurements containing Cu–O sheets as well as Cu–O chains, the latter
show no evidence for long-range ordering in LaSrFe1−x imparting the orthorhombic structure. The 123 oxides
Nix O4 , unlike that in LaFe1−x Nix O3 (x ≤ 0.35), and are the x = 1 members of the Ln3−x Ba3+x Cu6 O14+δ
the electrical resistivity of the former is considerably defect perovskites. The bismuth and thallium
higher. cuprate superconductors conforming to the formu-
las Bi2 (Ca, Sr)n+1 Cun O2n+4 , Tl2 Can+1 Ba2 Cun O2n+4 ,
TlCan−1 Ba2 Cun O2n+3 , with Tc s reaching 125 K contain
VI. FERROICS defect perovskite layers and rock-salt-type layers.
In Fig. 5, the schematic structures of n = 2 cuprates
Ferroics are materials possessing two or more orientation of different families are shown in order to illustrate how
states or domains that can be switched from one to another they arise from the intergrowth of perovskite and rock
through the application of one or more appropriate forces. salt layers. The family of mercury cuprates is also related
In a ferromagnet, the orientation state of magnetization to these intergrowths. The highest Tc to date is in a Hg
in domains is switched by the application of a magnetic cuprate of the formula HgBa2 Ca2 Cu3 O8 which becomes
field. In a ferroelastic, the direction of spontaneous strain superconducting at 165 K under pressure. It may be noted
in a domain is switched by the application of mechanical that the highest Tc found in a copper-free oxide material
P1: GSS Final Pages
Encyclopedia of Physical Science and Technology EN011A-554 July 14, 2001 21:50
Perovskites 713
714 Perovskites
Radiation Physics
John H. Hubbell
National Institute of Standards and Technology
561
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
material. The scientific disciplines employed in radiation in liquids flowing inside closed pipes can be monitored
physics center around the interactions of photons or par- by radiometric transmission and backscattering arrange-
ticles with solids, liquids, and gases. The primary transfer ments, to name a few examples. The design of equipment
of energy is complex but well understood; but the energy for airport x-ray inspection of luggage utilizes radiation
transferred then dissipates via secondary interactions with physics to produce useful images with minimal damage
the target material. Secondary processes are complex and to camera film and other sensitive items.
not well understood except in a few cases, such as the var-
ious results of the displacements of atoms in silicon by an
electron beam. II. RADIATION SOURCES
The technological aspects of radiation physics are very
wide. Instruments for measuring radiation (dosimeters and A. Overview
counters) are widely used. The prevention of degradation With reference to radiation effects, Fig. 1 shows a survey
in electronic and optical materials is important. Further- of the broad range of energy values that have to be con-
more, in some cases, controlled irradiation can improve sidered under the term radiation. Most of the radiations of
commercial products. For example, food preservation and interest have energies above 1 keV (103 electron volts), but
toughening of plastics are sometimes most economically neutrons with much lower energies are still described as
achieved by irradiation with noncontaminating radiation particle radiation. Ultraviolet photons also have sufficient
(electrons or γ rays). Infectious materials such as sewage energy to cause the same chemical effects as we see with
can be sterilized by radiation, and in-package sterilization x rays and very high energy photons. The shaded areas in
of medical supplies using radiation is now done routinely. Fig. 1 show that different fundamental effects in matter
An equally important subtopic of radiation physics is occur at different radiation energies.
noninvasive interrogation of systems. In addition to med-
ical imaging by conventional film and by the increasing
B. Photons
number of tomographic modalities, hot-rolling of steel
now employs noncontact radiometric monitoring of the Photons that have energies in the keV and MeV (mega-
thickness, and control of the rollers, and voids (bubbles) volt) range (x rays and γ rays) have the ability to penetrate
FIGURE 1 Energy values for various radiation environments and threshold energies. Approximate threshold for
producing ionization , for atomic displacements in solids , and for nuclear transmutations . Note: F & F = fission
and fusion. (Courtesy of Hughes Aircraft Co.)
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
matter deeply and, when absorbed, to produce strong ef- disintegration of atomic nuclei. A well-known example is
fects. X rays are generated when an electron beam strikes the pair of photons created in the spontaneous disintegra-
matter; an x-ray generator consists of a powerful electron tion of the cobalt-60 atomic nucleus. These photons have
gun and a metal target in which the photons are generated. energies of 1.1732 and 1.3325 MeV, and cobalt-60 (60 Co)
For x-ray crystallography applications, the photons will is frequently used in plaque and other geometrical con-
be in the energy range from 5 to 30 keV, including line- figurations in irradiation facilities. γ Rays are created by
energies characteristic of the atomic number of the metal nuclear chain reactions such as those that occur in a nu-
in the target, superimposed on a bremsstrahlung contin- clear reactor core or a nuclear explosion. The isotopes
uum spectrum. For imaging and irradiation applications, contained in nuclear fuel represent concentrated sources
the energies commonly range up to 3 MeV, mostly in the of γ rays. These can be used for experimental irradiation
form of bremsstrahlung. For research applications, elec- in spent-fuel ponds. Table I presents data on 60 Co and
tron accelerators such as synchrotrons and linacs (linear some of the other radioisotopes useful in medical therapy
accelerators) produce photons up to the GeV (109 elec- and industrial irradiation applications, also on 90 Sr which
tron volts) region, but for imaging and irradiation pur- can be important as an environmental hazard.
poses the energy is usually kept below 5 MeV to avoid In recent decades, synchrotron radiation has become a
producing radioactivities in the sample due to the photonu- major high-flux source of photons for research and analyti-
clear effect which has a resonance peak in the region 5 to cal applications. This radiation is produced in high-energy
40 MeV. accelerators from the bending of electron orbital trajecto-
γ Rays are high-energy monoenergetic photons. The ries in the confining magnetic field, sometimes by mag-
term is used specifically for photons created during the netic “wigglers and undulators” interposed in the electron
TABLE I Radioisotopes Important in Medical Therapy and Industrial Irradiation Applications, also as Environmental Hazards
(90 Sr)a
Photons Particles
Percentage Transition
Nuclide Half-life (year) Type of decay Energy (MeV) emitted (%) Energy (MeV) probability (%)
path. The photon energies thus produced range from tens reactions also produce protons in a material sample. Pro-
of eV from accelerators in the hundreds of MeV range, tons are emitted from the sun, and protons are also found
to above 100 keV for electron accelerators in the multi- trapped in planetary magnetic fields. High-energy protons
GeV range. Another recently developed source of photons are emitted from the sun in bursts associated with solar
in the γ -ray energy range is by inverse Compton scatter- flares. A less energetic, steady stream of protons emitted
ing. In such devices, intense laser beams in the visible or from the sun is called the solar wind.
ultraviolet (eV range) are collided with GeV-range elec-
trons in an accelerator, boosting the eV laser photons up
to MeV energies. F. Ions
α Particles (which are ions) consist of high-energy helium
C. Electrons nuclei; these too are emitted by radioisotopes and stars
and are thus found in interplanetary space. In addition to
Electron beams for research and for irradiation purposes
helium ions, very energetic ions of all atomic masses are
are produced by a variety of orbital and linear accelera-
found in space. These are called cosmic rays. Ion beams
tors, with output energies from the keV to the GeV re-
can be generated in accelerators. One use for ion beams is
gions, and beam currents from microamperes to kiloam-
the ion implantation of solids to modify their properties.
peres. Linear accelerators (linacs) consist of an electron
High-current ion implanters are available for industrial
gun injector at one end, followed by a series of accelerat-
use. These beams cause large amounts of radiation damage
ing cavities containing rf (radio frequency) power. Natural
in the solids so treated.
sources of high-energy electrons include β rays from ra-
dioisotope decay, and also from electrons in space which
have been accelerated in magnetic fields. There is a par- G. Neutrons
ticularly high concentration of electrons trapped around
planets that have high magnetic fields, such as Earth and The main sources of neutrons are nuclear fission and nu-
Jupiter. These trapped-electron concentrations, called Van clear fusion reactions. Of these, the fission of uranium
Allen belts, offer a radiation damage hazard to components is the most common. The primary product of fission is
in unmanned space vehicles traversing these altitudes, and fast neutrons having an energy distribution described as a
are avoided as much as possible in the case of manned fission spectrum. This spectrum has a large content with
flights. energies above 1 MeV. This is the spectrum that would be
observed near a nuclear explosion. In a nuclear reactor, in-
teraction with surrounding material, most efficiently with
D. Positrons light nuclei, reduces the neutron energy. Thermal neutrons
The emissions from decay of radionuclides can also in- or cold neutrons of much lower energy are produced. On
clude electrons of the opposite charge sign, from that of collision with matter, fast neutrons produce much damage
the orbital atomic electrons and β’s (β − ), which are called while thermal neutrons produce radioactivation. Neutrons
positrons (e+ or β + ), and these can also be accelerated to reduced to cryogenic temperatures, with their long effec-
provide positron beams. Positrons are also produced in the tive wavelength, are useful in surface physics studies. Nu-
process of pair production by photons of energy higher clear fusion reactions produce neutrons having a much
than that equivalent to the rest-mass of two electrons higher energy than fission. For example, one commercial
(1.022 MeV total). The signature of positrons is the anni- generator of fusion neutrons produces a beam of D-T neu-
hilation radiation resulting from the collision between an trons of a single energy of 14 MeV.
electron and a positron, usually, if the positron has come to
rest, consisting of two photons in opposite directions, each
with one rest-mass energy of 0.511 MeV. This radiation is
III. QUANTIFYING RADIATION
important in medical diagnostic imaging by PET (positron
emission tomography). Variants of this method include
A. Flux and Fluence
SPECT (single-photon emission computed tomography).
A parallel beam of radiation passing through free space can
be quantified by quoting the number of particles passing
E. Protons
through a unit area. See flux and fluence in this article’s
Proton beams can also be produced by accelerators. One Glossary. If the particles come from a variety of directions
especially well-known form of proton accelerator is the (as in outer space) then we quote the number intersecting
cyclotron, which uses radio-frequency energy. Nuclear a sphere of unit cross-sectional area.
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
FIGURE 2 Central axis depth dose curves for some typical photon, electron, neutron and proton radiotherapy treat-
ment beams. [Reproduced courtesy of D. T. L. Jones “Present status and future trends of heavy particle raadiotherapy,”
pp. 13–20 in Cyclotrons and their Applications 1998 (Proceedings of the 15th International Conference on Cyclotrons
and their Applications, Caen, France, pp. 14–19 June 1998) (E. Baron and M. Lieuvin, eds), Copyright Institute of
Physics, Bristol 1998.]
interactions of photons or particles with the atoms of the the pair (and triplet) production process. Among the other
material. This section describe the primary processes, for less-probable processes is photonuclear interaction which
the various types of radiation. can induce radioactivity.
general-purpose compilations of x-ray attenuation coef- E will never exceed mc2 /2 = 0.2555 MeV. In collimated
ficients for medical and shielding applications. However, gamma-ray sources, this energy can show up in a spectrum
EXAFS (x-ray absorption fine structure) is a widely used of the primary beam, in addition to high-energy photons
and important analytical tool. from the source itself, due to 180◦ Compton scattering
i. Fluorescence (characteristic) x rays. In addition to from material behind the source.
the ejection of a photoelectron, this process results in the Over most of the region where the Compton cross sec-
emission of one or more fluorescence x rays, due to an tion is a major part of the total cross section, the tar-
outer-shell electron falling into the inner-shell vacancy get electron can be assumed to be free and at rest, in
created by the departure of the ejected photoelectron. The which case the elegant Klein-Nishina equations apply. At
fluorescence x-ray energy is equal to the difference in lower energies, where electron motion and binding ener-
binding energy between the participating inner and outer gies are a significant fraction of the incident photon en-
electron shells or subshells, hence a fluorescence x-ray ergy, the Klein-Nishina theoretical cross section can be
spectrum consists of a number of discrete lines, each cor- modified by use of an incoherent scattering function S(x,
responding to one of the many transition possibilities. Z) in which x is a momentum transfer variable related
ii. Auger effect. The fluorescence x rays do not nec- to the incident photon energy E and its subsequent de-
essarily emerge from the collided atom, but can instead flection angle θ , and Z is the atomic number of the tar-
dislodge outer electrons, similar in effect to a secondary get atom. Calculations of S(x, Z) require knowledge of
photoelectric absorption. This additional emission of elec- atomic wave functions, and values of S(x, Z) are usu-
trons is called the Auger effect, and the emitted elec- ally taken from available tables of the incoherent scat-
trons are called Auger electrons. The probability that the tering function, an example of which is provided in the
primary fluorescence x ray will escape from the atom bibliography.
without undergoing the Auger process is called the flu- Compton scattering (σinc ) is the dominant interaction
orescence yield, ωi , in which i refers to the electron shell process in the intermediate energy range from a few tens
or subshell from which the primary photoelectron was or hundreds of keV up to a few MeV, this range being the
dislodged. broadest for the lowest-Z elements.
The photoelectric effect (τ ) is the dominant interaction
process for low photon energies.
c. Rayleigh scattering (σcoh ). In this interaction,
b. Compton scattering (σinc ). In this interaction, also called coherent or elastic scattering, a photon is
also called incoherent or inelastic scattering, only part of scattered by the atomic electron cloud as a whole, with
the energy of the photon is transferred to an electron. Both the entire atom, including the heavy nucleus (compared to
the electron and photon are scattered from the collision, the electrons) taking up the recoil. Thus, from energy and
with energies and directions related as determined by con- momentum conservation considerations, the deflected
servation of momentum and energy between the deflected photon undergoes negligible loss of energy or change in
photon and the recoiling electron. The energy E of the de- wavelength.
flected photon is reduced from that of the incident photon Since the angular distribution Rayleigh scattering is
energy E according to the relation sharply peaked in the forward direction for photon en-
ergies in the gamma-ray region, and the energy loss of the
E = E/{1 + (E/mc2 )(1 − cosθ )}
deflected photon is negligible, the contribution of this pro-
in which mc2 = 0.5110 MeV is the rest mass energy of cess is sometimes omitted in radiation transport calcula-
an electron (or positron) and θ is the deflection angle of tions. However, at the lower photon energies (4 to 30 keV)
the scattered photon. This equation can also be written in utilized in x-ray crystallography, the photon wavelength is
terms of the shift in wavelength of the photon comparable to the lattice spacing between atoms in crys-
tals, and this effect becomes of prime importance. Due
λ − λ = 1 − cosθ
to the coherence between the incident and deflected pho-
in which the wavelengths λ and λ of the deflected and ton, interferences from waves scattered from successive
incident photons are in Compton units crystalline layers result in reflections only into sharply
discrete directions, making possible studies not only of
λ = mc2 /E = 0.5110/E[MeV].
inorganic crystalline materials, but also of biological struc-
It can thus be seen that the maximum shift in photon wave- tures such as DNA.
length is two Compton units, at the photon backscattered Rayleigh (coherent) scattering (σcoh ) is never the dom-
angle of θ = 180◦ , at which angle, no matter how high the inant photon interaction process, but for heavy elements
incident photon energy E, the backscattered photon energy in the energy region just below the photoelectric K-shell
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
absorption edge (threshold) it can account for up to 10% higher, is somewhat the analog of the atomic photoelec-
of the total attenuation coefficient. tric effect, but with the photon absorbed by the atomic nu-
cleus, rather than by an electron in the shells surrounding
d. Pair and triplet production (κn and κe ). In this the nucleus. The most likely result of such an interaction
type of interaction, which can happen only for photons is the emission of a single neutron, in which case we can
with energy in excess of two electron rest-mass energies express the interaction as
(2mc2 = 1.022 MeV), the photon interacts with the elec-
trostatic field of a charged particle, such that the photon Ni (γ , n)Nf
disappears and in its place is created an electron–positron
pair (e− , e+ ). If this process happens in the field of the in which Ni is the target nucleus and Nf is the final nucleus,
positively charged atomic nucleus (κn ), with negligible re- with one less neutron, and may be a radioactive isotope
coil due to its effectively infinite mass in comparison with of the target nucleus Ni . Besides single neutron emis-
the electron or positron, only these two particles emerge sion, other possibilities include the emission of charged
from the atom as the products of the primary interac- particles, gamma rays, or more than one neutron. Emis-
tion. Any photon energy in excess of 1.022 MeV appears sion of charged particles, such as protons, changes the
as kinetic energy, divided between the electron and the atomic number Z of the target element, as well as its
positron. mass.
i. Triplet production. If the target charged particle for The most characteristic feature of the photonuclear ab-
this process is an atomic electron (κe ), the target elec- sorption cross section is the “giant resonance.” This is
tron recoils in the forward direction, along with the cre- a broad peak in the absorption cross section centered
ated electron-positron pair, with all three equal-mass parti- at about 24 MeV for light nuclei, decreasing in energy
cles (2e− , e+ ) sharing as kinetic energy the excess photon with increasing mass number to about 12 MeV for the
energy. In the early cloud-chamber observations of this heaviest stable nuclei. The width “” (energy difference
process, the three particles appeared as a three-pronged between the points at which the cross section drops to
trident, or “triplet.” Due to the kinematics, with the recoil- one half its maximum value) varies from about 3 MeV to
ing electron target, the threshold energy for triplet pro- 9 MeV depending on the detailed properties of individual
duction (κe ) is four electron rest-mass energies (4mc2 = nuclei.
2.044 MeV). The magnitude of the photonuclear cross section, even
ii. Annihilation radiation. The produced positron at the resonance peak energy, is small in comparison
(e+ ), as was mentioned earlier, eventually collides cata- with the sum of the above “electronic” cross sections,
strophically with an electron (e− ) and both are anni- and in no case contributes more than 10% to the to-
hilated, reverting usually to a pair of photons, each of tal cross section. However, photonuclear absorption can
energy of one electron rest-mass energy (mc2 = 0.511 be important in shielding technologies since the emit-
MeV), higher if the positron annihilates while still in ted neutrons are usually far more penetration than the
flight, and (rarely) to a single or to three photons. incident photons, and obviously important in irradiation
The probability for pair (or triplet) production is ap- technology due to the induced radioactivities in the target
proximately proportional to the square of the charge of materials.
the target particle. Hence, the cross section for pair pro-
duction in the field of the nucleus (κn ) varies as Z2 , whereas f. Other photon interactions. A number of other
for triplet production (κe ) the cross section varies simply less-probable things can happen to photons as they tra-
as Z, the number of unit-charge electron target particles verse and encounter tangible matter, but the interactions
per atom. Thus for hydrogen (Z = 1), at photon energies listed above are at present the only ones of practical
well above the thresholds, the cross sections for κe and κn interest in medical and technological applications. One
are approximately equal, whereas for higher-Z elements of these less-probable but interesting interactions is
κe is smaller approximately as Delbrück scattering, which is scattering of photons in the
Coulomb (electrostatic) field of nuclei as a consequence of
κe /κn ≈ Z/Z2 = 1/Z.
vacuum polarization. Delbrück scattering is considered to
Pair and triplet production (κn and κe ) are the dominant be “scattering of light by light,” and has been observed for
processes for photon interactions with atoms at high en- photon energies from a few hundred keV up to a few
ergies, from a few MeV upwards. GeV. Other small effects include, for example, resonance
scattering and Thomson scattering by the nucleus,
e. Photonuclear absorption (σph.n. ). This type of Compton scattering by nucleons, meson production (at
interaction, with a threshold of the order of 5 MeV or a few hundred MeV), resonance scattering associated
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
FIGURE 3 Contributions of (a) atomic photoeffect, τ , (b) incoherent (Compton) scattering, σinc , (c) coherent (Rayleigh)
scattering, σcoh , (d) nuclear-field pair production, κn , (e) electron-field pair production (triplet), κe , and (f) nuclear
photoabsorption, σph.n. , to the total measured cross section, σtot (circles) in carbon over the photon energy range
10 eV to 100 GeV. The measured σtot points, taken from 90 independent literature references, are not all shown in
regions of high measurement density.
with meson production, and nucleon–antinucleon pair different regions of photon incident energy, can be seen in
production. Nucleon–antinucleon pair production, of Figs. 3 and 4 for carbon (Z = 6) and for lead (Z = 82).
interest in cosmogenic modeling, has a threshold which The photonuclear cross section σph.n . is at present not
varies from 3.75 GeV (4 nucleon masses) in the field of a amenable to systematic compilation and tabulations, due
proton at rest down to 3.0 GeV (3.2 nucleon masses) for to its complex dependence on the irregular variations in
protons moving in a complex nucleus. nuclear structure as a function of the atomic number Z and
of the mass numbers A of the variable isotopic mixes of
the stable elements. Hence, despite its importance (pro-
g. Total cross section (σtot ) and the mass atten-
duction of radioactivities) in biological and technological
uation coefficient (µ/ρ). The total photon interaction
applications involving photon energies above 5 or 10 MeV,
cross section σtot , for a given material and a given photon
this cross section is still omitted from systematic general-
energy is given as the sum over the photoelectric effect
use tabulations of x-ray attenuation coefficient and energy-
cross section τ , the incoherent (Compton) and coherent
absorption coefficient data. Hence, in currently available
(Rayleigh) scattering cross sections σinc and σcoh , the pair
tabulations of the mass attenuation coefficient, µ/ρ, the
and triplet production cross sections κn and κe , and the
composition will be found to be
photonuclear absorption cross section σph.n. :
µ/ρ = (τ + σinc + σcoh + κn + κe )/(u · A)
σtot = τ + σinc + σcoh + κn + κe + σph.n.
with µ/ρ in units of cm2 /g if the five component
The relative importance of all six of these different cross sections are in units of cm2 /atom, where u
types of interactions of photons with atoms, in the ( = 1.660 538 73(13) · 10−24 g) is the atomic mass unit
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
FIGURE 4 Contributions of (a) atomic photoeffect, τ ,(b) incoherent (Compton) scattering, σinc , (c) coherent (Rayleigh)
scattering, σcoh , (d) nuclear-field pair production, κn , (e) electron-field pair production (triplet), κe , and (f) nuclear
photoabsorption, σph.n. , to the total measured cross section, σtot (circles) in lead over the photon energy range 10 eV
to 100 GeV. The measured σtot points, taken from 121 independent literature references, are not all shown in regions
of high measurement density.
(1/12 of the mass of an atom of the nuclide 12 C) and A is h. Other factors governing photon attenuation.
the relative atomic mass of the target element. Although the idealized pencil beam (collimated source,
i. Additivity. For compounds and mixtures, the mass collimated detector) geometry is useful for performing
attenuation coefficient µ/ρ at a particular photon energy basic measurements of the mass attenuation coefficient
E can be obtained according to simple additivity: µ/ρ in support of theoretical modeling of the individual
types of interaction cross sections and their sum σtot , in
µ/ρ = wi (µ/ρ)i practical situations two other factors modify the expo-
i nential attenuation law. One of these is the inverse square
in which wi is the fraction by weight of the ith elemental law, in which the radiation flux at various radial distances
constituent and (µ/ρ)i is the mass attenuation coefficient r from a point isotropic (PTI) source of photons varies
for that element for photons of energy E. according to 1/r 2 . For a point isotropic source embedded
in an absorbing and scattering medium, the flux detected
ii. Narrow-beam attenuation. For an idealized pencil
by an uncollimated detector will also be modified by
beam of photons and a detector shielded by a collimator
a buildup factor BPTI (E 0 , Z , r ), due to scattering and
such that secondary radiations (e.g., scattered and fluores-
other secondary radiations reaching the detector. In this
cence photons) from the interposed target material cannot
representation of the buildup factor, E 0 is the photon
be seen by the detector, the exponential attenuation law
source energy, Z represents the element or mixture
mentioned earlier holds exactly:
comprising the medium, and r is the distance from the
I/I0 = e−(µ/ρ)x source, usually in units of mfp (mean-free-path), in which
1 mfp = 1/{(µ/ρ)[E 0 , Z ]}. For applications in radiation
in which x is in units of g/cm2 if µ/ρ is in units of cm2 /g field modeling for extended sources, BPTI (E 0 , Z , r ) has
as mentioned above. been parametrized as a function of r in a variety of
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
available to include this process in currently available data ally taken up by other electrons; these ejected electrons
tales. are termed secondary electrons. Occasionally, atoms are
displaced from their lattice positions by the collision, and
a defect is left in the solid. Secondary electrons, after fur-
3. Neutron Absorption Mechanisms
ther collisions, generate electron-hole pairs or electron-ion
Neutrons are uncharged particles that can interact easily pairs, depending on the medium in question. In a liquid
with a positively charge atomic nucleus where various or gas, a complex track of electron-ion pairs is produced,
capture processes can occur. The probability of such an which has a core of intense ionization and small spurs
interaction is low, and hence neutrons are regarded as called delta rays, where secondary electrons have departed
highly penetrating radiation. In elastic scattering the neu- radially from the original track.
tron loses part of its energy by displacing atoms. It can
be shown that the energy transferable during collisions b. Electromagnetic (bremsstrahlung) radiation.
is greatest for atoms of low atomic mass, especially hy- High-energy electrons, when deflected or slowed in
drogen. Therefore, hydrogenous materials are useful for Coulomb (electrostatic) fields of nuclei, produce pho-
neutron shielding and detection. Energetic protons are cre- tons known as bremsstrahlung (“braking” radiation).
ated during elastic collisions with hydrogenous materials Electron–electron bremsstrahlung is also possible, but
such as polymers and water. this is a less-probable process. Thin-target bremsstrahlung
After a number of collisions in a material at room tem- has energy and angular distributions well described math-
perature, the neutrons become thermalized, that is, their ematically according to equations developed by Schiff,
kinetic energy is of the order of 0.025 eV, the average and this broad continuum is the major photon output of
vibrational energy of the atom in a solid at room temper- x-ray machines for imaging and for therapy, particularly
ature. If the solid is cooled below room temperature, then in the case of high-energy machines using electron linacs
neutrons of even lower energies are produced. These are or other high-energy accelerators as sources of electrons
called cold neutrons and have uses in materials physics. to impinge on heavy-element bremsstrahlung converter
The National Institute of Standards and Technology, from targets.
its reactor, produces neutrons at cryogenic temperatures,
valuable in surface studies due to the effective long wave- c. Cherenkov radiation. The speed of light has a
length of the neutrons. constant value of c in a vacuum. However, in a transparent
Thermal neutrons can be captured by atomic nuclei. material with a refractive index n > 1, the velocity of light
This usually leads to the emission of a γ photon from the in the material is reduced to c/n. Thus for a high-energy
excited nucleus and the creation of a new isotope, a pro- particle traveling at velocity v = βc, if β > 1/n, we can
cess known as transmutation. The new isotope may or may say that the particle is “traveling faster than light” in the
not be radioactive. If it is, then the process is termed ra- material. As first observed experimentally by Cherenkov
dioactivation. If the neutron capture results in the emission in 1934 and explained by Frank and Tamm in terms of
of more neutrons, and a moderator material is present, a Maxwell’s theory, when a charged particle traverses matter
chain reaction can occur. In particular, in the case of heavy at a velocity in excess of the speed of light in the medium,
elements such as uranium which can undergo fission from radiation in the optical-wavelength region is emitted. This
neutron capture, large amounts of energy can be released light, called “Cherenkov radiation,” is subject to construc-
in either controlled or uncontrolled modes. tive interference such that it is confined to a cone whose
axis coincides with the direction of propagation of the
charged particle, in a manner analogous to the production
4. Electron (β-Ray) Absorption
of a shock wave from a supersonic aircraft. The aperture
The two principal mechanisms by which electrons shed half-angle θ of this propagating cone of light is given
their kinetic energy are by collision, which yields ionized by
or excited atoms, and the production of bremsstrahlung
cos θ = 1/(βn)
radiation. A third mode of interaction of electrons with
matter is the Cherenkov effect which plays a minor role in which β is the ratio of the particle’s velocity to that
in energy deposition, but is quite visually dramatic in re- of light in a vacuum and n is the refractive index of the
fractive transparent media. medium. In water moderated and shielded research reac-
tors, where the fuel elements are visible, the eerie blue
a. Collisions. As they pass through matter, electrons glow in the water surrounding the elements is Cherenkov
gradually lose energy by means of collisions with atoms, radiation, mostly from Compton electrons induced by
along a wandering path. The collision energy is then usu- high-energy γ rays.
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
5. Proton Radiation Absorption the nuclear fusion processes energizing stars including
our Sun as well as in supernovae and other astrophysi-
Protons interact primarily with atomic electrons, hence
cal events. The neutrino was first postulated by W. Pauli
their effects, including biological, are similar to those of x
in 1930 to avoid an apparent non-conservation of energy
rays. However, as seen in Fig. 2, their depth-dose pattern
and linear momentum in radioactive β-decay. This elusive
is quite different. The proton depth-dose curve, here
“massless” uncharged particle interacts with matter princi-
shown for 200-MeV protons in water, is characterized by
pally through the weak nuclear force, hence the cross sec-
a relatively low entrance dose plateau region, followed
tion is so small that it can pass through the Earth with only
by a sharp dose peak (the Bragg peak) near the end of
a small probability of collision. Detectors consist of large
the range. Beyond the Bragg peak is a very sharp cut-off,
tanks of water with surface studded with inward-looking
with no protons and dose deposition reaching the areas
photomultiplier tubes to observe Cherenkov or other radi-
deeper into the material. Protons scatter very little, hence
ations from secondary products of neutrino collisions.
the lateral dose fall-off is also very sharp. Thus, in cancer
The three known types of neutrinos, associated with
therapy involving tumors adjacent to critical structures
electrons νe , muons νµ , and with tau particles ντ , although
which need to be spared, use of protons can avoid the
the latter has not been observed directly.∗ The Sun’s power
use of isocentric treatments in which the source-beam
is thought to be generated principally by hydrogen nuclei
is rotated around the patient. If the tumor is larger than
(protons, p) undergoing fusion into helium (He, Z = 2)
the typically 1–2 cm width of the Bragg peak, the peak
and two positrons (e+ ), according to
can be spread by rotating variable-thickness absorbers
intercepting the incident beam. 4p → 4 He + 2e+ + 2νe + 26.7 MeV
from which the neutrinos νe , even those produced deep
6. Charged Nuclei in the core of the Sun, can easily escape to enable terres-
trial detection and analysis. Currently, the observed Solar
Besides protons (hydrogen nuclei, Z = 1), nuclei of heav- neutrino flux is less than the theoretical predictions.
ier atoms moving at high energy can cause trails of intense A rare (but observed in geochemical experiments) ex-
ionization. Examples of such particles include α parti- ample of the neutrino participation in radioactivity is dou-
cles from radon and its short-lived daughter products (5, ble β-decay, which would be forbidden except for the neu-
6, and 7.7 MeV), cosmic ray particles with energies ex- trino production is:
tending up to the TeV region, ions in accelerator beams,
and fission fragments. Light atoms produce a track of sig- Nucleus(Z, A) → Nucleus(Z + 2, A) + 2e− + 2νe
nificant length. For example, a neon (Z = 10) ion at 40 in which Z is the atomic number and A is the atomic mass.
MeV has a track length in polymer plastics approaching
100 µm. Over a core region having a diameter of a few 8. Radiation Fields from Point
nanometers, the dose imparted to the material in the track and Extended Sources
is 106 rad. Over a penumbra region having a diameter of
about 10 µm the dose falls off to about 100 rad. Thus, This section will apply mostly to photon sources, but for
the details of the effects of different energetic atoms on sources embedded in a medium with negligible scatter-
a solid vary strongly according to atomic weight. Heavy ing or absorption, also negligible magnetic or electrostatic
atoms in motion are encountered in cosmic rays and in fields in the case of charged particles, the analytical geo-
fission fragments. During the fission of an atomic nu- metrical treatments here will also be applicable.
cleus, a large amount of momentum is given to the two
fragments of the nucleus. If the fission occurs in a solid, a. Point source. For a point isotropic (PTI) source of
then the fragments are stopped rapidly. The resulting bulk monoenergetic photons embedded in a homogeneous ab-
damage to the solid is heavy and is localized in a small sorbing and scattering medium, as schematized in Fig. 6,
volume. the total response D of an isotropic detector at distance r
from the source is
D = D ◦ + Ds
7. Neutrinos, ν
in which D◦ represents the response of the detector to
It is beyond the scope of this article to list and classify the
photons arriving from the source without suffering a scat-
entire “zoo” of known charged and uncharged particles
tering interaction, and Ds the response of the detector to
and anti-particles which could be considered to be within
the domain of “radiation physics.” However, the neutrino ∗See, however, B. Schwarzschild (2000). “The tau neutrino has finally
is here singled out for mention since it plays a role in been seen,” Physics Today 53(10), 17–19.
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
FIGURE 6 Schematization of a point isotropic monoenergetic ra- FIGURE 7 Finite plane isotropic source S uniformly covered with
diation source, and an isotropic radiation detector, embedded in isotropically radiating material.
a medium characterized by an attenuation coefficient µ and a
buildup factor BPTI which accounts for scattered and other sec-
ondary radiations “seen” by the detector.
σ exp(−µr )
D= × B PTI (µr ) d S.
4π s r2
photons arriving indirectly, via scattering or other modes For regular geometries such as the rectangular plaque
of secondary radiations. The unscattered response can be source shown in Fig. 8 found in irradiation facilities, a con-
calculated according to venient formulation for the buildup factor, to separate the
D◦ = (σ/4π )[exp(−µr)]/r2
in which the source-strength constant (σ/4π) is the
detector response at unit-distance from a unit-strength
point source in the absence of attenuation [e.g., σ/4π =
1.27 rads/hr in tissue 1 m from a 1 Ci 60 Co source, from
Table VI. C]. Defining the buildup factor BPTI for a given
photon energy Eo , material Z and distance r, as
BPTI = D/D◦ = (D◦ + Ds )/D◦
we have
D = D◦ × BPTI = (σ/4π )BPTI [exp(−µr)]/r2 .
The build-up factor BPTI has been expressed in a vari-
ety of parametrized formulations suitable for different ap-
plications, most of them listed and compared in the ar-
ticles by Hubbell (1963) and by Harima (1993) in the
bibliography.
FIGURE 9 Schematization for obtaining the detector response when the detector is not over a corner, but is displaced
by distances α and β, respectively, from the corner of a plaque source with width a and length b.
shape (geometry) variables from the material-dependent c. Rectangular plaque source. For a detector op-
penetration variables, is posite a corner of a rectangular plaque source, analytical
expressions and tabulations of the geometry coefficients
N
or ∞
qn are available in Hubbell et al. (1960). For the detec-
BPTI (µr ) = exp(+µr ) bn (−µr )n /n!
n=0
tor not opposite a corner, but displaced by distances α
and β from the corner of a rectangular source of width
in which the bn coefficients are tabulated in Hubbell (1963) a and length b, in units of the detector distance from the
which also provides transforms for obtaining bn s from source-plane (h = 1), as shown in Fig. 9, corner-position
other formulations given in termsof polynomials or ex- data can be combined according to
ponential combinations. The detector response from the
extended source S can then be computed according to D(a, b; α, β) = D(α, β) + D(a − α, b − β)
N
or ∞ + D(α, b − β) + D(a − α, β).
D = (σ/4π) bn × qn (geom) × (µx)n
n=0 For a “bare” rectangular plaque source (embedded in a
in which x is a fixed distance for a given geometry. The medium with negligible absorption and scattering), the
geometry (shape) coefficients qn can be calculated analyt- corner-position (h = 1) detector response is given by the
ically for a regular shape such as the rectangular source in zeroth term of the above series
Fig. 8 according to
D(a, b) = (σ/4π ) × q0 (a, b).
qn (geom) = (−r/x)n (1/r2 n!) dS in which
S b
1 a
or in polar coordinates q0 (a, b) = tan−1 dβ.
0 β2 + 1 β2 + 1
qn (geom) = [(−secθ )n+1 /n!] dS Although the rectangular source geometry coefficients
s
qn (a, b) are available in closed form (Hubbell et al., 1960)
in which S is the solid angle subtended by the source S for all n ≥ 1, the above integral for n = 0 is not soluble in
with respect to the detector, and θ is the angle between closed form. However, a series solution which is rapidly
r and x as indicated in Fig. 7 or between r and h as in convergent for all values of a and b, i.e., for 0 ≤ a ≤ b ≤ ∞,
Fig. 8. given in Hubbell et al. (1960) is
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
A. Radiation Units
1 day = 1.440 × 103 min Physical Constants, Physics Today 53(No. 8, Part 2), BG6–BG13.
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
SEE ALSO THE FOLLOWING ARTICLES Franklin, A. (2000). “The Road to the Neutrino,” Physics Today 53(2),
22–28.
Fuller, E. G., and Hayward, E. (1976). “Photonuclear Reactions,” Dow-
ATOMIC PHYSICS • COSMIC RADIATION • DOSIMETRY •
den, Hutchinson & Ross, Stroudsburg, PA.
HEALTH PHYSICS • NEUTRINOS • NUCLEAR PHYSICS • Greening, J. R. (1981). “Fundamentals of Radiation Dosimetry,” Adam
NUCLEAR REACTOR MATERIALS AND FUELS • NUCLEAR Hilger Ltd., Bristol, UK.
SAFEGUARDS • RADIATION SHIELDING AND PROTECTION Harima, Y. (1993). “An historical review and current status of buildup
• SOLAR PHYSICS factor calculations and applications,” Rad. Physics Cheml/5),
631–672.
Henke, B. L., Gullikson, E. M., and Davis, J. C. (1993). “X-ray inter-
actions: Photoabsorption, scattering, transmission, and reflection at
BIBLIOGRAPHY E = 50–30,000 eV, Z = 1–92,” Atomic Data Nucl. Data Tables 54(2),
181–342.
Berger, M. J., and Hubbell, J. H. (1987). “XCOM: Photon Cross Hubbell, J. H., Bach, R. L., and Lamkin, J. C. (1960). “Radiation field
Sections on a Personal Computer,” National Bureau of Stan- from a rectangular source,” J. Res. Nat. Bureau Stand. 64C(2), 121–
dards (now National Institute of Standards and Technology) Re- 138.
port NBSIR 87-3597. Current version of this database available at: Hubbell, J. H., Bach, R. L., and Herbold, R. J. (1961). “Radiation field
http://physics.nist.gov/PhysRefData/Xcom/Text/XCOM.html from a circular disk source,” J. Res. Nat. Bureau Stand. 65C(4), 249–
Chilton, A. B., Shultis, J. K., and Faw, R. E. (1984). “Principles of 264.
Radiation Shielding,” Prentice-Hall, Englewood Cliffs, NJ. Hubbell, J. H. (1963). “A power-series buildup factor formulation. Ap-
Christophorou, L. G. (1971). “Atomic and Molecular Radiation Physics,” plication to rectangular and off-axis disk source problems,” J. Res.
Wiley, New York. Nat. Bureau Stand. 67C(4), 291–306.
P1: GPJ/GLT P2: GQT Final Pages
Encyclopedia of Physical Science and Technology EN013A-634 July 26, 2001 19:56
Hubbell, J. H. (1969). “Photon Cross Sections, Attenuation Coefficients, ICRU (1993). “Stopping Powers for Protons and Alpha Particles,” ICRU
and Energy Absorption Coefficients from 10 keV to 100 GeV,” Na- Report 49, ICRU Publications, Bethesda, MD.
tional Bureau of Standards Reference Data Series NSRDS-NBS 29. ICRU (1998). “Fundamental Quantities and Units for Ionizing Radia-
Hubbell, J. H., Veigele, Wm. J., Briggs, E. A., Brown, R. T., Cromer, tion,” ICRU Report 60, ICRU Publications, Bethesda, MD.
D. T., and Howerton, R. J. (1975). “Atomic form factors, incoherent Jenkins, T. M., Nelson, W. R., and Rindi, A. (eds.) (1988). “Monte Carlo
scattering functions, and photon scattering cross sections,” J. Phys. Transport of Electrons and Photons,” Plenum, New York.
Chem. Ref. Data 4(3), 471–538, Erratum in 6(2), 615–617 (1977). Johns, H. E., and Cunningham, J. R. (1983). “The Physics of Radiology,”
Hubbell, J. H., Gimm, H. A., and Øverbo/, I. (1980). “Pair, triplet, and 4th ed., Charles C Thomas, Springfield, IL.
total atomic cross sections (and mass attenuation coefficients) for 1 Knoll, G. F. (2000). “Radiation Detection and Measurement,” 3rd ed.,
MeV–100 GeV photons in elements Z = 1 to 100,” J. Phys. Chem. Ref. Wiley, New York.
Data 9(4), 1023–1147. Koch, H. W., and Motz, J. W. (1959). “Bremsstrahlung cross-section
Hubbell, J. H. (1982). “Photon mass attenuation and energy-absorption formulas and related data,” Rev. Mod. Phys. 31(4), 920–955.
coefficients from 1 keV to 20 MeV,” Int. J. Appl. Rad. Isotopes 33(11), Lamarsh, J. R. (1983). “Introduction to Nuclear Engineering,” 2nd ed.,
1269–1290. Addison-Wesley, Reading, MA.
Hubbell, J. H. (ed.) (1993). “Radiation physics at 1993: A topical com- Morin, R. L. (ed.) (1988). “Monte Carlo Simulation in the Radiological
pendium,” Rad. Phys. Chem. 41(4/5), 579–789. Sciences,” CRC Press, Boca Raton, FL.
Hubbell, J. H., Trehan, P. N., Singh, N., Mehta, D., Garg, M. L., Garg, Motz, J. W., Olsen, H. A., and Koch, H. W. (1969). “Pair production by
R. R., Singh, S., and Puri, S. (1994). “A review, bibliography, and photons,” Rev. Mod. Phys. 41(4), 581–639.
tabulation of K, L, and higher atomic shell x-ray fluorescence yields,” Schwarzschild, B. “The tau neutrino has finally been seen,” Physics
J. Phys. Chem. Ref. Data 23(2), 339–364. Today 53(10), 17–19.
ICRU (1969). “Neutron Fluence, Neutron Spectra and Kerma,” ICRU Seltzer, S. M. (1993). “Calculation of photon mass energy-transfer and
Report 13, ICRU Publications, Bethesda, MD. mass energy-absorption coefficients,”Rad. Res. 136(2), 147–170.
ICRU (1984). “Stopping Powers for Electrons and Positrons,” ICRU Winter, K. (ed.) (1991). “Neutrino Physics,” Cambridge University
Report 37, ICRU Publications, Bethesda, MD. Press, Cambridge, U. K. and New York.
P1: GPA Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology en016b-749 July 31, 2001 15:25
Superconductivity
H. R. Khan
FEM and University of Tennessee at Knoxville
I. Introduction
II. Superconducting Materials
III. Correlation: Tc with the Electronic Structure
of a Solid
IV. Flux Quantization
V. London Equation and Coherence Length
VI. Coherence Length and Energy Gap
VII. Thermodynamics of Superconductivity
VIII. Magnetic Superconductors
IX. Tunneling and the Josephson Effect
X. Theory of Superconductivity
XI. Applications of Superconductivity
XII. Recent Developments: High-Transition
Temperature Superconductivity
235
P1: GPA Final Pages
Encyclopedia of Physical Science and Technology en016b-749 July 31, 2001 15:25
236 Superconductivity
large alternators, and much gain in efficiency and power ing state. The current flows without any attenuation in
density is obtained. Future fusion reactors will use su- this state, and it has been estimated that the decay time
perconducting magnets for confining deuterium and tri- of a current in a superconductor is about 100,000 years.
tium plasma. Superelectron pairs in a superconductor can The temperature below which a material loses its resis-
tunnel through a nonconducting thin layer. Based on this tance is called the superconducting transition or critical
“Josephson effect,” superconducting Josephson junctions temperature Tc .
are used as sensors, as high-energy electromagnetic radi-
ation detectors, and in high-speed digital signal and data
processing. B. Effect of a Magnetic Field on
Superconductivity and the Meissner Effect
FIGURE 1 Electrical resistance R () as a function of tempera- FIGURE 2 Normal and superconducting states of a material in
ture for mercury metal (Hg). an external magnetic field.
P1: GPA Final Pages
Encyclopedia of Physical Science and Technology en016b-749 July 31, 2001 15:25
Superconductivity 237
FIGURE 4 Magnetization 4π M as a function of externally applied The distribution of superconducting elements in the pe-
magnetic field Ba of a type I superconductor. riodic system is shown in Fig. 8. Some elements do not
P1: GPA Final Pages
Encyclopedia of Physical Science and Technology en016b-749 July 31, 2001 15:25
238 Superconductivity
Superconductivity 239
FIGURE 8 Distribution of the superconducting elements in the periodic table. [From Khan, H. R. (1984). Gold Bull.
17(3), 94.]
C. Intercalated Compounds als can be obtained in the form of thin films by evapora-
tion deposition on cold substrates. Amorphous materials
A typical example of this class of materials is
in bulk can also be obtained by rapidly cooling an alloy
TaS2 (C5 H5 N)1/2 . This compound is formed when TaS2
melt. The amorphous materials obtained in this way are
is intercalated with pyridine (C5 H5 N), and metallic layers
called metallic glasses. Materials of this class also ex-
about 6 Å thick are separated by pyridine layers of the
hibit superconductivity but are completely different from
same thickness. This intercalcated compound becomes
their crystalline counterparts. The superconducting tran-
superconducting at 3.5 K. A large number of transition
sition temperature Tc and the electron per atom ratio (e/a)
metal chalcogenides exist that crystallize in the layered
of some amorphous nontransition metals and alloys are
structures. These types of compounds show anisotropic
listed below. One sees that the Tc values in the amorphous
superconducting properties parallel and perpendicular to
state are higher than those in the crystalline state.
the layer surface. The critical magnetic field is about 30
times higher in the direction parallel to the layer surface Alloy T c (K) e/a
compared with the perpendicular direction, as shown in
Fig. 11. Be 9.95 2.0
Be90 A10 7.2 2.1
Ga 8.4 3.0
D. Amorphous Superconductors Pb90 Cu10 6.5 3.7
Bi 5 5
Unlike crystalline materials, amorphous or noncrystalline
materials consist of atoms that do not form regular arrays Amorphous films of transition metals and alloys have also
and are randomly distributed. These amorphous materi- been obtained, and the Tc values are lower than those of the
P1: GPA Final Pages
Encyclopedia of Physical Science and Technology en016b-749 July 31, 2001 15:25
240 Superconductivity
a
E. Organic Superconductors
f.c.c., face-centered cubic; hex., hexagonal; orth., orthorhombic;
rhom., rhombohedral; tetr., Tetrahedral; b.c.c., body-centered cubic. In 1964 Little proposed that an organic polymer can also
become a superconductor. His theory of superconductivity
is based on a mechanism entirely different from that of the
same alloys in crystalline form. Metallic glass supercon- Bardeen et al. (1957) theory of metals and alloys. Little
ductors are classified into two main groups. One group suggested that the electrons on the spine of the polymer
consists of metal–metal compositions and the other of chain are attracted to each other by an indirect process
metal–metalloid compositions. The superconducting tran-
sition temperatures of some of the metallic glass supercon-
ductors are listed in Table IV. These metallic glass super- TABLE III Superconducting Transition Temper-
ature Tc of Some β-W Structure Compounds
conductors show some desirable properties. For example,
they are ductile and possess a high strength, whereas their Compound T c (K)
crystalline counterparts are brittle. The metallic glass su-
V3 Ga 14.2–14.6
perconductors also possess very high values of the critical
V3 Si 17.1
magnetic field.
Nb3 Au 11.0–11.5
A practical superconductor capable of producing a mag-
Nb3 Sn 18.0
netic field of about 10 T should have a critical current den-
Nb3 A10.8 Ge0.2 20.7
sity of about 106 A/cm2 . In general, the amorphous super-
P1: GPA Final Pages
Encyclopedia of Physical Science and Technology en016b-749 July 31, 2001 15:25
Superconductivity 241
242 Superconductivity
F = Fs + E kin + E mag ,
Superconductivity 243
244 Superconductivity
FIGURE 18 Energy gap 2ε as a function of temperature T for a FIGURE 20 Tunneling of electrons through a thin insulating layer,
superconductor. B, between two superconductors, A and C.
P1: GPA Final Pages
Encyclopedia of Physical Science and Technology en016b-749 July 31, 2001 15:25
Superconductivity 245
246 Superconductivity
so that or
Cn − Cs = T ∂(Sn − Ss )/∂ T , γ = H02 2π Tc2 . (7)
and Eq. (3) reduces to Combining Eqs. (6) and (7),
Hc ∼
= H0 1 − (T /Tc )2 .
Cs = BT 3 .
where B is a constant, whereas Cn is given as
Cn = AT 3 + γ T . VIII. MAGNETIC SUPERCONDUCTORS
Because
d Q = dU + Pd V = T d S, Ferromagnetism and superconductivity have been consid-
ered to be mutually exclusive phenomena. lt was assumed
that the large internal magnetic field present in a ferro-
S = d Q/T = C dT /T . magnetic material would not allow it to become a super-
Therefore, conductor. This is true, and so far none of the magnetic el-
ements (for example, chromium, manganese, iron, cobalt,
Ss = BT 2 dT = (BT 3 /3) and nickel) have exhibited superconductivity. A search
was made to find a material that exhibits superconductivity
and and ferromagnetism at different temperatures. Among the
Sn = (AT 2 + γ ) dT = (AT 3 /3) + γ T , rare-earth elements, lanthanum is superconducting at 6 K.
The other rare-earth elements are either paramagnetic or
and the difference is ferromagnetic, with magnetic moments that are due to 4 f
1 electrons. Matthias and co-workers dissolved gadolinium
Sn − Ss =(A − B)T 3 + γ T . metal in lanthanum and measured the superconducting
3
transition temperatures as a function of dissolved gadolin-
At the transition temperature Tc (in zero field)
ium. Figure 23 shows a plot of the superconducting tran-
Sn = Ss , sition temperature as a function of gadolinium dissolved
thus
1
(B − A)Tc2 = γ
3
and
Sn − Ss = γ T − T 3 Tc2 . (5)
From Eq. (4) it follows that
Superconductivity 247
248 Superconductivity
Superconductivity 249
applications where normal iron magnets are inadequate. mechanism, a superconducting transition temperature of
These superconducting magnets can be used as exciter ∼35 K could be achieved. These conclusions were based
magnets for homopolar generators or rotors in large alter- on research on about 24,000 superconducting inorganic
nators, and a large gain in efficiency and power density is phases. In 1986, J. G. Bednorz and K. A. Müller pub-
obtained. lished a paper in Zeitschrift f ür Physik on the possibil-
Fusion reactors will employ superconducting mag- ity of a superconducting transition temperature as high
nets to confine plasma in which deuterium and tritium as 30 K in a mixture of lanthanum and barium–copper
will be fused to produce energy. Soon the six D-shaped oxide (La2 −x Bax –CuOx )(x ∼ 0.15) of tetragonal K2 NiF4
superconducting coils of dimensions 2.5 × 3.5 m man- structure. This discovery broke all previous records and
ufactured in the United States and in European coun- received world attention, and the two authors received the
tries will be tested to produce a magnetic field of 1987 Nobel Prize.
8 T, which will be used to confine deuterium–tritium In a short time, superconducting oxides in the ranges
plasma to produce fusion energy. Superconducting mag- 30–40, 90–100, and above 100 K were discovered. At
nets have found use in particle beam accelerators for present, the highest achievable superconducting transition
high-energy particle physics research. Another applica- temperature under normal conditions is about 133 K. The
tion of superconducting magnets is in nuclear magnetic superconducting oxides of ∼90 K superconducting tran-
resonance tomography, which requires a homogeneous sition temperature are rare-earth barium–copper oxides of
magnetic field; superconducting magnets are ideal for this orthorhombic structure. The oxygen content in these ox-
purpose. ides plays a major role in the superconductivity. When
Another application of superconductors is as magnetic the oxygen content is reduced, the oxides transform to
sensors. As mentioned earlier, a Josephson junction is an a tetragonal structure and become semiconducting. Su-
extremely nonlinear detector which, when connected to a perconducting transition temperatures above 100 K are
loop of a superconducting wire, forms a superconducting observed in thalium-, bismuth-, strontium-, calcium-, and
quantum interference device (SQUID). These SQUIDs are copper-based oxides.
extremely sensitive to small changes in magnetic fields. All these materials are ceramics and brittle, not ductile
Based on the Josephson junction, high-frequency electro- like metals or alloys, and the electronic properties are
magnetic radiation detectors for frequencies in the range highly anisotropic. The critical current density is high
of microwaves have been developed. Josephson junction in one direction and low in the other, perpendicular,
technology also finds applications in digital signal and direction. The epitaxial thin films of some of these ox-
data processing due to the high-speed and low-power ides show critical current densities of 106 A/cm2 at liquid
dissipation compared to semiconductor technology. The nitrogen temperature. The critical current density of poly-
Josephson junction can replace semiconductor technol- crystalline materials in the polycrystalline state is very
ogy where high speed, ultrahigh performance, reliabil- low and not suitable for technical applications. The co-
ity, lower power, and compactness are required. Other herence length in these ceramic superconductors is quite
applications of superconductors include lossless trans- small and is comparable to the lattice constants. These ma-
port of electrical energy and generation of magnetic terials show rather strong electron–electron interactions,
fields for levitation and propulsion for high-speed ground for example, as reported by Steiner et al. (1988). There-
transportation. fore there is increasing evidence that the electron pair-
ing in the superconducting state is of a pure electronic
nature as suggested by Anderson (1987), and not caused
XII. RECENT DEVELOPMENTS: by electron–phonon interaction.
HIGH-TRANSITION TEMPERATURE The mechanical properties of these ceramic supercon-
SUPERCONDUCTIVITY ductors as well as their superconducting properties may
be improved by the addition of silver metal as reported
Until April 1986, the maximum superconducting transi- by Khan et al. At present, a worldwide effort is ongoing
tion temperature measured in Nb3 Ge was ∼23 K. This to improve the mechanical properties and to increase the
limited superconducting transition temperature allowed critical current densities of these materials for large-scale
large- and small-scale applications of superconductors applications. Once the mechanical properties of ceramic
only with the use of liquid helium. Decades of experi- superconductors are improved and the critical current den-
mental and theoretical research work showed that the phe- sity is increased to a practical value, it is expected that
nomenon of superconductivity could be explained by the these superconducting materials will revolutionize vari-
attraction of electrons caused by electron–phonon inter- ous technologies by working at liquid nitrogen, rather than
action (BCS theory). lt was suggested that, based on this liquid helium, temperatures.
P1: GPA Final Pages
Encyclopedia of Physical Science and Technology en016b-749 July 31, 2001 15:25
250 Superconductivity
SEE ALSO THE FOLLOWING ARTICLES Barone, A., and Paterno, G. (1982). “Physics and Applications of the
Josephson Effect,” Wiley, New York.
Bednorz, J. G., and Müller, K. A. (1986). Z. Phys. B64, 189.
CRYOGENIC PROCESS ENGINEERING • CRYOGENICS • Buckel, W. (1972). “Supraleitung,” Physik Verlag GmbH, Weinheim,
FERROMAGNETISM • RARE EARTH ELEMENTS AND Germany.
MATERIALS • SUPERCONDUCTING CABLES • SUPER- Khan, H. R. (1984). Gold Bull. 17(3), 94.
CONDUCTING DEVICES • SUPERCONDUCTIVITY MECH- Khan, H. R. (1998). J. Superconduct 11, 1.
Khan, H. R., and Loebich, O., (1995). Physica C. 254, 15.
ANISMS • SUPERCONDUCTORS, HIGH TEMPERATURE •
Khan, H. R., and Raub, C. J. (1985). Annu. Rev. Mater. Sci. 15, 21.
THERMOELECTRICITY Kittel, C. (ed.) (1976). “Introduction to Solid State Physics,” 5th ed.
Wiley, New York.
Newhouse, V. L. (1964). “Applied Superconductivity,” Wiley, New York.
BIBLIOGRAPHY Putlin, S. N., and Antipov, E. V. (1993). Nature 362, 226.
Roberts, B. W. (1976). J. Phys. Chem. Data 5(3), 581–821.
Anderson, P. W. (1987). Science 235, 1196. Saint-James, D., Sarma, G., and Thomas, E. J. (1969). “Type II Super-
Bardeen, J., Cooper, L. N., and Schrieffer, J. R. (1957). Phys. Rev. 108, conductivity,” Pergamon, Oxford.
1175. Steiner, P., et al. (1988). Z. Phys. B69, 449.
P1: GQT/GUE P2: GLQ Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
Superconductivity Mechanisms
Jozef Spalek
Jagiellonian University and Purdue University
I. Introduction
II. The Bardeen–Cooper–Schrieffer (BCS) Theory:
A Brief Summary
III. Normal and Magnetic States of Correlated
Electrons
IV. Novel Mechanisms of Electron Pairing
V. Conclusions
251
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
exchange interaction provides a mechanism of mag- by electrons with their spin down, and vice versa. The
netic ordering in Mott insulators; it may also correlate parent stochiometric materials for high-temperature su-
electrons into singlet or triplet pairs in the metallic state, perconductors (e.g., La2 CuO4 and YBa2 Cu3 O6 ) are an-
particularly when the pair-exchange coupling J of an tiferromagnetic Mott insulators with Néel temperatures
electron pair is comparable to the kinetic energy of each (TN = 250 and 415 K, respectively).
of its constituents, as is the case for strongly correlated Real-space pairing Source of attraction or superconduct-
electrons. The superexchange (or kinetic exchange) is ing correlations that is not induced by lattice defor-
induced by a strong electron correlation. mation (phonons). Such pairing may be provided by
Fermi liquid Term describing the state of interacting the density fluctuations within the interacting electron
electrons in a metal. Equilibrium properties of such subsystem (e.g., by spin fluctuations or other excita-
systems are modeled by a gas of electrons with renor- tions). By real-space pairing we mean the pairing of
malized characteristics such as the effective mass (they electron spins in correlated metals caused by exchange
are called quasiparticles). The properties at low tem- interactions (e.g., kinetic exchange) among electrons in
peratures are determined mainly by electrons near the coordinate space. The essence of the real-space pair-
Fermi surface. The electron–electron interactions lead ing, not resolved as yet, is contained in the question,
to specific contributions to the transport properties of Can a strong short-range part of the coulomb repulsion
such a system producing, e.g., sound-wave and plas- (of range a0 ) lead to an attraction (an effective bind-
mon excitations. ing) at intermediate distances (2 ÷ 10a0 ), where strong
High-temperature superconductors Oxide materials of singlet–spin correlations prevail?
the type La2−x Srx CuO4 or YBa2 Cu3 O7−x , which have Strongly correlated electrons Electrons describing the
a layer structure, with the principal role of elec- metallic state of high-temperature superconductors,
trons confined to the CuO2 planes. The term “high- some heavy-fermion systems (non-Fermi liquids),
temperature superconductors” (HTS) was coined to and, particularly, systems of low dimensonality, d = 1
distinguish these and other oxide superconductors with and 2. In these systems, the concept of a Fermi liq-
a critical temperature Tc 20 K from “classical” super- uid is inapplicable, and for d = 1, at least, the charge
conductors, which comprise metals and intermetallic and spin degrees of freedom lead to separate quasi-
compounds such as Nb3 Ti with Tc 23 K. At present, particle representations—holons and spinons, respec-
this class of materials (HTS) is characterized by the tively. The quantum liquid describing strongly corre-
quasi-two-dimensonal structure of the normal metallic lated electrons composes a new quantum macrostate.
state above Tc and strong deviations from either the
normal Fermi-liquid or the BCS superconducting type
of behavior in the corresponding temperature regimes I. INTRODUCTION
T > Tc and T < Tc , respectively.
Hubbard subband Term describing each of the two parts Superconductivity remains among the most spectacu-
of an energy band in a solid which splits when the lar manifestations of a macroscopic quantum state of
electron–electron repulsion energy is comparable to electrons in a metal or plasma. Experimentally, one ob-
(or larger than) their kinetic (band) energy. The Hub- serves below a characteristic temperature Tc a transition
bard splitting of the original band induced by the inter- to a phase with nonmeasurable dc resistance (or with a
action explains in a natural way the existence of the persistent current), a perfect diamagnetism of bulk sam-
Mott insulating state in the case of an odd number ples in a weak magnetic field, and quantum tunneling be-
of electrons per atom (that is, when the atomic shells tween superconductors separated by an insulating layer of
would normally form an only half-filled band; cf., e.g., mesoscopic (∼1-nm) thickness. In the theoretical domain,
CoO). one studies the quantum–mechanical (nonclassical) mech-
Mott insulator An insulator containing atoms with par- anisms of pairing of the microscopic particles (fermions)
tially filled 3d or 4f shells. These systems order mag- at a macroscopic scale. Here, we summarize briefly
netically (usually antiferromagnetically) when the tem- our present understanding of the Bardeen–Cooper–
perature is lowered. Thus, they differ from ordinary Schrieffer (BCS) theory of “classical” superconductors
(Bloch–Wilson) or band insulators, which are weakly (see Section II) and we review the current theoretical
diamagnetic, and are characterized by filled atomic approaches to new superconductors: the heavy-fermion
shells, separated from empty states by a gap. In the an- materials and the high-Tc magnetic oxides. The latter sub-
tiferromagnetic phase of the Mott insulators each elec- ject is discussed in Section IV, after we summarize normal-
tron with its (frozen) spin oriented up is surrounded state properties of correlated electrons in Section III.
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
A brief characterization of the recent studies of super- atomic scale become crucial. Second, a whole class of
conductivity is in order. From the time of the first dis- models (discussed in Section IV) relies on the electron
covery (1911) of superconductivity in mercury (at tem- pairing induced by short-range electron–electron interac-
perature Tc 4.2 K) by Kammerlingh Onnes until 1986, tions. These interactions are strong and also present in the
studies were limited to low temperatures, T < 25 K. Dur- normal phase. This is the reason one must develop a co-
ing the next 5 years, six classes of new superconduct- herent theoretical picture of the correlated metallic state
ing compounds with critical temperatures Tc = 30 K (for that undergoes a transformation either to the Mott insu-
Ba1−x Kx BiO3 ), 40 K (for La2−x Srx CuO4 ), 90 K (for lating or to the superconducting state. Such a theory does
YBa2 Cu3 O7−δ ), 110 K (for Bi2 Sr2 CaCu2 O8 ), and 125 K yet not exist.
(for Tl2 Ca2 Ba2 Cu3 O10−y ), and 135 K (HgBa2 Ca2 Cu3 O8 ) In this chapter, the properties of correlated electrons in
were discovered and/or thoroughly studied in a number of normal, insulating, magnetic, and superconducting phases
laboratories. In recent years the idea has also been applied are reviewed and related to the parametrized models, start-
to new systems such as Fermi condensated dilute gases ing from either Hubbard or Anderson-lattice Hamiltoni-
and quark–gluon plasma in high-energy physics. Apart ans. These are the models that describe the properties of
from the discovery of spin–triplet pairing in liquid 3 He, correlated metallic systems in terms of a few parameters,
evidence for it in Sr2 RuO4 also opens new possibilities for such as the band width W of starting (uncorrelated, bare)
pairing studies. electrons, the magnitude U of short-range (intraatomic)
The starting point in both classical and new supercon- coulomb interactions, etc. Such models provide an over-
ducting materials is the electronic structure that deter- all understanding of both the nature of correlated metallic
mines the metallic properties in the normal phase (that and insulating ground states and the underlying thermody-
is, that above Tc ). In this respect, the classical supercon- namic properties of these systems. However, the guidance
ductors are well described by band theory and, in some of detailed band structure calculations is often needed in
cases, starting from the concept of the Fermi–liquid con- choosing appropriate values for the microscopic parame-
cept. In contrast, the new materials are characterized as ters, as well as to understand the specific features of the
those whose electrons are close to localization, that is, compounds.
those close to the metal–insulator transition of the Mott–
Hubbard type. The latter transition may be induced by
a relatively small change in compound composition (cf. II. THE BARDEEN–COOPER–SCHRIEFFER
the behavior of La2−x Srx O4 or YBa2 Cu3 O7−x as a func- (BCS) THEORY: A BRIEF SUMMARY
tion of x). It is quite interesting to note that oxides such
as YBa2 Cu3 O7−x may be synthesized in either insulat- The BCS theory [1–10] relies on three features of metal-
ing (x 0.65) or metallic states. Additionally, antiferro- lic solids: (1) the electron–lattice interaction; (2) the for-
magnetic ordering of the 3d electrons is observed close mation of an electron-pair bound state (the so-called
to the insulator–metal transition; the magnetic insulat- Cooper pair state) due to the coupling of the electrons to
ing state transforms into a superconducting state when the lattice; and (3) the instability of the normal metallic
0 ≤ x 0.65. Therefore, an account of our understand- state with respect to the formation of a macroscopic con-
ing of the almost-localized metallic state in a normal or densed state of all pairs (k↑, −k↓) with antiparallel spins
magnetic (that is, nonsuperconducting) phase is highly in momentum (k) space. The condensed state exhibits the
desirable and summarized in Section III. The antiferro- principal properties of superconductors, such as a per-
magnetic insulating, normal metallic, and superconduct- fect diamagnetism, zero dc resistance, etc. We first dis-
ing states must all be treated on the same footing for a cuss these three features briefly and then summarize some
proper characterization of high-Tc oxides. In this manner, consequences of the BCS theory. The BCS theory not only
the studies of those systems must incorporate the descrip- deals with one of the possible (phonon-mediated) mecha-
tion of different quantum phase transitions. One can say nisms for superconductivity, but also provides proper lan-
that the theory of strongly correlated electrons and of the guage for the description of such a condensed state in
superconductivity in those systems poses one of the most general terms, independent of the particular pairing mech-
challenging problems for physics of the 21st century. anism. One should also remark at the beginning that such a
Details of the electronic structure in high-Tc oxides condensed state of pairs cannot be regarded as a Bose con-
are also important for two additional reasons. First, as densed state if the size of the bound-state wave function
discussed later, in these superconductors the coherence ξ (the coherence length) is much larger than the interpar-
1
length is quite small, that is, comparable to the lattice ticle distance a = (V /N ) 3 ; this happens for the “classic”
constant. Hence, the details of the wave function on the superconductors.
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
FIGURE 1 (a) Scattering diagram of electrons with wave vectors k → k + q, accompanied by emission of the phonon
of wave vector −q. (b) Virtual emission and subsequent reabsorption of the phonon by electrons. The two processes
drawn combine into the contribution [Eq. (2)] leading to the effective electron–electron attraction.
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
The coefficients of the transformation fulfill the condition D. Summary of the Properties:
u 2k + vk2 = 1. One should note that the transformation does The Homogeneous State
not conserve the particle number, so one has to add the
The solution of Eq. (9) provides the following properties.
term (−µN ) to the Hamiltonian (b), where µ ≡
F is the
chemical potential in the superconducting state.
1. At T = 0, Eq. (1) reads
The single-particle excitations in the superconducting −1
phase are specified by 1 = (V /2N ) Ek . (11)
k
1 The value of ≡ (T = 0) for ρV 1 is given by
E k = (
k − µ)2 + |k |2 2 , (7)
hω p 1
where µ is the chemical potential of the system and |k | o = ≈ 2hω p exp − , (12)
sinh(1/ρV ) ρV
is the so-called superconducting gap determined from the
self—consistent equation where hω p ≈ kB θD 1 and ρ is the density of States at the
Fermi energy. One notes a striking similarity between
Eq. (12) and Eq. (4), particularly for ρV 1 (this condi-
k βE k
k = − Vkk tanh , (8) tion represents the so-called weak—coupling limit); The
k
2E k 2
absence of factor 2 in (12) provides an enchancement of
the gap in the condensed state due to the presence of other
with β ≡ (kB T )−1 . One should note that if Vkk is approx-
electrons.
imated by a negative constant, then k = ; Eq. (8) then
2. We can choose the origin of energy at µ. Then Eq. (9)
yields as a solution either ≡ 0 or = 0, obeying the
can be transformed into an integral form:
equation h ωp
ρ(
) d
β
V 1 β Ek 1=V 1 tanh
2 + 2 . (13)
1= tanh , (9) o (
2 + 2 ) 2 2
N k 2E k 2
Since hωp µ, we may take ρ(
) ≈ ρ(
F ) ≡ ρ within the
range of integration. This allows for an analytic evaluation
where now
of the critical temperature for which = 0:
1
E k = (
k − µ)2 + 2 2 . (10) 1
Tc = 1.13θD exp − . (14)
ρV
The primed summation in Eq. (9) is restricted to the regime
of k states where V = 0. Equations (9) and (10) constitute In all these calculations, it is implicitly assumed that
the simplest BCS solution for an isotropic (k-independent) ρV 1. Because of the presence of the exponential factor
gap. One sees that E k is always nonvanishing and reaches a in Eq. (14), the critical temperature Tc is much lower than
minimum E k = for electrons placed on the Fermi level, the Debye temperature characterizing the average energy
where
k = µ. Thus, the meaning of the gap becomes obvi- of acoustic phonons. This is the principal theoretical rea-
ous: it is the gap for the single—electron excitations from son that Tc is so low in the superconductors discovered
the superconducting (condensed) phase to a free—particle in the period 1911–1986. The exponential dependence of
state. The presence of a gap > kB Tc in the spectrum of Tc on the electronic parameter ρV also explains why the
single—particle excitations suppresses the scattering of parameters pertaining to the electronic structure, which
electrons with acoustic phonons. The thermally excited are of the order of 1 eV or more, respond to phase transi-
electrons across the gap do not yield nonzero resistivity tions on an energy scale that is three orders of magnitude
because their contribution is short-circuited by the pres- smaller (kB Tc ∼ 1 meV). Effects with such a nonanalytic
ence of the pair condensate that carries a current with no dependence of transition temperature on the coupling con-
resistance. The same holds true even for the superconduct- stant cannot be obtained in any order of perturbation theory
ing systems for which the gap vanishes along some lines starting with the normal state as an initial state. A similar
or at some points in k space. type of effect is obtained in the studies of the Kondo effect
One should emphasize that all thermodynamic prop- (cf. Section IV).
erties are associated with thermal excitations; the en- 3. Combining Eqs. (14) and (1.12) one obtains the uni-
ergies that are specified by Eq. (7) contain |k | or versal ratio
as a parameter to be determined self-consistently from 2o
= 3.53, (15)
Eq. (8) or (9), respectively. Next, we provide a brief sum- kB Tc
mary of the results that may be obtained within the BCS 1 In actual practice, one assumes that hω ≈ 0.75k θ (cf. Meservey
p B D
theory. and Schwartz in Ref. 9).
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
which is frequently used as a test for the applicability of the spin–singlet bound state is destroyed by the thermal
the BCS model. However, this value can also be obtained fluctuations. The pair binding energy is then effectively
in the strong-coupling limit [15] for a particular strength overcome by the magnetic energy, so that the pairs break
of electron–phonon coupling. up into single particles. Strictly speaking, this type of be-
4. By regarding energies E k as representing electron havior characterizes the so-called superconductors of the
excitations across the gap, one can write the expression first kind.
for the entropy of a superconductor in the standard form: 6. By calculating the specific heat from the stan-
dard thermodynamic analysis [CS = −T (∂ 2 FS /∂ T 2 )V ],
S = −2kB [ f k n f k + (1 − f k ) n(1 − f k )], (16)
k
one obtains at T = Tc a discontinuity of the form
where f k ≡ f (E k ) = [1 + exp(β E k )]−1 is the Fermi– CS − CN
= 1.43, (19)
Dirac distribution function [1 + exp(β E k )]−1 . Hence, the CN
free energy of the superconducting state is where CN is the specific heat at Tc for the material in
FS = 2 E k f k − TS. (17) its normal state. At low temperatures, the specific heat
k decreases exponentially:
One should note that the thermodynamic properties are de-
0
termined fully only if the chemical potential µ = µ(T ) and CS ∼ exp − , (20)
kB T
the temperature dependence of the superconducting gap
k = k (T ) are explicitly determined, since only then is for the special case of an isotropic gap. However, if the gap
the spectrum of single—particle excitations (characterized is anisotropic [ = k (T )] and has lines of zeros (along
by the energies {E k }) uniquely determined. The quantity which k = 0), then the low—temperature dependence of
(T ) is determined from Eq. (13). The chemical poten- CS does not follow Eq. (20) but rather a power law T n ,
tial is determined from the conservation of the number Ne with n depending on the details of the gap anisotropy.
of particles, that is, from the condition k f k = Ne . The The specific heat grows with T because the number of
temperature dependence of the gap in the isotropic case is thermally broken pairs increases with rising temperature;
shown schematically in Fig. 3. eventually, at T = Tc (kB Tc ∼ o ), all bound pairs disso-
5. By calculating this difference of the free energies ciate thermally, at which point Cs reaches a maximum.
FS − FN in superconducting (FS ) and normal (FN ) phases If the temperature is raised further (above Tc ), the spe-
and equating the difference with the magnetic free energy cific heat drops rapidly to its normal-state value since no
Hc2 V /8π (V is the volume of the system), one can obtain pairs are left to absorb the energy. This type of behavior
an approximate relation of the form is observed in superconductors with an isotropic gap (cf.,
2 e.g., Hg and Sn). One should note that this interpretation
Hc (T ) T
≈1− . (18) of the thermal properties is based on the single—particle
Hc (0) Tc excitation spectrum [Eq. (10)]; we have disregarded any
For the applied field H > Hc , superconductivity is de- fluctuation phenomena near Tc , as well as collective ex-
stroyed because in the thermodynamic critical field Hc citations of the condensed system. It can be shown that
the large coherence length ξ ∼ 103 /104 Å encountered in
classic superconductors [8] is related to the absence of
critical behavior near Tc . This is not the case in high-Tc
superconductors (discussed in Section IV); hence, the new
materials open up the possibility of studies of critical phe-
nomena in superconducting systems.
7. The spin part of the static magnetic susceptibility
vanishes as T → 0. This is a direct consequence of the
binding of electrons in the condensed state into singlet
pairs. Therefore, the Meissner effect (the magnetic flux
expulsion from the bulk of the sample) at T = 0 is present
because the orbital part of the susceptibility is diamag-
netic (roughly, it represents an electron-pair analogue of
the Landau diamagnetism of single electrons in a normal
FIGURE 3 Schematic representation of the temperature depen- electron gas). The expulsion of the magnetic flux from the
dence of the superconducting gap for the isotropic change. Tc is bulk is measured in terms of the so-called London pene-
the critical temperature for the transition, and 0 ≡ (T = 0). tration depth λ = λ(T ), which characterizes the decay of
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
interatomic distance, thus reducing W , which is directly quantity for the atomic state |iσ . This simple Hamiltonian
related to the wave–function overlap for the two states describes the localization versus delocalization aspect of
located on the nearest-neighboring sites. The splitting of electron states since the first term provides the gain in en-
the original band into two Hubbard subbands eliminates ergy (
k < 0) for electrons in the band state |kσ , whereas
the paired (spin–singlet) occupations of the same energy the second accounts for an energy loss (U > 0) connected
state
. Effectively, this pattern reflects the situation of with the motion of electrons throughout the system that is
electrons being separated from each other as far as pos- hindered by encounters with other electrons on the same
sible; however, the correspondence between the Hubbard atomic site. The competitive aspects of the two terms are
split–band situation, shown schematically in Fig. 7, and expressed explicitly if the first term in Eq. (24) is trans-
the electron disposition in the spin lattice in real space (cf. formed by the so–called Fourier transformation to the site
Fig. 5b) is by no means obvious and requires a more de- {|iσ } representation. Then Eq. (24) may be rewritten
tailed treatment that relates these two descriptions of the
+
Mott insulator. This problem is dealt with in the following H= ti j aiσ a jσ + U n i↑ n i↓ , (26)
i jσ i
section.
where
1. The Hubbard Model 1
ti j =
k exp[ik · (R j − Ri )], (27)
N k
In discussing narrow band systems, one usually starts from
+
the model Hamiltonian due to Hubbard [25], which ap- is the Fourier transform of the band energy
k and aiσ (ai σ )
pears to be complicated but can really be interpreted in is the creation (annihilation) of electrons in the atomic
simple terms, namely, (Wannier) state centered on the site R j . The first term in
Eq. (25) represents the motion of an electron through the
H=
kσ n kσ + U n i↑ n i↓ , (25)
kσ i
system by a series of hops j → i, which are described
in terms of destruction of the particle at site j and its
where
k is the single-particle (band) energy per electron
subsequent recreation on the neighboring site i. The width
with the wave vector k, U is the magnitude of intraatomic
of the corresponding band in this representation is given
coulomb repulsion between the two electrons located on
by
the same atomic site i, n kσ is the number of electrons in
the single–particle state |kσ , and n iσ is the corresponding W =2 |ti j | ≈ 2z|t|, (28)
j(i)
1
p o (
) = , (28a)
π (
− t0 )2 + 2 FIGURE 9 The position of the Fermi level
F as a function of the
band filling n for different values of interaction (from the bottom to
where t0 determines the position of the center of the band the top curve), U/ = 0,0.5, 2, and 10. For U/ = 10, the Fermi
(usually chosen as t0 = 0). Detailed calculations [28] level jumps between the subbands when n ≈ 1. [From Ref. 26.]
show that with a growing magnitude of interaction (U/),
the DOS [Eq. (28a)] splits into two parts described by the
density of states:
To display the of Mott insulator as a two-band system
1 − (n/2) n/2 in which the Hubbard subbands assume a role similar to
ρ(
) = + .
π (
− t0 )2 + 2 (
− t0 − U )2 + 2 that of the valence and conduction bands in an ordinary
(28b) semiconductor, we have plotted in Fig. 9 the position of
the Fermi level as a function of the numbers of electrons n
The first term describes the original DOS [Eq. (28a)], with per atom in the system. As n moves past unity, a jump in
the weighting factor (1 − (n/2)), whereas the second rep-
F occurs for U/ 1. This is exactly what happens in
resents the upper subband (on the energy scale), with the the ordinary semiconductor when the electrons are added
weighting factor (n/2) and shifted by an amount U . These to the conduction band. This feature shows once more
two terms and the corresponding two parts of the DOS in that the states near the upper edge of the lower Hubbard
Fig. 8 describe the Hubbard subbands. The dashed line subband (that is, the states near
F for n close to but less
in Fig. 8 represents the inverse lifetime of single–electron than unity) can be regarded as hole states. We will see that
states placed in the pseudogap, while the arrows point those states are the ones with a high effective mass.
to the position of the Fermi energy in each case. For n = 1, It should be emphasized that the Hubbard subband
the Fermi level falls in a pseudogap, where the lifetime of structure is characteristic of magnetic insulators and can-
those quasiparticle states is very short. This is reminiscent not be obtained with a standard band theoretical approach
of the behavior encountered in an ordinary semiconductor, to the electron states in solids. The N states in the lower
where the states in the band gap are those with a complex Hubbard subband are almost singly occupied; this is di-
wave vector k. The lifetime may qualitatively simulate the rectly related to the picture of unpaired spins in Fig. 5b
atomic disorder-producing spread (Lorentian-shape) form and is one of the reasons for calling the electron states
of the bare density states. for such interacting systems correlated electronic states.
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
The other reason (discussed in detail later) arises because perature for a canonical system (V1−x Crx )2 O3 (the data
the proper description of electronic states near the local- are from Ref. 27). The number of transitions (one, two, or
ization threshold (the Mott transition) requires that one three) depends on the Cr content. Note the presence of an
incorporates two-particle correlations into the quasiparti- intervening metallic state between the antiferromagnetic
cle states. The Hubbard split-band picture is only the first insulating (AFI) and the paramagnetic insulating (PM)
step in the proper description of the electron states. Those states, as well as the reentrant metallic behavior at high
additional correlations will lead to a very heavy mass of temperatures for 0.005 x 0.0178. To rationalize these
quasiparticles near the Mott transition; the heavy mass data, we discuss the physical implications of a model of in-
indicates a strong reduction of the bare bandwidth W as teracting narrow-band electrons for U ∼ W starting from
the localization threshold is approached from the metallic the Hamiltonian [Eq. (25)]. We summarize here the main
side. features of the detailed discussion presented in Refs. 28–
30, which provide the main features of the ground-state
3. Localized versus Itinerant Electrons: and thermodynamic properties.
Metal–Insulator Transitions In the absence of interactions (U = 0), the band energy
per particle is
¯ = −(W/2)n(1 − n)/2), where 0 ≤ n ≤ 2
The Hubbard split–band picture of unpaired electronic is the degree of band filling; for n = 1, this reduces to
states in a narrow band, shown in Figs. 7 and 8, provides
¯ = −W/4. When the interactions are present, the band
a rationale for the existence of a paramagnetic insulating narrows; this is because of a restriction on the elec-
ground state of the interacting electron system. The corre- tron motion caused by their repulsion, as described ear-
sponding experimentally observed metal–insulator transi- lier. One way of handling this restriction is to adjoin to
tions (MITs) at a finite temperature are very spectacular, the bare bandwidth a multiplying factor . This leads
as demonstrated in Fig. 10, where the resistivity (on a log- to a renormalized DOS for quasiparticles, as illustrated
arithmic scale) is plotted as a function of the inverse tem- in Fig. 11. The factor is a function of the particle–
particle correlation function η ≡ n i↑ n i↓ , the expecta-
tion value for the double occupancy of a representative
lattice site. The quantity η is calculated for T = 0 self-
consistently by minimizing the total energy E G (per site),
composed of the band energy E B =
and the coulomb
repulsion energy U η, where the parameter is specified by
= 8η(1 − 2η) [28, 29]. These two energies represent the
expectation values of the two terms in Eq. (25) for the case
of a half-filled band. The optimal values of the quantities moments is neglected, then each site in the paramagnetic
are given by state is randomly occupied by an electron with its spin
1 either up or down. The free energy FI for such an insulating
η0 = (1 − (U/Uc )), (29a) system of N moments is provided by the entropy term for
4
randomly oriented spins, that is,
0 = 1 − (U/Uc )2 , (29b) FI
= −kB T n 2 (32)
and N
E G = 1 − (U/Uc )2
¯ , (29c) Now, a system in thermodynamic equilibrium assumes
the lowest F state. The condition for the transition from
with Uc = 8|¯
| = 2W . Thus, as U increases, η0 decreases the metallic to the local-moment phase is specified by
from 14 to 0. At the critical value U = Uc , E B = 0 and there F = FI . The phase transition determined by this condition
are no double occupancies for the same lattice site; this sig- can be seen explicitly when we note that the free energy
nals the crossover by the system from the itinerant (band) varies with T either parabolically [Eq. (31)] or linearly
to the localized (atomiclike) state. The point U = Uc corre- [Eq. (32)], depending on whether the system is a param-
sponds to a true phase transition at T = 0; the last statement agnetic metallic (PM) or a paramagnetic simulating (PI)
can be proved by calculating the static magnetic suscepti- phase. As illustrated in Fig. 12, several of those curves
bility, which is [29]
(1 + I )/2
χ = χ0 0 1 − Uρ , (30)
(1 + I )2
where I ≡ U/Uc , ρ is the density of bare band states at
=
F , and χ0 is the magnetic susceptibility of band elec-
trons with energy
k at U = 0. As → 0 (that is, U → Uc ),
the susceptibility diverges. The localized electrons are rep-
resented in this picture by noninteracting magnetic mo-
ments for which the susceptibility is given by the Curie
law χ = C/T → ∞ as T → 0. Thus, the MIT is a true
phase transition; η0 may be regarded as an order parameter,
and the point U = Uc as a critical point. We concentrate
now on a more detailed description of the metallic phase,
which permits a generalization of the previous results to
the case T > 0. First, as has been said, the increase in mag-
nitude of interaction U reduces the band energy accord-
ing to E B = −W 0 /4. Eventually, E B becomes compa-
rable to the interaction part U η; they exactly compensate
each other at U = Uc . The resultant electronic configu-
ration (localized versus itinerant) is then determined at
T > 0 by the very low entropy and the exchange interac-
tion contributions. The entropy of the metallic phase in
the low–temperature regime may be estimated by using
the linear specific heat expression for electrons in a band
narrowed by correlations, namely, Cv ≡ γ T = (γ0 | 0 )T ,
where γ0 = 2π 2 kB2 ρ/3 is the linear specific heat coefficient
(per one atom) for uncorrelated electrons (that is, U = 0).
Hence, the entropy S = γ T = Cv . Combining this relation FIGURE 12 (a) Plots of the free energies for the paramagnetic
with the resultant energy at T = 0, given by Eq. (29c), one Mott insulator (the straight line starting from the origin) and the
can write an explicit expression for the free energy of the correlated metal (the parabolas). The parabolic curves’ points of
metallic phase [30]: crossing at L and J correspond to a discontinuous metal–insulator
transition, while those crossing at K and M correspond to the re-
F U 2 1 γ0 2 verse. (b) Schematic representation of the phase diagram be-
= 1−
¯ − T . (31) tween paramagnetic metallic (PM and PM and paramagnetic in-
N Uc 2 0
sulating (PI) phases. The points of crossing from a are also shown.
This is the free energy per one atomic site. On the other The vertical arrow represents a sequence of the transitions shown
hand, if the exchange interaction between the localized in Fig. 10 for 0.005 x 0.018 and in the paramagnetic phase.
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
intersect at one or two points depending on the value U/W . been discussed extensively (e.g., Gebhardt, Ref. 30). A
These intersection points determine the stability limits of central peak is located between the Hubbard subbands,
the PM and PI phases. The lowest curve for the PM phase which carries the main part of the quasiparticle weight.
lies below the straight line for the PI state; there is no There are two problems with the application of this solu-
transition, that is, the metallic Fermi liquid state with the tion to concrete systems. First, the upper critical dimen-
effective mass enhancement m ∗ /m 0 2.5 is stable at all sonality is not known for Mott systems. Second, the disap-
temperatures. As U/W increases, the parabolas fall higher pearance of the central peak at the localization threshold
on the free energy (F/WN ) scale and the possibilities for is being debated.
transitions open up. The higher two curves illustrate the
case in which the intersections with the straight lines oc-
cur at J and K and at L and M, respectively; at low and 4. Strongly Correlated Electrons: Kinetic
high temperatures the parabola lies below the straight line Exchange Interaction and Magnetic
for FI /WN , so that the metallic phase is stable in those T Phases in Three-Dimensonal Space
regions. At intermediate temperatures, the PI phase is sta- In the limit W U , the ground state of the interacting
ble. The loci of the intersections move farther apart on the electron system will be metallic only if the number of
kB T /W scale as U /W is increased, as shown in Fig. 12b, electrons Ne in the system differs from the number N
where the phase boundaries are drawn; this part of the of atomic sites. Simply, only then can charge transport
figure represents the temperature of the transitions (the take place via the hole states in the lower Hubbard sub-
intersection points in Fig. 12a) versus the relative mag- band (for Ne < N ), that is, when the transport of elec-
nitude of interaction U/W . We see that the PM phase is trons can be represented via hopping from site to site,
stable at low temperatures; thus, reentrant metallic behav- avoiding the doubly occupied configurations on the same
ior is encountered at high T . The explicit form of (the site. This restriction on the motion of individual elec-
curve in Fig. 12b) is obtained from the coexistence condi- trons is described above in terms of the band narrowing
tion F = FI , which leads to the following expression for factor , which, in the normal phase, is now of the or-
the transition temperatures [29]: der [28, 29] = (1 − n)/((1 − n)/2). This shows that the
2
effective quasiparticle bandwidth W ∗ ≡ W is nonzero
kB T+ 3 U
= 2
1− n 2 only if the number of holes δ ≡ 1 − n > 0.
W 2π Uc For W U , there is one class of dynamic processes that
12 is important in determining the magnetic interactions bet-
π 2 1 − (U/Uc ) ween strongly correlated itinerant electrons, namely, the
± (n 2)2 + . (33)
3 1 + (U/Uc ) virtual hopping processes, with the formation of a dou-
bly occupied site configuration in the intermediate state.
The root T− represents the low-temperature part, that is, Such processes are depicted in Fig. 13, where one electron
that for kB T /W ≤ 0.049. The T+ part is the one above the hops onto the site occupied by an electron with opposite
point where both curves meet; this takes place at the lower spin and then hops back to the original site. During such
critical value of U = Uc such that processes, the electrons can exchange positions (and the
√
Uc 3 2 1 yields to the spin reversal of the pair with respect to the
=1− ≈ 0.75. (34) original configuration) or the same electron can hop back
Uc 2π (ρ|¯
|) 12
and forth. The corresponding effective Hamiltonian, in-
Below the value of U = Uc , the correlated Fermi liq- cluding the virtual-hopping processes in first nontrivial
uid is stable at all temperatures. Ultimately, for 1.58 ≤ order, has the form
U/W ≤ 2.0, only one intersection (at low T ) of the curves
remains. This means that in this regime of U/W the reen-
trant metallic behavior is achieved gradually as the temper-
ature increases. The above-described transitions are ob-
served when changing the magnitude of interaction (U/W
ratio). In the case of high-temperature superconductors we
observe the transition from a Mott insulator to a supercon-
ductor as a function of doping (carrier concentration). This
FIGURE 13 Virtual hopping processes between singly and dou-
case is discussed next.
bly occupied atomic sites that lead to an antiferromagnetic ex-
change interaction between the neighboring sites. This interac-
Note added in August 2000. In recent years, the tion is responsible for the antiferromagnetism in most of the Mott
Mott localization in the limit of infinite dimension has insulators.
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
1 1 W
H= σ
k n kσ + 2ti2j U Si · S j − n i n j , nc 1 − ∼ 0.02 ÷ 0.05.
kσ ij
4 2z U
(35) In Fig. 14, we have plotted schematically the commonly
where in general the band narrowing factor σ = (1 − n)/ accepted phase diagram for three dimensional systems
(1 − n σ ), n σ = n iσ is the average number of particles describing the possible magnetic phases on the plane
per site with the spin quantum number σ , and n i ≡ n i↑ + n − (U/W ). Close to the case of one electron per atom,
n i↓ is the operator of the number of particles on given site the antiferromagnetic (AF) phase is stable for any arbitrary
i. Note that in the paramagnetic state n σ = n −σ = n/2, strength of interaction. At intermediate filling, the ferro-
and σ reduces to = (1 − n)/((1 − n)/2), the value for magnetic (F) phase may be stable. On the low-interaction
the normal state. side (W/U > 1), the ferromagnetic phase terminates at
One should note that the effective Hamiltonian points where the Stoner criterion is met, that is, when
[Eq. (35)] represents approximately the original Hub- ρ 0 (
F )U = 1, where ρ 0 (
F ) is the value of the bare density
bard Hamiltonian for W U (for more precise treatment, of states (per spin) at the Fermi level
F .
see Ref. 31 and Section IV.A). When n → 1, φ → 0, and Peculiar features appear in the corner where n ≈ 1, and
Eq. (35) reduces to the Heisenberg Hamiltonian with an- W/U 1, that is, where the number of holes is small,
tiferromagnetic interaction, which is the reason why most so that the exchange interaction contribution to the total
Mott insulators order antiferromagnetically. In the limit system energy is either larger than or comparable to the
of a half–filled band, we also find that the effective band- band energy part ¯
. In such a situation, a mixed ferro-
width W ∗ ≡ W = 0, thus proving that the electrons in magnetic–antiferromagnetic phase is possible [32]. When
that case are localized on atoms. The nature of the wave the number of holes is very small, each hole may form
function for these quasi–atomic states has not yet been a magnetic polaron with a ferromagnetic cloud accom-
satisfactorily analyzed, though some evidence given later panying it: the hole is self-trapped within the cloud of
shows that they should be treated as soliton states. ferromagnetic polarization it created. We consider those
For n < 1, the normalized band (the first term) and the objects next.
exchange parts in Eq. (35) do not commute with each other.
This means that for the narrow-band system of electrons B. Magnetic Polarons
represented by the spin dynamics influences the nature of
1. The Classical Approach
itinerant quasiparticle states of energies
k . What is even
more striking is that, as n → 1, the two terms in Eq. (35) It has been proved by Nagaoka [32] that in the limit
may contribute equally to the total energy. The critical W/U → 0 the ground state of the Mott insulator with one
concentration of electrons n c for which these two terms hole involves ferromagnetic ordering of spins. This is be-
are comparable is cause in this limit the antiferromagnetic exchange term
FIGURE 14 Commonly accepted magnetic phase diagram for strongly correlated electrons on the n–(W/U ) plane.
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
in Eq. (35) vanishes and the band energy is lowest when Equation (36) holds for a three-dimensional system; for
σ =↑ = 1 and σ =↓ = 1 − n. We can thus choose an equi- a planar system, the factor (4/3)π (R/a)3 in the last term
librium state with n ↑ = n, n ↓ = 0, that is, a state with all should be replaced by the area π (R/a)2 . One then obtains
spins pointing up. the corresponding optimal values,
Mott, and Hertier and Lederer [23], has pointed out 1
that if W/U is small but finite, a hole may create locally 2πU 2
N =π (38a)
a ferromagnetic polarization of the spins in a sphere of W
radius R, surrounded by a reservoir of antiferromagneti- and
cally ordered spins. The situation is shown schematically 12
in Fig. 15. The energy of such a hole accompanied by a W 2π 2 W
cloud of saturated polarization can be estimated roughly as E0 = − 1− . (38b)
2 z 2πU
2
W a 4π R 3 zt 2 These size estimates will be needed later when discussing
E(R) = − + π |t| 2
+ , (36)
2 R 3 a U the hole states at the threshold for the transition from an-
where a is the lattice constant and t is the hopping integral tiferromagnetism to superconductivity in high-Tc oxide
ti j between the z nearest neighbors. In this expression, the materials. One should note that U/W must be apprecia-
first term is the band energy of a free hole in a completely bly larger than unity to satisfy the requirement N 1. In
ferromagnetic medium, the second represents the kinetic other words, the condition R a must be met, so that the
energy loss due to the hole confinement, and the third spin subsystem (and the hole dynamics) may be treated
involves the antiferromagnetic exchange energy penalty in the continuous–medium approximation, the condition
paid by polarizing the spins ferromagnetically within a under which Eq. (36) can be derived.
volume 4π R 3 /3. Minimizing this equation with respect
to R, we obtain the optimal number of spins contained in 2. The Quantum Approach: Two Dimensions
the cloud,
The motion of a single hole in the Mott insulator is
4π πU 3/5 much more subtle than the formation of the polaron dis-
N= , (37a)
3 W cussed above. Namely, if we consider n holes in the lower
and the polaron energy, Hubbard subband, then the probability of electron hop-
ping around is ≈n(1 − n), so effectively, the bandwidth
W 5π 2 W 2/5 of such itinerant states is Weff = zt(1 − n). For small n,
E0 = − 1− . (37b) we have W ≤ J , where J is the magnitude of the kinetic
2 3z U
exchange. In the limit of a single hole the dynamics is
determined by the magnitude of exchange interactions J ,
since Weff → 0. In effect, we have a hole moving slowly
in the background of antiferromagnetically ordered spins.
This picture seems to be a good representation of the hole
motion in highly insulating magnetic oxides such as NiO
and CoO. Instead, in high-temperature superconductors
individual polaronic states must overlap appreciably for
(n c ∼ 0.95) when the magnetic insulator → metal tran-
sition takes place. Therefore, some sort of homogeneous
state must be formed in the metallic phase. This is partic-
ularly so since high-temperature superconductors evolve
from a charge-transfer insulator, for which the gap for
2p → 3dn+1 (O2− → Cu1+ ) transitions is smaller than the
Hubbard gap U − W . In effect, the hole states are
hybridized 3d–2p states (in proportions 2:1), not pure
3d states due to copper ions. As a result, few alterna-
tive pictures of the fermionic liquid of strongly correlated
electrons in the normal (metallic) phase arise, starting
FIGURE 15 Representation of the magnetic polaron state, that
is, one hole in the antiferromagnetic Mott insulator. This hole pro- from the phenomenological pictures of marginal Fermi
duces ferromagnetic polarization around itself and may become liquid (Varma et al. [33a]) and nearly antiferromagnetic
self-trapped. Fermi liquid (NAFL) [33b, 33c] to a mean-field picture of
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
1 +
+√ Vk eik·Ri aiσ ckσ + H.C. . (43)
N kσ
In this Hamiltonian, the first term describes the energy of
atomic electrons positioned at
f , the second represents
the energy of band electrons, the third represents the in-
traatomic coulomb repulsion between two electrons of op-
posite spins, and the last describes the mixing of atomic
with band electrons due to the energetic coincidence (de-
generacy) of those two sets of states (H.C. refers to the
Hermitian conjugate part of the hybridization part). In
FIGURE 18 Schematic representation of singlet–spin pairing
forming the RVB state. All paired configurations should be taken
heavy fermions, the atomic states are 4 f states, whereas
to calculate the actual ground state. (a) The RVB state for the they are 3d states of Cu2+ ions in high-Tc systems; the
Mott insulator; (b) that with one hole. The latter case will contain band states are 5d–6s and 2p states, respectively. Note that
+ +
an unpaired spin, as indicated. Niσ = aiσ aiσ and n kσ = ckσ ckσ are the number of particles
on given atomic (i) or k states, respectively. In this Hamil-
tonian, the following parameters appear: the atomic–level
now arises as to what happens when the spin part of the
position
f , the width W of starting band states with en-
form of the second term in Eq. (35) is explicitly included.
ergies {
k }, the magnitude U of the coulomb repulsion
The problem of the resultant quantum ground state of holes
for two electrons located in the same atomic site, and the
in a Mott insulator is a matter of intensive debate [36–38].
degree of hybridization (mixing), Vk , characterized by its
The state called the resonating valence-bond (RVB) state
magnitude V .
has been involked [36] specifically to deal with this prob-
Two completely different situations should be distin-
lem; this state is shown schematically in Fig. 18 for the
guished from the outset: (1) U > W > |
f | |V |, and
case without holes (a) and with one hole (b). The connect-
(2) U > W > |V | |
f |. Case 1 applies when the starting
ing lines represent bonds, across which the two electron
(bare) atomic level is placed deeply below the Fermi level
from spin–singlet pairs. The resonating nature of bonds
and the atomic states admix weakly to the band states.
is connected with the idea that the RVB ground state is a
In case 2, the hybridization is large and is responsible
coherent superposition of all such paired configurations.
for strong mixing of the two starting sets of states. The
The dynamic nature of this spin dimerization is connected
band structure corresponding to the hybridized band states
with the terms (Si+ S −j + Si− S +j ) in the exchange part
in the absence of electron–electron interactions (that is,
of the Hamiltonian [Eq. (35)]. There is the possibility
U = 0) is depicted in Fig. 19. We observe a small gap
that the RVB state [which, for obvious reasons, differs
in the hybridized band structure; it occurs around the
from the ordinary (Néel) antiferromagnet] is a ground
bare atomic level position
f and separates two hybridized
state for the planar CuO2 planes in high-Tc oxides, such as
bands. Those two bands, which have the energies
La2−x Srx CuO4 , where the long–range magnetic order is
destroyed for x ≈ 0.02 ÷ 0.03. We return to this problem
k +
f
k −
f
in Section IV when discussing the boundary line between E k± = ± + |Vk | ,
2
2 2
antiferromagnetism and superconductivity for high-Tc
oxides. correspond to the bonding and antibonding types of states
in molecular systems. The structure of the hybridized
bands is demonstrated explicitly in Fig. 20 of the DOS for
C. Hybridized Systems
each band. One sees that strongly peaked structures occur
Most of the strongly correlated systems are encountered in in the regions near the gap. If the Fermi level falls within
oxides and in several classes of organic and inorganic com- these peaks, a strong enhancement of the effective mass
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
k = 2t(cos k x a + cos k y a), (44)
FIGURE 23 The configuration of the 3dx2 – y2 and pσ orbitals for FIGURE 24 The shape of the two—dimensional Fermi surface for
bonding configurations. The reverse signs for the two p—orbitals band energy of the form of Eq. (44). The values specified represent
(that is, those in parentheses) represent the hybridized configura- µ/t as a parameter. The square shape corresponds to µ = 0 or,
tion for the antibonding state. equivalently, to n = 1. [From Ref. 44.]
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
the tilded operators mean that they are projected onto sub- the average distance between holes in the normal phase is
space of singly occupied lattices sites. The three param- ≈5a, a magnitude comparable to ξ . These circumstances,
eters take the values t −0.5 eV, t 0.05–0.1 eV, and combined with antiferromagnetism and localization ef-
J 0.13 eV. fects, render the new superconducting materials unique in
On the basis of the facts that the present-day band cal- the sense that their description requires a unification of
culations do not provide paramagnetic insulating states theoretical approaches to phenomena previously regarded
for stochiometric materials, such as La2 CuO4 , and that as disparate.
the antiferromagnetic ground state is difficult to achieve The accumulated evidence for rather strong electron–
within the local–density approximation [49], we conclude electron interaction in high-Tc oxides [36,48] and in
that an approach based on the parametrized models dis- heavy–fermion systems [39–43] makes it unlikely that
cussed in the preceding two sections should be treated electron pairing in these materials is caused by ex-
in detail. The microscopic parameters obtained from the tremely strong electron–phonon interaction. Furthermore,
band–structure calculations should be treated as input pa- the electron–phonon interaction does not allow for a con-
rameters in those models. A review of properties obtained nection (or, strictly speaking, competition) between the
within the parametrized models and relevant to high-Tc observed superconductivity and antiferromagnetism [24].
systems is given in Section IV. This is one of the reasons for an intensive search for a
purely electronic mechanism of pairing. We now discuss
some of the mechanisms that have been proposed. The
IV. NOVEL MECHANISMS OF main emphasis so far has been placed on an exchange–
ELECTRON PAIRING mediated pairing for strongly correlated electrons [45],
since for such systems, the pairing, antiferromagnetism,
The binding of two fermions into either a bosonic or a and MITs to the Mott localized phase are derived from
more complicate bound state is a prerequisite for the con- a single theoretical scheme. The latter two phases have
densation of microscopic particles into a coherent (super- been discussed in Section III; here, we concentrate on the
fluid) macroscopic state. This condensation may take the spin–singlet pairing among strongly correlated electrons.
form of Bose–Einstein condensation if the interaction en- Later we discuss charge transfer- and phonon-mediated
ergy between the pairs is much lower than the binding pairings. Finally, we classify the types of correlated states
energy of a single pair (also, the pairs must be well sepa- and metallic states in solids. This classification provides a
rated spatially). Such a Bose condensed state of charged concise way of characterizing specific properties of these
particles may exhibit the principal properties of the super- systems by which the almost-localized systems differ from
conducting state such as the Meissner–Ochsenfeld effect ordinary metals.
[52]. In the BCS theory (discussed in Section II), pair con-
densation occurs under a completely different condition,
namely, when the states of different pairs overlap strongly A. Exchange Interactions and
so that the motion of one widely separated pair takes place the Real-Space Pairing
in the mean field of almost all other pairs.
The pairing of particles in the BCS theory is described 1. Narrow–Band Systems
in momentum (reciprocal) space, where it is assumed that In Section III.A we provided an approximate Hamiltonian
the quasiparticle states with a well-defined Fermi surface [Eq. (34)], which includes the antiferromagnetic exchange
are formed first; the pairing involves electrons from the interactions between the correlated electrons in the limit
opposite points on the Fermi surface (k, −k) and gener- U/W 1. The precise form of this Hamiltonian to second
ates either a simple spin–singlet state (as in the classic su- order in W/U is [31, 55]
perconductors) or a higher angular-momentum state, e.g.,
L = S = 1 (as in superfluid 3 He [53]). Because of the small 2ti2j 1
+
coherence length (ξ ∼ 10 Å), the new superconductors of- H = ti j biσ b jσ + Si · S j − νi ν j
i jσ ij
U 4
fer an opportunity for exploring the possibility of pairing
in real (coordinate) space. Moreover, since the carrier con- + (three–site terms), (46)
centration determined from the Hall-effect measurements
[54] for high-Tc oxides is at least one order of magnitude where the primed summation means that i = j. In this
lower than that for ordinary metals, it is tempting to de- Hamiltonian, doubly occupied site i configurations |i↑↓
scribe the onset of the superconducting state as a Bose con- are excluded. This exclusion is reflected by the presence of
+
densation of preexisting pairs. In fact, the situation is not creation (biσ ) and annihilation (biσ ) operators for electrons
that simple. For example, in La2−x Srx Cu4 , with x = 0.04, in the state |iσ , which are defined as
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
+ +
biσ ≡ aiσ (1 − n i−σ ) and biσ ≡ aiσ (1 − n i−σ ), (47) and
1
so that b̃i j = √ (bi↓ b j↑ − bi↑ b j↑ ), (49b)
2
+
νiσ = biσ biσ and νi = νiσ . (48) then the Hamiltonian [Eq. (46)] with inclusion of the
σ
three–site part can be written in the following very sug-
The spin operator is defined as gestive closed form [55]:
+
Si ≡ Si+ , Si− , Siz ≡ ai↑ +
ai↓ , ai↓ ai↑ , (n i↑ − n i↓ )/2 . H= +
ti j biσ b jσ − (2ti j t jk /U )b̃i+j b̃k j . (50)
i jσ ij
Note that the same representation of the operator Si can
+
be written in terms of projected operators biσ and biσ . The The first term represents, as before, the dynamics of sin-
factor (1 − n i−σ ) in Eq. (47) imposes explicitly the restric- gle electrons moving between the empty sites regarded
tion that the creation or the annihilation of electrons in the as holes; the second term combines the last two terms in
state |iσ can take place only if there is no
second electron Eq. (46) and expresses the dynamics of the singlet pairs
already on the same site. Thus, νi = σ n iσ (1 − n i−σ ) [cf. Eqs. (49a) and (49b)]. The division in Eq. (50) into
enumerates only the singly occupied sites (νi = 0 or 1). In single–particle and pair parts is in analogy to the BCS
other words, the N states corresponding to the doubly oc- Hamiltonian; however, here, the operators are expressed
cupied site configurations have been projected out. Thus, in coordinate space. The term with i = k in the pairing part
Eq. (46) describes the dynamics of strongly correlated enumerates the spin–singlet pairs of neighboring spins; the
electrons for Ne ≤ N of electrons. Also, in performing terms with i = k represent pair hopping of such singlet pair
the summations in Eq. (46), one usually considers only the bonds. Thus, in the language of operators [Eqs. (49)], one
pairs i j of nearest neighbors; in this approximation the adds the bond dynamics to that of single electrons. More-
parameters Ji j = J and ti j = t can be chosen as constants. over, the forms of Eqs. (46) and (50) are completely equiv-
The first term in Eq. (46) describes the single–particle alent; hence, the pairing effect and the antiferromagnetism
hopping of electrons from the singly occupied to the empty should be directly linked within this formalism (they are
atomic sites; the second describes the exchange interaction two different expressions of the same part of H ).
induced by virtual hopping between site i and site j, while It is difficult to diagonalize the Hamiltonian [Eq. (50)]
the three-site part describes the motion of electron with to obtain the eigenvalues of the system. Part of the prob-
spin σ from the singly occupied site located at i to the lem arises from the fact that the single–particle operators
next-nearest neighboring empty site k via the occupied biσ and b+jσ do not obey the fermion anticommutation re-
configuration (with electron of opposite spin) located at lation and that the pair operators b̃i+j and b̃i j do not obey
site j. The various contributions to Eq. (46) are represented boson commutation relations. Additionally, the two terms
graphically in Fig. 26. in (3.5) do not commute, so that the itinerant characteris-
If one introduces a new pair of creation and annihilation tics of the electrons and the pair-binding effects combine
operators in coordinate space by and produce a paired metallic phase, particularly if the
two terms are of comparable magnitude. We have seen
1 + +
b̃i+j = √ (bi↑ + +
b j↓ − bi↓ b j↑ ) (49a) in Section III that if the number of holes δ ≡ 1 − n <
2 δc ∼ 0.02, then the pairing (or exchange) part dominates
and antiferromagnetism sets in. Detailed calculations [32]
lead to the boundary line between the antiferromagnetic
and the ferromagnetic phase, as shown in Fig. 27. The
energy of the completely saturated ferromagnetic phase
(CF) indicated does not depend on the value of exchange
integral Ji j ≡ 2ti2j /U .
We now discuss the superconducting phase for which
the pairing part in Eq. (50) plays a crucial role. To make the
problem tractable at this point, one replaces the operators
[Eqs. (49)] by fermion operators [45, 56], that is,
+ +
FIGURE 26 Various hopping processes in narrow-band systems biσ → aiσ , biσ → aiσ , (51a)
in a partial band-filling case: (a) virtual hopping processes leading
to a kinetic exchange interaction; (b) single-particle hopping rep- and introduces the replacement
resenting the band energy of correlated electrons; (c) contribution
1 + +
to the pair hopping—this process gives the pairing contribution in b̃i+j → bi+j = √ ai↓ + +
a j↑ − ai↓ a j↑ (51b)
Eq. (50) with k = i . 2
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
(sd)
k = s(s) (d)
k + dk . (55c)
+ λi bi+ bi + f iσ+ f iσ −1
i σ
−µ n iσ + (three–site terms). (57)
iσ
2. Hybridized Systems
The electron states near the Fermi surface in high-Tc
oxides such as La2 CuO4 involve hybridization of elec-
trons of atomiclike 3dx 2 −y 2 states of copper with 2pσ of
oxygen (cf. Fig. 23 and Section III.D). These electron
states can be described by the Anderson lattice Hamilto-
nian of the type of Eq. (43), with a width of the bare p–
band W ≈ 4 ev, the position of the 3d 9 level at
f ≡
d − FIGURE 30 Division of the charge-transfer ( p–d ) processes into
p ∼ 1 eV, U ≤ 10 eV, and hybridization magnitude low- and high-energy parts. The processes labeled II give rise to
|V | 1.5 eV [68]. The hybridization is intersite in na- Kondo and superexchange interactions when treated perturba-
ture, that is, it involves the 2p and 3d orbitals located on tionally to second and fourth order, respectively.
different sites. Therefore, the effective hybridization en-
ergy is V z 6 eV, where z = 4 is the number of nearest-
called hybrid interorbital pairing with the pairing opera-
neighboring O atoms in the plane for a given Cu atom. We
tors
see that V z >
f ; hence, the 3d and 2p states mix strongly,
that is, the d electrons can be promoted to 2p–hole states, + 1 + + + +
Bim = √ bi↑ cm↓ − bi↓ ci↑ (59a)
and vice versa. Additionally, 2p electrons can be promoted 2
to form the 3d10 configurations of the excited states. If and
V z
f , but |V |z
f + U , the above two promotion–
1
mixing events are low– and high–energy processes, re- Bim = √ (bi↑ cm↓ − bi↓ ci↑ ). (59b)
spectively. The situation is shown schematically in Fig. 30, 2
where the parameter U is assumed to be by far larger
than |
f |, W , or |V |z. We consider this limiting situation
first [68].
The high–energy processes take place only as virtual
events, that is, with electron hopping from the p state to
the highly excited 3d state and back. Such virtual p–d– p
processes are shown schematically in Fig. 31, where site
m labels the 2pσ state of the oxygen anion O2− centered at
Rm and site i labels 3dx 2 −y 2 due to the Cu2+ ion centered
at Ri . Then the effective Hamiltonian can be rewritten in
the real–space language and for large U reads [68]
+
+
H =
k n kσ +
f biσ biσ + Vim biσ cmσ
kσ iσ imσ
∗ +
2Vmi ∗
Vim +
+ Vim cmσ biσ − B Bin . (58)
imn
U +
f im
kσ iσ i jmσ
f 3. An Overview
2V ∗ Vin U
+
× biσ b jσ (1 − n mσ ) mi +
bim bin . (60) Two alternative models and mechanisms of exchange–
(
imσ f f
+ U ) mediated pairing have been discussed so far: the narrow–
band model, with d–d kinetic exchange-mediated pairing,
We now have a two–band system: the 3d electrons acquire
and the hybridized model, with d– p Kondo interaction-
a bandwidth W ∗ ∼ (V 2 /
f )(1 − n mσ ). The spin–singlet
mediated pairing. The hybridized model should be
pairing is again of the interband type. The part with m = n
regarded as a basis of narrow–band behavior in real ox-
in the last term is equivalent to the Kondo interaction de-
ides and in heavy–fermion systems since the direct d–d (or
rived a long time ago for magnetic impurities [69]. Here,
f – f ) overlap of the neighboring atomic wave functions
the lattice version of this Hamiltonian provides both pair-
is extremely small. Next, we give a brief overview of the
ing and itinerancy to the bare atomic electrons.
Note that the hybrid pairing introduced in this section
expresses both the Kondo interaction (the two–site part)
and pair hopping. It is therefore suitable for a discussion
of the superconductivity of Kondo lattice effects in heavy-
fermion systems. The pairing part supplements the cur-
rent discussions of the Anderson lattice Hamiltonian in
the U → ∞ limit [40–42]. One may state that the Kondo
interaction-mediated pairing introduced above represents
the strong–coupled version of spin fluctuation-mediated
pairing for almost-localized systems introduced previ-
ously [70].
An approach using the slave-boson language for hy- FIGURE 33 Superconducting transition temperature Tc versus
bridized systems has also been formulated [71] and con- hole concentration for La2−x Srx CuO4 over the full range. LM, the
tains a principal feature of the effective Hamiltonian regime of local moments (insulating phase). [From Ref. 72.]
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
narrow–band properties of the correlated electrons start- by virtual d–d transitions; cf. Section III.A) is expressed
ing from the hybridized (Anderson lattice) model. as a fourth-order effect in the hybridization V since the
First, we discuss the quasiparticle states in the U → ∞ virtual d–d transition involves a sequence of d– p and p–d
limit. The simplest approximation is to reintroduce ordi- transitions in the fourth order.
+
nary fermion operators aiσ and aiσ in Eq. (58) and readjust The possible macroscopic states of hybridized systems
the hybridization accordingly [73]. In effect, one obtains are illustrated in Fig. 34 as a schematic classification of
the hybridized bands of the form of Eq. (45), that is, possible states of hybridized systems modeled by the pe-
12 riodic Anderson Hamiltonian [Eq. (43)]. The parameter
f +
k
f −
k 2 W/U characterizes the degree of correlation of quasi–
E k± = ± + 4|Ṽ k |2 , (61)
2 2 atomic electrons that may acquire a nonzero bandwidth
1 due mainly to hybridization; the parameter V /
f charac-
where Ṽ k ≡ q 2 Vk , and q ≡ (1 − n f )/(1 − n f /2) for 0 ≤ terizes the degree of mixing of the states involved. If the
n f ≤ 1, while Vk is the space Fourier transform of Vim . d (or f ) atomic level lies deeply below the top of the val-
For the case of the CuO2 layers [74], ence band (V /
f 1), then we have either Mott–Hubbard
(M–H) or charge–transfer (C–T) insulators; for the for-
2 kx a 2 kya
|Ṽ k | = q V sin
2 2
+ sin . (62) mer the band gap is due to d n → d n+1 excitations (that
2 2
is, ∼ U − W ), whereas for the latter it is due to d n p 2 →
If the Fermi level falls into the lower hybridization band d n+1 p 1 charge–transfer transitions. The atomic 3d (or 4f )
and n f = 1 − δ, with δ 1, then it can be shown that electrons are unpaired in both the C–T and the M–H states.
the quasiparticles describing the hybridized states are of If V /
f 1, and W/U 1 then we enter mixed valent
mainly quasi-atomic character. In other words, the ef- (M–V) and (close to the border with M–H) heavy–fermion
fective Hamiltonian [Eq. (58)] is approximately of the regimes. On the other hand, if W/U 1, then irrespec-
narrow-band form [Eq. (52)]. The pairing takes place be- tive of the value of V /
f , we encounter the correlated–
tween heavy quasiparticles. This limiting situation de- metal regime that we call an almost–localized Fermi liquid
scribes qualitatively the situation in heavy fermions with (AL–FL). Both heavy–fermion and high-Tc systems are
Kondo interaction mediating the pairing. In contrast, if the close to the line separating M–H and M–V regimes. Such a
Fermi level falls close to the top of the upper hybridiza- classification scheme for transition-metal oxides has been
tion band (as is the case for high-Tc superconductors, since proposed in Ref. [77].
the p band is almost full and the 3d level is almost half- The classification shown schematically in Fig. 34 pro-
filled), then the pairing is due mainly to the band electrons vides only a distinction between insulating and metallic
(2p holes in the case of high-Tc oxides). These results are states. A complete magnetic phase diagram for the high-Tc
obtained by constructing explicItly the eigenstates corre- system La2−x Srx CuO4 is shown schematically in Fig. 35
sponding to the eigenvalues [Eq. (61)] and taking the lim- (taken from Ref. 79a). Stoichiometric or doped La2 CuO4 ,
its corresponding to heavy fermions (n f → 1) and high-Tc
systems (n = n d + n p ≈ 3, which also corresponds to the
situation of one hole in the system).
clear experimental evidence that these two phases coex- Three specific features of Ba1−x Kx BiO3 compounds
ist in a high-Tc system, though there is some evidence should be noted. First, the diamagnetic nature of the par-
from muon–spin rotation that it is so [82]. A clear detec- ent compound BaBiO3 distinguishes the systems from the
tion of such coexistence would demonstrate directly the parent compounds La2 CuO4 and YBa2 Cu3 O7 , which are
importance of exchange interactions in a superconduct- both antiferromagnetic. Second, the Ba1−x Kx BiO3 sys-
ing phase. Namely, within exchange-mediated supercon- tems are copper-free and have a truly three–dimensional
ductivity, one can show [55] that close to the half-filled cubic structure in the SC phase [92]. Third, their main su-
narrow-band case, TN /Tc ∼ 6 ÷ 8 (this is a mean-field- perconducting properties are in accordance with the pre-
approximation estimate). The analysis of AF–SC coexis- diction of the standard BCS theory [96].
tence conditions within the Anderson lattice Hamiltonian The theory of the Ba1−x Kx BiO3 compound must in-
has not yet been performed satisfactorly, even though those corporate three additional obvious facts. First, the pair-
two phases coexist in heavy-fermion compounds such as ing process 2Bi4+ → Bi3+ + Bi5+ is possible when the
UPt3 and URu2 Si2 . electron–lattice coupling leads to an attraction overcom-
ing the e–e repulsion in the Bi3+ state relative to the Bi5+
state [89]. It involves a relaxation of the O2− octahedra,
B. Phonons and Bipolarons
that is, an optical, almost dispersionless, breathing mode.
After the discoveries of superconductivity in the 40 and This can provide a local (on-site) attractive interaction be-
90 K ranges [83], the obvious question was posed whether tween 6s electrons of the type λn i↑ n n↓ , which leads to a
the phonon–mediated mechanism of pairing, so successful scalar (k–independent) pairing potential Vkk = λ, which,
in the past, can explain the superconductivity with such a in turn, provides a justification for the observed properties
high value of Tc . It was realized from the outset that one reflecting an isotropic shape of the gap (k ≡ ), as in the
should include specific properties of these compounds, standard BCS theory (cf. Section II).
such as the quasi-planar (CuO2 ) structure with a logarith- Second, from the fact that the parent compound BaBiO3
mic (Van Hove) singularity in the density of states ρ(
) is an insulator, we conclude that either the magnitude
at the middle point of the two–dimensional band [83–85], V of the coulomb repulsion between the electrons on
the polar nature of the CuO bonds rendering applicable nearest-neighboring Bi atoms exceeds the width W ∗ of
the tight–binding representation of the electronic states the bipolaron band [96] or the small bipolarons are self-
[49, 84], and strong electron lattice coupling [85–87], trapped in the potential created by interaction with nearest-
leading to the local formation of small bipolarons (that neighboring oxygens. The onset of the metallic phase at
is, two–electron pairs) [89] that may undergo Bose con- concentrations near the percolation threshold xc ∼ 0.1 for
densation when the metallic state is reached [in a more re- n.n. interaction means that both effects may be important.
fined version a mixed-fermion model is used (cf. Ranniger, In either case, the CDW state will set in, so the entropy
Ref. 90)]. of the bipolaron lattice vanishes at T = 0 (at least, for
There is no clear evidence for the phonon–mediated x = 0). The CDW phase plays the same role here as does
mechanism of pairing in classic high-Tc superconduc- AF ordering in La2 CuO4 and YBa2 Cu3 O6 . The proper-
tors since the isotope effect in both La systems [90] ties of Ba1−x Kx BiO3 are instead similar to those of the
and Y systems [91] is quite small. However, the recen- Ba1−x Pbx BiO3 compounds discovered over a decade ear-
tly discovered superconductors Ba1−x Kx BiO3 [92] exhi- lier [98].
bit a large isotope effect [93] and superconductivity Third, the fact that the onset of the superconductivity
with 20 K ≤ Tc 30 K in the concentration range 0.25 coincides with the transition from the CDW insulator to
x 0.4. Also, the proximity of superconductivity and the an SC metal speaks in favor of preexisting electron pairs
charge density—wave (CDW) state is observed [94]. already present in the insulating phase. However, the bipo-
The last property, as well as the observed diamagnetism laron concentration is large, and hence, the interpretation
in the insulating phase x 0.25, is very suggestive [95] of the superconducting transition as Bose condensation
that small trapped polarons are formed before the elec- of bipolarons may be inapplicable even when the coher-
tron subsystem condenses into a superconducting phase. ence length is small. The overall theoretical situation is
Condensation takes place when the percolation threshold nonetheless much clearer for Ba1−x Kx BiO3 compounds
for the insulator–metal transition is reached3 (at x ∼ 0.2). than for either the La2−x Srx CuO4 or the YBa2 Cu3 O7−δ
series since the accumulated (so far) experimental evi-
3 The actual percolation threshold for the onset of the metallic phase
dence indicates that (optical?) phonon–mediated pairing
is xc /2 ∼ 0.12 since the bipolarons reside on every alternate Bi lat-
tice site. Also, the holes introduced by K doping must be present in
takes place [99].
a Bi–O hybridized band for x > xc to render the bipolarons mobile for The Ba1−x Kx BiO3 compounds seem to be natural can-
x > xc . didates for a bipolaronic mechanism of electron pairing
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
indicated [107], the temperature dependence of the Lon- as supplying some details concerning the specific features
don penetration depth is close to [1 − (T /Tc )4 ]− 2 over
1
of the system under consideration: the gap anisotropy, the
a wide temperature range [108], and the electron pair- role of hybridization, etc. It remains to be seen if some
ing is in the spin-singlet state [109]. Additionally, the qualitative differences arise if superconductivity should
shape of the Fermi surface for YBa2 Cu3 O7 , as deter- occur as a result of Bose condensation of the preexist-
mined by the positron annihilation technique [110], agrees ing pairs. This question is particularly important in the
with the predictions of the band-structure calculations for case when the coherence length is small, as in high-Tc
an even (n ≈ 4) number of electrons. These results do systems.
not necessarily eliminate the principal features obtained In the coming years, one should see detailed calcula-
from the theory of strong electron correlations. We think tions within the exchange mechanism and comparisons
that before discarding the theory based on electron cor- with the experiment concerning the complete phase dia-
relations, we must show clearly that the stoichiometric gram, as well as the thermodynamic and electromagnetic
La2 CuO4 or YBa2 Cu3 O6 compounds are not insulating in properties of the new superconductors La2−x Srx CuO4 and
the paramagnetic phase; actually, they seem to be param- YBa2 Cu3 O7−δ . It would not be surprising if the final
agnetic insulators with well–defined magnetic moments answer for these systems came from a detailed analy-
(that is, with the Curie–Weiss law for the magnetic sus- sis of the model outlined in Section IV.A. The systems
ceptibility obeyed), which supports strongly the view that Ba1−x Kx BiO3 will probably be described satisfactorily
they are Mott insulators. In this respect, the situation in within the standard phonon–mediated mechanism. On the
heavy–fermion systems is rather clear since the recent other hand, it is too early to say anything definite about Bi
theoretical results [39–42] based on the theory of strongly and T compounds with Tc > 100 K, though the suggested
correlated and hybridized states provide a reasonable ra- influence of the electronic structure near
F by the CuO2
tionalization of most of the properties of their normal state. planes seems to indicate a nontrivial role of the exchange
The mechanism of pairing in superconducting heavy– interactions also in those systems when coupled with in-
fermion systems has not yet been determined fully; but terlayer pair tunneling. One of the missing links between
in view of the circumstances that some of the supercon- the properties of the last two classes of compounds and
ductors (for example, UPt3 ) are antiferromagnetic and ex- those of La2−x Srx CuO4 is the conspicuous lack of evi-
hibit pronounced spin fluctuations in the normal state, dence for antiferromagnetism in the Bi2 Sr2 CaCu2 O8 and
the spin–fluctuation mechanism in the version outlined the Tl2 Ca2 Ba2 Cu3 O10−y compounds.
in Section IV.A.2 is a strong candidate [111]. In the com-
ing years one should be able to see a clarification of these Note added in August 2000. This article was orig-
problem. inally written almost 12 years ago. During those years
Let us end with a methodological remark concerning the a tremendous number of papers has been published, but
analogy of the studies of magnetism and superconductiv- the questions concerning either the pairing mechanism or
ity. In 1928, Heisenberg introduced the exchange inter- the non-Fermi liquid behavior have not been clearly re-
action Ji j Si · S j between the magnetic atoms with spins solved, either for high-TC or for heavy-fermion supercon-
{Si }. The ferromagnetic state was understood in terms of ductors. Nonetheless, a number of experimental results
a molecular field Hi ∼ Si which was related to the direct have been obtained in a clear form for high-TC systems.
exchange integral Ji j . Later, various other exchange in- Let us mention two additional results. First, the role of
teractions have been introduced, such as superexchange, the hopping between the next neighbors is important for
double exchange, RKKY interaction, the Bloembergen– obtaining the open Fermi surface (cf. Fig. 41). This Fermi
Rowland interaction, Hund’s rule exchange, and kinetic surface is obtained from the photoemission and encom-
exchange, to explain magnetism in specific systems, such passes all electrons [112], i.e., not only the hole states
as oxides, rare-earth metals, and transition metals. How- in the doped Mott insulator. Thus, the principal question
ever, all these new theories provided a description in terms is how to reconcile the strong-correlation nature of the
of a single–order parameter—the magnetization Si ; the electrons in the CuO2 plane, as reviewed above with the
particular feature of the electron states in each case (local- Luttinger theorem, which seems to be obeyed in optimally
ized states, itinerant states, or a mixture of the two states) doped and overdoped systems, as concluded from the pho-
is contained only in the way of defining this order param- toemission data. Does this mean that the photoemission
eter or the exchange integral. By analogy, the BCS theory experiment samples states physically different from those
provided a concept of a superconducting order parameter involved in thermally induced transport properties? The
(k ), which is universal for all theories of singlet super- Fermi-liquid features seem clearly to break down in un-
conductivity. New mechanisms of pairing should provide derdoped systems [113], where a pseudogap related to the
a novel interpretation to the coupling constant Vkk as well superconducting gap is also observed [114].
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
✭
28. Spal ek, J., Oleś, A. M., and Honig, J. M. (1980). Phys. Rev. B 28, 44. For a review of experimental properties of heavy fermions, see
6802. Stewart, G. R. (1984). Rev. Mod. Phys. 56, 755. Fisk, Z., et al.
29. Brinkman, W. F., and Rice, T. M. (1970). Phys. Rev. B 2, 4302. (1986). Nature (London) 320, 124. Steglich, F. (1955). In “The-
✭
30. Spal ek, J., Datta, A., and Honig, J. M. (1987). Phys, Rev. Lett. 59, ory of Heavy Fermions and Valence Fluctuations” (T. Kasuya and
✭
728. Spal ek, J., Kokowski, M., and Honig, J. M. (1989). Phys. Rev. T. Saso, eds.), p. 23ff, Springer-Verlag, Berlin, New York.
B 39, 4175. For d = ∞ solution see, e.g., Gebhard, F. (1997). “The 45. Wróbel, P., and Jacak, L. (1988). Mod. Phys. Lett. B 2, 511.
Mott Metal–Insulator Transition,” Springer Verlag, Berlin. 46. Anderson, P. W. (1987). Science 235, 1196.
31. For the Mott insulators, the kinetic exchange interaction was intro- 47. Shirane, G., et al. (1988). Phys. Rev. Lett. 59, 1613.
duced by in Anderson, P. W. (1959). Phys. Rev. 115, 2. This result 48. Chakravarty, S., et al. (1988). Phys. Rev. Lett. 60, 1057.
was extended to the case of a strongly correlated metal by Chao, 49. Nucker, M., et al. (1987). Z. Phys. B 67, 9. Fujimori, A., et al.
✭
K. A., Spal ek, J., and Oleś, A. M. (1977). J. Phys. C 10, L271. (1987). Phys. Rev. B 35, 8814. Steiner, P., et al. (1988). Z. Phys. B
✭
Cf. also Spal ek, J., and Oleś, A. M. (1976). Jagiellonian University 69, 449.
preprint SSPJU–6/76, Oct. Zaanen, J., and Sawatzky, G. A. (1990). 50. Mattheis, L. F. (1987). Phys. Rev. Lett. 58, 1028. Yu, J., Free-
J. Solid State Chem. 88, 8ff. man, A. J., and Xu, J.-H. (1987). Phys. Rev. Lett. 58, 1035.
✭
32. Vischer, P. B. (1974). Phys. Rev. 10, 943. Spal ek, J., Oleś, A. M., Szpunar, B., and Smith, V. H., Jr. (1988). Phys. Rev. B 37, 2338.
and Chao, K. A. (1981). Phys. Stat. Sol. (b) 108, 329. Nagaoka, Y. For review see Hass, K. C. (1989). In “Solid State Physics” (H.
(1966). Phys. Rev. 147, 392. Ehrenreich and D. Turnbull, eds.), Vol. 42, Academic Press, San
33. (a) Varma, C. M., et al. (1989). Phys. Rev. Lett. 63, 1996; Diego, CA.
(1989). Int. J. Mod. Phys. B 3, 2083–2118; Littlewood, P. B., and 51. Fulde, P. (1988). Physica 153–155, 1769. Hybertsen, M. S., et al.
Varma, C. M. (1991). J. Appl. Phys. 69, 4979. (b) For review see (1994). Phys. Rev. B 41, 11068.
Pines, D. (1998). In “The Gap Symmetry and Fluctuations in High- 52. Zhang, F. C., and Rice, T. M. (1988). Phys. Rev. B 37, 3759.
TC Superconductors” (J. Bok et al., eds.), Plenum Press, New York, 53. Schafroth, M. R. (1955). Phys. Rev. 100, 463.
NATO ASI Series, Ser. B, Vol. 371, and references therein. (c) 54. For a review see Leggett, A. (1975). Rev. Mod. Phys.
Moriya, T., and Ueda, K. (2000). Adv. Phys. 49, 555–606, and ref- 55. Shafer, M. W., Penney, T., and Olson, B. L. (1987). Phys. Rev. B
erences therein. (d) Lee, P. A., and Nagaosa, N. (1992). Phys. Rev. B 36, 4047.
✭
46, 5621; for review see Nagaosa, N. (1999). “Quantum Field The- 56. Spal ek, J, (1988). Phys. Rev. B 37, 533. Acquarone, M. (1988).
ory in Strongly Correlated Electronic Systems,” Chap. 5, Springer Solid State Commun. 66, 937.
Verlag, Berlin; Lee, P. A. (1996). J. Low Temp. Phys. 105, 581. (e) 57. Baskaran, G., Zou, Z., and Anderson, P. W. (1987). Solid State
van den Brink, J., et al. (1995). Phys. Rev. Lett. 75, 4658; (1996). Commun. 63, 973.
✭
Phys. Rev. Lett. 76, 2826. (f ) Bal a, J., et al. (1995). Phys. Rev. B 58. Cyrot, M. (1987). Solid State Commun. 62, 821.
52, 4597. (g) Dagotto, E. (1994). Rev. Mod. Phys. 66, 763, and 59. Ruckenstein, A. E., Hirschfeld, P. J., and Appel, J. (1987). Phys.
references therein; Dagotto, E., and Rice, T. M. (1996). Science Rev. B 36, 857.
271, 618; Dagotto, E. (1998). J. Phys. Chem. Solids 59, 1699. (h) 60. Kotliar, G. (1988). Phys. Rev. B 37, 3664.
Zaanen, J. (1998). J. Phys. Chem. Solids 59, 1769; Castellani, C., 61. Isawa, Y., Maekawa, S., and Ebisawa, H. (1987). Physica 148B,
et al., ibid., p. 1694; Kivelson, S. A., ibid. p. 1705, and references 391.
therein. 62. Zou, Z., Anderson, P. W. (1988). Phys. Rev. B 37, 627.
34. (a) Takagi, H., et al. (1992). Phys. Rev. Lett. 69, 2975. (b) Rossat- 63. Inui, M., Doniach, S., Hirschfeld, P. J., and Ruckenstein, A. E.
Mignot, J., et al. (1991). Physica C 185–189, 86; Bourges, P., et al. (1988). Phys. Rev. B 37, 2320.
(1997). Phys. Rev. B 56(11) 439. (c) Anderson, P. W. (1997). “The 64. Suzumura, Y., Hasegawa, Y., and Fukuyama, H. (1988). J. Phys.
Theory of Superconductivity in the High-TC Cuprates,” Prince- Soc. Jpn. 57, 2768.
ton University Press, Princeton, NJ; Byczuk, K., Spalek, J., and 65. Kotliar, G., and Liu, J. (1988). Phys. Rev. B 38, 5142.
Wójcik, W. (1998). Acta Phys. Polonica B 29, 3871; Varma, C. M. 66. Nagaosa, N. (1966). In “Proceedings of the 10th Anniversary HTS
(1997). Phys. Rev. B 55(14), 554. Workshop on Physics, Materials, and Applications” (B. Batlogg
✭
35. Spal ek, J., and Wójcik, W. (1988). Phys. Rev. B 37, 1532. et al., eds.), pp. 505ff, World Scientific, Singapore, and references
36. Kivelson, S. A., Rokhsar, D. S., and Sethna, J. P. (1987). Phys. Rev. therein.
B 35, 8865. Anderson, P. W., and Zou, Z. (1988). Phys. Rev. Lett. 67. (a) Labbé, J., and Bok, J. (1987). Europhys. Lett. 3, 1225;
60, 132. Newns, D. M., et al. (1992). Comments Cond. Mat. Phys. 15, 273.
37. Anderson, P. W. (1988). Cargese 1988, Lecture Notes. Also (1987). (b) Bouvier, J., and Bok, J. (1998). In “The Gap Symmetry and
In “Proceedings of the International School ‘Enrico Fermi’—1987: Fluctuations in High-TC Superconductors” (J. Bok et al., ed.), pp.
Frontiers and Borderlines in Many—Particle Physics,” North- 37–54, Plenum Press, New York. (c) Van Harlingen, D. J. (1995).
Holland, Amsterdam. Rev. Mod. Phys. 67, 515. (d) Annett, J. F., Goldenfeld, N., and
38. For a recent review, see Fukuyama, H., Hasegawa, Y., and Suzu- Legett, A. J. (1996). In “Physical Properties in High-Temperature
mura, Y. (1988). Physica C 153–155. Superconductors,” Vol. 5 (D. M. Ginsberg, ed.), pp. 375–461, World
39. Anderson, P. W. (1988). Physica C 153–155. Scientific, Singapore. (e) Chakravarty, S., et al. (1993). Science 261,
✭
40. Lee, P. A., Rice, T. M., Serene, J. W., Sham, L. J., and Wilkins, J. W. 337. (f ) Byczuk, K., and Spal ek, J. (1996). Phys. Rev. B 53, R518.
(1986). Comments Condens. Matter. Phys. 12, 99. (g) Leggett, A. J. (1998). J. Phys. Chem. Solids 59, 1729; Tsvetkow,
41. Rice, T. M. (1987). “In Proceedings of the International School A. A., et al. (1998). Nature 395, 360. (h) Lawrence, W. E., and Do-
of Physics ‘Enrico Fermi’—1987: Frontiers and Borderlines in niach, S. (1971). In “Proc. 12th Int. Conf. Low Temp. Phys.” (E.
Many—Particle Physics,” North-Holland, Amsterdam. Kanda, ed.), Keigaku, Tokyo; see also Tinkham, M., in Ref. 4.
42. Newns, D. M., and Read, N. (1987). Adv. Phys. 36, 799. 68. Mila, F. (1988). Phys. Rev. B 38, 11358, and references therein.
✭ ✭
43. Fulde, P., Keller, J., and Zwicknagl, G. (1988). In “Solid State 69. Spal ek, J. (1988). Phys. Rev. B 38, 208. Spal ek, J. (1988). J. Solid
Physics” (H. Ehrenreich and D. Turnbull, eds.), Vol. 41, Academic State Chem. 76, 224.
Press, San Diego, CA. 70. Schrieffer, J. R., and Wolff, P. A. (1966). Phys. Rev. 149, 491.
P1: GQT/GUE P2: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-750 July 31, 2001 15:31
71. Anderson, P. W. (1984). Phys. Rev. B 30, 1549. Miyake, K., Robaszkiewicz, S. (1990). Rev. Mod. Phys. 62, 113; Ranninger, J.
Schmitt–Rink, S., and Varma, C. M. (1986). Phys. Rev. B 34, 6554. (1998). J. Phys. Chem. Solids 59, 1759, and references therein.
Scalapino, D. J., Loh, E., and Hirsch, J. E., (1986). Phys. Rev. B 91. Batlogg, B., et al. (1987). Phys. Rev. Lett. 59, 912. Faltens, T. A.,
34, 8190. Norman, M. R. (1987). Phys. Rev. Lett. 59, 232. Nor- et al. (1987). Phys. Rev. Lett. 59, 915.
man, M. R. (1988). Phys. Rev. B 37, 4987. 92. Leary, K. J., et al. (1987). Phys. Rev. Lett. 59, 1236.
72. Newns, D. M. (1987). Phys. Rev. B 36, 5595. Newns, D. M. (1988). 93. Mattheis, L. F., Gyorgy, E. M., and Johnson, D. W., Jr. (1988).
Phys. Scripta T 23, 113. Phys. Rev. B 37, 3745. Cava, R. J., et al. (1988). Nature 332, 814.
73. Torrance, J. B., et al. (1988). Phys. Rev. Lett. 61, 1127. Hinks, D. G., et al. (1988). Nature 333, 6176.
74. Rice, T. M., and Ueda, K. (1985). Phys. Rev. Lett. 55, 995. 94. Hinks, D. G., et al. (1988). Nature 335, 419.
Rice, T. M., and Ueda, K. (1986). Phys. Rev. B 34, 6420. 95. Pei, S., et al., preprint.
75. Miyake, K., Matsuura, T., Sano, K., and Nagaoka, Y. (1988). J. 96. Rice, T. M. (1988). Nature 332, 780.
Phys. Soc. Jpn. 57, 722. 97. Da̧browski, B., personal communication.
76. Anderson, P. W. (1959). Phys. Rev. 115, 2. Also (1963). In “Solid 98. Varma, C. M. (1988). Phys. Rev. Lett. 61, 2713.
State Physics” (F. Seitz and D. Turnbull, eds.), Vol. 14, pp. 99–213, 99. Sleight, A. W., Gillson, J. J., and Bierstedt, P. E. (1975). Solid
Academic Press, New York. Cf. also Zaanen, J., and Sawatzky, State Commun. 17, 27.
G. A. (1990). J. Solid State Chem. 88. 100. For critical estimates of isotope shifts of the Tc value, see
77. For a review see, e.g., Vonsovskii, S. V. (1974). “Magnetism” Wiley, Allen, P. B. (1988). Nature 335, 396.
New York. 101. Chakraverty, B. K. (1979). J. Phys. Lett. 40, L99, Alexandroy, A. S.,
78. Zaanen, J., Sawatzky, G. A., and Allen, J. W. (1985). Phys. Rev. and Ranninger, J. (1981). Phys. Rev. B 23, 1796. Alexandrov, A. S.,
Lett. 55, 418; (1986). J. Magn. Magn. Mat. 54–57, 607. Ranninger, J., and Robaszkiewicz, S. (1986). Phys. Rev. B 33,
79. (a) Aharony, A., et al. (1988). Phys. Rev. Lett. 60, 1330. (b) 4526.
Tranquada, J. M., et al. (1988). Phys. Rev. Lett. 60, 156. 102. Rice, T. M. (1988). Nature 332, 780.
✭
80. Spal ek, J., and Honig, J. M. (1990). In “Studies of High- 103. Prelovšek, P., Rice, T. M., and Zhang, F. C. (1987). J. Phys. C 20,
Temperature Superconductors” (A. Narlikar, ed.), Vol. 4, Nova L229.
Science, New York. 104. Little, W. A. (1964). Phys. Rev. 134A, 1416. Ginzburg, V. L.
81. The p–p pairing was discussed first by Emery, V. J. (1987). Phys. (1970). Sov. Phys. Uspekhi 13, 335.
Rev. Lett. 58, 2794, and also by Emery, V. J., and Reiter, G. (1988). 105. Varma, C. M., Schmitt-Rink, S., Abrahams, E. (1987). In
Phys. Rev. B 38, 4547. “Proceedings of the Conference on Novel Mechanisms of Super-
82. The coexistence of SDW and SC states has also been discussed conductivity” (S. A. Wolff and V. Z. Kresin, eds.), p. 355, Plenum
within BCS theory: Baltensperger, W., and Strassler, S. (1963). Press, New York; (1987). Solid State Commun. 62, 681.
Phys. Kondens. Mater. 1, 20; Nass, M. J., et al. (1981). Phys. Rev. 106. Weber, W. (1988). Z. Phys. B 70, 323.
Lett. 46, 614. Overhauser, A. W., and Daemen, L. (1988). Phys. 107. Scalpino, D. J., Loh, E., Jr., and Hirsch, J. E. (1987). Phys. Rev. B
Rev. Lett. 61, 1885. The corresponding problem for exchange– 35, 6694. Schrieffer, J. R., Wen, X. G., and Zhang, S. C. (1988).
mediated superconductivity has been outlined by Parmenter, R. H. Phys. Rev. Lett. 60, 944. White, S. R., and Scalpino, D. J. (1997),
(1987). Phys. Rev. Lett. 59, 923. Phys. Rev. B 55, 6504.
83. Weidinger, A., et al. (1989). Phys. Rev. Lett. 62, 102. Brewer, J. H., 108. For a critical review see Little, W. A. (1988). Science 242, 1390.
et al. (1988). Phys. Rev. Lett. 60, 1073. 109. Fiory, A. T., Hebard, A. F., Mankiewich, P. M., and Howard, R. E.
84. Bednorz, J. G., and Muller, K. A. (1986). Z. Phys. B 64, 189. (1988). Phys. Rev. Lett. 61, 1419.
Chu, C. W., et al. (1987). Phys. Rev. Lett. 58, 405. Uchida, S., 110. Niemeyer, J., Dietrich, M. R., and Politis, C. (1987). Z. Phys. B
et al. (1987). Jpn. J. Appl. Phys. 26, L1. Wu, M. K., et al. (1987). 67, 155.
Phys. Rev. Lett. 58, 908. 111. Smedskjaer, L. C., Liu, J. Z., Benedek, R., Legnini, D. G.,
85. abbé, J., and Bok, J. (1987). Europhys. Lett. 3, 1225. Lam, D. J., Stahulak, M. D., and Bansil, A. (1988). Physica C
86. Prelovšek, P., Rice, T. M., and Zhang, F. C. (1987). J. Phys. C 20, 156, 269.
L229. 112. Cf. also Mathur, N. D., et al. (1998). Nature 394, 39; Fisk, Z., and
87. Jorgensen, J. D., et al. (1988). Phys. Rev. Lett. 58, 1024. Pines, D. ibid., p. 22.
88. Weber, W. (1987). Phys. Rev. Lett. 58, 1371. Barisic, S., Batistic, J., 113. For review see Shen, Z.-X., and Dessau, D. S. (1995). Phys. Rep.
and Friedel, J. (1987). Europhys. Lett. 3, 1231. 253, 1ff; Ding, H., et al. (1995). Phys. Rev. Lett. 74, 2784; (1996).
89. For review of phonon- and bipolaron-mediated superconductivity Phys. Rev. B 54, R9878; Campuzano, J.-C., et al. (1998). In “The
see, e.g., Oguri, A. (1988). J. Phys. Soc. Jpn. 57, 2133; de Jongh, Gap Symmetry and Fluctuations in High-TC Superconductors”
L. J. (1988). In “Proc. 1st Int. Symp. Superconduct.,” Nagoya, (J. Bok et al., eds.), Plenum Press, New York.
Springer—Verlag, New York. 114. Fujimori, A., et al. (1998). J. Phys. Chem. Solids 59, 1892, and
90. Alexandrov, A., and Ranninger, J. (1981). Phys. Rev. B 24, 1164. references therein.
Alexandrov, A., Ranninger, J., and Robaszkiewicz, S. (1986). Phys. 115. Ding, H., et al. (1998). J. Phys. Chem. Solids 59, 1888, and
Rev. B 33, 4526. For review see Micnas, R., Ranninger, J., and refrences therein; Norman, M. R., et al., ibid. p. 1902.
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
Superconductors, High
Temperature
John B. Goodenough
University of Texas
I. Introduction
II. Superconductivity
III. Metallic Oxides
IV. High-Tc Superconductors
291
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
sample; superconductivity is an intrinsic property of the 3. The highest values of Tc are associated with partially
material. filled d bands, and Tc varies sensitively with the
Since 1911, many materials have been surveyed to de- electron–atom ratio for the partially filled band.
termine whether they are superconductors and, if so, the 4. Tc is suppressed where the conduction electrons
value of their Tc . Although there is still little understanding exhibit magnetic order at low temperatures.
to guide in the search for, or design of, high-Tc supercon- 5. Tc is suppressed where the electron–phonon coupling
ductors, extensive investigations of the elements, of alloys, becomes static, inducing a phase transformation to a
of compounds, and of polymers had, by 1985, resulted in nonmetallic state.
several empirical guidelines.
From 1911 to 1986, the critical temperature Tc remained
1. Only metals are superconductors. below 25 K, increasing by less than 0.3 K per decade
2. Superconductivity is associated with a dynamic (see Fig. 1). Moreover, the existing theory—applicable
electron–phonon coupling. to nearly all known superconductors—predicted a ceiling
II. SUPERCONDUCTIVITY
A. Phenomenology
1. Nomenclature
A superconductor is any material that undergoes a tran-
sition from the normal state to the superconductive state
below a critical temperature Tc . It is superconducting when FIGURE 2 Energy versus (a) the density N(ε) of one-electron
it is carrying a resistance-free (Rs = 0) current (i.e., a su- states for a semiconductor and a metal and (b) the Fermi–Dirac
percurrent) in the superconductive state. distribution function f (E).
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
FIGURE 3 Temperature dependence of (a) the resistance and (b) the thermopower of a superconductor.
excited from occupied to unoccupied states in the band: each with its own set of properties. The superconductive
this process gives rise to a Fermi–Dirac statistical distri- state is distinguished from the normal state by its electric,
bution that, for kB T E F (kB is Boltzmann’s constant), magnetic, thermodynamic, and tunneling properties.
leaves intact the concept of a Fermi surface (Fig. 2b).
As the ratio of interatomic/intraatomic interaction en- a. Electric. The dc resistance R of a superconductor
ergies decreases toward unity, the band of one-electron wire drops abruptly at Tc , from Rn > 0 in the normal state
energies narrows, increasing N (E F ) for a given E F ; single to Rs = 0 in the superconductive state (Fig. 3a).
electrons become transformed into quasiparticles that can- In the normal state, the potential difference V between
not be described by the average potential of all other elec- the ends of a wire of length l and resistance Rn is, by
trons because correlations between the electrons on near- Ohm’s law,
est like neighbors reflect the intraatomic electron–electron
V = IRn (1)
interactions and/or because electron–phonon interactions
“dress” the electrons in local crystallographic distortions. if the wire carries a current I . By definition, a constant
Although these interactions reduce the discontinuity in the electric field E = V /l then exists in the wire. In the su-
electron population at E F , an identifiable Fermi surface re- perconductive state, on the other hand, Rs = 0 makes
mains as long as the specimen is metallic. V = E = 0. There is no constant electric field in, or po-
Partially occupied, narrow energy bands of quasiparti- tential difference across, a superconducting wire. Con-
cle states may lose their Fermi surface at E F by induc- sequently all the thermoelectric effects present in the
ing a diffusionless phase transition at low temperatures normal state vanish abruptly at Tc . For example, in the
that splits the band into bands of occupied and empty normal state an applied temperature gradient T gives
states separated by a finite energy gap E g . Three types of rise to an electric field E in the conductor; the thermoelec-
transitions cause such a splitting: (1) an atomic clustering tric power, defined as E/ T , vanishes with E below Tc
that changes the translational symmetry of the crystallo- (Fig. 3b).
graphic structure, (2) a magnetic ordering that changes The resistance Rs of the superconductive state is strictly
the translational symmetry and/or the degree of electron zero only for direct currents of a constant value. If the cur-
localization at atomic positions, and (3) the onset of super- rent changes with time, as in an alternating-current (ac)
conductivity caused by a pairing of one-particle states hav- application, then Rs is not zero. Nevertheless at tempera-
ing energies near E F into an ordered condensate of two- tures T Tc , Rs remains much less than the resistance Rn
particle states. Since the first two processes compete with of the normal state for frequencies ν < E g / h, where E g is
the onset of superconductivity, any realization of a high Tc the energy gap (see Section II.C) at the Fermi energy of
must involve a mechanism that suppresses the stabiliza- the superconductive state. The ratio Rs /Rn increases from
tion of atomic clustering and of magnetic ordering of the a small value to nearly 1 in a finite frequency interval ν
conducting electrons. (Fig. 4). The width ν broadens and its midpoint shifts to
a lower frequency as T increases to Tc .
FIGURE 6 (a) The magnetic field in a superconductive cylinder and sphere induced by a uniform applied field B a .
(b) The total induced plus applied field.
beyond which the Meissner effect is less than complete; from type II superconductors by the way in which the
and Hc2 , beyond which it disappears completely. normal-state regions penetrate the superconductive state
In the absence of a magnetic field, the transition at Tc with increasing Ba /µ0 .
is always sharp; the material is, essentially, wholly su- The distinction between the two types of superconduc-
perconductive at temperatures T < Tc and wholly normal tors is illustrated in Fig. 8 for the case of a Ba applied per-
at T > Tc . At temperatures T < Tc , the change from the pendicular to a plane slab of a superconductor. If the super-
superconductive to the normal state at the critical field conductor is type I, the normal regions enter as relatively
strength Hc (T ) is not sharp in type II superconductors, and thick, parallel laminae; and if both normal and supercon-
for most geometries it is not sharp even in type I super- ductive states coexist, the superconductor is said to be in
conductors. Penetration of magnetic flux occurs between an intermediate state. If the superconductor is type II, the
Hc1 and Hc2 in Fig. 7b; in this range of applied field, parts normal regions enter as numerous, extremely thin tubular
of the specimen are in the normal state and parts are in the filaments separated by small distances (≤10−5 mm), and
superconductive state. It is possible to distinguish type I for Hcl < H < Hc2 the superconductor is said to be in a
FIGURE 7 Magnetization versus applied magnetic field B a for a bulk superconductive cylinder with its axis parallel
to B a for (a) type I and (b) type II superconductivity.
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
c. Thermodynamic: Order parameter. In the ab- which shows that any initial current circulating in the ring
sence of an applied magnetic field, the transition from the decays exponentially to zero in the normal state. How-
superconductive to the normal state is second order: there ever, in the superconductive state an R = Rs = 0 makes
is no discontinuity at Tc in either entropy (no latent heat) I (t) = I (0), and the initial current I (0) continues to circu-
or volume (no thermal hysteresis), but there is a sharp late around the ring without any change in its magnitude.
discontinuity C in the heat capacity C (Fig. 9). Such currents are known as persistent currents; and any
A decrease in entropy on going from the normal to the current I circulating around the ring produces a magnetic
superconductive state shows that the superconductive state flux threading the ring equal to LI. In the presence of Ba ,
is more ordered and can be described by an order pa- the total flux threading the ring is = Ar Ba + LI, and
rameter that varies smoothly with temperature from unity Eq. (10) reduces to
at 0 K to 0 at T = Tc . A natural choice for the order d/dt = −IR. (12)
parameter in classical physics is n s /n 0 , the local den-
sity of superconductive electrons normalized to its value In the superconductive state, R = 0 gives
at 0 K. However, superconductivity is a quantum—not a
= constant = Ar Bs , (13)
classical—phenomenon, and a more profound choice is
the corresponding quantum physics wave function where Bs is the ring magnetic field.
FIGURE 9 Temperature dependence of the specific heat capacity for (a) a BCS superconductor and (b) a typical
antiferromagnet.
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
FIGURE 13 Variation with temperature of (a) Hc of type I and (b) Hc2 of type II superconductors.
one metal to the other. However, the conduction-electron which means that the current oscillates with a frequency
wave functions extend beyond the metal surface, decay-
ν = 2eV / h. (27)
ing exponentially in magnitude with the distance from the
surface. If the insulating layer is thin enough (less than 10 This is the ac Josephson effect.
to 20 Å), a significant amplitude extends through the in- These predictions have been verified experimentally
sulating layer into the other metal. If an empty electronic and shown to apply to any sufficiently thin “weak link” in
state of equal energy is also available in the other metal, a superconducting circuit. A weak link can be any planar
then there is a finite probability that an electron impinging defect at which Tc is sharply reduced from its value in the
on the barrier will pass through the insulating layer. This bulk superconductor. Such weak links appear to limit the
phenomenon is called tunneling. supercurrents in the new high-Tc superconductors.
If both metals are superconductors, two types of par-
ticles may tunnel: single quasiparticles and paired super-
conductive particles. Tunneling of single quasiparticles B. Applications
has been used to measure the energy gap in the super- The technical applications of superconductivity have ex-
conductive state; tunneling of superconductive particles— ploited all its basic properties. However, an extensive
called Josephson tunneling—exhibits unusual quantum commercial potential has been made possible only by
effects that have been exploited in a variety of quantum the discovery of type II superconductors and Josephson
devices. tunneling.
In 1962, Josephson proposed that a tunnel junction bet-
ween two superconductors—each in their superconduc-
tive state—should exhibit a zero-voltage supercurrent in 1. High Magnetic Field, High Direct Current
the direction x perpendicular to the junction, The discovery of zero dc resistance, which makes possible
macroscopic persistent currents, immediately raised the
Ix = I0x sin γ , (24)
hope of building a solenoid magnet of superconductive
due to the tunneling of superconductive electron pairs. wire capable of generating an intense magnetic field at
Both the phase differences φ2 − φ1 of the wave function manageable power levels. Although no energy is expended
on either side of the insulating layer and the canonical by a static magnetic field, the energy required to create and
momentum of Eq. (15) in the presence of a magnetic flux sustain an intense magnetic field with a normal conductor
enter into is prohibitive.
Attempts to exploit this concept encountered the intrin-
2π 2
γ = (φ2 − φ1 ) − A x d x. (25) sic limitation imposed by Hc . A cylindrical wire of radius
0 1 rw carrying a current I has, at its surface, a magnetic field
A maximum dc flows in the absence of any electric or strength produced by the current
magnetic field. This is the dc Josephson effect. Hsurf = I /2πrw . (28)
Josephson further predicted that if a voltage difference
V is applied across the junction, the parameter γ becomes A supercurrent may increase until Hsurf = Hc ; for any cur-
time dependent, rent higher than the critical current,
γ (t) = γ (0) − (4π eV t/ h), (26) Ic = 2πrw Hc . (29)
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
2. Alternating-Current Devices
A low Rs /Rn ratio requires ac operation at T Tc and
ν E g / h. Type I superconductors may retain an Rs ≈ 0
up to 100 MHz. This property has enabled the realization FIGURE 14 Temperature variation of resistance for a wire
of very high-frequency linear electron accelerators with carrying a current I 1 < I 2 < I 3 .
magnifications up to 1010 ; they can operate continuously
with only a fraction of the power requirements of conven- The ability to amplify R electronically makes the su-
tional accelerators. perconductive bolometer an extremely sensitive radiation
Larger energy gaps E g in type II superconductors permit detector; it is particularly important in the far-infrared re-
low-loss ac transmission over superconductive strip lines gion of the spectrum, where most other types of radiation
to even higher frequencies. On the other hand, attempts to detectors are inoperative.
use superconductors in ac power devices remain restricted
to specialty applications such as space vehicles where, 5. Josephson Tunneling
with type II superconductors, high current densities in high
fields permit significant reductions in weight and size. Practical application of the dc Josephson effect has been
realized in very sensitive galvanometers and magnetome-
ters. The SQUID (superconducting quantum interference
3. Levitation device) magnetometer, for example, is used for measur-
The Meissner effect is demonstrated in the classroom by ing small magnetic fields, with extensive use in geologi-
levitation of a bar magnet over a superconductive bowl. cal surveying. A laboratory SQUID was the first practical
The experiment begins with a bar magnet resting on the demonstration of the high-Tc superconductor oxides.
bottom of a shallow bowl of superconductor in its normal The ac Josephson effect has been used in precision
state. The bowl is then cooled to below Tc ; expulsion of a determinations of the value of h/e.
magnetic field from the bowl creates an “image” magnet Applications in the computer field promise higher-
that exerts a repulsive force on the real magnet, causing density, lower-power components; however, their realiza-
it to rise until this force is balanced by the weight of the tion in practice requires an exquisite control of materi-
magnet. als processing that is particularly demanding for the new
A most spectacular application of this principle is the high-Tc superconductor oxides.
“levitated train,” which requires high magnetic fields and,
therefore, a hard, type II superconductor. C. Theory
1. History
4. Bolometer
Once the basic phenomena of zero resistance and the
A bolometer detects electromagnetic radiation by an ab- Meissner effect had been established, the experimental
sorption of radiation that increases its temperature. The strategies responsible for our understanding of supercon-
temperature increase T is related to the energy E ab- ductivity were guided by theory. The theory began with
sorbed per unit mass via the specific heat capacity Cv : purely phenomenological equations; these equations in-
troduced fundamental length parameters as well as the or-
T = E/Cv . (30)
der parameter, and their application permitted Abrikosov
At low temperatures, a low Cv enhances T for a given to explain the distinction between type I and type II
E. A type I superconductive detector is designed to op- superconductivity.
erate in the intermediate state where a small T gives rise The quantum theory of Bardeen, Cooper, and Schrieffer
to a large resistance change R as illustrated in Fig. 14. (BCS) introduced numerical values for three universal
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
4. BCS Theory
In 1956, Cooper showed that so long as there exists an at-
tractive interaction between pairs of electrons, the Fermi
sea of electrons is unstable against the formation of at least
one bound pair formed from states with k > kF . Moreover,
he argued that the two-electron wave function for a super-
FIGURE 16 Interface between superconductive and normal do- conductive pair is a singlet (paired spins), spherical state
mains in the intermediate state; h(x) is the local magnetic-field containing a weighted sum over k > kF of product wave
strength.
functions with momentum k, −k for each product and
that the maximum contribution comes from states with
introduced the order parameter ψ(r ) in Eq. (9) defined k ≈ kF . The two-electron binding energy relative to 2E F
by n s = |ψ|2 to obtain an equation for the supercurrent was shown to be (hωc = hνc )
density js in terms of ψ(r ). (This theory was later shown E bind ≈ 2hωc exp[−2/V N (E F )] (40)
to be a limiting form of the microscopic BCS theory first
presented in 1957.) With this formalism they were able in the weak-coupling limit V N (E F 1. In this derivation,
to treat two features that were beyond the scope of the Cooper made the approximation that the coupling energy
London formalism: (1) nonlinear effects in fields strong V is a constant for all values of k out to a cutoff energy
enough to change n s and (2) the spatial variation of n s . hωc away from E F . Since E bind is of the order of kB Tc ,
A major triumph of the formalism was its description an argument similar to that preceding Eq. (36) suggests
of the intermediate state of a type I superconductor in that the size of the Cooper-pair state is approximately ξ0 ,
which superconductive and normal domains coexist in an which is much larger than the interparticle distance. Thus
applied field strength Ba /µ0 ≈ Hc (Fig. 16). The interface the Cooper pairs are strongly overlapping, which is why
between the two domains is characterized by two lengths: they form a rigid condensate.
the penetration depth λL (T ), over which the local magnetic The interaction between electrons of a pair always con-
field is varying, and the coherence length tains an electrostatic repulsive energy Up between the two
electrons; a high dielectric constant introduces an elec-
ξGL (T ) = h/|2m ∗ α(T )|1/2 , (38)
tronic screening that reduces Up , but it is always repulsive.
over which ψ(r ) can vary without an undue energy in- The problem is to identify an attractive mechanism.
crease. In pure superconductors at T Tc , the Ginzburg– In 1950 Frölich suggested that electron–lattice interac-
Landau coherence length approaches the Pippard co- tions were responsible for the attractive potential, and this
herence length [i.e., ξGL (T ) ≈ ξ0 ], but ξGL (T ) diverges idea was confirmed experimentally with the discovery of
as (Tc − T )−1/2 near Tc since α vanishes as (Tc − T ). the isotope effect. The physical idea in the BCS treatment
Since λL (T ) also diverges as (Tc − T )−1/2 , the ratio of this suggestion is that the first electron polarizes the
λL /ξGL is nearly independent of temperature. Therefore crystal by attracting the positive atomic cores; the polar-
the Ginzburg–Landau parameter is ization in turn attracts a second electron provided that it
arrives in the polarized region of the crystal before the
κ = λL /ξGL . (39) lattice has had time to relax to its initial state. This time
In type I superconductors, a κ 1 results in a positive constraint limits the size of a Cooper pair to a charac-
interface energy, which stabilizes a macroscopic domain teristic length of order ξ0 . Moreover, the cutoff energy
pattern. hωc in Eq. (40) is, for this mechanism, the Debye energy
Abrikosov investigated what would happen if the hωD = kB D , which characterizes the cutoff of the phonon
Ginzburg–Landau parameter is greater than unity. He spectrum. If the attractive energy VC exceeds the electro-
√
found that for κ > 1/ 2, the energy of the interface be- static repulsive energy in magnitude, then the net BCS
tween normal and superconductive domains becomes potential
negative and the superconductor is type II. With a negative VBCS = VC − Up (41)
interface energy, field strengths Ba /µ0 ≥ Hc1 perpendicu-
lar to a superconducting slab cause flux to penetrate within is attractive.
cylindrical, normal-state domains; persistent supercur- The BCS theory involves a calculation of the ground
rents surrounding the normal-state regions form vortices. state of the system in the presence of a net attractive poten-
The vortex √concentration increases with Ba /µ0 ≥ Hc1 un- tial VBCS . Condensation of Cooper pairs changes the state
til Hc2 = 2κ Hc ; above Hc2 the vortices are merged into of the Fermi sea (the collection of one-particle states), and
a single normal-state phase. at some point the binding energy for an additional pair has
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
gone to zero. The simplest form of the theory contains one the BCS universal ratios [Eqs. (43)–(45)] have become
adjustable parameter, VBCS ; all other parameters entering transformed to
the theory are independently measurable. Its principal de-
ductions are the following: 2 0 /kB Tc = 3.53[1 + 12.5m 2ln(1/2m)], (46)
C(Tc )/γ Tc = 1.43[1 + 53m ln(1/3m)],
2
(47)
Tc ≈ 1.13D exp[−1/VBCS N (E F )], (42)
γ Tc2 H02 = 0.168[1 − 12.2m 2ln(1/3m)], (48)
where D ≈ M −1/2 is the Debye temperature, and
where m = kB Tc / hωln contains a parameter ωln that rep-
resents a weighted measure of the significant phonon
2 0 /kB Tc = 3.53, (43) frequencies appearing in α 2 F(ω). The agreement between
C(Tc )/γ Tc = 1.43, (44) theory and experiment for elemental and alloy supercon-
ductors, as given by Carbotte, is displayed in Fig. 17 for
γ Tc2 H02 = 0.168, (45) conventional weak to strong coupling regimes m ≤ 0.25.
Within the Eliashberg theory,
where 0 and H0 are the gap parameter E g = 2 and
the critical field strength Hc at T = 0 K, C(Tc ) is the 2 0 /kB Tc < 9 (49)
discontinuity in the specific heat at T = Tc , and γ is the
Sommerfeld constant of the electron gas in the normal would reach its maximum value only in the unrealistic
state (i.e., γ Tc is the electronic specific heat of the nor- situation that the entire spectral weight occurs at an op-
mal state at T = Tc ). It is found experimentally that α in timum vibrational energy hωE = 0.75 meV. The critical
Eq. (19) is not universally 12 and that Eqs. (43) to (45) do temperature
not hold quantitatively in many high- and intermediate-Tc
materials, which indicates a need to extend the simplest Tc = C(µ∗ )Ap /k (50)
BCS theory.
increases with the strength of the effective electron–
phonon interaction
∞
5. Beyond BCS
Ap = α 2 F(ω) d(hω), (51)
The limitation of the BCS theory is that it is a one- 0
parameter theory in which VBCS is assumed to be constant where C(µ ) decreases smoothly with increasing µ∗ . Tc
∗
in an energy region about E F of width ±hωD ; and there increases with Ap until a lattice instability freezes out a
is no prescription available for calculating VBCS from mi- static distortion of the structure. Thus the theory of su-
croscopic theory. An important extension of BCS theory perconductivity itself does not put an upper limit on Tc ;
has been given by Eliashberg. however, the conditions for a high Tc appear to be the same
Whereas BCS theory simply postulates an attractive as those for the stabilization of competitive mechanisms.
potential VBCS , Eliashberg theory treats properly the mi- The electronic density of states N (E F ) at the Fermi en-
croscopic electron–phonon interactions responsible for ergy plays an important role, as in the BCS theory, since
the pairing potential in conventional superconductors. The
Eliashberg theory contains two parameters. One is the Ap = N (E F )g 2 , (52)
pseudopotential µ∗ for the Coulomb electron–electron
repulsions; it is adjusted to give the correct value of Tc where g 2 involves a double Fermi-surface average of the
for a given electron–phonon interaction. The other is the square of the electron–atomic core interaction. In the ab-
electron–phonon spectral density α 2 F(ω), where F(ω) is sence of any physical intuition as to how to enhance g 2 ,
the number of phonon modes (lattice vibrations) having an efforts to increase Tc have traditionally concentrated on
energy between hω and h(ω + dω); α 2 F(ω) is a phonon increasing N (E F ), but this strategy is frustrated by the ap-
frequency distribution weighted by the strength of the pearance of spontaneous magnetism or atomic-clustering
electron–phonon interaction for that mode. It is possible to lattice instabilities as competing processes. Significantly
measure α 2 F(ω) accurately with superconductive-state– the high-Tc copper oxides have a relatively small N (E F ),
insulator–normal-state tunneling experiments. which implies that a large g 2 is enhancing the ratio m
The Eliashberg equations determine not only the tun- into a very strong-coupling regime. This observation re-
neling characteristic of a tunnel diode, but also all of the quires, in turn, either an electronic enchancement of the
thermodynamics of a particular superconductive material electron–phonon interaction or an entirely novel mecha-
provided that α 2 F(ω) and µ∗ are known. In this theory, nism for the formation of electron pairs.
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
✏
FIGURE 17 The ratios (a) 2 0 /kB Tc , (b) C(Tc )/γ Tc , and (c) γ Tc2 /H02 versus m = kBTc / h ωl n . [After Carbotte, J. P.
(1987). Sci. Prog. Oxf. 71, 327.]
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
III. METALLIC OXIDES model gives a good first approximation to the binding
energy of the solid. The covalent component to the bond-
The new high-Tc superconductors are oxides. Since only ing introduces an O-2 p character into the Mg-3s states
metals are superconductors, this fact may appear anoma- and a Mg-3s character into the O-2 p states, but without
lous, as most of the common oxides are either insulators changing the number of electron states at each energy
or low-mobility semiconductors. However, many oxides level. Even where this “mixing” is large, it is customary
are metallic, and a few of these have extensive com- to identify the energy levels by their ionic component only
mercial application. For example, the metallic cathode (i.e., as O2− :2 p 6 and Mg2+ :3s 0 levels); the wave functions
of the lead–acid battery is PbO2 ; and Bi2 Ru2 O7 has a describing the mixed Mg-3s and O-2 p states are referred
temperature-independent resistivity in the vicinity of room to as crystal-field orbitals so as to distinguish them from
temperature, which makes it an important resistor material the atomic orbitals of a point-charge model.
in the electronics industry. Discussion of the high-Tc su- The crystal-field energies reflect the point-group sym-
perconductors rightfully begins, therefore, with a review metry of the near-neighbor Mg O interactions. The final
of the conditions that must be satisfied if an oxide is to be step is to introduce the like-atom interatomic interactions,
metallic. which broaden the energy levels of the crystal-field or-
bitals into energy bands of one-electron states. Whereas
the crystal-field orbitals are localized to discrete atomic
A. Electron Energies in a Typical Insulator sites, the one-electron states are itinerant with a well-
Figure 18 shows schematically the construction of an en- defined momentum p = hk in the absence of a magnetic
ergy diagram for the insulator MgO. The O2− ion is not field Ba = ∇ × A. Each band contains 2N /n one-electron
stable in free space; a negative electron affinity places the states per atomic orbital (the factor 2 reflects the twofold
O−/2− redox energy above the lowest energy E vac for a spin degeneracy of an orbital) for an array of N like atoms
free electron in a vacuum. An energy E 1 is required to re- containing n atoms per primitive unit cell. Thus the band
move the outer Mg = 3s 1 electron from a free Mg+ ion to states reflect the translational space–group symmetry of
a free O− ion to create free Mg2+ and O2− ions. This cost the crystal.
in energy is more than compensated by the electrostatic In MgO there is one magnesium and one oxygen atom
energy E M gained by ordering the Mg2+ and O2− ions into per primitive unit cell, so that O2− :2 p 6 and Mg2+ :3s 0 lev-
a crystal structure; the Madelung energy E M is calculated els are each broadened into single bands with a band-
for a lattice of point charges. The crystalline electric field width much broader than the small spin-orbit splitting
raises the Mg2+/+ level and lowers the O−/2− level; cross- of the threefold degeneracy of the oxygen 2 p crystal-
ing of these two energies ensures stabilization of the crys- field orbitals. Therefore the highest occupied band of
talline phase with a charge transfer from magnesium to one-electron states is represented as an orbitally threefold-
oxygen. degenerate O:2 p 6 band, which is full; the lowest unoccu-
In the real MgO crystal, transfer of an integral elec- pied band is identified as the Mg2+ :3s 0 band.
tronic charge does not occur; a quantum-mechanical co- In tight-binding theory, the width of a band of one-
valent component in the Mg O bond transfers a frac- electron states is
tion of the O2− -ion electronic charge back onto the Mg2+ W ∼ = 2zb, (53)
ion. However, the reduction in E M caused by this lower-
where z is the number of like nearest neighbors on ener-
ing of the effective ionic charges is compensated by the
getically equivalent lattice sites and
quantum-mechanical covalent-mixing repulsion between
the two ionic energy levels. Therefore the point-charge b ≡ (ψi , H ψ j ) ∼
= εi j (ψi , ψ j ) (54)
FIGURE 18 Electron energies for MgO: (a) free ions; (b) point-charge model; (c) band model; (d) density of states.
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
obtaining partially filled d or f bands. However, the wave mixed-valent state thus become “dressed” in a local lattice
functions of partially filled d or f shells have smaller ra- deformation, which introduces an activation energy into
dial extensions, which reduces the interatomic-interaction their mobility. These “dressed” electrons are called small
parameter b of Eq. (54) and increases the intraatomic- polarons; they move diffusively, so k is no longer a good
interaction parameter U , so Eq. (55) may no longer be quantum number. The rare earth mixed-valent systems are
satisfied. not metallic.
On the other hand, where a broad conduction band over-
1. Problems with Outer 4f Electrons laps a 4 f n energy level and E F intersects the 4 f n en-
ergy level to give a 4 f n+1 /4 f n mixed valence (in this
The 4 f electrons of a rare earth ion are tightly bound case denoted an “intermediate valence”), hybridization of
to their atomic nucleus and they are screened from near the 4 f wave functions with the conduction-band wave
neighbors by closed 5s 2 5 p 6 shells, so the condition functions may lead to “heavy-fermion” metallic behav-
W U (57) ior. Superconductivity has been observed in some heavy-
fermion compounds (not oxides), but in all of them Tc
prevails in the rare earth oxides. Under these conditions is low.
the 4 f orbitals remain as crystal-field orbitals; they are
not transformed into itinerant-electron states. Therefore
4 f n configurations remain localized; they impart localized 2. Outer d Electrons
atomic moments to the ions that are essentially identical a. General considerations. The 4 f electrons of a
to the localized atomic moments they impart to the free rare earth ion in an oxide are only weakly perturbed from
ions or atoms. In this situation, successive redox potentials their free-ion behavior; the outer s and p electrons are so
are separated by a large energy U > E g , which restricts strongly perturbed that they are transformed into itiner-
the accessible 4 f n configurations on a given rare earth ant electrons. The perturbations of the d wave functions
cation to at most two, and two only if one happens to lie in a transition-metal oxide are of intermediate strength.
within E g . In this case also it is convenient to consider first the per-
If a 4 f n /4 f n+1 redox couple does lie within E g , as turbations imposed by the nearest-neighbor metal–oxygen
illustrated in Fig. 20a, then it is possible to obtain mixed- (M O) interactions; these give rise to crystal-field orbitals
valent compounds in which E F intersects the redox energy. containing the quantum-mechanical covalent mixing be-
In this case, metallic conduction could be expected were tween overlapping cation and oxygen orbitals. Whether
b > hωR , where ωR is the frequency of a breathing-mode the crystal-field orbitals remain localized or are trans-
lattice vibration. However, b is so small that the near neigh- formed into itinerant-electron band states depends on the
bors have time to relax about a mobile electron, thereby relative strengths of the crystal-field intraatomic interac-
trapping it in a local potential well. These lattice relax- tions U and the bandwidth W due to interatomic interac-
ations stabilize the occupied states at the expense of un- tions between crystal-field wave functions on neighboring
occupied states, as does the molecular reorganization in a metal atoms M.
liquid for a given redox couple. The mobile electrons of the Covalent mixing between cation d and oxygen 2s and
2 p orbitals in a transition-metal oxide has two important
consequences. First, the fivefold orbital degeneracy of the
free ion is at least partially removed by a crystal-field
splitting of the energies of the crystal-field orbitals. Sec-
ond, mixing of oxygen wave functions with the d wave
functions spreads the crystal-field orbitals of d wave-
function symmetry out over the oxygen atoms, which
both reduces the intraatomic energy U and increases the
bandwidth W ; it also allows M O M as well as M M
interactions.
For half-filled crystal-field orbitals, the addition of one
more electron to a cation costs an intraatomic energy
U = U + ex , (58)
FIGURE 20 Electron energies for (a) EuO and (b) Gd2 O3 . EF where U is the energy required to add an electron to an
moves into the 4 f 7 energy level in Eu1−δ O containing a Eu3+/2+ empty orbital and ex is the additional electrostatic en-
mixed valence. ergy required to add it to a half-filled orbital. The term
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
FIGURE 23 Electron energies for MnO: (a) free ions; (b) point charge; (c) crystal.
for MgO except for the appearance of a filled crystal- the O-2 pπ orbitals but are orthogonal to the O-2 pσ and
field d 5 configuration within the large energy gap be- O-2s orbitals. The admixture wave functions φs , φσ ,
tween the empty Mn:4s 0 and the filled O:2 p 6 bands. Be- and φπ are, respectively, linear combinations of nearest-
cause E F lies above the top of the O:2 p 6 band, the formal neighbor O-2S, O-2 pσ , and O-2 pπ orbitals having the
valence Mn(II) unambiguously assigns five d electrons same symmetries as the atomic f e or f t orbitals with
per Mn. which they mix. The covalent-mixing parameters are de-
The Mn atom sits in an octahedral interstice of oxy- fined as
gen atoms in the rock-salt structure of MnO; in this
configuration the fivefold d-orbital degeneracy is split in λσ ≡ bσca E and λπ ≡ bπca E, (64)
two by a cubic crystalline field. The wave functions for where E = (E d − E p ) is the energy required to transfer
the two crystal-field manifolds have the form an electron from an O-2 p orbital to an empty d orbital at
the point-charge Mn(I) = 3d 6 energy. Because the overlap
ψe = Nσ [ f e − λs φs − λσ φσ ], (62)
integrals entering
ψt = Nπ [ f t − λπ φπ ], (63)
bca ≡ (ψcat , H ψanion ) ≈ ε(ψcat , ψanion ) (65)
where, from Fig. 24, the twofold-degenerate e√ orbitals of
are larger for the σ -bonding orbitals, a λσ > λπ raises the
symmetry x 2 − y 2 and [(z 2 − x 2 ) + (z 2 − y 2 )]/ 2σ -bond
energy of the antibonding crystal-field e orbitals relative
with 2 pσ and 2s orbitals at neighboring oxygen atoms
to that of the t2 orbitals. The crystal-field splitting
but are orthogonal to the O-2 pπ orbitals; the threefold-
degenerate t2 orbitals of symmetry x y, yz, zx π -bond with
10Dq ≡ c = m + 1
2
λ2σ −λ2π (E d − E p )
+ 12 λ2s (E d − E s ) (66)
between e and t2 crystal-field energies contains only a
relatively small electrostatic term m .
The energy difference between the Mn(I):3d 6 and the
Mn(II):3d 5 manifolds is
U = Ut + ex , (67)
where Ut is the energy required to add an electron to
an empty t2 orbital. In MnO, a ex ≈ 3 eV makes both
U and E large, and a large E makes λσ and λπ rel-
atively small. Therefore a W U and a c < ex stabi-
lize a localized t23 e2 configuration at a Mn(II) ion, and a
direct exchange interaction between spins in orthogonal
orbitals couples the spins parallel—in accordance with
FIGURE 24 Illustration of cation d and anion p orbitals in the Hund’s highest multiplicity rule for free ions—to give a
(001) plane of the rock-salt structure. localized Mn(II)-ion magnetic moment of 5µB .
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
e. Ti4 O7 and the bipolaron. The mixed-valent com- and anion vacancies in its rock-salt structure that become
pound Ti4 O7 is obtained by removing oxygen from TiO2 . ordered at lower temperatures. Here, also, the intraatomic
The oxygen vacancies order into shear planes so as to leave energy U = Ut contains no additional term c or ex ,
a close-packed tetragonal oxide-ion array as in TiO2 , but and some hybridization of titanium 4s and 3d orbitals
with the Ti atoms arranged to give TiO2 slabs connected increases W .
every four Ti atoms along the TiO2 c axis by a shear plane
across which Ti atoms share common octahedral-site faces g. Oxides with only M O M interactions. The
rather than octahedral-site edges. A strong electrostatic re- perovskite and pyrochlore structures provide systems hav-
pulsion between cations across a shear plane displaces the ing 135 to 180◦ M O M interactions and no M M in-
shear-plane Ti atoms away from each other. The possibil- teractions. A survey of the oxides with these structures
ity of a ferroelectric-type displacement of a Ti(IV) ion in shows that the condition W > U may be fulfilled in single-
an octahedral site stabilizes the shear-plane structure and valent oxides and the condition W > hωR may be found
locates the Ti–3d electrons within the slabs. in mixed-valent oxides with these structures.
At room temperature, both the Ti Ti and the Ti O Ti The relevant bandwidth W ∼ = 2zb arising from
interactions within a rutile slab satisfy the condition M O M interactions is proportional to either λ2π or λ2σ .
W > hωR , and Ti4 O7 is metallic. However, a first-order A large covalent-mixing parameter λπ or λσ requires,
semiconductor–metal transition occurs at a Tt ≈ 150 K according to Eq. (64), a small E = (E d − E p ) and/or
due to Ti Ti dimerization within the slabs; the electrons a large bca . The energy E decreases with increasing
are trapped out as pairs within specific Ti Ti homopolar formal charge on the cation and, for a given charge, on
bonds. Such a transition would be typical of cation clus- going to the right in any long period, provided that c
tering except that in the mixed-valent compound Ti4 O7 or ex is not introduced into U on adding another d
the homopolar bonds are mobile in the temperature range electron to compensate for the increased nuclear charge.
130 < T < 150 K; they become stationary only below However, bca also decreases on going to heavier atoms,
130 K. These mobile homopolar bonds represent strongly but it increases on going from 3d to 4d to 5d orbitals.
coupled, localized electron pairs that, like small polarons, The overlap integrals are also sensitive to the M O M
move diffusively in the crystal. Such a mobile electron pair angle and to the character of the countercation A in the
is called a bipolaron. Formation of spin-paired bipolarons AMO3 perovskites and the A2 M2 O7 pyrochlores.
causes a sharp drop in the paramagnetic susceptibility. But Perovskite and pyrochlore oxides containing partially
the compound becomes a semiconductor, not a supercon- filled 5d orbitals are generally metallic, whether stoichio-
ductor, and there is no Meissner effect. metric as in ReO3 or mixed-valent as in the Nax WO3
bronzes; but single-valent metallic conductivity, as in
f. TiO, a single-valent compound with W >U. The ReO3 , does not guarantee that the compounds are
titanium atom may be stabilized in single-valent oxides as superconductors.
Ti(II) in TiO, Ti(III) in Ti2 O3 , and Ti(IV) in TiO2 . This is Perovskite and pyrochlore oxides containing partially
possible, even though TiO2 is an insulator with empty d or- filled 4d orbitals are intermediate in character; the 4d elec-
bitals some 3 eV above the top of the O:2 p 6 band, because trons are itinerant, but a W ≈ U may result in spontaneous
the energy separating the Ti(III):t21 e0 and Ti(II):t22 e0 lev- magnetism.
els is a relatively small U = Ut . Moreover, the absence of The perovskite and pyrochlore oxides containing par-
antibonding e electrons allows the Ti O bond to be short tially filled 3d orbitals commonly contain localized 3d n
relative to the radial extension of the t2 crystal-field or- configurations; however, several of these oxides have
bitals, so the overlap of t2 orbitals on neighboring Ti atoms a W ≥ U and are either metallic or stabilize itinerant-
sharing a common octahedralsite edge is large enough to electron antiferromagnetic order. In the perovskites, the
ensure a W > Ut for Ti Ti interactions and the overlap of homogeneous electronic picture in Fig. 21 does not
Ti-t2 and O-2 pπ orbitals is large enough to make W ≈ Ut hold where W ≈ U ; a static CDW and/or spin-density
for 180◦ Ti(III) O Ti(III) interactions. wave (SDW) may be stabilized as indicated in Fig. 22.
The corundum structure of Ti2 O3 contains, on a hexag- Recently, dynamic displacive phase segregations have
onal basis, c-axis pairs of cations sharing a common been identified in the metallic systems Sr1−x Cax VO3 ,
octahedral-site face. Although Ti2 O3 is a metal above La1−x Ndx CuO3 , and Ln1−x Cax MnO3 and the LnNiO3
room temperature, it becomes a semiconductor at lower family (Ln = lanthanide). These dynamic segregations oc-
temperatures because the d electrons become trapped in cur because there is a first-order change in the equi-
homopolar Ti Ti bonds within the c-axis pairs. librium M O bond length on going from localized 3d
TiO, on the other hand, is metallic and a superconductor electrons on the M atoms (W < U ) to itinerant 3d elec-
(Tc = 1.5 K), even though it contains about 15% cation trons (W > U ); the equilibrium M O bond is longer for
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
localized than for itinerant 3d electrons. In the LnNiO3 tivity. In fact, CDWs may also compete in mixed-valent
family, the onset of a static CDW/SDW below a critical oxides.
temperature Tt has been shown to be an order–disorder 5. At the crossover from localized to itinerant electronic
transition of localized-electron fluctuations in an itinerant- behavior (W ≈ U or W ≈ hωR ), a first-order change in
electron host; it is not due to either Fermi-surface nesting the equilibrium M O bond length can give rise to vi-
in a new Brillouin zone created by a change in lattice bronic phenomena in both single-valent and mixed-valent
symmetry or to the homogeneous Mott–Hubbard transi- transition-metal arrays.
tion illustrated in Fig. 21. In the Ln1−x Cax MnO3 system,
all the Mn atoms carry a localized t 3 configuration with i. Peculiarity of copper oxides. Copper oxides are un-
spin S = 3/2, and a single electron per Mn(III) occupies usual in two respects. First, octahedral-site Cu(II):t6 e3
a narrow (W ≈ U ) σ ∗ band of e-orbital parentage. In this contains a single e hole in the 3d shell, which makes it
case, the transition from localized e to itinerant σ ∗ elec- orbitally degenerate and therefore a strong Jahn–Teller
trons is approached from the localized-electron side, and ion; consequently, Cu(II) ions normally occupy octahedral
ordering of the twofold-degenerate e orbitals promotes sites that are deformed to tetragonal (c/a > 1) symmetry
e-electron localization and charge ordering. In the ab- by Jahn–Teller orbital ordering. However, in the absence
sence of charge and orbital ordering, the system becomes of a cooperativity that stabilizes long-range orbital order-
ferromagnetic below a Curie temperature Tc , and a dy- ing, the electrons may couple locally to E-mode vibra-
namic phase segregation in compositions with W ≈ U re- tions, forming vibronic states in a dynamic Jahn–Teller
sults in a “colossal magnetoresistance” (CMR) at tem- coupling. Second the Cu(II):3d 9 energy level lies below
peratures T ≥ Tc ; a dynamically segregated, ferromag- the top of the O2− :2 p 6 valence band in an ionic model;
netic, metallic phase of higher Curie temperature grows the introduction of covalent bonding creates states of e-
in an applied magnetic field at the expense of the host orbital symmetry at the top of the O2− :2 p 6 bands that have
phase until it reaches a percolation threshold. This evi- a large O-2 pσ component (see Fig. 26). Locally this O-2 pσ
dence of vibronic (hybridization of electronic and vibra- component increases dramatically on oxidation of Cu(II)
tional state) phenomena in oxides with perovskite-related to Cu(III). The change in hybridization represents a polar-
structures is of great significance for our understand- ization of the oxygen atoms that decreases the equilibrium
ing of the high-temperature superconductivity in copper Cu O bond length, but the change in polarization is fast
oxides.
B. Copper-Oxide Superconductors
to the other. Physicists refer to such a spontaneous dispro-
1. Structure
portionation as a “negative U ” reaction.
The system BaPb1−x Bix O3 is pseudocubic in the com- Where the tolerance factor in Eq. (77) approaches unity,
positional range 0.05 ≤ x 0.3, and it is an unconven- epitaxial (001) interfaces between an AX rock-salt layer
tional superconductor with a Tc that increases with x to and an ABX3 perovskite layer are lattice matched. Nature
about 13 K at the limiting composition of the pseudocubic recognizes this fact by stabilizing intergrowth structures
single-phase field (Fig. 28). (AX)(ABX3 )n in which perovskite layers alternate with
Superconductivity was first discovered in the rock-salt layers along an [001] axis. The La2 CuO4 struc-
BaPb1−x Bix O3 system by Sleight of DuPont. Al- ture in Fig. 29, for example, is tetragonal at high tempera-
though this perovskite system reaches a maximum Tc tures, with LaO rock-salt layers alternating with LaCuO3
of only about 13 K, it is considered unconventional perovskite layers on traversing the c-axis. Lattice match-
because such a Tc requires a large VBCS N (E F ) product ing requires a 45◦ rotation of the [100] axis of a rock-salt
[Eq. (42)], and a small measured N (E F ) then requires layer relative to that of a perovskite layer. As in the per-
an exceptionally large pairing potential VBCS . Therefore, ovskite structure itself, the Goldschmitt tolerance factor
the system has been examined for clues to the strong of Eq. (77) is a measure of the mismatch of the equilib-
coupling mechanism operative in the higher-Tc copper rium A X and B X bond lengths. Since the A X and
oxides (Fig. 1). B X bonds have different thermal expansions and com-
Superconductivity also appears on suppression of the pressibilities, matching (t = 1) of the bond lengths can be
static CDW of BaBiO3 by substitution of more than 12% perfect only at a specific temperature for a given pres-
K+ for Ba2+ in Ba1−x Kx BiO3 ; a maximum Tc = 32 K is sure, and the value of t calculated from tabulated ionic
found near x = 0.4, where the system becomes cubic. For radii corresponds to room temperature at ambient pres-
x > 0.47, the system behaves as a normal metal without sure. On cooling, t decreases, and a t < 1 is compensated
any superconductivity. The effect on Tc of substituting 18 O by a cooperative rotation of the CuO6/1.5 octahedra about
for 16 O gave a conventional isotope shift, α = 0.4 to 0.5, a tetragonal [110] axis (Fig. 29b), to lower the symme-
which indicates that the BCS phonon-mediated pairing try to orthorhombic. These rotations buckle the Cu–O–Cu
mechanism is operative in these systems. On the other bonds from 180◦ to (180◦ − φ), so the CuO2 planes be-
hand, Kumal, Hall, and Goodrich have shown that the come CuO2 sheets.
transition at Tc is fourth-order, not second-order, in the In the case of La2 CuO4 , the crystallographic c/a ratio
Ehrenfest classification. is anomalously high because the Cu(II) ions distort their
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
the p-type system La2−x Srx CuO4 , the larger Sr2+ ion re-
lieves the compressive stress, and oxygen stoichiometry
is more easily achieved as x increases.
In the T∗ -tetragonal structure of Nd2−x−y Ce y Srx CuO4
(Fig. 30b), the larger Sr2+ ions order into alternate A-
cation bilayers; the Sr2+ ions stabilize rock-salt bilayers,
whereas the alternate bilayers have the fluorite structure.
As a result, the Cu(II) are fivefold coordinated. Whether
the Cu(II) are six-, five-, or fourfold coordinated, su-
perconductivity requires preservation of the translational
symmetry within a CuO2 plane or sheet and, therefore,
the same nearest-neighbor oxygen coordination for every
copper atom within a plane or sheet.
The variable oxygen coordination at a Cu(II) also makes
it possible to remove the apical oxygen atoms from a per-
ovskite multilayer if the A-site cations of the perovskite
block are stable in eightfold oxygen coordination. In fact,
FIGURE 30 (a) T -tetragonal structure of Nd2 CuO4 . (b) T ∗ - all the copper-oxide superconductors that would contain
tetragonal structure of Nd2−x−y Ce y Srx CuO4 .
perovskite multilayers contain eightfold coordination of
the A’ cations (A’ = Ca, Y, or a trivalent lanthanide) to
octahedra to tetragonal (c/a > 1) symmetry with their long form an A’m−1 (CuO2 )m layer, with integral m ≥ 2; these
apical Cu O bond along the c-axis. This c-axis ordering superconductive layers alternate with AO––AO layers
of the filled (3z 2 –r 2 ) orbitals has an important structural (A = La, Sr, Ba) that have a rock-salt AO interface.
consequence: the apical oxygen atoms are not strongly The intralayer composition has a variable oxygen con-
bound to the Cu atoms and may be removed from the oxy- tent and may be quite varied, as illustrated in Figs. 31
gen coordination at a Cu(II). Consequently, the Cu(II) ion to 35.
may be found in six-, five-, or fourfold oxygen coordi- The nonsuperconductive layer may act as a charge
nation, but the strong square-coplanar bonding within a reservoir for the holes in the p-type superconductive
CuO2 sheet is always maintained. layers. This situation is found, for example, in the
Replacement of La with a smaller trivalent rare earth YBa2 Cu3 O6+x system, where the 0 ≤ x < 1 oxygen in
ion illustrates well the weak bonding of the apical oxygen. the BaO CuOx BaO layers order into Cu O Cu chains
Substitution of a smaller A cation lowers t, and the struc-
ture accommodates the bond-length mismatch by displac-
ing the apical oxygen to tetrahedral interstices of an (001)
La bilayer to form a fluorite Ln O2 Ln layer (Ln = Pr
to Gd) as illustrated in Fig. 30a. This structure is labeled
T -tetragonal to distinguish it from the T-tetragonal phase
of high-temperature La2 CuO4 . An important consequence
for the chemistry of these phases is that the displacement
of the apical oxygen in the T phase places the CuO2 planes
under tension, whereas the CuO2 sheets of La2 CuO4 are
under compression as a result of the bond-length mis-
match. A tensile stress is relieved by adding antibond-
ing (x 2 –y 2 ) electrons to the CuO2 planes; a compressive
stress is relieved by removing antibonding (x 2 –y 2 ) elec-
trons from the CuO2 sheets. As a result, the T phase can
only be doped n-type to give n-type superconductivity,
whereas La2 CuO4 can only be doped p-type. In fact, care
must be exercised in the preparation of La2 CuO4 , as it
may accept interstitial oxygen in the tetrahedral sites of
the rock-salt bilayers to give La2 CuO4+δ ; this composition
phase segregates below room temperature to give filamen- FIGURE 31 Structures of (a) tetragonal YBa2 Cu3 O6 and (b) or-
tary p-type superconductivity in the oxygen-rich phase. In thorhombic, ideal YBa2 Cu3 O7 .
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
FIGURE 34 Structures of (a) Tl2 Ba2 CuO6 , (b) Tl2 Ba2 CaCu2 O8 , and (c) Tl2 Ba2 Ca2 Cu3 O10 .
b. Underdoped. The “underdoped” region 0 < x < five Cu centers. A pseudo Jahn–Teller deformation of the
0.1 supports polaronic conduction, but the thermoelec- in-plane square-coplanar coordination at a Cu(III) would
tric power indicates that the nonadiabatic polorans are be resisted by an elastic energy, but this energy would
not small, centered on one Cu atom, but embrace about be reduced by cooperative deformations over several Cu
centers. Calculation has shown that the gain in elastic
energy would result in a polaron that embraced five to
seven Cu centers. However, some other vibronic mecha-
nism may be responsible for preventing the polaron col-
lapse to a single Cu center. Within the polarons, antifer-
romagnetic order is suppressed, which indicates that the
hole occupies a molecular orbital that includes all the Cu
centers of the polaron. In this respect, it represents a mo-
bile metallic phase in the antiferromagnetic matrix. As
the volume of this second electronic phase increases, it
breaks up the long-range antiferromagnetic order of the
parent phase, which causes TN to decrease precipitously
with x from 340 K at x = 0. However, localized spins
in regions of short-range order persist into the supercon-
ducting compositions; they give rise to a maximum in the
paramagnetic susceptibility at a Tmax that decreases with
increasing x.
The appearance of a superconductive Tc that increases
with x for compositions 0.05 ≤ x ≤ 0.10 indicates that the
polarons condense at lower temperatures into supercon-
ductive filaments. The transition temperatures TF and Tρ
FIGURE 35 Structure of HgBa2 Ca2 Cu3 O8+δ . The O4 and O5 are in Fig. 36 mark anomalies in the temperature dependence
the δ interstitial oxygen. of the transport properties; others have noted anomalies in
P1: GPB Final Pages
Encyclopedia of Physical Science and Technology EN016A-751 July 31, 2001 15:35
result in pair wave functions with s-wave symmetry; the superconductive CuO2 Y CuO2 layers and suppress Tc .
superconductive energy gap 2 0 is finite over the entire It is interesting that the Y3+ ion may be substituted by any
Fermi surface. The pair wave functions in the CuO2 sheets trivalent rare earth ion (with the exception of Pr) without
of the copper oxides have (x 2 –y 2 ) d-wave symmetry; the influencing Tc significantly. Only in the case of Pr is
energy gap 2 0 has nodes along the tetragonal [110] and there an important interaction between the lanthanide 4 f
[11̄0] axes. This symmetry reduces the coulomb repulsion orbitals and the (x 2 −y 2 ) band of the CuO2 sheets.
between the paired electrons. YBa2 Cu3 O6 is an antiferromagnetic insulator with
Cu(II) in the CuO2 Y CuO2 layers and Cu(I) in the
d. Overdoped. The overdoped compositions x > BaO Cu BaO layers; antiferromagnetic order between
0.25 are not superconductors, and a change from p- Cu(II) ions sets in at a TN > 500 K. The initial interstitial
type to n-type conduction signals a transfer of spectral oxygen atoms enter the BaO Cu BaO layers randomly
eight from the lower and upper Hubbard bands of the and oxidize the neighboring Cu(I) to Cu(II). However,
x = 0 parent compound to Fermi-liquid states in the gap a threefold-coordinated Cu(II) attracts a second intersti-
(U –W ) of the parent. Nevertheless, the metallic resis- tial oxygen atom to form square-coplanar coordination,
tivity remains high with an anomalous temperature de- which initiates the formation of a chain segment. The
pendence, which indicates that the transition from vi- twofold-coordinated Cu remain Cu(I), so the formation of
bronic to Fermi-liquid states may not be complete by chain segments initiates oxidation of the CuO2 Y CuO2
x = 0.3. layers. For x < 0.3, the chain segments remain disor-
The decrease in Tc with increasing x in the range dered, so the crystallographic symmetry is tetragonal; but
0.22 < x < 0.26 is not smooth; it is characterized by a se- TN drops precipitously with increasing oxidation of the
ries of steps typical of phase segregation. Since Tt crosses CuO2 Y CuO2 layers in the interval 0.1 < x < 0.25 (see
Tc near this compositional range, these steps may reflect Fig. 37). For x ≥ 0.4, the crystallographic symmetry is
segregation of orthorhombic and tetragonal phases.
Blatt, J. M. (1964). “Theory of Superconductivity,” Academic Press, McMillan and Rowell (1969). In “Superconductivity” (R. D. Parks, ed.),
New York. p. 561, Dekker, New York.
de Gennes, P. G. (1966). “Superconductivity of Metals and Alloys,” Manousakis, E. (1991). Rev. Mod. Phys. 63(1), 1–62.
Benjamin, New York. Mendelssohn, K. (1966). “Quest for Absolute Zero,” McGraw–Hill, New
Ginsberg, D. M. (ed.) (1996). “Physical Properties of High-Temperature York.
Superconductors,” Vol. 5, World Scientific, Singapore. Newhouse, V. L. (ed.) (1975). “Applied Superconductivity,” Academic
Ginzburg, V. L., and Kirzhmits, D. A. (eds.) (1982). “High Temperature Press, New York.
Superconductivity” (A. K. Agyei, transl.; J. L. Birman, transl. ed.), Rickayzen, G. (1965). “Theory of Superconductivity,” Wiley (Inter-
Pergamon, Oxford. science), New York.
Goodenough, J. B. (1972). Prog. Solid State Chem. 5, 145. Saint-James, D., Sarma, G., and Thomas, E. J. (1969). “Type II Super-
Goodenough, J. B., and Longo, J. M. (1970). “Crystallographic and Mag- conductivity,” Pergamon, Oxford.
netic Properties of Perovskite and Perovskite Related Compounds, Scalapino, D. J. (1995). Phys. Rev. 250, 329.
in Landolt–Bornstein Tabellen,” New Series Group III/4a, No. 126, Schrieffer, J. R. (1964). “Theory of Superconductivity,” Benjamin, New
Springer-Verlag, Berlin, New York. York.
Kaldis, E. (ed.) (1994). “Materials and Crystallographic Aspects of Solymar, L. (1972). “Superconductive Tunnelling and Applications,”
HTc —Superconductivity,” NATO ASI, Series E: Applied Sciences, Halsted, New York.
Vol. 263, Kluwer Academic, Dordrecht. Taylor, A. W. B. (1970). “Superconductivity,” Wykeham, London,
Kittel, C. (1976). “Introduction to Solid State Physics,” 5th ed., Wiley, Winchester.
New York. Tinkham, M. (1975). “Introduction to Superconductivity,” McGraw–
Kulik, I. O., and Yanson, I. K. (1972). “Josephson Effect in Supercon- Hill, New York.
ductive Tunneling Structures,” Halsted, New York. Wallace, P. R. (ed.) (1969). “Superconductivity,” Gordon and Breach,
Kuper, C. G. (1968). “Introduction to the Theory of Superconductivity,” New York.
Oxford University Press, London, New York. Williams, J. E. C. (1970). “Superconductivity and Its Applications,”
London, F. (1950). “Superfluids,” Vol. I, Wiley, New York. Arrowsmith, Bristol, England.
Lynton, E. A. (1971). “Superconductivity,” 3rd ed., Halsted, New York. Ziman, J. M. (1972). “Principles of the Theory of Solids,” 2nd ed.,
Markiewicz, R. S. (1997). J. Phys. Chem. Solids 58, 1179. Cambridge University Press, London, New York.
P1: GQT Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN016B-774 July 31, 2001 17:48
Thermoelectricity
Timothy P. Hogan
Michigan State University
681
P1: GQT Final Pages
Encyclopedia of Physical Science and Technology EN016B-774 July 31, 2001 17:48
682 Thermoelectricity
Thermoelectricity 683
Thomson then applied the first and second laws of ther- microscopic analyses can be used in deriving many use-
modynamics to the Seebeck, Peltier, and Thomson effects ful formulas for calculating thermoelectric properties of
to find the Kelvin relationships various materials. The following sections are dedicated to
developing the macroscopic and microscopic analyses.
= SAB T, (4)
d SAB A− B
= , (5) II. THERMODYNAMIC RELATIONSHIPS
dT T
where the subscripts A and B correspond to the two mate-
As shown by the Seebeck effect, when a temperature gra-
rials in Fig. 2. The second Kelvin relation suggests that the
dient is placed over the length of a sample, carrier flow
Seebeck coefficient for two materials forming a junction
will be predominantly from the hot side to the cold side.
can be represented as the difference between quantities
This indicates that a temperature gradient, T , is a force
based on the properties of the individual materials making
that can cause a flow of carriers. It is well known that
up the junction. Integration of the second Kelvin relation
applying the force of an electric potential gradient, V ,
gives
can also induce carrier flow. In 1931, Onsager developed
A− B A B a method of relating the flows of matter or energy within a
d SAB = dT = dT − dT
T T T system to the forces present. In this method the forces are
(6) assumed to be sufficiently small so that a linear relation-
ship between the forces, Xi , and the corresponding flows,
or Ji , can be written.
A B
SAB = dT − dT. (7) J1 = L 11 X1 + L 12 X2 + · · · L 1n Xn ,
T T
Defining the first term on the right-hand side as the “ab- J2 = L 21 X1 + L 22 X2 + · · · L 2n Xn , (8)
solute” Seebeck coefficient of material A and the second J3 = L 31 X1 + L 32 X2 + · · · L 3n Xn ,
term as the “absolute” Seebeck coefficient of material B,
we find that the Seebeck coefficient for a junction is equal or
to the difference in “absolute” Seebeck coefficients of the
n
individual materials making the junction. This is a very Ji = L im Xm (i = 1, 2, 3, . . . , n). (9)
significant result, as measurements of the individual ma- m=1
terials can be used to predict how junctions formed from For carrier and heat flow as described above, the Onsager
various combinations of materials will behave, thus re- relationships can be written
moving the need to measure every possible combination
of materials. The “absolute” Seebeck coefficient or ther- J = L 11 ∇V + L 12 ∇T,
(10)
moelectric power of a material, hereafter referred to sim- J Q = L 21 ∇V + L 22 ∇T,
ply as the thermopower of the material, can be found for
material A if the thermopower of material B is known or where J is the current density (electric charge flow), and J Q
if the thermopower of material B is zero. A material in is the heat flux density (heat flow). Without a temperature
the superconducting state has a thermopower of zero, and gradient (T = 0), a heat flux of zero would be expected,
once a material is calibrated against a superconductor, it contrary to what Eq. (10) would indicate. It is, therefore,
can then be used as a reference material to measure more important to understand further the primary coefficients
materials. This has been done for several pure materials L ii and interaction coefficients L i j (i = j) linking these
such as lead, gold, and silver (Roberts, 1977; Wendling equations. To do so requires a consideration of the ther-
et al., 1993). modynamics of an irreversible process (one in which the
Further understanding of the basic thermoelectric prop- change in entropy is greater than zero S̄ > 0).
erties and the relationships between them can be found
through comparisons of macroscopic and microscopic
derivations. The Onsager relations formulate various flows III. THERMODYNAMICS OF AN
(consisting of matter or energy) as functions of the forces IRREVERSIBLE PROCESS
that drive them, thus describing macroscopic observations
of materials. Another useful technique for understand- The general application of the Onsager relationship was
ing these basic thermoelectric properties utilizes semi- derived by Harman and Honig (1967) and is summarized
classical statistical mechanics to describe the microscopic here. For a constant electric potential, V , throughout the
processes. Comparisons between the macroscopic and the sample,
P1: GQT Final Pages
Encyclopedia of Physical Science and Technology EN016B-774 July 31, 2001 17:48
684 Thermoelectricity
d Q = T d S̄ = dU + P dV − µi dn i , (11)
i
Thermoelectricity 685
686 Thermoelectricity
Thermoelectricity 687
Onsager relations can then be written, by extracting the The electron wave function, (r, t), itself does not have
forces, Xi , from Eqs. (47), (48), and (50). physical meaning, however, the product of ∗ (r, t)(r, t)
represents the probability of finding an electron at position
Z11 µ̄ 1
J= ∇ + Z12 ∇ ,
r and time t. As a probability implies, there is a factor of
q T T uncertainty, which was quantified in 1927 by Heisenberg.
(52)
Z21 µ̄ 1
JE = ∇ + Z22 ∇ , px x ≥ h, p y y ≥ h, pz z ≥ h, (60)
q T T
B11 B12 where px , p y , and pz are the momentum uncertain-
J= ∇ µ̄ − ∇T,
ties in the x, y, and z directions, respectively. The po-
qT T sitional uncertainties in the three directions are given by
(53)
B21 B22
x, y, and z. It is possible to utilize these uncertainties
Js̄ = ∇ µ̄ − ∇T,
qT T to define the smallest volume (in real space, or momentum
L11 L12 space) that represents a discrete electronic state. Within a
J= ∇ µ̄ − 2 ∇T,
cube of material with dimensions L × L × L, the maxi-
qT T
(54) mum positional uncertainty for a given electron would be
L21 L22
x = y = z = L, since the electron must be located
JQ = ∇ µ̄ − 2 ∇T,
qT T somewhere within the cube. This would correspond to the
thus relating the electrical current density, J, to the energy minimum px , p y , and pz given by
flux density, J E , the entropy flux density, Js̄ , and the heat h h h h
pxmin = = , p ymin = = ,
flux density, J Q . Equations (52), (53), and (54) can now x L y L
be used to identify various thermoelectric properties. (61)
h h
pzmin = = .
z L
IV. STATISTICAL RELATIONSHIPS Thus the product
h3
Within crystalline materials, electron behavior can be de- pxmin p ymin pzmin = (62)
scribed by the wave nature of electrons and Schrödinger’s L3
equation, gives the minimum elemental volume in momentum space
to represent two discrete electronic states (one for spin-up
2m −h ∂E
∇ 2 + (E − V ) = , (55) and one for spin-down). The number of states, dg, per unit
h2 j ∂t volume in an element d px d p y d pz of momentum space
where is the electron wave function, E is the total en- can be written
ergy, and V is the potential energy of the electrons. The 1
solution to this equation is dg = 3 d px d p y d pz . (63)
h
(r, t) = ψ(r)e− jωt , (56) Schrödinger’s time-independent equation for a free
electron (V = 0) is
where is the time-independent solution to Schrödinger’s
equation. This solution forms a wave packet with a group 2m
∇ 2ψ + Eψ = 0, (64)
velocity, v, equal to the average velocity of the particle it h2
describes, such that which has the solution
∂ω 1 1 ∂E ψ = Ae jk·r . (65)
v = ∇k ω = = ∇k E = , (57)
∂k h h ∂k
Substituting back into (64) gives
where the use of Planck’s relationship, E = hν = hω, was
made. Force times distance is equal to energy, or with a h2 2 p2
E= k = , (66)
time derivative, 2m 2m
∂E 1 ∂E ∂k where p 2 = hk was used. Within a crystal, a similar for-
v·F= = ·h , (58)
∂t h ∂k ∂t mula can be found when the concept of effective mass, m ∗ ,
giving is utilized to account for internal forces on the electrons
due to the ion cores at each lattice point. Electrons with
∂k energies below some value E are then
F=h . (59) √defined by a sphere
∂t in momentum space with radius p = 2m ∗ E. The number
P1: GQT Final Pages
Encyclopedia of Physical Science and Technology EN016B-774 July 31, 2001 17:48
688 Thermoelectricity
Thermoelectricity 689
B. Carrier Concentrations The total rate of change of the distribution function near
r is then
In a free-electron approximation, the total number of elec-
df 1 ∂f
trons in a given energy band can then be found by integrat- = Ft · ∇k f + v · ∇r f + , (83)
ing the product of the density of states and the probability dt h ∂t
of occupancy of the state for spherical energy surfaces: which is Boltzmann’s transport equation. The first term on
Etop the right-hand side of this equation accounts for contribu-
(2mkT )3/2
n= g(E) f (E) dE = F1/2 (η), (77) tions from forces, Ft , including externally applied forces,
E bottom 2π 2 h3 F, and collision forces, Fc . The middle term adds the con-
where η = E F /kT , and Fν (η) is the Fermi–Dirac function, tributions from concentration gradients, and the last term
∞ is the local changes in the distribution function about the
ην
Fν η = d x. (78) point r. Equation (83) is equal to zero since the total num-
0 1 + e x−η ber of states in the crystal is constant, thus
Here, the bottom of the energy band was taken to be zero ∂f −1
energy corresponding to k = 0, and the integration was = Ft · ∇k f − ν · ∇r f
∂t h
allowed to extend to ∞ since the Fermi–Dirac distribution −1 1
falls to zero at high energy levels. In the degenerate limit, = Fc · ∇k f − F · ∇k f − ν · ∇r f
h h
when E F kT , a series expansion of (78) leads to the
following approximations for metals: ∂f 1
= − F · ∇k f − ν · ∇r f. (84)
2 ∂t c h
π 2 kT
E F ≈ E F0 1 − + ··· , With external forces applied, the distribution function, f ,
12 E F0 will be disturbed from the equilibrium value, f 0 . Upon
(79) the removal of those external forces, equilibrium will be
(2m)2/3 E F 3/2 π 2 kT 2 reestablished through collisions, (∂ f /∂t)c . Calculation of
n≈ 1 + + ··· . this collision term is a formidable task dependent largely
3π 2 h3 kT 8 EF
on the scattering mechanisms for the material investigated.
At T = 0 K, For small disturbances, however, a relaxation-time ap-
proximation is often used which assumes that
π 2 h2 3n 2/3
E F = E F0 = . (80) ∂f −( f − f 0 ) − f1
2m π = = , (85)
∂t c τk τk
For nonspherical energy surfaces, the number of elec-
trons per unit volume within an element of momentum where τk is the momentum relaxation time. In steady state,
space, dpx dp y d pz , is found using (63) and the relation ∂ f /∂t = 0 and Eq. (84) becomes
p = hk = 2π
h
k, − f1 1
0 = − F · ∇k f − v · ∇r f
2 1 τk h
dn = f (p, r) d px d p y d pz = f (k, r) dk x dk y dk z , or (86)
3 4π 3 τk
h f 1 = − F · ∇k f − τk v · ∇r f.
(81) h
where the factor of 2 accounts for two electrons of op-
The electric and heat current densities are given by
posite spin. The total electron density is then found by
q
integration. J = −qvn = −q v dn = − 3 f 1 (k) dk,
4π
(87)
C. The Boltzmann Function 1
JQ = v(E − E F ) f 1 (k) dk.
If the material is disturbed from equilibrium, then the dis- 4π 3
tribution will vary, in general, as a function of wavevector, Substituting Eq. (86) into Eqs. (87) starting with the elec-
k, position, r, and time, t, or f (k, r, t). At a time t + dt, tric current density, J, gives
the probability that a state with wavevector k + dk is
q −τk
occupied by an electron at position r + dr can be found, J=− 3 vF · ∇k f − τk vv · ∇r f dk. (88)
4π h
using Eq. (59), to be
Assuming parabolic bands, the gradient of the distribution
f (k + dk, r + dr, t + dt) function in k space can be written
1 ∂f ∂f
= f k + Ft · ∇k dt, r + v dt, t + dt . (82) ∇k f = ∇k E = hv. (89)
h ∂E ∂E
P1: GQT Final Pages
Encyclopedia of Physical Science and Technology EN016B-774 July 31, 2001 17:48
690 Thermoelectricity
Also, the following can be shown by direct substitution of Boltzmann’s equation. Comparison of Eqs. (95) and (97)
the Fermi–Dirac distribution (76): with Eq. (54) shows the following relations:
∂f ∂f ∂ 1 ∂ EF
= T E − , (90) L11
∂x ∂E ∂x T ∂x T = −qK0 ,
qT
with similar results in the y and z directions. Substituting
−L12 q
L11 = −q 2 T K0 ,
= ,
these results into Eq. (88) gives K 1 L = −qT K ,
T2 T 12 1
q −τk ∂ f ∂f
or
(98)
J=− 3 vF hv − vvτk T L21 L = −qT K 1,
= −K1 , 21
4π h ∂E ∂E
qT
L22 = −T K2 .
1 EF −L22 1
× E∇r
T
− ∇r
T
dk (91) = K2 ,
T2 T
or
q ∂f This shows the Onsager reciprocity relation, in that L12 =
J= τk vvF dk L21 . The thermoelectric properties can now be determined
4π 3 ∂E
through an evaluation of the transport integrals and the
∂f 1
+ q τk vv T (E − E F )∇r dk. (92) appropriate boundary conditions of isothermal (∇T = 0),
∂E T isoelectric (∇V = −E = 0), static (J = 0), or adiabatic
For example, the applied force might include a contri- (J Q = 0). For example, under isothermal conditions,
bution from an external electric field (−qE), plus a con- where ∇T = 0 and thus ∇µ = 0 (for a homogeneous
tribution caused by a temperature gradient (see Fig. 3), metal),
or in general as ∇ µ̄ = ∇(µ + q V ) = ∇µ − qE. Then (92)
J = q 2 K0 E = σE, (99)
would be
q ∂f and the electrical conductivity is
J= τk vv (∇µ − qE) dk
4π 3 ∂E
σ = q 2 K0 . (100)
∂f 1
−q τk vv(E − E F ) ∇r T dk, (93) The electronic contribution to the thermal conductivity
∂E T
is defined for static conditions as J E | J =0 = − κe ∇T , or
where ∇r (1/T ) = −(1/T 2 )∇r T was used. The electrical when J E = J Q as in a one-band material, J Q = κe ∇T ,
current density can be simplified and put into a format where (95) becomes
similar to the Onsager relations as shown in (54) by using
q
transport integrals defined as 0 = −qK0 ∇ µ̄ + K1 ∇T. (101)
T
1 ∂ f0
Kn = − 3 τk vv(E − E F )n dk, (94) Solving for ∇ µ̄ and substituting into (97) gives
4π ∂E
where it is assumed that the deviations from equilibrium 1 K1 1
are small, such that ∂ f /∂ E in Eq. (93) may be replaced J Q = −K1 ∇T + K2 ∇T
T K0 T
with ∂ f 0 /∂ E. This leads to an electrical current density of
1 K1 K1
q = K2 − ∇T (102)
J = −qK0 ∇ µ̄ + K1 ∇T. (95) T K0
T
Similarly, the heat current density, J Q , follows the same or
derivation to arrive at 1 K1 K1
κ= K2 − . (103)
1 ∂f T K0
JQ = τk vv (∇µ − qE)(E − E F ) dk
4π 3 ∂E The absolute Seebeck coefficient, or thermopower, S, can
also be found from the static condition, where the use of
1 ∂f 1
− 3 τk vv(E − E F )2 ∇r T dk (96) Eq. (101) gives
4π ∂E T
or 1 ∇ µ̄ 1 K1
S= = . (104)
1 q ∇T qT K0
J Q = −K1 ∇ µ̄ + K2 ∇T. (97) The Peltier coefficient, , can be found by evaluating the
T
heat current density, Eq. (97), for isothermal conditions:
These derivatives form the link between the macroscopic
Onsager equations and the atomistic derivations from J Q = qK1 E. (105)
P1: GQT Final Pages
Encyclopedia of Physical Science and Technology EN016B-774 July 31, 2001 17:48
Thermoelectricity 691
σ
σ = q 2 K0 K0 = L11 = −σT
q2
1 K1 σ
S= K1 = T S L12 = L21 = −T 2 σS
qT K0 q
1 K1 K1
κe = K2 − K2 = κe T + T 2 σS2 L22 = −T 2 κe − T 3 σS2
T K0
Substituting for the electric field, E, from Eq. (99) gives A relationship between the electrical conductivity and
the direct relationship between heat current density and the thermopower can be found by series expansion K1 ,
electric current density, which gives
K1
JQ = J = ΠJ, (106) π2 2 ∂σ
K1 = − 2 (kT ) , (112)
qK0 3q ∂ E E=E F
where the proportionality constant is simply the Peltier co-
efficient, . Comparing the Peltier coefficient (106) with along with σ = q 2 K0 and substituting into (104). This
the thermopower (104) leads to Kelvin’s second relation: leads to the Mott–Jones equation (Barnard, 1972):
Π = T S. (107)
−π 2 k 2 T ∂ ln σ
Sd = . (113)
Results of the transport integrals are summarized in 3 q ∂ E EF
Table I. Thus Eqs. (54) can be rewritten in terms of the
thermoelectric properties as A distinction of the diffusion thermopower, Sd , has been
−σ made here to separate it from a low-temperature effect
J = ∇ µ̄ + σS∇T,
that has not been considered above. The low-temperature
q
(108) effect typically appears as a peak in the measured ther-
−T
JQ = σS∇ µ̄ + κe + T σS2 ∇T. mopower (near 60 K for monovalent noble metals) and
q is the result of an increased electron–phonon interaction.
Substitution of the transport integrals can be used to eval- When a temperature gradient exists across a crystal, heat
uate further the thermoelectric properties. Estimations can will flow from the hot side to the cold side through lat-
be made through a series expansion of the transport inte- tice vibrations (phonons) and through electron flow. Vari-
grals using a Sommerfeld expansion, ous interactions among phonons, lattice defects, and elec-
∞ trons can be described by scattering times for each type of
∂ f0 interaction. At high temperatures, phonon–phonon inter-
Kn = − φn (E) dE
0 ∂E actions are more frequent than electron–phonon interac-
π2 d2 tions (τ p, p < τ p,e ). At these high temperatures (above the
= φn (E F ) + (kT )2 φn (E F ) + · · · . (109) Debye temperature, T > θD ), τ p,e is approximately tem-
6 d E F2
perature independent, while τ p, p ∝ 1/T . Under these
For the electrical conductivity, conditions, the total thermopower is dominated by the
diffusion thermopower as given in Eq. (113). At low tem-
q2 ∂ f0
σ = q 2 K0 = − 3 τk vv dk. (110) peratures (T < θD ), τ p,e ∝ 1/T and τ p, p ∝ eθ D /2T , there-
4π ∂E fore, as the temperature drops, τ p, p increases more rapidly
In its simplest form for cubic symmetry, this reduces to than τ p,e . When this occurs, τ p, p > τ p,e and electron–
phonon interactions will occur more frequently, caus-
nq 2 τk
σ = , (111) ing electrons to “dragged” along with the phonons. This
m∗ gives rise to a larger gradient of carrier concentration
where n is the electron density with energies near E F , and across the sample and is additive to the diffusion ther-
m ∗ is the effective mass of the electrons. Both the electron mopower such that S = Sd + Sg , where Sg is the phonon-
density near the Fermi level and the relaxation time are drag component of the thermopower described above. At
functions of energy, such that the electrical conductivity still lower temperatures, phonon-impurity interactions can
can be approximated as σ = const · E ξ , where ξ is some dominate, causing the magnitude of the thermopower to
number. decrease toward zero. For the remainder of this chapter, the
P1: GQT Final Pages
Encyclopedia of Physical Science and Technology EN016B-774 July 31, 2001 17:48
692 Thermoelectricity
Thermoelectricity 693
σ = const · (E 0 − E F )(3+s)/2 (126) using (109) and substituting the transport integrals into
(103). Series expansion of K2 gives
or, in (113),
π 2 k2T 2
−π k T2 2 K2 = − σ. (133)
S∼
= (3 + s), (127) 3 q2
6 q(E 0 − E F )
The thermal conductivity is given by (103), repeated here
where (3 + s) < 0. There is an exception to (127) in bis- for convenience:
muth, which shows the expected anisotropic thermopo-
1 K1 K1
wer, but an unexpected negative thermopower (S⊥ ≈ κe = K2 − . (103)
T K0
−50 µV/K, and S|| ≈ −100 µV/K at 273 K). For bismuth,
a value of ξ = (3 + s)/2 should be used in (114) describing In metals, (∂/∂ E)σ (E)| E=E F ≈ (σ/E F ), thus K1 ≈
electron conduction, instead of using Eq. (127), which is −(π 2 /3q 2 )(kT )2 (σ/E F ), or
for conduction by holes. 2 2
K1 K1 (π /3q 2 )(kT )2 (σ/E F )
≈
K0 σ/q 2
4. Alloys 2 2
π π (kT )2
Matthiessen’s rule states that the total resistivity of an alloy = (kT ) σ
2
, (134)
3q 2 3 E F2
formed by two metals can be found by
giving
1
ρ=
= ρi + ρ j , (128)
σ π 2k2 T π 2 (kT )2 π 2k2 T
κe ≈ σ 1 + ≈ σ, (135)
where ρi is the resistivity of the pure solvent metal due 3q 2 3 E F2 3q 2
to scattering of carriers by thermal vibrations, and ρ j rep-
resents scattering of carriers from impurities. This rule is where the approximation of (π 2 /3)((kT )2 /E F2 ) 1 was
often used for approximations but is not widely applicable used, thus arriving at the Wiedemann–Franz law, or
since many cases exhibit anisotropic scattering of carri- κe /σ T = 2.443 × 10−8 ((W · )/K2 ).
ers, causing a large deviation from (128). Assuming the The total thermal conductivity, κ, must also include a
validity of Matthiessen’s rule, (113) can be written lattice contribution, κ L , such that
κ = κ L + κe . (136)
π 2 k 2 T ∂ ln (ρi + ρ j )
S= , (129)
3 q ∂E EF The lattice thermal conductivity for metals is generally
much lower than the electronic contribution.
which can be written in terms of the difference between
S for the alloy and the thermopower of the pure solvent
metal, Si , or S = S − Si leads to D. Semiconductors
S 1 − (x j /xi ) The above analysis is applicable to normal metals, where it
=− , (130) is assumed that the carriers are electrons and ∇µ is a func-
S 1 + (ρi /ρ j )
tion of temperature only. Furthermore, the Onsager rela-
where tions were developed using four contributions, (33), (36),
∂ ln ρi ∂ ln ρ j (37), and (39), to the energy density rate of change, how-
xi = − and xj = − .
∂E EF ∂E EF
ever, two additional contributions exist for semiconduc-
tors. These contributions account for transitions of elec-
(131) trons across the bandgap, or the rate of change in carrier
Using the Gorter–Nordheim relation forthe impurity com- concentrations in each band, and for positional gradients
of the band edges (valence band and conduction band).
ponent of Mattheissen’s rule, ρi = C X 1 − X , where C
is the Nordheim coefficient and X is the atomic fraction of The last contribution could arise from temperature gradi-
the solute atoms in a solid solution, yields a more useful ents and/or compositional variations, for example. These
additional contributions have the form
relationship:
ρi ∂E
S = S j + (Si − S j ), = −qC (−∇ · Jn ) + qV (−∇ · J p ) (137)
ρ
(132) ∂t V
694 Thermoelectricity
Thermoelectricity 695
696 Thermoelectricity
Taking the derivative with respect to current and setting it refrigeration systems is 1.2 to 1.4, for a refrigerator oper-
equal to zero gives ating at a cold temperature of 263 K while the outside (hot
dϕ (−IR)[ST · I + I 2 R] − [ST + 2IR] I − 12 I 2 R − K (T0 − T1 )
=0= , (150)
dI [ST · I + I 2 R]2
where the substitutions S = (S A − S B ) and T = (T0 −T1 ) temperature) is at 323 K. Freon-based cooling systems
were used. After expansion and cancellation in the have coefficients of performance that would correspond
numerator, to a thermoelectric device with Z T between 3 and 4. Also
dϕ shown is the COP for the present value of Z T ∼ 1. The
=0 advantages of thermoelectric devices includes size scal-
dI
ability without loss of efficiency, robustness, low main-
I 2 −R − 12 R · ST + I [2KR(T0 − T1)] + K · ST 2 tenance, a relatively small electromagnetic signature, and
= .
[ST · I + I 2 R]2 the ability both to heat and to cool from a single device, and
(151) they are environmentally cleaner than conventional CFC-
based coolers. Many thermoelectric companies presently
Substituting = (S A − S B )T1 for the Peltier heat removed exist, indicating an existing market such that any increase
at the cold junction gives in Z T through a new material and/or configuration could
0 = I 2 −R(S A − S B ) T1 + 12 (T0 − T1 ) have a direct impact; however, a significant increase in
the market is anticipated for an increase in Z T to 2. This,
+ I [2K R(T0 − T1 )] + K (S A − S B )(T0 − T1 )2 . therefore, represents the current goal in Fig. 9.
(152) These devices are heat pumps, in that it is also pos-
sible to remove the electrical power source, and force
Solving this quadratic equation yields the maximum co- a temperature gradient across the thermoelectric device,
efficient of performance at the optimum current, by contacting one end of it to an external heat source.
(S A − S B )(T0 − T1 ) With a load connected to the device instead of the elec-
Iopt = √ , (153) trical power source, it then functions as a thermoelectric
R( 1 + Z T̄ − 1)
generator. Thus, the application of an electrical potential
where T̄ is the average temperature 12 (T0 + T1 ). Using this gradient causes the generation of a temperature gradient
in Eq. (143) yields (thermoelectric cooler) and the application of a temper-
√ ature gradient causes the generation of electrical power
T1 1 + Z T̄ − (T0 /T1 )
ϕopt = √ , (154) (thermoelectric generator).
(T0 − T1 ) 1 + Z T̄ + 1
where the first term represents the coefficient of perfor-
mance for an ideal heat pump. This shows that both ϕ and
T are directly dependent on the figure of merit, Z . Thus
maximizing the figure of merit for the individual materials,
S2 S2σ
Z= = , (155)
ρκ κ
maximizes the efficiency of the cooler. Desirable materi-
als have large-magnitude thermopowers, S (one n-type
and one p-type), and low electrical resistivities, ρ, or,
equivalently, high electrical conductivities, σ , and low
thermal conductivities, κ. Since the figure of merit has
units of K−1 , the unitless quantity of Z T is often reported.
It should also be noted that the Peltier heat, Q̇ = · I ,
is either absorbed or liberated based on the current di-
rection. Therefore, the same configuration can be used as
either a thermoelectric cooler or a heater.
For comparison, and to illuminate the present challenge, FIGURE 9 The figure of merit versus the coefficient of perfor-
the coefficient of performance for standard Freon-based mance.
P1: GQT Final Pages
Encyclopedia of Physical Science and Technology EN016B-774 July 31, 2001 17:48
Thermoelectricity 697
In the case of a generator, the efficiency, η, of the device TABLE III Desirable Material Properties for Thermoelectric
is defined as the ratio of the power supplied to the load to Applications (Kanatzidis, 2001)
the heat absorbed at the hot junction: 1. Many valley bands near the Fermi level, but located away from the
√ Brillouin zone boundaries.
TH − TC 1 + Z T̄ − 1
η= √ . (156) 2. Large atomic number elements with large spin–orbit coupling.
TH 1 + Z T̄ + (TC /TH ) 3. Compositions with two or more elements such as ternaries and
This is again dependent on the figure of merit of the device. quaternaries.
Through Thompson’s relations we can split the figure of 4. Low average electronegativity differences between elements.
merit for the device into a figure of merit for each of the two 5. Large unit cells.
legs. When each of these has been maximized individually, 6. Energy gaps near 10 kT.
then the total device figure of merit will also be maximized
assuming that one leg is n-type and one p-type. (PGEC) method (Slack, 1995), in which short phonon
mean free paths and long electron mean free paths are
simultaneously sought in a material. A suggested way for
C. New Directions
a material to exhibit PGEC behavior is making a material
Traditional materials used in thermoelectric devices are that incorporates cages and/or tunnels in its crystal struc-
listed in Table II. Near-room-temperature devices have ture large enough to accommodate an atom. The caged
been designed largely for cooling applications, while atom provides strong phonon scattering by rattling within
higher-temperature materials have been generally used in the cage. Electrons would not be significantly scattered
electrical power generation. Research on thermoelectrics by such “rattlers” since the main crystal structure would
was highly active during the decade following 1954, remain intact.
with the United States showing a great interest in high- Additional guidance has been provided by identifying
temperature power generation applications, such as the a B parameter defined as
Si–Ge-based generators used on the satellites Voyager I
1 2kT 3/2 √ k2
and II. Recently there has been a resurgence of interest B=γ 2 2
mx m ymz µx , (157)
in thermoelectrics, spurred on partly by predictions of the 3π h qκ L
high ZTs possible in quantum confined structures (Hicks where γ is the degeneracy parameter (Hicks and
and Dresselhaus, 1993). It was predicted that in such struc- Dresselhaus, 1993). This function should be maximized
tures, both the electrical conductivity and the thermopower for optimal Z T . With a large number of valleys within a
could be simultaneously increased due to the sharpen- band, fewer carriers can exist in each valley, thus increas-
ing of the density of states as confinement increases from ing the contribution to the thermopower from that valley.
3D → 2D → 1D → 0D (Broido and Reinecke, 1995). The At the same time, the total number of carriers can be main-
influence of such sharpening can be seen clearly within the tained for a high electrical conductivity. High degeneracy
Mott–Jones equation for thermopower (113). An indica- parameters are generally found in highly symmetric crys-
tion of the effect from a rapidly varying density of states tal systems. Large effective masses, or large effective mass
comes from mixed-valent compounds such as CePd3 and components in the axes perpendicular to the current flow,
YbAl3 , which have shown the largest power factor, σ S 2 , allow for a high electrical conductivity in the direction
among all known materials. Unfortunately, the high ther- of interest while maintaining a high B parameter. Equa-
mal conductivity in these materials prevents them from tion (157) also indicates that high mobilities, µx , in the
having a correspondingly high figure of merit. An ad- transport direction, and a low lattice thermal conductivity
ditional increase in Z T for quantum confined materials are also desirable. It has recently been shown that semicon-
comes from a decrease in the thermal conductivity due to ductors with bandgaps of approximately 10 kT best satisfy
confinement barrier scattering. these criteria (Mahan, 1998). Six properties of thermoelec-
Another avenue for investigating thermoelectric mate- tric materials that give the best results are listed in Table III.
rials has been coined the “phonon glass electron crystal”
VI. SUMMARY
TABLE II The Most Widely Used TE Materials
Z max (K−1 ) Useful range (K) T max (K) Macroscopic and atomistic derivations of the thermoelec-
tric properties of electrical conductivity, thermoelectric
Bi2 Te3 3 × 10−3 <500 300
power, and thermal conductivity have been presented, with
PbTe 1.7 × 10−3 <900 650
approximations for various material systems. The deriva-
Si–Ge 1 × 10−3 <1300 1100
tions outlined have considered external forces of electric
P1: GQT Final Pages
Encyclopedia of Physical Science and Technology EN016B-774 July 31, 2001 17:419678
698 Thermoelectricity
fields and temperature gradients. Additional effects are Broido, D. A., and Reinecke, T. L. (1995). “Thermoelectric figure of
realized when more forces such as magnetic fields are merit of quantum wire superlattices,” Appl. Phys. Lett. 67(1), 100–102.
included. These include Hall, magnetoresistance, Nernst, Gray, P. E. (1960). “The Dynamic Behavior of Thermoelectric Devices,”
Technology Press of the Massachusetts Institute of Technology/John
Ettingshausen, and Righi–Leduc effects. These magnetic Wiley & Sons, New York.
fields also affect the operation of thermoelectric coolers, Guggenheim, E. A. (1957). “Thermodynamics,” 3rd ed., North-Holland,
with significant enhancements of the efficiency possible Amsterdam.
under strong fields. Harman, T. C., and Honig, J. M. (1967). “Thermoelectric and Thermo-
magnetic Effects and Applications,” McGraw–Hill, New York.
Hicks, L. D., and Dresselhaus, M. S. (1993). “Effect of quantum-well
ACKNOWLEDGMENT structures on the thermoelectric figure of merit,” Phys. Rev. B 47(19),
12 727–12 731.
Ioffe, A. F. (1957). “Semiconductor Thermoelements and Thermoelec-
I wish to thank Sangeeta Lal, of Bihar University, for her helpful review tric Cooling,” Infosearch, London.
of the manuscript. Kanatzidis, M. G. (2001). The role of solid-state chemistry in the dis-
covery of new thermoelectric materials. In “Solid State Physics”
(H. Ehrenreich and F. Spaepen, eds.), Vol. 69, pp. 51–100, Academic
SEE ALSO THE FOLLOWING ARTICLES Press, New York.
Mahan, G. D. (1998). Good thermoelectrics. In “Solid State Physics”
ELECTROMAGNETICS • ELECTRONS IN SOLIDS • SEMICON- (H. Ehrenreich and F. Spaepen, eds.), Vol. 51, pp. 82–157, Academic
Press, New York.
DUCTOR ALLOYS • SUPERCONDUCTIVITY • THERMODY-
Roberts, R. B. (1977). “The absolute scale of thermoelectricity,” Philos.
NAMICS • THERMOMETRY Mag. 36(1), 91–107.
Slack, G. A. (1995). New materials and performance limits for ther-
moelectric cooling. In “CRC Handbook of Thermoelectrics” (D. M.
BIBLIOGRAPHY Rowe, ed.), CRC Press, New York.
Wendling, N., Chaussy, J., and Mazuer, J. (1993). “Thin gold wires as
Barnard, R. D. (1972). “Thermoelectricity in Metals and Alloys,” reference for thermoelectric power measurements of small samples
Halsted Press (Division of John Wiley & Sons), New York. from 1.3 K to 350 K,” J. Appl. Phys. 73(6), 2878–2881.
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
Transmission Electron
Microscopy
S. Amelinckx
J. Van Landuyt
University of Antwerp
53
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
factor. The image consists of the intensity distribution respects is complementary to other diffraction techniques
in a single diffraction spot. If this spot is the transmit- such as neutrons and X rays. Electron microscopy has been
ted beam, the image is called a bright-field image. If it especially successful in the study of crystal defects (e.g.,
is a diffracted beam, the resulting image is a dark-field dislocations, stacking faults, antiphase boundaries, do-
image. A dark-field image in a reflection for which the main boundaries, inversion boundaries, and discommen-
excitation error s has a large value is called a weak- surations) as well as long-period superstructures resulting
beam image. from the periodic arrangement of such defects (i.e., modu-
Dynamical diffraction theory Diffraction theory that lated structures, long-period antiphase boundary struc-
takes into account multiple scattering events. tures, and polytypes).
Ewald construction Geometrical construction that al- The existence of quasi-crystals, that is, objects with-
lows the directions of scattered beams to be derived out translation symmetry but with noncrystallographic
based on the concept of reciprocal lattice. rotation symmetry elements, such as 5-fold and 10-fold
Ewald sphere Sphere in reciprocal space with a radius of axes, was first demonstrated in 1984 by means of electron
1/λ used in the Ewald construction (λ is the electron diffraction and electron microscopy.
wavelength). Transmission electron microscopy finds applications in
Excitation error Distance in reciprocal space from a re- a wide variety of disciplines (e.g., solid-state physics,
ciprocal lattice node to the Ewald sphere, describing solid-state chemistry, physical metallurgy, mineralogy,
the deviation from the exact Bragg condition. and geology); for instance, the mineralogy of moon rocks
High-resolution electron microscopy The image is was studied mainly by means of electron microscopy.
formed by the interference between a large number In recent years the application of high-resolution elec-
of diffraction beams; with modern instruments, this tron microscopy to problems of crystal chemistry has been
allows the resolution of well-separated atom columns expanding very rapidly, and recently it has, in particular,
in crystal structures. been applied with considerable success to the study of
Kinematical diffraction theory Diffraction theory that high-critical temperature (T c ) superconductors. The study
takes into account only single scattering events. It is a of the atomic structure of surfaces is another very recent
reasonable approximation for neutron and X-ray scat- application.
tering but not for electron scattering. In 1986 the Nobel prize for physics was attributed to
Pendellösung effect Periodic transfer of intensity from the inventors of the electron microscope. Although the
the incident beam to the scattered beam in a two-beam invention dates back to 1937, it was not until 1958 that
situation. At the exact Bragg condition, the depth period transmission electron microscopy was used in the study
of this transfer is the extinction distance, which depends of solids. Rapid progress, following closely the progress in
on the reflection. electron optics, has been made since then; during the 1980s
Reciprocal lattice Lattice of which the base vectors A1 , atomic resolution became possible in many materials.
A2 , and A3 are related to the base vectors a1 , a2 , and
a3 of the direct lattice by the relation
(a2 × a3 ) (a3 × a1 ) I. BRAGG’S LAW
A1 = , A2 = ,
V V
The contrast of image produced in transmission electron
(a1 × a2 ) microscopy of crystalline solids is due mainly to diffrac-
A3 = .
V tion effects and, to a smaller extent, to absorption. In direct
Rocking curve Functional dependence of the transmitted space the diffraction of electrons by a crystal lattice can
and scattered intensity on the excitation error (i.e., on be described in terms of Bragg’s law (Fig. 1c):
the specimen orientation).
2d H sin θn = nλ, (1)
Scherzer focus Focusing condition leading to optimal
image resolution in high-resolution transmission elec- where d H is the interplanar spacing of the lattice planes
tron microscopy. The Scherzer focus does not coin- with Miller indices H (hkl), θn the Bragg angle, n an inte-
cide with the Gaussian focus considered in geometrical ger (i.e., the order of the reflection), and λ the de Broglie
optics. wavelength of the monokinetic electrons. This relation de-
fines the angles θn (i.e., the orientations of the crystal for
which the waves scattered by successive lattice planes are
TRANSMISSION electron microscopy combined with in phase and produce a peak in scattered intensity). The
electron diffraction has become an important tool in the peaks are very sharp because many unit cells contribute
study of the structural geometry of solids, which in many to the interference phenomenon.
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
AS = F sin π st /π s , (4)
V. DYNAMICAL THEORY
FIGURE 10 Schematic beam path in an electron microscope illustrating different imaging modes: (a) diffraction
contrast bright-field image, (b) diffraction contrast dark-field image, and (c) lattice image mode. [From Van Dyck, D.
(1978). In “Diffraction and Imaging Techniques in Material Science,” p. 355, North-Holland, Amsterdam.]
Defects, such as dislocations, produce a strain field that importance is in computing and interpreting the images of
causes local lattice curvature and, hence, produce images defects.
by strain contrast.
The relative displacement of two crystal parts along a
planar interface introduces phase shifts between the elec- IX. MOIRÉ FRINGES
tron beams diffracted by the two crystal parts. The in-
terference between these electron beams produces quasi- Superposed crystals that differ slightly in either orienta-
periodic intensity variations in the scattered as well as in tion or lattice parameter (or both) produce moiré fringes.
the transmitted beam. In a foil with constant thickness, The interference between the beam diffracted first by crys-
the intensities of the emerging beams vary with the depth tal 1 and subsequently transmitted through crystal 2 and
position of the interface in the foil in a quasi-periodic fash- the beam transmitted first through crystal 1 and subse-
ion; the quasi-period being the extinction distance tg (see quently diffracted by crystal 2 produces dark-field moiré
Sections V and VI). fringes. The bright-field moiré fringes result from the in-
The selected beam can be either a transmitted or a scat- terference between the doubly transmitted beam and the
tered beam. In the first case the image is called a bright- doubly scattered beam.
field image (Fig. 10a) and in the second a dark-field image Let the active diffraction vectors in crystal 1 and 2 be,
in the selected reflection (Fig. 10b). respectively, g1 and g2 , which are equal in magnitude but
differ slightly in orientation. The moiré fringes are then
perpendicular to ∆g = g1 − g2 (i.e., roughly parallel to
VIII. COLUMN APPROXIMATION g1
g2 ), and their spacing is = 1/|∆g|; ∆g can be con-
sidered as the wave vector of the moiré fringes.
The possibility to obtain a relatively undistorted image that The formation of moiré fringes can be simulated by
is in fact a highly magnified part of the diffraction spot is means of the optical analogues in Fig. 11, in which two
directly related to the small magnitude of the Bragg an- identical parallel line systems are superposed with a small
gles in electron diffraction. As a consequence of this, it angular difference. It is clear that much wider coincidence
is a good approximation to assume that electrons travel fringes are generated, which are the rotation moiré fringes
through the foil along narrow columns, the lateral dimen- (Fig. 11b).
sions of which are determined by the foil thickness t, by The parallel superposition of two line gratings with
the Bragg angles θ , and the upper limit of the lateral spread slightly different spacings, d1 = 1/|g1 | and d2 = 1/|g2 |,
w given by w = θ t. also produces coincidence fringes perpendicular to
It is, therefore, justified to associate a local intensity g1
g2 (i.e., to ∆g) and with a spacing = d1 d2 /
with the exit point in the back surface of a column, which (d1 − d2 ). The formation of parallel moiré fringes is dem-
has sampled a columnar volume of the foil. In a sense the onstrated by means of the optical analogue in Fig. 11a.
columns can be considered as picture elements (pixels) of Such fringes can be realized in the electron micro-
the image, which diffract quasi-independently. This pro- scope using sandwiches of two thin metal layers grown
cedure is known as the column approximation and its main epitaxially one onto the other. It is clear that if ∆g is
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
X. DISLOCATION CONTRAST
A. Translation Interfaces
The two parts of the crystal are related by a pure translation
described by the constant displacement vector R. If R is
not a lattice vector, such a defect is called a stacking fault
FIGURE 19 Contrast due to surface relaxation at the emergence (Fig. 21a). If the displacement vector R is a lattice vec-
points of dislocations in a platinum foil. The dislocations them- tor, but not a superlattice vector (e.g., in an ordered alloy)
selves are out of contrast in c; only the surface relaxation is visible. (Fig. 21b), the interface is called an antiphase boundary,
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
FIGURE 21 Schematic representation of different planar defects: (a) stacking fault (SF) in elementary crystal,
(b) antiphase boundary (APB) in an ordered alloy, (c) coherent twin boundary (TB), and (d) inversion boundary (IB).
[From Amelinckx, S., and Van Landuyt, J. (1976). In “Electron Microscopy in Mineralogy” (H. R. Wenk, ed.), pp. 68–112,
Springer-Verlag, Berlin.]
but strictly speaking only if R is one-half of a lattice distance, corrected for deviations from the exact Bragg
vector. If this is not the case, the same terminology may conditions, tg [i.e., as given by Eq. (6)].
be used although out-of-phase boundary would be more The final scattered beam results from the interference
correct. between (1) the beam transmitted through the first part and
The lattice planes with diffraction vector g in one part scattered by the second part T1 S2 and (2) the beam scat-
of the crystal are shifted with respect to those in the second tered by the first part and transmitted through the second
part of the crystal over a fraction of the interplanar dis- S1 T2− . Also, this beam periodically varies in intensity with
tance given by g · R. As a result, the electrons diffracted the position of the interface in the foil, the period being
by the second part of the crystal undergo a phase shift the same as for the transmitted beam (Fig. 22).
over α = 2πg · R with respect to those diffracted by the If the interface is inclined with respect to the foil sur-
first part of the crystal. The phase of the transmitted beam faces, one observes a set of fringes with a depth period
is not affected by this shift. Interference between (1) the equal to tg , in the bright-field image as well as in the dark-
beam transmitted by the first and, again, by the second field image, and the projected period depends on the in-
part of the foil T1 T2 and (2) the beam scattered by the first clination of the interface. We now discuss, in some detail,
part and, again, scattered in the incident direction by the the properties of the images of faults for which α = ± 23 π ,
second part, S1 S2− , gives rise to periodic variations of the which occur in close-packed structures.
final transmitted intensity with the position of the inter- In foils sufficiently thick for absorption effects to be im-
face within the foil (Fig. 22). The period is the extinction portant, the bright-field image is symmetrical with respect
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
FIGURE 24 Image formation in an ideal microscope. The diffraction pattern is the Fourier transform of the object
and the image is the inverse Fourier transform of the diffraction pattern.
amplitude is the Fourier transform of the function q(x, y). cal plane (i.e., the focal plane considered in geometrical
In turn, the diffraction pattern in this back focal plane acts optics), breaks down. This is due to the fact that the value
as a source of Huyghens spherical wavelets, which in- of sin β, which enters into the expression for the Lorentz
terfere to produce an enlarged image of the transmission force on a moving charge, can no longer be approximated
function. This image is, again, the Fourier transform of the by the angle β. This is analogous to approximating sin β
diffraction pattern. We can, therefore, conclude that the by β in Snell’s law for paraxial rays in ordinary optics, in
ideal microscope acts as an analogue computer and per- which higher-order terms, up to the third power in β, are
forms a double Fourier transformation apart from a linear required.
magnification, and, thus, reproduces the object. Unfortu- The radius of the disk of confusion in object space re-
nately ideal microscopes are not available and the actual sulting from this lens aberration is then given by
situation is somewhat more complicated.
ρS = C S β 3 , (8)
where CS is the spherical aberration constant, which has
XV. IMAGE FORMATION IN A REAL a value between 1 and 10 mm. A typical high-resolution
MICROSCOPE microscope operating at 200 kV has a value of CS = 1.2
mm. As a result of spherical aberration, electron beams
Real microscopes are subject to a number of limitations inclined at an angle with the optical axis suffer a phase
that induce deviations from the ideal imaging conditions shift χS with respect to the central beam (β = 0), which is
just described. given by
χS = 2π (/λ),
A. Spherical Aberration
where is the path difference caused by the beam that
In real magnetic lenses the paraxial approximation, which does not pass along the axis. From Fig. 25a it can be con-
leads to point-to-point representation in the Gaussian fo- cluded that = ρS sin β
ρS β and hence d = ρ dβ
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
FIGURE 25 Microscope aberrations. Phase shifts χ due to (a) spherical aberration and (b) defocusing. [From
Amelinckx, S. (1986). In “Examining the Submicron World,” p. 71, Plenum, New York.]
and dχS = 2πρS dβ/λ = 2πCS β 3 dβ/λ. After integra- and for the value of ,
tion from 0 up to the angle β,
= ε/(cos β) − ε ≈ 1/2β 2 ε,
χS = (1/2)πCS β /λ.
4
(9)
and hence,
χD = 2π /λ = π εβ 2 /λ. (11)
B. Aperture
ε > 0 means lens strengthening and ε < 0 means lens
The microscope contains an objective aperture that elim-
weakening.
inates beams that enclose an angle β with the optical axis
exceeding an angle βA to reduce the spherical aberration.
This imposes a limit to the theoretically achievable resolu- D. Chromatic Aberration
tion called the Abbe limit. A geometrical point is imaged
As a result of high-voltage instabilities of the microscope,
as a circle (the disk of confusion) with the radius
the incident electron beams exhibit a wavelength spread,
ρA = 0.61λ/βA , (10) since λ is related to the acceleration potential V by the
nonrelativistic approximate relation [Eq. (2)]:
which means, in practice, that only points separated by at
least this distance in the object can be observed as separate λ = h(2meV )−1/2 . (12)
points in the final image.
Moreover, variations in the lens currents I /I also cause
aberrations which are of the same nature.
C. Defocus A third origin of aberration is the inelastic scattering in
the specimen, which is equivalent to a change in energy
Most high-resolution images are automatically made un- of the electrons entering the lens system E/E. The net
der conditions where visual contrast is best. It turns out that effect of all these phenomena on the image formation is a
in the exact Gaussian focal plane the contrast is smallest, spread f on the focal distance f of the objective lens.
at least, for a phase object, that is, an object that changes The latter is proportional to E I −2 , assuming that E and
only the phase of the incident beam. One therefore usually I are uncorrelated; f is given by
works under somewhat defocused conditions. Also, defo-
cusing causes phase shift and a disk of confusion, which f = Cc [(E/E)2 + 4(I /I )2 ]1/2 .
we can estimate with reference to Fig. 25b.
The corresponding disk of confusion in object space has
When defocusing the electron microscope by an amount
a radius
ε, leaving the plane of observation unchanged, the object
being situated near the first focal plane, results in an ap- ρc = β f.
parent displacement ε of the object plane (Fig. 25b). One
clearly has The constant Cc is called the chromatic aberration con-
stant. Since instabilities at high voltages and lens currents
ρD = ε sin β ≈ εβ, can be reduced to smaller than 10−6 , f takes a typical
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
E. Beam Divergence
Because of the finite dimensions of the electron source
and the condensor lense aperture, the incident beam is
somewhat divergent. Under the intense illumination con-
ditions used in high-resolution imaging, the apex angle of
the illumination cone may reach a value of the order of
∼10−3 rad. FIGURE 26 Dependence of the phase shift χ on the angle β.
The influence of incoherent beam divergence on the [From Amelinckx, S. (1986). In “Examining the Submicron World,”
p. 71, Plenum, New York.]
image can be described as being due to the superposition
of independent images (i.e., intensities) corresponding to
different incident directions within the divergence cone.
G. Phase Shift
We have pointed out that spherical aberration and defocus
F. Ultimate Resolution
cause phase shifts of the nonaxial electron beams with
Apart from the lens imperfections discussed previously, respect to the axial beam. These phase shifts depend on β
which lead to image blurring and phase shifts, a number in the following manner:
of other imperfections occur, but these are unimportant
compared to those discussed. Furthermore, resolution also χ (β) = 12 πCs β 4 + π εβ 2 λ. (13)
depends on mechanical stability (e.g., vibration and drift);
these effects can be eliminated to a large extent by a proper Note that χ = 0 for the defocus value
microscope design.
ε = − 12 Cs β 2 .
The ultimate resolution is limited mainly by three, up to
the present, inevitable phenomena: finite aperture, spheri- This value still depends on β can be approximately satis-
cal aberration, and chromatic aberration. fied in only a limited range of β values. It is therefore nec-
The final disk of confusion has a radius given by essary to take the phase shifts into account when perform-
1/2 ing image calculations. The general aspect of the curve is
ρ = ρA2 + ρs2 + ρc2 . as presented in Fig. 26.
Because of the difference in angular dependence of the
different aberrations, it turns out that in present-day high-
resolution electron microscopes, chromatic aberration has XVI. IMAGE FORMATION OF A
only a relatively small influence. The limiting factor at WEAK-PHASE OBJECT
small angles (β < 5 × 10−3 rad) is the aperture, whereas
in the range β > 5 × 10−3 rad, the limiting factor is the The amplitude distribution in the back focal plane of the
spherical aberration. (This is true for E = 100 kV, Cs = objective lens is given by the Fourier transform (F) of
8.2 mm, and Cc = 3.9 mm.) the object function. In the case of a crystalline specimen
The curves ρA and ρs versus β [i.e., Eqs. (8) and the object function is the electron wave function at the
(10)] have opposite slopes. There is therefore a mini- exit face of the thin foil. The amplitude distribution in the
mum value for ρ, which occurs for ∂ρ/∂β = 0, where ρ = diffraction pattern is the Fourier transform of the wave
(ρA2 + ρs2 )1/2 . This minimum, which corresponds to the op- function. The final image amplitude is the Fourier trans-
timum compromise β0 between spherical aberration √ and form of the diffraction amplitude. However, the electrons
aperture effects, occurs for β0 = (0.61λ/Cs 3)1/4 . The are now moving in a lens system and therefore undergo the
corresponding radius of the confusion disk is then ρ0 = phase shifts χ(β) discussed previously. Moreover, an aper-
0.9λ3/4 Cs . Representative values are β0 = 5 × 10−3 rad
1/4
ture is limiting the number of beams transmitted through
and ρ0 ≈ 0.5 nm. This expression makes it clear that more the system. This can be taken care of by introducing an
can be gained in resolution by decreasing the wavelength aperture function in the plane of the diffraction pattern.
(i.e., by increasing the accelerating voltage) than by de- This function is 1 over the surface of the aperture and 0
creasing Cs . outside of this.
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
The main features of the image formation can be illus- value of χ is essentially negative, we cannot satisfy si-
trated in the simplest case, where the specimen can be as- multaneously the requirement sin χ = +1 (i.e., χ = π/2),
similated with a weak-phase grating. Since the wavelength but we can do so for sin χ = −1 (i.e., χ = −π/2).
of the electron is different in vacuum, and in the specimen It is sufficient to choose the defocus ε in such a way
the passage of the electron beam through the specimen that −π ε 2 /2λCs = −π/2 [i.e., εs = −(λCs )1/2 ]. Since
causes a phase shift that can be written χ (x, y) = σ φ(x, y), the sin χ function is stationary around χ = ±π/2, the
where φ is the projected lattice potential along the propa- sin χ -versus-β curve will present a flat part in the region
gation direction of the electrons, x and y are coordinates in of β = (−ε/Cs )1/2 provided ε = εs . The defocus value
the specimen plane, and σ = π/λE (λ is the wavelength of εs = −(λCs )1/2 , which corresponds to the optimum imag-
the electrons in vacuum, and E the accelerating potential). ing conditions of a phase grating, is called the Scherzer
The object function is then defocus, and the quantity (λCs )1/2 is used as a unit of
defocus. A more complete expression is εs = − 43 [λCs ]1/2 .
q(x, y) = eiσ φ(x,y)
1 + iσ φ(x, y). (14)
The dependence of sin χ on the diffraction angle β is
The image amplitude U (x, y) obtained by Fourier trans- represented in Fig. 27 for a typical situation close to the
formation followed by the inverse Fourier transformation Scherzer defocus. It is a rapidly oscillating function and
then becomes of the form therefore it is not possible to fulfill the required condition
for all β values. The curve in Fig. 27, which was drawn for
U (x, y) = 1 + σ φ(x, y) sin χ + iσ φ(x, y) cos χ , (15)
the Scherzer defocus value of −210 nm in this particular
where χ is supposed to be a constant. In reality, χ depends case, does exhibit a region where sin χ is approximately
on β (i.e., on x and y), but we shall show that the imag- −1 as required. Beams that are diffracted in this angular
ing conditions require that sin χ is at least approximately range give rise to an image that is a direct representation of
constant to obtain a directly interpretable image. the object. If not enough beams can be passed through this
In the particular case where sin χ = −1 (and, thus, “window” in the sin χ curve, the image may be rudimen-
cos χ = 0), the intensity distribution (i.e., the image) tary in the sense that it can give true detail only up to some
becomes maximum spatial frequency. This limiting frequency cor-
responds roughly with the β angle for which sin χ goes
I = UU ∗ or I (x, y) = 1 − 2σ φ(x, y), (16)
the first time through zero at the Scherzer focus, that is,
which clearly has a direct relationship with the object rep- βmax = (−2ε/Cs )1/2 ; for χ (βmax ) = 0 with ε = εs , this
resented by its projected potential φ(x, y). The image con- becomes βmax = 2(λ/3Cs )1/4 , and the radius of the cor-
1/4
trast, defined as (I − I0 )/I0
2σ φ(x, y), turns out to be responding disk of confusion is ρ = (λ/2)3/4 Cs , to be
directly proportional to the projected potential. compared with the expression just given. Figures 27a and
If one could make sin χ = +1, one would obtain b show two image transfer functions for two instruments;
the advantage of a high voltage becomes quite apparent
I (x, y) = 1 + 2σ φ(x, y). (17)
from the width of the window.
The intensity is now larger for a larger projected potential.
The situation is somewhat like positive and negative phase
contrast. The lenses have introduced phase shifts of π/2,
similar to what the quarter-wavelength ring does in op-
tical phase contrast microscopy. The lens aberrations are
exploited to produce phase contrast that would be absent
in a perfect microscope.
It is worth noting that the image (i.e., the positive or the observed after magnification by the electron optical sys-
negative deviations from the background) is proportional tem. Elementary geometrical considerations show that the
to the projected potential φ(x, y). spacing between the planes in this pattern is equal to the
spacing of the lattice planes for which the Bragg condi-
tion is satisfied. The direction of the fringes is perpen-
XVIII. LATTICE IMAGES dicular to the acting diffraction vector and thus parallel
with the lattice planes; their intensity distribution is sinu-
In images made using the diffraction contrast mode, only soidal since they are two beam interference fringes. The
one beam is used. A quite different type of image is ob- observed fringes represent admittedly rudimentary images
tained when admitting more than one beam through the of lattice planes, in terms of an intensity variation of the
selector aperture. In the simplest situation, two beams, electron beam. They can also be considered to represent
usually the transmitted beam and one scattered beam, are one Fourier component of the lattice potential.
selected and then made to interfere. The resulting interfer- In a sense, the crystal foil acts in this simple case on
ence pattern behind the exit surface of the crystal consists the incident electron beam in very much the same way
of straight sinusoidal fringes with a spacing equal to the that certain optical devices act on a light wave to produce
interplanar spacing corresponding to the chosen reflec- interference fringes by beam splitting (the Fresnel biprism
tion. Their formation is illustrated in Fig. 28, where the or mirror experiment).
transmitted and a single scattered beam are represented, If the crystal contains a planar defect (such as a stacking
assuming the Bragg condition to be satisfied exactly. The fault, an antiphase boundary, or a discommensuration wall
successive planar wave fronts (spacing λ) of maximum with displacement vector R), an intersecting set of lattice
elongation associated with the two beams ovelap in the fringes undergoes a lateral fractional shift along the inter-
space behind the foil and produce maxima (and minima) section line, given by g·R, where g is the acting diffraction
in a set of parallel planes. Although the plane waves are vector. The model allows a simple understanding of this
propagating, these parallel planes of maximum (or mini- effect. Suppose that in the second part of the crystal the
mum) amplitude form a stationary pattern that can be lattice planes are displaced over a distance y with respect
to the position in the first part, so as to occupy the dotted
position (Fig. 28). This shift will not affect the phase of the
transmitted beam but it will cause a phase shift of the scat-
tered beam; the path difference f is given by f = 2y sin θ .
This quantity also determines the relative phase shift of the
wave fronts diffracted by the displaced part with respect to
that diffracted by the undisplaced part. The stationary in-
terference pattern formed by T and Sd (dotted wave front)
will now be displaced sideways over a distance δ (into the
dotted position). From simple geometrical considerations
one can conclude the δ/ = (g · R)/d, which is the rela-
tion used in practice to determine the displacement vector
R of stacking faults and of antiphase boundaries from the
fringe shifts along the trace of the interface.
Likewise every experimental parameter or instrumental
factor (e.g., defocusing and beam tilt) that influences the
relative phase of T and S will also shift the fringe positions.
It is, in general, not possible to associate a fringe position
with a plane in the crystal structure. The fringe spacing
1/g has a structural significance—it is directly related to
the interplanar spacing in the crystal.
If the foil is oriented exactly perpendicular to the inci-
dent beam, the reflection +g and −g are both excited to the
same extent (the same s value). Lattice images can now
be obtained by selecting the three beams, −g, 0, and + g,
FIGURE 28 Intuitive model for the formation of lattice fringes in which exhibit fringes with a spacing 1/g as well as 12 g with
(a) a perfect crystal and in (b) a crystal containing a stacking fault. a different brightness (Fig. 29). The latter fringes (the sec-
[From Amelinckx, S. (1986). J. Electron Microsc. Tech. 3, 131.] ond harmonic) arise as a result of the interference between
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
A. One-Dimensional Images
If only a one-dimensional representation of the structure
is required because the structure is a long-period one-
dimensional superstructure, one can use the following
imaging techniques (Fig. 32a). FIGURE 31 Computed and corresponding experimental struc-
ture image of Au4 Mn. (a) Four computed images at optimum de-
focus but for different thicknesses. The left-bottom corner was
1. Mode 1 occupied by a manganese column. The brightest dots are thus lo-
cated at manganese columns. (b) Experimental image: Note that
One can select two neighboring superstructure reflections in the thick part only manganese columns produced bright dots,
belonging to the same basic spot in a row of spots passing whereas in the thinnest part all atom columns produce bright dots.
through the origin (i.e., in a central row). One uses only one The inset shows a model of the tetragonal structure. [From Van
Fourier component, and consequently the image reveals Dyck, D., Van Tendeloo, G., and Amelinckx, S. (1982). Ultrami-
croscopy 10, 263–280.]
only the long spacing.
4. Mode 9
All beams originating from the basic as well as from the
superstructure reflections are selected, provided they do
not correspond to spacings that are smaller than the in-
strumental resolving power of the microscope. For most
current instruments this means that up to the first or, pos-
sibly, up to the second shell of reflections, due to an FCC
matrix, can usefully be included. The face-centered-cubic
matrix will now be prominently revealed in the thin parts
of the specimen, whereas in the thicker part the super-
structure will be revealed.
Examples of high-resolution images are reproduced in
Section XXIV.
a succession of parallel two-dimensional phase gratings, or scattered beam(s) produces the image on a recording
separated by layers of vacuum with a thickness equal to medium (film, channel plate-CCD camera, etc.).
the slice thickness. Scattering of the electrons is followed In a second class of electron optical instruments (so-
slice after slice by computing the electron wave amplitude, called scanning electron microscopes), a fine electron
alternatingly taking into account the phase shift due to the probe is scanned across the specimen and the signal of
propagation between two gratings, followed by comput- interest, produced locally by the probe, is selected, de-
ing the diffracted amplitude by the successive gratings. tected, amplified, and displayed by modulating the inten-
The latter operation amounts to a Fourier transformation sity of the electron beam of a TV monitor which is scanned
as shown above. In practice, the slices can be taken to be synchronously with the probe. The signal can be observed
one unit cell thick. For each slice the exit waves of the either in backscattering, as in conventional scanning elec-
previous slice act as the input waves. The electron waves tron microscopy (CSEM or simple SEM), or in transmis-
emerging from the back surface of the specimen are sub- sion (STEM). In this chapter we consider only the latter
sequently assumed to suffer the angle-dependent phase case and compare it to CTEM (Fig. 33).
shifts introduced by the microscope before interfering to In a scanning transmission electron microscope
form the final image. Suitable computer programs using (STEM) the signal often consists of the transmitted (or
the multislice algorithm are available commercially. scattered) electron beam; however, other signals (e.g., X
More recently developed methods proceed by the di- rays) can also be detected, even in parallel with the elec-
rect retrieval of the projected structure. The main problem trons, provided that adequate detectors are available on
with direct retrieval is similar to the phase problem in the instrument.
X-ray diffraction; it is a consequence of the fact that only In CTEM the achievable resolution is determined
intensities are recorded in the image (and in the diffraction mainly by the quality of the imaging optics behind the
pattern), not amplitudes, unless use is made of holographic specimen; in STEM the resolution limit is determined
methods. mainly by the probe size (which is usually of the order
A recent retrieval method relies on the use of a sequence of 1 nm or less), i.e., by the probe forming optics ahead
of images taken at closely spaced foci (focus variation of the specimen.
method). This allows us to eliminate the effects of the In STEM the magnification is purely geometrical, i.e.,
microscopic optics and, thus, to obtain the corrected wave it is given by the ratio of the area on the monitor scanned
function at the exit face of the specimen. Using an analyti- by the electron beam to the corresponding specimen area
cal approach, based on the channeling model, then allows scanned by the probe. Whereas CTEM instruments are
us to obtain the projected structure. conceptually closely related to the classic light micro-
scope, STEM instruments are not unlike TVs; they are
essentially “mapping” or “plotting” devices of the spa-
XX. SCANNING ELECTRON tial variation of the various signals captured by adequate
MICROSCOPES detectors. The analog signals can further be digitized and
images can thus be electronically treated and, for instance,
So far we have discussed conventional transmission elec- magnetically stored.
tron microscopy (CTEM), in which the electron probe is The relationship between STEM and CTEM operat-
a parallel stationary beam incident along a fixed direction ing modes is represented schematically in Fig. 33. The
with respect to the specimen and in which the transmitted diagram should be read from right to left in the case of
FIGURE 33 Schematic ray paths of electron beams in an electron microscope in two modes: conventional electron
microscopy (CTEM) and scanning transmission electron microscopy (STEM).
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
CTEM and from left to right in the case of STEM, i.e., the
electrons are traveling in opposite directions in the two
cases.
In the STEM the source in A is often a field emission
gun, since this has a high brilliance. The objective lens
is a demagnifying lens system producing a slightly con-
vergent electron beam that becomes the probe, which is
scanned over the specimen by means of the deflector fol-
lowing a two-dimensional raster. Behind the objective lens
a convergent-beam electron diffraction pattern is formed.
Part of this pattern is selected, and the signal collected, am-
plified, and displayed on a TV monitor. In this way bright-
and dark-field images from elastically scattered electrons
can be produced; moreover, a variety of other modes of
operation is possible. Collecting many beams allows us to
image atom columns in crystals as a dot pattern.
In the Z -contrast mode an annular detector, capturing
the large-angle scattered beams only, allows us to obtain
an image formed predominantly by incoherently scattered
electrons. High-resolution images made in large-angle in-
coherently scattered electrons have the important property
that the bright dot image does not suffer contrast reversal
with changing defocus, i.e., the columns are always im-
aged as bright dots, irrespective of the amount of defocus
or foil thickness, whereas in CTEM the same atom col- FIGURE 34 Array of dislocations in stainless steel situated in
umn can be imaged either as a bright or as a dark dot, a glide plane observed by means of high-voltage electron mi-
depending on the thickness and focus. This difference in croscopy. [From Dupouy, G., Perrier, F., and Durieu, L. (1970).
behavior is a consequence of the difference in shape of the J. Microsc. 9, 575.]
image contrast transfer function (CTF). Under incoherent
imaging conditions the CTF decreases monotonously with
the spatial frequency, whereas in the coherent case it de- the shorter wavelength of the electrons used, they allow a
pends on the spatial frequency in an oscillatory manner. better instrumental resolution to be achieved. Moreover,
The dot brightness, furthermore, increases with the aver- the contrast transfer function of the lens system can be
age Z value of the atoms along the column. Chemically designed so it produces roughly the same phase shift for
different columns, are thus imaged as dots of different a larger angular range of beams, and hence more faith-
brightnesses. ful representation of crystal structures, than with 100-
Z -contrast high-resolution images can be interpreted in- kV microscopes. The use of high-resolution, high-voltage
tuitively; they are thus particularly well suited for imaging microscopy offers, at present, perspectives for the direct
of the geometry of defect configurations. Often an electron study of crystal structures. However, the displacement
energy loss spectrometer (EELS) is fitted to the outcoming and ionization damage produced by high-energy electrons
electron beam, which makes it possible to produce images constitute an intrinsic limitation that restricts the obser-
using only electrons having suffered no or a characteristic vation time as well as the resolution. Medium-voltage
energy loss, thus allowing chemical mapping. (∼300- to 400-kV) electron microscopes may turn out
to be the best compromise for a number of applications (a
400-kV image is shown in Fig. 51).
XXI. HIGH-VOLTAGE ELECTRON
MICROSCOPY
XXII. ANALYTICAL ELECTRON
Electron microscopes with an accelerating voltage signi- MICROSCOPY
ficantly higher than the conventional 100 kV have come
into use over the last decade. They offer the possibility of Apart from elastic scattering, which is responsible for
greater penetrating power and the use of thicker specimens electron diffraction, inelastic scattering events also oc-
that are more representative of the bulk (Fig. 34). Due to cur as electrons pass through the foil. Inelastic processes
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
FIGURE 37 Imaging filter (GIF) as an attachment to an electron microscope. It produces energy-filtered electron
images and diffraction patterns, also known as electron spectroscopic images and diffraction patterns. It also produces
electron energy loss spectra.
particular elements. A scheme of the experimental setup size available, enabling analysis of particles as small as a
(Gatan imaging filter) is shown in Fig. 37. few nanometers. Furthermore, use can be made of scan-
ning optics to make images in the signal of, for example,
one particular line, thus mapping the distribution of cer-
B. X-Ray Microanalysis
tain elements as in a micrograph, but on a much smaller
The higher energy state of the ionized atom can be reduced scale.
by an electron of an outer shell, e.g., L|| , falling into the
K shell. This process results in the radiative emission of
C. Auger Electron Emission
a photon with an energy equal to the difference between
the two excited states: In contrast with the radiative method of deexciting the
ion by emission of characteristic X rays, the energy lib-
E ph = E K − E L . erated by an electron falling into the inner shell can be
used to expel an electron from one of the higher shells.
The probability of these characteristic X-rays being emit- These have characteristic energies for each element and
ted is called the X-ray fluorescence yield. It increases with are called Auger electrons. These energies are rather low,
atomic number and is larger for K-line emissions, than for resulting in easy absorption and the limitation of the use of
L-line emissions. these electrons to surface analysis only. Because of the ne-
For X-ray microanalysis energy-dispersive crystal cessity of an ultrahigh vacuum for reducing inelastic colli-
detectors have nowadays replaced the curved crystal sions, this technique is not commonly used in conjunction
wavelength-dispersive systems based on Bragg diffrac- with electron microscopes and we do not elaborate on it
tion. This technique employs a lithium-drifted silicon de- further.
tector crystal, which has the advantage of covering a wide Both electron energy loss spectroscopy (EELS) and
range of energies simultaneously. The convenience of dis- energy-dispersive X-ray spectroscopy (EDXS) enable
play and spectrum processing and acceptable resolution (within their limits) convenient qualitative and quantita-
have made this technique very popular. An extensive ac- tive elemental analysis. Whereas EELS is more specifi-
count of the instrumental aspects can be found in the cally suited for light elements, EDXS is not, however, it
Bibliography. can detect a wide spectrum of elements, from beryllium
It is clear that X-ray microanalysis equipment mounted to uranium or from sodium to uranium, depending on the
on a TEM constitutes a powerful tool enabling observation kind of spectrometer used (wavelength dispersive or en-
of the substructure of interest at high magnifications and ergy dispersive). The former, using curved crystal geom-
in situ analysis of the elemental composition. This facility etry, is more suitable for quantitative analysis. The detec-
becomes even more powerful in a scanning transmission tion limit with EELS for an element in a matrix amounts
assembly (STEM) which has a very small electron probe to approximately 10−19 g, which means about 1000–100
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
atoms. For EDXS a sensitivity of 1% can be expected for ers to yield self-supporting disks that can be mounted di-
an element detectable in the presence of another one. rectly in the holder of the microscope.
Of the three techniques applicable in a TEM, X-ray Ceramics and semiconducting Si materials and like al-
microanalysis appears to be the most widely used. It is loys are successfully thinned by ion milling. Ar ions (5 kV)
limited mainly in the low-Z element side and in the resolu- bombard a rotating 3-mm disk under grazing incidence
tion of the imaging capabilities. Energy loss spectroscopy until perforation occurs, usually in the center of the disk.
has advantages in both these aspects, and its potential to Chemical thinning is also used for these materials, but ion
provide nanoscale information on the elemental beams is milling is more universal and reproducible.
most promising. Surface Auger spectroscopy has only lim- For high-resolution observations a number of materials
ited applications except in dedicated STEM instruments, which exhibit a conchoı̈dal fracture habit can be thinned
where instrumental conditions such as UHV and a high for HREM by mechanical crushing. The fragments are dis-
gun brightness are available. The three techniques, hav- persed on holey carbon grids, and after appropriate tilting
ing in common their origin of inner shell excitations by in the goniometer stage of the object holder, the thin frag-
the primary electron beam, appear to be complementary ments provide acceptable HREM observation conditions.
rather than competitive. Layered crystals can usually be sliced down to useful
thicknesses by repeated cleavage, and for some materials
thin films can be prepared directly by one of the vari-
XXIII. SPECIMEN PREPARATION FOR ous types of deposition techniques, e.g., evaporation and
TRANSMISSION ELECTRON chemical vapor deposition, on a substrate which is subse-
MICROSCOPY quently removed by dissolution. Multilayered-device ma-
terials are often prepared by this type of deposition or
One of the major drawbacks of electron microscopy tech- by in-depth doping or chemical treatment. For this type
niques is the necessity of thinning the sample down to of material it is often of major importance to know the
extremely small thicknesses, depending on the accelerat- succession of the layers, their thicknesses, and detailed
ing voltage used and on the resolution [diffraction contrast information on the interfaces. Therefore specimen prepa-
(100 Å) or HREM (1 to 5 Å) desired. The first is due mainly ration in “cross section” is required. Figure 38 shows the
to the higher penetration with increased voltage, and the successive steps in sample preparation in comparison to
second to increased inelastic energy loss, giving rise to “plan-view” specimen preparation as can be performed
spread and chromatic aberration, with a consequent loss with the above-mentioned thinning methods.
of resolution. For 100-kV microscopes a thickness of 1000 Nowadays highly sophisticated instrumentation for ion
to 2000 Å is typically required for the diffraction contrast milling semiconductor device samples allows viewing and
mode, whereas for HREM a thickness of 100 Å or less thinning in the same instrument particular areas in a device
is required. For 1000-kV microscopes thicknesses of up
to 1 to 5 µm of Si are acceptable for diffraction contrast
observations.
These thickness requirements and the preparation of
samples are undoubtedly destructive with respect to the
bulk original material, which is indeed one of the dis-
advantages of TEM techniques. Fortunately appropriate
techniques for specimen thinning have been developed
for specific types of materials whereby the destructive as-
pects are kept under control and very valuable information
on defect and structure characterization can be obtained.
A. Thinning Methods
The main procedures for thinning various types of materi-
als are summarized below. Sawing, slicing, and grinding
or cold work (milling) are usually required as prepara-
tory thinning procedures to obtain a starting thickness of
100 µm, from which the final thinning method proceeds. FIGURE 38 Scheme illustrating the successive steps to prepare
Metals and alloys are generally thinned by electropol- thin transmission electron microscope specimens in the plan-view
ishing. Trepanned disks are mounted in special inert hold- mode (left) and in the cross-section mode (right).
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
A. Dislocations
The first application of transmission electron microscopy
(TEM) was the study, by means of diffraction contrast, of
dislocation configurations in materials heat treated in dif-
ferent ways; TEM allowed the first moving dislocations
to be observed in situ. In Fig. 34 we reproduce an array of
dislocations confined to a well-defined glide plane in stain-
less steel. Since the dislocations in this material are dis-
sociated in Shockley partial dislocations, their movement
is restricted to a well-defined glide plane, as cross-glide
is difficult. This micrograph was made in a high-voltage
electron microscope, which allowed a rather thick foil to
be observed. The thickness can be deduced from the num-
ber of oscillations in the dislocation image.
In Fig. 39 a network of dissociated dislocations in
graphite is shown. The configuration is situated in the
(0001) plane, which is a glide plane and a cleavage plane.
The cleaved specimen is also limited by (0001) planes. FIGURE 40 Dislocation loops in irradiated platinum. A contrast
In Fig. 39a the extended dislocation nodes, which contain experiment allows identification of the loop as being due to vacan-
cies. [Courtesy of E. Ruedl.]
a stacking fault, exhibit a dark contrast. Also, one set of
triple-ribbon dislocations is visible. In Figs. 42b and c, the
configuration is imaged under three two-beam conditions.
In each of the three images one set of dislocations is out of
contrast; the Burger vector of the nonimaged dislocations
is perpendicular to the active diffraction vector.
Under irradiation, by means of neutrons, point defects
are formed that, with the appropriate heat treatment, ag-
glomerate into small dislocation loops. Vacancies and in-
terstitials may precipitate into separate disks, which, after
collapsing, give rise to dislocation loops of two types. It
is possible to distinguish these two types of dislocation
loops by means of contrast experiments since the Burger
vectors of the bordering dislocations have opposite signs
(Fig. 40).
On quenching, similar dislocation loops, resulting from
the agglomeration of vacancies, are observed in a number
of metals (Fig. 41). In metals with a low stacking fault en-
ergy, such as gold and cobalt, the vacancies introduced by
quenching are found to aggregate in stacking fault tetra-
hedra, a defect first discovered by means of TEM. The
edges of such a tetrahedron are formed by stair rod dis-
locations, whereas the faces are stacking faults (Fig. 42).
FIGURE 39 Hexagonal network of dissociated dislocations in the
basal plane of graphite: (a) stacking fault contrast; (b–d) different High-resolution electron microscopy allows one to distin-
line contrasts. [From Amelinckx, S., and Delavignette, P. (1960). guish between tetrahedra due to vacancies and tetrahedra
J. Appl. Phys. 31, 2126]. due to interstitials.
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
FIGURE 41 Dislocation loops introduced by quenching in alu- FIGURE 43 Procession of dislocations in a silicon substrate
minum. [From Hirsch, P. B., Silcox, J., Smallman, R. E., and formed during the fabrication process of a field effect transistor.
Westmacott, K. H. (1958). Phil. Mag. 3, 897.] [From Vanhellemont, J., Amelinckx, S., and Claeys, C. (1987). J.
Appl. Phys. 61, 2176.]
Figure 43 shows an application in the field of micro-
electronics. The sequence of dislocations results from the produced when the fault plane is inclined with respect
stresses caused by the fabrication process of field-effect to the foil plane. Figure 45 shows a bright- and a dark-
devices. From the geometry of this dislocation array, the field image of the same stacking fault, limited to Shock-
magnitude of the stress can be estimated. ley partials in a face-centered cubic copper–aluminum al-
The unambiguous interpretation of dislocation images loy. From the nature of the outer fringes in the dark- and
should be based on a comparison of computer-generated bright-field images, one can deduce whether the stack-
images, using dynamical diffraction theory with experi- ing fault is intrinsic (i.e., of the type ABCA–CABC) or
mental images. extrinsic (i.e., of the type ABCABA–CABC). The dis-
The atomic arrangement around the core of edge dislo- placement vector R of an interface, which relates the two
cations can be imaged by means of high-resolution elec- crystal parts, can be determined by making images in dif-
tron microscopy when viewed along a direction parallel ferent two-beam situations and the condition for the ab-
to the dislocation line. Figure 44 shows a 60◦ dislocation sence of contrast (i.e., so-called extinction) when g · R is
in silicon. an integer. The vector R describes the displacement of the
crystal part last met by the electrons with respect to the
B. Planar Interfaces front part. The problem of determining the nature of the
fault consists in determining the displacement vector R.
As explained, stacking faults are images of different shade
or fringe patterns. The first case arises when the fault plane
is parallel to the foil plane (see, e.g., Fig. 39). Fringes are
F. Surface Studies
Quite recently transmission electron microscopy was ap-
plied to the study of the atomic structure of surfaces. An
excellent vacuum and clean surfaces are required for such
studies. The reconstruction of surfaces has been demon-
strated in this way, as well as the migration of steps on
crystal surfaces during growth.
FIGURE 55 High-resolution image of a stacking fault tetrahedron
due to vacancies in silicon. [From Coene, W., Bender, H., and SEE ALSO THE FOLLOWING ARTICLES
Amelinckx, S. (1985). Phil. Mag. A 52, 369–381.]
BIBLIOGRAPHY
E. In Situ Studies
The specimen chamber of an electron microscope can be Amelinckx, S. (1964). “The Direct Observation of Dislocations,”
transformed into a small laboratory in which specimens Academic Press, New York.
Amelinckx, S., and Nabarro, F. R. N. (eds.) (1979). “Dislocations in
can be exposed to different environments and external pa- Crystals,” North-Holland, Amsterdam.
rameters by the use of the appropriate specimen holders. Amelinckx, S., Gevers, R., and Van Landuyt, J. (eds.) (1978). North-
Table II gives a survey of existing possibilities and some Holland, Amsterdam.
applications of the special specimen holders developed for Amelinckx, S., Van Dyck, D., Van Landuyt, J., and Van Tendeloo,
these purposes. G. (eds.) (1997a). “Handbook of Microscopy. Methods I,” VCH,
Weinheim.
In situ studies of electron radiation damage can be per- Amelinckx, S., Van Dyck, D., Van Landuyt, J., and Van Tendeloo,
formed in a high-voltage electron microscope; the elec- G. (eds.) (1997b). “Handbook of Microscopy. Methods II,” VCH,
trons create the damage and simultaneously form an im- Weinheim.
Amelinckx, S., Van Dyck, D., Van Landuyt, J., and Van Tendeloo,
G. (eds.) (1997c). “Handbook of Microscopy. Applications,” VCH,
TABLE I Microscope Holders and Their Applications Weinheim.
Amelinckx, S., Van Dyck, D., Van Landuyt, J., and Van Tendeloo, G.
Holder Application(s) (eds.) (1997d). “Electron Microscopy, Principles and Fundamentals,”
VCH, Weinheim.
Heating holder Phase transitions Bethge, H., and Heydenreich, J. (eds.) (1982). “Elektronenmikroskopie
Chemical reactions in der Festkörperphysik,” VEB Deutscher Verlag der Wissenschaften,
Order–disorder phenomena Berlin.
Melting phenomena Cowley, J. M. (1975). “Diffraction Physics,” North-Holland,
Amsterdam.
Annealing phenomena—grain growth
Edington, J. W. (1977). “Monographs in Practical Electron Microscopy
Cooling holder Phase transitions in Materials Science,” Mcmillan, New York.
Order–disorder phenomena Glauert, A. M. (ed.) (1981). “Practical Methods: Electron Microscopy,”
Environmental cell Chemical reactions North-Holland, Amsterdam.
Crystal growth Hawkes, P. W. (1972). “Electron Optics and Electron Microscopy,”
Taylor & Francis, London.
Straining holder Plastic deformation dislocation reactions
Head, A. K., Humble, P., Clarebrough, L. M., Morton, A. J., and
(possibly combined with
Forwood, C. T. (1973). “Computed Electron Micrographs and De-
heating and cooling)
fect Identification,” North-Holland, Amsterdam.
P1: GNH/GRD P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN017D-789 August 3, 2001 16:30
Jouffrey, B. (ed.) (1972). “Méthodes et techniques nouvelles d’obser- Williams, D. B., and Carter, C. B. (1996a). “Transmission Electron
vation en métallurgie physique,” SFME, Paris. Microscopy. I. Basics,” Plenum, New York.
Murr, L. E. (1970). “Electron Optical Applications in Materials Sci- Williams, D. B., and Carter, C. B. (1996b). “Transmission Elec-
ence,” McGraw–Hill, New York. tron Microscopy. II. Diffraction from Crystals,” Plenum, New
Spence, J. C. H. (1981). “Experimental High-Resolution Electron York.
Microscopy,” Oxford University Press (Clarendon), London and New Williams, D. B., and Carter, C. B. (1996c). “Transmission Electron
York. Microscopy. III. Imaging,” Plenum, New York.
Wenk, H.-R. (ed.) (1976). “Electron Microscopy in Mineralogy,” Williams, D. B., and Carter, C. B. (1996d). “Transmission Electron
Springer-Verlag, Berlin and New York. Microscopy. IV. Spectrometry,” Plenum, New York.
P1: GRB Final pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
989
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
line defined by the incident beam passing through the mous, and led also to medical applications. Röntgen him-
origin of reciprocal space. self suggested that X-rays might propagate as longitudi-
Extinction Loss of diffracted intensity due to coherent nal vibrations of ether, and their true nature was revealed
coupling of forward and diffracted beams (primary only gradually. The existence of short-wavelength elec-
extinction), or due to extra absorption arising from tromagnetic radiation could be inferred from Maxwell’s
diffraction (secondary extinction). theory, but the wavelength of X-rays is more than 1000
Interference function Function giving the structure of times smaller than the wavelength of visible light, and the
diffracted intensity in reciprocal space. Contains the properties of X-rays could not be extrapolated from ear-
small-angle scattering term and the Fourier transform lier observations. Röntgen received the first Nobel Prize
of the Patterson function. in Physics in 1901, and in the subsequent years several
Kinematical diffraction Interaction between forward Nobel Prizes were awarded to scientists working in this or
and diffracted beams is negligible. closely related fields.
Laue equations Diffraction conditions in reciprocal The wave nature of X-rays was established in the be-
space. ginning of the 20th century, and their wavelength could
Opening angle Angular width of the radiation cone from be estimated. In 1912 M. von Laue had the idea that
a single relativistic electron. X-rays would be diffracted by crystals, which were known
Phase problem Phase angles of the structure factors are to be three-dimensional regular arrays of atoms, resem-
not obtained from diffracted intensity. bling one- or two-dimensional optical diffraction gratings.
Powder sample Large number of randomly oriented The fascinating story of the first X-ray diffraction exper-
crystallites or grains in small particles. iment, which took place in Munich, Germany, has been
Reciprocal lattice vectors Span the reciprocal unit cell, told by P. P. Ewald (1962). The Laue-diffraction pattern
perpendicular to the planes defined by crystal unit cell was recorded on the photographic plate, and the inten-
vectors a, b, c. The volume of the reciprocal unit cell is sity maxima could be correlated with atomic structure of
Vc∗ = 1/Vc . the crystal. Soon after, father and son W. H. Bragg and
Refractive index Complex quantity, where the real part W. L. Bragg observed reflection of X-rays from cleav-
is slightly less than unity for X-rays and neutrons, and age surfaces of crystals, and they established the Bragg
the imaginary part corresponds to absorption. law, which gives the relation between the diffraction an-
Rietveld method Calculated diffraction pattern from a gle, X-ray wavelength, and spacing of atomic planes. The
structural model and peak shape functions is fitted to Braggs solved the first crystal structures, those of cubic
the experimental powder diffraction pattern. ZnS and alkali halides. The results were quite revealing,
Spallation source Neutron source where neutrons are because they showed that at least the crystals of inorganic
stripped from nuclei by a high-energy proton beam. compounds are not composed of units of molecules, but
Unit cell Basic building block of a crystal. The unit cell of atoms making up a three-dimensional framework. This
vectors a, b, c span a parallelepipedon of volume Vc . new concept was difficult to accept for some chemists, who
Wigglers and undulators Periodic magnetic structures considered the molecules the basic units of compounds.
placed in the straight sections of storage rings for en- However, structure determination by X-ray diffraction de-
hanced production of synchrotron radiation. veloped rapidly, and it formed the foundation for under-
standing the nature of solid matter. The importance of this
work was recognized immediately; von Laue received the
THE DISCOVERY of X-rays by W. C. Röntgen in 1895 Nobel Prize in Physics in 1914 and the Braggs the follow-
came in the middle of great changes in physics. Much of ing year.
the research in the last decades of the 19th century con- The neutron was discovered by Chadwick in 1932. By
centrated on the nature and propagation of radiation. New that time, the wave/particle dualism of radiation was gen-
forms of radiation, radio waves, and cathode rays were erally accepted, and diffraction of electrons by crystals
produced, and the old argument of particles vs waves was had been demonstrated. It was suggested that neutrons
rekindled. The prevailing ether theory was subjected to would be diffracted as well, and the first experimental ev-
serious blows, and the need for reformulation of emission idence was obtained in 1936. However, the neutron beams
and absorption laws of radiation was becoming evident. from radioactive sources were too weak for any quantita-
X-rays propagated like visible light, were not affected by tive neutron diffraction experiments, and it was only af-
electric and magnetic fields, but were not refracted and ter 1945, with the advent of nuclear reactors, that neutron
penetrated through light materials, such as soft tissues of diffraction became an important tool in study of condensed
human body. This last property made X-rays instantly fa- matter. The neutron wavelength is
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
“resonance” scattering. Under certain conditions the res- where S is the electron spin quantum number, γ the neu-
onance term is negative and large enough to make the real tron magnetic moment in units of nuclear magnetons, and
part of b negative. f m the magnetic form factor. This result for spin magnetic
If the scattering nucleus has a spin I , the compound nu- moment can be extended in cases where there is orbital
cleus has a spin I + 12 or I − 12 . These nuclei have different moment also. The form factor f m arises from outer elec-
scattering lengths b+ and b− , and the total cross section is tron shells, so that there is a strong dependence on the
scattering angle (see Fig. 28).
σ = 4π b2 = 4π w+ b+ 2
+ w− b−2
(11)
The formalisms of X-ray and neutron scattering are very
Here w+ = (I + 1)/(2I + 1) and w− = I /(2I + 1) are the similar, which makes many results directly comparable.
weight factors of the two possible compound nuclear However, the interactions with the atoms are fundamen-
states. The cross section is split in two parts, those of tally different, which make X-rays and neutron comple-
coherent scattering, S, and incoherent scattering, s, mentary probes. This has become more and more evident
with the use of synchrotron radiation, as will be seen in
σ = S + s = 4π(w+ b+ + w− b− )2
the chapters where applications are discussed.
+ 4πw+ w− (b+ − b− )2 (12)
Only the first term can produce interference, which is ob-
served in the diffraction pattern. II. BASIC FORMULAS FOR DIFFRACTION
There is even more diversity in b due to the existence
of different isotopes. In the general case, each of these has Scattering from an atom can be generalized to concern any
its characteristic b+ and b− . The different isotopes of an electron or nuclear density distribution ρ(r). It is typical
element will be distributed at random among the atomic that ρ(r) has several hierarchical levels. In a crystal the
positions, and in the case of a crystal, scattering can be unit cell is repeated in three dimensions, whereas many
divided into scattering from the average structure (ordered organic materials are built of chainlike molecules, which
scattering) and scattering from fluctuations of the structure form fibers, and these in turn structures of bundles, etc.
(disorder scattering). This division will be discussed in the Any object has a finite size, and in the following the effect
section where a general formulation for diffraction from a of size in diffraction is considered. The results are based
crystal is given. The different factors affecting the nuclear on the use of the convolution theorem of Fourier transform
scattering length complicate the interpretation of neutron F (Guinier, 1963). Functions g(r) and h(r) are defined in
diffraction patterns, but on the other hand they allow a real space and their Fourier transforms G(K) and H (K)
great variety of studies. Experimental determination of b in wavevector or reciprocal space. Then
is the basis of all quantitative scattering studies, and one of F[g(r)h(r)] = G(K) ∗ H (K) (15)
the methods used is determination of the critical angle of
the total external reflection. In the same way as in the case where * is the convolution operator.
of X-rays the refractive index for neutrons is a complex Consider a statistically homogeneous object of volume
quantity, V . When its shape is given by function τ (r), which is 1
inside the object and 0 outside, the electron (or nuclear)
n = 1 − N (λ2 /2π)
b (13) density is
where
b is the average value of the bound coherent scat- ρ(r) = ρ∞ (r)τ (r) (16)
tering amplitude of N nuclei in unit volume. For most
isotopes,
b is positive, and 1 − n ∼
= 10−6 for thermal neu- i.e., the object is “cut” by τ (r) from the infinite, statistically
trons. The critical angle is small, typically of the order of homogeneous object with density ρ∞ (r). The scattered
0.1 mrad, which makes possible optical components and amplitude is
experiments similar to those with X-rays. A(K) = A∞ (K) ∗ T (K) (17)
In addition to nuclear scattering there is scattering due
to the interaction between the neutron magnetic moment where T(K) is the transform of τ (r), and A∞ (K) that of
and that of the atom. The bulk of knowledge of the mag- ρ∞ (r).
netic structures of solids is based on neutron diffraction The intensity is the amplitude multiplied by its complex
studies. There are two important groups of atoms with net conjugate, and it turns out to be the Fourier transform of the
magnetic moments, namely the first transition elements autocorrelation (Patterson) function P(r) of the density
with incomplete 3d shells, and rare-earth elements with of scatterers,
incomplete 4f shells. The scattering cross section is
P(r) = ρ∞ (u)ρ∞ (u + r) τ (u) τ (u + r) d 3 u (18)
dσpm /d = (2/3)S(S + 1)γ 2re2 f m2 (14)
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
equations, we obtain recorded by rotating the crystal and the associated recip-
rocal lattice to make the relps to intersect Ewald’s sphere,
a∗ = b × c/Vc , b∗ = c × a/Vc ; c∗ = a × b/Vc and the detector is placed in the direction of the diffracted
(25) ray (it may be a stationary two-dimensional detector). In
reality, the relps are not points but small domains due to the
where Vc = a · b × c is the volume of the unit cell.
finite width of |T (K)|2 and mosaicity of the crystal. Also,
The significance of these results is the following.
the Ewald sphere is a bit “fuzzy” because of variations in
Diffraction maxima occur at discrete values of the scatter-
k0 . Therefore, the relevant quantity is the integrated reflec-
ing vector K = k − ko . It is easy to show that Khkl is the
tion, which is obtained when the relp traverses the Ewald
normal of the reflecting planes, and the Bragg equation is
sphere during the scan. The other possibility of recording
obtained,
the integrated reflections is to use radiation with a con-
2d sin θ = λ or K = 2π/d (26) tinuous wavelength distribution. The center of the Ewald
sphere covers a range along ko /2π , and relp’s between the
where d is the spacing of the lattice planes. The scattering
spheres of radii from 1/λmax to 1/λmin are intersected. This
amplitude at the diffraction maximum is called the struc-
is the Laue method of X-ray and neutron diffraction. In ei-
ture factor F(hkl), and it is in general a complex quantity.
ther method there are several geometrical factors involved,
The structure factors are the Fourier coefficients of ρ(xyz),
but the main result is that the integrated reflection is pro-
so that
portional to the square of the structure factor, |F(hkl)|2 .
ρ(x yz) = Vc∗ F(hkl) exp[−iKhkl · r] (27) Determination of crystal structure involves two basic
hkl steps: calculation of the unit cell vectors from the diffrac-
The integral values (hkl) span the reciprocal lattice, and tion pattern, i.e., inversion from a∗ , b∗ , c∗ to a, b, c, and
the volume of the reciprocal unit cell is Vc∗ = 1/Vc , which calculation of the electron or nuclear density from the
shows that the density of relp’s and the number of reflec- structure factors by Fourier inversion [Eq. (26)]. How-
tions increases proportionally to the volume Vc of the unit ever, the observed quantity is the intensity of the reflection,
cell. so that the information of the phase of F(hkl) is lost. This
Ewald’s construction illustrates the diffraction condi- is the famous phase problem of diffraction, and several in-
tion and reciprocal lattice (see Fig. 3). The vector −ko /2π genious schemes have been put forward for experimental
drawn from the origin of the reciprocal lattice defines the and theoretical solutions. These are discussed in detail in
center of Ewald’s sphere. Any relp lying on the sphere ful- another article (K. Ann Kerr, 1987). The basic assumption
fills the diffraction condition, and an intensity maximum in the preceding discussion is that the scattering ampli-
is observed in the direction k. The diffraction pattern is tudes of individual unit cells add up. This is the so-called
kinematical approximation of diffraction, where interac-
tions between the incident beam and diffracted beam are
ignored. In many cases this is a good starting point for
crystal structure determination, but usually corrections to
the kinematical approximation are needed.
The crystal is not a static, perfectly periodic three-
dimensional structure, but the atoms are displaced from
their ideal positions by δ j , and the structure factor be-
comes
F(K) = f j (K) exp[iK · (r j + δ j )] (28)
j
described by V (rm ), which allows the summation be ex- X-ray diffraction, but the following discussion applies to
tended to infinity. The use of K instead of Khkl is due to neutron diffraction as well with appropriate changes of the
the fact that in a distorted crystal there is scattered inten- meanings of the symbols.
sity between the relps. The average square of the structure The Darwin treatment begins with description of reflec-
factors is divided in two parts, tion from a layer of atoms. The resultant amplitudes of the
∗
reflected and transmitted beams are calculated using the
ym (K) = Fn Fn+m = |
Fn (K)|2 + m (K) (30)
formalism of Fresnel diffraction. The effects of electron
The first part gives diffracted intensity from the average binding, which lead to a complex scattering amplitude and
structure, the second from fluctuations of the structure. absorption, are ignored, but these can be included without
These may be due to static disorder, and the atoms are changing the essential results of the calculation. It is found
always in thermal motion. The motions are correlated, so that there is a π/2 phase shift in the reflected beam, and
that the thermal diffuse scattering (TDS) is not uniform this shift plays a very important role in crystal diffraction,
in reciprocal space. In fact, long-wavelength in-phase vi- because a beam that has been reflected twice has suffered
brations (acoustic phonons) make the TDS peak under a phase shift of π and is out-of-phase with the incident
the Bragg reflections, and their contribution must be sub- beam. The transmitted beam is the combination of the
tracted. If the average scattering and diffuse scattering incident beam and the forward reflected beam, and this
can be resolved in components, a very complete picture introduces a small phase shift and makes the real part of
of atomic positions and their displacements is obtained. the refractive index slightly smaller than unity, as already
The preceding separation of the average structure is discussed in Section I.A.
valid for X-rays, which travel at the speed of light, and There are two distinctly different cases in perfect crystal
in diffraction the instantaneous structure is “seen” by the diffraction. The geometry where the reflected beam exits
X-rays. In the course of the experiment the average of at the same surface as the incident beam enters is called the
the configurations is seen. This is not necessarily the case reflection or Bragg case, and the geometry where the beam
with neutrons, which may be slower than the elastic waves exits at the opposite surface is called the transmission or
(phonons) in the crystal. In such a case the TDS is modi- Laue case. The following discussion concerns the Bragg
fied, and for instance, the peak under the Bragg reflections case, but the Laue case will be discussed briefly.
is not observed. A detailed discussion is found in Interna- In Darwin’s theory, diffraction in a crystal is treated by
tional Tables for Crystallography, Vol. C, Ch. 7.4 (1999). considering the propagation of the reflected and transmit-
ted beams in a structure of parallel equidistant planes of
atoms. The resultant reflected beam is a coherent sum of
IV. PERFECT CRYSTAL: DYNAMICAL the beams reflected by one atomic layer and transmitted
THEORY OF DIFFRACTION through the layers above. The beam in the direction of the
incident beam is a sum of the transmitted beam and the
A complete description of diffraction from a crystal beams reflected back to the forward direction. The phase
requires that the interaction between the incident and shift of π in two reflections decreases the amplitude of
diffracted beams is taken into account. It may happen that the transmitted beam. Calculation of the amplitudes leads
there are several relps on the Ewald sphere at the same to difference equations for the transmission and reflection
time, so that more than one diffracted beam is excited. In coefficients of successive atomic layers. These are solved
most diffraction measurements the goal is to record the with the assumption that the changes in one layer are small,
intensities of reflections in the two-beam case, where for so that first-order approximations can be used. At the sur-
a given wavelength only one diffracted beam is excited at face of the crystal the ratio of the reflected amplitude to
the time. However, multiple diffraction provides important the incident amplitude is
information about the phases of the structure factors, and
recently this has been used for an experimental solution R0 /T0 = i p/[iε ± ( p 2 − ε 2 )1/2 ] (31)
of the phase problem of diffraction.
The interaction of the incident and diffracted beams where p = δ/ sin θ cos θ and ε = θ − θ0 . Here 1 − δ is the
can be treated on many levels of sophistication. All these real part of the refractive index, and θ the observed Bragg
treatments are called dynamical theories of diffraction. angle. By multiplying R0 /T0 by its complex conjugate
A good account of the approaches introduced by P. P. and reorganizing the terms several interesting features are
Ewald and M. von Laue is given by James (1962), but observed:
here we follow a simplified formulation given by Warren
(1969), based on the work of C. G. Darwin. All the various r When ε/ p is between −1 and +1, the incident beam
forms of dynamical diffraction theory were formulated for is totally reflected, i.e., I /I0 = 1
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
V. REAL CRYSTAL AND EXTINCTION, The size D of a mosaic block is best measured in units
POWDER DIFFRACTION of the extinction distance ,
t = D/ = D/[Vc /re λK pol,d F(hkl)]=A(γ0 γh )1/2 (34)
Most single crystals that are used in studies of crystal
structure are closer to the kinematical than the dynamical Here γ0 and γh are the direction cosines of the incident
limit of diffraction. It is seen from Fig. 5 that at small val- and reflected beams, respectively. It is seen that t can be
ues of A the integrated reflection increases linearly with made small and the kinematical limit approached, if the
the thickness. This is the range of kinematical diffraction, X-ray wavelength or K pol,d is reduced. Model calculations
where the reflected beam is weak and its power increases indicate that
linearly with the diffracting volume. However, deviations
from the conditions of kinematical diffraction are large yp (t) ≈ exp[−Ct 2 ] → 1 − Ct 2 (at small t) (35)
enough to warrant correction. The effect is called extinc- where C is a constant, which depends on the shape of the
tion, and many different methods have been developed to mosaic block. This can be used for extrapolation to zero
correct for it. primary extinction by changing λ and/or K pol,d (Suortti,
Most extinction theories and correction methods are 1982). Recently, synchrotron radiation has provided new
based on the concept of mosaic crystal. If the mosaic possibilities also in this area. In particular, wavelengths
blocks are sufficiently small and their orientations vary in the 0.1 Å regime can be used. These are an order of
appreciably, the blocks diffract independently, and the magnitude smaller than the ones used in traditional crys-
interaction between the incident and reflected beams in- tallography, so that 1 − yp (t) can be reduced drastically.
side the blocks is negligible. In such a case the crystal is Secondary extinction may be substantial even when pri-
called ideally imperfect, and the conditions of kinematical mary extinction is negligible. For instance, some crystals
diffraction prevail. Although the concept of mosaic crys- have a layer-like structure, where the thin layers are highly
tal is oversimplified, since imperfection in a crystal may parallel, but diffract incoherently. The interaction between
be the result of dislocations and inhomogeneous strains, it the incident (P0 ) and diffracted (Ph ) beams is described
has persisted. Darwin introduced two kinds of extinction, by energy transfer equations (Zachariasen, 1945)
primary and secondary, and these concepts have been used
since then. (∂ P0 /∂s0 ) = −µe P0 + σ Ph
Primary extinction is present when the integrated inten- (∂ Ph /∂sh ) = −µe Ph + σ P0 (36)
sity from each of the mosaic blocks is less than predicted
by kinematical theory. Before reaching an interior block Here s0 and sh are the coordinates in the directions
the beam may have been diffracted by several other blocks, of the incident and diffracted beams, respectively, and
so that the beam is attenuated more than by ordinary ab- µe = µ0 + σ the total attenuation coefficient, due to the
sorption. This effect is called secondary extinction, and linear absorption coefficient µ0 and the scattering coeffi-
it becomes negligible only when the disorientation of the cient σ (s0 , sh , θ ). If the transmitted and reflected beams are
blocks becomes large, or the reflection is weak. Conceptu- recorded as functions of θ, and the geometrical factors are
ally, primary extinction may be said to arise from coherent known, the transfer equations can be solved, if σ depends
or amplitude coupling of beams, while secondary extinc- weakly on (s0 , sh ). The extinction-corrected integrated re-
tion is due to incoherent or intensity coupling of beams. flection is σ (θ ) dθ . This method has been used in a few
With these concepts, explicit reference to the mosaic crys- cases, but with highly collimated, monochromatic SR ex-
tal model is avoided. perimental corrections for secondary extinction could be
The effects of primary and secondary extinction on done in routine way.
the integrated reflection are usually given by the expres- Usually, an angular distribution W (θ) of σ (θ ) is as-
sion sumed without measuring the actual reflectivity curve, and
the width of W is a parameter of the model. For a sym-
E obs (hkl) = yp y s E kin (hkl) (33) metrical Bragg reflection from a thick sample,
E obs = Q/2(µ0 + g Q) (37)
where yp and ys are the primary and secondary extinction
coefficients, respectively. These are average values, which where g Q is the mean secondary extinction √ coefficient.
refer to the integral over the reflectivity curve. It would be For a Gaussian distribution of W , g = 1/2 π η, where η is
more appropriate to use extinction coefficients that vary the standard deviation. In crystal structure determination
along the reflectivity curve, so that no assumptions of the a very large number of reflections is recorded, and the
distributions of the sizes and orientations of the mosaic individual reflectivity curves are not examined. Extinction
blocks would be needed. corrections are included in structure refinement, where the
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
model parameters are fitted by least-squares methods. The Third-generation synchrotron radiation laboratories are
most widely used model for extinction corrections is that actually accelerator complexes, which include the source
by Becker and Coppens (1974). of particles (electron gun), a preaccelerator (microtron
Diffraction from powder samples can be used to ap- or linac), a booster synchrotron, where the final particle
proach the kinematical limit. An ideal powder sample has energy is achieved, and the storage ring. The layout of
a randomly oriented distribution of small crystallites. In the European Synchrotron Radiation Facility (ESRF) is
terms of Ewald’s construction, the reciprocal lattice takes shown in Fig. 6. The storage ring is actually a polygon,
all orientations with respect to k0 , so that all reflections where the particles travel in a vacuum tube, and dipole
with K ≤ 2k0 take place simultaneously. The intensity of magnets bend the particle trajectory at the corners to
a given reflection is distributed uniformly on the cone in- make a closed orbit. The particles have formed bunches in
scribed by k about k0 . The diffraction pattern or a part the preaccelerators, and when they are injected to the stor-
of it can be projected on a two-dimensional detector. The age ring, their energy is typically several GeV; the length
Bragg reflections form concentric rings, and the diffrac- of the bunch is of the order of 1 cm, and its diameter is
tion pattern is projected on K by azimuthal integration. smaller than 0.1 mm. The fill of the storage ring can be
Secondary extinction is the same in all reflections, cor- varied from single bunch to about 1000 bunches, so that the
responding to the average scattering contribution to total time difference between bunches is from a few microsec-
attenuation. Primary extinction is usually small because of onds to nanoseconds, and the bunch length corresponds to
the small grain size of the powder particles, but broadening a few tens of picoseconds. The velocity is very close the
and overlap of the reflections make it difficult to separate velocity of light, because the electron or positron mass is
the background from the diffraction pattern. Again, SR about 104 times the rest mass. When changing direction in
has improved the quality of powder diffraction data by the magnetic field the particles radiate along the tangent of
making instrumental broadening of reflections nearly neg- the orbit, as will be seen in the following, and the loss of en-
ligible. Powder diffraction is now used for structure de- ergy is compensated for in radio-frequency cavities. When
termination, which is important because sufficiently large scattering from residual gas molecules in the vacuum tube
single crystals of many interesting materials are not avail- is small the lifetime of the particle beam is about 1 day.
able. However, the principal use of powder diffraction is The closed orbit of the particles can be described in
in materials science, in studies of grain size, strain, and phase space. The 1σ -contours of position and angle are el-
texture in metals, alloys, and ceramics. These quantities lipses in horizontal and vertical directions, and the area/π
are extracted from the shapes and shifts of the reflection of each one is called the horizontal or vertical emittance, εx
profiles, and from nonuniform distribution of intensity on or εz , respectively. A steady state is achieved in a few mil-
diffraction cones. Accounts of developments in powder liseconds after injection, and εx and εz are characteristic
diffraction are found in books edited by Young (1993), constants of the storage ring. The shape of the emittance
and Snyder, Fiala, and Bunge (1999). ellipse changes along the ring depending on the focusing
magnets, but the area is constant. The ratio εz /εx is called
coupling, and in modern storage rings it is a few percent.
VI. SYNCHROTRON RADIATION
The horizontal emittance of the ESRF is 3 × 10−9 m rad,
SOURCES AND PROPERTIES
which corresponds to a typical beam size σx = 300 µm
OF RADIATION
and divergence σx = 10 µrad. In the vertical direction,
both values are reduced by a factor of 10.
A. Sources
Synchrotron radiation (SR) is emitted when light charged B. Properties of SR
particles (electrons or positrons) moving with relativistic
velocity undergo radial acceleration. Small synchrotrons The radiation pattern of a nonrelativistic electron orbit-
have developed to large (circumference about 1 km) stor- ing in a magnetic field has the well-known dipole radiation
age rings, which are dedicated to production of syn- distribution I ∝ sin2 ϕ, where ϕ is the angle between the
chrotron radiation. Periodic magnetic structures called observation direction and the direction of radial acceler-
wigglers and undulators are inserted in the straight sec- ation in the rest frame of the electron. For a relativistic
tions of the storage rings to enhance the flux of the SR electron, this radiation distribution has to be transformed
sources. The latest advance in the field is development of by Lorentz transformation into the laboratory system. As
special storage rings and linear accelerators, where free a result, radiation is observed only within a narrow cone
electron lasers provide tunable coherent radiation of high in the propagation direction of the electron (Fig. 7). The
brilliance. The sources and properties of SR have been opening angle of this cone is approximately
discussed by Brefeld and Gürtler (1991). ψ = 1/γ = E 0 /E (38)
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
FIGURE 6 Layout of the European Synchrotron Radiation Facility (ESRF) in Grenoble, France. The circumference
of the storage ring is 845 m, and presently 40 beamlines are in operation. [ESRF Highlights 1999; reproduced with
permission.]
where E is the electron energy and E 0 is the electron rest ing magnet source, and those are illustrated in Fig. 8.
energy, 511 keV. The frequency spectrum observed by a The wavelength scale is given in units of the critical or
stationary observer is understood by considering the sit- characteristic wavelength λc , which is in practical units of
uation where the narrow radiation cone sweeps past in the bending radius R, magnetic field B, and the electron
a short time approximately equal to R/γ 3 c, where R is energy E,
the radius of curvature of the orbit and c the velocity
of light. For the values relevant for the ESRF bending λc [Å] = 5.59R[m]/E 3 [GeV3 ] = 18.6/B[T]E 2 [GeV2 ]
magnet, γ = 1.174 × 104 , R = 23 m, this single electron (39)
flash lasts 4.7 × 10−20 s. This short pulse, Fourier trans-
formed to frequency, contains a spectrum of harmonics The concrete meaning of λc is that it divides the emitted
up to 2.1 × 1019 Hz, which corresponds to 0.14 Å wave- power in equal halves. The universal curves can be used
length or 90 keV photon energy. In a real storage ring with for calculating the brightness and flux of radiation from a
many emitting electrons, a continuous spectrum covering given bending magnet.
the range from infrared to hard X-ray regime is observed. SR from a bending magnet is linearly horizontally po-
The spectrum of synchrotron radiation can be calculated larized when observed in the orbit plane. Out of the plane,
precisely, and SR is actually used to calibrate instruments the polarization is elliptical and can be decomposed into
utilized in astrophysics. The spectral brilliance is usually its horizontal and vertical components. These are shown
given in units of photons per second per mm2 source area in Fig. 7, and it is seen that the more intense horizontal
per mrad2 source divergence and per 0.1% bandwidth, component is closely approximated by a Gaussian of vari-
I = I (x, z, θ, ψ, E, t). Integration over the source area ance σψ , which is related the full-width-half-maximum by
yields the spectral brightness (intensity), and integration FWHMψ = 2.35 σψ .
over all angles yields the spectral flux. Universal curves of The sources of radiation in modern storage rings
spectral brightness and flux can be calculated for a bend- are mostly wigglers and undulators. These are periodic
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
FIGURE 7 Cone of synchrotron radiation from a relativistic electron, and vertical intensity distributions of the parallel
(electric vector in the orbit plane) and perpendicular components.
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
FIGURE 9 Spectral flux from an undulator. The numerical values correspond to the early operation parameters of
the ESRF.
compromises between optimal performance and technical they are rejected by a mirror set to reflect the fundamental
feasibility must be made. energy.
Two phenomena are utilized in X-ray optics: diffrac- Mirror technology has advanced enormously in recent
tion by single crystal or by synthetic multilayer structure, years. The rms surface roughness of 1 m long mirrors is
and total external reflection from a mirror. Focusing is on the level of atomic size, and the figure errors are less
achieved by curved crystals, multilayers, or mirrors. than 1 µrad. The usual mirror material is Si or SiC, and
the mirror can be coated by a thin metal layer to provide
A. Mirrors the desired critical angle. In order to benefit from the high
quality of the mirrors their shape must be maintained
It was already mentioned that the refractive index is a com- even under the high heat load of the SR beam. The mirror
plex quantity, where the real part 1 − δ is slightly smaller is water-cooled, and its shape is monitored by an optical
than unity. Therefore, total reflection takes place at the sensor. The sensor has feedback to piezoelectric actuators,
interface of vacuum and solid mirror, and if absorption is which correct for the effects of thermal deformations of the
small, there is a well-defined critical angle, mirror.
√
θc [mrad] = 2δ = 2.3λ (ρ Z /A)1/2 (44)
B. Multilayers
where λ is the X-ray wavelength in Å, ρ the mirror den-
sity in units of g/cm3 , Z the atomic number, and A the Multilayers are synthetic periodical structures, where al-
atomic mass. Absorption rounds off the sharp edge at ternating layers of light and heavy elements are deposited
the critical angle. Because of the small opening angle of on a substrate. Typically, the layer thickness for each crys-
SR totally reflecting mirrors provide efficient solutions tal element is 5 to 10 atomic layers, so that the period in
even when θc is only a few mrad in a typical case. Ideal the direction of the surface normal is 20 to 50 Å. Because
point-to-point focusing is achieved with an elliptical mir- of the large period, the Bragg angles are small, as seen
ror, when the source is at one of the focii. In practice, in Fig. 10. The figure shows also the regime of total re-
mirrors are ground to cylindrical shape in the sagittal di- flection at small incident angles. The relative width of the
rection and bent in the meridional direction. In addition Bragg reflection from a multilayer is 1/N in the energy
to focusing, an X-ray mirror acts as a low bandpass fil- scale, where N is the number of periods (typically be-
ter, as the critical angle is inversely proportional to the tween 100 and 1000), so that multilayers can be used as
X-ray energy. In many experiments, the high-energy har- wide bandpass monochromators and focusing elements.
monics from the monochromator cannot be allowed, and Actually, the X-ray optical properties of multilayers are
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
FIGURE 11 Geometry of double-crystal monochromator in nondispersive (n, −n) (solid lines) and dispersive (n, n)
(broken lines) settings.
be used only near normal incidence of the beam, which lenses is their small size, which limits the useful aperture
limits the available X-ray energies. Linear lenses form a of the SR beam.
line focus, but they can be bent cylindrically to focus also The fact that the refractive index of X-rays is smaller
in the other direction. Focus sizes in the micrometer range than unity can be utilized for construction of X-ray lenses.
have been achieved. The limitation of the Bragg–Fresnel For a long time, this was considered impossible, because
FIGURE 12 Laue–Bragg monochromator. Horizontal focusing is achieved by bent crystals in the geometry where the
source points and focii are on the Rowland circles. Vertical focusing is realized by a bent multilayer. [ESRF Highlights
1996/1997, reproduced with permission.]
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
E. Beamline
It was already mentioned that the power of the SR beam
from an undulator or wiggler is typically several kW.
Therefore, the optical elements and beamline instruments
must be placed in “hutches,” which provide radiation
shielding. For instance, the spectrum of the radiation from
the superconducting wavelength shifter at the ESRF ex-
tends beyond 1 MeV, and 60 mm of lead is needed for
shielding. The beamline components are under vacuum
to prevent contamination and production of ozone by the
SR beam. An example of a beamline is shown in Fig. 14.
The beam enters the first optics hutch through the shield
wall. It is limited by fixed apertures, and usually the “soft”
low-energy part of the spectrum is removed by absorbers.
The beam position is monitored, and the size of the beam
is limited by primary slits to match the optical elements
downstream. An adaptive mirror focuses the beam to in-
finity and removes high energies, a narrow energy band is
separated by a two-crystal monochromator, and the beam
is focused on the sample by the second mirror. At high
energies, where the critical angle of the mirrors becomes
too small, the mirrors can be moved away, and horizontal
focusing is achieved by the sagittally bent second crystal
of the monochromator. All the components of the beam-
FIGURE 13 Linear and circular Bragg–Fresnel lenses fabri- line are controlled remotely, and there are several feedback
cated on Si wafers. [ESRF Highlights 1994/1995; reproduced with loops, which are used to maximize and stabilize the inten-
permission.] sity of the exit beam. Usually the sample is enclosed in a
chamber, where it is under vacuum or high pressure, it can
be heated or cooled, and electric or magnetic fields can be
typically δ is of the order of 10−5 . However, a spherical applied. In diffraction studies, usually a four-circle or even
cavity inside a solid acts as a condensing lens, and if a six-circle diffractometer is used. Two angular motions are
long row of holes is milled in a weakly absorbing material needed to orient the crystal for fulfillment of the diffraction
such as Be, a beam traveling along the row is focused. conditions, and two motions are needed for rotation of the
Focusing in the other direction is obtained by a crossed crystal during the scan and for tracking the diffracted beam
row of holes. Again, focus sizes in the micrometer range by the detector. There may be an analyzer crystal in front
are achieved. of the detector to remove parasitic scattering. However,
Still another method of microfocusing is that of us- more and more two-dimensional detectors are used, which
ing tapered capillaries. The SR beam enters the capillary eliminates the detector movement and increases the speed
at the wide end, and after successive total reflections of data collection. On the other hand, most such detectors
from the inner walls of the capillary, exits from the other operate in the integrating mode without any energy res-
end. Submicron beam sizes have been achieved by this olution, which increases the background in the observed
method. The methods described above have their advan- diffraction pattern. A modern diffractometer is shown in
tages and disadvantages. With the exception of the bent Fig. 15.
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
VIII. X-RAY DIFFRACTION METHODS AND orientation of the crystal is greatly facilitated by area de-
APPLICATIONS OF SYNCHROTRON tectors, most notably by image intensifiers coupled to fast
RADIATION readout CCDs. In general, area detectors with associated
software have made collection and analysis of diffraction
The conventional X-ray diffraction methods are covered data a routine step in structure determination. On-line data
in another article (K. Ann Kerr, 1987), and the present analysis runs automatically as diffraction patterns are col-
one concentrates on new developments, which originate lected and keeps up with the data acquisition, so that the
mostly from SR laboratories. The methods are numerous, experimenter knows the quality of the data, completeness
and only short descriptions of the main areas of research etc. A graphical user interface interacts with different pro-
are possible. cesses (beamline control, data acquisition, and data anal-
ysis) running on different computers at different times.
Diffraction experiments with monochromatic radiation
A. Structure Determination,
use mostly the rotating crystal method to scan reciprocal
Protein Crystallography
space. A diffractometer, such as the one shown in Fig. 15,
Recent reviews of protein crystallography based on the use is used for crystal orientation and scanning. The data rates
of SR have been given by Bartunik (1991) and Helliwell are enormous, because the area detector has 106 to 107 pix-
(1992). The first steps of structure determination— els, and the dynamic range is 14 bits. It is estimated that
growing the crystal, determination of the unit cell and the rate of uncorrected data may be up to 150 GB/day. Ev-
the orientation matrix—have stayed the same as before, idently, mass storage and structured data base are needed
but there are some important advances. Very small crys- to handle the data. The essential step is reduction of the
tals can be used, since 1010 to 1011 photons/s can be fo- data to integrated intensities of the reflections, indexing the
cused on a sample of 10 µm diameter. Peak search and reflections, and making corrections for absorption, beam
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
FIGURE 15 Six-axis Kappa diffractometer at the Materials Science beamline of the ESRF. All axes are mechanically
independent, and they allow free choice of the scattering plane. The detector arm can be equipped with a crystal
analyzer or an area detector.
polarization, geometrical factors, and probably for extinc- heavy-atom derivatives is a standard problem in protein
tion. The resulting structure factors without information crystallography, and there is much empirical knowledge.
about their phase angle give the starting point for crystal A quite general technique is to introduce Se during
structure analysis. biosynthesis of proteins, and the K-edge of Se is at the
Synchrotron radiation provides new possibilities for ex- convenient energy of 12.66 keV.
perimental determination of the phase angles of the struc- The use of continuous radiation and stationary crystal,
ture factors. The use of multiple diffraction was mentioned i.e., the Laue method, has had a renaissance with the ad-
earlier, but the methods based on anomalous dispersion vent of synchrotron radiation. The intensity of SR as a
are more important at the moment. The atomic scattering function of energy can be calculated precisely, so that the
factor changes strongly near the absorption edges [cf. integrated intensities of the reflections recorded with dif-
Eq. (4)]. The principle of phase determination from struc- ferent energies can be converted to structure factors on a
ture factors where the scattering factor of one type of atoms common scale. A diffraction pattern is shown in Fig. 17.
is changed is shown in Fig. 16. The changes f and i f It is remarkable that this pattern was taken in a single
can be calculated from theory, but the free-atom values shot of 150 picoseconds using an image intensifier cou-
may be modified in the crystal. The imaginary part, which pled to a CCD camera, i.e., the pattern is due to diffrac-
is due to photoelectric absorption, can be determined tion of X-rays from a single electron bunch of the storage
experimentally by measuring the fluorescence signal as ring. On one hand, this means that it is possible to de-
a function of energy, and the real part can be calculated termine structures of proteins which disintegrate rapidly
from f by Kramers–Kronig relations. Unambiguous due to radiation damage. It is also possible to make pump-
phase angles are obtained from measurements with two or and-probe experiments, where a conformational change
three X-ray energies, where the changes in f and f are of a protein is triggered by a laser pulse, and by vary-
large. Most native proteins do not have heavy atoms with ing the time difference to the X-ray pulse the evolution
absorption edges in the range of energies suitable for of a change in structure can be followed. The changes of
diffraction studies, i.e., above 5 keV. Preparation of the structure factors are observed, and the changes of the
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
FIGURE 16 Determination of the phase angle of the structure factor by using the multiple-wavelength anomalous
dispersion (MAD) method. FA is the structure factor contribution of the anomalous scatterer, and F P that from the
rest of the unit cell. The phase angle φ is solved from the intersection points of the circles of radius |F| (Harker
diagram).
electron density are calculated from Eq. (27). One recent Radiation damage is a limiting factor in protein crys-
example is the study of binding and release of CO at the tallography. Most of the radiation falling on the crystal is
heme site of myoglobin. The movements of CO have been not used, either as wavelengths that are not reflected in the
studied extensively by infrared spectroscopy and molec- Laue method, or during the time spent between the reflec-
ular dynamics calculations, but time-resolved diffraction tions in monochromatic diffraction. Radiation damage is
of SR provides a detailed picture of the conformational not a linear function of dose, but also scales with the total
changes of the protein. The movement of the CO molecule duration of the experiment. Therefore, efficient data col-
to the docking site, where it stays ca 350 ns before leaving lection strategies are essential. It turns out that the damage
the pocket, can be seen. Then the molecule diffuses about caused by high-energy radiation is substantially less than
in the outer protein for a fraction of millisecond and comes damage at the energies normally used in protein crystal-
back to the Fe by random collisions. lography.
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
C. Surface Structures
The present understanding of the atomic structure of sur-
faces is to a large extent based on studies with SR. Because
FIGURE 17 Laue diffraction pattern from a protein crystal taken
of the high intensity, a sufficient signal can be obtained
in a single shot of 150 ps from an ESRF undulator. The band-
width is 7–11 keV, and the image was recorded by a CCD camera from an atomic layer, and total reflection at grazing inci-
coupled to an image intensifier. [ESRF Highlights 1994/1995; re- dence largely eliminates the signal from bulk. The stan-
produced with permission.] dard geometry of surface diffraction studies is shown in
Fig. 19. The SR beam is incident at a small angle α, and
diffraction is observed as a function of the lateral angle 2θ
B. Fiber Diffraction
along a line perpendicular to the surface.
It was already mentioned that many materials and tis- Surface diffraction takes place in a few top atomic lay-
sues found in nature have hierarchical structures. This ers of the surface, so that propagation of X-rays on the
means that there are many different length scales, and the surface must be studied more closely. The critical angle
diffraction pattern extends from ultrasmall angles to the αc , which is given by Eq. (44), corresponds to a wave vec-
wide-angle regime. Typically, the structures are built of tor transfer K c = 2k sin αc ≈ 2kαc , which is independent
long, chainlike molecules, which form fibrils, and these in of wavelength. Snell’s law can be written at small angles
turn are packed in ordered bundles. The interatomic dis- as α 2 = α 2 + αc2 . This implies that α must be imaginary
tances in the molecule are seen in the wide-angle regime for α < αc , which physically means that the transmitted
of diffraction, and the effects of the size and shape of the wave at grazing angle α is transformed into an evanes-
bundles in the ultrasmall-angle regime. Between these ex- cent wave propagating along the surface with an expo-
tremes, ordering is seen as diffraction maxima. Collagen is nentially damped intensity profile. The penetration depth
a good example of these hierarchical structures. There are approaches 1/K c at the limit α αc . The evanescent wave
many variants of collagen, and study of collagen structures can be considered as an incident plane wave, which will
and functions is a wide field of research by itself. Diffrac- be diffracted by the in-plane structures.
tion studies with conventional X-ray sources were limited Surface structures can be classified as termination of the
to well-ordered collagen structures, such as tendon. The three-dimensional crystal at the surface, reconstruction of
high intensity and collimation of SR has made possible the topmost atomic layers of the surface, terraces of atomic
high-resolution studies in the whole angular range. An layers on inclined surfaces, ordering of adsorbed mono- or
example is given in the following. submonolayers on crystal surface, and structure of films
Human dermis under the epidermis is mostly composed on surfaces. The termination of the crystal gives rise to
of fibroblasts in a matrix of collagen with small amounts so-called truncation rods, as the two-dimensional surface
of proteoglycans and elastin. The molecular structure of structure is Fourier-transformed to one-dimensional struc-
collagen is a triple helix, which is about 3000 Å long. The ture in reciprocal space. The relps of the bulk crystal lie on
molecules form twisted fibrils, where adjacent molecules these rods, but the intensity is not zero between the relps.
are parallel and shifted by 650 Å. In the dermis, these fib- Formally, this is most easily seen by using the convolution
rils are also parallel forming bundles of 20 to 100 fibrils, theorem. The three-dimensional δ-function of the crystal
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
FIGURE 18 Small-angle diffraction pattern from human dermis. The maxima in the diffraction pattern arise from the
axial periodicity of the collagen, the diameter of the fibrils and their hexagonal packing. [ESRF Newsletter No. 29,
1997; reproduced with permission.]
lattice is multiplied by a step function. The reciprocal lat- ing of Langmuir films are important for many reasons. The
tice is then convoluted by the Fourier transform of the films are model systems for studies of two-dimensional
step function, which is of form 1/(iK · c), if the lattice ordering processes, including superstructure and domain
vectors a and b are in the surface plane. The atomic lay- formation. Intermolecular interactions can be changed in
ers on the surface reorganize themselves spontaneously, wide limits by varying the surface charge density. Lang-
forming new bonds with other atoms on the surface and muir layers are also important models in membrane bio-
in the layers underneath. This produces two-dimensional physics for the study of lipid–lipid and lipid–protein in-
structures, which do not have the same periodicity as the teractions. Films can be “peeled” from the water surface
terminated crystal, and in reciprocal space fractional order to form mono- or multilayers, called Langmuir–Blodgett
rods are observed. Similar structures develop when atoms films, which have unique optical and electronic properties.
are adsorbed on a surface. An example is shown in Fig. 20. Information of the lateral structure of the Langmuir films is
Oxygen atoms on the (110) surface of Cu create a rect- based mostly on surface diffraction studies by SR. Because
angular unit cell, the so-called c6×2 structure. Studies of of the relatively large thickness of the film, the contribution
the surface structures are important from the technological of the substrate is rather small. A review of these studies
point of view, but also for understanding the mechanisms has been given by Als-Nielsen and Möhwald (1991).
of phase transitions, for instance. A review has been given
by Robinson and Tweet (1992).
D. Microdiffraction
Solid films of organic molecules can be grown on a gas–
water interface; these monomolecular layers are called It was already mentioned that the intensity of SR at an
Langmuir layers. In these films the hydrophilic part of ESRF crystallography beamline is 1010 to 1011 ph/s in a
the long molecule is embedded in the water subphase, spot of 10 µm diameter. The beam size can be further re-
whereas the hydrophobic part, often aliphatic chains, duced by capillary optics, and at the microfocus beamline
points towards the gas. Studies of the structures and order- of the ESRF the flux in a 2 µm2 spot is 4 × 1010 ph/s.
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
composition of an object, and diffraction adds another this evolves with time the speckle pattern changes, and
dimension by revealing the structures on the molecular observation of intensity fluctuations at a single point of
and atomic level. the pattern is a direct measure of the dynamics. The short
wavelength of X-rays provides atomic-scale resolution,
so that critical fluctuations near an order–disorder tran-
E. Powder Diffraction sition can be studied, for instance. Still another impor-
tant field is studies of structures under high pressure. In
One of the most important applications of X-ray diffrac-
diamond anvil cells small amounts of solids can be sub-
tion in materials science is powder diffraction. This
jected to pressure that is close to that at the center of the
method, which is widely used in nondestructive testing
earth. Understanding the structure of solids under such
of materials, has experienced a renaissance as a tool in
conditions is important in geophysics or even in astro-
structure research. The high intensity, collimation, and
physics, and high-pressure studies also address the basic
narrow energy band of a monochromatic SR beam have
questions of interactions between atoms. On the methods
virtually eliminated the instrumental broadening of pow-
side, the continuous and precisely known spectrum of SR
der reflections. It is possible to study small samples in
makes quantitative diffraction measurements with energy-
special environments, to record the full diffraction cones
dispersive techniques possible. In this method, scattering
by two-dimensional detectors for analysis of the texture
is observed at a fixed angle, and the spectrum is ana-
of the sample, or to insert a crystal analyzer between the
lyzed by a solid-state detector, i.e., the diffraction pat-
sample and detector to eliminate all incoherent scatter-
tern is recorded in the energy scale using the Bragg law
ing. Many interesting materials are not available as suf-
[Eq. (26)].
ficiently large crystals for single-crystal diffraction, and
In summary, the use of synchrotron radiation has made
powder diffraction may be the only possibility in structural
possible many new fields of research where X-ray diffrac-
studies of materials with coexisting phases. The density
tion is used. Perhaps the greatest impact is due to the use
of reflections becomes very large when the unit cell di-
of the coherence properties of SR in imaging and stu-
mensions approach 15 or 20 Å. Several pattern decom-
dies of correlations, and on the other hand, due to com-
position and indexing programs are available, and direct
bination of various methods in the submicron-length
methods can be used for solving the structure. In the final
scales.
phase, the atomic positions and even thermal parameters
can be refined by the Rietveld method, where the calculated
diffraction pattern is fitted to the observed pattern by least-
squares methods (see Section X.B.) IX. NEUTRON SOURCES
AND NEUTRON OPTICS
FIGURE 22 Layout of the experimental facilities of the Institut Laue-Langevin (ILL) in Grenoble, France. The diffraction
experiments are indicated by letter “D,” and these include single-crystal and powder diffractometers (e.g., D1) and
small-angle scattering instruments (e.g., D11). In addition, there are instruments for inelastic scattering (“IN”) and
other experiments. Also, the spectral range of neutrons (hot, thermal, cold) is indicated. [ILL Annual Report 1999;
reproduced with permission.]
mean energy of the Maxwellian distribution at 300 K is 38 There are fundamental differences between experi-
meV, which corresponds to 1.8 Å wavelength [cf. Eqs. (1) ments carried out at steady reactor sources and those made
and (2)]. at pulsed sources. This is illustrated in Fig. 23. A crystal
Neutron beams are extracted through holes that pene- monochromator is used to separate a narrow energy band
trate the moderator. The layout of the experimental stations from the distribution of thermal neutrons, and this is used
around the High-Flux reactor at the Institut Laue-Langevin for diffraction experiments much like the beam from an
in Grenoble, France, is shown in Fig. 22. To shift the spec- X-ray tube. On the other hand, the beam from a pulsed
trum in energy, a cold source (liquid deuterium at 25 K) source has a wide energy distribution, and for an efficient
and a hot source (graphite at 2400 K) have been inserted use of the beam time-of-flight (TOF) method is used. The
in the D2 O moderator. These extend the wavelength range pulse length is 1 to 50 µs, depending on energy, and their
between 20 Å and 0.2 Å. separation is 10 to 100 ms. The average intensity is low,
The second method of producing neutrons is with but very high within each pulse. With the TOF method,
charged particles (usually protons) striking target nuclei. all neutrons scattered to a given angle are exploited. The
Short bursts (<1 µs) of about 1 GeV protons produce diffraction pattern is analyzed as a function of the neutron
high-energy “evaporation” neutrons, which have energies wavelength, which is related to the flight time τ over
extending close to that of the incident proton beam. The distance L,
neutrons are moderated as rapidly as possible in hydroge-
λ (Å) = 0.3966τ (µs)/L (cm) (47)
nous materials such as polyethylene to provide a short
pulse of “slow” (E n < 10 eV) neutrons. The frequency of Actually, the TOF method is rather similar to the energy-
the pulses is between 10 and 100 Hz. dispersive diffraction method with X-rays, the difference
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
FIGURE 23 Comparison of steady-state (reactor) and pulsed (e.g., spallation) neutron sources. An energy band is
separated from the Maxwellian distribution of thermal neutrons by a crystal monochromator, while a wide spectral
band from the pulsed source is used for time-of-flight experiments. [International Tables for Crystallography, (1999).
Vol. C, reproduced with permission.]
being that the energy analysis of the neutron is done by to a small spot. It must be remembered, however, that
their speed. the phase space density of neutrons cannot be increased,
i.e., there is a trade-off between beam dimension and
divergence.
B. Neutron Optics
Multilayers were already discussed in Section VII.B.
Many of the optical elements used in neutron diffraction They can be used as broad-band monochromators and fo-
are similar to those used to focus and monochromatize cusing elements. The so-called “supermirror” is a multi-
X-rays or SR. The main differences come from the nature layer structure where the layer thickness changes, super-
of the neutron source, which has large dimensions, and imposing the reflectivity curves over a range that extends
from the fact that the neutron velocity is typically five from the regime of totally reflecting mirror to the limit
orders of magnitude smaller than the velocity of X-rays. determined by the smallest periodicity. In this range the
Absorption of neutrons is weak, allowing construction of reflectivity may exceed 90%, and the critical angle is three
many devices that could not be used with X-rays. or four times larger than that of 58 Ni.
Mirrors and multilayers are widely used in neutron op- Single crystals are used as monochromators in the
tics. The refractive index is less than 1, so that total ex- same way as with X-rays. However, the narrow reflectiv-
ternal reflection takes place below the critical angle θc . ity curve of a perfect crystal is not matched with the low-
This angle is small and proportional to the neutron wave- brilliance neutron source, and perfect crystals are used
length [cf. Eq. (12)]. Of the elements, 58 Ni has the high- only in some high-resolution applications, such as neutron
est critical angle, 0.120 at 1 Å wavelength. Guide tubes interferometry or backscattering instruments. The widths
with totally reflecting inner walls can be used to transport of mosaic crystal reflectivity curves match the neutron
neutrons long distances, as seen in Fig. 22. If the guide beam divergences much better, but they increase the beam
tube is curved, it acts also as a low-pass energy filter. Fo- divergence and the width of the wavelength band. A bet-
cusing mirrors can be built, but these become very long, ter choice is a crystal where the reflecting Bragg planes
because the critical angle is smaller and the beam size are highly parallel, but the lattice spacing varies. Such
larger than in the case of SR. Another solution is the use crystals are called gradient crystals, and they are usually
of capillary waveguides, which can be bent for focusing. Si–Ge crystals where the lattice spacing increases with
These are manufactured of glass, and different focusing increasing Ge content. Another possibility is to introduce
geometries are possible. For instance, an almost parallel a thermal gradient in the direction of the scattering vector.
beam is produced by a parabolic bundle of capillaries, The difference between a perfect, mosaic, and gradient
or if this is used in the reverse direction, a wide beam crystal is shown in Fig. 24 as a reciprocal lattice diagram.
from the end of a large-area waveguide can be focused Monochromators are also built from stacks of thin crystal
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
A. Neutron Crystallography
Properties of neutrons and X-rays for diffraction studies
have been compared by Bacon (1975). Certain things in
that comparison have changed with the use of SR, such
as magnetic scattering of X-rays and free choice of wave-
length in a wide range, but there are some basic differ-
ences, which make neutrons and X-rays complementary
probes. A large fraction of atoms in biological structures
is hydrogen, and the hydrogen contribution to the elec-
tron density is difficult to distinguish in X-ray diffraction,
even with the best methods using SR. On the other hand,
the neutron scattering amplitude of hydrogen is compa-
rable with those of other nuclei, and the positions and
thermal motion of hydrogen atoms in the unit cell can be
determined with almost the same accuracy as the posi-
tions of heavier atoms. The scattering amplitude of nor-
mal hydrogen 1 H is negative because of the large spin
incoherence, whereas the scattering amplitude of deu- FIGURE 25 Phase diagram of water. A new phase, ice XII, has
terium 2 H (or D) is positive, and incoherent scattering is been found at two isolated places of the pressure/temperature di-
small. The incoherent background is large when the hydro- agram, at (0.55 GPa, 260 K) and at (>1 Gpa, <150 K). In the
gen content of the sample is high, and particularly in pow- latter domain, formation of ice XII competes with formation of
the high-density amorphous ice (HDA). [ILL Annual Report 1999;
der diffraction it is advisable to use deuterated compounds, reproduced with permission.]
if possible. In general, comparison of diffraction data of
normal and deuterated samples highlights the positions of
the hydrogen atoms. small differences in atomic numbers. Sufficient contrast
The structure of water and ice is a network of hydrogen may be obtained when anomalous dispersion of X-rays is
bonds between the atoms, and depending on temperature used, but this is not feasible in the case of light elements.
and pressure, many different phases are formed. The phase For instance, the coherent scattering cross section for Mg
diagram in Fig. 25 shows the richness of structures that is more than twice as large as that for Al. Isotopes of the
arise from changes in the bonding network. This network same elements may have very different scattering ampli-
collapses at sufficiently high pressures, and for instance, tudes, as already seen in the case of hydrogen; this fact
the common crystalline hexagonal ice Ih is compressed can be used to tailor the scattering amplitude, if isotopes
into a high-density amorphous state (HDA), which trans- can be enriched.
forms upon heating into low-density amorphous phase The amplitude of nuclear scattering is independent of
(LDA), and finally into the cubic phase Ic . When water is the scattering vector, and the intensity falls off only be-
confined to microporous solids, the boundary conditions cause of the thermal motion of the atoms. The effect on
have strong influence on the structure. The liquid phase the intensity is given by the Debye–Waller factor,
prevails tens of degrees below 00 C, and the structure is
DW = exp[−2B(sin θ/λ)2 ], (51)
the same as in the solid LDA phase (ILL Annual Report
1999). Structures and driving mechanisms of transitions where the thermal motion parameter is related to the mean-
in water and ice are of fundamental importance, and neu- square displacement of the reflecting Bragg plane in the
tron diffraction has provided most of the experimental direction of the scattering vector by B = 8π 2
u 2 . In the
evidence. case of X-rays, the high-order reflections are suppressed
Another field where the nonsystematic variation of the because the atomic scattering amplitudes decrease with
neutron scattering amplitude is exploited is studies of al- increasing scattering vector, and separation of the effects
loys. In many cases the alloys contain atoms that have of thermal motion becomes difficult. Combination of
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
X-ray and neutron diffraction data in the so-called XN calculated intensity. The summation runs over all data
studies has been important in accurate analysis of charge points. The profile function is convolution of the instru-
density distributions of crystals. ment function by the function describing the diffraction
domain in reciprocal space. Customarily, Gaussian shape
of the profile function is assumed, although it is known
B. Powder Diffraction and Rietveld Method
that the tails of the reflections extend further out than those
Refinement of the crystal structure by the Rietveld method of the Gaussian. Other shapes that are used to describe the
was already mentioned in connection of powder diffrac- profile function include Lorentzian, Voigtian (convolution
tion with SR. Actually, the method was developed for anal- of Gaussian and Lorentzian), or Pearson VII functions.
ysis of neutron diffraction data. Within the kinematical The main problem in Rietveld analysis is modeling the
approximation of diffraction, the integrated intensities of background, which is not constant or even smoothly vary-
powder reflections can be calculated from the positions ing because of disorder and thermal diffuse scattering.
and thermal motion amplitudes of the atoms. This distri- However, it is possible to reduce the parameters of the
bution of δ-functions in the scattering angle scale is con- model to those describing the structure, thermal motion,
voluted by the so-called profile function. The model of the grain size, strain, and texture of the powder sample, when
structure is refined by least-squares minimization of the the instrument function is calculated from the known
residual, diffraction geometry, and a proper model for background
is included in the refinement (Honkimäki and Suortti, in
M= wi [yi (obs) − yi (calc)]2 (52)
Snyder et al., 1999). An example of powder diffraction
Here yi (obs) is the intensity measured at a point i in the pattern and the fit of the model by the Rietveld method is
diffraction pattern, wi is its weight, and yi (calc) is the shown in Fig. 26.
FIGURE 26 Neutron and SR powder diffraction patterns of zeolite ZSM-5 with 3.4 benzene molecules absorbed per
unit cell. The neutron wavelength is 3 Å, and the SR wavelength 1 Å, i.e., the Bragg angles in the neutron diffraction
pattern are about 3 times those of the SR diffraction pattern. Note the superb resolution of the SR data, and the
relatively high intensities of the neutron reflections at large scattering angles. The benzene positions in the channels
of ZSM-5 are shown to right. [Courtesy of A. Fitch, ESRF.]
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
C. Magnetic Structures
In spite of all the advances of magnetic SR diffraction,
neutron diffraction remains the primary source of infor-
mation on magnetization of solids. In the same way as
in X-ray diffraction, there are two distinct areas of re-
search: determination of magnetic structures, and studies
of magnetization densities of known structures. Magnetic
scattering arises from the interaction between the neutron
magnetic moment and the magnetic moment of the atom.
Magnetic scattering is superimposed on scattering of neu-
trons from the nuclei, which makes the situation much
more complicated than that of X-ray scattering from elec-
tron charge distribution. The unpaired electrons of atoms
give rise to a net magnetic moment. In the first transi-
tion series these are the 3d and 4s electrons, and in the
rare-earths the 4 f electrons. As to the interaction with the
neighboring atoms, which results in magnetic ordering,
these groups of atoms are very different.
In paramagnetic materials the directions of the atomic
magnetic moments vary randomly, and the magnetic scat- FIGURE 27 Magnetic structure of antiferromagnetic MnO. Mag-
tering is distributed in the background of the diffraction netic moments are parallel to each other in an (hhh) plane of Mn
pattern. The scattering cross section is [generalized from atoms, and the direction of the moments is reversed in the next
Eq. (14)] plane. The magnetic unit cell is twice the chemical unit cell in all
three directions.
dσpm /d = (2/3)J (J + 1)(gγ /2)2re2 f 2 (53)
where g is the Landé splitting factor, J the total angu- net is the same as the chemical unit cell, so that magnetic
lar momentum, γ the neutron magnetic moment (−1.91 reflections coincide with the nuclear reflections. In an an-
nuclear magnetons), re the classical electron radius, and tiferromagnet the magnetic unit cell is usually larger than
f the magnetic scattering factor. The magnetic scattering the chemical unit cell, so that the magnetic structure gives
factor is defined analogously to the X-ray atomic scatter- rise to superlattice reflections between the nuclear reflec-
ing factor, tions. The dependence of the integrated intensity on the
structure factors is given by Eq. (49). It is seen that when
f (K) = M(r) exp (iK · r) d 3 r (54) the neutron beam is unpolarized the middle term averages
to zero. This simple argument applies only to ferromagnets
where M(r) is the normalized magnetization density. If and antiferromagnets, where all the magnetic moments
other contributions to the background (incoherent nu- are either parallel or antiparallel to a single direction. The
clear scattering, multiple scattering, and TDS) can be sub- detailed treatment of polarization is much more compli-
tracted, the magnetic scattering factor can be determined. cated, particularly for noncollinear arrangements of spins
In a ferromagnet all the moments in a single domain such as are found in helimagnetic compounds. In general,
are aligned parallel. A simple antiferromagnetic structure the neutron scattering cross section consists of five sepa-
is shown in Fig. 27. In an antiferromagnet the atoms may rate terms: coherent nuclear scattering, incoherent isotope
be considered to lie on two sublattices whose spins are and nuclear spin scattering, nuclear–magnetic interference
oppositely aligned. In the picture of local magnetic mo- term, purely magnetic scattering, and polarization depen-
ments, exchange interaction couples the spins of neigh- dent term in complicated structures (Bacon, 1975).
boring atoms. The potential energy is Vi j = −2Ji j Si S j , When the purely magnetic scattering contribution to the
where Si and S j are the spin quantum numbers of the two total diffraction pattern can be separated and the structure
atoms, and Ji j is the exchange integral. For a ferromagnet is known, the magnetic form factors of atoms can be calcu-
this integral is positive, leading to parallel orientation of lated. This is important for two reasons: The form factors
spins, while for an antiferromagnet Ji j is negative. The are needed for solving magnetic structures from inten-
exchange integral is due to overlap of atomic orbitals, and sity measurements, and secondly, electron distributions
the value and sign of the integral depend on the distance can be calculated by Fourier transform. The uncompen-
between the atoms. The magnetic unit cell of a ferromag- sated spin and orbital magnetic moments are due to the
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
separated by polarization analysis of the diffracted SR this has made magnetic diffraction of SR complementary
beam. The order is ferromagnetic in the basal plane, i.e., to neutron diffraction
in a monolayer of atoms. Then from one layer to the next, r Diffraction at very high energies: Bulk properties of
the orientation turns a certain angle, setting up a magnetic solids are probed in the same way as in neutron diffraction;
spiral perpendicular to the planes. However, the periodic- extinction-free measurements of structure factors have
ity of the helix does not coincide with the crystal planes, become possible.
i.e., the spin helix is incommensurate. At the transition
temperature the turn angle is 50◦ decreasing continuously In many areas there has been important synergy between
to 30◦ at 20 K. At this temperature the magnetic moments synchrotron radiation and neutron diffraction studies, and
tilt upward out of the basal plane, locking into a commen- this has also stimulated neutron research, in areas such as
surate helical magnetic-cone order. Recently magnetic or- structures of thin films and fibers, and magnetism-driven
dering in a thin epitaxial Ho(00.1) film could be studied by structural changes. It seems evident that there are many
observing the (00.0 + τ ∗ ) magnetic satellite reflection. It new, exciting developments over the horizon, and there
turns out that the Néel temperature is lowered to 105 K and the new sources of neutrons and synchrotron radiation are
there is no lock-in transition to commensurate magnetic expected to play leading roles.
structure. These results are technologically important, as Safety concerns have made building and operation of
magnetic nanostructures are integrated in new electronic nuclear research reactors more and restricted, and in re-
devices. cent years rather few reactors have been built while some
of the existing ones have been closed down. This is a great
concern to scientists, because neutrons are very versatile
XI. FUTURE DEVELOPMENTS probes of condensed matter, and their complementarity
with SR has opened up many new areas of research. There
Some 20 years ago, X-ray and neutron diffraction could is a clear “shortage of neutrons,” and to meet the grow-
be considered mature fields of research, where well- ing demand new projects of spallation sources have been
established methods were used for structural studies in put forward. A project called European Spallation Source
physics, chemistry, and biology. However, the increasing (ESS) has been initiated and, at the time of writing, is
use of synchrotron radiation was changing this picture, in study phase. The corresponding project in the United
and in the past decades the synchrotron laboratories have States is more advanced. The Spallation Neutron Source
been motors of new developments. Research in many tra- (SNS) will be located at Oak Ridge National Laboratory,
ditional fields has been revolutionized by the new tools, Tennessee, and funding for the design and first construc-
and totally new areas of research have been introduced. tion phases has been appropriated. The proton beam en-
The examples given above emphasized this aspect, and it ergy is 1.0 GeV, and there are 2 × 1014 protons in each
may be useful to reiterate the most important advances: 0.7-µs long pulse. The repetition rate is 60 Hz, so that the
proton beam power on the target is 2 MW. The target is
r Microdiffraction: Samples of micron size can be stud- liquid mercury, which is not damaged by radiation, and
ied, even crystals that were considered as small powder because of the high Z the yield of spallation neutrons is
grains; diffraction under very high pressure is possible high. The intensity of a neutron pulse is two orders of mag-
r Anomalous scattering: Experiments can be per- nitude larger than the flux from steady-state reactors, and
formed with wavelengths close to any absorption edge, this high flux is efficiently exploited in TOF experiments.
and this links diffraction to spectroscopic methods On the other hand, there are many experiments where the
r Time-resolved diffraction: A Laue pattern can be much higher average flux of reactors is needed.
recorded with radiation from a single bunch of electrons, The third generation SR sources, such as the ESRF
and evolution of structures under external stimuli can be (Grenoble, France), APS (Chicago), and SPring-8
followed by pump-and-probe experiments (Hyogo, Japan), are approaching the ultimate performance
r Surface structures: The high intensity and collima- of circular accelerators, but there is quest to ever higher
tion of SR allows probing the structures of atomic layers, brilliance and coherence of the source, i.e., to an X-ray
interfaces, and thin films laser. These fourth-generation sources are being studied
r Coherence: The SR beam from an undulator is in several places, and the most advanced projects are those
partially coherent, which makes possible phase-contrast at DESY (Hamburg, Germany) and at SLAC (Stanford,
imaging and studies of spatial and temporal correlations California). In both cases, linear electron accelerators are
r Magnetic diffraction: The weak interaction of the used. A test facility has been built at DESY, and the ex-
electromagnetic field of the photons with the magnetiza- isting linear accelerator will be used at Stanford. It has
tion of solids is enhanced near the absorption edges, and been predicted and experimentally demonstrated that in
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
FIGURE 30 Average and peak flux from undulators planned for the TESLA facility of DESY in Hamburg. The devel-
opment of average and peak brilliance of different X-ray and SR sources since the discovery of X-rays in 1895.
P1: GRB Final pages
Encyclopedia of Physical Science and Technology EN017B-831 August 2, 2001 19:37
a very long undulator the electron and photon beams, SEE ALSO THE FOLLOWING ARTICLES
which travel together, exchange energy. The undulator
wavelength is given by Eq. (42). The electric vector of X-RAY PHOTOELECTRON SPECTROSCOPY • X-RAY
the photon field is parallel to the transverse velocity of SMALL ANGLE SCATTERING
the electron, so that energy is transmitted from the radia-
tion field to the electron or vice versa. Depending on their
precise longitudinal position some electrons are acceler-
ated while others are decelerated, leading to formation of BIBLIOGRAPHY
equidistant microbunches. The length of the microbunch
Als-Nielsen, J., and Möhwald, H. (1999). In “Handbook on Synchrotron
is smaller than the emitted wavelength, so that electrons in
Radiation,” Vol. 4 (Ebashi, S., Koch, M., and Rubenstein, E., eds.),
a microbunch emit coherently as a point charge. The radi- Elsevier, Amsterdam.
ated power is proportional to the square of the number of Bacon, G. E. (1975). “Neutron Diffraction,” Clarendon Press, Oxford.
electrons in the microbunch. Furthermore, the interbunch Bartunik, H. D. (1999). In “Handbook on Synchrotron Radiation,” Else-
distance is related to the wavelength leading to a coherent vier, Amsterdam.
Becker, P. J., and Coppens, P. (1974). Acta Crystallogr. A30, 129–147.
superposition of radiation from all microbunches. The pro-
Brefeld, W., and Gürtler, P. (1999). In “Handbook on Synchrotron Radia-
cess is called self-amplification of spontaneous emission tion,” Vol. 4 (Ebashi, S., Koch, M., and Rubenstein, E., eds.), Elsevier,
(SASE). The necessary condition is a very short electron Amsterdam.
bunch, on the order of 10 µm, and this is achieved by Brunel, M., and de Bergevin, F. (1999). In “Handbook on Synchrotron
special electron guns and pulse compression. Radiation Radiation,” Vol. 3 (Brown, G., and Moncton, D., eds.), Elsevier,
Amsterdam.
from a free electron laser (FEL) is coherent, and the peak
Doniach, S. (1996). J . Synchrotron Rad. 3, 260–267.
brilliance is many orders of magnitude higher than that of European Synchrotron Radiation Facility (1994–1999). “Highlights.”
the third-generation storage rings. On the other hand, the Ewald, P. P., ed. (1962). “Fifty Years of X-Ray Diffraction,” Oosthoek,
duty cycle of a FEL is low, but still very large gains of the Utrecht.
average brilliance are expected, as seen in Fig. 30. Guinier, A. (1963). “X-Ray Diffraction,” Freeman, San Francisco.
Helliwell, J. R. (1992). “Macromolecular Crystallography with Syn-
The high brilliance, time structure, and coherence of the
chrotron Radiation,” Cambridge University Press, Cambridge,
FEL radiation gives prospects for many new types of stud- U.K.
ies, as discussed by Doniach (1996). One interesting possi- Institute Laue-Langevin (1998, 1999). “Annual Reports.”
bility is determination of the structure of individual macro- James, R. W. (1962). “The Optical Principles of the Diffraction of
molecules. Simulations indicate that a diffraction pattern X-Rays,” G. Bell and Sons, London.
Jarlskog, G., Mjörnmark, U., and Sjöstrand, T., eds. (1999). “Workshop
from a single molecule may be recorded using intense
on Future Linear Electron–Positron Colliders,” Lund.
(106 X-ray photons/Å2 ) femtosecond pulses. The structure Kerr, K. Ann (1987). “Crystallography.” In “Encyclopedia of Physical
of the molecule can be reconstructed from a large num- Science and Technology,” Vol. 4, pp. 45–80, Academic Press, San
ber of patterns from randomly oriented molecules using Diego.
Fourier transforms. Another possibility is Fourier holog- Kratky, O., and Laggner, P. (1987). “X-ray small-angle scattering.” In
“Encyclopedia of Physical Science and Technology,” Vol. 14, pp. 693–
raphy, where the reference beam is obtained by diffrac-
742, Academic Press, San Digeo.
tion from a thin film of crystallites. It is evident that Lovesey, S. W., and Collins, S. P. (1996), “X-Ray Scattering and Ab-
the use of femtosecond pulses improves the resolution of sorption by Magnetic Crystals,” Oxford Science Publ., Oxford.
time-resolved protein crystallography by several orders of Robinson, I. K., and Tweet, D. J. (1992). Rep. Prog. Phys. 55, 599–651.
magnitude. Snyder, R. L., Fiala, J., and Bunge, H. J., eds. (1999). “Defect and
Microstructure Analysis by Diffraction,” Oxford University Press,
It may be conjectured that most of the applications of
Oxford.
FEL radiation are still over the horizon. The synchrotron Suortti, P. (1982). Acta Crystallogr. A38, 642–647.
radiation diffraction methods and their fields of applica- Warren, B. E. (1969). “X-Ray Diffraction,” Addison-Wesley, Reading,
tion may be revolutionized in the same way as the tra- MA.
ditional X-ray diffraction was by the use of SR. It is also Wilson, A. J. C., and Prince, E., eds. (1999). “International Tables for
Crystallography,” Vol. C, Kluwer, Dordrecht.
likely that new links to neutron scattering develop, leading
Young, R. A., ed. (1993). “The Rietveld Method,” Oxford University
to increasing synergy of the two methods. The first FEL Press, Oxford.
sources in the X-ray regime should become operative in Zachariasen, W. H. (1945). “Theory of X-Ray Diffraction in Crystals,”
the first decade of this millenium. John Wiley & Sons, New York.