Professional Documents
Culture Documents
many techniques have been proposed, the field has converged to a few basic
CHAPTER 11 methods. These do have their differences, however, and in some cases one
will work where another fails. Thus it is useful to be aware of the dis
tinctions and their implications.
11.1. INEQUALITIES
interest, locating this atom by methods involving the Patterson function, The meaning of Eq. (11.2) can be c1arified by dividing both sides by FÖoo
and using it as an initial phasing model by which to bootstrap to the entire and defining
structure. Methods of direct calculation of phases were known and dis
cussed but were regarded as experimental, constituting a significant
Uhki =F hkd F 000 ( 11.3)
research undertaking in themselves. Furthermore, although a few noncen
trosymmetric structures had been solved by direct phasing,I,2 the methods Equation (11.2) then becomes
had in generl\,1 been restricted to centrosymmetric space groups.
Twenty years later this picture has changed entirely. Direct phasing is
U ~kl :::; ! + .~ U2h.2k,21 (11.4)
now the method of choice, with the result that 80-90% of reported
small-molecule structures are solved in this way, The methods have been The importance of this equation lies in the fact that both the magnitude and
automated to the extent that they are commonly used as "black box" sign (+) of U~kl are known while the phase of U2h.2k.2! remains the only
techniques in which the raw data go in at one end and the essentially solved unknown. Thus
structure appears at the other. As much as any other advance, direct
methods have been responsible for the widespread use of X-ray crystaIlo
U~kl s U2h.2k.21i) (11.5)
graphy as a routine technique for structure determination, attested by the
1985 award of the Nobel peize in chemistry to Jerome Karle and Herbert
lf the magnitudes of UhU and U2h,2k.2l are sufficiently large, they may
Hauptman, two of the pioneers and steady trail breakers in the field.
require the selection of the positive sign for the latter in order to ensure that
Although most direct solutions are performed with one of a few widely
the inequality holds.
distributed programs, we believe that it is important for the user to have
Table 11.1 shows some examples of the use of inequalities as weil as their
some understanding of the principles behind these programs. Although
3D. Harker and J. S. Kasper. Acta Cryst., 1, 70 (1948). These inequalities reftecl the
'I. L, Karle and J. Karle, Acta Cryst., 17, 835 (1964). reslriclion that the electron density musl be everywhere real and nonnegative. See J. Karle and
21. L. Karle, Abstracts 0/ American Crystal/ographic Association Meeting, Austin, Texas, H. Hauptman, Acta Cryst., 3, 181 (1950).
February-March. 1966, p. 18.
...........
{point atom
TADLE 11.1 Examples of Phase Determination an Zjk I'
For a ceB containing only one kind of atom with scattering factor I, this can
be rewritten as and this will serve for most purposes.
N
In practice it is more common to define the unitary structure lactar U
F IL e 2 -rrj("·t) (11.7) such that
or, using Eq, (11.13) and noting that LN Zj = F ooo , atoms are alike, however, it be comes
Fhkl 2
Vhkl =LI" fi (11.16) (see Section 7.3 and references cited there), so the expected value of V is
given by
Thus V is a structure factor that has the same phase as F but whose N
nj = h/Ih I
(11.18) The E's have certain mathematical conveniences for use in probability
calculations, but their particular advantage is that they allow the nor
that is, the fraction of the scattering power represented by the jth atom. malization of all c1asses of reflections to a common basis. In this way it is
Since the scattering factor curves do not all show the same shape, nj is a
function of () if there is more than one kind of atom in the cell. If all the 4J. Karle and H. Hauptman, Acta Cryst., 9, 635 (1956).
-,f
possible to avoid a rather subtle source of error in the comparison of special T ABLE 11.2 Theoretical Values Related to
sets of reflections.
The significant factor in the calculation of E's is that a given V must be Centrosymmctric Noncentrosymmetric
related to the true (V 2 ) for the dass of reflections to which it belongs. For Average lEI' 1.0000 1.0000
general (hkl) reflections, this value will be that given by Eq. (l1.24). As was Average IE'-11 0.968 0.736
pointed out in Chapter 7, however, in some space groups certain dasses of Average lEI 0.798 0.886
reflections have average intensities that are multiples of that of the general lEI> 1 32.0% 36.H%
set. Likewise, the (V 2 ) given by Eq. (l 1.21) represents an average over all IEI>2 5.0% 1.8%
reflections including those systematically absent. If, as is usually the case, IEI>3 0.3% 0.01%
only the nonextinguished reflections of a set are considered, they are
intensified by a factor that depends on the fraction of reflections omitted
(see Sections 7.3 and 8.11).5 Thus combining Eqs. (11.24) and (11.21) into ing IErs that lead to solutions, are often obtained by using local values of
a general form (F 2 ) corresponding to the averages of the observations over small ranges of
(sin 2 (J)/ A2. This method does not give single values for scale and tem
2 V~kl perature factors but does adapt to the variations of the actual structure.
E = (11.25)
hkl e Lj"N' nj2 The distribution of the lEI values is, in principle and often in practice,
independent of the size and content of the unit celL It does depend,
or equivalently, and more commonly used for calculation, however, on the presence or absence of a center of symmetry in the space
group. Table 11.2 gives some useful values describing these distributions. 7
As can be seen, these values provide yet another statistical test for centric
(11.26)
and acentric distributions of intensities (see Section 7.3). Like all tests,
however, they are subject to being disturbed by particular atomic dis
In these expressions, e is an integer that is gene rally 1 but may assurne tributions in the cell and should be treated with caution.
other values for special sets of reflections in certain space groups. Thus in
P2dc, e 2 for the hOL and OkO reflections and 1 for all others. For any
space group, the proper value of e and the affected reflections can be 11.3. STRUCTURE INVARIANTS AND SEMINVARIANTS
obtained from International Tables. 6 Here e S /<p) is given for all
symmetry elements (although in only one orientation; the others can be As we shall see here and in Chapter one of the recurring difficulties in
derived by analogy) and for the classes of observed reflections that they solving the phase problem involves defining the origin of the unit cell
affect. properly with respect to the symmetry elements and all the contents of the
The effect of including e in the calculation of lEI is to reduce the ce11. Shifting the origin arbitrarily does not affect the structure amplitudes
significance of reflections that have large values of IVi but that also belong but may change the phases drastically. Consequently, the selection of
to a class having, for reasons connected with the space group symmetry, an phases is intimately associated with the choice of origin. There exist,
abnormally large (V 2 ). Thus the improbability of these reflections, and the however, certain combinations of phases that do not change regardless of
information carried by them, is not as great as would at first appear. the arbitrary assignment of cell origins (structure invariants), or that do not
In practice, IEI's are often calculated as described, based on the assump change with shifts among alternative origins that have the same symmetry
tion of randomly distributed atoms. If, however, something is known of the characteristics (structure seminvariants).8,9
geometry of the cell contents, this information can be applied to obtain a Among the structure invariants, the most important ones have the form
more specific model of the expected intensity distribution. The result is
generally a better fit to the observed distribution and yields better values for <Pn <Pbl + <Pb2 + . . , + <P1m (11.27)
the scale and temperature factors. Even better results, in terms of estimat
71. L. Karle, K. S. Dragonette, and S. A. Brenner, Acta Cryst., 19,713 (1965).
sH. Hauptman and J. Karle. Solution o{ lhe Phase Problem. I. The Centrosymmelric Cryslal,
5For a more general discussion, see A. J. C. Wilson, ACla CrySI., 17, 1591 (1964). ACA Monograph 3, Polycrystal Book Service, Pittsburgh, PA, 1953.
61ntemational Tables, Vol. 11, pp. 355-356; see also H. Iwosaki and T. Ito, Acta Crysl., A33, 9H. Schenk, in Compulational Cryslallography, D. Sayre, Ed., C1arendon Press, Oxford, pp.
227 (1977).
65-74; H. Hauptman, ibid., pp. 75-91.
,
-~
z mination. As we shall see, in the range of intensities that are too small for
inequalities but still relatively large, it is possible to set up "equations" that
are probably true, and from these to extract phase information. These
methods were first introduced practically for problems in centrosymmetric
space groups and only later extended to noncentrosymmetric groups. For
c1arity we shall follow the same route.
-===T y
Centrosymmetric Methods
The easiest approach for the methods to be described is a 1952 paper by
x Sayre, lQ although mathematically equivalent results had appeared earlier. l !
It can be shown that, subject to certain restrictions,
Figure 11.2. Triplet of vectors forming a
c10sed set in the reciprocal lattice.
F hkl 11 kkl LLL F
h' k' I'
k'kT F k-k',k-k',I-I' (11.31)
where
hl + h2 + ... + hn = 0 = 0,0) (11.28) where 11 hkl is simply a ca1culable scaling term. The implication of Eq.
(11.31) is that any structure factor Fkkl is determined by the products of all
In other words, the sum of the phases of n reflections whose indices sum 10 of the pairs of structure factors whose indices add to give (hk/); Thus F213
zero is a constant for a given structure regardless of the choice of origin in depends on the products of F 322 and F llb F 604 and Ei!), and so on.
the structure. Trivial examples of this are FOCJ{h which always has the phase On the surface, Eq. (11.31) appears rather useless, since to determine
of 0, and Fhkl and whose indices add to (000) and whose phases, in the one Fitis necessary to know the magnitudes and phases of all others. Sayre
absence of dispersion, add to 21f 0 (see Fig. 8.10). The first significant pointed out, however, that for the case where Fhkl is large, the series must
example is the tripie of phases te nd strongly in one direction (+ or and that this direction is generally
(11.29) determined by the agreement in sign among products between large F's.
<1>3 4Jhl + 4JhZ + 4Jhl+hZ Thus for the case of all three reflections large,
Regardless of the magnitudes of the F's, and independent of the choice of (11.32)
S(Fkk/ ) = s(F k'kT} • s(F k-k'.k--k',I-I')
origin, these phases will sum to a constant value. In general it is not possible
to specify this value apriori, but the cases in which the Harker-Kasper or, as it is sometimes given,
inequalities hold are equivalent to demonstrating that <1>3 must be O.
For centrosymmetric space groups, Eq. (11.29) can be rewritten in terms
S(Fkk1 ) • s(F k'k'I') • s(F h-h',k-k' I-I') = + 1 (11.33)
of the product of the signs of the structure factors. Thus,
(11.30) where s means "sign of" and = means His probably equal to." The
S3 = Sbl • Sb2 • SiiI+hZ = ±1 quantities s( ) may be considered as ± 1 and will often be used in the form
s(hk/).12
A convenient visualization of these structure invariants is as a c10sed
Equations (11.32) and (11.33) are the probability equations derived from
polygon of vectors in reciprocal space (Fig. 11.2). This polygon starts and
Eq, (11.27) and serve to favor one of the possible choices of the sign of the
ends at the reciprocal lattice (r .I.) origin and defines a set of reflections
structure invariant [Eq, (11.30)]. Since the phases of U's and E's are the
whose relative phasing remains constant regardless of the choice of origin.
IOD. Sayre, Acta CrySI, , S, 60 (1952), This analysis adds cxplicitly thc requirement that the
clectron density be composed of resolved atoms.
11.4. PROBABILITV METHODS 'J, Karle and H, Hauptman, Acta Cryst., 3,181
12In centrosymmetric space groups, s(hkl) s(hkl) [s(b) s(-b)] so Eq. (11.33) can be
Once structures reach such a size that inequalities are ineffective, it is expressed equally weil as s(h)' s(b') s(b b') or s(h)' s(h') . s(h + h'). In the noncentrosym
necessary to find another approach to the problem of direct phase deter- metric case more care is needed to keep track of ""'V"l~~''--.~'_'_'_'''' ~_~=""'''=-Q~'',,",",,''''"''-.''_~''C:--
-, ~:a
" ::
same as those of the corresponding F's, they also apply to these modified and since
structure factors. In addition, Eq. (11.32) holds in those cases in which
inequalities also apply, that is, the inequalities simply represent the cases in 0'3 N( N)3/2
0'~/2
2
N3 N 2 = N-l/ (11.41)
which the probability becomes certainty. For example,
so that regardless of the sign of Fhkl> FZh ,2k.21 will probably be posItive P ! +! tanh{N-1I2IEhkIEh'k,/,Eh_h',k_k' (11.42)
[ = (+ 1)2 or (- 1)2] if the reftections are sufficiently large. This is the same
result that is given with certainty when the inequality of Eq, (11.4) is Equations (11.40) and (11.42) are safer to use than (11.35) and (11.39)
applicable. For this reason, inequalities are generally ignored in practice, because the peculiarities of the special c1asses of reftections have been
since the same results will appear with very high probabilities from Eq, acounted for in the calculation of the while in Eq, (11,35) they should
(I 1.34). properly be considered by using different values for 0'3 and 0'2 in certain
The question of the exact probabilities of Eq, (11.32) and (11,34) is cases. Since the calculated probabilities are more indications than rigorous
obviously an important one, and many analyses of the problem have evaluations in practical cases, we shall assurne the equal atom case from
appeared. The result that is most commonly used, although it is not now on to keep the equations as simple as possible, In general, however,
absolutely rigorous, is that of Cochran and Woolfson. 13 According to equations involving a tri pie product of U's can be generalized by replacing
N with 0'31 O'~, and those with E 's by replacing N- 1/2 with 0'31 O'~n
P ~+~ tanh{(0'3/0'~)IU hkl V h'k'/'U h-h',k k' (11.35) Table 11.3 shows P for sign relationships involving various values of
IE"I = lCE hk,E"'k'I'E,,-h'.k_k',I_dl lf3 for crystals containing N equal atoms.
where P is the probability that Eq. (I 1.32) will hold, and Probabilities calculated with the equations given hold only if there is no
N
systematic relationship between the indices hkl and h' k'l', In particular, for
the case given by Eq, (11.34) the properformulas are
0'3 L ni (11.36)
0'2 =
N
the nj being those defined by Eq, (l1. If all the atoms of the cell are P+(EZh,Zk,z/)
1 1 {0'3
= 2" + 2" tanh 20'~!2IE2h,2k.2d(E I)} (11
equal, it is easily shown that
where P+ is now the probability that V 2h.2k,2/ has a positive sign. Prob
0'3 N (N)-3 abilities calculated with these equations are always smaller than those that
O'~ N3 N2 N
(11.38)
By a simple conversion, the corresponding formulas for E can be found to IEayl 20 40 60 80 100 120
be 3.0 27.0 1.0 1.0 1.0 1.0 0.99 0.99
2.8 22.0 1.0 1.0 1.0 0.99 0.99 0.98
P ! +! tanh{(0'310'~!2)IEhkIE h'k'!' E h-h',kk',/-I'I} (11.40) 2,6 17.4 1.0 1.0 0.99 0.98 0.97 0.96
2.3 12.2 1.0 0.98 0.96 0.94 0.92 0.90
2.0 8.0 0.97 0.93 0.89 0.86 0.83 0.81
1.8 5.8 0.93 0.86 0.81 0.78 0.76 0.74
1.5 3.4 0.82 0.74 0.71 0.68 0.66 0.65
"w. Cochran and M. M, Woolfson, Acta Cryst., 8, 1 (1955).
f
would be obtained using Eqs. (l 1.35) and (11.40). The implieation is that (11.32) eould be derived for noneentrosymmetrie space groups as weIl.
Eq. (11.34) is less powerful than Eq. (1l.32) as a phase-determining tool These appear as
and should be used with eaution, partieularly if probabilities are not being
ealculated explieitly. q,(h) = q,(b') + q,(h h') (11.49)
It will often oeeur in the later stages of the phasing proeess that there are
or
a number of relationships of the form of Eq. (11.32), all relating to a given
Uhk/' but none having suffieiently high probability to be eompelling. It is q,(h) + q,(h') + q,(h + b') $3 = 0 (11.50)
obvious, however, that the expression 14
where q, is the phase angle expressed in fractions of a cycle. As before, the
= L S(h'k'l')S(h h',k-k',l l') (11.45) probability that Eq. (11.49) holds or that $3 0 increases with the mag
"'kT nitude of the refleetions involved. Because the phases can now assurne
more nearly resembles the full Sayre equltion (Eq. (l 1.31)) than does Eq. general values, the probabilities assurne the form of a distribution that gives
(l1.32) and should have a higher probability of being eorreet than any of the likelihood of various degrees of error. For Eq. (11.49) this distribution is
given b y l6
the individual terms of the summation. Under these eonditions '5
-'-(h)-[ </>(11')+</>(11-", J)
eK(.....·)cos{ 'P
~+~tanh{N1uhk/1 h'kT
(11.51)
P+(Uhk/) L Uh'k,/,Uh'h"k-k',I-I'} (11.46) P{8.q,(b)} = 21Tlo[K(h, h')]
P_ = 1- P+ (I 1.48)
Noncentrosymmetric Methods
Although it was not appreeiated that practieal use eould be made of the
1
information, it was known by 1955 16 • 17 that expressions equivalent to Eq.
Ol 1.>"I:;:=r='
60. 120' 180" Figure 11.3. Probability distributions of q,3 for three
14This expression is frequently called the ~2 (sigma two) relationship after a notation used in R
ilt/J values of K. See text.
Hauptman and J, Karle, Solution 01 the Phase Problem. L The Cenlrosymmetric Crystal, A.C.A.
Monograph No. 3, Polycrystal Book Service, Pittsburgh, 1953. ~, (sigma one) was used in this
IU U-bl_
Uo 2 _ 2
practical methods of phase determination currently in widespread use. A U -Uo-UbU-b-l-U b 0 (11.56)
great many extensions have been considered theoretically and used in b o
experimental cases. Some of these add to our understanding of the underly U- b U Zb
ing theory of direct methods while others appear in particular program
systems or hold promise for the future. Here we shall consider a few of the
Uh 1 U-bl = 1 +2U~U2h- Uih-2U~~0 (11.57)
Uh 1
developments that seem to us to be the most significant
(I+Uzb )(I-Uzh ) 2U~(l UZh)~O
Determinantal Forms (11.58)
One of the earliest 8 ,ll and most important theoretical contributions to direct since (I - U Zh ) must be >0, this yields
methods was the Karle-Hauptman determinant. Time after time it has been
i(1 + UZh)~ U~ ( 11.59)
19S. E. Hull and M. J. Irwin, Acta Cryst., A34, 863 (1978).
Manipulation of the more general third-order KHdet leads to expressions solutions in which all phases are O. Quartets whose values tend to 1T/2 can
that contain products of U h , U h ·, Uh - h" and U- h + h ,. These can be con be used to select enantiomorphs.
nected with the results obtained by Sayre and others. Still higher order polygons of LI. vectors such as quintets and sextets
Extension of the KHdet to higher orders leads to the theory of quartets, have been examined and occasionally used, but their application is not yet
quintets, and higher-order forms (see below), but more significant and widespread. lt would appear likely, however, that a general approach based
specific are those techniques that make use of the capabilities of very large on the KHdet would be more useful than dealing with each individual
determinants. Obviously, as the order increases, the number of possible increase in complexity.
phase permutations becomes immense, and general techniques do not exist
for finding reliably those combinations of phases that meet the requirements
imposed by the determinantal inequality. There are many in any real case, 11.6. PHASE DETERMINATION IN PRACTICE
but they constitute only a srooll fraction of those possible. Tsoucaris 22
demonstrated that the most probable set of phases is the one that maximizes Although the basic theory of direct phase determination is now nearly 40
the KHdet (maximum determinant metlwd) and has proposed solution years old, it has been only in the last decade that its application has
approaches for attacking that problem. Other attempts have used the undergone explosive growth to its present dominance. This is partially due
KHdet and the maximum determinant to extend and refine phases in to the growth in computing power and the realization that brute force is
macromolecular problems. much more effective here than is logical elegance. In part, too, it refleets
the time taken for praetieal understanding to grow from the somewhat
idealized theory.
Quarte1s and Higher Products2 3,24
The practieal objective of direet methods is to phase a sufficient number
Although most of the practical work with direct phasing has involved tripies of reflections to give an identifiable Fourier representation of the moleeule
of reflections whose indices sum to zero, there has been increasing interest being studied. The number required wiII depend on various factors, among
in larger sets of reflections related in the same way. Thus quartets of the them the nature of the moleeular system and how mueh is already known of
form its structure. Roughly, however, 10 refleetions per atom in the asymmetrie
unit seems quite satisfactory, and in some cases as few as three to five per
4>4 = 4>bl + 4>112 + 4>b3 + 4>bl+h2+h3 (11 atom have served.
are also structure invariants and have been found to be of real practical Symbolic Addition 20
utility. In particular, they can be related to the KHdet, and it can be
shown 23 ,24 that for the important case in which the product The method most used for the application of direet methods in their early
EblEII2Eb3Ehl+khh3 is large, the value of $4 may be distributed around 0 years traces back to Zachariasen 26 in a 1952 paper accompanying the
(positive quartets), around 1T/2, or around 1T (negative quartets) depending pioneering one of Sayre. Subsequently it was applied intermittently by
on whether the cross terms E hl +h1 , E h1 +b3, and E h l+b3 are large, medium, or various workers and was finally codified under the name of the symbolic
small. Formulas have been derived for estimating the probable value of $4 addition method. 27
and its uncertainty, but they are still a field of active study. Nevertheless, The method, like so many other crystallographic techniques, is a boot
positive quartets have proved useful for confirming tripie relationships in strapping operation. Here one starts with a very limited number of phases
the early stages of tree building, while negative ones serve as a selection and uses them in conneetion with Eq. (11.32) or Eq. (11.45) to pyramid to a
test among alternative solutions. 25 They are also effective for breaking the number large enough to give a recognizable Fourier representation of the
tendency of certain space groups (e.g., PI and pI) to give falsely consistent strueture. The obvious danger inthis approach is that one is building one's
pyramid the wrong way to; it is balanced on its point, and if one of the
220. Tsoucaris, Acta Cryst., Al6, 492 (1970).
stones in the bottom course is faulty it wilI come down. Fortunately, it turns
23H. Schenk in Crystallographic Computing, 3: Da/a Collection, Structure Determination,
out that the labor invested is not very great and it is possible to improve
Proteins, and Data Basts. O. M. Sheldrick, C. Krüger, and R. Ooddard, Eds., Clarendon Press,
"0. T. DeTitta, J. W. Edmonds, D. A. Langs, and H. Hauptman, Acta Cryst., A31, 367
26W. H. Zachariasen, Acta Cryst .• 5, 68 (1952).
(l975).
271. L. Karle, K. Britts, and P. Oum, Acta Cryst., 17,496 (1964).
266 DIRECT METHODS PHASE DETERMINATION IN PRACTICE 267
. x t Then
N/2
F~kI = 2 L h cos 27r(hx - h/2 + ky + Iz) (11.62)
N/2
F~kl 2 L fj COS[21T(hx + ky + lz) - 7rh] (11.63)
N/2
F~kI = 2 L h[cos 21T(1tx + kv + 7rh)
Figure 11.4. Unit cell of a centrosymmetric + sin 21T(1tx + ky + Iz) sin(-7rh)] (11.64)
structure showing centers of symmetry and
alternative origins. x And, since cos n7r (-1 t, sin n1T = 0 for any integer n,
NI2
one's chances by constructing a number of pyramids and seeing which one F~kl=2 L fi cos 27r(1tx + ky+ Iz)(-I)" (11.65)
stands.
The first problem to be met in practice is that of obtaining some initial F~kl = 1)"F"k, (11.66)
phases to work with. Fortunately, a limited number, usually three but
sometimes more or fewer, can be assigned arbitrary values, subject to Thus the shift of an ongm by al2 results in a change of sign for all
certain restrictions. These arbitrarily assigned phases constitute the initial reftections of odd h but no changes in IFI. Similar derivations could be
set. carried out for each of the centers in Fig. 11.4 with corresponding results; a
The question of what reftections can be assigned phases in various space change by t along any axis produces a change of sign in the reflections with
groups has been studied by many workers. 28 - 30 We shall outline only the the corresponding indices odd. The reflections with even indices are
simplest case, that of the primitive centrosymmetric space groups in the unafIected. If the shift involves translation along two axes, for example, al2
triclinic, monoclinic, and orthorhombic classes. Any such space group can and b /2 to get to origin 5, the result would be
be generalized in the form shown in Fig. 11.4, that is, as a unit cell with
centers of symmetry as shown, all other symmetry being omitted. In dealing F"kI = (-1 )Hk Fhkl (11.67)
with a centrosymmetric space group by this method, it is necessary to pi ace
the origin at a center of symmetry to keep all the phases real, but in these and a sign change would occur only if h or k but not both were odd. Table
cases there are no furt her restrictions on which of the eight centers marked 11.5 gives the complete set of sign changes of an initially positive set of
with numbers is chosen. 3 ) As discussed in Section 11.3, a change of origin reflections for all possible combinations of even/odd indices and origins.
from one center to another will afIect only the phases and not the mag Table 11.5 provides a basis for deciding how arbitrary phases can be
nitudes of any calculated structure factors, and so no one is to be preferred.
The justification of this statement, and the elue as to the efIect of such an TADLE 11.5 Relative Sign Relationsbips for Possible Origins
oligin change on the phases, is gained by considering the general structure
factor expression for a centrosymmetric crystal [Eq. Reflection Kind
N/2 Origin Shift eee oee eoe eeo ooe Deo eoo 000
F"kl = 2 L h cos 27r(1tx + ky + (11.60
1 0 + + + + + + + +
2 al2 + + + +
Suppose that the origin is shifted from point 1 to point 2, that x becomes 3 b/2 + + + +
4 el2 + + + +
28K. Lonsdale and H. J. Grenville-Wells, Acta Cryst., 7, 490 (1954).
5 (a+b)/2 + + + +
29H. Hauptman and J. Karte, Acta Cryst., 12,93 (1959).
6 (a+e)/2 + + + +
30J. Karle, in lntemational Tab/es, Vol. IV, pp. 339-349.
7 (b+e)/2 + + + +
31The others need not be considered since they are all related to some one of the fundamental
(a + b + e)/2
8 + + + +
eight by unit translations.
assigned. Since the phases of eee reflections never change for shifts among F hkl (11.71)
these standard origins (structure seminvariants), it is clear that they cannot
be given values at will. All of the other classes are positive for four origins which implies as weil
and negative far four. As long as both + and signs appear in the table for
a dass of reflections, some member of it can be assigned a phase arhitrarily, F hkl F hÜ (11. 72)
since this merely corresponds to selecting one origin from the two possible
sets. Thus if, for example, an oee reflection such as 742 is defined as being The effect of changing the of k depends, as is generally the case, on
+, this reduces the possible number of origins to four (1, 3, 4, or 7). The the even or odd character of an or sum of indices. Far P2dc this test
phases of all the oee reflections are now fixed, and no furt her choices can is on k + I. Thus,
be made from that set. All of the remaining classes, however, show two -t 's
and two -'s far the four origins that make the oee group +. Thus any k + l even:
reflection of these classes, (or example, 516, an ooe, may be set +. This
narrows the choice to origins 1 and 4, and one more choke can be made to F "kr F "kl F Iikf (11.73)
select one of these. This choke is more restricted than before, however.
F hkr F hiZl F "kr (11.74)
Not only are the oee and ooe groups bar red since they have already been
fixed, but also eoe is forbidden as a choke. Inspection of Table 11.5 shows where the second equalities in Eqs. (11.73) and (11.74) follow from the
that eoe is + for both origins 1 and 4, that is, the phases for this class of application of Eq. (11.71) to the first On the other hand,
reflections have been set by the choices already made and none can be
chosen arbitrarily. The other classes all have a + and a - sign for these k + l odd:
origins, and any of these can be assigned the third phase to finally fix the
origin. It is clear that the choke of the first two phases is easy but that some (11.75)
F hkl - F"iZr F hkf
care must be given to the third in order to avoid choosing one from a dass
that has become invariant. F hkl Fiikl F "kr (11.76)
Within the limits imposed above, any three reflections can be assigned
phases. Selection of these also determines the phase of those other Using these re1ationships or the corresponding ones for other space groups,
reftections that are related to the choices by the symmetry of the reciprocal it is possible to obtain the proper phase for any memher of an index set once
lattice. As was discussed in Section 6.1, the concept of a unique data set that of one memher of the set is known.
arises because of the identity of the intensities from various symmetrically Origin assignment foe noncentrosymmetric crystals is similar to that
related reflections. Thus for monodink crystals, reflections with the same described but is complicated hy the complex phases that occur for general
absolute values of h, k, and I can be divided into two groups [see Eqs. reftections.29.30.33 In many cases there exist special c1asses of reftections
(6.3)-(6.5)], (e,g., hOl in P2 1) for which the allowed phases are restricted to one of two
values, but it is usually helpful to have at least one general reflection in the
IFhk11 IF hkll IF hkll IFhHI (11.68) initial set. One additional starting phase can be assigned for many noncen
trosymmetric cases in order to select the enantiomer. 34 These phases are
IFhkll IFhk11 IFhkrl IFMII ( 11.69) assigned to structure invariants or seminvariants (which may be single
IF hkll f IF Iikd (11.70) reftections that become invariant after the origin has been fixed). In so me
space groups such as P2 12 12 1 there exist reflections for whkh A ~ 0 and
The phase relationships among reftections within one set are determined by whose phases are therefore Tr/2 or 3Tr/2. An appropriate one of these can
the space group and can be ca1culated from the general structure factor be assigned Tr/2 (say) arbitrarily to define the enantiomorph. If no such
expression by expanding it in terms of the equivalent positions. Fortunately, Iimited reftections exist, the problem is more difficult, since it is necessary to
the results have been tabulated for all space groups in International
Tables. 32 Thus for the common case of P2dc we find that far all reflections -"H. Hauplman and J. Karle, Acta Cryst., 9, 45 (1956); J. Karle and H. Hauplman, ACla
CrySI., 14, 217 (1961).
4
3 This is nol frue for fhe enantiomeric space groups such as P3, and P3 2 for whleh the
enantiomer is chosen by the assignment of Ihe space group and its associated phase relation
"Internatwnal Tables, VoJ. I, pp. 374-525. ships.
270 DIRECT METHODS PHASE DETERMINATION IN PRACTICE 271
seleet arefleetion for which the correet phase is near 1T'/2 or 31T'12 (rather The origin was fixed by assigning + phases to the reflections 3 1 17
than 0 or 1T') in order to ehoose ahand. Often the assignment of an (000), 3 4 II (oeo), 5 0 14 (oee). These appeared in two relationships:
enantiomorph-determining reflection is omitted in the early stages of phase
determination and is made only at the end when there is evidence to favor a s(3 1 17)· s(3 4 11) = s(6 5 6); s(6 5 6) = + (I 1.77)
partieular ehoice.
The choice of origin-defining refieetions is normally made by algorithms s(3 1 s(5 () 14) = s(8 I s(8 1 3) = + (11.78)
eneoded in the paekage of phase-determining programs, but it is simple to
check the correetness of the set chosen with a procedure of Hovmöller. 35 The high probabilities of the relationships suggested that these signs might
The preliminary stage 36 of the symbolic addition method consists of be accepted as correct and used as the basis for further sign determinations.
caleulating E 's for the entire data set. For all the E 's greater than some Combination of one of the new refiections with one of the original set
chosen minimum, often about 1 a list is compiled of all the tripies of leads to a new relationship;
reflections belonging to this set for which the indices sum to zero. This is
the most tedious part of the exercise and the hardest to carry out without a s(34 1I)'s(5 3 14)=s(8 I 3) (I 1. 79)
computer. Praetieal programs often also compute probabilities [see Eq.
(11.42)] that SblSb2Sb1+h2 + 1 for a centrosymmetrie problem or the prob Since the space group is P2 d c,
able variance of the $3 [Eq. (11.51)] for a noncentrosymmetric one. These
aid in selecting the probably most reliable relationships for the next step.
s(3 .4 1 l) 4 11) = - (11.80)
The list of strongest (most probable) relationships is used to select those and
reflections that are most often and most reliably interconnected, and appro
priate ones of these are chosen for origin determination. Often the three s(8 I 3) = s(8 I 3) + (11.81)
starting reflections will combine to relate to two new ones and imply their
phases by Eq. (11.45). Perhaps one of these will feed back with the original k + I being odd in the first case and even in the second (see Eqs. (11.73)
set to give yet another. Nevertheless it is generally the case that the process (11.76)]. Consequently Eq. (11.79) may be written
will terminate early by running out of new combinations. The crucial step of
the method then involves bringing in a new, strong refleetion as a variable, . S(5 3 14) = +; s(5314)=- (11.82)
historically represented by a letter. This variable stands for either a general
phase in the noncentrosymmetrie case or a sign in the centrosymmetric No further relationships exist reIating two of the reflections of known
ease. Using this variable, the process is continued until all possible tripies sign with one of the remaining reflections. To continue the proeess, a
have been found, and the eyde is then started again with another variable. variable that represents the unknown + or - phase is assigned as the sign of
This se ries of operations is repeated until a sufficient number of reflections some useful reflection. The phases of other refleetions are then worked out
have been phased, either absolutely or in terms of one or more variables. in terms of the known signs and this variable. Thus, if by definition
In order to outline the phase-determining process, an actual structure
determination will be used as an example. 37 The crystals were those of a s(6 5 a (I 1.83)
natural produet (C n H I6 0 7 ) in space group P2t1c. The indices of reflections the relations hip
with lEI> 3.0 were used to generate the tripies of refleetions in which the
indices of two added to give those of the third. With lEI> 3.0 and with 96 s(2 6 9)'s(6 5 12)=s(8 I 3) (11.84)
nonhydrogen atoms in the eell, the probability that the signs of these
refleetions were related as given in Eq. (11.32) or (11.33) could be eal implies
eulated [Eq. (11.40)] to be greater than 0.99 (see Table 11.3). s(2 6 9) . a =+ (11.85)
"s. Hovmöller, Acta Crysl., A37, 133 (1982). or since
3"For detailed examples of the solution of phase problems by symbolic addition, see I. L. Karle
in Crystallographic Compuring Techniques, F. R. Ahmed, K. Huml, and B. Sedlack, Eds .• a'a +.+=-.-=+ (11.86)
Munksgaard, Copenhagen, 1976, pp. 27-70; M. M. Woolfson, Acta Cryst., A43, 593 (1987);
then
and Luger, Modem X-Ray Analysis on Single Crystals, pp. 253-266.
regardless of the actual sign corresponding to a. Since k + I is odd, a single sign indication as determinative. To make up for this, however,
many unknown signs will be determined by more than one relations hip.
s(2 6 9) = -s(2 6 9) = a (11.88) Thus 2 3 20 (E 2.53) is given by
s(3 1 17)' s(3 6 5) = s(6 5 12) ( 11.89) s(4 8 8)' s(6 5 1 s(2 3 20) (11.98)
s(3 4 11)' s(6 5 12) = s(9 1 I) (11.91) All these imply that
- . a = s(9 1 1); s(9 1 1) a (11.92)
S(2 3 -a (11.
At this point a checking relationship becomes available, all of whose
and the agreement reinforces the strength of the implication [see Eq.
signs are known. Thus,
(11
s(2 6 9)' s(3 6 5) = s(5 0 (1 As the phase-determining process continues, two additional phenomena
occur. First, contradictions will begin to appear. The set of relationships
a· a =+ (11.94) defining a given sign may contain several that indicate it to be ab and one
indicating - ab. Usually, the contradictory relationship is one of relatively
which is correct and lends confidence to the assignments that have been lower probability and c'an be discounted against one of opposite implication.
made. Such disagreements are to be expected, and, in fact, their failure to appear
the extra phases now determined in terms of a, additional rela is a warning that the solution may be headed for the "uranium catastrophe"
tionships can be found. For example. in which alJ the phases are consistent and alJ the electron density is
concentrated at a single massive peak at an origin. J8
s(2 6 9)' s(7 7 10) = s(9 1 1) (11.95) The second phenomenon is the appearance of sets of relationships
suggesting more than one variable combination as appropriate for a given
a . s(7 7 10) = -a; s(7 7 10) = (11.96) reflection. These often serve to suggest relationships between the phases
corresponding to some of the variables, either absolutely or in terms of
so the sign of 7' 7 10 is determined absolutely even though the two other other variables. This serves to reduce the number of unknowns, but usualJy
reflections in the equation are known only in terms of the variable. The list not to zero, and often suggests the most probably successful order of phase
of signs can be extended by repeating the process until a point is reached at assignment to the variables,
which no more relationships involving two known and one unknown signs Symbolic addition in the noncentrosymmetric case is carried out in a
can be found. In this case 11 further signs can be found. Since a number of similar way, but the process is complicated by the need to add phases
equations remain unused, another arbitrary sign can be defined and the modulo 2'1T rat her than multiplying and by the accumulation of errors
process repeated, new signs being found in terms of either or both of the in slow stages even for approximately correct relationships,
variables. When several phase indications are combined by averaging the im
With this set of determined signs as a basis it is now possible to extend plications of individual tripIes, problems arise from the step that occurs
the process to reflections of lower E value. The list of unknowns is between 359° and 1° (2'1T 6 and 0 + 6) or between + '1T (+ 179° and -179°),
extended by adding the set of the next most intense reflections (in terms of If a given average includes contributors from the two sides of a step, the
E), and the relationships are found that connect two reflections of known
phase with a member of this set. In the actual solution of the example being
used, the list was extended in steps. adding first those reflections with 1ßThis effecl occurs when a1l the phases are 0 or 'TI' (+ or signs). lt may not be apparent for
Ihe origin selected, but examination of the signs produced by the changes described in Table
> 2.5, then > 2.0, and finally lEI> 1.5. I 1.4 will show thai for some origin all the signs are the same. The phenomenon is a particular
As the relationships begin to inc\ude reflections of lower lEI values, their problem in PI and PI where triplet sign relationships alone do not provide any means of
probabilities begin to fall from the very high values that allow one to accept breaking oul of an overconsistent set with all the signs the same.
274 OIRECT METHODS PHASE DETERMINATION IN PRACTICE 275
numbers do not average correctly and the apparent distribution of values set of branches to sprout from each existing stern, leading to a very large
around the mean is excessively broad. One solution 20 is to calculate each tree structure of possible phase sets.
average for both of the ranges ± 1T and 0- 21T and select the average that The actual number of trial sets was reduced by the introduction of magie
gives the tighter distribution. Once an extended set of starting phases, often integer methods. 44 ,45 These are sets of integers that serve to link a moderate
100-200, has been obtained, the tangent formula (Eq. (11.53)] can be used number of unknown phases to two or three variables in a way that provides
by cycling repeatedly through the tripies to obtain the most consistent uniform coverage oe approximate alternative phases. Various methods were
phases for the original set as weil as to extend the phasing to reflections with then used to obtain the probably best values for the remaining variables.
lower Evalues. Although widely used as part of program systems, their importance has
probably been lessened by the growth of purely random methods (see
below).
Multiple Solution Methods MULTAN also formalized in CONVERGENCE 46 the pf{lCeSS described
In the early days of direct methods, computers were used to generate the above of working back from the entire set of tripies to the most reIiable and
lists of tri pies, and the subsequent symbolic addition was carried out by productive set of reflections on which to build the tree. This is the weakest
hand. This was fairly easy in the centrosymmetric case, somewhat more of the procedure, since if an early tripie fails to be true, the entire
difficult in the noncentrosymmetric one. Nevertheless, it was obviously process may fail, but it is no more fallible than doing the sAme thing by
desirable to computerize the entire process, and various programs were hand and is certainly more systematically complete.
written to carry out symbolic addition by machine. Computers are not Since each branch of the phase development tree is worked out from the
particularly adept at quasi-algebraic manipulation, however, and the stan initial set and the list of tripies in terms of numeric phases, it is possible to
dard approach to symbolic addition does not make the best use of their apply the tangent formula (Eq. (ll.53)] directly to the process of phase
abilities. They are best at keeping track of numbers, and it was on this basis extension as weil as to subsequent refinement. Unfortunately, the tangent
that WooIfson and his collaborators built their direct methods program formula sometimes fails to refine an initial set of phases properly even if the
39-42 which is extremely widely used either directly or in the phases are correct. 42 In fact, cases are known where an apparently
form of other programs derived from it. 43 In fact, MULT AN has become erroneous starting set will refine to the true solution while a seemingly much
almost a generic term for multisolution programs. doser set will fail. On the other hand, it is the strength of the multisolution
MULTAN in its earliest forms was a mechanization of various elements approach that it provides a large number of opportunities to arrive at the
of the symbolic addition method, with the critical addition that symbolic true solution.
variables were replaced by a list of trial numeric phase possibilities adequate
to hit sufficiently e10se to the true phase of the reflection. For general Random Phase Methods
noncentrosymmetric reflections these were originally 45°, 13 SO, 225°, and
315°, which guarantee that one trial must be within 45° of the correct phase In the course of work with MULT AN it became dear that the most
(at worst; the average error is much less). For centrosymmetric reflections common source of difficulty was the occurrence of failures early in the phase
they are 0° and 180° (or other special values for particular cases). oe course, deduction as a result of tripies whose true value was grossly different from
each of these trials corresponds to a distinct alternative solution and as such that predicted. Some of these faiIures could be avoided
must be kept separate, leading to rat her complex bookkeeping but weil straints to the CONVERGENCE process and
within the power of the computer. Each additional variable causes another developments that accidentally placed the (unknown)
some could not be remedied in this way.
Mlllplest means of avoiding the risks of depending on single triplet
9
3 p. Main, in Cryslallographic Computing, 3: Data Col/eclion, SIrUClure Determination, indications is to do away with the tree development entirely. Various
P,oteins, and Dala Bases, G. M. Sheldrick, C. Krüger, and R. Goddard, Eds., Clarendon
approaches were made to increasing the size of the starting set of tripies so
Press, Oxford, 1985, pp. 206-215.
43For example, MITHRIL, C. J. Gilmore, J. Appl. Crysl., 17, 42 (1984). For a list and
46G. Germain, P. Main, and M. M. Woolfson, ACla Crysl., 826, 274 (1970),
lography, D. Sayre, Ed., C1arendon Press, Oxford, 1982, pp. 126-140. A34. 883 (1978).
276 DIRECT METHODS PHASE DETERMINATION IN PRACTICE 277
logical extreme by Ya048 in his program RANTAN, in which random already known in terms of variables. In this way the correct values of the
phases are assigned to all strong reflections except for those defining the variables are found. 51 -'\3
origin and enantiomorph and the tangent formula is then applied carefully Once a set or sets of phases have been found, the remaining task is to
to refine them to the most consistent possible set. The method does not calculate Fourier syntheses. The majority of reflections phased will have
guarantee to yield a successful result for any single set, but when done large indices and thus correspond to moderate to high values of sin O. For
repeatedly it is possible to obtain and identify good solutions in a relatively this reason their IFI values are small even though the lEI values are large. If
small number of trials (tens, not hundreds, let alone the astronomical the IFl's are used as the coefficients in the Fourier synthesis, many of the
number of possible combinations of phases). This appears at present to be weaker ones tend to be swamped by a few, more intense, low-order
the strongest practical method and has been incorporated into MULT AN reflections. It has been found that this can be avoided by calculating an
and other programs as an option, often the default option. E_map,52.53 that is, a synthesis in which the coefficients are E's rat her than
F's. Since the E's correspond to completely sharpened atoms, the peaks on
the map tend to be very sharp, but nevertheless they can easily be found on
Getting a Model a grid with a spacing of about 0.5 A.
Once the analysis is completed, the result will be a number of possible phase Because of the high degree of sharpening present in an E-map and
sets corresponding either to multiple solutions or to assignment of phases to because the reflections involved are often specially selected by the existence
the variables in a single symbolic list. It is now perfectly feasible to calculate of their interrelations, a considerable number of spurious peaks may be
three-dimensional Fourier syntheses for each of these and to examine the observed. Benzene rings, for example, are often found to have peaks in
maps in turn to see which appear to yield promising molecular systems. The their centers. On the other hand, groups that do not fit into the main pattern
limiting factor is much more one's disinclination to plot and examine a large of the molecular structure, for example, side chains attached to rings, may
number of syntheses than the time required to compute them. In order to not show up at all. Similarly, iiule or no significance can be attached to the
avoid wasting time studying unneccessary syntheses, it is worthwhile to relative heights found for the various peaks. It is for this reason that a
consider the combinations in an order that corresponds at least roughly to knowledge of the probable gross molecular structure can be of help in
their probability of being correct. trying to visualize the way in which atoms are to be assigned to the map.~4
Numerous approaches have been used to try to select the best com If a reasonable set of atomic positions can be found, they can be used for
bination without actually calculating Fourier syntheses. Thus an ap structure factor calculation, and the subsequent development will follow the
proximation to the full Sa yre summation [Eq. (11.31)1 can be calculated for lines described in Chapter 15. If the distribution of peaks looks possible, but
some or all reflections using the phases determined for each case. The a moleeule cannot be made out because of spurious peaks, the development
agreement between the magnitudes (lEI or IFI) predicted by the summation of more phases and the addition of more reflections will, if they are correct,
and those actually observed can be measured by a quantity similar to R,49 enhance the correct peaks and diminish the false ones. If only a portion of
and in principle the set of phases that gives the best agreement should be the moleeule can be recognized, structure factors can be calculated from
correcl. In practice the results are not so clean-cut, but they are certainly these atoms and the phases used as a starting set to feed back into tangent
helpful as a guide. Both very strong and very weak (psi-zero test) reflections formula refinement (Karle recycling).55 Often this process will reveal ad
have been used in these comparisons, but neither has been shown to be ditional atoms and can be repeated to build the entire structure.
obviously superior. Often both are calculated and are combined to give a If no solution can be found, it is necessary to force the system to follow
better test than either alone. 50 some other path of phase determination to avoid errors that may be
A further method that has been used involves the use of other phase appearing too early in the process and that trap the solution into a false
determining relationships to obtain absolute phases for certain reflections result. This can sometimes be done by trying another phasing program, but
it is necessary to be careful since many programs have been adapted from
4"J._X. Yao, Acta Cryst., A37, 642 (1981); A39, 35 (1983). "!. L. Karle, H. Hauplman. J. Karle, and A. B. Wing, Acta Cryst., 11, 257 (1958).
is often cited as the Karle R (R Kart') after J. Karte and I. L. Karte, Acta Cryst., 21, 849
4 9 This 52[,L. Karle, K. Brill~, and S. Brenner, Acta Cryst., 17, 1506 (1964).
(1966), but similar formulations had appeared earlier. See W. Cochran and A. S. Douglas, 'JS. Block and A. Perloff, Acta Cryst., 16, 1233 (1963).
Proc. Roy. Soc. (London), 243A, 281 (1951). 54For a discussion of the ellects of randorn errors in phases on the appearance of E-maps, see
50 Many other tests have been proposed and are used in various programs. For further A. M. Silva and D. Vilerbo, Acta Cryst., A36, 1065 (1980).
discussioll, see G. Cascareno, C. Giacovazzo, and D. Viterbo, Acta Cryst., A43, 22 (1987). "J. Karle, Acta Cryst .. 824, 182 (1968).
-r
.ßlf8~1! 8
others and the internal workings may be very similar. Too, some crystals
prove to be very resistant to direct methods, for reasons that are not weil
understood.
BIBLIOGRAPHY
X-RAV
Dunitz, J., X-Ray Analysis and the Structures 0/ Organic Molecules, Cornell
University Press, Ithaca, NY, 1979, pp. 148-182.
STRUCTURE
Giacovazzo, c., Direct Methods in Crystallography, Academic, London, 1980. DETERMINATION
Karle, J., in Advances in Structure Research, R. SrilI, Ed., lntersciencc, Ncw York,
1964, pp. 55-89.
A PRACTICAL GUIDE
Karle, J., and 1. L. Karle, in Computing Methods in Crystallography, J. S. Rollett,
Ed., Pergamon, Oxford, 1965, pp. 151-165.
Ladd, M. F. c., and R. H. Palmer, Structure Determination by X-Ray Crysta/lo
graphy, Plenum, New York, 1985, pp. 271-287. Second Edition
Lipson, H., and W. Cochran, The Determination 0/ Crystal Structures, Cornell
University Press, Ithaca, NY, 1966, Chapter 9.
Luger, P., Modem X-Ray Analysis on Single Crystals, dc Gruytcr, New York, 1980,
pp. 229-266.
Schenk, H., A. J. C. Wilson, and S. Parthasarathy, Eds., Direct Methods,
Macromolecular Crystallography, and Crystallographic Statistics, World Scientific, GEORGE H. STOUT
Singapore, 1987.
Woolfson, M. M., Direct Methods-From Birth to Maturity, Acta Cryst., A43, LYLE H. JENSEN
593-612 (1987).
..
'\"\ i,~n
r:!'::ll (;te
rtf'
ß I L.! 1'1,.
WILEY
A Wiley-Interscience Publication