International
Tables for Crystallography Volume B Reciprocal space Edited by U. Shmueli © International Union of Crystallography 2006 
International Tables for Crystallography (2006). Vol. B, ch. 2.2, pp. 210234
doi: 10.1107/97809553602060000555 Chapter 2.2. Direct methods ^{a}Dipartimento Geomineralogico, Campus Universitario, I70125 Bari, Italy Direct methods are essentially reciprocalspace techniques, developed historically to solve smallmolecule crystal structures. Their success in this area (in practice, they solve the phase problem) is based on numerous theoretical achievements which concern origin specification (Section 2.2.3), the concepts of structure invariants and seminvariants, normalization of the structure factors (Section 2.2.4), inequalities among structure factors, and probabilistic phase relationships (Section 2.2.5). Probabilistic phase relationships are at the core of direct methods: triplet (via Cochran and via the P10 formula), quartet (according to Hauptman and to Giacovazzo) and quintet invariant phase estimates are described, along with estimates of onephase and of twophase structure seminvariants (via representation theory). Determinantal formulas are also quoted. Techniques for estimating invariants by using some prior information and their realspace counterparts are discussed. The success of direct methods is intimately connected with the phasing procedures. The most important tools of the procedures (e.g. the tangent formula, magic integers, randomstart approaches and figures of merit for recognizing the correct solution) are analysed. The integration of direct methods with macromolecular crystallography is discussed in Section 2.2.10: in particular we refer to the ab initio methods for solving protein structures (in combination with directspace techniques like electrondensity modification, envelope determination, histogram matching etc.) as well as to the combination of direct methods with molecular replacement and SAD–MAD techniques (both in 'onestep' procedures and in the twostep method, the latter requiring the prior determination of the heavyatom or anomalousscatterer substructure). 
Direct methods are today the most widely used tool for solving small crystal structures. They work well both for equalatom molecules and when a few heavy atoms exist in the structure. In recent years the theoretical background of direct methods has been improved to take into account a large variety of prior information (the form of the molecule, its orientation, a partial structure, the presence of pseudosymmetry or of a superstructure, the availability of isomorphous data or of data affected by anomalousdispersion effects, …). Owing to this progress and to the increasing availability of powerful computers, a number of effective, highly automated packages for the practical solution of the phase problem are today available to the scientific community.
The ab initio crystal structure solution of macromolecules seems not to exceed the potential of direct methods. Many efforts will certainly be devoted to this task in the near future: a report of the first achievements is given in Section 2.2.10.
This chapter describes both the traditional direct methods tools and the most recent and revolutionary techniques suitable for macromolecules.
The theoretical background and tables useful for origin specification are given in Section 2.2.3; in Section 2.2.4 the procedures for normalizing structure factors are summarized. Phasedetermining formulae (inequalities, probabilistic formulae for triplet, quartet and quintet invariants, and for one and twophase s.s.'s, determinantal formulae) are given in Section 2.2.5. In Section 2.2.6 the connection between direct methods and related techniques in real space is discussed. Practical procedures for solving crystal structures are described in Sections 2.2.7 and 2.2.8, and references to the most extensively used packages are given in Section 2.2.9. The techniques suitable for the ab initio crystal structure solution of macromolecules are described in Section 2.2.10.2. The integration of direct methods with isomorphousreplacement and anomalousdispersion techniques is briefly described in Sections 2.2.10.3 and 2.2.10.4.
The reader will find full coverage of the most important aspects of direct methods in the recent books by Giacovazzo (1998) and Woolfson & Fan (1995).

The normalized structure factors E (see also Chapter 2.1 ) are calculated according to (Hauptman & Karle, 1953) where is the squared observed structurefactor magnitude on the absolute scale and is the expected value of .
depends on the available a priori information. Often, but not always, this may be considered as a combination of several typical situations. We mention:

When probability theory is not used, the quasinormalized structure factors and the unitary structure factors are often used. and are defined according to Since is the largest possible value for represents the fraction of with respect to its largest possible value. Therefore If atoms are equal, then .
N.s.f.'s cannot be calculated by applying (2.2.4.1) to observed s.f.'s because: (a) the observed magnitudes (already corrected for Lp factor, absorption, …) are on a relative scale; (b) cannot be calculated without having estimated the vibrational motion of the atoms.
This is usually obtained by the well known Wilson plot (Wilson, 1942), according to which observed data are divided into ranges of and averages of intensity are taken in each shell. Reflection multiplicities and other effects of spacegroup symmetry on intensities must be taken into account when such averages are calculated. The shells are symmetrically overlapped in order to reduce statistical fluctuations and are restricted so that the number of reflections in each shell is reasonably large. For each shell should be obtained, where K is the scale factor needed to place Xray intensities on the absolute scale, B is the overall thermal parameter and is the expected value of in which it is assumed that all the atoms are at rest. depends upon the structural information that is available (see Section 2.2.4.1 for some examples).
Equation (2.2.4.3) may be rewritten as which plotted at various should be a straight line of which the slope (2B) and intercept (ln K) on the logarithmic axis can be obtained by applying a linear leastsquares procedure.
Very often molecular geometries produce perceptible departures from linearity in the logarithmic Wilson plot. However, the more extensive the available a priori information on the structure is, the closer, on the average, are the Wilsonplot curves to their leastsquares straight lines.
Accurate estimates of B and K require good strategies (Rogers & Wilson, 1953) for:

Once K and B have been estimated, values can be obtained from experimental data by where is the expected value of for the reflection h on the basis of the available a priori information.
Under some fairly general assumptions (see Chapter 2.1 ) probability distribution functions for the variable for cs. and ncs. structures are (see Fig. 2.2.4.1) and respectively. Corresponding cumulative functions are (see Fig. 2.2.4.2)
Some moments of the distributions (2.2.4.4) and (2.2.4.5) are listed in Table 2.2.4.1. In the absence of other indications for a given crystal structure, a cs. or an ncs. space group will be preferred according to whether the statistical tests yield values closer to column 2 or to column 3 of Table 2.2.4.1.

For further details about the distribution of intensities see Chapter 2.1 .
From the earliest periods of Xray structure analysis several authors (Ott, 1927; Banerjee, 1933; Avrami, 1938) have tried to determine atomic positions directly from diffraction intensities. Significant developments are the derivation of inequalities and the introduction of probabilistic techniques via the use of joint probability distribution methods (Hauptman & Karle, 1953).
An extensive system of inequalities exists for the coefficients of a Fourier series which represents a positive function. This can restrict the allowed values for the phases of the s.f.'s in terms of measured structurefactor magnitudes. Harker & Kasper (1948) derived two types of inequalities:
Type 1. A modulus is bound by a combination of structure factors: where m is the order of the point group and .
Applied to loworder space groups, (2.2.5.1) gives The meaning of each inequality is easily understandable: in , for example, must be positive if is large enough.
Type 2. The modulus of the sum or of the difference of two structure factors is bound by a combination of structure factors: where stands for `real part of'. Equation (2.2.5.2) applied to P1 gives
A variant of (2.2.5.2) valid for cs. space groups is After Harker & Kasper's contributions, several other inequalities were discovered (Gillis, 1948; Goedkoop, 1950; Okaya & Nitta, 1952; de Wolff & Bouman, 1954; Bouman, 1956; Oda et al., 1961). The most general are the Karle–Hauptman inequalities (Karle & Hauptman, 1950): The determinant can be of any order but the leading column (or row) must consist of U's with different indices, although, within the column, symmetryrelated U's may occur. For and , equation (2.2.5.3) reduces to which, for cs. structures, gives the Harker & Kasper inequality For , equation (2.2.5.3) becomes from which where If the moduli , , are large enough, (2.2.5.4) is not satisfied for all values of . In cs. structures the eventual check that one of the two values of does not satisfy (2.2.5.4) brings about the unambiguous identification of the sign of the product .
It was observed (Gillis, 1948) that `there was a number of cases in which both signs satisfied the inequality, one of them by a comfortable margin and the other by only a relatively small margin. In almost all such cases it was the former sign which was the correct one. That suggests that the method may have some power in reserve in the sense that there are still fundamentally stronger inequalities to be discovered'. Today we identify this power in reserve in the use of probability theory.
For any space group (see Section 2.2.3) there are linear combinations of phases with cosines that are, in principle, fixed by the magnitudes alone (s.i.'s) or by the values and the trigonometric form of the structure factor (s.s.'s). This result greatly stimulated the calculation of conditional distribution functions where , is an s.i. or an s.s. and is a suitable set of diffraction magnitudes. The method was first proposed by Hauptman & Karle (1953) and was developed further by several authors (Bertaut, 1955a,b, 1960; Klug, 1958; Naya et al., 1964, 1965; Giacovazzo, 1980a). From a probabilistic point ofview the crystallographic problem is clear: the joint distribution , from which the conditional distributions (2.2.5.5) can be derived, involves a number of normalized structure factors each of which is a linear sum of random variables (the atomic contributions to the structure factors). So, for the probabilistic interpretation of the phase problem, the atomic positions and the reciprocal vectors may be considered as random variables. A further problem is that of identifying, for a given Φ, a suitable set of magnitudes on which Φ primarily depends. The formulation of the nested neighbourhood principle first (Hauptman, 1975) fixed the idea of defining a sequence of sets of reflections each contained in the succeeding one and having the property that any s.i. or s.s. may be estimated via the magnitudes constituting the various neighbourhoods. A subsequent more general theory, the representation method (Giacovazzo, 1977a, 1980b), arranges for any Φ the set of intensities in a sequence of subsets in order of their expected effectiveness (in the statistical sense) for the estimation of Φ.
In the following sections the main formulae estimating loworder invariants and seminvariants or relating phases to other phases and diffraction magnitudes are given.
The basic formula for the estimation of the triplet phase given the parameter is Cochran's (1955) formula where , is the atomic number of the jth atom and is the modified Bessel function of order n. In Fig. 2.2.5.1 the distribution is shown for different values of G.
The conditional probability distribution for , given a set of and , is given (Karle & Hauptman, 1956; Karle & Karle, 1966) by where is the most probable value for . The variance of may be obtained from (2.2.5.7) and is given by which is plotted in Fig. 2.2.5.2.
Equation (2.2.5.9) is the socalled tangent formula. According to (2.2.5.10), the larger is α the more reliable is the relation .
For an equalatom structure .
The basic conditional formula for sign determination of in cs. crystals is Cochran & Woolfson's (1955) formula where is the probability that is positive and k ranges over the set of known values . The larger the absolute value of the argument of tanh, the more reliable is the phase indication.
An auxiliary formula exploiting all the 's in reciprocal space in order to estimate a single Φ is the formula (Hauptman & Karle, 1958; Karle & Hauptman, 1958) given by where C is a constant which differs for cs. and ncs. crystals, is the average value of and p is normally chosen to be some small number. Several modifications of (2.2.5.12) have been proposed (Hauptman, 1964, 1970; Karle, 1970a; Giacovazzo, 1977b).
A recent formula (Cascarano, Giacovazzo, Camalli et al., 1984) exploits information contained within the second representation of Φ, that is to say, within the collection of special quintets (see Section 2.2.5.6): where k is a free vector. The formula retains the same algebraic form as (2.2.5.6), but where , is assumed to be zero if it is experimentally negative. The prime to the summation warns the reader that precautions have to be taken in order to avoid duplications in the contributions.
G may be positive or negative. In particular, if the triplet is estimated negative.
The accuracy with which the value of Φ is estimated strongly depends on . Thus, in practice, only a subset of reciprocal space (the reflections k with large values of ) may be used for estimating Φ.
(2.2.5.13) proved to be quite useful in practice. Positive triplet cosines are ranked in order of reliability by (2.2.5.13) markedly better than by Cochran's parameters. Negative estimated triplet cosines may be excluded from the phasing process and may be used as a figure of merit for finding the correct solution in a multisolution procedure.
A strength of direct methods is that no knowledge of structure is required for their application. However, when some a priori information is available, it should certainly be a weakness of the methods not to make use of this knowledge. The conditional distribution of Φ given and the first three of the five kinds of a priori information described in Section 2.2.4.1 is (Main, 1976; Heinermann, 1977a) where stand for h, , , and for . The quantities have been calculated in Section 2.2.4.1 according to different categories: is a suitable average of the product of three scattering factors for the ith atomic group, p is the number of atomic groups in the cell including those related by symmetry elements. We have the following categories.

In early papers (Hauptman & Karle, 1953; Simerska, 1956) the phase was always expected to be zero. Schenk (1973a,b) [see also Hauptman (1974)] suggested that Φ primarily depends on the seven magnitudes: , called basis magnitudes, and , called cross magnitudes.
The conditional probability of Φ in P1 given seven magnitudes according to Hauptman (1975) is where L is a suitable normalizing constant which can be derived numerically, For equal atoms . Denoting gives Fig. 2.2.5.3 shows the distribution (2.2.5.18) for three typical cases. It is clear from the figure that the cosine estimated near π or in the middle range will be in poorer agreement with the true values than the cosine near 0 because of the relatively larger values of the variance. In principle, however, the formula is able to estimate negative or enantiomorphsensitive quartet cosines from the seven magnitudes.

Distributions (2.2.5.18) (solid curve) and (2.2.5.20) (dashed curve) for the indicated values in three typical cases. 
In the cs. case (2.2.5.18) is replaced (Hauptman & Green, 1976) by where is the probability that the sign of is positive or negative, and The normalized probability may be derived by . More simple probabilistic formulae were derived independently by Giacovazzo (1975, 1976): where and . Q is never allowed to be negative.
According to (2.2.5.20) is expected to be positive or negative according to whether is positive or negative: the larger is C, the more reliable is the phase indication. For , (2.2.5.18) and (2.2.5.20) are practically equivalent in all cases. If N is small, (2.2.5.20) is in good agreement with (2.2.5.18) for quartets strongly defined as positive or negative, but in poor agreement for enantiomorphsensitive quartets (see Fig. 2.2.5.3).
In cs. cases the sign probability for is where G is defined by (2.2.5.21).
All three cross magnitudes are not always in the set of measured reflections. From marginal distributions the following formulae arise (Giacovazzo, 1977c; Heinermann, 1977b):
Equations (2.2.5.20) and (2.2.5.23) are easily modifiable when some cross magnitudes are not in the measurements. If is not measured then (2.2.5.20) or (2.2.5.23) are still valid provided that in G it is assumed that . For example, if and are not in the data then (2.2.5.21) and (2.2.5.22) become In space groups with symmetry higher than more symmetryequivalent quartets can exist of the type where are rotation matrices of the space group. The set is called the first representation of Φ. In this case Φ primarily depends on more than seven magnitudes. For example, let us consider in Pmmm the quartet Quartets symmetry equivalent to Φ and respective cross terms are given in Table 2.2.5.1.

Experimental tests on the application of the representation concept to quartets have recently been made (Busetta et al., 1980). It was shown that quartets with more than three cross magnitudes are more accurately estimated than other quartets. Also, quartets with a cross reflection which is systematically absent were shown to be of significant importance in direct methods. In this context it is noted that systematically absent reflections are not usually included in the set of diffraction data. This custom, not exceptionable when only triplet relations are used, can give rise to a loss of information when quartets are used. In fact the usual programs of direct methods discard quartets as soon as one of the cross reflections is not measured, so that systematic absences are dealt with in the same manner as those reflections which are outside the sphere of measurements.
A quintet phase may be considered as the sum of three suitable triplets or the sum of a triplet and a quartet, i.e. or It depends primarily on 15 magnitudes: the five basis magnitudes and the ten cross magnitudes In the following we will denote Conditional distributions of Φ in P1 and given the 15 magnitudes have been derived by several authors and allow in favourable circumstances in ncs. space groups the quintets having Φ near 0 or near π or near to be identified. Among others, we remember:

For cs. cases (2.2.5.24) reduces to Positive or negative quintets may be identified according to whether G is larger or smaller than zero.
If is not measured then (2.2.5.24) and (2.2.5.25) are still valid provided that in (2.2.5.25) .
If the symmetry is higher than in then more symmetryequivalent quintets can exist of the type where are rotation matrices of the space groups. The set is called the first representation of Φ. In this case Φ primarily depends on more than 15 magnitudes which all have to be taken into account for a careful estimation of Φ (Giacovazzo, 1980a).
A wide use of quintet invariants in direct methods procedures is prevented for two reasons: (a) the large correlation of positive quintet cosines with positive triplets; (b) the large computing time necessary for their estimation [quintets are phase relationships of order , so a large number of quintets have to be estimated in order to pick up a sufficient percentage of reliable ones].
In a crystal structure with N identical atoms the joint probability distribution of n normalized s.f.'s under the following conditions:

Advantages, limitations and applications of determinantal formulae can be found in the literature (Heinermann et al., 1979; de Rango et al., 1975, 1985). Taylor et al. (1978) combined K–H determinants with a magicinteger approach. The computing time, however, was larger than that required by standard computing techniques. The use of K–H matrices has been made faster and more effective by de Gelder et al. (1990) (see also de Gelder, 1992). They developed a phasing procedure (CRUNCH) which uses random phases as starting points for the maximization of the K–H determinants.
According to the representations method (Giacovazzo, 1977a, 1980a,b):

The more general expressions for the s.s.'s of first rank are
In other words:

The set of special quartets (2.2.5.35a) and (2.2.5.35b) constitutes the first representations of Φ.
Structure seminvariants of the second rank can be characterized as follows: suppose that, for a given seminvariant Φ, it is not possible to find a vectorial index h and a rotation matrix such that is a structure invariant. Then Φ is a structure seminvariant of the second rank and a set of structure invariants ψ can certainly be formed, of type by means of suitable indices h and l and rotation matrices and . As an example, for symmetry class 222, or or are s.s.'s of the first rank while is an s.s. of the second rank.
The procedure may easily be generalized to s.s.'s of any order of the first and of the second rank. So far only the role of onephase and twophase s.s.'s of the first rank in direct procedures is well documented (see references quoted in Sections 2.2.5.9 and 2.2.5.10).
Let be our onephase s.s. of the first rank, where In general, more than one rotation matrix and more than one vector h are compatible with (2.2.5.36). The set of special triplets is the first representation of . In cs. space groups the probability that , given and the set , may be estimated (Hauptman & Karle, 1953; Naya et al., 1964; Cochran & Woolfson, 1955) by where In (2.2.5.37), the summation over n goes within the set of matrices for which (2.2.5.35a,b) is compatible, and h varies within the set of vectors which satisfy (2.2.5.36) for each . Equation (2.2.5.36) is actually a generalized way of writing the socalled relationships (Hauptman & Karle, 1953).
If is a phase restricted by symmetry to and in an ncs. space group then (Giacovazzo, 1978) If is a general phase then is distributed according to where with a reliability measured by The second representation of is the set of special quintets provided that h and vary over the vectors and matrices for which (2.2.5.36) is compatible, k over the asymmetric region of the reciprocal space, and over the rotation matrices in the space group. Formulae estimating via the second representation in all the space groups [all the base and cross magnitudes of the quintets (2.2.5.40) now constitute the a priori information] have recently been secured (Giacovazzo, 1978; Cascarano & Giacovazzo, 1983; Cascarano, Giacovazzo, Calabrese et al., 1984). Such formulae contain, besides the contribution of order provided by the first representation, a supplementary (not negligible) contribution of order arising from quintets.
Denoting formulae (2.2.5.37), (2.2.5.38), (2.2.5.39) still hold provided that is replaced by where m is the number of symmetry operators and is the Hermite polynomial of order four.
is assumed to be zero if it is computed negative. The prime to the summation warns the reader that precautions have to be taken in order to avoid duplication in the contributions.
Twophase s.s.'s of the first rank were first evaluated in some cs. space groups by the method of coincidence by Grant et al. (1957); the idea was extended to ncs. space groups by Debaerdemaeker & Woolfson (1972), and in a more general way by Giacovazzo (1977e,f).
The technique was based on the combination of the two triplets which, subtracted from one another, give If all four 's are sufficiently large, an estimate of the twophase seminvariant is available.
Probability distributions valid in according to the neighbourhood principle have been given by Hauptman & Green (1978). Finally, the theory of representations was combined by Giacovazzo (1979a) with the joint probability distribution method in order to estimate twophase s.s.'s in all the space groups.
According to representation theory, the problem is that of evaluating via the special quartets (2.2.5.35a) and (2.2.5.35b). Thus, contributions of order will appear in the probabilistic formulae, which will be functions of the basis and of the cross magnitudes of the quartets (2.2.5.35) . Since more pairs of matrices and can be compatible with (2.2.5.34), and for each pair more pairs of vectors and may satisfy (2.2.5.34), several quartets can in general be exploited for estimating Φ. The simplest case occurs in where the two quartets (2.2.5.35) suggest the calculation of the sixvariate distribution function which leads to the probability formula where is the probability that the product is positive, and It may be seen that in favourable cases .
For the sake of brevity, the probabilistic formulae for the general case are not given and the reader is referred to the original papers.
The statistical treatment suggested by Wilson for scaling observed intensities corresponds, in direct space, to the origin peak of the Patterson function, so it is not surprising that a general correspondence exists between probabilistic formulation in reciprocal space and algebraic properties in direct space.
For a structure containing atoms which are fully resolved from one another, the operation of raising to the nth power retains the condition of resolved atoms but changes the shape of each atom. Let where is an atomic function and is the coordinate of the `centre' of the atom. Then the Fourier transform of the electron density can be written as If the atoms do not overlap and its Fourier transform gives is the scattering factor for the jth peak of :
We now introduce the condition that all atoms are equal, so that and for any j. From (2.2.6.1) and (2.2.6.2) we may write where is a function which corrects for the difference of shape of the atoms with electron distributions and . Since the Fourier transform of both sides gives from which the following relation arises: For , equation (2.2.6.4) reduces to Sayre's (1952) equation [but see also Hughes (1953)] If the structure contains resolved isotropic atoms of two types, P and Q, it is impossible to find a factor such that the relation holds, since this would imply values of such that and simultaneously. However, the following relationship can be stated (Woolfson, 1958): where and are adjustable parameters of . Equation (2.2.6.6) can easily be generalized to the case of structures containing resolved atoms of more than two types (von Eller, 1973).
Besides the algebraic properties of the electron density, Patterson methods also can be developed so that they provide phase indications. For example, it is possible to find the reciprocal counterpart of the function For the function (2.2.6.7) coincides with the usual Patterson function ; for , (2.2.6.7) reduces to the double Patterson function introduced by Sayre (1953). Expansion of as a Fourier series yields Vice versa, the value of a triplet invariant may be considered as the Fourier transform of the double Patterson.
Among the main results relating direct and reciprocalspace properties it may be remembered:

A traditional procedure for phase assignment may be schematically presented as follows:

In very complex structures a large initial set of known phases seems to be a basic requirement for a structure to be determined. This aim can be achieved, for example, by introducing a large number of permutable phases into the initial set. However, the introduction of every new symbol implies a fourfold increase in computing time, which, even in fast computers, quickly leads to computingtime limitations. On the other hand, a relatively large starting set is not in itself enough to ensure a successful structure determination. This is the case, for example, when the triplet invariants used in the initial steps differ significantly from zero. New strategies have therefore been devised to solve more complex structures.

Some references for directmethods packages are given below. Other useful packages using symbolic addition or multisolution procedures do exist but are not well documented.
CRUNCH: Gelder, R. de, de Graaff, R. A. G. & Schenk, H. (1993). Automatic determination of crystal structures using Karle–Hauptman matrices. Acta Cryst. A49, 287–293.
DIRDIF: Beurskens, P. T., Beurskens G., de Gelder, R., GarciaGranda, S., Gould, R. O., Israel, R. & Smits, J. M. M. (1999). The DIRDIF99 program system. Crystallography Laboratory, University of Nijmegen, The Netherlands.
MITHRIL: Gilmore, C. J. (1984). MITHRIL. An integrated directmethods computer program. J. Appl. Cryst. 17, 42–46.
MULTAN88: Main, P., Fiske, S. J., Germain, G., Hull, S. E., Declercq, J.P., Lessinger, L. & Woolfson, M. M. (1999). Crystallographic software: teXsan for Windows. http://www.msc.com/brochures/teXsan/wintex.html.
PATSEE: Egert, E. & Sheldrick, G. M. (1985). Search for a fragment of known geometry by integrated Patterson and direct methods. Acta Cryst. A41, 262–268.
SAPI: Fan, H.F. (1999). Crystallographic software: teXsan for Windows. http://www.msc.com/brochures/teXsan/wintex.html.
SnB: Weeks, C. M. & Miller, R. (1999). The design and implementation of SnB version 2.0. J. Appl. Cryst. 32, 120–124.
SHELX97: Sheldrick, G. M. (2000a). The SHELX home page. http://shelx.uniac.gwdg.de/SHELX/ .
SHELXS: Sheldrick, G. M. (2000b). SHELX. http://www.ucg.ie/cryst/shelx.htm .
SIR97: Altomare, A., Burla, M. C., Camalli, M., Cascarano,G. L., Giacovazzo, C., Guagliardi, A., Moliterni, A. G. G., Polidori, G. & Spagna, R. (1999). SIR97: a new tool for crystal structure determination and refinement. J. Appl. Cryst. 32, 115–119.
XTAL3.6.1: Hall, S. R., du Boulay, D. J. & OlthofHazekamp, R. (1999). Xtal3.6 crystallographic software. http://www.crystal.uwa.edu.au/Crystal/xtal .
Protein structures cannot be solved ab initio by traditional direct methods (i.e., by application of the tangent formula alone). Accordingly, the first applications were focused on two tasks:
The application of standard tangent techniques to (a) and (b) has not been found to be very satisfactory (Coulter & Dewar, 1971; Hendrickson et al., 1973; Weinzierl et al., 1969). Tangent methods, in fact, require atomicity and nonnegativity of the electron density. Both these properties are not satisfied if data do not extend to atomic resolution . Because of series termination and other errors the electrondensity map at presents large negative regions which will appear as false peaks in the squared structure. However, tangent methods use only a part of the information given by the Sayre equation (2.2.6.5). In fact, (2.2.6.5) express two equations relating the radial and angular parts of the two sides, so obtaining a large degree of overdetermination of the phases. To achieve this Sayre (1972) [see also Sayre & Toupin (1975)] suggested minimizing (2.2.10.1) by least squares as a function of the phases: Even if tests on rubredoxin (extensions of phases from 2.5 to 1.5 Å resolution) and insulin (Cutfield et al., 1975) (from 1.9 to 1.5 Å resolution) were successful, the limitations of the method are its high cost and, especially, the higher efficiency of the leastsquares method. Equivalent considerations hold for the application of determinantal methods to proteins [see Podjarny et al. (1981); de Rango et al. (1985) and literature cited therein].
A question now arises: why is the tangent formula unable to solve protein structures? Fan et al. (1991) considered the question from a firstprinciple approach and concluded that:
Sheldrick (1990) suggested that direct methods are not expected to succeed if fewer than half of the reflections in the range 1.1–1.2 Å are observed with (a condition seldom satisfied by protein data).
The most complete analysis of the problem has been made by Giacovazzo, Guagliardi et al. (1994). They observed that the expected value of α (see Section 2.2.7) suggested by the tangent formula for proteins is comparable with the variance of the α parameter. In other words, for proteins the signal determining the phase is comparable with the noise, and therefore the phase indication is expected to be unreliable.
Section 2.2.10.1 suggests that the mere use of the tangent formula or the Sayre equation cannot solve ab initio protein structures of usual size. However, even in an ab initio situation, there is a source of supplementary information which may be used. Good examples are the `peaklist optimization' procedure (Sheldrick & Gould, 1995) and the SIR97 procedure (Altomare et al., 1999) for refining and completing the trial structure offered by the first E map.
In both cases there are reasons to suspect that the correct structure is sometimes extracted from a totally incorrect directmethods solution. These results suggest that a directspace procedure can provide some form of structural information complementary to that used in reciprocal space by the tangent or similar formulae. The combination of real and reciprocalspace techniques could therefore enlarge the size of crystal structures solvable by direct methods. The first program to explicitly propose the combined use of direct and reciprocal space was Shake and Bake (SnB), which inspired a second package, halfbake (HB). A third program, SIR99, uses a different algorithm.
The SnB method (DeTitta et al., 1994; Weeks et al., 1994; Hauptman, 1995) is the heir of the cosine leastsquares method described in Section 2.2.8, point (4). The function where is the triplet phase, and .
is expected to have a global minimum, provided the number of phases involved is sufficiently large, when all the phases are equal to their true values for some choice of origin and enantiomorph. Thus the phasing problem reduces to that of finding the global minimum of (the minimum principle).
SnB comprises a shake step (phase refinement) and a bake step (electrondensity modification), the second step aiming to impose phase constraints implicit in real space. Accordingly, the program requires two Fourier transforms per cycle, and numerous cycles. Thus it may be very time consuming and it is not competitive with other direct methods for the solution of the crystal structures of small molecules. However, it introduced into the field the tremendous usefulness of intensive computations for the direct solution of complex crystal structures.
Owing to Sheldrick (1997), HB does most of its work in direct space. Random atomic positions are generated, to which a modified peaklist optimization process is applied. A number of peaks are eliminated subject to the condition that remains as large as possible (only reflections with are involved, where ). The phases of a suitable subset of reflections are then used as input for a tangent expansion. Then an E map is calculated from which peaks are selected: these are submitted to the elimination procedure.
Typically 5–20 cycles of this internal loop are performed. Then a correlation coefficient (CC) between and is calculated for all the data. If the CC is good (i.e. larger than a given threshold), then a new loop is performed: a new E map is obtained, from which a list of peaks is selected for submission to the elimination procedure. The criterion now is the value of the CC, which is calculated for all the reflections. Typically two to five cycles of this external loop are performed.
The program works indefinitely, restarting from random atoms until interrupted. It may work either by applying the true spacegroup symmetry or after having expanded the data to P1.
The SIR99 procedure (Burla et al., 1999) may be divided into two distinct parts: the tangent section (i.e., a double tangent process using triplet and quartet invariants) is followed by a realspace refinement procedure. As in SIR97, the reciprocalspace part is followed by the realspace refinement, but this time this last part is much more complex. It involves three different techniques: EDM (an electrondensity modification process), the HAFR part (in which all the peaks are associated with the heaviest atomic species) and the DLSQ procedure (a leastsquares Fourier refinement process). The atomicity is gradually introduced into the procedure. The entire process requires, for each trial, several cycles of EDM and HAFR: the realspace part is able to lead to the correct solution even when the tangent formula does not provide favourable phase values.
The modulus of the isomorphous difference may be assumed at a first approximation as an estimate of the heavyatom s.f. . Normalization of 's and application of the tangent formula may reveal the heavyatom structure (Wilson, 1978).
The theoretical basis for integrating the techniques of direct methods and isomorphous replacement was introduced by Hauptman (1982a). According to his notation let us denote by and atomic scattering factors for the atom labelled j in a pair of isomorphous structures, and let and denote corresponding normalized structure factors. Then where The conditional probability of the twophase structure invariant given and is (Hauptman, 1982a) where Threephase structure invariants were evaluated by considering that eight invariants exist for a given triple of indices h, k, l : So, for the estimation of any , the joint probability distribution has to be studied, from which eight conditional probability densities can be obtained: for .
The analytical expressions of are too intricate and are not given here (the reader is referred to the original paper). We only say that may be positive or negative, so that reliable triplet phase estimates near 0 or near π are possible: the larger , the more reliable the phase estimate.
A useful interpretation of the formulae in terms of experimental parameters was suggested by Fortier et al. (1984): according to them, distributions do not depend, as in the case of the traditional threephase invariants, on the total number of atoms per unit cell but rather on the scattering difference between the native protein and the derivative (that is, on the scattering of the heavy atoms in the derivative).
Hauptman's formulae were generalized by Giacovazzo et al. (1988): the new expressions were able to take into account the resolution effects on distribution parameters. The formulae are completely general and include as special cases native protein and heavyatom isomorphous derivatives as well as Xray and neutron diffraction data. Their complicated algebraic forms are easily reduced to a simple expression in the case of a native protein heavyatom derivative: in particular, the reliability parameter for is where indices P and H warn that parameters have to be calculated over protein atoms and over heavy atoms, respectively, and Δ is a pseudonormalized difference (with respect to the heavyatom structure) between moduli of structure factors.
Equation (2.2.10.2) may be compared with Karle's (1983) qualitative rule: if the sign of is plus then the value of is estimated to be zero; if its sign is minus then the expected value of is close to π. In practice Karle's rule agrees with (2.2.10.2) only if the Cochrantype term in (2.2.10.2) may be neglected. Furthermore, (2.2.10.2) shows that large reliability values do not depend on the triple product of structurefactor differences, but on the triple product of pseudonormalized differences. A series of papers (Giacovazzo, Siliqi & Ralph, 1994; Giacovazzo, Siliqi & Spagna, 1994; Giacovazzo, Siliqi & Platas, 1995; Giacovazzo, Siliqi & Zanotti, 1995; Giacovazzo et al., 1996) shows how equation (2.2.10.2) may be implemented in a direct procedure which proved to be able to estimate the protein phases correctly without any preliminary information on the heavyatom substructure.
Combination of direct methods with the twoderivative case is also possible (Fortier et al., 1984) and leads to more accurate estimates of triplet invariants provided experimental data are of sufficient accuracy.
If the frequency of the radiation is close to an absorption edge of an atom, then that atom will scatter the Xrays anomalously (see Chapter 2.4 ) according to . This results in the breakdown of Friedel's law. It was soon realized that the Bijvoet difference could also be used in the determination of phases (Peerdeman & Bijvoet, 1956; Ramachandran & Raman, 1956; Okaya & Pepinsky, 1956). Since then, a great deal of work has been done both from algebraic (see Chapter 2.4 ) and from probabilistic points of view. In this section we are only interested in the second.
We will mention the following different cases:

Probability distributions of diffraction intensities and of selected functions of diffraction intensities for dispersive structures have been given by various authors [Parthasarathy & Srinivasan (1964), see also Srinivasan & Parthasarathy (1976) and relevant literature cited therein]. We describe here some probabilistic formulae for estimating invariants of low order.

Let us now describe some practical aspects of the integration of direct methods with OAS techniques.
Anomalous difference structure factors can be used for locating the positions of the anomalous scatterers (Mukherjee et al., 1989). Tests prove that accuracy in the difference magnitudes is critical for the success of the phasing process.
Suppose now that the positions of the heavy atoms have been found. How do we estimate the phase values for the protein? The phase ambiguity strictly connected with OAS techniques can be overcome by different methods: we quote the Qs method by Hao & Woolfson (1989), the Wilson distribution method and the MPS method by Ralph & Woolfson (1991), and the Bijvoet–Ramachandran–Raman method by Peerdeman & Bijvoet (1956), Raman (1959) and Moncrief & Lipscomb (1966). More recently, a probabilistic method by Fan & Gu (1985) gained additional insight into the problem.
Isomorphous replacement and anomalous scattering are discussed in Chapter 2.4 and in IT F (2001). We observe here only that the SIRAS case can lead algebraically to unambiguous phase determination provided the experimental data are sufficiently good. Thus, any probabilistic treatment must take into consideration errors in the measurements.
In the MIRAS and MAD cases the system is overconditioned: again any probabilistic treatment must consider errors in the measurements, but now overconditioning allows the reduction of the perverse effects of the experimental errors and (in MIRAS) of the lack of isomorphism.
A particular application of extreme relevance concerns the location of anomalous scatterers when selenomethioninesubstituted proteins and MAD data are available (Hendrickson & Ogata, 1997; Smith, 1998). In this case, many selenium sites should be identified and usual Pattersoninterpretation methods can be expected to fail. The successes of SnB and HB prove the essential role of direct methods in this important area.
References
Allegra, G. (1979). Derivation of threephase invariants from the Patterson function. Acta Cryst. A35, 213–220.Altomare, A., Burla, M. C., Camalli, M., Cascarano, G. L., Giacovazzo, C., Guagliardi, A., Moliterni, A. G. G., Polidori, G. & Spagna, R.(1999). SIR97: a new tool for crystal structure determination and refinement. J. Appl. Cryst. 32, 115–119.
Anzenhofer, K. & Hoppe, W. (1962). Phys. Verh. Mosbach. 13, 119.
Ardito, G., Cascarano, G., Giacovazzo, C. & Luić, M. (1985). 1Phase seminvariants and Harker sections. Z. Kristallogr. 172, 25–34.
Argos, P. & Rossmann, M. G. (1980). Molecular replacement method. In Theory and practice of direct methods in crystallography, edited by M. F. C. Ladd & R. A. Palmer, pp. 381–389. New York: Plenum.
Avrami, M. (1938). Direct determination of crystal structure from Xray data. Phys. Rev. 54, 300–303.
Baggio, R., Woolfson, M. M., Declercq, J.P. & Germain, G. (1978). On the application of phase relationships to complex structures. XVI. A random approach to structure determination. Acta Cryst. A34, 883–892.
Banerjee, K. (1933). Determination of the signs of the Fourier terms in complete crystal structure analysis. Proc. R. Soc. London Ser. A, 141, 188–193.
Bertaut, E. F. (1955a). La méthode statistique en cristallographie. I. Acta Cryst. 8, 537–543.
Bertaut, E. F. (1955b). La méthode statistique en cristallographie. II. Quelques applications. Acta Cryst. 8, 544–548.
Bertaut, E. F. (1960). Ordre logarithmique des densités de répartition. I. Acta Cryst. 13, 546–552.
Beurskens, P. T., Beurskens, G., de Gelder, R., GarciaGranda, S., Gould, R. O., Israel, R. & Smits, J. M. M.(1999). The DIRDIF99 program system. Crystallography Laboratory, University of Nijmegen, The Netherlands.
Beurskens, P. T., Gould, R. O., Bruins Slot, H. J. & Bosman, W. P. (1987). Translation functions for the positioning of a well oriented molecular fragment. Z. Kristallogr. 179, 127–159.
Beurskens, P. T., Prick, A. J., Doesburg, H. M. & Gould, R. O. (1979). Statistical properties of normalized differencestructure factors for noncentrosymmetric structures. Acta Cryst. A35, 765–772.
Böhme, R. (1982). Direkte Methoden für Strukturen mit Uberstruktureffekten. Acta Cryst. A38, 318–326.
Bouman, J. (1956). A general theory of inequalities. Acta Cryst. 9, 777–780.
Bricogne, G. (1984). Maximum entropy and the foundation of direct methods. Acta Cryst. A40, 410–415.
Britten, P. L. & Collins, D. M. (1982). Information theory as a basis for the maximum determinant. Acta Cryst. A38, 129–132.
Buerger, M. J. (1959). Vector space and its applications in crystal structure investigation. New York: John Wiley.
Burla, M. C., Camalli, M., Carrozzini, B., Cascarano, G. L., Giacovazzo, C., Polidori, G. & Spagna, R. (1999). SIR99, a program for the automatic solution of small and large crystal structures. Acta Cryst. A55, 991–999.
Burla, M. C., Cascarano, G., Giacovazzo, C., Nunzi, A. & Polidori, G. (1987). A weighting scheme for tangent formula development. III. The weighting scheme of the SIR program. Acta Cryst. A43, 370–374.
Busetta, B. (1976). An example of the use of quartet and triplet structure invariants when enantiomorph discrimination is difficult. Acta Cryst. A32, 139–143.
Busetta, B., Giacovazzo, C., Burla, M. C., Nunzi, A., Polidori, G. & Viterbo, D. (1980). The SIR program. I. Use of negative quartets. Acta Cryst. A36, 68–74.
Camalli, M., Giacovazzo, C. & Spagna, R. (1985). From a partial to the complete crystal structure. II. The procedure and its applications. Acta Cryst. A41, 605–613.
Cascarano, G. & Giacovazzo, C. (1983). Onephase seminvariants of first rank. I. Algebraic considerations. Z. Kristallogr. 165, 169–174.
Cascarano, G. & Giacovazzo, C. (1985). Onewavelength technique: some probabilistic formulas using the anomalous dispersion effect. Acta Cryst. A41, 408–413.
Cascarano, G., Giacovazzo, C., Burla, M. C., Nunzi, A. & Polidori, G. (1984). The distribution of . Acta Cryst. A40, 389–394.
Cascarano, G., Giacovazzo, C., Calabrese, G., Burla, M. C., Nunzi, A., Polidori, G. & Viterbo, D. (1984). Onephase seminvariants of first rank. II. Probabilistic considerations. Z. Kristallogr. 167, 37–47.
Cascarano, G., Giacovazzo, C., Camalli, M., Spagna, R., Burla, M. C., Nunzi, A. & Polidori, G. (1984). The method of representations of structure seminvariants. The strengthening of triplet relationships. Acta Cryst. A40, 278–283.
Cascarano, G., Giacovazzo, C. & Luić, M. (1985a). Noncrystallographic translational symmetry: effects on diffractionintensity statistics. In Structure and statistics in crystallography, edited by A. J. C. Wilson, pp. 67–77. Guilderland, USA: Adenine Press.
Cascarano, G., Giacovazzo, C. & Luić, M. (1985b). Direct methods and superstructures. I. Effects of the pseudotranslation on the reciprocal space. Acta Cryst. A41, 544–551.
Cascarano, G., Giacovazzo, C. & Luić, M. (1987). Direct methods and structures showing superstructure effects. II. A probabilistic theory of triplet invariants. Acta Cryst. A43, 14–22.
Cascarano, G., Giacovazzo, C. & Luić, M. (1988a). Direct methods and structures showing superstructure effects. III. A general mathematical model. Acta Cryst. A44, 176–183.
Cascarano, G., Giacovazzo, C. & Luić, M. (1988b). Direct methods and structures showing superstructure effects. IV. A new approach for phase solution. Acta Cryst. A44, 183–188.
Cascarano, G., Giacovazzo, C., Luić, M., Pifferi, A. & Spagna, R. (1987). 1Phase seminvariants and Harker sections. II. A new procedure. Z. Kristallogr. 179, 113–125.
Cascarano, G., Giacovazzo, C. & Viterbo, D. (1987). Figures of merit in direct methods: a new point of view. Acta Cryst. A43, 22–29.
Castellano, E. E., Podjarny, A. D. & Navaza, J. (1973). A multivariate joint probability distribution of phase determination. Acta Cryst. A29, 609–615.
Cochran, W. (1955). Relations between the phases of structure factors. Acta Cryst. 8, 473–478.
Cochran, W. & Douglas, A. S. (1957). The use of a highspeed digital computer for the direct determination of crystal structure. II. Proc. R. Soc. London Ser. A, 243, 281–288.
Cochran, W. & Woolfson, M. M. (1955). The theory of sign relations between structure factors. Acta Cryst. 8, 1–12.
Coulter, C. L. & Dewar, R. B. K. (1971). Tangent formula applications in protein crystallography: an evaluation. Acta Cryst. B27, 1730–1740.
Crowther, R. A. & Blow, D. M. (1967). A method of positioning of a known molecule in an unknown crystal structure. Acta Cryst. 23, 544–548.
Cutfield, J. F., Dodson, E. J., Dodson, G. G., Hodgkin, D. C., Isaacs, N. W., Sakabe, K. & Sakabe, N. (1975). The high resolution structure of insulin: a comparison of results obtained from leastsquares phase refinement and difference Fourier refinement. Acta Cryst. A31, S21.
Debaerdemaeker, T., Tate, C. & Woolfson, M. M. (1985). On the application of phase relationships to complex structures. XXIV. The Sayre tangent formula. Acta Cryst. A41, 286–290.
Debaerdemaeker, T. & Woolfson, M. M. (1972). On the application of phase relationships to complex structures. IV. The coincidence method applied to general phases. Acta Cryst. A28, 477–481.
Debaerdemaeker, T. & Woolfson, M. M. (1983). On the application of phase relationships to complex structures. XXII. Techniques for random refinement. Acta Cryst. A39, 193–196.
Declercq, J.P., Germain, G., Main, P. & Woolfson, M. M. (1973). On the application of phase relationships to complex structures. V. Finding the solution. Acta Cryst. A29, 231–234.
Declercq, J.P., Germain, G. & Woolfson, M. M. (1975). On the application of phase relationships to complex structures. VIII. Extension of the magicinteger approach. Acta Cryst. A31, 367–372.
De Titta, G. T., Edmonds, J. W., Langs, D. A. & Hauptman, H. (1975). Use of the negative quartet cosine invariants as a phasing figure of merit: NQEST. Acta Cryst. A31, 472–479.
DeTitta, G. T., Weeks, C. M., Thuman, P., Miller, R. & Hauptman, H. A. (1994). Structure solution by minimalfunction phase refinement and Fourier filtering. I. Theoretical basis. Acta Cryst. A50, 203–210.
Egert, E. (1983). Patterson search – an alternative to direct methods. Acta Cryst. A39, 936–940.
Egert, E. & Sheldrick, G. M. (1985). Search for a fragment of known geometry by integrated Patterson and direct methods. Acta Cryst. A41, 262–268.
Eller, G. von (1973). Génération de formules statistiques entre facteurs de structure: la méthode du polynome. Acta Cryst. A29, 63–67.
Fan, H.F. (1999). Crystallographic software: teXsan for Windows. http://www.msc.com/brochures/teXsan/wintex.html.
Fan, H.F. & Gu, Y.X. (1985). Combining direct methods with isomorphous replacement or anomalous scattering data. III. The incorporation of partial structure information. Acta Cryst. A41, 280–284.
Fan, H. F., Hao, Q. & Woolfson, M. M. (1991). Proteins and direct methods. Z. Kristallogr. 197, 197–208.
Fan, H.F., Yao, J.X., Main, P. & Woolfson, M. M. (1983). On the application of phase relationships to complex structures. XXIII. Automatic determination of crystal structures having pseudotranslational symmetry by a modified MULTAN procedure. Acta Cryst. A39, 566–569.
Fortier, S. & Hauptman, H. (1977). Quintets in : probabilistic theory of the fivephase structure invariant in the space group . Acta Cryst. A33, 829–833.
Fortier, S., Weeks, C. M. & Hauptman, H. (1984). On integrating the techniques of direct methods and isomorphous replacement. III. The threephase invariant for the native and twoderivative case. Acta Cryst. A40, 646–651.
Freer, A. A. & Gilmore, C. J. (1980). The use of higher invariants in MULTAN. Acta Cryst. A36, 470–475.
French, S. & Wilson, K. (1978). On the treatment of negative intensity observations. Acta Cryst. A34, 517–525.
Gelder, R. de (1992). Thesis. University of Leiden, The Netherlands.
Gelder, R. de, de Graaff, R. A. G. & Schenk, H. (1990). On the construction of Karle–Hauptman matrices. Acta Cryst. A46, 688–692.
Gelder, R. de, de Graaff, R. A. G. & Schenk, H. (1993). Automatic determination of crystal structures using Karle–Hauptman matrices. Acta Cryst. A49, 287–293.
Germain, G., Main, P. & Woolfson, M. M. (1970). On the application of phase relationships to complex structures. II. Getting a good start. Acta Cryst. B26, 274–285.
Germain, G., Main, P. & Woolfson, M. M. (1971). The application of phase relationships to complex structures. III. The optimum use of phase relationships. Acta Cryst. A27, 368–376.
Giacovazzo, C. (1974). A new scheme for seminvariant tables in all space groups. Acta Cryst. A30, 390–395.
Giacovazzo, C. (1975). A probabilistic theory in of the invariant . Acta Cryst. A31, 252–259.
Giacovazzo, C. (1976). A probabilistic theory of the cosine invariant . Acta Cryst. A32, 91–99.
Giacovazzo, C. (1977a). A general approach to phase relationships: the method of representations. Acta Cryst. A33, 933–944.
Giacovazzo, C. (1977b). Strengthening of the triplet relationships. II. A new probabilistic approach in . Acta Cryst. A33, 527–531.
Giacovazzo, C. (1977c). On different probabilistic approaches to quartet theory. Acta Cryst. A33, 50–54.
Giacovazzo, C. (1977d). Quintets in and related phase relationships: a probabilistic approach. Acta Cryst. A33, 944–948.
Giacovazzo, C. (1977e). A probabilistic theory of the coincidence method. I. Centrosymmetric space groups. Acta Cryst. A33, 531–538.
Giacovazzo, C. (1977f). A probabilistic theory of the coincidence method. II. Noncentrosymmetric space groups. Acta Cryst. A33, 539–547.
Giacovazzo, C. (1978). The estimation of the onephase structure seminvariants of first rank by means of their first and second representation. Acta Cryst. A34, 562–574.
Giacovazzo, C. (1979a). A probabilistic theory of twophase seminvariants of first rank via the method of representations. III. Acta Cryst. A35, 296–305.
Giacovazzo, C. (1979b). A theoretical weighting scheme for tangentformula development and refinement and Fourier synthesis. Acta Cryst. A35, 757–764.
Giacovazzo, C. (1980a). Direct methods in crystallography. London: Academic Press.
Giacovazzo, C. (1980b). The method of representations of structure seminvariants. II. New theoretical and practical aspects. Acta Cryst. A36, 362–372.
Giacovazzo, C. (1980c). Triplet and quartet relations: their use in direct procedures. Acta Cryst. A36, 74–82.
Giacovazzo, C. (1983a). From a partial to the complete crystal structure. Acta Cryst. A39, 685–692.
Giacovazzo, C. (1983b). The estimation of twophase invariants in when anomalous scatterers are present. Acta Cryst. A39, 585–592.
Giacovazzo, C. (1987). One wavelength technique: estimation of centrosymmetrical twophase invariants in dispersive structures. Acta Cryst. A43, 73–75.
Giacovazzo, C. (1988a). New probabilistic formulas for finding the positions of correctly oriented atomic groups. Acta Cryst. A44, 294–300.
Giacovazzo, C. (1988b). Direct phasing in crystallography. New York: IUCr, Oxford University Press.
Giacovazzo, C., Cascarano, G. & Zheng, C.D. (1988). On integrating the techniques of direct methods and isomorphous replacement. A new probabilistic formula for triplet invariants. Acta Cryst. A44, 45–51.
Giacovazzo, C., Guagliardi, A., Ravelli, R. & Siliqi, D. (1994). Ab initio direct phasing of proteins: the limits. Z. Kristallogr. 209, 136–142.
Giacovazzo, C., Siliqi, D. & Platas, J. G. (1995). The ab initio crystal structure solution of proteins by direct methods. V. A new normalizing procedure. Acta Cryst. A51, 811–820.
Giacovazzo, C., Siliqi, D., Platas, J. G., Hecht, H.J., Zanotti, G. & York, B. (1996). The ab initio crystal structure solution of proteins by direct methods. VI. Complete phasing up to derivative resolution. Acta Cryst. D52, 813–825.
Giacovazzo, C., Siliqi, D. & Ralph, A. (1994). The ab initio crystal structure solution of proteins by direct methods. I. Feasibility. Acta Cryst. A50, 503–510.
Giacovazzo, C., Siliqi, D. & Spagna, R. (1994). The ab initio crystal structure solution of proteins by direct methods. II. The procedure and its first applications. Acta Cryst. A50, 609–621.
Giacovazzo, C., Siliqi, D. & Zanotti, G. (1995). The ab initio crystal structure solution of proteins by direct methods. III. The phase extension process. Acta Cryst. A51, 177–188.
Gillis, J. (1948). Structure factor relations and phase determination. Acta Cryst. 1, 76–80.
Gilmore, C. J. (1984). MITHRIL. An integrated directmethods computer program. J. Appl. Cryst. 17, 42–46.
Goedkoop, J. A. (1950). Remarks on the theory of phase limiting inequalities and equalities. Acta Cryst. 3, 374–378.
Gramlich, V. (1984). The influence of rational dependence on the probability distribution of structure factors. Acta Cryst. A40, 610–616.
Grant, D. F., Howells, R. G. & Rogers, D. (1957). A method for the systematic application of sign relations. Acta Cryst. 10, 489–497.
Hall, S. R., du Boulay, D. J. & OlthofHazekamp, R. (1999). Xtal3.6 crystallographic software. http://www.crystal.uwa.edu.au/Crystal/xtal .
Hao, Q. & Woolfson, M. M. (1989). Application of the P_{s}function method to macromolecular structure determination. Acta Cryst. A45, 794–797.
Harker, D. & Kasper, J. S. (1948). Phases of Fourier coefficients directly from crystal diffraction data. Acta Cryst. 1, 70–75.
Hauptman, H. (1964). The role of molecular structure in the direct determination of phase. Acta Cryst. 17, 1421–1433.
Hauptman, H. (1965). The average value of . Z. Kristallogr. 121, 1–8.
Hauptman, H. (1970). Communication at New Orleans Meeting of Am. Crystallogr. Assoc.
Hauptman, H. (1974). On the identity and estimation of those cosine invariants, , which are probably negative. Acta Cryst. A30, 472–476.
Hauptman, H. (1975). A new method in the probabilistic theory of the structure invariants. Acta Cryst. A31, 680–687.
Hauptman, H. (1982a). On integrating the techniques of direct methods and isomorphous replacement. I. The theoretical basis. Acta Cryst. A38, 289–294.
Hauptman, H. (1982b). On integrating the techniques of direct methods with anomalous dispersion. I. The theoretical basis. Acta Cryst. A38, 632–641.
Hauptman, H. (1995). Looking ahead. Acta Cryst. B51, 416–422.
Hauptman, H., Fisher, J., Hancock, H. & Norton, D. A. (1969). Phase determination for the estriol structure. Acta Cryst. B25, 811–814.
Hauptman, H. & Green, E. A. (1976). Conditional probability distributions of the fourphase structure invariant in . Acta Cryst. A32, 45–49.
Hauptman, H. & Green, E. A. (1978). Pairs in : probability distributions which lead to estimates of the twophase structure seminvariants in the vicinity of π/2. Acta Cryst. A34, 224–229.
Hauptman, H. & Karle, J. (1953). Solution of the phase problem. I. The centrosymmetric crystal. Am. Crystallogr. Assoc. Monograph No. 3. Dayton, Ohio: Polycrystal Book Service.
Hauptman, H. & Karle, J. (1956). Structure invariants and seminvariants for noncentrosymmetric space groups. Acta Cryst. 9, 45–55.
Hauptman, H. & Karle, J. (1958). Phase determination from new joint probability distributions: space group . Acta Cryst. 11, 149–157.
Hauptman, H. & Karle, J. (1959). Table 2. Equivalence classes, seminvariant vectors and seminvariant moduli for the centered centrosymmetric space groups, referred to a primitive unit cell. Acta Cryst. 12, 93–97.
Heinermann, J. J. L. (1977a). The use of structural information in the phase probability of a triple product. Acta Cryst. A33, 100–106.
Heinermann, J. J. L. (1977b). Thesis. University of Utrecht.
Heinermann, J. J. L., Krabbendam, H. & Kroon, J. (1979). The joint probability distribution of the structure factors in a Karle–Hauptman matrix. Acta Cryst. A35, 101–105.
Heinermann, J. J. L., Krabbendam, H., Kroon, J. & Spek, A. L. (1978). Direct phase determination of triple products from Bijvoet inequalities. II. A probabilistic approach. Acta Cryst. A34, 447–450.
Hendrickson, W. A., Love, W. E. & Karle, J. (1973). Crystal structure analysis of sea lamprey hemoglobin at 2 Å resolution. J. Mol. Biol. 74, 331–361.
Hendrickson, W. A. & Ogata, C. M. (1997). Phase determination from multiwavelength anomalous diffraction measurements. Methods Enzymol. 276, 494–523.
Hoppe, W. (1963). Phase determination and zero points in the Patterson function. Acta Cryst. 16, 1056–1057.
Hughes, E. W. (1953). The signs of products of structure factors. Acta Cryst. 6, 871.
Hull, S. E. & Irwin, M. J. (1978). On the application of phase relationships to complex structures. XIV. The additional use of statistical information in tangentformula refinement. Acta Cryst. A34, 863–870.
Hull, S. E., Viterbo, D., Woolfson, M. M. & ShaoHui, Z. (1981). On the application of phase relationships to complex structures. XIX. Magicinteger representation of a large set of phases: the MAGEX procedure. Acta Cryst. A37, 566–572.
International Tables for Crystallography (2001). Vol. F. Macromolecular crystallography, edited by M. G. Rossmann & E. Arnold. Dordrecht: Kluwer Academic Publishers.
Jaynes, E. T. (1957). Information theory and statistical mechanics. Phys. Rev. 106, 620–630.
Karle, J. (1970a). An alternative form for , a phase determining formula. Acta Cryst. B26, 1614–1617.
Karle, J. (1970b). Partial structures and use of the tangent formula and translation functions. In Crystallographic computing, pp. 155–164. Copenhagen: Munksgaard.
Karle, J. (1972). Translation functions and direct methods. Acta Cryst. B28, 820–824.
Karle, J. (1979). Triple phase invariants: formula for centric case from fourthorder determinantal joint probability distributions. Proc. Natl Acad. Sci. USA, 76, 2089–2093.
Karle, J. (1980). Triplet phase invariants: formula for acentric case from fourthorder determinantal joint probability distributions. Proc. Natl Acad. Sci. USA, 77, 5–9.
Karle, J. (1983). A simple rule for finding and distinguishing triplet phase invariants with values near 0 or π with isomorphous replacement data. Acta Cryst. A39, 800–805.
Karle, J. (1984). Rules for evaluating triplet phase invariants by use of anomalous dispersion data. Acta Cryst. A40, 4–11.
Karle, J. (1985). Many algebraic formulas for the evaluation of triplet phase invariants from isomorphous replacement and anomalous dispersion data. Acta Cryst. A41, 182–189.
Karle, J. & Hauptman, H. (1950). The phases and magnitudes of the structure factors. Acta Cryst. 3, 181–187.
Karle, J. & Hauptman, H. (1956). A theory of phase determination for the four types of noncentrosymmetric space groups . Acta Cryst. 9, 635–651.
Karle, J. & Hauptman, H. (1958). Phase determination from new joint probability distributions: space group . Acta Cryst. 11, 264–269.
Karle, J. & Hauptman, H. (1961). Seminvariants for noncentrosymmetric space groups with conventional centered cells. Acta Cryst. 14, 217–223.
Karle, J. & Karle, I. L. (1966). The symbolic addition procedure for phase determination for centrosymmetric and noncentrosymmetric crystals. Acta Cryst. 21, 849–859.
Klug, A. (1958). Joint probability distributions of structure factors and the phase problem. Acta Cryst. 11, 515–543.
Koch, M. H. J. (1974). On the application of phase relationships to complex structures. IV. Automatic interpretation of electrondensity maps for organic structures. Acta Cryst. A30, 67–70.
Krabbendam, H. & Kroon, J. (1971). A relation between structure factor, triple products and a single Patterson vector, and its application to sign determination. Acta Cryst. A27, 362–367.
Kroon, J., Spek, A. L. & Krabbendam, H. (1977). Direct phase determination of triple products from Bijvoet inequalities. Acta Cryst. A33, 382–385.
Lajzérowicz, J. & Lajzérowicz, J. (1966). Loi de distribution des facteurs de structure pour un répartition non uniforme des atomes. Acta Cryst. 21, 8–12.
Langs, D. A. (1985). Translation functions: the elimination of structuredependent spurious maxima. Acta Cryst. A41, 305–308.
Lessinger, L. & Wondratschek, H. (1975). Seminvariants for space groups and . Acta Cryst. A31, 521.
Mackay, A. L. (1953). A statistical treatment of superlattice reflexions. Acta Cryst. 6, 214–215.
Main, P. (1976). Recent developments in the MULTAN system. The use of molecular structure. In Crystallographic computing techniques, edited by F. R. Ahmed, pp. 97–105. Copenhagen: Munksgaard.
Main, P. (1977). On the application of phase relationships to complex structures. XI. A theory of magic integers. Acta Cryst. A33, 750–757.
Main, P., Fiske, S. J., Germain, G., Hull, S. E., Declercq, J.P., Lessinger, L. & Woolfson, M. M. (1999). Crystallographic software: teXsan for Windows. http://www.msc.com/brochures/teXsan/wintex.html.
Main, P. & Hull, S. E. (1978). The recognition of molecular fragments in E maps and electron density maps. Acta Cryst. A34, 353–361.
Moncrief, J. W. & Lipscomb, W. N. (1966). Structure of leurocristine methiodide dihydrate by anomalous scattering methods; relation to leurocristine (vincristine) and vincaleukoblastine (vinblastine). Acta Cryst. A21, 322–331.
Mukherjee, A. K., Helliwell, J. R. & Main, P. (1989). The use of MULTAN to locate the positions of anomalous scatterers. Acta Cryst. A45, 715–718.
Narayan, R. & Nityananda, R. (1982). The maximum determinant method and the maximum entropy method. Acta Cryst. A38, 122–128.
Navaza, J. (1985). On the maximumentropy estimate of the electron density function. Acta Cryst. A41, 232–244.
Navaza, J., Castellano, E. E. & Tsoucaris, G. (1983). Constrained density modifications by variational techniques. Acta Cryst. A39, 622–631.
Naya, S., Nitta, I. & Oda, T. (1964). A study on the statistical method for determination of signs of structure factors. Acta Cryst. 17, 421–433.
Naya, S., Nitta, I. & Oda, T. (1965). Affinement tridimensional du sulfanilamide β. Acta Cryst. 19, 734–747.
Nordman, C. E. (1985). Introduction to Patterson search methods. In Crystallographic computing 3. Data collection, structure determination, proteins and databases, edited by G. M. Sheldrick, G. Kruger & R. Goddard, pp. 232–244. Oxford: Clarendon Press.
Oda, T., Naya, S. & Taguchi, I. (1961). Matrix theoretical derivation of inequalities. II. Acta Cryst. 14, 456–458.
Okaya, J. & Nitta, I. (1952). Linear structure factor inequalities and the application to the structure determination of tetragonal ethylenediamine sulphate. Acta Cryst. 5, 564–570.
Okaya, Y. & Pepinsky, R. (1956). New formulation and solution of the phase problem in Xray analysis of noncentric crystals containing anomalous scatterers. Phys. Rev. 103, 1645–1647.
Ott, H. (1927). Zur Methodik der Struckturanalyse. Z. Kristallogr. 66, 136–153.
Parthasarathy, S. & Srinivasan, R. (1964). The probability distribution of Bijvoet differences. Acta Cryst. 17, 1400–1407.
Peerdeman, A. F. & Bijvoet, J. M. (1956). The indexing of reflexions in investigations involving the use of the anomalous scattering effect. Acta Cryst. 9, 1012–1015.
Piro, O. E. (1983). Information theory and the phase problem in crystallography. Acta Cryst. A39, 61–68.
Podjarny, A. D., Schevitz, R. W. & Sigler, P. B. (1981). Phasing lowresolution macromolecular structure factors by matricial direct methods. Acta Cryst. A37, 662–668.
Podjarny, A. D., Yonath, A. & Traub, W. (1976). Application of multivariate distribution theory to phase extension for a crystalline protein. Acta Cryst. A32, 281–292.
Rae, A. D. (1977). The use of structure factors to find the origin of an oriented molecular fragment. Acta Cryst. A33, 423–425.
Ralph, A. C. & Woolfson, M. M. (1991). On the application of onewavelength anomalous scattering. III. The Wilsondistribution and MPS methods. Acta Cryst. A47, 533–537.
Ramachandran, G. N. & Raman, S. (1956). A new method for the structure analysis of noncentrosymmetric crystals. Curr. Sci. (India), 25, 348.
Raman, S. (1959). Syntheses for the deconvolution of the Patterson function. Part II. Detailed theory for noncentrosymmetric crystals. Acta Cryst. 12, 964–975.
Rango, C. de (1969). Thesis. Paris.
Rango, C. de, Mauguen, Y. & Tsoucaris, G. (1975). Use of highorder probability laws in phase refinement and extension of protein structures. Acta Cryst. A31, 227–233.
Rango, C. de, Mauguen, Y., Tsoucaris, G., Dodson, E. J., Dodson, G. G. & Taylor, D. J. (1985). The extension and refinement of the 1.9 Å spacing isomorphous phases to 1.5 Å spacing in 2Zn insulin by determinantal methods. Acta Cryst. A41, 3–17.
Rango, C. de, Tsoucaris, G. & Zelwer, C. (1974). Phase determination from the Karle–Hauptman determinant. II. Connexion between inequalities and probabilities. Acta Cryst. A30, 342–353.
Rogers, D., Stanley, E. & Wilson, A. J. C. (1955). The probability distribution of intensities. VI. The influence of intensity errors on the statistical tests. Acta Cryst. 8, 383–393.
Rogers, D. & Wilson, A. J. C. (1953). The probability distribution of Xray intensities. V. A note on some hypersymmetric distributions. Acta Cryst. 6, 439–449.
Rossmann, M. G., Blow, D. M., Harding, M. M. & Coller, E. (1964). The relative positions of independent molecules within the same asymmetric unit. Acta Cryst. 17, 338–342.
Sayre, D. (1952). The squaring method: a new method for phase determination. Acta Cryst. 5, 60–65.
Sayre, D. (1953). Double Patterson function. Acta Cryst. 6, 430–431.
Sayre, D. (1972). On leastsquares refinement of the phases of crystallographic structure factors. Acta Cryst. A28, 210–212.
Sayre, D. & Toupin, R. (1975). Major increase in speed of leastsquares phase refinement. Acta Cryst. A31, S20.
Schenk, H. (1973a). Direct structure determination in and other noncentrosymmetric symmorphic space groups. Acta Cryst. A29, 480–481.
Schenk, H. (1973b). The use of phase relationships between quartets of reflexions. Acta Cryst. A29, 77–82.
Sheldrick, G. M. (1990). Phase annealing in SHELX90: direct methods for larger structures. Acta Cryst. A46, 467–473.
Sheldrick, G. M. (1997). In Direct methods for solving macromolecular structures. NATO Advanced Study Institute, Erice, Italy.
Sheldrick, G. M. (2000a). The SHELX home page. http://shelx.uniac.gwdg.de/SHELX/ .
Sheldrick, G. M. (2000b). SHELX. http://www.ucg.ie/cryst/shelx.htm .
Sheldrick, G. M. & Gould, R. O. (1995). Structure solution by iterative peaklist optimization and tangent expansion in space group . Acta Cryst. B51, 423–431.
Sim, G. A. (1959). The distribution of phase angles for structures containing heavy atoms. II. A modification of the normal heavyatoms method for noncentrosymmetrical structures. Acta Cryst. 12, 813–815.
Simerska, M. (1956). Czech. J. Phys. 6, 1.
Simonov, V. I. & Weissberg, A. M. (1970). Calculation of the signs of structure amplitudes by a binary function section of interatomic vectors. Sov. Phys. Dokl. 15, 321–323. [Translated from Dokl. Akad. Nauk SSSR, 191, 1050–1052.]
Sint, L. & Schenk, H. (1975). Phase extension and refinement in noncentrosymmetric structures containing large molecules. Acta Cryst. A31, S22.
Smith, J. L. (1998). Multiwavelength anomalous diffraction in macromolecular crystallography. In Direct methods for solving macromolecular structures, edited by S. Fortier, pp. 221–225. Dordrecht: Kluwer Academic Publishers.
Srinivasan, R. & Parthasarathy, S. (1976). Some statistical applications in Xray crystallography. Oxford: Pergamon Press.
Taylor, D. J., Woolfson, M. M. & Main, P. (1978). On the application of phase relationships to complex structures. XV. Magic determinants. Acta Cryst. A34, 870–883.
Tsoucaris, G. (1970). A new method for phase determination. The maximum determinant rule. Acta Cryst. A26, 492–499.
Van der Putten, N. & Schenk, H. (1977). On the conditional probability of quintets. Acta Cryst. A33, 856–858.
Vaughan, P. A. (1958). A phasedetermining procedure related to the vectorcoincidence method. Acta Cryst. 11, 111–115.
Vermin, W. J. & de Graaff, R. A. G. (1978). The use of Karle–Hauptman determinants in smallstructure determinations. Acta Cryst. A34, 892–894.
Vicković, I. & Viterbo, D. (1979). A simple statistical treatment of unobserved reflexions. Application to two organic substances. Acta Cryst. A35, 500–501.
Weeks, C. M., DeTitta, G. T., Hauptman, H. A., Thuman, P. & Miller, R. (1994). Structure solution by minimalfunction phase refinement and Fourier filtering. II. Implementation and applications. Acta Cryst. A50, 210–220.
Weeks, C. M. & Miller, R. (1999). The design and implementation of SnB version 2.0. J. Appl. Cryst. 32, 120–124.
Weinzierl, J. E., Eisenberg, D. & Dickerson, R. E. (1969). Refinement of protein phases with the Karle–Hauptman tangent fomula. Acta Cryst. B25, 380–387.
White, P. & Woolfson, M. M. (1975). The application of phase relationships to complex structures. VII. Magic integers. Acta Cryst. A31, 53–56.
Wilkins, S. W., Varghese, J. N. & Lehmann, M. S. (1983). Statistical geometry. I. A selfconsistent approach to the crystallographic inversion problem based on information theory. Acta Cryst. A39, 47–60.
Wilson, A. J. C. (1942). Determination of absolute from relative Xray intensity data. Nature (London), 150, 151–152.
Wilson, K. S. (1978). The application of MULTAN to the analysis of isomorphous derivatives in protein crystallography. Acta Cryst. B34, 1599–1608.
Wolff, P. M. de & Bouman, J. (1954). A fundamental set of structure factor inequalities. Acta Cryst. 7, 328–333.
Woolfson, M. M. (1958). Crystal and molecular structure of p,p′dimethoxybenzophenone by the direct probability method. Acta Cryst. 11, 277–283.
Woolfson, M. M. (1977). On the application of phase relationships to complex structures. X. MAGLIN – a successor to MULTAN. Acta Cryst. A33, 219–225.
Woolfson, M. & Fan, H.F. (1995). Physical and nonphysical methods of solving crystal structures. Cambridge University Press.
Yao, J.X. (1981). On the application of phase relationships to complex structures. XVIII. RANTAN – random MULTAN. Acta Cryst. A37, 642–664.