International
Tables for Crystallography Volume B Reciprocal space Edited by U. Shmueli © International Union of Crystallography 2006 
International Tables for Crystallography (2006). Vol. B, ch. 2.3, pp. 260262
Section 2.3.8. Molecular replacement ^{a}Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA, and ^{b}CABM & Rutgers University, 679 Hoes Lane, Piscataway, New Jersey 088545638, USA 
The most straightforward application of the molecularreplacement method occurs when the orientation and position of a known molecular fragment in an unknown cell have been previously determined. The simple procedure is to apply the rotation and translation operations to the known fragment. This will place it into one `standard' asymmetric unit of the unknown cell. Then the crystal operators (assuming no further noncrystallographic operators are present in the unknown cell) are applied to generate the complete unit cell of the unknown structure. Structure factors can then be calculated from the rotated and translated known molecule into the unknown cell. The resultant model can be refined in numerous ways.
More generally, consider a molecule placed in any crystal cell (h), within which coordinate positions shall be designated by x. Let the corresponding structure factors be . It is then possible to compute the structure factors for another cell (p) into which the same molecule has been placed N times related by the crystallographic symmetry operators . Let the electron density at a point in the first crystallographic asymmetric unit be spatially related to the point in the nth asymmetric unit of the p crystal such that where From the definition of a structure factor, where the integral is taken over the volume U of one molecule. But since each molecule is identical as expressed in equation (2.3.8.1) and since (2.3.8.2) can be substituted in equation (2.3.8.3), we have Now let the molecule in the h crystal be related to the molecule in the first asymmetric unit of the p crystal by the noncrystallographic symmetry operation which implies Furthermore, in the h cell and thus, by combining with (2.3.8.5), (2.3.8.6) and (2.3.8.7), Now using (2.3.8.4) and (2.3.8.8) it can be shown that where S is a chosen molecular origin in the h crystal and is the corresponding molecular position in the nth asymmetric unit of the p crystal.
The use of noncrystallographic symmetry for phase determination was proposed by Rossmann & Blow (1962, 1963) and subsequently explored by Crowther (1967, 1969) and Main & Rossmann (1966). These methods were developed in reciprocal space and were primarily concerned with ab initio phase determination. Realspace averaging of electron density between noncrystallographically related molecules was used in the structure determination of deoxyhaemoglobin (Muirhead et al., 1967) and of αchymotrypsin (Matthews et al., 1967). The improvement derived from the averaging between the two noncrystallographic units was, however, not clear in either case. The first obviously successful application was in the structure determination of lobster glyceraldehyde3phosphate dehydrogenase (Buehner et al., 1974; Argos et al., 1975), where the tetrameric molecule of symmetry 222 occupied one crystallographic asymmetric unit. The improvement in the essentially SIR electrondensity map was considerable and the results changed from uninterpretable to interpretable. The uniqueness and validity of the solution lay in the obvious chemical correctness of the polypeptide fold and its agreement with known aminoacidsequence data. In contrast to the earlier reciprocalspace methods, noncrystallographic symmetry was used as a method to improve poor phases rather than to determine phases ab initio.
Many other applications followed rapidly, aided greatly by the versatile techniques developed by Bricogne (1976). Of particular interest is the application to the structure determination of hexokinase (Fletterick & Steitz, 1976), where the averaging occurred both between different crystal forms and within the same crystal.
The most widely used procedure for realspace averaging is the `double sorting' technique developed by Bricogne (1976) and also by Johnson (1978). An alternative method is to maintain the complete map stored in the computer (Nordman, 1980b). This avoids the sorting operation, but is only possible given a very large computer or a lowresolution map containing relatively few grid points.
Bricogne's double sorting technique involves generating realspace nonintegral points which are related to integral grid points in the cell asymmetric unit by the noncrystallographic symmetry operators. The elements of the set are then brought back to their equivalent points in the cell asymmetric unit and sorted by their proximity to two adjacent realspace sections. The set , calculated on a finer grid than and stored in the computer memory two sections at a time, is then used for linear interpolation to determine the density values at which are successively stored and summed in the related array . A count is kept of the number of densities received at each , resulting in a final averaged aggregate, when all realspace sections have been utilized. The density to be assigned outside the molecular envelope (defined with respect to the set ) is determined by averaging the density of all unused points in . The grid interval for the set should be about onesixth of the resolution to avoid serious errors from interpolation (Bricogne, 1976). The grid point separation in the set need only be sufficient for representation of electron density, or about onethird of the resolution.
Molecular replacement in real space consists of the following steps (Table 2.3.8.1): (a) calculation of electron density based on a starting phase set and observed amplitudes; (b) averaging of this density among the noncrystallographic asymmetric units or molecular copies in several crystal forms, a process which defines a molecular envelope as the averaging is only valid within the range of the noncrystallographic symmetry; (c) reconstructing the unit cell based on averaged density in every noncrystallographic asymmetric unit; (d) calculating structure factors from the reconstructed cell; (e) combining the new phases with others to obtain a weighted bestphase set; and (f) returning to step (a) at the previous or an extended resolution. Decisions made in steps (b) and (e) determine the rate of convergence (see Table 2.3.8.1) to a solution (Arnold et al., 1987).

The power of the molecularreplacement procedure for either phase improvement or phase extension depends on the number of noncrystallographic asymmetric units, the size of the excluded volume expressed in terms of the ratio and the magnitude of the measurement error on the structure amplitudes. Crowther (1967, 1969) and Bricogne (1974) have investigated the dependence on the number of noncrystallographic asymmetric units and conclude that three or more copies are sufficient to ensure convergence of an iterative phase improvement procedure in the absence of errors on the structure amplitudes. As with the analogous case of isomorphous replacement in which three data sets ensure reasonable phase determination, additional copies will enhance the power of the method, although their usefulness is subject to the law of diminishing returns. Another example of this principle is the sign determination of the h0l reflections of horse haemoglobin (Perutz, 1954) in which seven shrinkage stages constituted the sampling of the transform of a single copy.
Procedures for realspace averaging have been used extensively with great success. The interesting work of Wilson et al. (1981) is noteworthy for the continuous adjustment of molecular envelope with increased map definition. Furthermore, the analysis of complete virus structures has only been possible as a consequence of this technique (Bloomer et al., 1978; Harrison et al., 1978; AbadZapatero et al., 1980; Liljas et al., 1982). Although the procedure has been used primarily for phase improvement, apparently successful attempts have been made at phase extension (Nordman, 1980b; Gaykema et al., 1984; Rossmann et al., 1985). Ab initio phasing of glyceraldehyde3phosphate dehydrogenase (Argos et al., 1975) was successfully attempted by initially filling the known envelope with uniform density to determine the phases of the innermost reflections and then gradually extending phases to 6.3 Å resolution. Johnson et al. (1976) used the same procedure to determine the structure of southern bean mosaic virus to 22.5 Å resolution. Particularly impressive was the work on polyoma virus (Rayment et al., 1982; Rayment, 1983; Rayment et al., 1983) where crude initial models led to an entirely unexpected breakdown of the Caspar & Klug (1962) concept of quasisymmetry. Ab initio phasing has also been used by combining the electrondiffraction projection data of two different crystal forms of bacterial rhodopsin (Rossmann & Henderson, 1982).
Let us proceed in reciprocal space doing exactly the same as is done in realspace averaging. Thus where Therefore, The next step is to perform the backtransform of the averaged electron density. Hence, where U is the volume within the averaged part of the cell. Hence, substituting for , which is readily simplified to Setting the molecularreplacement equations can be written as (Main & Rossmann, 1966), or in matrix form which is the form of the equations used by Main (1967) and by Crowther (1967). Colman (1974) arrived at the same conclusions by an application of Shannon's sampling theorem. It should be noted that the elements of [B] are dependent only on knowledge of the noncrystallographic symmetry and the volume within which it is valid. Substitution of approximate phases into the righthand side of (2.3.8.11) produces a set of calculated structure factors exactly analogous to those produced by backtransforming the averaged electron density in real space. The new phases can then be used in a renewed cycle of molecular replacement.
Computationally, it has been found more convenient and faster to work in real space. This may, however, change with the advent of vector processing in `supercomputers'. Obtaining improved phases by substitution of current phases on the righthand side of the molecularreplacement equations (2.3.8.1) seems less cumbersome than the repeated forward and backward Fourier transformation, intermediate sorting, and averaging required in the realspace procedure.
References
AbadZapatero, C., AbdelMeguid, S. S., Johnson, J. E., Leslie, A. G. W., Rayment, I., Rossmann, M. G., Suck, D. & Tsukihara, T. (1980). Structure of southern bean mosaic virus at 2.8 Å resolution. Nature (London), 286, 33–39.Argos, P., Ford, G. C. & Rossmann, M. G. (1975). An application of the molecular replacement technique in direct space to a known protein structure. Acta Cryst. A31, 499–506.
Arnold, E., Vriend, G., Luo, M., Griffith, J. P., Kamer, G., Erickson, J. W., Johnson, J. E. & Rossmann, M. G. (1987). The structure determination of a common cold virus, human rhinovirus 14. Acta Cryst. A43, 346–361.
Bloomer, A. C., Champness, J. N., Bricogne, G., Staden, R. & Klug, A. (1978). Protein disk of tobacco mosaic virus at 2.8 Å resolution showing the interactions within and between subunits. Nature (London), 276, 362–368.
Bricogne, G. (1974). Geometric sources of redundancy in intensity data and their use for phase determination. Acta Cryst. A30, 395–405.
Bricogne, G. (1976). Methods and programs for the direct space exploitation of geometric redundancies. Acta Cryst. A32, 832–847.
Buehner, M., Ford, G. C., Moras, D., Olsen, K. W. & Rossmann, M. G. (1974). Structure determination of crystalline lobster Dglyceraldehyde3phosphate dehydrogenase. J. Mol. Biol. 82, 563–585.
Caspar, D. L. D. & Klug, A. (1962). Physical principles in the construction of regular viruses. Cold Spring Harbor Symp. Quant. Biol. 27, 1–24.
Colman, P. M. (1974). Noncrystallographic symmetry and the sampling theorem. Z. Kristallogr. 140, 344–349.
Crowther, R. A. (1967). A linear analysis of the noncrystallographic symmetry problem. Acta Cryst. 22, 758–764.
Crowther, R. A. (1969). The use of noncrystallographic symmetry for phase determination. Acta Cryst. B25, 2571–2580.
Fletterick, R. J. & Steitz, T. A. (1976). The combination of independent phase information obtained from separate protein structure determinations of yeast hexokinase. Acta Cryst. A32, 125–132.
Gaykema, W. P. J., Hol, W. G. J., Vereijken, J. M., Soeter, N. M., Bak, H. J. & Beintema, J. J. (1984). 3.2 Å structure of the coppercontaining, oxygencarrying protein Panulirus interruptus haemocyanin. Nature (London), 309, 23–29.
Harrison, S. C., Olson, A. J., Schutt, C. E., Winkler, F. K. & Bricogne, G. (1978). Tomato bushy stunt virus at 2.9 Å resolution. Nature (London), 276, 368–373.
Johnson, J. E. (1978). Appendix II. Averaging of electron density maps. Acta Cryst. B34, 576–577.
Johnson, J. E., Akimoto, T., Suck, D., Rayment, I. & Rossmann, M. G. (1976). The structure of southern bean mosaic virus at 22.5 Å resolution. Virology, 75, 394–400.
Liljas, L., Unge, T., Jones, T. A., Fridborg, K., Lövgren, S., Skoglund, U. & Strandberg, B. (1982). Structure of satellite tobacco necrosis virus at 3.0 Å resolution. J. Mol. Biol. 159, 93–108.
Main, P. (1967). Phase determination using noncrystallographic symmetry. Acta Cryst. 23, 50–54.
Main, P. & Rossmann, M. G. (1966). Relationships among structure factors due to identical molecules in different crystallographic environments. Acta Cryst. 21, 67–72.
Matthews, B. W., Sigler, P. B., Henderson, R. & Blow, D. M. (1967). Threedimensional structure of tosylαchymotrypsin. Nature (London), 214, 652–656.
Muirhead, H., Cox, J. M., Mazzarella, L. & Perutz, M. F. (1967). Structure and function of haemoglobin. III. A threedimensional Fourier synthesis of human deoxyhaemoglobin at 5.5 Å resolution. J. Mol. Biol. 28, 117–156.
Nordman, C. E. (1980b). Procedures for detection and idealization of noncrystallographic symmetry with application to phase refinement of the satellite tobacco necrosis virus structure. Acta Cryst. A36, 747–754.
Perutz, M. F. (1954). The structure of haemoglobin. III. Direct determination of the molecular transform. Proc. R. Soc. London Ser. A, 225, 264–286.
Rayment, I. (1983). Molecular replacement method at low resolution: optimum strategy and intrinsic limitations as determined by calculations on icosahedral virus models. Acta Cryst. A39, 102–116.
Rayment, I., Baker, T. S. & Caspar, D. L. D. (1983). A description of the techniques and application of molecular replacement used to determine the structure of polyoma virus capsid at 22.5 Å resolution. Acta Cryst. B39, 505–516.
Rayment, I., Baker, T. S., Caspar, D. L. D. & Murakami, W. T. (1982). Polyoma virus capsid structure at 22.5 Å resolution. Nature (London), 295, 110–115.
Rossmann, M. G., Arnold, E., Erickson, J. W., Frankenberger, E. A., Griffith, J. P., Hecht, H. J., Johnson, J. E., Kamer, G., Luo, M., Mosser, A. G., Rueckert, R. R., Sherry, B. & Vriend, G. (1985). Structure of a human common cold virus and functional relationship to other picornaviruses. Nature (London), 317, 145–153.
Rossmann, M. G. & Blow, D. M. (1962). The detection of subunits within the crystallographic asymmetric unit. Acta Cryst. 15, 24–31.
Rossmann, M. G. & Blow, D. M. (1963). Determination of phases by the conditions of noncrystallographic symmetry. Acta Cryst. 16, 39–45.
Rossmann, M. G. & Henderson, R. (1982). Phasing electron diffraction amplitudes with the molecular replacement method. Acta Cryst. A38, 13–20.
Wilson, I. A., Skehel, J. J. & Wiley, D. C. (1981). Structure of the haemagglutinin membrane glycoprotein of influenza virus at 3 Å resolution. Nature (London), 289, 366–373.