International
Tables for
Crystallography
Volume B
Reciprocal space
Edited by U. Shmueli

International Tables for Crystallography (2006). Vol. B, ch. 4.5, pp. 466-485   | 1 | 2 |
https://doi.org/10.1107/97809553602060000567

## Chapter 4.5. Polymer crystallography

R. P. Millanea and D. L. Dorsetb

aWhistler Center for Carbohydrate Research, and Computational Science and Engineering Program, Purdue University, West Lafayette, Indiana 47907-1160, USA, and bExxonMobil Research and Engineering Co., 1545 Route 22 East, Clinton Township, Annandale, New Jersey 08801, USA

The two primary crystallographic techniques used for studying polymer structure, X-ray fibre diffraction analysis and polymer electron crystallography, are described in this chapter. X-ray fibre diffraction analysis is a collection of crystallographic techniques used to determine molecular and crystal structures of molecules, or molecular assemblies, that form specimens (often fibres) in which the molecules, assemblies or crystallites are approximately parallel but not otherwise ordered. The theory and techniques of structure determination by X-ray fibre diffraction analysis are reviewed. Fibre specimens, the theory of diffraction by these specimens, intensity data collection and processing, and the variety of structure determination methods used for the various kinds of specimens studied by fibre diffraction are described. Polymer electron crystallography may be defined as the quantitative use of electron micrographs and electron diffraction intensities for the determination of polymer crystal structures. Data from thin single microcrystals are required for successful structure determination. Crystallization and data-collection techniques are described and two approaches to crystal structure analysis are discussed.

### 4.5.1. Overview

| top | pdf |
R. P. Millanea* and D. L. Dorsetb

Linear polymers from natural or synthetic sources are actually polydisperse aggregates of high-molecular-weight chains. Nevertheless, many of these essentially infinite-length molecules can be prepared as solid-state specimens that contain ordered molecular segments or crystalline inclusions (Vainshtein, 1966; Tadokoro, 1979; Mandelkern, 1989; Barham, 1993). In general, ordering can occur in a number of ways. Hence an oriented and/or somewhat ordered packing of chain segments might be found in a stretched fibre, or in the chain-folded arrangement of a lamellar crystallite. Lamellae themselves may exist as single plates or in the more complex array of a spherulite (Geil, 1963). Diffraction data can be obtained from these various kinds of specimens and used to determine molecular and crystal structures.

There are numerous reasons why crystallography of polymers is important. Although it may be possible to crystallize small constituent fragments of these large molecules and determine their crystal structures, one often wishes to study the intact (and biologically or functionally active) polymeric system. The molecular conformations and intermolecular interactions are determinants of parameters such as persistence length which affect, for example, solution conformations (random or worm-like coils) which determine viscosity. Molecular conformations also influence intermolecular interactions, which determine physical properties in gels and melts. Molecular conformations are, of course, of critical importance in many biological recognition processes. Knowledge of the stereochemical constraints that are placed on the molecular packing to maximize unit-cell density is particularly relevant to the fact that many linear molecules (as well as monodisperse substances with low molecular weight) can adopt several different allomorphic forms, depending on the crystallization conditions employed or the biological origin. Since different allomorphs can behave quite differently from one another, it is clear that the mode of chain packing is related to the bulk properties of the polymer (Grubb, 1993). The three-dimensional geometry of the chain packing obtained from a crystal structure analysis can be used to investigate other phenomena such as the possible inclusion of disordered material in chain-fold regions (Mandelkern, 1989; Lotz & Wittmann, 1993), the ordered interaction of crystallite sectors across grain boundaries where tight interactions are found between domains, or the specific interactions of polymer chains with another substance in a composite material (Lotz & Wittmann, 1993).

The two primary crystallographic techniques used for studying polymer structure are described in this chapter. The first is X-ray fibre diffraction analysis, described in Section 4.5.2; and the second is polymer electron crystallography, described in Section 4.5.3.

Crystallographic studies of polymers were first performed using X-ray diffraction from oriented fibre specimens. Early applications were to cellulose and DNA from the 1930s to the 1950s, and the technique has subsequently been applied to hundreds of biological and synthetic polymers (Arnott, 1980; Millane, 1988). This technique is now referred to as X-ray fibre diffraction analysis. In fact, fibre diffraction analysis can be employed not only for polymers, but for any system that can be oriented. Indeed, one of the first applications of the technique was to tobacco mosaic virus (Franklin, 1955). Fibre diffraction analysis has also utilized, in some cases, neutrons instead of X-rays (e.g. Stark et al., 1988; Forsyth et al., 1989). X-ray fibre diffraction analysis is particularly suitable for biological polymers that form natural fibrous superstructures and even for many synthetic polymers that exist in either a fibrous or a liquid-crystalline state. Fibre diffraction has played an important role in structural studies of polynucleotides, polysaccharides, polypeptides and polyesters, as well as rod-like helical viruses, bacteriophages, microtubules and muscle fibres (Arnott, 1980; French & Gardner, 1980; Hall, 1984; Millane, 1988; Atkins, 1989). The common, and unique, feature of these systems is that the molecules (or their aggregates) are randomly rotated about an axis of preferred orientation. As a result, the measured diffraction is the cylindrical average of that from a single molecule or aggregate. The challenge for the structural scientist, therefore, is that of structure determination from cylindrically averaged diffraction intensities. Since a wide range of types and degrees of order (or disorder) occur in fibrous specimens, as well as a wide range of sizes of the repeating units, a variety of methods are used for structure determination.

The second technique used for structural studies of polymers is polymer electron crystallography. This involves measuring electron intensity data from individual crystalline regions or lamellae in the diffraction plane of an electron microscope. This is possible because a narrow electron beam can be focused on a single thin microcrystal and because of the enhanced scattering cross section of matter for electrons. By tilting the specimen, three-dimensional diffraction intensities from a single microcrystal can be collected. This means that the unit-cell dimensions and symmetry can be obtained unambiguously in electron-diffraction experiments on individual chain-folded lamellae, and the data can be used for actual single-crystal structure determinations. One of the first informative electron-diffraction studies of crystalline polymer films was made by Storks (1938), who formulated the concept of chain folding in polymer lamellae. Among the first quantitative structure determinations from electron-diffraction intensities was that of Tatarinova & Vainshtein (1962) on the α form of poly-γ-methyl-L-glutamate. Quantitative interpretation of the intensity data may benefit from the assumption of quasi-kinematical scattering (Dorset, 1995a), as long as the proper constraints are placed on the experiment. Structure determination may then proceed using the traditional techniques of X-ray crystallography. While molecular-modelling approaches (in which atomic level molecular and crystal structure models are constructed and refined) have been employed with single-crystal electron-diffraction data (Brisse, 1989), the importance of ab initio structure determination has been established in recent years (Dorset, 1995b), demonstrating that no initial assumptions about the molecular geometry need be made before the determination is begun. In some cases too, high-resolution electron micrographs of the polymer crystal structure can be used as an additional means for determining crystallographic phases and/or to visualize lattice defects.

Each of the two techniques described above has its own advantages and disadvantages. While specimen disorder can limit the application of X-ray fibre diffraction analysis, polymer electron diffraction is limited to materials that can be be prepared as crystalline lamallae and that can withstand the high vacuum environment of an electron microscope (although the latter restriction can now be largely overcome by the use of low-temperature specimen holders and/or environmental chambers).

### 4.5.2. X-ray fibre diffraction analysis

| top | pdf |
R. P. Millanea*

#### 4.5.2.1. Introduction

| top | pdf |

X-ray fibre diffraction analysis is a collection of crystallographic techniques that are used to determine molecular and crystal structures of molecules, or molecular assemblies, that form specimens (often fibres) in which the molecules, assemblies or crystallites are approximately parallel but not otherwise ordered (Arnott, 1980; French & Gardner, 1980; Hall, 1984; Vibert, 1987; Millane, 1988; Atkins, 1989; Stubbs, 1999). These are usually long, slender molecules and they are often inherently flexible, which usually precludes the formation of regular three-dimensional crystals suitable for conventional crystallographic analysis. X-ray fibre diffraction therefore provides a route for structure determination for certain kinds of specimens that cannot be crystallized. Although it may be possible to crystallize small fragments or subunits of these molecules, and determine the crystal structures of these, X-ray fibre diffraction provides a means for studying the intact, and often the biologically or functionally active, system. Fibre diffraction has played an important role in the determination of biopolymers such as polynucleotides, polysaccharides (both linear and branched), polypeptides and a wide variety of synthetic polymers (such as polyesters), as well as larger assemblies including rod-like helical viruses, bacteriophages, microtubules and muscle fibres (Arnott, 1980; Arnott & Mitra, 1984; Millane, 1990c; Squire & Vibert, 1987).

Specimens appropriate for fibre diffraction analysis exhibit rotational disorder (of the molecules, aggregates or crystallites) about a preferred axis, resulting in cylindrical averaging of the diffracted intensity in reciprocal space. Therefore, fibre diffraction analysis can be thought of as structure determination from cylindrically averaged diffraction intensities' (Millane, 1993). In a powder specimen the crystallites are completely (spherically) disordered, so that structure determination by fibre diffraction can be considered to be intermediate between structure determination from single crystals and from powders.

This section is a review of the theory and techniques of structure determination by X-ray fibre diffraction analysis. It includes descriptions of fibre specimens, the theory of diffraction by these specimens, intensity data collection and processing, and the variety of structure determination methods used for the various kinds of specimens studied by fibre diffraction. It does not include descriptions of specimen preparation (those can be found in the references given for specific systems), or of applications of X-ray diffraction to determining polymer morphology (e.g. particle or void sizes and shapes, texture, domain structure etc.).

#### 4.5.2.2. Fibre specimens

| top | pdf |

A wide variety of kinds of fibre specimen exist. All exhibit preferred orientation; the variety results from variability in the degree of order (crystallinity) in the lateral plane (the plane perpendicular to the axis of preferred orientation). This leads to categorization of three kinds of fibre specimen: noncrystalline fibres, in which there is no order in the lateral plane; polycrystalline fibres, in which there is near-perfect crystallinity in the lateral plane; and disordered fibres, in which there is disorder either within the molecules or in their crystalline packing (or both). The kind of fibre specimen affects the kind of diffraction pattern obtained, the relationships between the molecular and crystal structures and the diffraction data, methods of data collection, and methods of structure determination.

Noncrystalline fibres are made up of a collection of molecules that are oriented. This means that there is a common axis in each molecule (referred to here as the molecular axis), the axes being parallel in the specimen. The direction of preferred orientation is called the fibre axis. The molecule itself is usually considered to be a rigid body. There is no other ordering within the specimen. The molecules are therefore randomly positioned in the lateral plane and are randomly rotated about their molecular axes. Furthermore, if the molecule does not have a twofold rotation axis normal to the molecular axis, then the molecular axis has a direction associated with it, and the molecular axes are oriented randomly parallel or antiparallel to each other. This is often called directional disorder, or the molecules are said to be oriented randomly up and down. The average length of the ordered molecular segments in a noncrystalline fibre is referred to as the coherence length.

Polycrystalline fibres are characterized by molecular segments packing together to form well ordered microcrystallites within the specimen. The crystallites effectively take the place of the molecules in a noncrystalline specimen as described above. The crystallites are oriented, and since the axis within each crystallite that is aligned parallel to those in other crystallites usually corresponds to the long axes of the constituent molecules, it is also referred to here as the molecular axis. The crystallites are randomly positioned in the lateral plane, randomly rotated about the molecular axis, and randomly oriented up or down. The size of the crystalline domains can be characterized by their average dimensions in the directions of the a, b and c unit-cell vectors. However, because of the rotational disorder of the crystallites, any differences between crystallite dimensions in different directions normal to the fibre axis tend to be smeared out in the diffraction pattern, and the crystallite size is usefully characterized by the average dimensions of the crystallites normal and parallel to the fibre axis.

The molecules or crystallites in a fibre specimen are not perfectly oriented, and the variation in inclinations of the molecular axes to the fibre axis is referred to as disorientation. Assuming that the orientation is axisymmetric, then it can be described by an orientation density function such that is the fraction of molecules in an element of solid angle inclined at an angle α to the fibre axis. The exact form of is generally not known for any particular fibre and it is often sufficient to assume a Gaussian orientation density function, so that where is a measure of the degree of disorientation.

Fibre specimens often exhibit various kinds of disorder. The disorder may be within the molecules or in their packing. Disorder affects the relationship between the molecular and crystal structure and the diffracted intensities. Disorder within the molecules may result from a degree of randomness in the chemical sequence of the molecule or from variability in the interactions between the units that make up the molecule. Such molecules may (at least in principle) form noncrystalline, polycrystalline or partially crystalline (described below) fibres. Disordered packing of molecules within crystallites can result from a variety of ways in which the molecules can interact with each other. Fibre specimens made up of disordered crystallites are referred to here as partially crystalline fibres.

#### 4.5.2.3. Diffraction by helical structures

| top | pdf |

Molecules or assemblies studied by fibre diffraction are usually made up of a large number of identical, or nearly identical, residues, or subunits, that in an oriented specimen are distributed along an axis; this leads naturally to helical symmetry. Since a periodic structure with no helix symmetry can be treated as a onefold helix, the assumption of helix symmetry is not restrictive.

#### 4.5.2.3.1. Helix symmetry

| top | pdf |

The presence of a unique axis about which there is rotational disorder means that it is convenient to use cylindrical polar coordinate systems in fibre diffraction. We denote by a cylindrical polar coordinate system in real space, in which the z axis is parallel to the molecular axes. The molecule is said to have helix symmetry, where u and v are integers, if the electron density satisfies where m is any integer. The constant c is the period along the z direction, which is referred to variously as the molecular repeat distance, the crystallographic repeat, or the c repeat. The helix pitch P is equal to . Helix symmetry is easily interpreted as follows. There are u subunits, or helix repeat units, in one c repeat of the molecule. The helix repeat units are repeated by integral rotations of about, and translations of along, the molecular (or helix) axis. The helix repeat units may therefore be referenced to a helical lattice that consists of points at a fixed radius, with relative rotations and translations as described above. These points lie on a helix of pitch P, there are v turns (or pitch-lengths) of the helix in one c repeat, and there are u helical lattice points in one c repeat. A helix is said to have u residues in v turns'.

Since the electron density is periodic in ϕ and z, it can be decomposed into a Fourier series as where the coefficients are given by Assume now that the electron density has helical symmetry. Denote by the electron density in the region ; the electron density being zero outside this region, i.e. is the electron density of a single helix repeat unit. It follows that Substituting equation (4.5.2.5) into equation (4.5.2.4) shows that vanishes unless is a multiple of u, i.e. unless for any integer m. Equation (4.5.2.6) is called the helix selection rule. The electron density in the helix repeat unit is therefore given by where and where in equation (4.5.2.7) (and in the remainder of this section) the sum over l is over all integers, the sum over n is over all integers satisfying the helix selection rule and the integral in equation (4.5.2.8) is over one helix repeat unit. The effect of helix symmetry, therefore, is to restrict the number of Fourier coefficients required to represent the electron density to those whose index n satisfies the selection rule. Note that the selection rule is usually derived using a rather more complicated argument by considering the convolution of the Fourier transform of a continuous filamentary helix with a set of planes in reciprocal space (Cochran et al., 1952). The approach described above, which follows that of Millane (1991), is much more straightforward.

#### 4.5.2.3.2. Diffraction by helical structures

| top | pdf |

Denote by a cylindrical polar coordinate system in reciprocal space (with the Z and z axes parallel), and by the Fourier transform of . Since is periodic in z with period c, its Fourier transform is nonzero only on the layer planes where l is an integer. Denote by ; using the cylindrical form of the Fourier transform shows that

It is convenient to rewrite equation (4.5.2.9) making use of the Fourier decomposition described in Section 4.5.2.3.1, since this allows utilization of the helix selection rule. The Fourier–Bessel structure factors (Klug et al., 1958), , are defined as the Hankel transform of the Fourier coefficients , i.e. and the inverse transform is Using equations (4.5.2.7) and (4.5.2.11) shows that equation (4.5.2.9) can be written as where, as usual, the sum is over only those values of n that satisfy the helix selection rule. Using equations (4.5.2.8) and (4.5.2.10) shows that the Fourier–Bessel structure factors may be written in terms of the atomic coordinates as where is the (spherically symmetric) atomic scattering factor (usually including an isotropic temperature factor) of the jth atom and is the spherical radius in reciprocal space. Equations (4.5.2.12) and (4.5.2.13) allow the complex diffracted amplitudes for a helical molecule to be calculated from the atomic coordinates, and are analogous to expressions for the structure factors in conventional crystallography.

The significance of the selection rule is now more apparent. On a particular layer plane l, not all Fourier–Bessel structure factors contribute; only those whose Bessel order n satisfies the selection rule for that value of l contribute. Since any molecule has a maximum radius, denoted here by , and since is small for and diffraction data are measured out to only a finite value of R, reference to equation (4.5.2.10) [or equation (4.5.2.13)] shows that there is a maximum Bessel order that contributes significant value to equation (4.5.2.12) (Crowther et al., 1970; Makowski, 1982), so that the infinite sum over n in equation (4.5.2.12) can be replaced by a finite sum. On each layer plane there is also a minimum value of , denoted by , that satisfies the helix selection rule, so that the region is devoid of diffracted amplitude where

The selection rule therefore results in a region around the Z axis of reciprocal space that is devoid of diffraction, the shape of the region depending on the helix symmetry.

#### 4.5.2.3.3. Approximate helix symmetry

| top | pdf |

In some cases the nature of the subunits and their interactions results in a structure that is not exactly periodic. Consider a helical structure with subunits in v turns, where x is a small real number; i.e. the structure has approximate, but not exact, helix symmetry. Since the molecule has an approximate repeat distance c, only those layer planes close to those at show significant diffraction. Denoting by the Z coordinate of the nth Bessel order and its associated value of m, and using the selection rule shows that so that the positions of the Bessel orders are shifted by from their positions if the helix symmetry is exactly . At moderate resolution m is small so the shift is small. Hence Bessel orders that would have been coincident on a particular layer plane are now separated in reciprocal space. This is referred to as layer-plane splitting and was first observed in fibre diffraction patterns from tobacco mosaic virus (TMV) (Franklin & Klug, 1955). Splitting can be used to advantage in structure determination (Section 4.5.2.6.6).

As an example, TMV has approximately 493 helix symmetry with a c repeat of 69 Å. However, close inspection of diffraction patterns from TMV shows that there are actually about 49.02 subunits in three turns (Stubbs & Makowski, 1982). The virus is therefore more accurately described as a 2451150 helix with a c repeat of 3450 Å. The layer lines corresponding to this larger repeat distance are not observed, but the effects of layer-plane splitting are detectable (Stubbs & Makowski, 1982).

#### 4.5.2.4. Diffraction by fibres

| top | pdf |

The kind of diffraction pattern obtained from a fibre specimen made up of helical molecules depends on the kind of specimen as described in Section 4.5.2.2. This section is divided into four parts. The first two describe diffraction patterns obtained from noncrystalline and polycrystalline fibres (which are the most common kinds used for structural analysis), and the last two describe diffraction by partially crystalline fibres.

#### 4.5.2.4.1. Noncrystalline fibres

| top | pdf |

A noncrystalline fibre is made up of a collection of helical molecules that are oriented parallel to each other, but are otherwise randomly positioned and rotated relative to each other. The recorded intensity, , is therefore that diffracted by a single molecule cylindrically averaged about the Z axis in reciprocal space i.e. using equation (4.5.2.12) shows that where, as usual, the sum is over the values of n that satisfy the helix selection rule. On the diffraction pattern, reciprocal space collapses to the two dimensions (R, Z). The R axis is called the equator and the Z axis the meridian. The layer planes collapse to layer lines, at , which are indexed by l. Equation (4.5.2.17) gives a rather simple relationship between the recorded intensity and the Fourier–Bessel structure factors.

Coherence length and disorientation, as described in Section 4.5.2.2, also affect the form of the diffraction pattern. These effects are described here, although they also apply to other than noncrystalline fibres. A finite coherence length leads to smearing of the layer lines along the Z direction. If the average coherence length of the molecules is , the intensity distribution about the lth layer line can be approximated by

It is convenient to express the effects of disorientation on the intensity distribution of a fibre diffraction pattern by writing the latter as a function of the polar coordinates (where σ is the angle with the Z axis) in (R, Z) space. Assuming a Gaussian orientation density function [equation (4.5.2.1)], if is small and the effects of disorientation dominate over those of coherence length (which is usually the case except close to the meridian), then the distribution of intensity about one layer line can be approximated by (Holmes & Barrington Leigh, 1974; Stubbs, 1974) where (Millane & Arnott, 1986; Millane, 1989c) and is the polar angle at the centre of the layer line, i.e. . The effect of disorientation, therefore, is to smear each layer line about the origin of reciprocal space.

#### 4.5.2.4.2. Polycrystalline fibres

| top | pdf |

A polycrystalline fibre is made up of crystallites that are oriented parallel to each other, but are randomly positioned and randomly rotated about their molecular axes. The recorded diffraction pattern is the intensity diffracted by a single crystallite, cylindrically averaged about the Z axis. On a fibre diffraction pattern, therefore, the Bragg reflections are cylindrically projected onto the (R, Z) plane and their positions are described by the cylindrically projected reciprocal lattice (Finkenstadt & Millane, 1998).

The molecules are periodic and are therefore usually aligned with one of the unit-cell vectors. Since the z axis is defined as the fibre axis, it is usual in fibre diffraction to take the c lattice vector as the unique axis and as the lattice vector parallel to the molecular axes. It is almost always the case that the fibre is rotationally disordered about the molecular axes, i.e. about c. Consider first the case of a monoclinic unit cell so that the reciprocal lattice is cylindrically projected about . The cylindrical coordinates of the projected reciprocal-lattice points are then given by and so that R depends only on h and k, and Z depends only on l. Reflections with fixed h and k lie on straight row lines. Certain sets of distinct reciprocal-lattice points will have the same value of and therefore superimpose in cylindrical projection. For example, for an orthorhombic system the reciprocal-lattice points (hkl), , and superimpose. Furthermore, the crystallites in a fibre specimen are usually oriented randomly up and down so that the reciprocal-lattice points (hkl) and superimpose, so that in the orthorhombic case eight reciprocal-lattice points superimpose. Also, as described below, reciprocal-lattice points that have similar values of R can effectively superimpose.

If the unit cell is either triclinic, or is monoclinic with or , then is inclined to c and the Z axis, and the reciprocal lattice is not cylindrically projected about . Equation (4.5.2.22) for still applies, but the cylindrical radius is given by and the row lines are curved (Finkenstadt & Millane, 1998).

The most complicated situation arises if the crystallites are rotationally disordered about an axis that is inclined to c. Reciprocal space is then rotated about an axis that is inclined to the normal to the plane, and are both functions of h, k and l, equation (4.5.2.23) does not apply, and reciprocal-lattice points for fixed l do not lie on layer lines of constant Z. Although this situation is rather unusual, it does occur (Daubeny et al., 1954), and is described in detail by Finkenstadt & Millane (1998).

The observed fibre diffraction pattern consists of reflections at the projected reciprocal-lattice points whose intensities are equal to the sums of the intensities of the contributing structure factors. The observed intensity, denoted by , at a projected reciprocal-lattice point on the lth layer line and with is therefore given by (assuming, for simplicity, a monoclinic system) where denotes the set of indices such that . The number of independent reflections contributing in equation (4.5.2.24) depends on the space-group symmetry of the crystallites, because of either systematic absences or structure factors whose values are related.

The effect of a finite crystallite size is to smear what would otherwise be infinitely sharp reflections into broadened reflections of a finite size. If the average crystallite dimensions normal and parallel to the z axis are (i.e. in the lateral' direction) and (i.e. in the axial' direction), respectively, the profile of the reflection centred at can be written as (Fraser et al., 1984; Millane & Arnott, 1986; Millane, 1989c) where the profile function can be approximated by

The effect of crystallite disorientation is to smear the reflections given by equation (4.5.2.26) about the origin of the projected reciprocal space. If the effects of disorientation dominate over those of crystallite size, then the profile of a reflection can be approximated by (Fraser et al., 1984; Millane & Arnott, 1986; Millane, 1989c) where are the polar coordinates of the reflection, and

Reflections that have similar enough coordinates overlap severely with each other and are also included in the sum in equation (4.5.2.24). This is quite common in practice because a number of sets of reflections may have similar values of .

#### 4.5.2.4.3. Random copolymers

| top | pdf |

Random copolymers are made up of a small number of different kinds of monomer, whose sequence along the polymer chain is not regular, but is random, or partially random. A particularly interesting class are synthetic polymers such as copolyesters that form a variety of liquid-crystalline phases and have useful mechanical properties (Biswas & Blackwell, 1988a). The structures of these materials have been studied quite extensively using X-ray fibre diffraction analysis. Because the molecules do not have an average c repeat, their diffraction patterns do not consist of equally spaced layer lines. However, as a result of the small number of distinct spacings associated with the monomers, diffracted intensity is concentrated about layer lines, but these are irregularly spaced (along Z) and are aperiodic. Since the molecule is not periodic, the basic theory of diffraction by helical molecules described in Section 4.5.2.3.2 does not apply in this case. Cylindrically averaged diffraction from random copolymers is described here. Related approaches have been described independently by Hendricks & Teller (1942) and Blackwell et al. (1984). Hendricks & Teller (1942) considered the rather general problem of diffraction by layered structures made up of different kinds of layers, the probability of a layer at a particular level depending on the layers present at adjacent levels. This is a one-dimensional disordered structure that can be used to describe a random copolymer. Blackwell and co-workers have developed a similar theory in terms of a one-dimensional paracrystalline model (Hosemann & Bagchi, 1962) for diffraction by random copolymers (Blackwell et al., 1984; Biswas & Blackwell, 1988a), and this is the theory described here.

Consider a random copolymer made up of monomer units (residues) of N different types. Since the disorder is along the length of the polymer, some of the main characteristics of diffraction from such a molecule can be elucidated by studying the diffraction along the meridian of the diffraction pattern. The meridional diffraction is the intensity of the Fourier transform of the polymer chain projected onto the z axis and averaged over all possible monomer sequences. The diffraction pattern depends on the monomer (molar) compositions, denoted by , the statistics of the monomer sequence (described by the probability of the different possible monomer pairs in this model) and the Fourier transform of the monomer units. Development of this model shows that the meridional diffracted intensity can be written in the form (Blackwell et al., 1984; Biswas & Blackwell, 1988a; Schneider et al., 1991) where the summations are over the different monomer types and is the axial Fourier transform of the ith monomer unit (each referenced to a common origin). The are most conveniently described by defining them as the ijth element of an matrix T, which is given by where and I are matrices. I is the unit matrix and P is a diagonal matrix with elements . The elements of M are the probabilities of forming ij monomer pairs and can be generated for different kinds of random sequence (e.g. chemical restrictions on the occurrence of particular monomer pairs, random chains, varying degrees of blockiness, tendency towards alternating sequences etc.) (Schneider et al., 1991). The matrix is diagonal with elements equal to where the are the projected monomer lengths and the average is over all chain conformations.

Equation (4.5.2.30) can be used to calculate the meridional diffraction for a particular random copolymer. The most important result of such a calculation is that intensity maxima are spaced irregularly along the meridian. The positions of the maxima depend on the monomer proportions, the sequence statistics and the projected monomer lengths.

The full cylindrically averaged diffraction pattern, denoted by , from a noncrystalline specimen containing oriented random copolymer chains can be calculated by replacing in equation (4.5.2.30) by , i.e. (Biswas & Blackwell, 1988a). Note that we write rather than since the pattern cannot be indexed on the basis of regularly spaced layer lines. The in equation (4.5.2.32) depend on the chain conformation, since this affects the range of monomer orientations and hence their average diffraction. Chivers & Blackwell (1985) have considered two extreme cases, one corresponding to fixed conformations between monomers and the other corresponding to completely random conformations between monomers, and have derived expressions for the diffracted intensity in both cases. Equation (4.5.2.32) allows one to calculate the fibre diffraction pattern from an array of parallel random copolymers that exhibit no lateral ordering. The diffraction pattern consists of irregularly spaced layer lines whose spacings (in Z) are the same as those described above for the meridional maxima. Measurement of layer-line spacings and intensities and comparison with calculations based on the constituent monomers allows chain conformations to be estimated (Biswas & Blackwell, 1988a).

Diffraction patterns from liquid-crystalline random copolymers often contain sharp Bragg maxima on the layer lines. This indicates that, despite the random sequence and the possible dissimilarity of the component monomers, the chains are able to pack together in a regular way (Biswas & Blackwell, 1988b,c). Expressions that allow calculation of diffraction patterns for arrays of polymers with minimal registration, in which short, non-identical sequences form layers, have been derived (Biswas & Blackwell, 1988b,c). Calculation of diffracted intensities, coupled with molecular-mechanics modelling, allows chain conformations and packing to be investigated (Hofmann et al., 1994).

#### 4.5.2.4.4. Partially crystalline fibres

| top | pdf |

In this section we address disorder in the packing of the molecules in a polycrystalline fibre. The presence of disorder within the crystallites modifies the intensities of the Bragg reflections, as well as introducing continuous diffraction. The dominant effect, sometimes seen on fibre diffraction patterns (Stroud & Millane, 1995a), is for Bragg reflections to remain at low resolution but to be replaced by continuous diffraction at high resolution. There are two distinct cases to consider. The first is where the distortions at different lattice points in the crystallite are uncorrelated, and the second is where they are correlated.

Disorder within a crystallite in a polycrystalline fibre may consist of (1) deviations in the positions of the molecules (which are treated as rigid bodies) from their positions in a regular lattice, (2) rotations of the molecules about their molecular axes from their rotational positions in an ordered crystal, and (3) random orientations (up or down) of the molecules. The first of these is called lattice disorder, and the second and third are components of substitution disorder.

Uncorrelated disorder has been treated by a number of authors (Clark & Muus, 1962; Tanaka & Naya, 1969; Fraser & MacRae, 1973; Arnott, 1980). A rather complete model has been described by Millane & Stroud (1991) and Stroud & Millane (1995b), which is presented here. If the lattice and substitution disorders are independent, and the lattice and substitution distortions at different lattice sites are uncorrelated, then the cylindrically averaged layer-line intensities diffracted from a fibre can be written as a sum of Bragg and diffuse (continuous) intensities (Tanaka & Naya, 1969):

The profiles of the Bragg reflections are independent of the position of the reflection in reciprocal space. If the Cartesian components of the lattice distortions are independent, normally distributed, and the x and y components have equal variances, cylindrical averaging of the diffracted intensity can be performed analytically.

The lattice disorder consists of distortions of the two-dimensional lattice (in the lateral plane) into three-dimensional space, and in the absence of substitution disorder the Bragg component is given by (Stroud & Millane, 1995b) where is given by equation (4.5.2.24), the lattice disorder weight, , is given by where and are the variances of the lattice distortions normal (lateral') and parallel (axial') to the z axis, respectively. The diffuse component is given by where is given by equation (4.5.2.17) and is the average cross-sectional area of the crystallites. Inspection of equations (4.5.2.34) and (4.5.2.36) shows that the effect of the lattice disorder is to weight the amplitudes of the Bragg reflections down with increasing R and l, and to introduce a continuous intensity component whose amplitude increases with R and Z. Furthermore, the amplitude of the diffuse component relative to the Bragg component is inversely proportional to , and therefore is not significant unless the crystallites are small.

If substitution disorder is also present, then equation (4.5.2.33) still applies but equations (4.5.2.34) and (4.5.2.36) are replaced by and respectively, where the substitution disorder weight, , is given by In equation (4.5.2.39), is the probability density function (p.d.f.) that describes the substitution disorder, i.e. the p.d.f. for a molecule being rotated by ϕ about and translated by z along the molecular axis relative to its position in the undistorted lattice. Inspection of equations (4.5.2.37) and (4.5.2.38) shows that the substitution disorder weights the different contributing Bessel terms differently. This can lead to quite complicated effects on the diffraction pattern for various kinds of substitution disorder, resulting in different distributions and amplitudes of Bragg and diffuse diffraction over the diffraction pattern (Stroud & Millane, 1995b). If one assumes either uniform or normal distributions for ϕ and z, then expressions can be obtained for the in terms of the variances of the distributions of ϕ and z (Stroud & Millane, 1995b). The cases where distortions in ϕ are correlated with distortions in z (e.g. screw disorder'), and directional (up and down) disorder, can also be accommodated. This model has been shown to be capable of predicting diffraction patterns which are in good agreement with those measured from some disordered polycrystalline fibres (Stroud & Millane, 1995a).

We consider now the case of correlated packing disorder. As a result of intermolecular contacts within a polycrystalline specimen, it is possible that distortions at one lattice site will affect the degree of distortion at neighbouring sites. Coupling between distortions at different lattice sites can be included in the model of disorder by allowing the distortions at different lattice sites to be correlated. The effect of correlated distortions on diffraction patterns is that the diffracted intensity does not separate into Bragg and diffuse components as it does in the case of uncorrelated distortions [equation (4.5.2.33)]. The intensity can be described as being diffuse on the whole diffraction pattern, with (often broad) maxima occurring at some of the reciprocal-lattice points, but with no significant maxima at other reciprocal-lattice points. The widths of the profiles of the maxima generally increase with increasing resolution, whereas the widths of the Bragg maxima resulting from uncorrelated disorder as described above are independent of resolution. A broadening of diffraction maxima with increasing resolution and blending into continuous diffraction is sometimes seen on diffraction patterns from polycrystalline fibres, indicating the presence of correlated disorder (Stroud & Millane, 1996b).

Correlated lattice disorder consists of correlated distortions of the two-dimensional lattice into three-dimensional space. A flexible model of crystalline disorder is that based on the perturbed lattice approach (Welberry et al., 1980). While formulating perturbed lattices with only nearest-neighbour interactions is complicated, a more tractable approach is to base the statistics on an imposed correlation field (de Graaf, 1989; Stroud & Millane, 1996a). This approach has been used to describe cylindrically averaged diffraction from polycrystalline fibres that contain correlated lattice disorder and uncorrelated substitution disorder (Stroud & Millane, 1996a,b), and is presented here.

To develop a flexible and tractable theory for diffraction from crystallites with correlated disorder, it is necessary to formulate the problem in real space. The size and shape of a crystallite in the xy (lateral) plane is described by a shape function , where r denotes the position vector in real space, which is equal to unity inside the crystallite and zero outside. The autocorrelation of the shape function, , is given by

The correlations between the x components, and between the y components, of the distortions at any two lattice sites are taken to be identical. The correlations between distortion vectors are defined in terms of lateral, , and axial, , correlation fields such that the correlation coefficients between components of the distortions in the x (or y) and z directions, respectively, are equal to the correlation field evaluated for r equal to the inter-site vector. Various functional forms for the correlation fields are possible, but exponential correlation functions are usually used (Stroud & Millane, 1996a). If and the correlation fields are circularly symmetric, then cylindrical averaging of the diffracted intensity can be performed analytically.

For a polycrystalline fibre with correlated lattice disorder and uncorrelated substitution disorder, the diffracted intensity is given by (Stroud & Millane, 1996b) where , the sum over is over all the sites of the undistorted lattice within the region occupied by the autocorrelation function, are the polar coordinates of the lattice sites, and the lateral and axial lattice disorder weights are given by and

Equation (4.5.2.41) is an expression for the continuous intensity distribution along the layer lines and does not separate into Bragg and continuous components as in the case of uncorrelated disorder. However, calculations using these expressions show that the continuous intensity is sharply peaked around the projected reciprocal-lattice points at low resolution, the peaks broadening with increasing resolution until they have the character of continuous diffraction at high resolution (Stroud & Millane, 1996a). This is consistent with the character of diffraction patterns from some disordered polycrystalline fibres. A detailed study of the effects of correlated disorder on fibre diffraction patterns, and analysis of such disorder, can be found in Stroud & Millane (1996a) and Stroud & Millane (1996b).

#### 4.5.2.5. Processing diffraction data

| top | pdf |

Since the diffraction pattern from a fibre is two-dimensional, it can be collected with a single exposure of a stationary specimen. Diffraction data are collected either on film, which is subsequently scanned by a two-dimensional microdensitometer to obtain a digitized representation of the diffracted intensity, or using an electronic area detector (imaging plate, CCD camera, wire detector etc.) (Fraser et al., 1976; Namba, Yamashita & Vonderviszt, 1989; Lorenz & Holmes, 1993). We assume here that the diffraction pattern is recorded on a flat film (or detector) that is normal to the incident X-ray beam, although other film geometries are easily accommodated (Fraser et al., 1976). The fibre specimen is usually oriented with its axis normal to the incident X-ray beam, although, as is described below, it is sometimes tilted by a small angle to the normal in order to better access reciprocal space close to the meridian. The diffraction and camera geometry are shown in Fig. 4.5.2.1. Referring to this figure, P and S denote the intersections of the diffracted beam with the sphere of reflection and the film, respectively. The fibre, and therefore reciprocal space, is tilted by an angle β to the normal to the incident beam. The angles μ and χ define the direction of the diffracted beam and θ is the Bragg angle. Cartesian and polar coordinates on the film are denoted by (u, v) and , respectively, and D denotes the film-to-specimen distance.

 Figure 4.5.2.1 | top | pdf |Fibre diffraction geometry (see text). O is the origin of reciprocal space and OA is normal to the incident X-ray beam. Reciprocal space is rotated about OY so that the Z axis is inclined at an angle β to OA. Q is the projection of P onto the plane containing the incident beam and OY.

Inspection of Fig. 4.5.2.1 shows that the cylindrical and spherical polar coordinates in reciprocal space are related to μ and χ by and

The coordinates on the film are related to μ and χ by and and we also have that

Use of the above equations allows the reciprocal-space coordinates to be calculated from film-space coordinates, and vice versa. The film coordinates represent a relatively undistorted map of reciprocal space , except near the v (vertical) axis of the diffraction pattern. The meridian of reciprocal space does not map onto the film. Inspection of Fig. 4.5.2.1 shows that the only point on the meridian that does appear on the film is at . The region close to the meridian that appears on the film can therefore be manipulated by adjusting the fibre tilt.

The film-to-specimen distance can be determined by including with the specimen a crystalline power that gives a diffraction ring of known spacing and adjusting the film-to-specimen distance so that the calculated and observed rings coincide. A nonzero fibre tilt leads to differences between the upper and lower halves of the diffraction pattern, and these differences can be used to determine the tilt. This can be done by either calculating the ρ and χ values for several sets of the same reflection above and below the equator and using the relationship where and refer to the upper and lower reflections (Millane & Arnott, 1986; Lorenz & Holmes, 1993), or by finding the tilt that minimizes the differences between optical densities at the same reciprocal-space coordinates above and below the equator (Fraser et al., 1976). The optical densities may also be corrected for the effects of film (or detector) nonlinearity (Fraser et al., 1976) and the effects of variable absorption owing to the oblique passage of the beam through the film using expressions given by Fraser et al. (1976) and Lorenz & Holmes (1993).

Accurate subtraction of background diffraction is important in order to obtain accurate intensity measurements. One approach to estimating background diffraction is to fit a global background function, usually expanded as a polynomial (Lorenz & Holmes, 1993) or a Fourier–Bessel (Millane & Arnott, 1985) series, to optical densities at a set of points on the diffraction pattern that represent background alone. The background function may or may not be circularly symmetric. The background function is subtracted from the whole diffraction pattern. Another approach, suitable only for Bragg diffraction patterns, is to fit a plane under each reflection, either to the peripheral regions of the reflection or as part of a profile-fitting procedure (Fraser et al., 1976). A different plane is required for each reflection. A third approach, more suitable for continuous diffraction patterns, is to fit a one-dimensional polynomial in angle, for each value of r on the film, possibly as part of a deconvolution procedure (Makowski, 1978). Recently, Ivanova & Makowski (1998) have described an iterative low-pass filtering technique for estimating the background on diffraction patterns from poorly oriented specimens in which there is little space between the layer lines for sampling the background.

A polarization correction is applied to the diffraction pattern, where for unpolarized X-rays (laboratory sources) the polarization factor p is given by (Fraser et al., 1976) The diffraction pattern is usually mapped into reciprocal space for subsequent analysis. The mapping is performed by assigning to the intensity at position in reciprocal space the value given by (Fraser et al., 1976) where denotes the intensity on the film. The functions and can be derived from the equations given above. Note that equation (4.5.2.54) includes, implicitly, the Lorentz factor.

Subsequent processing depends on whether the diffraction pattern is continuous (i.e. from a noncrystalline specimen) or Bragg (i.e. from a polycrystalline specimen). Diffraction patterns from partially crystalline specimens that contain both components have been analysed using a combination of both approaches (Arnott et al., 1986; Park et al., 1987).

For a diffraction pattern containing continuous diffraction on layer lines, one usually extracts the cylindrically averaged transform from the intensity on the diffraction pattern mapped into reciprocal space. This involves correcting for the effects of coherence length and disorientation expressed by equation (4.5.2.19), and for the overlap of the smeared layer lines that results from their increasing width with increasing R. The diffracted intensity in polar coordinates in reciprocal space is equal to the sum of the diffraction due to each (overlapping) layer line so that Referring to equations (4.5.2.55), (4.5.2.19) and (4.5.2.20) shows that if the smearing due to disorientation dominates over that due to coherence length, then for fixed ρ, equation (4.5.2.55) represents a convolution along σ of the layer-line intensities with the Gaussian angular profile in equation (4.5.2.19). By mapping the intensity into polar coordinates as , or by simply sampling for fixed ρ and equally spaced samples of σ, can be calculated from by deconvolution, usually by some appropriate solution of the resulting system of linear equations (Makowski, 1978). If the effects of coherence length are significant, as they often are, then equation (4.5.2.55) does not represent a convolution since the width of the Gaussian smearing function depends on σ through equation (4.5.2.20). However, the problem can still be posed as the solution of a system of linear equations and becomes one of profile fitting rather than deconvolution (Millane & Arnott, 1986). This allows the layer-line intensities to be extracted from the data beyond the resolution where they overlap, although there is a limiting resolution, owing to excessive overlap, beyond which reliable data cannot be obtained (Makowski, 1978; Millane & Arnott, 1986). This procedure requires that and be known; these parameters can be estimated from the angular profiles at low resolution where there is no overlap, or they can be determined as part of the profile-fitting procedure.

For a diffraction pattern from a polycrystalline specimen containing Bragg reflections, the intensities given by equation (4.5.2.24) need to be extracted from the intensity on the diffraction pattern mapped into reciprocal space. Each composite reflection is smeared into a spot whose intensity profile is given by equation (4.5.2.27), and adjacent reflections may overlap. The intensity is equal to the intensity integrated over the region of the spot, and the intensity at the centre of a spot is reduced, relative to , by a factor that increases with the degree of smearing.

The c repeat can be obtained immediately from the layer-line spacing. Initial estimates of the remaining cell constants can be made from inspection of the coordinates of low-order reflections. These values are refined by minimizing the difference between the calculated and measured coordinates of all the sharp reflections on the pattern.

One approach to measuring the intensities of Bragg reflections is to estimate the boundary of each spot (or a fixed proportion of the region occupied by each spot) and integrate the intensity over that region (Millane & Arnott, 1986; Hall et al., 1987). For spots that overlap, an integration region that is the union of the region occupied by each contributing spot can be used, allowing the intensities for composite spots to be calculated (Millane & Arnott, 1986). This is more accurate than methods based on the measurement of the peak intensity followed by a correction for smearing. Integration methods suffer from problems associated with determining accurate spot boundaries and they are not capable of separating weakly overlapping spots. A more effective approach is one based on profile fitting. The intensity distribution on the diffraction pattern can be written as where denotes the intensity distribution of the spot , and the sums are over all spots on the diffraction pattern. Using equation (4.5.2.27) shows that equation (4.5.2.56) can be written as where denotes the profile of the spot centred at [which can be derived from equation (4.5.2.27)]. Given estimates of the parameters , and , equation (4.5.2.57) can be written as a system of linear equations that can be solved for the intensities from the data on the diffraction pattern. The parameters , and , as well as the cell constants and possibly other parameters, can also be refined as part of the profile-fitting procedure using nonlinear optimization.

A suite of programs for processing fibre diffraction data is distributed (and often developed) by the Collaborative Computational Project for Fibre and Polymer Diffraction (CCP13) in the UK (http://www.ccp13.ac.uk/ ) (Shotton et al., 1998).

| top | pdf |

#### 4.5.2.6.1. Overview

| top | pdf |

Structure determination in fibre diffraction is concerned with determining atomic coordinates or some other structural parameters, from the measured cylindrically averaged diffraction data. Fibre diffraction analysis suffers from the phase problem and low resolution (diffraction data rarely extend beyond 3 Å resolution), but this is no worse than in protein crystallography where phases derived from, say, isomorphous replacement or molecular replacement, coupled with the considerable stereochemical information usually available on the molecule under study, together contribute enough information to lead to precise structures. What makes structure determination by fibre diffraction more difficult is the loss of information owing to the cylindrical averaging of the diffraction data. However, in spite of these difficulties, fibre diffraction has been used to determine, with high precision, the structures of a wide variety of biological and synthetic polymers, and other macromolecular assemblies. Because of the size of the repeating unit and the resolution of the diffraction data, methods for structure determination in fibre diffraction tend to mimic those of macromolecular (protein) crystallography, rather than small-molecule crystallography (direct methods).

For a noncrystalline fibre one can determine only the molecular structure from the continuous diffraction data, whereas for a polycrystalline fibre one can determine crystal structures from the Bragg diffraction data. However, there is little fundamental difference between methods used for structure determination with noncrystalline and polycrystalline fibres. For partially crystalline fibres, little has so far been attempted with regard to rigorous structure determination.

As is the case with protein crystallography, the precise methods used for structure determination by fibre diffraction depend on the particular problem at hand. A variety of tools are available and one selects from these those that are appropriate given the data available in a particular case. For example, the structure of a polycrystalline polynucleotide might be determined by using Patterson functions to determine possible packing arrangements, molecular model building to define, refine and arbitrate between structures, difference Fourier synthesis to locate ions or solvent molecules, and finally assessment of the reliability of the structure. As a second example, to determine the structure of a helical virus, one might use isomorphous replacement to obtain phase estimates, calculate an electron-density map, fit a preliminary model and refine it using simulated annealing alternating with difference Fourier analysis, and assess the results. The various tools available, together with indications of where and how they are used, are described in the following sections.

Although a variety of techniques are used to solve structures using fibre diffraction, most of the methods do fall broadly into one of three classes that depend primarily on the size of the helical repeat unit. The first class applies to molecules whose repeating units are small, i.e. are represented by a relatively small number of independent parameters or degrees of freedom (after all stereochemical constraints have been incorporated). The structure can then be determined by an exhaustive exploration of the parameter space using molecular model building. The first example above would belong to this class. The second class of methods is appropriate when the size of the helical repeating unit is such that its structure is described by too many variable parameters for the parameter space to be explored a priori. It is then necessary to phase the fibre diffraction data and construct an electron-density map into which the molecular structure can be fitted and then refined. The second example above would belong to this class. The second class of methods therefore mimics conventional protein crystallography quite closely. The third class of problems applies when the structure is large, but there are too few diffraction data to attempt phasing and the usual determination of atomic coordinates. The solution to such problems varies from case to case and usually involves modelling and optimization of some kind.

An important parameter in structure determination by fibre diffraction is the degree of overlap (that results from the cylindrical averaging) in the data. This parameter is equal to the number of significant terms in equation (4.5.2.17) or the number of independent terms in equation (4.5.2.24), and depends on the position in reciprocal space and, for a polycrystalline fibre, the space-group symmetry. The number of degrees of freedom in a particular datum is equal to twice this number (since each structure factor generally has real and imaginary parts), and is denoted in this section by m. Determination of the from the cylindrically averaged data therefore involves separating the amplitudes and assigning phases to each. The electron density can be calculated from the using equations (4.5.2.7) and (4.5.2.11).

#### 4.5.2.6.2. Helix symmetry, cell constants and space-group symmetry

| top | pdf |

The first step in analysis of any fibre diffraction pattern is determination of the molecular helix symmetry . Only the zero-order Bessel term contributes diffracted intensity on the meridian, and referring to equation (4.5.2.6) shows that the zero-order term occurs only on layer lines for which l is a multiple of u. Therefore, inspection of the distribution of diffraction along the meridian allows the value of u to be inferred. This procedure is usually effective, but can be difficult if u is large, because the first meridional maximum may be on a layer line that is difficult to measure. This difficulty was overcome in one case by Franklin & Holmes (1958) by noting that the second Bessel term on the equator is , estimating using data from a heavy-atom derivative (see Section 4.5.2.6.6), subtracting this from , and using the behaviour of the remaining intensity for small R to infer the order of the next Bessel term [using equation (4.5.2.14)] and thence u.

Referring to equations (4.5.2.6) and (4.5.2.14) shows that the distribution of for depends on the value of v. Therefore, inspection of the intensity distribution close to the meridian often allows v to be inferred. Note, however, that the distribution of does not distinguish between the helix symmetries and . Any remaining ambiguities in the helix symmetry need to be resolved by steric considerations, or by detailed testing of models with the different symmetries against the available data.

For a polycrystalline system, the cell constants are determined from the coordinates of the spots on the diffraction pattern as described in Section 4.5.2.6.4. Space-group assignment is based on analysis of systematic absences, as in conventional crystallography. However, in some cases, because of possible overlap of systematic absences with other reflections, there may be some ambiguity in space-group assignment. However, the space group can always be limited to one of a few possibilities, and ambiguities can usually be resolved during structure determination (Section 4.5.2.6.4).

#### 4.5.2.6.3. Patterson functions

| top | pdf |

In fibre diffraction, the conventional Patterson function cannot be calculated since the individual structure-factor intensities are not available. However, MacGillavry & Bruins (1948) showed that the cylindrically averaged Patterson function can be calculated from fibre diffraction data. Consider the function defined by where for and 2 for , which can be calculated from the intensity distribution on a continuous fibre diffraction pattern. Using equations (4.5.2.7), (4.5.2.10), (4.5.2.17) and (4.5.2.58) shows that is the cylindrical average of the Patterson function, , of one molecule, i.e. The symbols on and indicate that these are Patterson functions of a single molecule, as distinct from the usual Patterson function of a crystal, which contains intermolecular interatomic vectors and is periodic with the same periodicity as the crystal. is periodic only along z and is therefore, strictly, a Patterson function along z and an autocorrelation function along x and y (Millane, 1990b). The cylindrically averaged Patterson contains information on interatomic separations along the axial direction and in the lateral plane, but no information on orientations of the vectors in the lateral plane.

For a polycrystalline system; consider the function given by where the sums are over all the overlapped reflections on the diffraction pattern, given by equation (4.5.2.24). It is easily shown that is related to the Patterson function by where, in this case, is the usual Patterson function (expressed in cylindrical polar coordinates), i.e. it contains all intermolecular (both intra- and inter-unit cell) interatomic vectors and has the same translational symmetry as the unit cell. The cylindrically averaged Patterson function for polycrystalline fibres therefore contains the same information as it does for noncrystalline fibres (i.e. no angular information in the lateral plane), except that it also contains information on intermolecular separations.

Low resolution and cylindrical averaging, in addition to the usual difficulties with interpretation of Patterson functions, has resulted in the cylindrically averaged Patterson function not playing a major role in structure determination by fibre diffraction. However, information provided by the cylindrically averaged Patterson function has, in a number of instances, been a useful component in fibre diffraction analyses. A good review of the application of Patterson functions in fibre diffraction is given by Stubbs (1987). Removing data from the low-resolution part (or all) of the equator when calculating the cylindrically averaged Patterson function removes the strong vectors related to axially invariant (or cylindrically symmetric) parts of the map, and can aid interpretation (Namba et al., 1980; Stubbs, 1987). It is also important when calculating cylindrically averaged Patterson functions to use data only at a resolution that is appropriate to the size and spacings of features one is looking for (Stubbs, 1987).

Cylindrically averaged Patterson functions were used in early applications of fibre diffraction analysis (Franklin & Gosling, 1953; Franklin & Klug, 1955). The intermolecular peaks that usually dominate in a cylindrically averaged Patterson function can help to define the locations of multiple molecules in the unit cell. Depending on the space-group symmetry, it is sometimes possible to calculate the complete three-dimensional Patterson function (or certain projections of it). This comes about because of the equivalence of the amplitudes of overlapping reflections in some high-symmetry space groups. The intensity of each reflection can then be determined and a full three-dimensional Patterson map calculated (Alexeev et al., 1992). The only difficulty is that non-systematic overlaps are often present, although these are usually relatively few in number and the intensity can be apportioned equally amongst them, the resulting errors usually being small relative to the level of detail present in the Patterson map. For lower space-group symmetries, it may not be possible to calculate a three-dimensional Patterson map, but it may be possible to calculate certain projections of the map. For example, if the overlapped hk0 reflections have the same intensities, a projection of the Patterson map down the c axis can be calculated. Since such a projection is along the polymer axes, it gives the relative positions of the molecules in the ab plane. If the combined helix and space-group symmetry is high, an estimate of the electron density can be obtained by averaging appropriate copies of the three-dimensional Patterson function (Alexeev et al., 1992).

#### 4.5.2.6.4. Molecular model building

| top | pdf |

The majority of the structures determined by X-ray fibre diffraction analysis have been determined by molecular model building (Campbell Smith & Arnott, 1978; Arnott, 1980; Millane, 1988). Most applications of molecular model building have been to polycrystalline systems, although there have been a number of applications to noncrystalline systems (Park et al., 1987; Millane et al., 1988). The approach is to use spacings and symmetry information derived directly from the diffraction pattern, coupled with the primary structure and stereochemical information on the molecule under study, to construct models of all kinds of possible molecular or crystal structure. These models are each refined (optimized) against the diffraction data, as well as stereochemical restraints, to produce the best model of each kind. The optimized models can be compared using various figures of merit, and in favourable cases one model will be sufficiently superior to the remainder for it to represent unequivocally the correct structure. The principle of this approach is that by making use of stereochemical constraints, the molecular and crystal structure have few enough degrees of freedom that the parameter space has a sufficiently small number of local minima for these to be identified and individually examined to find the global minimum. The X-ray phases are therefore not determined explicitly.

There are three steps involved in structure determination by molecular model building: (1) construction of all possible molecular and crystal structure models, (2) refinement of each model against the X-ray data and stereochemical restraints, and (3) adjudication among the refined models. The overall procedure for determining polymer structures using molecular model building is summarized by the flow chart in Fig. 4.5.2.2, and is described below.

 Figure 4.5.2.2 | top | pdf |Flow chart of the molecular-model-building approach to structure determination (Arnott, 1980).

The helix symmetry of the molecule, or one of a few helix symmetries, can be determined as described in Section 4.5.2.6.2. Different kinds of molecular model may correspond to one of a few different helix symmetries, usually corresponding to different values of v. For example, helix symmetries and , which correspond to the left- and right-handed helices, cannot be distinguished on the basis of the overall intensity distribution alone. Other examples of different kinds of molecular model may include single, double or multiple helices, parallel or antiparallel double helices, different juxtapositions of chains within multiple helices and different conformational domains within the molecule. For polycrystalline systems, in addition to different kinds of molecular structures, there are often different kinds of possible packing arrangements within the unit cell. There may be a number of possible packings which correspond to different arrangements within the crystallographic asymmetric unit, and there may be more than one space group that needs to be considered.

Despite the apparent large number of potential starting models implied by the above discussion, in practice the number of feasible models is usually quite small, and many of these are often eliminated at an early stage. Definition and refinement of helical polymers [steps (1) and (2) above] are carried out using computer programs, the most popular and versatile being the linked-atom least-squares (LALS) system (Campbell Smith & Arnott, 1978; Millane et al., 1985), originally developed by Arnott and co-workers in the early 1960s (Arnott & Wonacott, 1966). This system has been used to determine the structures of a wide variety of polynucleotides, polysaccharides, polyesters and polypeptides (Arnott, 1980; Arnott & Mitra, 1984; Chandrasekaran & Arnott, 1989; Millane, 1990c). Other refinement systems exist (Zugenmaier & Sarko, 1980; Iannelli, 1994), but the principles are essentially the same and the following discussion is in terms of the LALS system. The atomic coordinates are defined, using a linked-atom description, in terms of bond lengths, bond angles and conformation (torsion) angles (Campbell Smith & Arnott, 1978). Stereochemical constraints are imposed, and the number of parameters reduced, by fixing the bond lengths, often (but not always) the bond angles, and possibly some of the conformation angles. The molecular conformation is then defined by the remaining parameters. For polycrystalline systems, there are usually additional variable parameters that define the packing of the molecule(s) in the unit cell. A further source of stereochemical data is the requirement that a model exhibit no over-short nonbonded interatomic distances. These are incorporated by a quadratic nonbonded potential that is matched to a Buckingham potential (Campbell Smith & Arnott, 1978). A variety of other restraints can also be incorporated.

In the LALS system, the quantity Ω given by is minimized by varying a set of chosen parameters consisting of conformation angles, possibly bond angles, and packing parameters. The term X involves the differences between the model and experimental X-ray amplitudes – Bragg and/or continuous. The term C involves restraints to ensure that over-short nonbonded interatomic distances are driven beyond acceptable minimum values, that conformations are within desired domains, that hydrogen-bond and coordination geometries are close to the expected configurations, and a variety of other relationships are satisfied (Campbell Smith & Arnott, 1978). The and are weights that are inversely proportional to the estimated variances of the data. The term L involves constraints which are relationships that are to be satisfied exactly and the are Lagrange multipliers. Constraints are used, for example, to ensure connectivity from one helix pitch to the next and to ensure that chemical ring systems are closed. The cost function Ω is minimized using full-matrix nonlinear least squares and singular value decomposition (Campbell Smith & Arnott, 1978).

Structure determination usually involves first using equation (4.5.2.62) with the terms C and L only, to establish the stereochemical viability of each kind of possible molecular model and packing arrangement. It is worth emphasizing that it is usually advantageous if the specimen is polycrystalline, even though the continuous diffraction contains, in principle, more information than the Bragg reflections (since the latter are sampled). This is because the molecule in a noncrystalline specimen must be refined in steric isolation, whereas for a polycrystalline specimen it is refined while packed in the crystal lattice. The extra information provided by the intermolecular contacts can often help to eliminate incorrect models. This can be particularly significant if the molecule has flexible sidechains. The initial models that survive the steric optimization are then optimized also against the X-ray data, by further refinement with X included in equation (4.5.2.62). The ratios and can be used in Hamilton's test (Hamilton, 1965) to evaluate the differences between models P and Q. On the basis of these statistical tests, one can decide if one model is superior to the others at an acceptable confidence level. In the final stages of refinement, bond angles may be varied in a stiffly elastic' fashion from their mean values if there are sufficient data to justify the increase in the number of degrees of freedom.

If sufficient X-ray data are available, it is sometimes possible to locate additional ordered molecules such as counterions or solvent molecules by difference Fourier synthesis as described in Section 4.5.2.6.5. Their positions can then be co-refined with the polymer structure while hydrogen bonds and coordination geometries are optimized. The resulting structure can then be used to compute improved phases to search for additional molecules. Since the signal-to-noise ratio in fibre difference syntheses is usually low, difference maps must be interpreted with caution. The assignment of counterions or solvent molecules to peaks in the difference synthesis must be supported by plausible interactions with the rest of the structure and, following refinement of the structure, by elimination of the peak in the difference map and by a significant improvement in the agreement between the calculated and measured X-ray amplitudes.

#### 4.5.2.6.5. Difference Fourier synthesis

| top | pdf |

Difference Fourier syntheses are widely used in both protein and small-molecule crystallography to detect structural errors or to complete partial structures (Drenth, 1994). The difficulty in applying difference Fourier techniques in fibre diffraction is that the individual observed amplitudes are not available. However, difference syntheses have found wide use in fibre diffraction analysis, one of the earliest applications being to polycrystalline fibres of polynucleotides (e.g. Arnott et al., 1967). Calculation of a three-dimensional difference map (for the unit cell) from Bragg fibre diffraction data requires that the observed intensity be apportioned among the contributing intensities . There are two ways of doing this. The intensities may be divided equally among the contributing reflections [i.e. ], or they may be divided in the same proportions as those in the model, i.e. The advantage of the former is that it is unbiased, and the advantage of the latter is that it may be more accurate but is biased towards the model. Equal division of the intensities is often (but not always) used to minimize model bias. Once the observed amplitudes have been apportioned, an map can be calculated as in conventional crystallography, although noise levels will be higher owing to errors in apportioning the amplitudes. As a result of overlapping of the reflections, a synthesis based on coefficients gives a more accurate estimate of the true density than does one based on , as is described below. Difference syntheses for polycrystalline specimens calculated in this way have been used, for example, to locate cations and water molecules in polynucleotide and polysaccharide structures (e.g. Cael et al., 1978), to help position molecules in the unit cell (e.g. Chandrasekaran et al., 1994) and to help position side chains, and have also been applied in neutron fibre diffraction studies of polynucleotides (Forsyth et al., 1989).

Sim (1960) has shown that the mean-squared error in difference syntheses can be minimized by weighting the coefficients based on the agreement between the calculated and observed structure amplitudes. Such an analysis has recently been conducted for fibre diffraction, and shows that the optimum difference synthesis is obtained by using coefficients (Millane & Baskaran, 1997; Baskaran & Millane, 1999a) where m is the number of degrees of freedom as defined in Section 4.5.2.6.1. If the reflections contributing to are either all centric or all acentric, then the weights are given by where denotes the modified Bessel function of the first kind of order m, and X is given by where for centric reflections and 2 for acentric reflections. The form of the weighting function is more complicated if both centric and acentric reflections contribute, but it can be approximated as given by where and are the number of acentric and centric reflections, respectively, contributing. Use of the weighted maps reduces bias towards the model (Baskaran & Millane, 1999b).

For continuous diffraction data from noncrystalline specimens, the situation is essentially identical except that one works in cylindrical coordinates. Referring to equations (4.5.2.7) and (4.5.2.10), the desired difference synthesis, , is the Fourier–Bessel transform of where and denote the observed and calculated, respectively, Fourier–Bessel structure factors . Since is not known, the synthesis is based on the Fourier–Bessel transform of , where is the phase of . As in the polycrystalline case, the individual need to be estimated from the data given by equation (4.5.2.17), and can be based on either equal division of the data, or division in the same proportion as the amplitudes from the model.

Namba & Stubbs (1987a) have shown that the peak heights in a difference synthesis are times their true value, as opposed to half their true value in a conventional difference synthesis. The best estimate of the true map is therefore provided by a synthesis based on the coefficients , rather than on . Test examples showed that the noise in the synthesis can be reduced by using a value for m that is fixed over the diffraction pattern and approximately equal to the average value of m over the pattern (Namba & Stubbs, 1987a). Difference Fourier maps for noncrystalline systems have been used in studies of helical viruses to locate heavy atoms, to correct errors in atomic models and to locate water molecules (Mandelkow et al., 1981; Lobert et al., 1987; Namba, Pattanayek & Stubbs, 1989; Wang & Stubbs, 1994).

#### 4.5.2.6.6. Multidimensional isomorphous replacement

| top | pdf |

At low enough resolution, only one Fourier–Bessel structure factor contributes on each layer line of a fibre diffraction pattern, so that only the phase needs to be determined and the situation is no different to that in protein crystallography. If heavy-atom-derivative specimens can be prepared, the usual method of multiple isomorphous replacement (MIR) (Drenth, 1994) can be applied, which in principle requires only two heavy-atom derivatives. At higher resolution, however, more than one Fourier–Bessel structure factor contributes on each layer line. A generalized form of isomorphous replacement which involves using diffraction data from several heavy-atom derivatives to determine the real and imaginary components of each contributing is referred to as multidimensional isomorphous replacement (MDIR) (Namba & Stubbs, 1985). MDIR was first described and used to determine the structure of TMV at 6.7 Å resolution (Stubbs & Diamond, 1975; Holmes et al., 1975), and has since been used to extend the resolution to 2.9 Å (Namba, Pattanayek & Stubbs, 1989). A consequence of cylindrical averaging is that large numbers of heavy-atom derivatives are required: at least two for each Bessel term to be separated. The theory of MDIR is outlined here.

The first step in MDIR is location of the heavy atoms in the derivative structures. The radial coordinate of a heavy atom can be determined by analysis of the intensity distribution in the low-resolution region of the equator where only the Bessel term contributes. Since is real, and can be measured continuously in R, inspection of the positions of the minima and maxima in the low-resolution region of the equator generally allows the sign of to be assigned to , i.e. can be determined from . If the sign is determined for both the native and a heavy-atom derivative, referring to equation (4.5.2.13) shows that where is the value derived from the derivative data, o denotes the occupancy and the subscript h denotes values for the heavy atom. The parameters and on the right-hand side of equation (4.5.2.68) can be searched in a trial-and-error fashion to obtain the best agreement with the left-hand side (calculated from the data) to determine the radial coordinate of the heavy atom (Mandelkow & Holmes, 1974). Lobert et al. (1987) applied the same method to cucumber green mottle mosaic virus (CGMMV), except that the sign of was taken from that of TMV.

Two approaches have been used to determine the angular and axial coordinates of the heavy atom. Mandelkow & Holmes (1974) and Holmes et al. (1975) used a search procedure in which the quantity is varied and used to calculate the intensity of the Fourier–Bessel structure factor for the heavy atom alone. This is compared to on each layer line, where only one Bessel order contributes, and Φ chosen to minimize the mean-square difference. The values of Φ found for each layer line can then be combined to determine and . In the case of CGMMV, Lobert et al. (1987) used the phases and Bessel-order separations from TMV to calculate Fourier–Bessel difference maps between the native and derivative data to determine the heavy-atom coordinates .

Consider a set of isomorphous heavy-atom derivatives indexed by j. Since the analysis is applied at any point on the fibre diffraction pattern, the symbol will be used for where no confusion arises. Denote by the value of for the jth derivative, so that where denotes the Fourier–Bessel structure factor of a structure containing the heavy atom only. Denote by and the real and imaginary parts, respectively, of (for the native structure), and by and the real and imaginary parts of , i.e. for the jth heavy-atom structure alone. Equation (4.5.2.17) can then be written as for the native and for the jth derivative. If intensity data are available from J heavy-atom derivatives, and can be calculated from the heavy-atom positions, and equations (4.5.2.70) and (4.5.2.71) represent a system of second-order equations for the m unknowns and . If , then the system of equations is overdetermined and can be solved for the and . The solution of this nonlinear system can be eased by deriving a system of linear equations by substituting from (4.5.2.70) into (4.5.2.71), giving Equation (4.5.2.72) is a system of linear equations for the unknowns and , the solution being subject to the constraint equation (4.5.2.70). However, since the original problem is second-order, there may be up to m local minima. Stubbs & Diamond (1975) describe a numerical procedure for locating all the local minima and selecting the best of these based on continuity' of the . This method was used to determine the structure of TMV at 6.7 Å resolution (Holmes et al., 1975) and 4 Å resolution (Stubbs et al., 1977). In current applications of MDIR a more direct solution technique is used in which the phase-determining equations (4.5.2.70) and (4.5.2.71) are solved by first solving the linear equations (4.5.2.72) by linear least squares to obtain an approximate solution, which is then refined by solving the quadratic equations (4.5.2.70) and (4.5.2.71) directly using nonlinear least squares (Namba & Stubbs, 1985).

The number of heavy-atom derivatives required can be quite demanding experimentally, although phasing with fewer heavy-atom derivatives is possible, particularly if additional information is available, such as from a related structure. The different Bessel terms may be assumed to contribute the same amplitude each, or, if the structure of a related molecule is known, the ratios of the amplitudes can be taken as being the same as those for the related molecule. Using the amplitude estimates derived using either of these two approaches, applied to both native and derivative data, the phases of the Bessel terms can be estimated using conventional MIR and data from at least two heavy-atom derivatives, allowing an initial electron-density map to be calculated. If only one heavy-atom derivative is available then two phase solutions are obtained, but the method of conventional single isomorphous replacement (SIR) (Drenth, 1994) can be used to obtain an estimate of the electron density. The electron density obtained by MIR, and particularly by SIR, in this way tends to be noisy and low contrast as a result of inaccurate division of the intensities, as well as the usual sources of errors in MIR. The electron density can, however, be improved using solvent levelling. If no heavy-atom derivatives are available, both the relative amplitudes and the phases can be based on those of a related structure. Model bias can, however, be more serious than in conventional crystallography since both the phases and the relative amplitudes are based on the model.

The feasibility of structure determination with a limited number of heavy-atom derivatives was first demonstrated by Namba & Stubbs (1987b) using data from TMV at 4 Å resolution. The structure of CGMMV has been determined at 5 Å resolution using data from two heavy-atom derivatives and the techniques described above (Lobert et al., 1987; Lobert & Stubbs, 1990). Structure determination at this resolution using MDIR would theoretically require six heavy-atom derivatives. Initial separation of the Bessel-term amplitudes was based on the equal-amplitude assumption and also on the relative amplitudes for (homologous) TMV.

In general, the equal-amplitude assumption appears to produce reliable electron-density maps where only two or three Bessel terms contribute. The corresponding resolution depends on the helix symmetry and the molecular diameter, but can be relatively high for molecules with high helix symmetry. At higher resolution where more Bessel terms contribute, use of related or partial structures can be used to calculate initial Bessel-term amplitudes and can lead to successful phasing.

If the molecule has only approximate helix symmetry, then layer-line splitting (Section 4.5.2.3.3) can provide additional information which reduces the number of heavy-atom derivatives required. The degree of splitting is usually significantly less than the breadth of the layer lines so that the different Bessel terms within a (split) layer line overlap. The effect of splitting can be observed, however, since the centre of a layer line, at a particular value of R, is shifted towards the position of the stronger Bessel term contributing at that radius. The shift depends on the relative magnitudes of the contributing Bessel terms, and can be measured and used in phase determination as detailed by Stubbs & Makowski (1982). If P of the heavy-atom derivatives (in addition to the native) give accurate splitting information, then an additional P linear equations [analogous to equation (4.5.2.72)] and one quadratic equation [analogous to equation (4.5.2.70)] are available for solution of the phase problem, and the number of heavy-atom derivatives required is reduced by a factor of up to two. The value of layer-line splitting was first demonstrated by recalculating an electron-density map of TMV at 6.7 Å resolution using only two derivatives, rather than using six derivatives without the use of splitting data (Stubbs & Makowski, 1982). Layer-line splitting was subsequently used in a structure determination of TMV at 3.6 Å resolution (Namba & Stubbs, 1985).

Macromolecular fibre structures that have been built into an electron-density map have been refined using both restrained least-squares (RLS) and molecular-dynamics (MD) refinements. Restrained least squares has been used to refine the structure of TMV at 2.9 Å resolution (Namba, Pattanayek & Stubbs, 1989); however, Wang & Stubbs (1993) have shown that a larger radius of convergence is obtained using MD refinement (as in protein crystallography).

Molecular-dynamics refinement in fibre diffraction has been implemented by adding a fibre diffraction option (Wang & Stubbs, 1993) to the X-PLOR program (Brünger, 1992). This involves including the cylindrically averaged fibre diffraction intensities in the energy term and taking account of the inter-helical subunit contacts and covalent connections in the same way as described above for RLS refinement. The effective potential-energy function E used is where is the empirical energy function (which typically includes bond-length, bond-angle and torsion-angle distortions, van der Waals and electrostatic interactions, and other terms such as ring planarity), and are the observed and calculated, respectively, cylindrically averaged diffraction intensities sampled at , the are weights for the observed intensities and k is a scale factor between the calculated and observed data. The quantity S is a weight to make the gradients of the two terms in equation (4.5.2.73) comparable (Wang & Stubbs, 1993), and can be estimated using the method of Brünger (1992). Molecular-dynamics refinement has been successfully used to refine the structure of CGMMV at 3.4 Å resolution (Wang & Stubbs, 1994). In the case of ribgrass mosaic virus (RMV), the close isomorphism with TMV (identical helix symmetry, similar repeat distance, significant sequence homology and similar diffraction pattern) allowed an initial model to be built based on the TMV structure, and a solution obtained at 2.9 Å by alternating molecular-dynamics refinement with difference-map and omit-map calculations (Wang et al., 1997).

#### 4.5.2.6.7. Other techniques

| top | pdf |

Aside from the techniques for structure determination described in the previous sections, a variety of other techniques have been applied to specific problems where the methods described above are not suitable. This situation usually arises where the diffraction data available are far too few, by themselves, to determine the individual atomic coordinates of a structure, even with the usual stereochemical constraints. Often only relatively low-resolution data are available, but they can be supplemented by either a low-resolution or high-resolution model of either a whole molecule or relatively large subunits. Structure determination often amounts to positioning the molecules or subunits within a larger assembly. The results can be quite precise, depending on the information available. The problem is almost always one of refinement or optimization, since it invariably involves optimizing some kind of model directly against the fibre diffraction data. The problem is usually twofold: (1) parameterizing the model with few enough parameters to obtain a usable data-to-parameter ratio, but retaining enough degrees of freedom to represent the important structural features; and (2) devising an optimization procedure that will locate the global minimum of the resulting complicated cost function. There have been numerous such applications in fibre diffraction, and rather than attempt to be exhaustive or detailed, I will briefly mention a few of the more prominent applications and techniques.

The structure of the bacteriophage Pf1 was determined at 7 Å resolution using a model in which the α-helical segments of the structure were represented by rods of electron density of appropriate dimensions and spacings (Makowski et al., 1980). The positions and orientations of the rods were refined in an iterative procedure that alternated between real space and reciprocal space and also incorporated solvent levelling. Neutron fibre diffraction data have been collected from specifically deuterated phages and, starting with a model of the kind described above, iterative application of difference maps (between the deuterated and native data) was used to locate 15 (of the 46) residues, allowing construction of a model of the coat protein (Stark et al., 1988; Nambudripad et al., 1991).

Pf1 undergoes a temperature-induced structural transition that involves a small change in the helix symmetry. The low-temperature form has 7113 helix symmetry with a c repeat of 216.5 Å, and the high-temperature form (that discussed in the previous paragraph) has 275 helix symmetry and a c repeat of 78.3 Å. These two symmetries are very similar since and , i.e. the rotations and translations from one subunit to the next are very similar in both structures.

The structure of the low-temperature form of Pf1 has been determined at 3.3 Å resolution by starting with an α-helical polyalanine model (Marvin et al., 1987) and alternating rounds of molecular-dynamics refinement and model rebuilding based on maps and omit maps (Gonzalez et al., 1995). The structure of the high-temperature form of Pf1 was determined using data to 3 Å resolution, starting with a model based on the low-temperature form, making small adjustments to satisfy the slightly different helix symmetry, and refining the model using molecular dynamics (Welsh et al., 2000).

The bacteriophage Pf3 is related to Pf1 but does not undergo a structural transition, and fibre diffraction patterns are similar to those from the high-temperature form of Pf1. An α-helical polyalanine model of Pf3 based on the Pf1 structure was used to separate and phase the Bessel terms, which were then used to calculate maps. These maps were used to align and position the polypeptide chain, and the resulting model was refined by molecular dynamics (Welsh et al., 1998).

The R-type bacterial flagellar filament structure (that has a very high molecular weight subunit) has been determined at 9 Å resolution by X-ray fibre diffraction (Yamashita et al., 1998). Accurate intensities were taken from high-quality X-ray diffraction patterns and combined with phases obtained from electron cryomicroscopy, and solvent levelling was used to refine the phases.

Some studies of muscle provide a good example of the use of low-resolution fibre diffraction data, coupled with high-resolution crystal structures of some of the component molecules, to determine the structure of a complex. Holmes et al. (1990) constructed a model of F-actin based on the crystal structure of the monomer, G-actin, and 8 Å fibre diffraction data, by either treating the monomer as a rigid body or dividing it into four separate rigid domains, and using a search procedure followed by least-squares refinement. The results gave the orientation of the actin monomer in the actin helix. This structure has since been refined using a genetic algorithm (Lorenz et al., 1993) and normal-mode analysis (Tirion et al., 1995). The genetic algorithm involved a Monte Carlo method of selecting subdomains to be refined and nonlinear least squares to obtain the best fit for the selected domains. In the normal-mode analysis, the model was parameterized in terms of its low-frequency vibrational modes to allow low-energy conformational changes and reduce the number of parameters which were optimized against the fibre diffraction data using nonlinear least squares.

Squire et al. (1993) have refined a low-resolution model of the muscle thin-filament structure that consists of four spheres representing each of the F-actin monomer subdomains and five spheres (fixed relative to each other) representing tropomyosin. Steric restraints were placed on the actin subdomain and thin-filament structures. The positions of the actin subdomains and the orientation of the tropomyosin were refined using a search procedure against fibre diffraction data from both resting' and activated' muscle at 25 Å resolution. More recent work has used a low-resolution model of the myosin head (based on the single-crystal atomic structure), a search procedure and simulated-annealing refinements to study myosin head configuration (Hudson et al., 1997) and myosin rod packing (Squire et al., 1998).

#### 4.5.2.6.8. Reliability

| top | pdf |

As with structure determination in any area of crystallography, assessment of the reliability or precision of a structure is critically important. The most commonly used measure of reliability in fibre diffraction is the R factor, calculated as where and denote the observed (measured) and calculated, respectively, amplitude of either the samples (along R) of the cylindrically averaged intensity (for a noncrystalline specimen) or the cylindrically averaged structure factors (for a polycrystalline specimen). One way of assessing the significance of the R factor obtained in a particular structure determination is by comparing it with the largest likely R factor' (Wilson, 1950), i.e. the expected value of the R factor for a random distribution of atoms. Wilson (1950) showed that the largest likely R factor is 0.83 for a centric crystal and 0.59 for an acentric crystal. Although it does not provide a quantitative measure of structural reliability, the largest likely R factor does provide a useful yardstick for evaluating the significance of R factors obtained in structure determinations.

The largest likely R factor for fibre diffraction can be calculated from the amplitude statistics, which depend on the number of degrees of freedom, m, in the measured intensity (Stubbs, 1989; Millane, 1990a). Making use of these statistics shows that the largest likely R factor, , for m components is given by (Stubbs, 1989; Millane, 1989a) where is the binomial coefficient and the incomplete beta function. The beta function in equation (4.5.2.75) can be replaced by a finite series that is easy to evaluate (Millane, 1989a). The expression in equation (4.5.2.75) for can be written in various approximate forms (Millane, 1990d, 1992a), the simplest being (Millane, 1990d), which shows that the largest likely R factor falls off approximately as with increasing m. This is because it is easier to match the sum of a number of structure amplitudes than to match each of them individually. The important conclusion is that the largest likely R factor is smaller in fibre diffraction than in conventional crystallography (where or 2), and it is smaller when there are more overlapping reflections. This means that for equivalent precision, the R factor must be smaller for a structure determined by fibre diffraction than for one determined by conventional crystallography. How much smaller depends on the number of overlapping reflections on the diffraction pattern.

In a structure determination, the data have different values of m at different positions on the diffraction pattern. Using the definition of the R factor, equation (4.5.2.74), shows that the largest likely R factor for a structure determination is given by (Millane, 1989b) where the sums are over the values of m on the diffraction pattern, is the number of data that have m components, is given by equation (4.5.2.75) and is given by where is the gamma function. The quantities on the right-hand side of equation (4.5.2.77) are easily determined for a particular data set. The largest likely R factor decreases (since m increases) with increasing resolution of the data, increasing diameter of the molecule and decreasing order u of the helix symmetry. For example, for TMV at 5 Å resolution the largest likely R factor is 0.37, and at 3 Å resolution it is 0.31, whereas for a tenfold nucleic acid structure at 3 Å resolution it is 0.40 (Millane, 1989b, 1992b). This underlines the importance of comparing R factors obtained in a fibre diffraction analysis with the largest likely R factor; an R factor of 0.25 that may indicate a good protein structure may, or may not, indicate a well determined fibre structure.

Using approximations for , and m allows the following approximation for the largest likely R factor for a noncrystalline fibre to be derived (Millane, 1992b): where is the resolution of the data. The approximation (4.5.2.79) is generally not good enough for calculating accurate largest likely R factors, but it does show the general behaviour with helix symmetry, molecular diameter and diffraction-data resolution. Other approximations to largest likely R factors have been derived that are quite accurate and also include the effect of a minimum resolution for the data (Millane, 1992b).

Largest likely R factors in fibre diffraction studies are typically between about 0.3 and 0.5, depending on the particular structure (Millane, 1989b, 1992b; Millane & Stubbs, 1992). Although the largest likely R factor does not give a quantitative assessment of the significance of an R factor obtained in a particular structure determination, it can be used as a guide to the significance. R factors obtained for well determined protein structures are typically between about one-third and one-half of the corresponding largest likely R factor, depending on the resolution. It is therefore reasonable to expect the R factor for a well determined fibre structure to be between one-third and one-half of the largest likely R factor calculated for the structure. R factors should, therefore, generally be less than 0.15 to 0.25, depending on the particular structure and the resolution as illustrated by the examples presented in Millane & Stubbs (1992).

The free R factor (Brünger, 1997) has become popular in single-crystal crystallography as a tool for validation of refinements. The free R factor is more difficult to implement (but is probably even more important) in fibre diffraction studies because of the smaller data sets, but has been used to advantage in recent studies (Hudson et al., 1997; Welsh et al., 1998, 2000).

### 4.5.3. Electron crystallography of polymers

| top | pdf |
D. L. Dorsetb*

#### 4.5.3.1. Is polymer electron crystallography possible?

| top | pdf |

As a crystallographic tool, the electron microscope has also made an important impact in polymer science. Historically, single-crystal electron diffraction information has been very useful for the interpretation of cylindrically averaged fibre X-ray patterns (Atkins, 1989), particularly when there is an extensive overlap of diffracted intensities. An electron diffraction pattern aids indexing of the fibre pattern and facilitates measurement of unit-cell constants, and the observation of undistorted plane-group symmetry similarly places important constraints on the identification of the space group (Geil, 1963; Wunderlich, 1973).

The concept of using electron diffraction intensities by themselves for the quantitative determination of crystal structures of polymers or other organics often has been met with scepticism (Lipson & Cochran, 1966). Difficulties experienced in the quantitative interpretation of images and diffraction intensities from hard' materials composed of heavy atoms (Hirsch et al., 1965; Cowley, 1981), for example, has adversely affected the outlook for polymer structure analysis, irrespective of whether these reservations are important or not for soft' materials comprising light atoms. Despite the still commonly held opinion that no new crystal structures will be determined that are solely based on data collected in the electron microscope, it can be shown that this extremely pessimistic outlook is unwarranted. With proper control of crystallization (i.e. crystal thickness) and data collection, the electron microscope can be used quite productively for the direct determination of macromolecular structures at atomic resolution, not only to verify some of the previous findings of fibre X-ray diffraction analysis, but, more importantly, to determine new structures, even of crystalline forms that cannot be studied conveniently by X-rays as drawn fibres (Dorset, 1995b). The potential advantages of electron crystallography are therefore clear. The great advantage in scattering cross section of matter for electrons over X-rays permits much smaller samples to be examined by electron diffraction as single-crystalline preparations (Vainshtein, 1964). (Typical dimensions are given below.)

Electron crystallography can be defined as the quantitative use of electron micrographs and electron diffraction intensities for the determination of crystal structures. In the electron microscope, an electron beam illuminates a semitransparent object and the microscope objective lens produces an enlarged representation of the object as an image. If the specimen is thin enough and/or the electron energy is high enough, the weak-phase-object or kinematical' approximation is valid (Cowley, 1981), see Chapter 2.5 . That is to say, there is an approximate one-to-one mapping of density points between the object mass distribution and the image, within the resolution limits of the instrument (as set by the objective lens aberrations and electron wavelength). The spatial relationships between diffraction and image planes of an electron microscope objective lens are reciprocal and related by Fourier transform operations (Cowley, 1988). While it is easy to transform from the image to the diffraction pattern, the reverse Fourier transform of the diffraction pattern to a high-resolution image requires solution of the famous crystallographic phase problem (as discussed for electron diffraction in Section 2.5.7 ).

Certainly, in electron diffraction studies, one must still be cognizant of the limitations imposed by the underlying scattering theory. An approximate quasi-kinematical' data set is often sufficient for the analysis (Dorset, 1995a). However (Dorset, 1995b), there are other important perturbations to diffraction intensities which should be minimized. For example, the effects of radiation damage while recording a high-resolution image are minimized by so-called low-dose' procedures (Tsuji, 1989).

#### 4.5.3.2. Crystallization and data collection

| top | pdf |

The success of electron crystallographic determinations relies on the possibility of collecting data from thin single microcrystals. These can be grown by several methods, including self-seeding, epitaxic orientation, in situ polymerization on a substrate, in a Langmuir–Blodgett layer, in situ polymerization within a thin layer and polymerization in dilute solution. If these preparations do not provide sufficient information, then data can also be collected from microfibres. Thin cast films have also been examined after stretching.

Self-seeding (Blundell et al., 1966) has been one of the most important techniques for growing single chain-folded lamellae. The technique is very simple. A dilute suspension of the polymer is made in a poor solvent. The temperature is raised to cause total solubilization of the macromolecule and then lowered to room temperature to crystallize ill-formed particles (mostly dendrites). The temperature is then elevated again until the suspension just clears, leaving small seeds of the polymer crystals behind. Upon lowering to a suitable temperature above ambience, which is then fixed, isothermal crystallization of well formed lamellae is allowed to occur over time. When the crystallization procedure is complete, the suspension can be cooled again to room temperature and the lamellae harvested. These lamellae are typically less than 10 nm thick, with lateral dimensions between 1.0 and 10.0 µm.

Epitaxic orientation techniques, to give alternative projections of the chain packing, have become increasingly important in recent years. While inorganic substrates have been described (Mauritz et al., 1978), the use of organic layers for this purpose (Wittmann & Lotz, 1990; Lotz & Wittmann, 1993) has been more promising because these substrates are less easily contaminated by adsorbed gases and water vapour, and because the nucleation is anisotropic. Often the crystallization can be carried out from a cooled co-melt, i.e. a dilute solution of the polymer in the organic small molecule. When the liquidus curve of the eutectic phase diagram is crossed, the diluent crystals form first. Since these have a surface lattice spacing closely resembling that of the polymer-chain packing, the polymer chains can be directed to lie along the substrate surface, rather than normal to it, as the solidus line of the phase diagram is crossed. The substrate can then be removed by some suitable technique (sublimation, selective solvation) to permit the investigation of the oriented film. Variations of this procedure include crystallization of polymer-chain segments from the vapour phase onto a substrate (Wittmann & Lotz, 1985) and in situ crystallization of monomers that have first been epitaxically oriented on a suitable substrate (Rickert et al., 1979).

A number of other possibilities for crystal growth also exist. Langmuir troughs have been used to orient monomers that may have hydrophilic moieties. If the monomers contain triple bonds that can be cross-linked, then a polymer film can be formed, e.g., if the condensed monomer film is exposed to ultraviolet light (Day & Lando, 1980). It may be possible to carry out the polymerization within a confined space (Rybnikar et al., 1994) or in dilute solution (Liu & Geil, 1993) to form crystals suitable for electron diffraction data collection. In the latter case, whisker formation with the chain axis parallel to the lath plane has been observed. Films can be cast on a water surface by evaporation of an organic solvent from a polymer solution. The film can then be stretched to give a suitably oriented specimen for data collection (Vainshtein & Tatarinova, 1967). In addition, it may just be possible to obtain suitable data from drawn microfibres to supplement the single-crystal diffraction information from other preparations.

Data collection from these thin microcrystals often employs the selected-area diffraction technique in the electron microscope that is described in detail elsewhere (Dorset, 1995b). Using an approximately eucentric goniometric tilting device in the electron microscope, the sampling of three-dimensional reciprocal space is tomographic, i.e. the tilts of a nearly planar Ewald sphere surface (owing to the very small electron wavelength) are always referred to a set of reciprocal axes that intersect (0, 0, 0). For any given crystal habit, there is always a missing set of data owing to the physical limitation to the tilt imposed by the finite thickness of the specimen holder within the pole-piece gap of the electron microscope objective lens (Vainshtein, 1964). For this reason, it is desirable to crystallize two orthogonal orientations of the chain packing (using the above-mentioned approaches), if possible, so that all of the reciprocal lattice can be sampled. If electron micrographs are to be used as an additional source of crystallographic phases then low-dose' techniques for recording such images should be employed to reduce the deleterious effects of radiation damage caused by the inelastic interactions of the electron beam with the crystalline sample (Tsuji, 1989).

When the diffraction patterns are recorded on photographic film and these are then measured with a densitometer, relative reflection intensities can often be expressed simply as the integrated peak area without need for a Lorentz correction (Dorset, 1995b). Only if the diffraction maxima are extensively arced (e.g. in patterns from epitaxic films) is such a correction required. That is to say, where is the observed structure-factor magnitude. Assuming the kinematical approximation holds, the calculated value is where are the electron scattering factors (Doyle & Turner, 1968), e.g. as tabulated in Table 4.3.1.1 in IT C. By analogy with X-ray crystallography (see Chapter 2.2 ), normalized values can be found from with the usual scaling condition that . [Note, however, that these intensities only describe the chain monomer packing in the stem' region of the lamellar microcrystal. Details owing to the surface chain folds are lost (even if they are strictly periodic), because of reasons similar to those described by Cowley (1961) for the electron scattering from elastically bent silicate crystals.]

#### 4.5.3.3. Crystal structure analysis

| top | pdf |

Two approaches to crystal structure analysis are generally employed in polymer electron crystallography. As already mentioned, the procedure adapted from fibre X-ray crystallography relies on the construction of a model (Brisse, 1989; Perez & Chanzy, 1989). Conformational searches (Campbell Smith & Arnott, 1978) simultaneously minimize the fit of observed diffraction data to calculated values (the R factor based on structure factors computed via known atomic scattering factors) and a nonbonded atom–atom potential function (Tadokoro, 1979). Reviews of structures solved by this approach have been published (Dorset, 1989, 1995b).

Recently, direct phasing methods of the kind used in X-ray crystallography (Chapter 2.2 and, applied to electron diffraction, Section 2.5.7 ) have also been found to be particularly effective for electron crystallographic structure analyses (Dorset, 1995b). While the Fourier transform of an electron micrograph would be the most easily imagined direct method, yielding crystallographic phases after image analysis (see Section 2.5.5 ), this use of micrographs has been of less importance to polymer crystallography than it has been in the study of globular proteins, even though there is at least one notable example where it has been helpful (Isoda et al., 1983a) for the determination of a structure from X-ray fibre data. On the other hand, high-resolution images of polymer crystals are of considerable use for the characterization of packing defects (Isoda et al., 1983b).

In polymer electron crystallography, the sole reliance on the diffraction intensities for structure analysis has proven, in recent years, to be quite effective. Several direct-methods approaches have been pursued, including the use of probabilistic techniques, either in the symbolic addition procedure, or in more automated procedures involving the tangent formula (see Chapter 2.2 ). The Sayre (1952) equation has been found to be particularly effective, where the correct structure is identified via some figure of merit after algebraic phase values are used to generate multiple solutions (Stanley, 1986). More recently, maximum-entropy and likelihood methods (Gilmore et al., 1993) have also been effective for solving such structures. After the initial atomic model is found, it can be improved by refinement, generally using Fourier techniques. Least-squares refinement can be carried out under most favourable circumstances (Dorset, 1995a), but requires the availability of a sufficient number of diffraction data. Even so, the refinement of thermal parameters must be uncoupled from that of the atomic positions. Also, positional shifts must be dampened (if X-ray crystallographic software is used) to prevent finding a false minimum, especially if the kinematical R factor is used as a figure of merit.

#### 4.5.3.4. Examples of crystal structure analyses

| top | pdf |

At least four kinds of electron diffraction intensity data sets have been used as tests for direct phase determination via the approaches mentioned above.

• Case 1: Zonal data sets – view down the chain axis. Such data are from the least optimal projection of the polymer packing, because of extensive atomic overlap along the chain axis. Initially, symbolic addition was used to find phase values for hk0 data sets from six representative polymers, including three complicated saccharide structures (Dorset, 1992). Most of the determinations were strikingly successful. Later, an unknown data set from the polysaccharide chitosan was obtained from Grenoble (Mazeau et al., 1994) and direct phase determination was able to find a correct model (Dorset, 1995b). More recently, other polymers have been tested [including one case where an electron micrograph provided many of the starting phase terms (Dorset, 1995b)] also comparing favourably with the solution found by energy minimization of a linkage model. For all examples considered so far, the projected symmetry was centrosymmetric.

Later, it was found that a partial phase set provided by symbolic addition could be expanded to the complete zone by the Sayre equation (Dorset et al., 1995). In all of these tests (Dorset, 1995b), there were only one or two examples where there were serious deviations from the phase terms found by other methods. Even in these instances, the potential maps could still be used as envelopes for the actual projection of the chain structure (Dorset, 1992).

• Case 2: Zonal data set – view onto the chain axes. Electron diffraction data from a projection onto the polymer chain axes would be more useful if individual atomic positions were to be resolved. An interesting example where such a view can be obtained is an data set from the polypeptide poly-γ-methyl-L-glutamate. Electron diffraction data were collected from stretched films by Vainshtein & Tatarinova (1967). In projection, the cell constants are , c = 6.83 Å with plane-group symmetry pg. As shown in Table 4.5.3.1, there were 19 unique intensity data used for the analysis. After initial phase assignment by symbolic addition, a correct solution could be visualized which, after Fourier refinement (Dorset, 1995b), differed from the original one by a mean phase difference of only 6°.

 Table 4.5.3.1| top | pdf | Structure analysis of poly-γ-methyl-L-glutamate in the β form
(previous) (this study)
002 0.48 0.72 0.57 −63 −51
004 0.43 0.38 0.31 49 73
006 3.01 1.47 0.88 1 −3
100 1.48 2.12 2.37 0 0
200 1.03 1.04 1.06 0 0
300 0.30 0.65 0.89 0 0
400 0.35 0.15 0.46 0 0
500 0.23 0.07 0.04 180 180
101 0.75 1.02 0.67 −169 −178
201 0.32 0.31 0.42 90 108
102 0.42 0.48 0.56 17 14
202 0.40 0.33 0.64 41 43
103 0.95 0.85 0.77 88 90
203 0.51 0.36 0.42 91 88
303 0.12 0.06 0.31 92 87
403 0.13 0.04 0.54 90 90
104 0.66 0.45 0.27 −22 −13
105 0.55 0.28 0.29 −26 −7
106 1.75 0.69 0.58 5 −5
 Fractional coordinates
This studyVainshtein & Tatarinova (1967)
xzxz
0.048 0.000 0.042 0.000
C′ 0.067 0.331 0.092 0.330
O 0.281 0.335 0.300 0.330
N 0.000 0.161 −0.025 0.175

The progress of this structure analysis can be reviewed to give a representative example. Since the h00 reflections have centrosymmetric phases, the value was chosen as a single origin-defining point. From high-probability three-phase invariants (assessed after calculation of normalized structure factors ), one could assign . Symbolic values were then given to three other phases, viz. ; ; . From this entire basis, other values could be found from highly probable three-phase invariants, as follows: (These invariant relationships include phase interactions among symmetry-related Miller indices characteristic of the plane group.) Additionally could be specified to complete origin definition for the zone. It was then possible to permute values of a and b to arrive at test phase values for this subset, i.e. to generate a multiple set of solutions. When , , the map in Fig. 4.5.3.1 was observed. After finding trial atomic positions for Fourier refinement (assuming that two carbon-atom positions were eclipsed in this projection), the final phase set was found as shown in Table 4.5.3.1. Although the crystallographic residual to the observed data, calculated with the model coordinates, was rather large (0.32), there was a close agreement with the earlier determination.

 Figure 4.5.3.1 | top | pdf |Initial potential map for poly-γ-methyl-L-glutamate (plane group pg) found with phases generated by the Sayre equation.

More recently a similar data set, collected from oriented crystal whiskers' of poly(p-oxybenzoate) in plane group pg was analysed. Again the Sayre equation, via a multisolution approach, was used to produce a map that contained 13 of 18 possible atomic positions for the two subunits in the asymmetric unit. The complete structure was observed after the remaining five atom sites were identified in two subsequent cycles of Fourier refinement (Liu et al., 1997) and the average atomic positions were found to be within 0.2 Å of the model derived from an energy minimization.

• Case 3: Three-dimensional data – single crystal orientation. The first data set from a chain-folded lamella for a direct structure analysis was a centrosymmetric set (space group ) from poly(1,4-trans-cyclohexanediyl dimethylene succinate), composed of 87 reflections (Brisse et al., 1984). The phase determination was quite successful and atomic positions could be found as somewhat blurred density maxima in the three-dimensional maps (Dorset, 1991a). A model was constructed from these positions and the bonding parameters optimized to give the best fit to the data (R = 0.29).

Noncentrosymmetric three-dimensional intensity sets (orthorhombic space group ) from the polysaccharides mannan (form I) (Chanzy et al., 1987) and chitosan (Mazeau et al., 1994) were also collected from tilted crystals. In both cases, direct phase determination by symbolic addition via an algebraic unknown was successful, even though the data were not sampled along the chain repeat. For the former polymer, a monomer model could be fitted to the blurred density profile, much as one would fit a polypeptide sequence to a continuous electron-density map (Dorset & McCourt, 1993; Dorset, 1995c). If the Sayre equation were used to predict phases and amplitudes within the missing cone' of unsampled data, then the fit of the monomer could be much more highly constrained.

• Case 4: Three-dimensional data – two crystal orientations. The optimal case for collection of diffraction data is when two orthogonal projections of the same polymer polymorph can be obtained, respectively, by self-seeding and epitaxic orientation. While tilting these specimens, all of reciprocal space can be sampled for intensity data collection.

Polyethylene crystals were used to collect 50 unique maxima (Hu & Dorset, 1989) and, via symbolic addition, the centrosymmetric phases of 40 reflections (space group Pnma) could be readily determined (Dorset, 1991b). The structural features were readily observed in the three-dimensional potential maps (Fig. 4.5.3.2a), and atomic coordinates (with estimated values for hydrogen-atom positions) could be refined by least squares (Dorset, 1995b) to give a final R value of 0.19.

 Figure 4.5.3.2 | top | pdf |Crystal structures of linear polymers determined from three-dimensional data. (a) Polyethylene; (b) poly(-caprolactone); (c) poly(1-butene), form (III).

Poly(-caprolactone) was epitaxically crystallized on benzoic acid and, with hk0 data from solution-crystallized samples, a unique set of 47 intensities was collected for the noncentrosymmetric orthorhombic unit cell (space group ) (Hu & Dorset, 1990). Direct phase determination was achieved via symbolic addition, using one algebraic unknown to assign values to 30 reflections (Dorset, 1991c). Atomic positions along the chain repeat, including the carbonyl position, were clearly discerned in the [100] projection (Fig. 4.5.3.2b) and the three-dimensional model was constructed to fit to the map calculated from all phased data, yielding a final crystallographic residual R = 0.21. This independent determination was able to distinguish between two rival fibre X-ray structures, in favour of the one that predicted a non-planar chain conformation. Because of the methylene repeat, this is actually a difficult structure to solve by automated techniques. For example, the tangent formula and SnB (Miller et al., 1993) could only find chain zigzag positions and not the position of the carbonyl oxygen atom (Dorset, 1995b).

The most complicated complete polymer crystal structure solved so far by direct methods using electron diffraction data (Dorset et al., 1994) was based on 125 unique data (space group ) from isotactic poly(1-butene), form (III), using orthogonal molecular orientations crystallized in Strasbourg (Kopp et al., 1994). Initially, the standard NQEST figure of merit (FOM) (De Titta et al., 1975) was not suitable for identifying the correct solution among the multiple sets generated with the tangent formula. A solution could only be found when a separate phase determination was carried out with the hk0 data to compare with the multiple solutions generated. More recently, the minimal principle (Hauptman, 1993), used as a FOM with the tangent formula or with a multiple random structure generator, SnB, correctly identified the structure on the first try (Dorset, 1995b). The maps clearly show individual carbon-atom positions in a 41 helix that parallels 21 helices of the space group (Fig. 4.5.3.2c). After Fourier refinement, the crystallographic residual was R = 0.26. The previous powder X-ray diffraction determination was based on only 21 diffraction maxima, some of which had as many as 15 individual contributors.

### Acknowledgements

The work of both RPM and DLD was supported by the US National Science Foundation (grant DBI-9722862 for RPM).

### References

Alexeev, D. G., Lipanov, A. A. & Skuratovskii, I. Y. (1992). Patterson methods in fibre diffraction. Int. J. Biol. Macromol. 14, 139–144.
Arnott, S. (1980). Twenty years hard labor as a fibre diffractionist. In Fibre diffraction methods, ACS Symposium Series, Vol. 141, edited by A. D. French & K. H. Gardner, pp. 1–30. Washington DC: American Chemical Society.
Arnott, S., Chandrasekaran, R., Millane, R. P. & Park, H. (1986). DNA–RNA hybrid secondary structures. J. Mol. Biol. 188, 631–640.
Arnott, S. & Mitra, A. K. (1984). X-ray diffraction analyses of glycosamionoglycans. In Molecular biophysics of the extracellular matrix, edited by S. Arnott, D. A. Rees & E. R. Morris, pp. 41–67. Clifton: Humana Press.
Arnott, S., Wilkins, M. H. F., Fuller, W. & Langridge, R. (1967). Molecular and crystal structures of double-helical RNA III. An 11-fold molecular model and comparison of the agreement between the observed and calculated three-dimensional diffraction data for 10- and 11-fold models. J. Mol. Biol. 27, 535–548.
Arnott, S. & Wonacott, A. J. (1966). The refinement of the crystal and molecular structures of polymers using X-ray data and stereochemical constraints. Polymer, 7, 157–166.
Atkins, E. D. T. (1989). Crystal structure by X-ray diffraction. In Comprehensive polymer science, Vol. 1. Polymer characterization, edited by G. A. Allen, pp. 613–650. Oxford: Pergamon Press.
Barham, P. J. (1993). Crystallization and morphology of semicrystalline polymers. In Materials science and technology. A comprehensive treatment, Vol. 12. Structure and properties of polymers, edited by E. L. Thomas, pp. 153–212. Weinheim: VCH.
Baskaran, S. & Millane, R. P. (1999a). Bayesian image reconstruction from partial image and aliased spectral intensity data. IEEE Trans. Image Process. 8, 1420–1434.
Baskaran, S. & Millane, R. P. (1999b). Model bias in Bayesian image reconstruction from X-ray fiber diffraction data. J. Opt. Soc. Am. A, 16, 236–245.
Biswas, A. & Blackwell, J. (1988a). Three-dimensional structure of main-chain liquid-crystalline copolymers. 1. Cylindrically averaged intensity transforms of single chains. Macromolecules, 21, 3146–3151.
Biswas, A. & Blackwell, J. (1988b). Three-dimensional structure of main-chain liquid-crystalline copolymers. 2. Interchain interference effects. Macromolecules, 21, 3152–3158.
Biswas, A. & Blackwell, J. (1988c). Three-dimensional structure of main-chain liquid-crystalline copolymers. 3. Chain packing in the solid state. Macromolecules, 21, 3158–3164.
Blackwell, J., Gutierrez, G. A. & Chivers, R. A. (1984). Diffraction by aperiodic polymer chains: the structure of liquid crystalline copolyesters. Macromolecules, 17, 1219–1224.
Blundell, D. J., Keller, A. & Kovacs, A. J. (1966). A new self-nucleation phenomenon and its application to the growing of polymer crystals from solution. J. Polym. Sci. Polym. Lett. Ed. 4, 481–486.
Brisse, F. (1989). Electron diffraction of synthetic polymers: the model compound approach to polymer structure. J. Electron Microsc. Tech. 11, 272–279.
Brisse, F., Remillard, B. & Chanzy, H. (1984). Poly(1,4-trans-cyclohexanediyldimethylene succinate). A structural determination using X-ray and electron diffraction. Macromolecules, 17, 1980–1987.
Brünger, A. T. (1992). X-PLOR. Version 3.1. New Haven: Yale University Press.
Brünger, A. T. (1997). Free R value: cross-validation in crystallography. Methods Enzymol. 277, 366–396.
Cael, J. J., Winter, W. T. & Arnott, S. (1978). Calcium chondroitin 4-sulfate: molecular conformation and organization of polysaccharide chains in a proteoglycan. J. Mol. Biol. 125, 21–42.
Campbell Smith, P. J. & Arnott, S. (1978). LALS: a linked-atom least-squares reciprocal-space refinement system incorporating stereochemical constraints to supplement sparse diffraction data. Acta Cryst. A34, 3–11.
Chandrasekaran, R. & Arnott, S. (1989). The structures of DNA and RNA helices in oriented fibres. In Landolt–Bornstein numerical data and functional relationships in science and technology, Vol. VII/1b, edited by W. Saenger, pp. 31–170. Berlin, Heidelberg: Springer-Verlag.
Chandrasekaran, R., Radha, A. & Lee, E. J. (1994). Structural roles of calcium ions and side chains in welan: an X-ray study. Carbohydr. Res. 252, 183–207.
Chanzy, H., Perez, S., Miller, D. P., Paradossi, G. & Winter, W. T. (1987). An electron diffraction study of mannan I. Crystal and molecular structure. Macromolecules, 20, 2407–2413.
Chivers, R. A. & Blackwell, J. (1985). Three-dimensional structure of copolymers of p-hydroxybenzoic acid and 2-hydroxy-6-naphthoic acid: a model for diffraction from a nematic structure. Polymer, 26, 997–1002.
Clark, E. S. & Muus, I. T. (1962). The relationship between Bragg reflections and disorder in crystalline polymers. Z. Kristallogr. 117, 108–118.
Cochran, W., Crick, F. H. C. & Vand, V. (1952). The structure of synthetic polypeptides. I. The transform of atoms on a helix. Acta Cryst. 5, 581–586.
Cowley, J. M. (1961). Diffraction intensities from bent crystals. Acta Cryst. 14, 920–927.
Cowley, J. M. (1981). Diffraction physics. Second revised edition. Amsterdam: North-Holland.
Cowley, J. M. (1988). Imaging [and] imaging theory. In High-resolution transmission electron microscopy and associated techniques, edited by P. Busek, J. M. Cowley & L. Eyring, pp. 3–57. New York: Oxford University Press.
Crowther, R. A., DeRosier, D. J. & Klug, A. (1970). The reconstruction of a three-dimensional structure from projections and its application to electron microscopy. Proc. R. Soc. London Ser. A, 317, 319–344.
Daubeny, R. de P., Bunn, C. W. & Brown, C. J. (1954). The crystal structure of polyethylene terephthalate. Proc. R. Soc. London Ser. A, 226, 531–542.
Day, D. & Lando, J. B. (1980). Structure determination of a poly(diacetylene) monolayer. Macromolecules, 13, 1483–1487.
De Titta, G. T., Edmonds, J. W., Langs, D. A. & Hauptman, H. (1975). Use of negative quartet cosine invariants as a phasing figure of merit: NQEST. Acta Cryst. A31, 472–479.
Dorset, D. L. (1989). Electron diffraction from crystalline polymers. In Comprehensive polymer science, Vol. 1. Polymer characterization, edited by G. A. Allen, pp. 651–668. Oxford: Pergamon Press.
Dorset, D. L. (1991a). Is electron crystallography possible? The direct determination of organic crystal structures. Ultramicroscopy, 38, 23–40.
Dorset, D. L. (1991b). Electron diffraction structure analysis of polyethylene. A direct phase determination. Macromolecules, 24, 1175–1178.
Dorset, D. L. (1991c). Electron crystallography of linear polymers: direct structure analysis of poly(-caprolactone). Proc. Natl Acad. Sci. USA, 88, 5499–5502.
Dorset, D. L. (1992). Electron crystallography of linear polymers: direct phase determination for zonal data sets. Macromolecules, 25, 4425–4430.
Dorset, D. L. (1995a). Comments on the validity of the direct phasing and Fourier methods in electron crystallography. Acta Cryst. A51, 869–879.
Dorset, D. L. (1995b). Structural electron crystallography. New York: Plenum.
Dorset, D. L. (1995c). Filling the cone – overcoming the goniometric tilt limit in electron crystallography by direct methods. Am. Cryst. Assoc. Abstr. Series 2, 23, p. 89.
Dorset, D. L., Kopp, S., Fryer, J. R. & Tivol, W. T. (1995). The Sayre equation in electron crystallography. Ultramicroscopy, 57, 59–89.
Dorset, D. L. & McCourt, M. P. (1993). Electron crystallographic analysis of a polysaccharide structure – direct phase determination and model refinement for mannan I. J. Struct. Biol. 111, 118–124.
Dorset, D. L., McCourt, M. P., Kopp, S., Wittmann, J.-C. & Lotz, B. (1994). Direct determination of polymer crystal structures by electron crystallography – isotactic poly(1-butene), form (III). Acta Cryst. B50, 201–208.
Doyle, P. A. & Turner, P. S. (1968). Relativistic Hartree–Fock X-ray and electron scattering factors. Acta Cryst. A24, 390–397.
Drenth, J. (1994). Principles of protein X-ray crystallography. New York: Springer-Verlag.
Finkenstadt, V. L. & Millane, R. P. (1998). Fiber diffraction patterns for general unit cells: the cylindrically projected reciprocal lattice. Acta Cryst. A54, 240–248.
Forsyth, V. T., Mahendrasingam, A., Pigram, W. J., Greenall, R. J., Bellamy, K., Fuller, W. & Mason, S. A. (1989). Neutron fibre diffraction study of DNA hydration. Int. J. Biol. Macromol. 11, 236–240.
Franklin, R. E. (1955). Structure of tobacco mosaic virus. Nature (London), 175, 379–381.
Franklin, R. E. & Gosling, R. G. (1953). The structure of sodium thymonucleate fibres. II. The cylindrically symmetrical Patterson function. Acta Cryst. 6, 678–685.
Franklin, R. E. & Holmes, K. C. (1958). Tobacco mosaic virus: application of the method of isomorphous replacement to the determination of the helical parameters and radial density distribution. Acta Cryst. 11, 213–220.
Franklin, R. E. & Klug, A. (1955). The splitting of layer lines in X-ray fibre diagrams of helical structures: application to tobacco mosaic virus. Acta Cryst. 8, 777–780.
Fraser, R. D. B. & MacRae, T. P. (1973). Conformations in fibrous proteins. New York: Academic Press.
Fraser, R. D. B., MacRae, T. P., Miller, A. & Rowlands, R. J. (1976). Digital processing of fibre diffraction patterns. J. Appl. Cryst. 9, 81–94.
Fraser, R. D. B., Suzuki, E. & MacRae, T. P. (1984). Computer analysis of X-ray diffraction patterns. In Structure of crystalline polymers, edited by I. H. Hall, pp. 1–37. New York: Elsevier.
French, A. D. & Gardner, K. H. (1980). Editors. Fibre diffraction methods. ACS Symposium Series, Vol. 141. Washington DC: American Chemical Society.
Geil, P. H. (1963). Polymer single crystals. New York: John Wiley & Sons.
Gilmore, C. J., Shankland, K. & Bricogne, G. (1993). Application of the maximum entropy method to powder diffraction and electron crystallography. Proc. R. Soc. London Ser. A, 442, 97–111.
Gonzalez, A., Nave, C. & Marvin, D. A. (1995). Pf1 filamentous bacteriophage: refinement of a molecular model by simulated annealing using 3.3 Å resolution X-ray fiber diffraction data. Acta Cryst. D51, 792–804.
Graaf, H. de (1989). On the calculation of small-angle diffraction patterns from distorted lattices. Acta Cryst. A45, 861–870.
Grubb, D. T. (1993). Elastic properties of crystalline polymers. In Materials science and technology. A comprehensive treatment, Vol. 12. Structure and properties of polymers, edited by E. L. Thomas, pp. 301–356. Weinheim: VCH.
Hall, I. H. (1984). Editor. Structure of crystalline polymers. New York: Elsevier.
Hall, I. H., Neisser, J. Z. & Elder, M. (1987). A computer-based method of measuring the integrated intensities of the reflections on the X-ray diffraction photograph of an oriented crystalline polymer. J. Appl. Cryst. 20, 246–255.
Hamilton, W. C. (1965). Significance tests on the crystallographic R factor. Acta Cryst. 18, 502–510.
Hauptman, H. A. (1993). A minimal principle in X-ray crystallography: starting in a small way. Proc. R. Soc. London Ser. A, 442, 3–12.
Hendricks, S. & Teller, E. (1942). X-ray interference in partially ordered layer lattices. J. Chem. Phys. 10, 147–167.
Hirsch, P. B., Howie, A., Nicholson, P. B., Pashley, D. W. & Whelan, M. J. (1965). Electron microscopy of thin crystals. London: Butterworths.
Hofmann, D., Schneider, A. I. & Blackwell, J. (1994). Molecular modelling of the structure of a wholly aromatic thermotropic copolyester. Polymer, 35, 5603–5610.
Holmes, K. C. & Barrington Leigh, J. (1974). The effect of disorientation on the intensity distribution of non-crystalline fibres. I. Theory. Acta Cryst. A30, 635–638.
Holmes, K. C., Popp, D., Gebhard, W. & Kabsch, W. (1990). Atomic model of the actin filament. Nature (London), 347, 44–49.
Holmes, K. C., Stubbs, G. J., Mandelkow, E. & Gallwitz, U. (1975). Structure of tobacco mosaic virus at 6.7 Å resolution. Nature (London), 254, 192–196.
Hosemann, R. & Bagchi, S. N. (1962). Direct analysis of diffraction by matter. Amsterdam: North-Holland.
Hu, H. & Dorset, D. L. (1989). Three-dimensional electron diffraction structure analysis of polyethylene. Acta Cryst. B45, 283–290.
Hu, H. & Dorset, D. L. (1990). Crystal structure of poly(-caprolactone). Macromolecules, 23, 4604–4607.
Hudson, L., Harford, J. J., Denny, R. C. & Squire, J. M. (1997). Myosin head configuration in relaxed fish muscle: resting state myosin heads must swing axially by up to 150 Å or turn upside down to reach rigor. J. Mol. Biol. 273, 440–455.
Iannelli, P. (1994). FWR: a computer program for refining the molecular structure in the crystalline phase of polymers based on the analysis of the whole X-ray fibre diffraction patterns. J. Appl. Cryst. 27, 1055–1060.
Isoda, S., Tsuji, M., Ohara, M., Kawaguchi, A. & Katayama, K. (1983a). Structural analysis of β-form poly(p-xylene) starting from a high-resolution image. Polymer, 24, 1155–1161.
Isoda, S., Tsuji, M., Ohara, M., Kawaguchi, A. & Katayama, K. (1983b). Direct observation of dislocations in polymer single crystals. Makromol. Chem. Rapid Commun. 4, 141–144.
Ivanova, M. I. & Makowski, L. (1998). Iterative low-pass filtering for estimation of the background in fiber diffraction patterns. Acta Cryst. A54, 626–631.
Klug, A., Crick, F. H. C. & Wyckoff, H. W. (1958). Diffraction from helical structures. Acta Cryst. 11, 199–213.
Kopp, S., Wittmann, J. C. & Lotz, B. (1994). Epitaxial crystallization and crystalline polymorphism of poly(1-butene): form (III) and (II). Polymer, 35, 908–915.
Lipson, H. & Cochran, W. (1966). The determination of crystal structures, p. 381. Ithaca: Cornell University Press.
Liu, J. & Geil, P. H. (1993). Morphological observations of nascent poly(p-oxabenzoate). Polymer, 34, 1366–1374.
Liu, J., Yuan, B.-L., Geil, P. H. & Dorset, D. L. (1997). Chain conformation and molecular packing in poly(p-oxybenzoate) single crystals at ambient temperature. Polymer, 38, 6031–6047.
Lobert, S., Heil, P. D., Namba, K. & Stubbs, G. (1987). Preliminary X-ray fibre diffraction studies of cucumber green mottle mosaic virus, watermelon strain. J. Mol. Biol. 196, 935–938.
Lobert, S. & Stubbs, G. (1990). Fibre diffraction analysis of cucumber green mottle mosaic virus using limited numbers of heavy-atom derivatives. Acta Cryst. A46, 993–997.
Lorenz, M. & Holmes, K. C. (1993). Computer processing and analysis of X-ray fibre diffraction data. J. Appl. Cryst. 26, 82–91.
Lorenz, M., Popp, D. & Holmes, K. C. (1993). Refinement of the F-actin model against X-ray fibre diffraction data by the use of a directed mutation algorithm. J. Mol. Biol. 234, 826–836.
Lotz, B. & Wittmann, J. C. (1993). Structure of polymer single crystals. In Materials science and technology. A comprehensive treatment, Vol. 12. Structure and properties of polymers, edited by E. L. Thomas, pp. 79–154. Weinheim: VCH.
MacGillavry, C. H. & Bruins, E. M. (1948). On the Patterson transforms of fibre diagrams. Acta Cryst. 1, 156–158.
Makowski, L. (1978). Processing of X-ray diffraction data from partially oriented specimens. J. Appl. Cryst. 11, 273–283.
Makowski, L. (1982). The use of continuous diffraction data as a phase constraint. II. Application to fibre diffraction data. J. Appl. Cryst. 15, 546–557.
Makowski, L., Caspar, D. L. D. & Marvin, D. A. (1980). Filamentous bacteriophage Pf1 structure determined at 7 Å resolution by refinement of models for the α-helical subunit. J. Mol. Biol. 140, 149–181.
Mandelkern, L. (1989). Crystallization and melting. In Comprehensive polymer science, Vol. 2. Polymer properties, edited by C. Booth & C. Price, pp. 363–414. Oxford: Pergamon Press.
Mandelkow, E. & Holmes, K. C. (1974). The positions of the N-terminus and residue 68 in tobacco mosaic virus. J. Mol. Biol. 87, 265–273.
Mandelkow, E., Stubbs, G. & Warren, S. (1981). Structures of the helical aggregates of tobacco mosaic virus protein. J. Mol. Biol. 152, 375–386.
Marvin, D. A., Bryan, R. K. & Nave, C. (1987). Pf1 inovirus. Electron density distribution calculated by a maximum entropy algorithm from native fiber diffraction data to 3 Å resolution and single isomorphous replacement data to 5 Å resolution. J. Mol. Biol. 193, 315–343.
Mauritz, K. A., Baer, E. & Hopfinger, A. J. (1978). The epitaxial crystallization of macromolecules. J. Polym. Sci. Macromol. Rev. 13, 1–61.
Mazeau, K., Winter, W. T. & Chanzy, H. (1994). Molecular and crystal structure of a high-temperature polymorph of chitosan from electron diffraction data. Macromolecules, 27, 7606–7612.
Millane, R. P. (1988). X-ray fibre diffraction. In Crystallographic computing 4. Techniques and new technologies, edited by N. W. Isaacs & M. R. Taylor, pp. 169–186. Oxford University Press.
Millane, R. P. (1989a). R factors in X-ray fibre diffraction. I. Largest likely R factors for N overlapping terms. Acta Cryst. A45, 258–260.
Millane, R. P. (1989b). R factors in X-ray fibre diffraction. II. Largest likely R factors. Acta Cryst. A45, 573–576.
Millane, R. P. (1989c). Relating reflection boundaries in X-ray fibre diffraction patterns to specimen morphology and their use for intensity measurement. J. Macromol. Sci. Phys. B28, 149–166.
Millane, R. P. (1990a). Intensity distributions in fibre diffraction. Acta Cryst. A46, 552–559.
Millane, R. P. (1990b). Phase retrieval in crystallography and optics. J. Opt. Soc. Am. A, 7, 394–411.
Millane, R. P. (1990c). Polysaccharide structures: X-ray fibre diffraction studies. In Computer modeling of carbohydrate molecules. ACS Symposium Series No. 430, edited by A. D. French & J. W. Brady, pp. 315–331. Washington DC: American Chemical Society.
Millane, R. P. (1990d). R factors in X-ray fibre diffraction. III. Asymptotic approximations to largest likely R factors. Acta Cryst. A46, 68–72.
Millane, R. P. (1991). An alternative approach to helical diffraction. Acta Cryst. A47, 449–451.
Millane, R. P. (1992a). Largest likely R factors for normal distributions. Acta Cryst. A48, 649–650.
Millane, R. P. (1992b). R factors in X-ray fibre diffraction. IV. Analytic expressions for largest likely R factors. Acta Cryst. A48, 209–215.
Millane, R. P. (1993). Image reconstruction from cylindrically averaged diffraction intensities. In Digital image recovery and synthesis II, Proc. SPIE, Vol. 2029, edited by P. S. Idell, pp. 137–143. Bellingham, WA: SPIE.
Millane, R. P. & Arnott, S. (1985). Background removal in X-ray fibre diffraction patterns. J. Appl. Cryst. 18, 419–423.
Millane, R. P. & Arnott, S. (1986). Digital processing of X-ray diffraction patterns from oriented fibres. J. Macromol. Sci. Phys. B24, 193–227.
Millane, R. P. & Baskaran, S. (1997). Optimal difference Fourier synthesis in fibre diffraction. Fiber Diffr. Rev. 6, 14–18.
Millane, R. P., Byler, M. A. & Arnott, S. (1985). Implementing constrained least squares refinement of helical polymers on a vector pipeline machine. In Supercomputer applications, edited by R. W. Numrich, pp. 137–143, New York: Plenum.
Millane, R. P., Chandrasekaran, R., Arnott, S. & Dea, I. C. M. (1988). The molecular structure of kappa-carrageenan and comparison with iota-carrageenan. Carbohydr. Res. 182, 1–17.
Millane, R. P. & Stroud, W. J. (1991). Effects of disorder on fibre diffraction patterns. Int. J. Biol. Macromol. 13, 202–208.
Millane, R. P. & Stubbs, G. (1992). The significance of R factors in fibre diffraction. Polym. Prepr. 33(1), 321–322.
Miller, R., DeTitta, G. T., Jones, R., Langs, D. A., Weeks, C. M. & Hauptman, H. A. (1993). On the application of the minimal principle to solve unknown structures. Science, 259, 1430–1433.
Namba, K., Pattanayek, R. & Stubbs, G. J. (1989). Visualization of protein–nucleic acid interactions in a virus. Refined structure of intact tobacco mosaic virus at 2.9 Å resolution by X-ray fibre diffraction. J. Mol. Biol. 208, 307–325.
Namba, K. & Stubbs, G. (1985). Solving the phase problem in fibre diffraction. Application to tobacco mosaic virus at 3.6 Å resolution. Acta Cryst. A41, 252–262.
Namba, K. & Stubbs, G. (1987a). Difference Fourier syntheses in fibre diffraction. Acta Cryst. A43, 533–539.
Namba, K. & Stubbs, G. (1987b). Isomorphous replacement in fibre diffraction using limited numbers of heavy-atom derivatives. Acta Cryst. A43, 64–69.
Namba, K., Wakabayashi, K. & Mitsui, T. (1980). X-ray structure analysis of the thin filament of crab striated muscle in the rigor state. J. Mol. Biol. 138, 1–26.
Namba, K., Yamashita, I. & Vonderviszt, F. (1989). Structure of the core and central channel of bacterial flagella. Nature (London), 342, 648–654.
Nambudripad, R., Stark, W. & Makowski, L. (1991). Neutron diffraction studies of the structure of filamentous bacteriophage Pf1. J. Mol. Biol. 220, 359–379.
Park, H., Arnott, S., Chandrasekaran, R., Millane, R. P. & Campagnari, F. (1987). Structure of the α-form of poly(dA)·poly(dT) and related polynucleotide duplexes. J. Mol. Biol. 197, 513–523.
Perez, S. & Chanzy, H. (1989). Electron crystallography of linear polysaccharides. J. Electron Microsc. Tech. 11, 280–285.
Rickert, S. E., Lando, J. B., Hopfinger, A. J. & Baer, E. (1979). Epitaxial polymerization of (SN)x 1. Structure and morphology of single crystals on alkali halide substrates. Macromolecules, 12, 1053–1057.
Rybnikar, F., Liu, J. & Geil, P. H. (1994). Thin film melt-polymerized single crystals of poly(p-oxybenzoate). Makromol. Chem. Phys. 195, 81–104.
Sayre, D. (1952). The squaring method: a new method for phase determination. Acta Cryst. 5, 60–65.
Schneider, A. I., Blackwell, J., Pielartzik, H. & Karbach, A. (1991). Structure analysis of copoly(ester carbonate). Macromolecules, 24, 5676–5682.
Shotton, M. W., Denny, R. C. & Forsyth, V. T. (1998). CCP13 software development. Fiber Diffr. Rev. 7, 40–44.
Sim, G. A. (1960). A note on the heavy atom method. Acta Cryst. 13, 511–512.
Squire, J. M., Al-Khayat, H. A. & Yagi, N. (1993). Muscle thin-filament structure and regulation. Actin sub-domain movements and the tropomyosin shift modelled from low-angle X-ray diffraction. J. Chem. Soc. Faraday Trans. 89, 2717–2726.
Squire, J., Cantino, M., Chew, M., Denny, R., Harford, J., Hudson, L. & Luther, P. (1998). Myosin rod-packing schemes in vertebrate muscle thick filaments. J. Struct. Biol. 122, 128–138.
Squire, J. M. & Vibert, P. J. (1987). Editors. Fibrous protein structure. London: Academic Press.
Stanley, E. (1986). Peakiness' test functions. Acta Cryst. A42, 297–299.
Stark, W., Glucksman, M. J. & Makowski, L. (1988). Conformation of the coat protein of filamentous bacteriophage Pf1 determined by neutron diffraction from magnetically oriented gels of specifically deuterated virions. J. Mol. Biol. 199, 171–182.
Storks, K. H. (1938). An electron diffraction examination of some linear high polymers. J. Am. Chem. Soc. 60, 1753–1761.
Stroud, W. J. & Millane, R. P. (1995a). Analysis of disorder in biopolymer fibres. Acta Cryst. A51, 790–800.
Stroud, W. J. & Millane, R. P. (1995b). Diffraction by disordered polycrystalline fibres. Acta Cryst. A51, 771–790.
Stroud, W. J. & Millane, R. P. (1996a). Cylindrically averaged diffraction by distorted lattices. Proc. R. Soc. London, 452, 151–173.
Stroud, W. J. & Millane, R. P. (1996b). Diffraction by polycrystalline fibres with correlated disorder. Acta Cryst. A52, 812–829.
Stubbs, G. (1987). The Patterson function in fibre diffraction. In Patterson and Pattersons, edited by J. P. Glusker, B. K. Patterson & M. Rossi, pp. 548–557. Oxford University Press.
Stubbs, G. (1989). The probability distributions of X-ray intensities in fibre diffraction: largest likely values for fibre diffraction R factors. Acta Cryst. A45, 254–258.
Stubbs, G. (1999). Developments in fiber diffraction. Curr. Opin. Struct. Biol. 9, 615–619.
Stubbs, G., Warren, S. & Holmes, K. (1977). Structure of RNA and RNA binding site in tobacco mosaic virus from a 4 Å map calculated from X-ray fibre diagrams. Nature (London), 267, 216–221.
Stubbs, G. J. (1974). The effect of disorientation on the intensity distribution of non-crystalline fibres. II. Applications. Acta Cryst. A30, 639–645.
Stubbs, G. J. & Diamond, R. (1975). The phase problem for cylindrically averaged diffraction patterns. Solution by isomorphous replacement and application to tobacco mosaic virus. Acta Cryst. A31, 709–718.
Stubbs, G. J. & Makowski, L. (1982). Coordinated use of isomorphous replacement and layer-line splitting in the phasing of fibre diffraction data. Acta Cryst. A38, 417–425.
Tadokoro, H. (1979). Structure of crystalline polymers. New York: Wiley.
Tanaka, S. & Naya, S. (1969). Theory of X-ray scattering by disordered polymer crystals. J. Phys. Soc. Jpn, 26, 982–993.
Tatarinova, L. I. & Vainshtein, B. K. (1962). Issledovanie poli-γ-metil-L-glutamata v α-forme metodom difraktsi elektronov. (Electron diffraction study of poly-γ-methyl-L-glutamate in the α-form.) Visokomol. Soed. 4, 261–269. (In Russian.)
Tirion, M., ben Avraham, D., Lorenz, M. & Holmes, K. C. (1995). Normal modes as refinement parameters for the F-actin model. Biophys. J. 68, 5–12.
Tsuji, M. (1989). Electron microscopy. In Comprehensive polymer science, Vol. 1. Polymer characterization, edited by G. A. Allen, pp. 785–866. Oxford: Pergamon Press.
Vainshtein, B. K. (1964). Structure analysis by electron diffraction. Oxford: Pergamon Press.
Vainshtein, B. K. (1966). Diffraction of X-rays by chain molecules. Amsterdam: Elsevier.
Vainshtein, B. K. & Tatarinova, L. I. (1967). The β-form of poly-γ-methyl-L-glutamate. Sov. Phys. Crystallogr. 11, 494–498.
Vibert, P. J. (1987). Fibre diffraction methods. In Fibrous protein structure, edited by J. M. Squire & P. J. Vibert, pp. 23–45. New York: Academic Press.
Wang, H., Culver, J. N. & Stubbs, G. (1997). Structure of ribgrass mosaic virus at 2.9 Å resolution: evolution and taxonomy of tobamoviruses. J. Mol. Biol. 269, 769–779.
Wang, H. & Stubbs, G. (1993). Molecular dynamics refinement against fibre diffraction data. Acta Cryst. A49, 504–513.
Wang, H. & Stubbs, G. J. (1994). Structure determination of cucumber green mottle mosaic virus by X-ray fibre diffraction. Significance for the evolution of tobamoviruses. J. Mol. Biol. 239, 371–384.
Welberry, T. R., Miller, G. H. & Carroll, C. E. (1980). Paracrystals and growth-disorder models. Acta Cryst. A36, 921–929.
Welsh, L. C., Symmons, M. F. & Marvin, D. A. (2000). The molecular structure and structural transition of the α-helical capsid in filamentous bacteriophage Pf1. Acta Cryst. D56, 137–150.
Welsh, L. C., Symmons, M. F., Sturtevant, J. M., Marvin, D. A. & Perham, R. N. (1998). Structure of the capsid of Pf3 filamentous phage determined from X-ray fiber diffraction data at 3.1 Å resolution. J. Mol. Biol. 283, 155–177.
Wilson, A. J. C. (1950). Largest likely values for the reliability index. Acta Cryst. 3, 397–399.
Wittmann, J. C. & Lotz, B. (1985). Polymer decoration: the orientation of polymer folds as revealed by the crystallization of polymer vapor. J. Polym. Sci. Polym. Phys. Ed. 23, 205–226.
Wittmann, J. C. & Lotz, B. (1990). Epitaxial crystallization of polymers on organic and polymeric substrates. Prog. Polym. Sci. 15, 909–948.
Wunderlich, B. (1973). Macromolecular physics, Vol. 1. Crystal structure, morphology, defects. New York: Academic Press.
Yamashita, I., Hasegawa, K., Suzuki, H., Vonderviszt, F., Mimori-Kiyosue, Y. & Namba, K. (1998). Structure and switching of bacterial flagellar filaments studied by X-ray fiber diffraction. Nature Struct. Biol. 5, 125–132.
Zugenmaier, P. & Sarko, A. (1980). The variable virtual bond. In Fibre diffraction methods, ACS Symposium Series Vol. 141, edited by A. D. French & K. H. Gardner, pp. 225–237. Washington DC: American Chemical Society.