Tables for
Volume F
Crystallography of biological macromolecules
Edited by M. G. Rossmann and E. Arnold

International Tables for Crystallography (2006). Vol. F, ch. 4.3, pp. 100-110   | 1 | 2 |

Chapter 4.3. Application of protein engineering to improve crystal properties

D. R. Daviesa* and A. Burgess Hickmana

aLaboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892-0560, USA
Correspondence e-mail:

There is accelerating use of protein engineering by protein crystallographers. In this chapter, procedures that have been used successfully in protein crystallography are outlined. Topics covered include: improving solubility; use of fusion proteins; mutations to improve diffraction quality; avoiding protein heterogeneity; engineering crystal contacts to enhance crystallization in a particular crystal form; and engineering heavy-atom sites.

4.3.1. Introduction

| top | pdf |

There is accelerating use of protein engineering by protein crystallographers. Site-directed mutations are being used for a variety of purposes, including solubilizing the protein, developing new crystal forms, providing sites for heavy-atom derivatives, constructing proteolysis-resistant mutants and enhancing the rate of crystallization. Traditionally, if the chosen protein failed to crystallize, a good strategy was to examine a homologous protein from a related species. Now, the crystallographer has a variety of tools for directly modifying the protein according to his or her choice. This is owing to the development of techniques that make it easy to produce a large number of mutant proteins in a timely manner (see Chapter 3.1[link] ).

The relevance to macromolecular crystallography of these mutational procedures rests on the assumption that the mutations do not produce conformation changes in the protein. It is often possible to measure the activity of the protein in vitro and, therefore, test directly whether mutation has affected the protein's properties. Several observations suggest that changes of a small number of surface residues can be tolerated without changing the three-dimensional structure of a protein. The work on haemoglobins demonstrated that mutant proteins generally have similar topologies to the wild type (Fermi & Perutz, 1981[link]). The systematic study of T4 phage lysozyme mutants by the Matthews group (Matthews, 1993[link]; Zhang et al., 1995[link]) has confirmed and significantly extended these studies and has provided a basis for mutant design. This work revealed that, for monomeric proteins, `Substitutions of solvent-exposed amino acids on the surfaces of proteins are seen to have little if any effect on protein stability or structure, leading to the view that it is the rigid parts of proteins that are critical for folding and stability' (Matthews, 1993[link]). It was also concluded that point mutants do not interfere with crystallization unless they affect crystal contacts. The corollary from this is that if the topology of the protein is known from sequence homology with a known structure, the residues that are likely to be located on the surface can be defined and will provide suitable targets for mutation. Fortunately, even in the absence of such information, it is usually possible to make an informed prediction of which residues (generally charged or polar) will, with reasonable probability, be found on the surface.

Here, we shall outline some of the procedures that have been used successfully in protein crystallography. We have tried to provide representative examples of the variety of techniques and creative approaches that have been used, rather than attempting to assemble a comprehensive review of the field. The identification of appropriate references is a somewhat unreliable process, because information regarding these attempts is usually buried in texts; we apologize in advance for any significant omissions.

There have been several reviews on the general topic of the application of protein engineering to crystallography. An overview of the subject is provided by D'Arcy (1994[link]), while Price & Nagai (1995[link]) `focus on strategies either to obtain crystals with good diffraction properties or to improve existing crystals through protein engineering'. In addition to attempts at a rational approach to protein engineering, it is worth emphasizing the role of serendipity in achieving the goal of diffraction-quality crystals. One example is given by the structure of GroEL (Braig et al., 1994[link]), where better crystals were obtained by the accidental introduction of a double mutation, which arose from a polymerase error during the cloning process. The second example is provided by the search for crystals of the complex between the U1A spliceosomal protein and its RNA hairpin substrate (Oubridge et al., 1995[link]). Initially, only poorly diffracting crystals (7–8 Å) could be obtained, which were similar in morphology to those of the protein alone. A series of mutations were made, designed to improve the crystal contacts, but the end result was a new crystal form that diffracted to 1.7 Å.

Dasgupta et al. (1997[link]), in an informative review, have compared the contacts formed between molecules in crystal lattices and in protein oligomerization. They found that there are more polar interactions in crystal contacts, while oligomer contacts favour aromatic residues and methionine. Arginine is the only residue prominent in both, and for a protein that is difficult to crystallize, they recommend replacing lysine with arginine or glutamine. Carugo & Argos (1997[link]) also examined crystal-packing contacts between protein molecules and compared these with contacts formed in oligomers. They observed that the area of the crystal contacts is generally smaller, but that the amino-acid composition of the contacts is indistinguishable from that of the solvent-accessible surface of the protein and is dramatically different from that observed in oligomer interfaces.

4.3.2. Improving solubility

| top | pdf |

Frequently, a protein is so insoluble that there is only a small probability of direct crystallization. Not only does the limited amount of protein hinder crystallization, but the departure from optimal solubility conditions by the addition of almost any crystallization medium frequently results in rapid precipitation of the protein from solution. When this happens, it is sometimes possible to find surface mutations that enhance solubility. Two strategies have been successfully applied, depending on whether or not the overall topology is known.

An early investigation of the effects of surface mutations (McElroy et al., 1992[link]) involved the crystallization of human thymidylate synthase, where the Escherichia coli enzyme structure was known, but the human enzyme could only be crystallized in an apo form unsuitable for studying inhibitors owing to disorder in the active site. The effect of surface mutations was systematically explored by making 12 mutations in 11 positions, and it was found that some of the mutations dramatically changed the protein solubility. Some of the mutant proteins were easier to crystallize than the wild type, and, furthermore, three crystal forms were obtained that differed from that of the wild type.

A second example of the rational design of surface mutations based on prior knowledge of the structure of a related protein is demonstrated by the studies of the trimethoprim-resistant type S1 hydrofolate reductase (Dale et al., 1994[link]). This protein was rather insoluble and precipitated at concentrations greater than 2 mg ml−1. The authors changed four neutral, amide-containing side chains to carboxylates and examined the expressed proteins for improved solubility. Three of the four mutant proteins were more soluble than the wild-type protein, and a double mutant, [\hbox{Asn48}\rightarrow \hbox{Glu}] and [\hbox{Asn130}\rightarrow \hbox{Asp}], was particularly soluble; this mutant protein crystallized in thick plates, ultimately enabling the structure to be determined.

In the absence of any knowledge of the structure, more heroic procedures are required, as illustrated by the crystallization of the HIV-1 integrase catalytic domain (residues 50–212). This domain had been a focus of intensive crystallization attempts, which were hindered by the low solubility of the protein. The strategy used was to replace all the single hydrophobic residues with lysine and to replace groups of adjacent hydrophobic amino acids with alanines (Jenkins et al., 1995[link]). A simple assay for improved solubility based on the overexpression of the protein was employed, which did not require isolating the purified protein; cell lysis followed by centrifugation and SDS–PAGE analysis were used to determine which mutant proteins were sufficiently soluble to appear in the supernatant. The initial application of this method to 30 mutants resulted in one, [\hbox{Phe185}\rightarrow \hbox{Lys}], which was soluble and which was subsequently crystallized and its structure determined (Dyda et al., 1994[link]). The protein formed a dimer, and the mutated residue was observed at the periphery of the dimer interface where the introduced lysine formed a hydrogen bond with a backbone atom of the second subunit, an interaction not possible for the unmutated protein. The position of the mutation was remote from the active site, and the physiological relevance of the observed dimer interaction was later confirmed by studies on an avian retroviral integrase (Bujacz et al., 1995[link]).

In further mutational work, it was observed that the HIV-1 integrase core-domain mutant suffered from an inability to bind to Mg2+ in the crystal, despite the evidence that Mg2+ or Mn2+ is needed for activity. The original crystallization took place using cacodylate as a buffer and also had dithiothreitol present in the crystallization medium. Under these conditions, cacodylate can react with –SH groups, and there were two cysteines in the structure that were clearly bonded to arsenic atoms. To avoid this problem, attempts were made to crystallize in the absence of cacodylate. These were successful only when a second mutation, designed to improve solubility, was introduced, [\hbox{Trp131}\rightarrow \hbox{Glu}] (Jenkins et al., 1995[link]; Goldgur et al., 1998[link]). The use of this mutant led to crystals that had the desired property of binding to Mg2+ and, in addition, revealed the conformation of a flexible loop that had not been previously defined.

4.3.3. Use of fusion proteins

| top | pdf |

Fusion proteins have been frequently used in a variety of applications (reviewed by Nilsson et al., 1992[link]), such as preventing proteolysis, changing solubility and increasing stability. They have also been used – although less frequently – for crystallization. The disadvantage in the context of crystallography is that the length and flexibility of the linker chain often introduce mobility of one protein domain relative to the other, which can impede, rather than enhance, crystallization.

Donahue et al. (1994[link]) were able to determine the three-dimensional structure of the 14 residues representing the platelet integrin recognition segment of the fibrinogen γ chain by constructing a fusion protein with lysozyme, which was then crystallized from ammonium sulfate. Kuge et al. (1997[link]) successfully obtained crystals of a fusion protein consisting of glutathione S-transferase (GST) and the DNA-binding domain (residues 16–115) of the DNA replication-related element-binding factor, DREF, under crystallization conditions similar to those used for GST alone.

In many cases, a fusion protein is made to aid in the isolation and purification of the target protein, and the intervening linker is engineered to contain a proteolytically susceptible sequence. However, subsequent cleavage to separate the two proteins can introduce the possibility of accidental proteolysis elsewhere in the protein. This was observed with a fusion protein between thioredoxin and VanH, a D-lactate dehydrogenase, where attempts to remove the carrier resulted in non-specific proteolysis and VanH inactivation (Stoll et al., 1998[link]). Fortunately, cleavage was unnecessary, and conditions were identified under which the authors were able to crystallize the intact fusion protein.

A novel approach to crystallizing membrane proteins is provided by the fusion protein in which cytochrome [b_{562}] was inserted into a central cytoplasmic loop of the lactose permease from Escherichia coli (Privé et al., 1994[link]). Although crystals have not yet been reported, the cytochrome attachment provides increased solubility together with the ability to use the red colour to assay the progress of crystallization trials.

4.3.4. Mutations to accelerate crystallization

| top | pdf |

A common problem encountered in crystallization is that certain crystals appear late and grow slowly. Sometimes, the slow appearance of crystals is the result of proteolytic processing, but often the reasons are not apparent. There are several examples where protein engineering has resulted in an increase in the rate of crystallization.

Heinz & Matthews (1994)[link] explored the crystallization of T4 phage lysozyme using a strategy based on their understanding of the structure of the enzyme and its crystallization properties. The crystallization of the wild-type protein required the presence of β-mercaptoethanol (BME), an additive which could not be replaced with dithiothreitol. It had also been observed that the oxidized form of BME, hydroxyethyl disulfide, was trapped in the dimer interface between two lysozyme molecules (Bell et al., 1991[link]). It was hypothesized that dimer formation might be the rate-limiting step in crystallization, so dimerization was enhanced by cross-linking two monomers by disulfide-bridge formation. Applying rules developed for constructing S–S bridges, they selected [\hbox{Asn68}\rightarrow \hbox{Cys}] and [\hbox{Ala93}\rightarrow \hbox{Cys}]. In the presence of oxidized BME, the rate of crystallization of these mutant proteins was substantially increased, with crystals reaching full size in two days, in contrast to two weeks for the unmutated protein. Furthermore, they were able to crystallize a previously uncrystallizable mutant. Unexpectedly, however, the dimer formed in this way was lacking in activity, despite the selection of mutation sites on the opposite side of the molecule to the active site.

Mittl et al. (1994[link]) wanted to improve the resolution of their crystals of glutathione reductase. From the 3 Å map, they could see a hole in the crystal packing where two molecules within 6 Å of each other just missed forming a crystal contact; they filled this hole by mutating [\hbox{Ala90}\rightarrow \hbox{Tyr}] and [\hbox{Ala86}\rightarrow \hbox{His}]. This designed double mutant did not improve the resolution, but did increase the rate of crystallization 40-fold, i.e., initial crystals were observed within 1.5 h versus 60 h for the wild-type enzyme.

4.3.5. Mutations to improve diffraction quality

| top | pdf |

Another commonly encountered situation is that crystals can be obtained, but they diffract poorly. There are many examples where investigators have applied protein engineering in an effort to overcome this problem.

Proteolytic trimming is one possible approach to improving diffraction quality. For example, Zhang et al. (1997[link]) attempted to crystallize a homodimer of the C2 domain of adenylyl cyclase. The initial crystals diffracted poorly (to 3.8 Å), so the effects of limited proteolysis with chymotrypsin, trypsin, GluC and LysC were investigated. A stable cleavage product was observed with GluC, approximately 4 kDa smaller than the full-length protein, but in order to avoid minor products formed during GluC proteolysis, the cleavage site was re-engineered as a thrombin site. Since there was already an atypical thrombin site seven residues from this site, proteolysis resulted in a smaller protein than expected; nevertheless, this modified protein crystallized readily and diffracted to 2.2 Å.

The importance of applying a variety of strategies to improve crystal quality is exemplified by the work of Oubridge et al. (1995[link]), in which initial attempts to crystallize wild-type U1A complexed with RNA hairpins resulted in cubic crystals diffracting to 7–8 Å. By mutating surface residues, changing the N-terminal sequence to reduce heterogeneity and varying the sequence of the RNA hairpin, a new crystal form which diffracted to 1.7 Å was ultimately crystallized. However, in order to achieve this result, many variants were constructed and examined. For the protein, mutations were introduced which it was believed (incorrectly) would affect the crystal packing, and which were selected based on the observed similarity of space group and cell dimensions between crystals of the complex and those of the protein alone. One of these mutations, together with an additional mutation resulting from a polymerase chain reaction (PCR) artefact, yielded crystals that diffracted to 3.5 Å. Additional variation of the length and composition of the RNA hairpin led to a new crystal form of this double mutant in the presence of a 21-base RNA that diffracted to 1.7 Å. A further mutation, [\hbox{Ser29}\rightarrow \hbox{Cys}], was made to allow mercury binding (see Section 4.3.8[link]), also resulting in crystals that diffracted to 1.7 Å. The authors commented that `If any principle emerges from this study, it is that the key to success is not in concentrating on exhausting any one approach, but in the diversity of approaches used.'

The relevance of this comment is illustrated by the attempts of Scott et al. (1998)[link] to obtain diffraction-quality crystals of the I-[\hbox{A}^{\rm d}] class II major histocompatibility complex (MHC) protein. This complex exists in vivo as a heterodimer, but expression in recombinant form did not lead to satisfactory dimer formation. A leucine zipper peptide was therefore added to each chain to enhance dimerization. Attempts to crystallize this heterodimer after removal of the leucine zippers and in the presence of bound peptides led to poorly diffracting crystals. To enhance the affinity of an ovalbumin peptide for the MHC dimer, the peptide was then attached through a six-residue linker to the N-terminus of the chain, tethering it in the vicinity of the binding site. This construct, in conjunction with removal of the leucine zippers from the heterodimer, resulted in crystals that diffracted to 2.6 Å.

4.3.6. Avoiding protein heterogeneity

| top | pdf |

Protein heterogeneity can arise from many sources, including proteolysis, oxidation and post-translational modifications, and can have a severe effect on crystal quality or can prevent crystallization altogether. Limited proteolysis has frequently been used to modify proteins for crystallization, in order to avoid heterogeneity from proteolysis occurring during expression and to remove relatively unstructured regions that might hinder crystallization. Some examples are given below.

Windsor et al. (1996[link]) crystallized a complex of interferon γ with the extracellular domain of the interferon γ cell surface receptor. To obtain satisfactory crystals, it was necessary to re-engineer the receptor with an eight-amino-acid residue deletion at the N-terminus to avoid the observed heterogeneity owing to proteolysis, since 2–10% of the purified protein was cleaved during expression.

Crucial to the structure determination of the complex of transducin-α bound to GTPγS (Noel et al., 1993[link]) was the systematic examination of proteolysis of the intact protein (Mazzoni et al., 1991[link]). This work revealed a cluster of protease-sensitive sites near residues Lys17–Lys25. Homogeneous material consisting of residues 26–350 of activated rod transducin, [\hbox{G}_{{\rm t}\alpha}], was obtained by proteolysis of the full-length protein with endoproteinase LysC; the truncated protein was subsequently used to solve the structure.

Hickman et al. (1997[link]) identified a site near the C-terminus of HIV-1 integrase that was susceptible to proteolytic cleavage during protein expression, resulting in severe protein heterogeneity in which up to 30% of the purified protein was cleaved. The proteolysis site was identified by mass spectrometry analysis, and several point mutations on either side of this site were made and evaluated for their effect on proteolysis. Substitution of either Gly or Lys for Arg284 eliminated the protease sensitivity, yielding homogeneous material.

Some proteins have surface cysteines that are susceptible to oxidation and can be adventitiously cross-linked via a disulfide bridge that does not exist in the native protein. If there are relatively few cysteines, this problem may be circumvented by mutating the individual cysteines to determine which ones are responsible. Conversely, cysteines can be introduced into proteins to enhance the binding of interacting molecules (see also Section 4.3.8[link]). An elegant example of the latter case is provided by the recent structure of HIV-1 reverse transcriptase (Huang et al., 1998[link]), which was mutated to introduce a cysteine in a position near the known binding side of the double-stranded DNA substrate. Using an oligonucleotide with a modified base that contained a free thiol group, cross-links were specifically introduced between the protein and the DNA; this covalently linked complex was used to obtain crystals that contained the incoming nucleoside triphosphate, a crystallographic problem that had defied other solutions.

Post-translationally modified proteins, such as glycoproteins, present some of the most difficult problems in X-ray crystallography, since the carbohydrate side chains are usually flexible and often heterogeneous. In some cases, enzymes can be used to trim the carbohydrate and produce a protein suitable for crystallization. Alternatively, the protein sequence can be altered so that unwanted glycosylation does not occur. A combination of approaches was used by Kwong et al. (1998[link]) to determine the structure of the HIV-1 envelope glycoprotein, gp120, a protein which is extensively modified in vivo. The N- and C-termini were truncated, 90% of the carbohydrate was removed by deglycosylation and two large, flexible loops of the protein were replaced by tripeptides. The resulting simplified version of the glycoprotein retained its ability to bind the CD4 receptor, and crystals were ultimately obtained of a ternary complex of the envelope glycoprotein, a two-domain fragment of CD4 and an antibody Fab.

Occasionally, an mRNA sequence will fortuitously result in a false initiation of translation, resulting in a truncated form co-purifying with the intended protein. In attempting to crystallize a trimethoprim-resistant form of dihydrofolate reductase, Dale et al. (1994[link]) observed that a fragment of the protein was being expressed through false initiation of translation, beginning at Ala43. They also found most of the protein in inclusion bodies and recovery was poor. They noticed that there was a putative Shine–Dalgarno sequence ten nucleotides up from the AUG codon of Met42, which could result in the expression of a smaller protein. They replaced the middle base of the Shine–Dalgarno sequence, GGGAA, with GGCAA and removed unusual codons from the first 18 amino acids. These two changes resulted in a 20-fold increase in expression level, together with removal of the contaminating fragment. Similar heterogeneity problems owing to translation initiation at an internal Shine–Dalgarno sequence upstream of Met50 were observed during expression of full-length recombinant HIV-1 integrase and were also resolved by altering the DNA to eliminate the Shine–Dalgarno sequence without changing the sequence of encoded amino acids (Hizi & Hughes, 1988[link]).

4.3.7. Engineering crystal contacts to enhance crystallization in a particular crystal form

| top | pdf |

It is often the case that the structure of some related form of a protein is known, but the protein of interest crystallizes in a different space group. There have been attempts to use this knowledge to obtain crystals in a form that could be readily analysed. However, it may not be necessary to resort to molecular engineering approaches, since molecular replacement methods can often be successfully applied to determine the protein structure.

In one of the first applications of protein engineering to obtain crystals, Lawson et al. (1991)[link] reported the crystal structure of ferritin H. Ferritin has two types of chains, H and L; the structure of rat L ferritin was known. Despite high sequence identity to L ferritin, human recombinant H ferritin did not crystallize satisfactorily. To obtain the structure of a human H ferritin homopolymer, the sequence in the subunit interface was modified to give crystals that were isomorphous with the rat L ferritin. The mutation [\hbox{Lys86}\rightarrow \hbox{Gln}] was introduced, which enabled metal bridge contacts to form, resulting in crystals that diffracted to 1.9 Å. Although the mutant was designed to crystallize from CdSO4, it did not do so. Rather, CaCl2 gave large crystals which were isomorphous with rat and horse L ferritin crystals. In these latter crystals, Ca2+ is coordinated between Asp84 and Gln86, providing the rationale for the mutation.

4.3.8. Engineering heavy-atom sites

| top | pdf |

Another application of protein engineering to crystallography involves the mutation of wild-type residues to cysteines, thus creating potential heavy-atom binding sites (reviewed by Price & Nagai, 1995[link]). This was first systematically investigated by Sun et al. (1987[link]), who made five cysteine mutants of T4 phage lysozyme. They demonstrated that modification of the protein usually, but not always, introduced differences in isomorphism with the wild type. When the lack of isomorphism was not large, its effects could be reduced by comparing the mutant crystal with and without heavy atoms. The authors suggested that serine would be an attractive site for substitution, since it is structurally similar to cysteine and has a high probability of being on the protein surface.

However, in the absence of a known structure, the choice of a successful cysteine substitution site involves some luck. A general sense of the success rate of this approach can be gauged from three studies. Martinez et al. (1992[link]) prepared 14 mutant forms of Fusarium solani cutinase in which each serine was replaced by cysteine. Four of these gave isomorphous crystals and led to useful derivatives with mercuric acetate. Nagai et al. (1990[link]), as part of an attempt to crystallize a domain of the U1 small RNA-binding protein, engineered ten mutants to give cysteine replacements for polar side chains; of these, four yielded mercury derivatives that were isomorphous with the native protein. Finally, in a study of the ribosomal protein L9 (Hoffman et al., 1994[link]), eight cysteine mutants were prepared, but only one crystallized well and was isomorphous with wild-type crystals. In addition, two methionine mutants were engineered, and both crystallized isomorphously to the wild type (discussed below).

When the protein being examined belongs to a homologous superfamily, the sequences can be analysed to provide likely sites. For example, in a structure determination of ribosomal protein L6 (Golden et al., 1993[link]), a heavy-atom binding site was constructed with the mutant [\hbox{Val124}\rightarrow \hbox{Cys}]. This site was chosen because it is a cysteine residue in other L6 proteins. The mutant protein crystallized with the same space group and cell dimensions as the wild type. It was reacted with parachloromercuribenzoate to provide a heavy-atom derivative. However, at high resolution, the crystals were not isomorphous with the wild type, so a derivative was prepared by replacing the two methionines with selenomethionine, illustrating a second approach to engineering heavy-atom sites, discussed below. The structure was ultimately solved with a combination of multiple isomorphous replacement, anomalous scattering and solvent flattening. The same approach was used for OmpR (Martínez-Hackert et al., 1996[link]), in which cysteine residues were similarly engineered into positions determined by comparison with other proteins of the superfamily.

The increasing popularity of multiwavelength anomalous dispersion (Karle, 1980[link]; Hendrickson, 1991[link]; Hendrickson & Ogata, 1997[link]) for phase determination, using selenomethionine (Se-Met) in place of methionine, has led to the engineering of proteins to create selenomethionine sites. The original substitution of Se-Met for Met was described by Cowie & Cohen (1957[link]). The potential for crystallography was demonstrated for thioredoxin (Hendrickson et al., 1990[link]) and was used to solve the structure of ribonuclease H (Yang, Hendrickson, Crouch & Satow, 1990[link]; Yang, Hendrickson, Kalman & Crouch, 1990[link]). Methods for preparing Se-Met-substituted proteins are reviewed by Doublié (1997[link]). Budisa et al. (1995[link]) have also reported successful incorporation of telluromethionine into a protein, although this approach is not yet routine.

Since the frequency of methionines in proteins is about 1 in 60 (Dayhoff, 1978[link]; Hendrickson et al., 1990[link]), it is not unusual for the protein being studied to contain no methionine residues. A number of investigators have introduced methionine into a protein sequence so that it can subsequently be replaced by Se-Met. These include Leahy et al. (1994[link]), who crystallized domains FN7–10 of human fibronectin. Attempts to obtain mercury-soaked diffraction-quality crystals of FN7–10P, a double mutant that resulted from a (yet another!) PCR error, were unsuccessful, as were attempts to solve the structure by molecular replacement. They therefore prepared a mutant in which three residues (two leucines and one isoleucine) were substituted with methionine. Diffraction-quality data were subsequently obtained from the Se-Met derivatives.

Sometimes the protein cannot be crystallized satisfactorily in the Se-Met form, and further modification is required. The Se-Met derivative of UmuD′, an Escherichia coli SOS response protein, did not crystallize under conditions that gave native crystals (Peat et al., 1996[link]). Comparison with homologous proteins indicated that two of the Met sites were either conserved or replaced by hydrophobic residues. The third site, Met138, was variable and often replaced by a polar residue. The authors hypothesized that this methionine might, therefore, be surface-exposed, rendering the Se-Met version highly susceptible to oxidation and heterogeneity. When this penultimate Met was mutated to [\hbox{Met138}\rightarrow \hbox{Val}] or [\hbox{Met138}\rightarrow \hbox{Thr}], these mutant proteins yielded crystals both with and without introduction of Se-Met.

As a final note, it is worth returning to the study involving the crystallization of the U1A/RNA complex (Oubridge et al., 1995[link]), in which the authors comment: `In retrospect it is clear that too much was assumed about interactions within crystals, and that the “design” of good crystals per se was not feasible …. It may be that almost anything can be crystallized to give well ordered crystals as long as enough constructs are tried; however, one only knows the right condition when the crystals are obtained.'


We gratefully acknowledge the kind assistance of B. Cudney, A. D'Arcy, J. Ladner and A. McPherson in the early stages of researching this article, and A. McPherson and K. Nagai for reviewing the manuscript. We also thank A. Stock and S. Hughes for their help in coordinating references and for sharing their manuscript prior to publication.


Bell, J. A., Wilson, K. P., Zhang, X.-J., Faber, H. R., Nicholson, H. & Matthews, B. W. (1991). Comparison of the crystal structure of bacteriophage T4 lysozyme at low, medium, and high ionic strengths. Proteins, 10, 10–21.
Braig, K., Otwinowski, Z., Hegde, R., Boisvert, D. C., Joachimiak, A., Horwich, A. L. & Sigler, P. B. (1994). The crystal structure of the bacterial chaperonin GroEL at 2.8 Å. Nature (London), 371, 578–586.
Budisa, N., Steipe, B., Demange, P., Eckerskorn, C., Kellermann, J. & Huber, R. (1995). High-level biosynthetic substitution of methionine in proteins by its analogs 2-aminohexanoic acid, selenomethionine, telluromethionine and ethionine in Escherichia coli. Eur. J. Biochem. 230, 788–796.
Bujacz, G., Jaskolski, M., Alexandratos, J., Wlodawer, A., Merkel, G., Katz, R. A. & Skalka, A. M. (1995). High resolution structure of the catalytic domain of avian sarcoma virus integrase. J. Mol. Biol. 253, 333–346.
Carugo, O. & Argos, P. (1997). Protein–protein crystal-packing contacts. Protein Sci. 6, 2261–2263.
Cowie, D. B. & Cohen, G. N. (1957). Biosynthesis by Escherichia coli of active altered proteins containing selenium instead of sulfur. Biochim. Biophys. Acta, 26, 252–261.
Dale, G. E., Broger, C., Langen, H., D'Arcy, A. & Stüber, D. (1994). Improving protein solubility through rationally designed amino acid replacements: solubilization of the trimethoprim-resistant type S1 dihydrofolate reductase. Protein Eng. 7, 933–939.
D'Arcy, A. (1994). Crystallizing proteins – a rational approach? Acta Cryst. D50, 469–471.
Dasgupta, S., Iyer, G. H., Bryant, S. H., Lawrence, C. E. & Bell, J. A. (1997). Extent and nature of contacts between protein molecules in crystal lattices and between subunits of protein oligomers. Proteins, 28, 494–514.
Dayhoff, M. O. (1978). Atlas of protein sequence and structure, Vol. 5, Suppl. 3, p. 363. Washington DC: National Biomedical Research Foundation.
Donahue, J. P., Patel, H., Anderson, W. F. & Hawiger, J. (1994). Three-dimensional structure of the platelet integrin recognition segment of the fibrinogen γ chain obtained by carrier protein-driven crystallization. Proc. Natl Acad. Sci. USA, 91, 12178–12182.
Doublié, S. (1997). Preparation of selenomethionyl proteins for phase determination. Methods Enzymol. 276, 523–530.
Dyda, F., Hickman, A. B., Jenkins, T. M., Engelman, A., Craigie, R. & Davies, D. R. (1994). Crystal structure of the catalytic domain of HIV-1 integrase: similarity to other polynucleotidyl transferases. Science, 266, 1981–1986.
Fermi, G. & Perutz, M. F. (1981). Atlas of molecular structures in biology, Vol. 2. Oxford: Clarendon Press.
Golden, B. L., Ramakrishnan, V. & White, S. W. (1993). Ribosomal protein L6: structural evidence of gene duplicaton from a primitive RNA binding protein. EMBO J. 12, 4901–4908.
Goldgur, Y., Dyda, F., Hickman, A. B., Jenkins, T. M., Craigie, R. & Davies, D. R. (1998). Three new structures of the core domain of HIV-1 integrase: an active site that binds magnesium. Proc. Natl Acad. Sci. USA, 95, 9150–9154.
Heinz, D. W. & Matthews, B. W. (1994). Rapid crystallization of T4 lysozyme by intermolecular disulfide cross-linking. Protein Eng. 7, 301–307.
Hendrickson, W. A. (1991). Determination of macromolecular structures from anomalous diffraction of synchrotron radiation. Science, 254, 51–58.
Hendrickson, W. A., Horton, J. R. & LeMaster, D. M. (1990). Selenomethionyl proteins produced for analysis by multiwavelength anomalous diffraction (MAD): a vehicle for direct determination of three-dimensional structure. EMBO J. 9, 1665–1672.
Hendrickson, W. A. & Ogata, C. M. (1997). Phase determination from multiwavelength anomalous diffraction measurements. Methods Enzymol. 276, 494–523.
Hickman, A. B., Dyda, F. & Craigie, R. (1997). Heterogeneity in recombinant HIV-1 integrase corrected by site-directed mutagenesis: the identification and elimination of a protease cleavage site. Protein Eng. 10, 601–606.
Hizi, A. & Hughes, S. H. (1988). Expression of the Moloney murine leukemia virus and human immunodeficiency virus integration proteins in Escherichia coli. Virology, 167, 634–638.
Hoffman, D. W., Davies, C., Gerchman, S. E., Kycia, J. H., Porter, S. J., White, S. W. & Ramakrishnan, V. (1994). Crystal structure of prokaryotic ribosomal protein L9: a bi-lobed RNA-binding protein. EMBO J. 13, 205–212.
Huang, H., Chopra, R., Verdine, G. L. & Harrison, S. C. (1998). Structure of a covalently trapped catalytic complex of HIV-1 reverse transcriptase: implications for drug resistance. Science, 282, 1669–1675.
Jenkins, T. M., Hickman, A. B., Dyda, F., Ghirlando, R., Davies, D. R. & Craigie, R. (1995). Catalytic domain of human immunodeficiency virus type 1 integrase: identification of a soluble mutant by systematic replacement of hydrophobic residues. Proc. Natl Acad. Sci. USA, 92, 6057–6061.
Karle, J. (1980). Some developments in anomalous dispersion for the structural investigation of macromolecular systems in biology. Int. J. Quantum Chem. Symp. 7, 357–367.
Kuge, M., Fujii, Y., Shimizu, T., Hirose, F., Matsukage, A. & Hakoshima, T. (1997). Use of a fusion protein to obtain crystals suitable for X-ray analysis: crystallization of a GST-fused protein containing the DNA-binding domain of DNA replication-related element-binding factor, DREF. Protein Sci. 6, 1783–1786.
Kwong, P. D., Wyatt, R., Robinson, J., Sweet, R. W., Sodroski, J. & Hendrickson, W. A. (1998). Structure of an HIV gp120 envelope glycoprotein in complex with the CD4 receptor and a neutralizing human antibody. Nature (London), 393, 648–659.
Lawson, D. M., Artymiuk, P. J., Yewdall, S. J., Smith, J. M. A., Livingstone, J. C., Treffry, A., Luzzago, A., Levi, S., Arosio, P., Cesareni, G., Thomas, C. D., Shaw, W. V. & Harrison, P. M. (1991). Solving the structure of human H ferritin by genetically engineering intermolecular crystal contacts. Nature (London), 349, 541–544.
Leahy, D. J., Erickson, H. P., Aukhil, I., Joshi, P. & Hendrickson, W. A. (1994). Crystallization of a fragment of human fibronectin: introduction of methionine by site-directed mutagenesis to allow phasing via selenomethionine. Proteins, 19, 48–54.
McElroy, H. E., Sisson, G. W., Schoettlin, W. E., Aust, R. M. & Villafranca, J. E. (1992). Studies on engineering crystallizability by mutation of surface residues of human thymidylate synthase. J. Cryst. Growth, 122, 265–272.
Martinez, C., De Geus, P., Lauwereys, M., Matthyssens, G. & Cambillau, C. (1992). Fusarium solani cutinase is a lipolytic enzyme with a catalytic serine accessible to solvent. Nature (London), 356, 615–618.
Martínez-Hackert, E., Harlocker, S., Inouye, M., Berman, H. M. & Stock, A. M. (1996). Crystallization, X-ray studies, and site-directed cysteine mutagenesis of the DNA-binding domain of OmpR. Protein Sci. 5, 1429–1433.
Matthews, B. W. (1993). Structural and genetic analysis of protein stability. Annu. Rev. Biochem. 62, 139–160.
Mazzoni, M. R., Malinski, J. A. & Hamm, H. E. (1991). Structural analysis of rod GTP-binding protein, Gt. J. Biol. Chem. 266, 14072–14081.
Mittl, P. R. E., Berry, A., Scrutton, N. S., Perham, R. N. & Schulz, G. E. (1994). A designed mutant of the enzyme glutathione reductase shortens the crystallization time by a factor of forty. Acta Cryst. D50, 228–231.
Nagai, K., Oubridge, C., Jessen, T. H., Li, J. & Evans, P. R. (1990). Crystal structure of the RNA-binding domain of the U1 small nuclear ribonucleoprotein A. Nature (London), 348, 515–520.
Nilsson, B., Forsberg, G., Moks, T., Hartmanis, M. & Uhlén, M. (1992). Fusion proteins in biotechnology and structural biology. Curr. Opin. Struct. Biol. 2, 569–575.
Noel, J. P., Hamm, H. E. & Sigler, P. B. (1993). The 2.2 Å crystal structure of transducin-α complexed with GTPγS. Nature (London), 366, 654–663.
Oubridge, C., Ito, N., Teo, C.-H., Fearnley, I. & Nagai, K. (1995). Crystallisation of RNA-protein complexes II. The application of protein engineering for crystallisation of the U1A protein–RNA complex. J. Mol. Biol. 249, 409–423.
Peat, T. S., Frank, E. G., Woodgate, R. & Hendrickson, W. A. (1996). Production and crystallization of a selenomethionyl variant of UmuD′, an Escherichia coli SOS response protein. Proteins, 25, 506–509.
Price, S. R. & Nagai, K. (1995). Protein engineering as a tool for crystallography. Curr. Opin. Biotech. 6, 425–430.
Privé, G. G., Verner, G. E., Weitzman, C., Zen, K. H., Eisenberg, D. & Kaback, H. R. (1994). Fusion proteins as tools for crystallization: the lactose permease from Escherichia coli. Acta Cryst. D50, 375–379.
Scott, C. A., Garcia, K. C., Stura, E. A., Peterson, P. A., Wilson, I. A. & Teyton, L. (1998). Engineering protein for X-ray crystallography: the murine major histocompatibility complex class II molecule I-A. Protein Sci. 7, 413–418.
Stoll, V. S., Manohar, A. V., Gillon, W., Macfarlane, E. L. A., Hynes, R. C. & Pai, E. F. (1998). A thioredoxin fusion protein of VanH, a D-lactate dehydrogenase from Enterococcus faecium: cloning, expression, purification, kinetic analysis, and crystallization. Protein Sci. 7, 1147–1155.
Sun, D.-P., Alber, T., Bell, J. A., Weaver, L. H. & Matthews, B. W. (1987). Use of site-directed mutagenesis to obtain isomorphous heavy-atom derivatives for protein crystallography: cysteine-containing mutants of phage T4 lysozyme. Protein Eng. 1, 115–123.
Windsor, W. T., Walter, L. J., Syto, R., Fossetta, J., Cook, W. J., Nagabhushan, T. L. & Walter, M. R. (1996). Purification and crystallization of a complex between human interferon γ receptor (extracellular domain) and human interferon γ. Proteins, 26, 108–114.
Yang, W., Hendrickson, W. A., Crouch, R. J. & Satow, Y. (1990). Structure of ribonuclease H phased at 2 Å resolution by MAD analysis of the selenomethionyl protein. Science, 249, 1398–1405.
Yang, W., Hendrickson, W. A., Kalman, E. T. & Crouch, R. J. (1990). Expression, purification, and crystallization of natural and selenomethionyl recombinant ribonuclease H from Escherichia coli. J. Biol. Chem. 265, 13553–13559.
Zhang, G., Liu, Y., Qin, J., Vo, B., Tang, W.-J., Ruoho, A. E. & Hurley, J. H. (1997). Characterization and crystallization of a minimal catalytic core domain from mammalian type II adenylyl cyclase. Protein Sci. 6, 903–908.
Zhang, X., Wozniak, J. A. & Matthews, B. W. (1995). Protein flexibility and adaptability seen in 25 crystal forms of T4 lysozyme. J. Mol. Biol. 250, 527–552.

to end of page
to top of page