Structure Vol. 2, No. 7, 15 July 1994 Volume changes on protein folding [Research Article] Yehouda Harpaz, Mark Gerstein, Cyrus Chothia Structure 1994, 2:641-649. |
|
Protein volumes change very little on folding at low pressure, but at high pressure the unfolded state is more compact. So far, the molecular origins of this behaviour have not been explained: it is the opposite of that expected from the model of the hydrophobic effect based on the transfer of non-polar solutes from water to organic solvent.
We redetermined the mean volumes occupied by residues in the interior of proteins. The new residue volumes are smaller than those given by previous calculations which were based on much more limited data. They show that the packing density in protein interiors is exceptionally high. Comparison of the volumes that residues occupy in proteins with those they occupy in solution shows that aliphatic groups have smaller volumes in protein interiors than in solution, while peptide and charged groups have larger volumes. The cancellation of these volume changes is the reason that the net change on folding is very small.
The exceptionally high density of the protein interior shown here implies that packing forces play a more important role in protein stability than has been believed hitherto.
The volume changes that occur on protein folding are directly related to the forces responsible for the stability of proteins. They are the result of the differences in the molecular interactions that occur within folded proteins and those that occur between unfolded proteins and water. More than 20 years ago, several groups determined experimentally the extent of volume changes that occur on folding [1] [2] [3]. These experiments were carried out to test the hydrophobic model for protein stability [4]: the results were not those expected.
It was found that on protein folding the volume changes are very small (< 0.5%) at normal pressures but large and positive at high pressures i.e. at low pressures the folded and unfolded states occupy the same volume but at high pressure the unfolded state is more compact. Models of the hydrophobic effect, based on the transfer of non-polar solutes from water to organic solvent, predict that the volume changes on folding would be positive at low pressure and negative at high pressure i.e. the unfolded state should be more compact at low pressure and less compact at high pressure [4].
A consequence of the failure of the solution transfer models to account for the observed volume changes was that the molecular mechanisms that underlay the changes were not understood [1] [2] [3]. However, as Kauzmann has argued [5], for any model of protein stability to be successful it must account for these observations.
Soon after these experiments, Richards introduced a procedure for determining directly the volumes of residues buried in the interior of protein structures [6]. Its application to the early protein crystal structures showed clearly that their interiors are close-packed as in crystals of amino acids [6] [7]. Though this discovery has been important for subsequent calculations and experiments on protein stability it raised a new problem: the total protein volumes given by these calculations were significantly larger than the volumes determined by solution experiments [6].
Here we provide an explanation for the volume changes on protein folding. This explanation is based upon a redetermination of the volumes residues occupy in folded proteins (which demonstrates that they are smaller than previously thought) and a comparison of these new volumes with the volumes residues occupy in solution.
The current values for the mean volumes of residues buried in protein interiors were determined from the atomic coordinates of 15 proteins (the set of structures that was available at the time) [7]. Much more accurate and extensive crystallographic data are now available and we used these to redetermine the volumes of buried residues.
We initially took 119 different proteins from the protein structure data bank [8]. All had different sequences, were determined at high resolution (between 1.0 Å and 1.9 Å), had been refined to R-factors of 20% or less, and had good sterochemistry [9]. Four proteins were found to have no completely buried residues and seven had sequence identities of between 67% and 90% with other members of the set. These 11 proteins were removed from our set and calculations carried out on the remaining 108. Of these, 74 had no significant sequence identity with each other, 25 had identities of between 17% and 45% with one of the 74 and 9 had identities of between 45% and 63%. (A list of these structures is available on request from the authors.)
The volumes of atoms buried in the interior of the protein structures were calculated by the Voronoi polyhedra procedure [6] [7] [10] (Fig. 1) using a computer program based on the one originally written by Richards [6]. In addition, the accessible surface areas of the residues in each protein were calculated using the procedure described by Lee and Richards [11]. From these data, we determined the volumes of all residues that are completely buried, i.e. those that have no surface area accessible to solvent. We give in Table 1, for each residue type, the number of residues found buried in our set of proteins (N), the mean volume (in Å 3), the standard error of the mean volume ( / N) and the standard deviation of the observed volumes () in cubic Å and as a percentage of the mean.
Fig. 1. The determination of the volumes occupied by residues in protein interiors. (a) A two-dimensional graphical illustration of the Voronoi procedure [6] [10] for determining the volumes occupied by atoms buried in an assembly of atoms. The points represent the position of atoms. We show how the space effectively occupied by one atom is determined. Vectors are drawn to all neighbours within a radius of 8 Å. Planes perpendicular to each of these vectors are then constructed at a point related to the van der Waals radii of the two atoms forming the vector. The smallest polygon (polyhedron in the three-dimensional case) so constructed is the Voronoi polygon and defines the space effectively occupied by the atom. If the procedure is carried out for all buried atoms the whole of the interior space is allocated to them. The packing density of the interior of the assembly is proportional to the mean area of the polygons (or, in three dimensions, the mean volumes of the polyhedra). (b) The Voronoi polyhedron of the O group in a buried serine residue. The atoms in the residue are drawn as spheres with a radius one-quarter of the van der Waals radii. For reasons of clarity, we do not show the atoms in neighbouring residues which bury the O group and which are used in the construction of the polyhedron [see (a)]. Each of the other atoms in the buried residue is surrounded by similar polyhedra; the sum of their volumes gives the volume occupied by the residue in the protein interior. |
Table 1. Mean volumes of residues buried in protein interiors. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Volumes are given as Å 3 residue- 1; to convert these values to cm3 mol- 1, they should be multiplied by 0.6023 (see text). Cys is involved in a disulfide linkage while Cyh is not. |
The new mean residue volumes are smaller than those determined in the previous work [7]: values for individual residue types are smaller by between 1% and 10%, and by 3.7% on average. The aliphatic and sulphur-containing residues, alanine, valine, leucine, isoleucine, methionine and cysteine are smaller by 12.5%; the aromatic residues, histidine, phenylalanine, tyrosine and tryptophan, and the small residues, glycine, alanine, serine and threonine by 25% and the charged and amide residues, aspartate, asparagine, glutamate, glutamine and arginine by 410%.
The new and old calculations differ in the accuracy and number of the protein structures used in the calculations. They also differ in the definition of buried residues: here, buried residues are those with no surface accessible to water; previously, they had 05% of their surface exposed to water. When our present calculations are repeated using the old definition for buried residues, the mean volumes are 2.5% larger. This suggests that, of the 3.7% difference between the old and new values, ~ 1% comes from using more accurate structures and ~ 2.5% from using a strict definition for buried residues. These results explain why sets of residues have characteristic values for the differences between the old and new volumes (see above). Within the 05% accessibility range used in the old calculations, the aliphatic and sulphur-containing residues tend to be more buried than other residue types and the differences in their volumes are relatively small (12.5%) because the change in the definition for buried residues has only a small effect. The charged and amide residues, on the other hand, tend to be more accessible than other residue types and the strict definition for buried residues will make an important contribution to their larger differences (410%).
In the new calculations, the standard deviations of the observed volumes around the mean values are also smaller than those in the previous work: the average deviation is 3.7% in place of 6% [7] (Table 1). This reflects again the accuracy of the structures used in the present calculations.
In the following sections, we discuss the differences between the mean volumes residues occupy in the protein interior relative to the volumes residues occupy in solution. The significance of these differences will depend upon the standard error of the mean residue volumes. This is not the same as the standard deviations of the observed volumes, , but is related to it by the expression / N, where N is the number of observations used to calculate the mean. The standard error of each mean residue volume is given in Table 1. For 18 residues, the values are between 0.2 Å 3 and 1.2 Å 3 . Three residues, arginine, glutamate and lysine, have larger standard error values (between 1.8 Å 3 and 2.1 Å 3) which arise from the small number (N) of these residues that are buried.
The values for the residue volumes in Table 1 are derived from the relatively small proportion of residues that are completely buried within the proteins: those that form its 'deep' structure. To determine whether these values have a wider application, they were used to calculate the volumes of whole protein molecules in solution.
A major difference between residues in protein interiors and those on the surface is the electrostriction that occurs around charged groups in contact with water. The effect of electrostriction on the volumes of atomic groups was discussed in some detail by Cohn and Edsall [12] who concluded that it reduces the volumes of carboxyl oxygens by 10 Å 3 and amino and guanadino groups by 18 Å 3 values that agree with more recent determinations [13].
The experimental solution volume of a protein is usually expressed as a partial specific volume (PSV), its volume (V) divided by its molecular weight (W). PSVs were calculated for 13 proteins (Table 2). The volume, V, was computed by summing the volumes of each residue in the protein using the mean values given in Table 1; adding the atomic volume of a carboxyl oxygen for the carboxyl terminus and, when necessary, an acetyl group for the amino terminus; and correcting the resulting volume for electrostriction by subtracting 10 Å 3 for each carboxyl oxygen and 18 Å 3 for each amino and guanadino group in the protein (inclusive of the termini). The molecular weight, M, was computed from the amino acid sequences. Now, by convention, the units of PSV are ml g - 1 and the atomic units are Å 3 and Daltons. To calculate the PSVs in conventional units we used the formula:
6.023 × 10 23 × (10 - 8) 3 × (V/W) = 0.6023(V/W).
where 6.023 × 10 23 is Avogadro's number and (10 - 8) 3 converts cubic Å to millilitres.
Table 2. Observed and calculated partial specific volumes for proteins. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
aObserved values taken from the compilation of Squire and Himmel [38] except that for lysozyme, which is taken from Gekko and Noguchi [21]. b The calculation of the partial specific volumes is described in the text. |
The calculated and experimental PSVs for the 13 proteins are given in Table 2. They differ by between 0.0% and 2.2%, and by 0.65% on average. This close agreement between the calculated and observed values shows that, if allowance is made for electrostriction around the charged groups on the protein surface, the volumes of folded proteins can be accurately determined from the mean volumes of buried residues given in Table 1.
There are no data for the volumes residues occupy in solution per se . However, since the work of Cohn et al . [14], residue volumes have been estimated from the volumes of amino acids and peptides in solution. Data are available for the volumes in solution of all 20 amino acids [12] [13] [14] [15] [16] and are given in Table 3. Data are also available for certain tripeptides of the form Gly-X-Gly (see below for discussion of these data).
Table 3. The volumesa side chains occupy in protein interiors and in amino acids in solutions. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
aAll volumes are given here as Å 3 residue- 1 etc.; to convert these values to cm mol- 1, they should be multiplied by 0.6023 (see text). bResidue volumes in the protein interior, Vp, are from Table 1. cAmino acid volumes in water, Vs, are from Mishra and Ahluwalia [13], Rao et al. [15] and Jolicoeur et al. [16]. Many of these values are close to the original measurements of Cohn and Edsall [12] and Cohn et al. [14]. dThe volumes occupied by side chains in proteins and amino acids were calculated by simply subtracting the corresponding volume of the glycyl residue or glycine from the rest of the residue or amino acid volumes: e.g., for an alanine in the protein interior: SCp(Ala) = Vp(Ala) - Vp(Gly). |
It was shown more than 50 years ago that the volumes of proteins in solution can be accurately calculated from the volumes of amino acids in solution [12], and we used the amino acid volumes in Table 3 to calculate the PSVs of the 13 proteins in Table 2. The protein volumes were calculated by summing the amino acid volumes (taken from Table 3) and correcting for the formation of the peptide links by subtracting 10.4 Å 3 for each such link in the protein.
The PSVs calculated from the amino acid solution volumes differ from the experimental values by between 0.0% and 2.0%, and by 0.8% on average. Note that these calculated values are as accurate as those calculated using the mean volumes of residues in folded proteins (Table 2).
The calculations described in the previous sections show that protein volumes can be accurately determined from both the volumes amino acids occupy in solution and from the volumes residues occupy in protein interiors. This conclusion might have been expected from the experimental observations that volume changes on folding are small (< 0.5%; see above). However, it does not arise because there are no changes in residue volumes on folding. Changes in residue volume do occur. The size of these changes can be calculated from the comparison of the volumes of residues in the protein interior with those of amino acids in solution.
The comparison of residue volumes with those of amino acids is not entirely straightforward because of the additional atoms in amino acids. We overcome this problem by treating the side chains and the peptide group separately. The volume of a side chain in solution is taken to be the volume of its amino acid less that of glycine; the volume of a side chain in the folded protein is taken to be that of its residue less that of the glycyl residue. The change in the volume of the side chain on folding is the difference between these two values.
As shown in Table 3, the side chain volumes in folded and unfolded proteins have differences that are systematic enough to place them in three distinct groups. The aliphatic side chains occupy volumes that are 2.6 Å 3 to 6.2 Å 3 smaller in proteins than in solution. Aromatic side chains, and those with sulphur or hydroxyl groups, show only small volume changes on going between the two environments. Side chains that are charged or have amide groups have volumes that are 3.3 Å to 11.9 Å 3 larger in proteins than in solution. The volume changes for side chains with polar atoms are the net result of the increase in volume of the polar atoms and the decrease in volume of the non-polar atoms. Note that the changes for the aliphatic, charged and amide side chains are much larger than the standard errors of the mean volumes (Table 1) and are very significant.
Experimental measurements of the volume of a peptide group in solution [12] [13] give a value of ~ 33 Å 3 . For peptides buried in protein interiors, our calculations give a mean volume of 39.2 Å 3 (Table 1). An increase in volume of 6 Å 3 may seem large, but a peptide contains both an NH and an O group and the increase is smaller than that which occurs on loss of electrostriction by charged polar groups: 10 Å 3 for a carboxylate group and 18 Å 3 for amino and guanadino groups [12] [13].
The positive and negative changes in volume are not due to any special interactions within folded proteins. The gain in volume of the polar side chains arises from efficient internal packing not fully compensating for the loss of electrostriction and hydrogen-bonded hydration. The reduction in the volume of aliphatic side chains results from the high efficiency of the protein packing in contrast to the low density of the 'iceberg' structure formed by hydrophobic hydration (see also [17] [18] [19] [20]).
The systematic differences in the volumes of folded and unfolded side chains and peptides provide a clear explanation of why the volume changes on folding are small at low pressures and large and positive at high pressures. The change in volume at low pressure (~ 1 atm) and room temperature is less than 0.5% ([1] [2] [3] and references therein). Such a small change requires either that residues have the same volumes in the folded and unfolded states or that positive and negative changes cancel on folding. Here we have shown that, upon folding, hydrophobic groups decrease in volume while hydrophilic groups increase in volume. The cancellation of these effects is demonstrated by the accurate calculation of PSVs from the volumes of both residues in proteins and amino acids in solution.
Furthermore, the cancellation of residue volume changes implies that the total volume change at high pressure depends primarily on the compressibility of the solution around the protein. The adiabatic compressibility of a protein is typically ~ 5 × 10 - 11 m 2 N - 1 [21] [22], a value between that of organic solids and of soft metals. Unfolded proteins have compressibilities ~ 25% greater than folded proteins [1] [2] and so are more compact at high pressures. Most of this increase in compressibility must come from the many non-polar groups exposed to solvent upon denaturation since the compressibility of water around non-polar groups (~ 37 × 10 - 11 m 2 N - 1) is greater than that of water around polar and charged groups [23]. Thus, the decrease in volume that accompanies protein unfolding at high pressure results from the great compressibility of the water around non-polar groups that become exposed.
Experiments on amino acids provide the most complete data for side chain volumes in solution. These side chain volumes, however, are not exactly comparable with those in proteins because, in amino acids, the electrostriction around charged amino and carboxyl groups affects the solvation of side chains so as to reduce their volumes [17] [24]. This effect can be mitigated by separating the side chains from these charged groups. Thus, more accurate but less extensive data are given by the work of Reading and Hedwig [18] [19], who determined the volumes in solution of tripeptides of the form Gly-X-Gly, where X is glycine, alanine, valine, leucine, serine or asparagine. The volumes of these peptides and their side chain components are listed in Table 4. The solution side chain volumes calculated from the tripeptide data are 0.62.9 Å 3 larger than those calculated from amino acids.
Table 4. The volumesa side chains occupy in protein interiors and in Gly-X-Gly tripeptides in solution. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||
aAll volumes are given here as Å 3 residue - 1 etc.; to convert these values to cm3 mol- 1, they should be multiplied by 0.6023 (see text). bResidue volumes in the protein interior, Vp, are from Table 1. cVolumes of the Gly-X-Gly tripeptides in solution, Vs, are from Reading and Hedwig [18] [19]. dThe volumes occupied by side chains in proteins and in tripeptides were calculated by simply subtracting the corresponding volume of the glycyl residue, or glycine tripeptide, from the residue, or tripeptide, volumes: e.g., for an alanine in the protein interior: SCp(Ala) = Vp(Ala) - Vp(Gly). |
This means that on folding, the negative volume changes that occur for aliphatic residues are somewhat larger than those calculated from the amino acid data and the positve changes for polar side chains are somewhat smaller (see Table 3 and Table 4). Note that these corrections do not affect the general conclusions obtained from the amino acid data on the nature of the volume changes that occur on protein folding.
The earlier calculations of the volumes residues occupy in protein interiors indicated that the packing density is the same as that in crystals of amino acids [6] [7]. The new smaller values show that the density is, in fact, somewhat higher. To determine the extent to which this is the case, we compared the mean volumes of side chains in protein interiors with the volumes of the side chains in crystals of amino acids (Table 5).
Table 5. The volumesa side chains occupy in protein interiors and in amino acids in crystals. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
aAll volumes are given here as Å 3 residue- 1 etc.; to convert these values to cm3 mol- 1, they should be multiplied by 0.6023. bResidue volumes in the protein interior, Vp, are from Table 1. cAmino acid volumes in crystals, Vx, are from [7]. There are no data for crystals of asparagine, lysine or arginine. dThe volumes occupied by side chains in proteins and amino acids were calculated by simply subtracting the corresponding volume of the glycyl residue or glycine from the rest of the residue or amino acid volumes: e.g., for an alanine in the protein interior: SCp(Ala) = Vp(Ala) - Vp(Gly). |
The volumes of both polar and non-polar side chains in protein interiors are generally smaller than in crystals: of the 17 side chains for which we have data, 15 have smaller volumes in proteins and 2 have smaller volumes in amino acid crystals (Table 5). The extent of the differences varies but, on average, side chains in protein interiors occupy 4% less volume. This value can be compared with the 15% volume reduction that occurs typically when organic molecules go from the liquid to the solid state.
These results mean that the packing in protein interiors is more efficient than that in organic crystals. This tight packing arises from two factors. First, proteins, unlike amino acids, are formed from heterogeneous side chains and, in general, objects of different sizes and shapes can pack more tightly than objects of the same size and shape. Second, and perhaps more importantly, the efficiency of packing within the unit cells of crystals is limited by the necessity of forming a three-dimensional lattice. Protein structures have no such requirement: they are free to shift their component parts relative to each other to produce an exceptionally high packing density (see also [25] [26] [27]).
In the previous sections we have argued that the very small net change in volume that occurs on folding arises from the positive changes that take place when polar groups are buried, cancelling the negative changes produced by aliphatic groups. An implication of this argument is that to maintain this balance, the ratio of different types of groups buried in proteins should be essentially constant.
Previous work has shown that the amount of surface buried in a protein is a function of molecular weight and that, for monomeric proteins in the size range 15 00035 000 kDa, the chemical composition of the buried surface is essentially constant [7] [28]. (Most smaller proteins have the same composition for their buried surfaces but there are a few exceptions involving structures in which several buried disulphide bridges form a significant fraction of the interior [28].)
We extended this work by calculating, as described previously [28], the chemical composition of the buried surface in well-determined proteins outside the range of the previous work: for monomers with molecular weights between 42 500 kDa and 82 500 kDa, and for oligomeric proteins with molecular weights between 15 800 kDa and 229 300 kDa. The results of the new calculations are given in Table 6. They show that the chemical character of surface buried in both sets of proteins is also constant: non-polar groups (those based on carbon) contribute 62 ± 1%, polar groups 31 ± 2%, charged groups 7 ± 2%, and polar and charged groups together 38 ± 1%.
Table 6. Chemical character of buried and accessible surface in proteins. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
aProtein abbreviations are as follows: SOD, superoxide dismutase; PRAI-1GPS, phosphoribosylanthranilate isomerase indoleglycerolphosphate synthase; AAT, aspartate aminotransferase; G-3-PDH, glyceraldehyde-3-phosphate dehydrogenase. The accessible surface areas of the unfolded protein and of the surface and interior of the folded proteins were calculated from the protein atomic coordinates using the procedures described previously [27]. The PDB coordinate files and references for the structure determinations of the proteins are: 1UTG [39]; 3WRP [40]; 2SOD [41]; 1OVA [42]; 1PII [43]; 6TAA [44]; 5ACN [45]; 7AAT [46]; 1GD1 [47]; 1GPB [48] and 8CAT [49]. bThe number of subunits per molecule. |
The groups that make the largest contribution to the volume changes are those totally buried in the protein interiors. We determined the composition of these groups in the proteins used to calculate the residue volumes (see Table 7). Table 7 shows that the major changes on folding come from the volume reductions produced by burying aliphatic groups and the volume gains produced by burying peptide groups.
Table 7. The composition of the atomic groups buried in the protein interiors. | |||||||||||||||||||||||||||||||||||||||||||
|
The explanation of the volume changes on protein folding given here has implications for experimental models of protein stability. The model most widely used is the transfer of small molecules from water to organic solvent [4] [29] [30] [31]. At normal pressures, the transfer of hydrophobic molecules from water to organic solvent is accompanied by a volume increase. For example, methylene groups, which have a volume in water of 26.5 Å 3, increase in volume by 23 Å 3 to about 29 Å 3 [23] [32]. As discussed above, such volume increases can not be reconciled with the observations that the volume changes on protein unfolding are small and that protein interiors are well packed [1] [3] [5] [33]. Here we have shown that this difficulty with the solution transfer model is even more serious since we have found that the close packing that occurs on folding actually involves reductions in the volumes of hydrophobic side chains. So, for example, methylene groups in proteins have a mean volume of 23.5 Å 3, about one-fifth less than their volume in organic solvents [23] [32].
As the chemical environments of non-polar side chains in organic solvents and in proteins are similar, van der Waals forces will have a fairly direct relation to packing density. Thus, the large differences in packing density in the two environments means that the role van der Waals forces play in protein folding are not properly represented by solution transfer models.
An alternative model for protein stability is based on the thermodynamics of dissolving crystalline cyclic dipeptides in water [34] [35]. Murphy and Gill [35] argue that crystalline dipeptides more accurately represent the protein interior because of the peptide hydrogen bonding that occurs in both. Crystalline dipeptides should also more accurately represent the packing of the side chains in protein interiors. We compared the volumes of side chains in the few known cyclic dipeptide crystal structures with those of side chains in the protein interior. As is found for amino acids, the volumes of both polar and non-polar side chains in the cyclic dipeptide crystals tend to be slightly larger (4% on average) than those in proteins (unpublished data). Thus, the solid model compounds are a good but not exact representation of the close packing in protein interiors.
On protein folding, residues are transferred from solution to the close-packed interior of the molecule. This change produces different effects on different groups: aliphatic groups occupy smaller volumes in proteins than they do in solution; polar groups occupy larger volumes. Examination of the surfaces buried in a wide range of proteins shows that the proportions of these groups that become buried is essentially constant and are such that the positive and negative volume changes cancel each other out. This accounts for the experimental observations [1] [2] [3] that the net volume changes on folding are very small.
We have demonstrated here that the overall packing density of both polar and non-polar residues in protein interiors is exceptionally high. This high density produces the well-ordered structures that give proteins their high specificity in recognition and catalysis and which are essential for their biological functions.
What occurs in natural proteins can be contrasted with what is found in the proteins that have been created by de novo design. Designed proteins are compact with stable secondary structures and high stability, but are like molten globules in having no fixed tertiary structure and interiors with mobile side chains [36]. The stability of these structures, which may be greater than that of natural proteins, is produced by hydrophobic forces: the design experiments demonstrate the essential role of close packing in producing structures that are specific [36].
In recent discussions of protein stability, the emphasis has been placed on the contributions made by hydrophobic forces and hydrogen bonds [29] [30] [31] [34] [35] [37]. It is difficult to determine from our results the extent of the contribution that packing forces make to the balance between the folded and unfolded states: the chemical environment of residues in solution and in protein interiors is very different, so the volumes they occupy in the two states cannot simply be related to energies. However, the high packing density observed in proteins must play a significant role because any diminution of density would reduce the contribution packing forces make to the stability of the folded state. Thus, an essential condition for an amino acid sequence to form a structure that is both stable and specific is the ability of the buried residues to pack not only in low-energy conformations, but also with high density.
We thank Drs JT Edsall, FM Richards, RL Baldwin and M Han for discussions of our results and J Goodfellow and TH Lilley for information. MG is supported by an Damon-Runyon Walter-Winchell Fellowship.
This paper is dedicated to John Edsall on the sixtieth anniversary of his paper with Cohn, McKeekin and Blanchard [14] on the volumes of residues in solution.